TY - JOUR
T1 - The Human Phenotype Ontology
T2 - Semantic Unification of Common and Rare Disease
AU - Groza, Tudor
AU - Köhler, Sebastian
AU - Moldenhauer, Dawid
AU - Vasilevsky, Nicole
AU - Baynam, Gareth
AU - Zemojtel, Tomasz
AU - Schriml, Lynn Marie
AU - Kibbe, Warren Alden
AU - Schofield, Paul N.
AU - Beck, Tim
AU - Vasant, Drashtti
AU - Brookes, Anthony J.
AU - Zankl, Andreas
AU - Washington, Nicole L.
AU - Mungall, Christopher J.
AU - Lewis, Suzanna E.
AU - Haendel, Melissa A.
AU - Parkinson, Helen
AU - Robinson, Peter N.
N1 - Publisher Copyright:
© 2015 The Authors.
PY - 2015
Y1 - 2015
N2 - The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.
AB - The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000 rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.
UR - http://www.scopus.com/inward/record.url?scp=84937523757&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84937523757&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2015.05.020
DO - 10.1016/j.ajhg.2015.05.020
M3 - Article
C2 - 26119816
AN - SCOPUS:84937523757
SN - 0002-9297
VL - 97
SP - 111
EP - 124
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 1
ER -