Data-driven method to enhance craniofacial and oral phenotype vocabularies

Rashmi Mishra; Andrea Burke; Bonnie Gitman; Payal Verma; Mark Engelstad; Melissa A. Haendel; Ilias Alevizos; William A. Gahl; Michael T. Collins; Janice S. Lee; Murat Sincan

doi:10.1016/j.adaj.2019.05.029

Data-driven method to enhance craniofacial and oral phenotype vocabularies

Rashmi Mishra, Andrea Burke, Bonnie Gitman, Payal Verma, Mark Engelstad, Melissa A. Haendel, Ilias Alevizos, William A. Gahl, Michael T. Collins, Janice S. Lee, Murat Sincan

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Background: A significant amount of clinical information captured as free-text narratives could be better used for several applications, such as clinical decision support, ontology development, evidence-based practice, and research. The Human Phenotype Ontology (HPO) is specifically used for semantic comparisons for diagnostic purposes. All these functions require quality coverage of the domain of interest. The authors used natural language processing to capture craniofacial and oral phenotype signatures from electronic health records and then used these signatures for evaluation of existing oral phenotype ontology coverage. Methods: The authors applied a text-processing pipeline based on the clinical Text Analysis and Knowledge Extraction System to annotate the clinical notes with Unified Medical Language System codes. The authors extracted the disease or disorder phenotype terms, which were then compared with HPO terms and their synonyms. Results: The authors retrieved 2,153 deidentified clinical notes from 558 patients. Finally, 2,416 unique diseases or disorders phenotype terms were extracted, which included 210 craniofacial or oral phenotype terms. Twenty-six of these phenotypes were not found in the HPO. Conclusions: The authors demonstrated that natural language processing tools could extract relevant phenotype terms from clinical narratives, which could help identify gaps in existing ontologies and enhance craniofacial and dental phenotyping vocabularies. Practical Implications: The expansion of terms in the dental, oral, and craniofacial domains in the HPO is particularly important as the dental community moves toward electronic health records.

Original language	English (US)
Pages (from-to)	933-939.e2
Journal	Journal of the American Dental Association
Volume	150
Issue number	11
DOIs	https://doi.org/10.1016/j.adaj.2019.05.029
State	Published - Nov 2019

Keywords

Natural language processing
craniofacial and oral phenotypes
evidence-based dentistry
ontology

ASJC Scopus subject areas

General Dentistry

Access to Document

10.1016/j.adaj.2019.05.029

Cite this

@article{ab1bebdbac98494bb6c6e9e2aeaa350c,

title = "Data-driven method to enhance craniofacial and oral phenotype vocabularies",

abstract = "Background: A significant amount of clinical information captured as free-text narratives could be better used for several applications, such as clinical decision support, ontology development, evidence-based practice, and research. The Human Phenotype Ontology (HPO) is specifically used for semantic comparisons for diagnostic purposes. All these functions require quality coverage of the domain of interest. The authors used natural language processing to capture craniofacial and oral phenotype signatures from electronic health records and then used these signatures for evaluation of existing oral phenotype ontology coverage. Methods: The authors applied a text-processing pipeline based on the clinical Text Analysis and Knowledge Extraction System to annotate the clinical notes with Unified Medical Language System codes. The authors extracted the disease or disorder phenotype terms, which were then compared with HPO terms and their synonyms. Results: The authors retrieved 2,153 deidentified clinical notes from 558 patients. Finally, 2,416 unique diseases or disorders phenotype terms were extracted, which included 210 craniofacial or oral phenotype terms. Twenty-six of these phenotypes were not found in the HPO. Conclusions: The authors demonstrated that natural language processing tools could extract relevant phenotype terms from clinical narratives, which could help identify gaps in existing ontologies and enhance craniofacial and dental phenotyping vocabularies. Practical Implications: The expansion of terms in the dental, oral, and craniofacial domains in the HPO is particularly important as the dental community moves toward electronic health records.",

keywords = "Natural language processing, craniofacial and oral phenotypes, evidence-based dentistry, ontology",

author = "Rashmi Mishra and Andrea Burke and Bonnie Gitman and Payal Verma and Mark Engelstad and Haendel, {Melissa A.} and Ilias Alevizos and Gahl, {William A.} and Collins, {Michael T.} and Lee, {Janice S.} and Murat Sincan",

note = "Publisher Copyright: {\textcopyright} 2019 American Dental Association",

year = "2019",

month = nov,

doi = "10.1016/j.adaj.2019.05.029",

language = "English (US)",

volume = "150",

pages = "933--939.e2",

journal = "Journal of the American Dental Association",

issn = "0002-8177",

publisher = "American Dental Association",

number = "11",

}

TY - JOUR

T1 - Data-driven method to enhance craniofacial and oral phenotype vocabularies

AU - Mishra, Rashmi

AU - Burke, Andrea

AU - Gitman, Bonnie

AU - Verma, Payal

AU - Engelstad, Mark

AU - Haendel, Melissa A.

AU - Alevizos, Ilias

AU - Gahl, William A.

AU - Collins, Michael T.

AU - Lee, Janice S.

AU - Sincan, Murat

PY - 2019/11

Y1 - 2019/11

N2 - Background: A significant amount of clinical information captured as free-text narratives could be better used for several applications, such as clinical decision support, ontology development, evidence-based practice, and research. The Human Phenotype Ontology (HPO) is specifically used for semantic comparisons for diagnostic purposes. All these functions require quality coverage of the domain of interest. The authors used natural language processing to capture craniofacial and oral phenotype signatures from electronic health records and then used these signatures for evaluation of existing oral phenotype ontology coverage. Methods: The authors applied a text-processing pipeline based on the clinical Text Analysis and Knowledge Extraction System to annotate the clinical notes with Unified Medical Language System codes. The authors extracted the disease or disorder phenotype terms, which were then compared with HPO terms and their synonyms. Results: The authors retrieved 2,153 deidentified clinical notes from 558 patients. Finally, 2,416 unique diseases or disorders phenotype terms were extracted, which included 210 craniofacial or oral phenotype terms. Twenty-six of these phenotypes were not found in the HPO. Conclusions: The authors demonstrated that natural language processing tools could extract relevant phenotype terms from clinical narratives, which could help identify gaps in existing ontologies and enhance craniofacial and dental phenotyping vocabularies. Practical Implications: The expansion of terms in the dental, oral, and craniofacial domains in the HPO is particularly important as the dental community moves toward electronic health records.

AB - Background: A significant amount of clinical information captured as free-text narratives could be better used for several applications, such as clinical decision support, ontology development, evidence-based practice, and research. The Human Phenotype Ontology (HPO) is specifically used for semantic comparisons for diagnostic purposes. All these functions require quality coverage of the domain of interest. The authors used natural language processing to capture craniofacial and oral phenotype signatures from electronic health records and then used these signatures for evaluation of existing oral phenotype ontology coverage. Methods: The authors applied a text-processing pipeline based on the clinical Text Analysis and Knowledge Extraction System to annotate the clinical notes with Unified Medical Language System codes. The authors extracted the disease or disorder phenotype terms, which were then compared with HPO terms and their synonyms. Results: The authors retrieved 2,153 deidentified clinical notes from 558 patients. Finally, 2,416 unique diseases or disorders phenotype terms were extracted, which included 210 craniofacial or oral phenotype terms. Twenty-six of these phenotypes were not found in the HPO. Conclusions: The authors demonstrated that natural language processing tools could extract relevant phenotype terms from clinical narratives, which could help identify gaps in existing ontologies and enhance craniofacial and dental phenotyping vocabularies. Practical Implications: The expansion of terms in the dental, oral, and craniofacial domains in the HPO is particularly important as the dental community moves toward electronic health records.

KW - Natural language processing

KW - craniofacial and oral phenotypes

KW - evidence-based dentistry

KW - ontology

UR - http://www.scopus.com/inward/record.url?scp=85073630645&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073630645&partnerID=8YFLogxK

U2 - 10.1016/j.adaj.2019.05.029

DO - 10.1016/j.adaj.2019.05.029

M3 - Article

C2 - 31668172

AN - SCOPUS:85073630645

SN - 0002-8177

VL - 150

SP - 933-939.e2

JO - Journal of the American Dental Association

JF - Journal of the American Dental Association

IS - 11

ER -

Data-driven method to enhance craniofacial and oral phenotype vocabularies

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this