Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers.

K. A. Spackman; W. R. Hersh

Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers.

K. A. Spackman, W. R. Hersh

Medical Informatics and Clinical Epidemiology

Research output: Contribution to journal › Article › peer-review

5 Scopus citations

Abstract

We evaluated the ability of two natural language parsers, CLARIT and the Xerox Tagger, to identify simple, noun phrases in medical discharge summaries. In twenty randomly selected discharge summaries, there were 1909 unique simple noun phrases. CLARIT and the Xerox Tagger exactly identified 77.0% and 68.7% of the phrases, respectively, and partially identified 85.7% and 80.8% of the phrases. Neither system had been specially modified or tuned to the medical domain. These results suggest that it is possible to apply existing natural language processing (NLP) techniques to large bodies of medical text, in order to empirically identify the terminology used in medicine. Virtually all the noun phrases could be regarded as having special medical connotation and would be candidates for entry into a controlled medical vocabulary.

Original language	English (US)
Pages (from-to)	155-158
Number of pages	4
Journal	Proceedings : a conference of the American Medical Informatics Association / ... AMIA Annual Fall Symposium. AMIA Fall Symposium
State	Published - 1996

ASJC Scopus subject areas

General Medicine

Cite this

@article{952501de6b63407dbf687b8c2691aab4,

title = "Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers.",

abstract = "We evaluated the ability of two natural language parsers, CLARIT and the Xerox Tagger, to identify simple, noun phrases in medical discharge summaries. In twenty randomly selected discharge summaries, there were 1909 unique simple noun phrases. CLARIT and the Xerox Tagger exactly identified 77.0% and 68.7% of the phrases, respectively, and partially identified 85.7% and 80.8% of the phrases. Neither system had been specially modified or tuned to the medical domain. These results suggest that it is possible to apply existing natural language processing (NLP) techniques to large bodies of medical text, in order to empirically identify the terminology used in medicine. Virtually all the noun phrases could be regarded as having special medical connotation and would be candidates for entry into a controlled medical vocabulary.",

author = "Spackman, {K. A.} and Hersh, {W. R.}",

note = "Copyright: This record is sourced from MEDLINE{\textregistered}/PubMed{\textregistered}, a database of the U.S. National Library of Medicine",

year = "1996",

language = "English (US)",

pages = "155--158",

journal = "Proceedings : a conference of the American Medical Informatics Association / ... AMIA Annual Fall Symposium. AMIA Fall Symposium",

issn = "1091-8280",

publisher = "Hanley and Belfus Inc.",

}

TY - JOUR

T1 - Recognizing noun phrases in medical discharge summaries

T2 - an evaluation of two natural language parsers.

AU - Spackman, K. A.

AU - Hersh, W. R.

N1 - Copyright: This record is sourced from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine

PY - 1996

Y1 - 1996

N2 - We evaluated the ability of two natural language parsers, CLARIT and the Xerox Tagger, to identify simple, noun phrases in medical discharge summaries. In twenty randomly selected discharge summaries, there were 1909 unique simple noun phrases. CLARIT and the Xerox Tagger exactly identified 77.0% and 68.7% of the phrases, respectively, and partially identified 85.7% and 80.8% of the phrases. Neither system had been specially modified or tuned to the medical domain. These results suggest that it is possible to apply existing natural language processing (NLP) techniques to large bodies of medical text, in order to empirically identify the terminology used in medicine. Virtually all the noun phrases could be regarded as having special medical connotation and would be candidates for entry into a controlled medical vocabulary.

AB - We evaluated the ability of two natural language parsers, CLARIT and the Xerox Tagger, to identify simple, noun phrases in medical discharge summaries. In twenty randomly selected discharge summaries, there were 1909 unique simple noun phrases. CLARIT and the Xerox Tagger exactly identified 77.0% and 68.7% of the phrases, respectively, and partially identified 85.7% and 80.8% of the phrases. Neither system had been specially modified or tuned to the medical domain. These results suggest that it is possible to apply existing natural language processing (NLP) techniques to large bodies of medical text, in order to empirically identify the terminology used in medicine. Virtually all the noun phrases could be regarded as having special medical connotation and would be candidates for entry into a controlled medical vocabulary.

UR - http://www.scopus.com/inward/record.url?scp=0030341882&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030341882&partnerID=8YFLogxK

M3 - Article

C2 - 8947647

AN - SCOPUS:0030341882

SN - 1091-8280

SP - 155

EP - 158

JO - Proceedings : a conference of the American Medical Informatics Association / ... AMIA Annual Fall Symposium. AMIA Fall Symposium

JF - Proceedings : a conference of the American Medical Informatics Association / ... AMIA Annual Fall Symposium. AMIA Fall Symposium

ER -

Recognizing noun phrases in medical discharge summaries: an evaluation of two natural language parsers.

Abstract

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this