Assessing thesaurus-based query expansion using the UMLS Metathesaurus.

William (Bill) Hersh, S. Price, L. Donohoe

Research output: Contribution to journalArticle

88 Citations (Scopus)

Abstract

OBJECTIVES: Assess query expansion using thesaurus relationships and definitions in the UMLS Metathesaurus for improving searching performance. METHODS: The queries from a MEDLINE test collection (OHSUMED) were expanded using synonym, hierarchical, and related term information as well as term definitions from the UMLS Metathesaurus. Documents were retrieved from a word-statistical retrieval system and assessed for recall and precision based on relevance judgments from the test collection. RESULTS: All types of query expansion degraded aggregate retrieval performance as measured by recall and precision, although 38.6% of the queries with synonym expansion and up to 29.7% of the queries with hierarchical expansion showed improvement. CONCLUSIONS: Thesaurus-based query expansion causes a decline in retrieval performance generally but improves it in specific instances. Further research must focus on identifying instances where performance improves and how it can be exploited by real users.

Original languageEnglish (US)
Pages (from-to)344-348
Number of pages5
JournalProceedings / AMIA ... Annual Symposium. AMIA Symposium
StatePublished - 2000

Fingerprint

Unified Medical Language System
Controlled Vocabulary
MEDLINE
Research

Cite this

Assessing thesaurus-based query expansion using the UMLS Metathesaurus. / Hersh, William (Bill); Price, S.; Donohoe, L.

In: Proceedings / AMIA ... Annual Symposium. AMIA Symposium, 2000, p. 344-348.

Research output: Contribution to journalArticle

@article{b907867dcf494624a2f7f4d4a4bffd3a,
title = "Assessing thesaurus-based query expansion using the UMLS Metathesaurus.",
abstract = "OBJECTIVES: Assess query expansion using thesaurus relationships and definitions in the UMLS Metathesaurus for improving searching performance. METHODS: The queries from a MEDLINE test collection (OHSUMED) were expanded using synonym, hierarchical, and related term information as well as term definitions from the UMLS Metathesaurus. Documents were retrieved from a word-statistical retrieval system and assessed for recall and precision based on relevance judgments from the test collection. RESULTS: All types of query expansion degraded aggregate retrieval performance as measured by recall and precision, although 38.6{\%} of the queries with synonym expansion and up to 29.7{\%} of the queries with hierarchical expansion showed improvement. CONCLUSIONS: Thesaurus-based query expansion causes a decline in retrieval performance generally but improves it in specific instances. Further research must focus on identifying instances where performance improves and how it can be exploited by real users.",
author = "Hersh, {William (Bill)} and S. Price and L. Donohoe",
year = "2000",
language = "English (US)",
pages = "344--348",
journal = "Proceedings / AMIA . Annual Symposium. AMIA Symposium",
issn = "1531-605X",
publisher = "Hanley & Belfus",

}

TY - JOUR

T1 - Assessing thesaurus-based query expansion using the UMLS Metathesaurus.

AU - Hersh, William (Bill)

AU - Price, S.

AU - Donohoe, L.

PY - 2000

Y1 - 2000

N2 - OBJECTIVES: Assess query expansion using thesaurus relationships and definitions in the UMLS Metathesaurus for improving searching performance. METHODS: The queries from a MEDLINE test collection (OHSUMED) were expanded using synonym, hierarchical, and related term information as well as term definitions from the UMLS Metathesaurus. Documents were retrieved from a word-statistical retrieval system and assessed for recall and precision based on relevance judgments from the test collection. RESULTS: All types of query expansion degraded aggregate retrieval performance as measured by recall and precision, although 38.6% of the queries with synonym expansion and up to 29.7% of the queries with hierarchical expansion showed improvement. CONCLUSIONS: Thesaurus-based query expansion causes a decline in retrieval performance generally but improves it in specific instances. Further research must focus on identifying instances where performance improves and how it can be exploited by real users.

AB - OBJECTIVES: Assess query expansion using thesaurus relationships and definitions in the UMLS Metathesaurus for improving searching performance. METHODS: The queries from a MEDLINE test collection (OHSUMED) were expanded using synonym, hierarchical, and related term information as well as term definitions from the UMLS Metathesaurus. Documents were retrieved from a word-statistical retrieval system and assessed for recall and precision based on relevance judgments from the test collection. RESULTS: All types of query expansion degraded aggregate retrieval performance as measured by recall and precision, although 38.6% of the queries with synonym expansion and up to 29.7% of the queries with hierarchical expansion showed improvement. CONCLUSIONS: Thesaurus-based query expansion causes a decline in retrieval performance generally but improves it in specific instances. Further research must focus on identifying instances where performance improves and how it can be exploited by real users.

UR - http://www.scopus.com/inward/record.url?scp=0034567115&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0034567115&partnerID=8YFLogxK

M3 - Article

C2 - 11079902

AN - SCOPUS:0034567115

SP - 344

EP - 348

JO - Proceedings / AMIA . Annual Symposium. AMIA Symposium

JF - Proceedings / AMIA . Annual Symposium. AMIA Symposium

SN - 1531-605X

ER -