A comparison of retrieval effectiveness for three methods of indexing medical literature

William (Bill) Hersh, D. H. Hickam

Research output: Contribution to journalArticle

21 Citations (Scopus)

Abstract

Conventional approaches to indexing medical literature include the human assignment of terms from a controlled vocabulary, such as MeSH, or the computer assignment of all words in the title and abstract as indexing terms. Human indexing suffers from inconsistency, while word-based indexing suffers from the multiple meanings of words. SAPHIRE is a computer program designed to provide indexing using controlled terms that are assigned by computer, based on their occurrence in the title and abstract. In this first evaluation of SAPHIRE, the authors compared the retrieval performance of the three indexing approaches-human-based MEDLINE with text words; machine-based SAPHIRE with text words; and text words only-for searches by measuring recall and precision for each search using a test collection of 200 abstracts. The abstracts were judged by human reviewers for relevance as applied to 12 literature queries. The results suggest that text word indexing is more effective than indexing with MeSH terms. SAPHIRE's indexing performance was slightly inferior but the program has other advantageous features.

Original languageEnglish (US)
Pages (from-to)292-300
Number of pages9
JournalAmerican Journal of the Medical Sciences
Volume303
Issue number5
StatePublished - 1992

Fingerprint

Controlled Vocabulary
MEDLINE
Software

Keywords

  • AIDSLINE
  • Automated indexing
  • Information retrieval
  • MEDLINE
  • Meta-1
  • Recall/precision analysis
  • Unified medical language systems project

ASJC Scopus subject areas

  • Medicine(all)

Cite this

A comparison of retrieval effectiveness for three methods of indexing medical literature. / Hersh, William (Bill); Hickam, D. H.

In: American Journal of the Medical Sciences, Vol. 303, No. 5, 1992, p. 292-300.

Research output: Contribution to journalArticle

@article{6f04bd4af80440de8925589419ecf69d,
title = "A comparison of retrieval effectiveness for three methods of indexing medical literature",
abstract = "Conventional approaches to indexing medical literature include the human assignment of terms from a controlled vocabulary, such as MeSH, or the computer assignment of all words in the title and abstract as indexing terms. Human indexing suffers from inconsistency, while word-based indexing suffers from the multiple meanings of words. SAPHIRE is a computer program designed to provide indexing using controlled terms that are assigned by computer, based on their occurrence in the title and abstract. In this first evaluation of SAPHIRE, the authors compared the retrieval performance of the three indexing approaches-human-based MEDLINE with text words; machine-based SAPHIRE with text words; and text words only-for searches by measuring recall and precision for each search using a test collection of 200 abstracts. The abstracts were judged by human reviewers for relevance as applied to 12 literature queries. The results suggest that text word indexing is more effective than indexing with MeSH terms. SAPHIRE's indexing performance was slightly inferior but the program has other advantageous features.",
keywords = "AIDSLINE, Automated indexing, Information retrieval, MEDLINE, Meta-1, Recall/precision analysis, Unified medical language systems project",
author = "Hersh, {William (Bill)} and Hickam, {D. H.}",
year = "1992",
language = "English (US)",
volume = "303",
pages = "292--300",
journal = "American Journal of the Medical Sciences",
issn = "0002-9629",
publisher = "Lippincott Williams and Wilkins",
number = "5",

}

TY - JOUR

T1 - A comparison of retrieval effectiveness for three methods of indexing medical literature

AU - Hersh, William (Bill)

AU - Hickam, D. H.

PY - 1992

Y1 - 1992

N2 - Conventional approaches to indexing medical literature include the human assignment of terms from a controlled vocabulary, such as MeSH, or the computer assignment of all words in the title and abstract as indexing terms. Human indexing suffers from inconsistency, while word-based indexing suffers from the multiple meanings of words. SAPHIRE is a computer program designed to provide indexing using controlled terms that are assigned by computer, based on their occurrence in the title and abstract. In this first evaluation of SAPHIRE, the authors compared the retrieval performance of the three indexing approaches-human-based MEDLINE with text words; machine-based SAPHIRE with text words; and text words only-for searches by measuring recall and precision for each search using a test collection of 200 abstracts. The abstracts were judged by human reviewers for relevance as applied to 12 literature queries. The results suggest that text word indexing is more effective than indexing with MeSH terms. SAPHIRE's indexing performance was slightly inferior but the program has other advantageous features.

AB - Conventional approaches to indexing medical literature include the human assignment of terms from a controlled vocabulary, such as MeSH, or the computer assignment of all words in the title and abstract as indexing terms. Human indexing suffers from inconsistency, while word-based indexing suffers from the multiple meanings of words. SAPHIRE is a computer program designed to provide indexing using controlled terms that are assigned by computer, based on their occurrence in the title and abstract. In this first evaluation of SAPHIRE, the authors compared the retrieval performance of the three indexing approaches-human-based MEDLINE with text words; machine-based SAPHIRE with text words; and text words only-for searches by measuring recall and precision for each search using a test collection of 200 abstracts. The abstracts were judged by human reviewers for relevance as applied to 12 literature queries. The results suggest that text word indexing is more effective than indexing with MeSH terms. SAPHIRE's indexing performance was slightly inferior but the program has other advantageous features.

KW - AIDSLINE

KW - Automated indexing

KW - Information retrieval

KW - MEDLINE

KW - Meta-1

KW - Recall/precision analysis

KW - Unified medical language systems project

UR - http://www.scopus.com/inward/record.url?scp=0026771569&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0026771569&partnerID=8YFLogxK

M3 - Article

C2 - 1580316

AN - SCOPUS:0026771569

VL - 303

SP - 292

EP - 300

JO - American Journal of the Medical Sciences

JF - American Journal of the Medical Sciences

SN - 0002-9629

IS - 5

ER -