Automatic classification of PubMed abstracts with Latent semantic indexing: Working notes

Joel Robert Adams, Steven Bedrick

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations


The 2014 BioASQ challenge 2a tasks participants with assigning semantic tags to biomedical journal abstracts. We present a system that uses Latent Semantic Analysis to identify semantically similar documents in MEDLINE to an unlabeled abstract, and then uses a novel ranking scheme to select a list of MeSH headers from candidates drawn from the most similar documents. Our approach achieved good precision, but suffered in terms of recall. We describe several possible strategies to improve our system's performance.

Original languageEnglish (US)
Title of host publicationCLEF 2014 - Working Notes for CLEF 2014 Conference
Number of pages8
StatePublished - 2014
Event2014 Cross Language Evaluation Forum Conference, CLEF 2014 - Sheffield, United Kingdom
Duration: Sep 15 2014Sep 18 2014


Other2014 Cross Language Evaluation Forum Conference, CLEF 2014
Country/TerritoryUnited Kingdom

ASJC Scopus subject areas

  • Computer Science(all)

Cite this