Automatic indexing of journal abstracts with latent semantic analysis

Joel Robert Adams, Steven Bedrick

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The BioASQ “Task on Large-Scale Online Biomedical Semantic Indexing” charges participants with assigning semantic tags to biomedical journal abstracts. We present a system that takes as input a biomedical abstract and uses latent semantic analysis to identify similar documents in the MEDLINE database. The system then uses a novel ranking scheme to select a list of MeSH tags from candidates drawn from the most similar documents. Our approach achieved better than baseline performance in both precision and recall. We suggest several possible strategies to improve the system’s performance.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Verlag
Pages200-208
Number of pages9
Volume9283
ISBN (Print)9783319240268
DOIs
StatePublished - 2015
Event6th International Conference on Labs of the Evaluation Forum, CLEF 2015 - Toulouse, France
Duration: Sep 8 2015Sep 11 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9283
ISSN (Print)03029743
ISSN (Electronic)16113349

Other

Other6th International Conference on Labs of the Evaluation Forum, CLEF 2015
CountryFrance
CityToulouse
Period9/8/159/11/15

Fingerprint

Automatic indexing
Latent Semantic Analysis
Indexing
Semantics
System Performance
Baseline
Ranking
Charge
Mesh

ASJC Scopus subject areas

  • Computer Science(all)
  • Theoretical Computer Science

Cite this

Adams, J. R., & Bedrick, S. (2015). Automatic indexing of journal abstracts with latent semantic analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9283, pp. 200-208). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9283). Springer Verlag. https://doi.org/10.1007/978-3-319-24027-5_17

Automatic indexing of journal abstracts with latent semantic analysis. / Adams, Joel Robert; Bedrick, Steven.

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9283 Springer Verlag, 2015. p. 200-208 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 9283).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Adams, JR & Bedrick, S 2015, Automatic indexing of journal abstracts with latent semantic analysis. in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). vol. 9283, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 9283, Springer Verlag, pp. 200-208, 6th International Conference on Labs of the Evaluation Forum, CLEF 2015, Toulouse, France, 9/8/15. https://doi.org/10.1007/978-3-319-24027-5_17
Adams JR, Bedrick S. Automatic indexing of journal abstracts with latent semantic analysis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9283. Springer Verlag. 2015. p. 200-208. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-24027-5_17
Adams, Joel Robert ; Bedrick, Steven. / Automatic indexing of journal abstracts with latent semantic analysis. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Vol. 9283 Springer Verlag, 2015. pp. 200-208 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
@inproceedings{8b93a60ce6734ec0b71760cacc7dd7ea,
title = "Automatic indexing of journal abstracts with latent semantic analysis",
abstract = "The BioASQ “Task on Large-Scale Online Biomedical Semantic Indexing” charges participants with assigning semantic tags to biomedical journal abstracts. We present a system that takes as input a biomedical abstract and uses latent semantic analysis to identify similar documents in the MEDLINE database. The system then uses a novel ranking scheme to select a list of MeSH tags from candidates drawn from the most similar documents. Our approach achieved better than baseline performance in both precision and recall. We suggest several possible strategies to improve the system’s performance.",
author = "Adams, {Joel Robert} and Steven Bedrick",
year = "2015",
doi = "10.1007/978-3-319-24027-5_17",
language = "English (US)",
isbn = "9783319240268",
volume = "9283",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
pages = "200--208",
booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - GEN

T1 - Automatic indexing of journal abstracts with latent semantic analysis

AU - Adams, Joel Robert

AU - Bedrick, Steven

PY - 2015

Y1 - 2015

N2 - The BioASQ “Task on Large-Scale Online Biomedical Semantic Indexing” charges participants with assigning semantic tags to biomedical journal abstracts. We present a system that takes as input a biomedical abstract and uses latent semantic analysis to identify similar documents in the MEDLINE database. The system then uses a novel ranking scheme to select a list of MeSH tags from candidates drawn from the most similar documents. Our approach achieved better than baseline performance in both precision and recall. We suggest several possible strategies to improve the system’s performance.

AB - The BioASQ “Task on Large-Scale Online Biomedical Semantic Indexing” charges participants with assigning semantic tags to biomedical journal abstracts. We present a system that takes as input a biomedical abstract and uses latent semantic analysis to identify similar documents in the MEDLINE database. The system then uses a novel ranking scheme to select a list of MeSH tags from candidates drawn from the most similar documents. Our approach achieved better than baseline performance in both precision and recall. We suggest several possible strategies to improve the system’s performance.

UR - http://www.scopus.com/inward/record.url?scp=84945930956&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84945930956&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-24027-5_17

DO - 10.1007/978-3-319-24027-5_17

M3 - Conference contribution

SN - 9783319240268

VL - 9283

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 200

EP - 208

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

PB - Springer Verlag

ER -