The OHSU biomedical question answering system framework

A. M. Cohen, J. Yang, S. Fisher, B. Roark, W. R. Hersh

Research output: Contribution to journalConference article

1 Scopus citations

Abstract

The Oregon Health & Science University submission to the TREC 2007 Genomics Track approached the entity list question answering task using a modular object oriented system framework. A system object coordinates a collection of processing objects into a pipe that constructs a set of queries, retrieves passage, and then processes those passages into a final output answer set. Using the framework we applied multiple levels of synonym expansion and a ranked series of topic queries with a range of specificities in order to retrieve all of the likely relevant passages with the most likely ranked higher. We then applied sentence pruning to the head and tail of each passage using both NLP and term-based techniques. Overall scores finished around the TREC Genomics mean for each of the four measures. Careful passage retrieval, including synonym expansion and multiple query construction, as well as sentence pruning was essential in achieving acceptable performance on this task.

Original languageEnglish (US)
JournalNIST Special Publication
StatePublished - Dec 1 2007
Event16th Text REtrieval Conference, TREC 2007 - Gaithersburg, MD, United States
Duration: Nov 6 2007Nov 9 2007

ASJC Scopus subject areas

  • Engineering(all)

Fingerprint Dive into the research topics of 'The OHSU biomedical question answering system framework'. Together they form a unique fingerprint.

  • Cite this