Using media fusion and domain dimensions to improve precision in medical image retrieval

Saïd Radhouani; Jayashree Kalpathy-Cramer; Steven Bedrick; Brian Bakke; William Hersh

doi:10.1007/978-3-642-15751-6_27

Using media fusion and domain dimensions to improve precision in medical image retrieval

Saïd Radhouani, Jayashree Kalpathy-Cramer, Steven Bedrick, Brian Bakke, William Hersh

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

In this paper, we focus on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. The queries we deal with consist of a visual component, given as a set of image-examples, and textual annotation, provided as a set of words. The queries' semantic content can be classified along three domain dimensions: anatomy, pathology, and modality. To solve these queries, we interpret their semantic content using both textual and visual data. Medical images often are accompanied by textual annotations, which in turn typically include explicit mention of their image's anatomy or pathology. Annotations rarely include explicit mention of image modality, however. To address this, we use an image's visual features to identify its modality. Our system thereby performs image retrieval by combining purely visual information about an image with information derived from its textual annotations. In order to experimentally evaluate our approach, we performed a set of experiments using the 2009 ImageCLEFmed collection using our integrated system as well as a purely textual retrieval system. Our integrated approach consistently outperformed our text-only system by 43% in MAP and by 71% in precision within the top 5 retrieved documents. We conclude that this improved performance is due to our method of combining visual and textual features.

Original language	English (US)
Title of host publication	Multilingual Information Access Evaluation II
Subtitle of host publication	Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers
Pages	223-230
Number of pages	8
DOIs	https://doi.org/10.1007/978-3-642-15751-6_27
State	Published - 2010
Event	10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009 - Corfu, Greece Duration: Sep 30 2009 → Oct 2 2009

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	6242 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Other

Other	10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009
Country/Territory	Greece
City	Corfu
Period	9/30/09 → 10/2/09

Keywords

Domain Dimensions
Image Classification
Image Modality Extraction
Media Fusion
Medical Image Retrieval
Performance Evaluation

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-642-15751-6_27

Cite this

Radhouani, S., Kalpathy-Cramer, J., Bedrick, S., Bakke, B., & Hersh, W. (2010). Using media fusion and domain dimensions to improve precision in medical image retrieval. In Multilingual Information Access Evaluation II: Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers (pp. 223-230). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6242 LNCS). https://doi.org/10.1007/978-3-642-15751-6_27

Using media fusion and domain dimensions to improve precision in medical image retrieval. / Radhouani, Saïd; Kalpathy-Cramer, Jayashree; Bedrick, Steven et al.
Multilingual Information Access Evaluation II: Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers. 2010. p. 223-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 6242 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Radhouani, S, Kalpathy-Cramer, J, Bedrick, S, Bakke, B & Hersh, W 2010, Using media fusion and domain dimensions to improve precision in medical image retrieval. in Multilingual Information Access Evaluation II: Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 6242 LNCS, pp. 223-230, 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Corfu, Greece, 9/30/09. https://doi.org/10.1007/978-3-642-15751-6_27

Radhouani S, Kalpathy-Cramer J, Bedrick S, Bakke B, Hersh W. Using media fusion and domain dimensions to improve precision in medical image retrieval. In Multilingual Information Access Evaluation II: Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers. 2010. p. 223-230. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-642-15751-6_27

Radhouani, Saïd ; Kalpathy-Cramer, Jayashree ; Bedrick, Steven et al. / Using media fusion and domain dimensions to improve precision in medical image retrieval. Multilingual Information Access Evaluation II: Multimedia Experiments - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009, Revised Selected Papers. 2010. pp. 223-230 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{aaec138a536545559598e734c3b0b1aa,

title = "Using media fusion and domain dimensions to improve precision in medical image retrieval",

abstract = "In this paper, we focus on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. The queries we deal with consist of a visual component, given as a set of image-examples, and textual annotation, provided as a set of words. The queries' semantic content can be classified along three domain dimensions: anatomy, pathology, and modality. To solve these queries, we interpret their semantic content using both textual and visual data. Medical images often are accompanied by textual annotations, which in turn typically include explicit mention of their image's anatomy or pathology. Annotations rarely include explicit mention of image modality, however. To address this, we use an image's visual features to identify its modality. Our system thereby performs image retrieval by combining purely visual information about an image with information derived from its textual annotations. In order to experimentally evaluate our approach, we performed a set of experiments using the 2009 ImageCLEFmed collection using our integrated system as well as a purely textual retrieval system. Our integrated approach consistently outperformed our text-only system by 43% in MAP and by 71% in precision within the top 5 retrieved documents. We conclude that this improved performance is due to our method of combining visual and textual features.",

keywords = "Domain Dimensions, Image Classification, Image Modality Extraction, Media Fusion, Medical Image Retrieval, Performance Evaluation",

author = "Sa{\"i}d Radhouani and Jayashree Kalpathy-Cramer and Steven Bedrick and Brian Bakke and William Hersh",

year = "2010",

doi = "10.1007/978-3-642-15751-6_27",

language = "English (US)",

isbn = "3642157505",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "223--230",

booktitle = "Multilingual Information Access Evaluation II",

note = "10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009 ; Conference date: 30-09-2009 Through 02-10-2009",

}

TY - GEN

T1 - Using media fusion and domain dimensions to improve precision in medical image retrieval

AU - Radhouani, Saïd

AU - Kalpathy-Cramer, Jayashree

AU - Bedrick, Steven

AU - Bakke, Brian

AU - Hersh, William

PY - 2010

Y1 - 2010

N2 - In this paper, we focus on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. The queries we deal with consist of a visual component, given as a set of image-examples, and textual annotation, provided as a set of words. The queries' semantic content can be classified along three domain dimensions: anatomy, pathology, and modality. To solve these queries, we interpret their semantic content using both textual and visual data. Medical images often are accompanied by textual annotations, which in turn typically include explicit mention of their image's anatomy or pathology. Annotations rarely include explicit mention of image modality, however. To address this, we use an image's visual features to identify its modality. Our system thereby performs image retrieval by combining purely visual information about an image with information derived from its textual annotations. In order to experimentally evaluate our approach, we performed a set of experiments using the 2009 ImageCLEFmed collection using our integrated system as well as a purely textual retrieval system. Our integrated approach consistently outperformed our text-only system by 43% in MAP and by 71% in precision within the top 5 retrieved documents. We conclude that this improved performance is due to our method of combining visual and textual features.

AB - In this paper, we focus on improving retrieval performance, especially early precision, in the task of solving medical multimodal queries. The queries we deal with consist of a visual component, given as a set of image-examples, and textual annotation, provided as a set of words. The queries' semantic content can be classified along three domain dimensions: anatomy, pathology, and modality. To solve these queries, we interpret their semantic content using both textual and visual data. Medical images often are accompanied by textual annotations, which in turn typically include explicit mention of their image's anatomy or pathology. Annotations rarely include explicit mention of image modality, however. To address this, we use an image's visual features to identify its modality. Our system thereby performs image retrieval by combining purely visual information about an image with information derived from its textual annotations. In order to experimentally evaluate our approach, we performed a set of experiments using the 2009 ImageCLEFmed collection using our integrated system as well as a purely textual retrieval system. Our integrated approach consistently outperformed our text-only system by 43% in MAP and by 71% in precision within the top 5 retrieved documents. We conclude that this improved performance is due to our method of combining visual and textual features.

KW - Domain Dimensions

KW - Image Classification

KW - Image Modality Extraction

KW - Media Fusion

KW - Medical Image Retrieval

KW - Performance Evaluation

UR - http://www.scopus.com/inward/record.url?scp=78049328603&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78049328603&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-15751-6_27

DO - 10.1007/978-3-642-15751-6_27

M3 - Conference contribution

AN - SCOPUS:78049328603

SN - 3642157505

SN - 9783642157509

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 223

EP - 230

BT - Multilingual Information Access Evaluation II

T2 - 10th Workshop of the Cross-Language Evaluation Forum, CLEF 2009

Y2 - 30 September 2009 through 2 October 2009

ER -

Using media fusion and domain dimensions to improve precision in medical image retrieval

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this