Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval

Henning Müller; Jayashree Kalpathy-Cramer; Charles E. Kahn; William Hersh

doi:10.1117/12.811416

Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval

Henning Müller, Jayashree Kalpathy-Cramer, Charles E. Kahn, William Hersh

Medical Informatics and Clinical Epidemiology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

8 Scopus citations

Abstract

Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004?2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently, visual retrieval alone does not achieve the performance necessary for real?world clinical applications. Most of the common visual retrieval techniques have a MAP (Mean Average Precision) of around 2-3%, which is much lower than that achieved using textual retrieval (MAP=29%). Advanced machine learning techniques, together with good training data, have been shown to improve the performance of visual retrieval systems in the past. Multimodal retrieval (basing retrieval on both visual and textual information) can achieve better results than purely visual, but only when carefully applied. In many cases, multimodal retrieval systems performed even worse than purely textual retrieval systems. On the other hand, some multimodal retrieval systems demonstrated significantly increased early precision, which has been shown to be a desirable behavior in real?world systems.

Original language	English (US)
Title of host publication	Medical Imaging 2009
Subtitle of host publication	Advanced PACS-based Imaging Informatics and Therapeutic Applications
DOIs	https://doi.org/10.1117/12.811416
State	Published - 2009
Event	Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications - Lake Buena Vista, FL, United States Duration: Feb 11 2009 → Feb 12 2009

Publication series

Name	Progress in Biomedical Optics and Imaging - Proceedings of SPIE
Volume	7264
ISSN (Print)	1605-7422

Other

Other	Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications
Country/Territory	United States
City	Lake Buena Vista, FL
Period	2/11/09 → 2/12/09

Keywords

Content-based image retrieval
Evaluation
Information retrieval
Medical image retrieval
Multimodal information search
Scientific literature

ASJC Scopus subject areas

Electronic, Optical and Magnetic Materials
Atomic and Molecular Physics, and Optics
Radiology Nuclear Medicine and imaging
Biomaterials

Access to Document

10.1117/12.811416

Cite this

Müller, H., Kalpathy-Cramer, J., Kahn, C. E., & Hersh, W. (2009). Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval. In Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications Article 726405 (Progress in Biomedical Optics and Imaging - Proceedings of SPIE; Vol. 7264). https://doi.org/10.1117/12.811416

Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval. / Müller, Henning; Kalpathy-Cramer, Jayashree; Kahn, Charles E. et al.
Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications. 2009. 726405 (Progress in Biomedical Optics and Imaging - Proceedings of SPIE; Vol. 7264).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Müller, H, Kalpathy-Cramer, J, Kahn, CE & Hersh, W 2009, Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval. in Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications., 726405, Progress in Biomedical Optics and Imaging - Proceedings of SPIE, vol. 7264, Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications, Lake Buena Vista, FL, United States, 2/11/09. https://doi.org/10.1117/12.811416

@inproceedings{7894cfc84b6a4d96b18f594d65873077,

title = "Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval",

abstract = "Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004?2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently, visual retrieval alone does not achieve the performance necessary for real?world clinical applications. Most of the common visual retrieval techniques have a MAP (Mean Average Precision) of around 2-3%, which is much lower than that achieved using textual retrieval (MAP=29%). Advanced machine learning techniques, together with good training data, have been shown to improve the performance of visual retrieval systems in the past. Multimodal retrieval (basing retrieval on both visual and textual information) can achieve better results than purely visual, but only when carefully applied. In many cases, multimodal retrieval systems performed even worse than purely textual retrieval systems. On the other hand, some multimodal retrieval systems demonstrated significantly increased early precision, which has been shown to be a desirable behavior in real?world systems.",

keywords = "Content-based image retrieval, Evaluation, Information retrieval, Medical image retrieval, Multimodal information search, Scientific literature",

author = "Henning M{\"u}ller and Jayashree Kalpathy-Cramer and Kahn, {Charles E.} and William Hersh",

year = "2009",

doi = "10.1117/12.811416",

language = "English (US)",

isbn = "9780819475152",

series = "Progress in Biomedical Optics and Imaging - Proceedings of SPIE",

booktitle = "Medical Imaging 2009",

note = "Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications ; Conference date: 11-02-2009 Through 12-02-2009",

}

TY - GEN

T1 - Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval

AU - Müller, Henning

AU - Kalpathy-Cramer, Jayashree

AU - Kahn, Charles E.

AU - Hersh, William

PY - 2009

Y1 - 2009

N2 - Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004?2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently, visual retrieval alone does not achieve the performance necessary for real?world clinical applications. Most of the common visual retrieval techniques have a MAP (Mean Average Precision) of around 2-3%, which is much lower than that achieved using textual retrieval (MAP=29%). Advanced machine learning techniques, together with good training data, have been shown to improve the performance of visual retrieval systems in the past. Multimodal retrieval (basing retrieval on both visual and textual information) can achieve better results than purely visual, but only when carefully applied. In many cases, multimodal retrieval systems performed even worse than purely textual retrieval systems. On the other hand, some multimodal retrieval systems demonstrated significantly increased early precision, which has been shown to be a desirable behavior in real?world systems.

AB - Content-based visual information (or image) retrieval (CBIR) has been an extremely active research domain within medical imaging over the past ten years, with the goal of improving the management of visual medical information. Many technical solutions have been proposed, and application scenarios for image retrieval as well as image classification have been set up. However, in contrast to medical information retrieval using textual methods, visual retrieval has only rarely been applied in clinical practice. This is despite the large amount and variety of visual information produced in hospitals every day. This information overload imposes a significant burden upon clinicians, and CBIR technologies have the potential to help the situation. However, in order for CBIR to become an accepted clinical tool, it must demonstrate a higher level of technical maturity than it has to date. Since 2004, the ImageCLEF benchmark has included a task for the comparison of visual information retrieval algorithms for medical applications. In 2005, a task for medical image classification was introduced and both tasks have been run successfully for the past four years. These benchmarks allow an annual comparison of visual retrieval techniques based on the same data sets and the same query tasks, enabling the meaningful comparison of various retrieval techniques. The datasets used from 2004?2007 contained images and annotations from medical teaching files. In 2008, however, the dataset used was made up of 67,000 images (along with their associated figure captions and the full text of their corresponding articles) from two Radiological Society of North America (RSNA) scientific journals. This article describes the results of the medical image retrieval task of the ImageCLEF 2008 evaluation campaign. We compare the retrieval results of both visual and textual information retrieval systems from 15 research groups on the aforementioned data set. The results show clearly that, currently, visual retrieval alone does not achieve the performance necessary for real?world clinical applications. Most of the common visual retrieval techniques have a MAP (Mean Average Precision) of around 2-3%, which is much lower than that achieved using textual retrieval (MAP=29%). Advanced machine learning techniques, together with good training data, have been shown to improve the performance of visual retrieval systems in the past. Multimodal retrieval (basing retrieval on both visual and textual information) can achieve better results than purely visual, but only when carefully applied. In many cases, multimodal retrieval systems performed even worse than purely textual retrieval systems. On the other hand, some multimodal retrieval systems demonstrated significantly increased early precision, which has been shown to be a desirable behavior in real?world systems.

KW - Content-based image retrieval

KW - Evaluation

KW - Information retrieval

KW - Medical image retrieval

KW - Multimodal information search

KW - Scientific literature

UR - http://www.scopus.com/inward/record.url?scp=67149099668&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67149099668&partnerID=8YFLogxK

U2 - 10.1117/12.811416

DO - 10.1117/12.811416

M3 - Conference contribution

AN - SCOPUS:67149099668

SN - 9780819475152

T3 - Progress in Biomedical Optics and Imaging - Proceedings of SPIE

BT - Medical Imaging 2009

T2 - Medical Imaging 2009: Advanced PACS-based Imaging Informatics and Therapeutic Applications

Y2 - 11 February 2009 through 12 February 2009

ER -

Comparing the quality of accessing the medical literature using content? Based visual and textual information retrieval

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this