Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems

Henning Müller; Paul Clough; William Hersh; Thomas Deselaers; Thomas M. Lehmann; Bruno Janvier; Antoine Geissbuhler

doi:10.1117/12.660259

Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems

Henning Müller, Paul Clough, William Hersh, Thomas Deselaers, Thomas M. Lehmann, Bruno Janvier, Antoine Geissbuhler

Medical Informatics and Clinical Epidemiology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high-level semantic queries, makes visual/textual combinations almost useless as the information need can often be solved using just textual features. In reality, many collections do have some form of annotation but this is often heterogeneous and incomplete. Web-based image repositories such as FlickR even allow collective, as well as multilingual annotation of multimedia objects. This article describes an image retrieval evaluation campaign called ImageCLEF. Unlike previous evaluations, we offer a range of realistic tasks and image collections in which combining text and visual features is likely to obtain the best results. In particular, we offer a medical retrieval task which models exactly the situation of heterogenous annotation by combining four collections with annotations of varying quality, structure, extent and language. Two collections have an annotation per case and not per image, which is normal in the medical domain, making it difficult to relate parts of the accompanying text to corresponding images. This is also typical of image retrieval from the web in which adjacent text does not always describe an image. The ImageCLEF benchmark shows the need for realistic and standardised datasets, search tasks and ground truths for visual information retrieval evaluation.

Original language	English (US)
Title of host publication	Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging
DOIs	https://doi.org/10.1117/12.660259
State	Published - 2006
Event	Internet Imaging VII - San Jose, CA, United States Duration: Jan 18 2006 → Jan 19 2006

Publication series

Name	Proceedings of SPIE - The International Society for Optical Engineering
Volume	6061
ISSN (Print)	0277-786X

Other

Other	Internet Imaging VII
Country/Territory	United States
City	San Jose, CA
Period	1/18/06 → 1/19/06

ASJC Scopus subject areas

Electronic, Optical and Magnetic Materials
Condensed Matter Physics
Computer Science Applications
Applied Mathematics
Electrical and Electronic Engineering

Access to Document

10.1117/12.660259

Cite this

Müller, H., Clough, P., Hersh, W., Deselaers, T., Lehmann, T. M., Janvier, B., & Geissbuhler, A. (2006). Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems. In Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging Article 606105 (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 6061). https://doi.org/10.1117/12.660259

Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems. / Müller, Henning; Clough, Paul; Hersh, William et al.
Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging. 2006. 606105 (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 6061).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Müller, H, Clough, P, Hersh, W, Deselaers, T, Lehmann, TM, Janvier, B & Geissbuhler, A 2006, Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems. in Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging., 606105, Proceedings of SPIE - The International Society for Optical Engineering, vol. 6061, Internet Imaging VII, San Jose, CA, United States, 1/18/06. https://doi.org/10.1117/12.660259

@inproceedings{3c7927e9d0aa46cfae5b4990f52549c7,

title = "Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems",

abstract = "Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high-level semantic queries, makes visual/textual combinations almost useless as the information need can often be solved using just textual features. In reality, many collections do have some form of annotation but this is often heterogeneous and incomplete. Web-based image repositories such as FlickR even allow collective, as well as multilingual annotation of multimedia objects. This article describes an image retrieval evaluation campaign called ImageCLEF. Unlike previous evaluations, we offer a range of realistic tasks and image collections in which combining text and visual features is likely to obtain the best results. In particular, we offer a medical retrieval task which models exactly the situation of heterogenous annotation by combining four collections with annotations of varying quality, structure, extent and language. Two collections have an annotation per case and not per image, which is normal in the medical domain, making it difficult to relate parts of the accompanying text to corresponding images. This is also typical of image retrieval from the web in which adjacent text does not always describe an image. The ImageCLEF benchmark shows the need for realistic and standardised datasets, search tasks and ground truths for visual information retrieval evaluation.",

author = "Henning M{\"u}ller and Paul Clough and William Hersh and Thomas Deselaers and Lehmann, {Thomas M.} and Bruno Janvier and Antoine Geissbuhler",

year = "2006",

doi = "10.1117/12.660259",

language = "English (US)",

isbn = "0819461016",

series = "Proceedings of SPIE - The International Society for Optical Engineering",

booktitle = "Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging",

}

TY - GEN

T1 - Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems

AU - Müller, Henning

AU - Clough, Paul

AU - Hersh, William

AU - Deselaers, Thomas

AU - Lehmann, Thomas M.

AU - Janvier, Bruno

AU - Geissbuhler, Antoine

PY - 2006

Y1 - 2006

N2 - Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high-level semantic queries, makes visual/textual combinations almost useless as the information need can often be solved using just textual features. In reality, many collections do have some form of annotation but this is often heterogeneous and incomplete. Web-based image repositories such as FlickR even allow collective, as well as multilingual annotation of multimedia objects. This article describes an image retrieval evaluation campaign called ImageCLEF. Unlike previous evaluations, we offer a range of realistic tasks and image collections in which combining text and visual features is likely to obtain the best results. In particular, we offer a medical retrieval task which models exactly the situation of heterogenous annotation by combining four collections with annotations of varying quality, structure, extent and language. Two collections have an annotation per case and not per image, which is normal in the medical domain, making it difficult to relate parts of the accompanying text to corresponding images. This is also typical of image retrieval from the web in which adjacent text does not always describe an image. The ImageCLEF benchmark shows the need for realistic and standardised datasets, search tasks and ground truths for visual information retrieval evaluation.

AB - Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high-level semantic queries, makes visual/textual combinations almost useless as the information need can often be solved using just textual features. In reality, many collections do have some form of annotation but this is often heterogeneous and incomplete. Web-based image repositories such as FlickR even allow collective, as well as multilingual annotation of multimedia objects. This article describes an image retrieval evaluation campaign called ImageCLEF. Unlike previous evaluations, we offer a range of realistic tasks and image collections in which combining text and visual features is likely to obtain the best results. In particular, we offer a medical retrieval task which models exactly the situation of heterogenous annotation by combining four collections with annotations of varying quality, structure, extent and language. Two collections have an annotation per case and not per image, which is normal in the medical domain, making it difficult to relate parts of the accompanying text to corresponding images. This is also typical of image retrieval from the web in which adjacent text does not always describe an image. The ImageCLEF benchmark shows the need for realistic and standardised datasets, search tasks and ground truths for visual information retrieval evaluation.

UR - http://www.scopus.com/inward/record.url?scp=33645685975&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33645685975&partnerID=8YFLogxK

U2 - 10.1117/12.660259

DO - 10.1117/12.660259

M3 - Conference contribution

AN - SCOPUS:33645685975

SN - 0819461016

SN - 9780819461018

T3 - Proceedings of SPIE - The International Society for Optical Engineering

BT - Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging

T2 - Internet Imaging VII

Y2 - 18 January 2006 through 19 January 2006

ER -

Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems

Abstract

Publication series

Other

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this