Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems

Henning Müller, Paul Clough, William Hersh, Thomas Deselaers, Thomas M. Lehmann, Bruno Janvier, Antoine Geissbuhler

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Many image retrieval systems, and the evaluation methodologies of these systems, make use of either visual or textual information only. Only few combine textual and visual features for retrieval and evaluation. If text is used, it is often relies upon having a standardised and complete annotation schema for the entire collection. This, in combination with high-level semantic queries, makes visual/textual combinations almost useless as the information need can often be solved using just textual features. In reality, many collections do have some form of annotation but this is often heterogeneous and incomplete. Web-based image repositories such as FlickR even allow collective, as well as multilingual annotation of multimedia objects. This article describes an image retrieval evaluation campaign called ImageCLEF. Unlike previous evaluations, we offer a range of realistic tasks and image collections in which combining text and visual features is likely to obtain the best results. In particular, we offer a medical retrieval task which models exactly the situation of heterogenous annotation by combining four collections with annotations of varying quality, structure, extent and language. Two collections have an annotation per case and not per image, which is normal in the medical domain, making it difficult to relate parts of the accompanying text to corresponding images. This is also typical of image retrieval from the web in which adjacent text does not always describe an image. The ImageCLEF benchmark shows the need for realistic and standardised datasets, search tasks and ground truths for visual information retrieval evaluation.

Original languageEnglish (US)
Title of host publicationInternet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging
DOIs
StatePublished - Apr 17 2006
EventInternet Imaging VII - San Jose, CA, United States
Duration: Jan 18 2006Jan 19 2006

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume6061
ISSN (Print)0277-786X

Other

OtherInternet Imaging VII
CountryUnited States
CitySan Jose, CA
Period1/18/061/19/06

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems'. Together they form a unique fingerprint.

  • Cite this

    Müller, H., Clough, P., Hersh, W., Deselaers, T., Lehmann, T. M., Janvier, B., & Geissbuhler, A. (2006). Using heterogeneous annotation and visual information for the benchmarking of image retrieval systems. In Internet Imaging VII - Proceedings of SPIE-IS and T Electronic Imaging [606105] (Proceedings of SPIE - The International Society for Optical Engineering; Vol. 6061). https://doi.org/10.1117/12.660259