Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

Henning Müller, Célia Boyer, Arnaud Gaudinat, William (Bill) Hersh, Antoine Geissbuhler

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

Original languageEnglish (US)
Title of host publicationStudies in Health Technology and Informatics
Pages1319-1323
Number of pages5
Volume129
StatePublished - 2007
Event12th World Congress on Medical Informatics, MEDINFO 2007 - Brisbane, QLD, Australia
Duration: Aug 20 2007Aug 24 2007

Other

Other12th World Congress on Medical Informatics, MEDINFO 2007
CountryAustralia
CityBrisbane, QLD
Period8/20/078/24/07

Fingerprint

Search Engine
Image retrieval
Search engines
Language
Character sets
Health
Pathology
Lung
Research
Therapeutics
Datasets

Keywords

  • image retrieval evaluation
  • log files analysis

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Cite this

Müller, H., Boyer, C., Gaudinat, A., Hersh, W. B., & Geissbuhler, A. (2007). Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. In Studies in Health Technology and Informatics (Vol. 129, pp. 1319-1323)

Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. / Müller, Henning; Boyer, Célia; Gaudinat, Arnaud; Hersh, William (Bill); Geissbuhler, Antoine.

Studies in Health Technology and Informatics. Vol. 129 2007. p. 1319-1323.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Müller, H, Boyer, C, Gaudinat, A, Hersh, WB & Geissbuhler, A 2007, Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. in Studies in Health Technology and Informatics. vol. 129, pp. 1319-1323, 12th World Congress on Medical Informatics, MEDINFO 2007, Brisbane, QLD, Australia, 8/20/07.
Müller H, Boyer C, Gaudinat A, Hersh WB, Geissbuhler A. Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. In Studies in Health Technology and Informatics. Vol. 129. 2007. p. 1319-1323
Müller, Henning ; Boyer, Célia ; Gaudinat, Arnaud ; Hersh, William (Bill) ; Geissbuhler, Antoine. / Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. Studies in Health Technology and Informatics. Vol. 129 2007. pp. 1319-1323
@inproceedings{3427ef8b5ba14d35aab72efc5e337bda,
title = "Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation",
abstract = "Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.",
keywords = "image retrieval evaluation, log files analysis",
author = "Henning M{\"u}ller and C{\'e}lia Boyer and Arnaud Gaudinat and Hersh, {William (Bill)} and Antoine Geissbuhler",
year = "2007",
language = "English (US)",
isbn = "9781586037741",
volume = "129",
pages = "1319--1323",
booktitle = "Studies in Health Technology and Informatics",

}

TY - GEN

T1 - Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

AU - Müller, Henning

AU - Boyer, Célia

AU - Gaudinat, Arnaud

AU - Hersh, William (Bill)

AU - Geissbuhler, Antoine

PY - 2007

Y1 - 2007

N2 - Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

AB - Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

KW - image retrieval evaluation

KW - log files analysis

UR - http://www.scopus.com/inward/record.url?scp=78149298044&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78149298044&partnerID=8YFLogxK

M3 - Conference contribution

SN - 9781586037741

VL - 129

SP - 1319

EP - 1323

BT - Studies in Health Technology and Informatics

ER -