The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events

Jörn Anemüller; Jörg Hendrik Bach; Barbara Caputo; Michal Havlena; Luo Jie; Hendrik Kayser; Bastian Leibe; Petr Motlicek; Tomas Pajdla; Misha Pavel; Akihiko Torii; Luc Van Gool; Alon Zweig; Hynek Hermansky

doi:10.1145/1452392.1452451

The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events

Jörn Anemüller, Jörg Hendrik Bach, Barbara Caputo, Michal Havlena, Luo Jie, Hendrik Kayser, Bastian Leibe, Petr Motlicek, Tomas Pajdla, Misha Pavel, Akihiko Torii, Luc Van Gool, Alon Zweig, Hynek Hermansky

Biomedical Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

It is of prime importance in everyday human life to cope with and respond appropriately to events that are not foreseen by prior experience. Machines to a large extent lack the ability to respond appropriately to such inputs. An important class of unexpected events is defined by incongruent combinations of inputs from different modalities and therefore multimodal information provides a crucial cue for the identification of such events, e.g., the sound of a voice is being heard while the person in the field-of-view does not move her lips. In the project DIRAC ("Detection and Identification of Rare Audio-visual Cues") we have been developing algorithmic approaches to the detection of such events, as well as an experimental hardware platform to test it. An audio-visual platform ("AWEAR" - audio-visual wearable device) has been constructed with the goal to help users with disabilities or a high cognitive load to deal with unexpected events. Key hardware components include stereo panoramic vision sensors and 6-channel worn-behind-the-ear (hearing aid) microphone arrays. Data have been recorded to study audio-visual tracking, a/v scene/object classification and a/v detection of incongruencies.

Original language	English (US)
Title of host publication	ICMI'08
Subtitle of host publication	Proceedings of the 10th International Conference on Multimodal Interfaces
Pages	289-292
Number of pages	4
DOIs	https://doi.org/10.1145/1452392.1452451
State	Published - 2008
Event	10th International Conference on Multimodal Interfaces, ICMI'08 - Chania, Crete, Greece Duration: Oct 20 2008 → Oct 22 2008

Publication series

Name	ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces

Other

Other	10th International Conference on Multimodal Interfaces, ICMI'08
Country/Territory	Greece
City	Chania, Crete
Period	10/20/08 → 10/22/08

Keywords

Audio-visual
Augmented cognition
Event detection
Multimodal interaction
Sensor platform

ASJC Scopus subject areas

Computer Science Applications
Human-Computer Interaction

Access to Document

10.1145/1452392.1452451

Cite this

Anemüller, J., Bach, J. H., Caputo, B., Havlena, M., Jie, L., Kayser, H., Leibe, B., Motlicek, P., Pajdla, T., Pavel, M., Torii, A., Gool, L. V., Zweig, A., & Hermansky, H. (2008). The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. In ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces (pp. 289-292). (ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces). https://doi.org/10.1145/1452392.1452451

The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. / Anemüller, Jörn; Bach, Jörg Hendrik; Caputo, Barbara et al.
ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces. 2008. p. 289-292 (ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Anemüller, J, Bach, JH, Caputo, B, Havlena, M, Jie, L, Kayser, H, Leibe, B, Motlicek, P, Pajdla, T, Pavel, M, Torii, A, Gool, LV, Zweig, A & Hermansky, H 2008, The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. in ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces. ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces, pp. 289-292, 10th International Conference on Multimodal Interfaces, ICMI'08, Chania, Crete, Greece, 10/20/08. https://doi.org/10.1145/1452392.1452451

Anemüller J, Bach JH, Caputo B, Havlena M, Jie L, Kayser H et al. The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events. In ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces. 2008. p. 289-292. (ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces). doi: 10.1145/1452392.1452451

@inproceedings{a48979352a2548e398ae1e94d02ce8ec,

title = "The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events",

abstract = "It is of prime importance in everyday human life to cope with and respond appropriately to events that are not foreseen by prior experience. Machines to a large extent lack the ability to respond appropriately to such inputs. An important class of unexpected events is defined by incongruent combinations of inputs from different modalities and therefore multimodal information provides a crucial cue for the identification of such events, e.g., the sound of a voice is being heard while the person in the field-of-view does not move her lips. In the project DIRAC ({"}Detection and Identification of Rare Audio-visual Cues{"}) we have been developing algorithmic approaches to the detection of such events, as well as an experimental hardware platform to test it. An audio-visual platform ({"}AWEAR{"} - audio-visual wearable device) has been constructed with the goal to help users with disabilities or a high cognitive load to deal with unexpected events. Key hardware components include stereo panoramic vision sensors and 6-channel worn-behind-the-ear (hearing aid) microphone arrays. Data have been recorded to study audio-visual tracking, a/v scene/object classification and a/v detection of incongruencies.",

keywords = "Audio-visual, Augmented cognition, Event detection, Multimodal interaction, Sensor platform",

author = "J{\"o}rn Anem{\"u}ller and Bach, {J{\"o}rg Hendrik} and Barbara Caputo and Michal Havlena and Luo Jie and Hendrik Kayser and Bastian Leibe and Petr Motlicek and Tomas Pajdla and Misha Pavel and Akihiko Torii and Gool, {Luc Van} and Alon Zweig and Hynek Hermansky",

year = "2008",

doi = "10.1145/1452392.1452451",

language = "English (US)",

isbn = "9781605581989",

series = "ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces",

pages = "289--292",

booktitle = "ICMI'08",

note = "10th International Conference on Multimodal Interfaces, ICMI'08 ; Conference date: 20-10-2008 Through 22-10-2008",

}

TY - GEN

T1 - The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events

AU - Anemüller, Jörn

AU - Bach, Jörg Hendrik

AU - Caputo, Barbara

AU - Havlena, Michal

AU - Jie, Luo

AU - Kayser, Hendrik

AU - Leibe, Bastian

AU - Motlicek, Petr

AU - Pajdla, Tomas

AU - Pavel, Misha

AU - Torii, Akihiko

AU - Gool, Luc Van

AU - Zweig, Alon

AU - Hermansky, Hynek

PY - 2008

Y1 - 2008

N2 - It is of prime importance in everyday human life to cope with and respond appropriately to events that are not foreseen by prior experience. Machines to a large extent lack the ability to respond appropriately to such inputs. An important class of unexpected events is defined by incongruent combinations of inputs from different modalities and therefore multimodal information provides a crucial cue for the identification of such events, e.g., the sound of a voice is being heard while the person in the field-of-view does not move her lips. In the project DIRAC ("Detection and Identification of Rare Audio-visual Cues") we have been developing algorithmic approaches to the detection of such events, as well as an experimental hardware platform to test it. An audio-visual platform ("AWEAR" - audio-visual wearable device) has been constructed with the goal to help users with disabilities or a high cognitive load to deal with unexpected events. Key hardware components include stereo panoramic vision sensors and 6-channel worn-behind-the-ear (hearing aid) microphone arrays. Data have been recorded to study audio-visual tracking, a/v scene/object classification and a/v detection of incongruencies.

AB - It is of prime importance in everyday human life to cope with and respond appropriately to events that are not foreseen by prior experience. Machines to a large extent lack the ability to respond appropriately to such inputs. An important class of unexpected events is defined by incongruent combinations of inputs from different modalities and therefore multimodal information provides a crucial cue for the identification of such events, e.g., the sound of a voice is being heard while the person in the field-of-view does not move her lips. In the project DIRAC ("Detection and Identification of Rare Audio-visual Cues") we have been developing algorithmic approaches to the detection of such events, as well as an experimental hardware platform to test it. An audio-visual platform ("AWEAR" - audio-visual wearable device) has been constructed with the goal to help users with disabilities or a high cognitive load to deal with unexpected events. Key hardware components include stereo panoramic vision sensors and 6-channel worn-behind-the-ear (hearing aid) microphone arrays. Data have been recorded to study audio-visual tracking, a/v scene/object classification and a/v detection of incongruencies.

KW - Audio-visual

KW - Augmented cognition

KW - Event detection

KW - Multimodal interaction

KW - Sensor platform

UR - http://www.scopus.com/inward/record.url?scp=63449125358&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=63449125358&partnerID=8YFLogxK

U2 - 10.1145/1452392.1452451

DO - 10.1145/1452392.1452451

M3 - Conference contribution

AN - SCOPUS:63449125358

SN - 9781605581989

T3 - ICMI'08: Proceedings of the 10th International Conference on Multimodal Interfaces

SP - 289

EP - 292

BT - ICMI'08

T2 - 10th International Conference on Multimodal Interfaces, ICMI'08

Y2 - 20 October 2008 through 22 October 2008

ER -

The DIRAC AWEAR audio-visual platform for detection of unexpected and incongruent events

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this