Building an ASR system for Noisy environments: SRI's 2001 SPINE evaluation system

Venkata Ramana Rao Gadde, Andreas Stolcke, Dimitra Vergyri, Jing Zheng, Kemal Sonmez, Anand Venkataraman

Research output: Contribution to conferencePaper

9 Scopus citations

Abstract

We describe SRI's recognition system as used in the 2001 DARPA Speech in Noisy Environments (SPINE) evaluation. The SPINE task involves recognition of speech in simulated military environments. The task had some unique challenges, including segmentation of foreground speech from noisy background, the need for robust acoustic models to handle noisy speech, and development of language models from limited training data. In developing the SRI evaluation system for this task, we addressed each of these challenges using a combination of state-of-the-art techniques, including several types of feature normalization, model adaptation, class-based language modeling, multi-pass segmentation and recognition, and word posterior-based decoding and system combination.

Original languageEnglish (US)
Pages1577-1580
Number of pages4
StatePublished - Jan 1 2002
Externally publishedYes
Event7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States
Duration: Sep 16 2002Sep 20 2002

Other

Other7th International Conference on Spoken Language Processing, ICSLP 2002
CountryUnited States
CityDenver
Period9/16/029/20/02

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Building an ASR system for Noisy environments: SRI's 2001 SPINE evaluation system'. Together they form a unique fingerprint.

  • Cite this

    Gadde, V. R. R., Stolcke, A., Vergyri, D., Zheng, J., Sonmez, K., & Venkataraman, A. (2002). Building an ASR system for Noisy environments: SRI's 2001 SPINE evaluation system. 1577-1580. Paper presented at 7th International Conference on Spoken Language Processing, ICSLP 2002, Denver, United States.