Speech recognition as feature extraction for speaker recognition

A. Stolcke, E. Shriberg, L. Ferrer, S. Kajarekar, Mustafa (Kemal) Sonmez, G. Tur

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Scopus citations

Abstract

Information from speech recognition can be used in various ways in state-of-the-art speaker recognition systems. This includes the obvious use of recognized words to enable the use of text-dependent speaker modeling techniques when the words spoken are not given. Furthermore, it has been shown that the choice of words and phones itself can be a useful indicator of speaker identity. Also, recognizer output enables higher-level features, in particular those related to prosodic properties of speech. Finally, we discuss the use of mere byproducts of word recognition, such as subword unit alignments, pronunciations, and speaker adaptation transforms to derive powerful nonstandard features for speaker modeling. We present specific techniques and results from SRI's NIST speaker recognition evaluation system.

Original languageEnglish (US)
Title of host publicationProceedings - SAFE 2007: Workshop on Signal Processing Applications for Public Security and Forensics
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)1424412269, 9781424412266
StatePublished - 2007
Externally publishedYes
EventWorkshop on Signal Processing Applications for Public Security and Forensics, SAFE 2007 - Washington, United States
Duration: Apr 11 2007Apr 13 2007

Other

OtherWorkshop on Signal Processing Applications for Public Security and Forensics, SAFE 2007
CountryUnited States
CityWashington
Period4/11/074/13/07

Keywords

  • High-level features
  • Prosody
  • Speaker adaptation
  • Speaker recognition
  • Speech recognition

ASJC Scopus subject areas

  • Law
  • Computer Vision and Pattern Recognition
  • Signal Processing

Fingerprint Dive into the research topics of 'Speech recognition as feature extraction for speaker recognition'. Together they form a unique fingerprint.

  • Cite this

    Stolcke, A., Shriberg, E., Ferrer, L., Kajarekar, S., Sonmez, M. K., & Tur, G. (2007). Speech recognition as feature extraction for speaker recognition. In Proceedings - SAFE 2007: Workshop on Signal Processing Applications for Public Security and Forensics [4218939] Institute of Electrical and Electronics Engineers Inc..