"TalkPrinting": Improving speaker recognition by modeling stylistic features

Sachin Kajarekar, Kemal Sönmez, Luciana Ferrer, Venkata Gadde, Anand Venkataraman, Elizabeth Shriberg, Andreas Stolcke, Harry Bratt

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

Automatic speaker recognition is an important technology for intelligence gathering, law enforcement, and audio mining. Conventional speaker recognition systems, which are based on independent short-term spectral samples, suffer from a lack of noise robustness and are unable to model a speaker's idiosyncratic stylistic features. This paper describes "TalkPrinting", a program of research aimed at adding such stylistic features to conventional systems. Results on three preliminary systems based on stylistic features demonstrate that (1) the new features alone carry significant speaker information; (2) they also carry significant complementary information compared to the conventional features; and (3) they provide increasing improvements in performance with increasing test durations.

Original languageEnglish (US)
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsHsinchun Chen, Daniel D. Zeng, Therani Madhusudan, Richard Miranda, Jenny Schroeder, Chris Demchak
PublisherSpringer-Verlag
Pages350-354
Number of pages5
ISBN (Print)354040189X, 9783540401896
DOIs
StatePublished - 2003

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2665
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of '"TalkPrinting": Improving speaker recognition by modeling stylistic features'. Together they form a unique fingerprint.

  • Cite this

    Kajarekar, S., Sönmez, K., Ferrer, L., Gadde, V., Venkataraman, A., Shriberg, E., Stolcke, A., & Bratt, H. (2003). "TalkPrinting": Improving speaker recognition by modeling stylistic features. In H. Chen, D. D. Zeng, T. Madhusudan, R. Miranda, J. Schroeder, & C. Demchak (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 350-354). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2665). Springer-Verlag. https://doi.org/10.1007/3-540-44853-5_28