"TalkPrinting": Improving speaker recognition by modeling stylistic features

Sachin Kajarekar; Kemal Sönmez; Luciana Ferrer; Venkata Gadde; Anand Venkataraman; Elizabeth Shriberg; Andreas Stolcke; Harry Bratt

doi:10.1007/3-540-44853-5_28

"TalkPrinting": Improving speaker recognition by modeling stylistic features

Sachin Kajarekar, Kemal Sönmez, Luciana Ferrer, Venkata Gadde, Anand Venkataraman, Elizabeth Shriberg, Andreas Stolcke, Harry Bratt

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

Automatic speaker recognition is an important technology for intelligence gathering, law enforcement, and audio mining. Conventional speaker recognition systems, which are based on independent short-term spectral samples, suffer from a lack of noise robustness and are unable to model a speaker's idiosyncratic stylistic features. This paper describes "TalkPrinting", a program of research aimed at adding such stylistic features to conventional systems. Results on three preliminary systems based on stylistic features demonstrate that (1) the new features alone carry significant speaker information; (2) they also carry significant complementary information compared to the conventional features; and (3) they provide increasing improvements in performance with increasing test durations.

Original language	English (US)
Title of host publication	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Editors	Hsinchun Chen, Daniel D. Zeng, Therani Madhusudan, Richard Miranda, Jenny Schroeder, Chris Demchak
Publisher	Springer-Verlag
Pages	350-354
Number of pages	5
ISBN (Print)	354040189X, 9783540401896
DOIs	https://doi.org/10.1007/3-540-44853-5_28
State	Published - 2003
Externally published	Yes

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	2665
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/3-540-44853-5_28

Cite this

Kajarekar, S., Sönmez, K., Ferrer, L., Gadde, V., Venkataraman, A., Shriberg, E., Stolcke, A., & Bratt, H. (2003). "TalkPrinting": Improving speaker recognition by modeling stylistic features. In H. Chen, D. D. Zeng, T. Madhusudan, R. Miranda, J. Schroeder, & C. Demchak (Eds.), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 350-354). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2665). Springer-Verlag. https://doi.org/10.1007/3-540-44853-5_28

"TalkPrinting": Improving speaker recognition by modeling stylistic features. / Kajarekar, Sachin; Sönmez, Kemal; Ferrer, Luciana et al.
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). ed. / Hsinchun Chen; Daniel D. Zeng; Therani Madhusudan; Richard Miranda; Jenny Schroeder; Chris Demchak. Springer-Verlag, 2003. p. 350-354 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 2665).

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Kajarekar, S, Sönmez, K, Ferrer, L, Gadde, V, Venkataraman, A, Shriberg, E, Stolcke, A & Bratt, H 2003, "TalkPrinting": Improving speaker recognition by modeling stylistic features. in H Chen, DD Zeng, T Madhusudan, R Miranda, J Schroeder & C Demchak (eds), Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2665, Springer-Verlag, pp. 350-354. https://doi.org/10.1007/3-540-44853-5_28

Kajarekar S, Sönmez K, Ferrer L, Gadde V, Venkataraman A, Shriberg E et al. "TalkPrinting": Improving speaker recognition by modeling stylistic features. In Chen H, Zeng DD, Madhusudan T, Miranda R, Schroeder J, Demchak C, editors, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Springer-Verlag. 2003. p. 350-354. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/3-540-44853-5_28

Kajarekar, Sachin ; Sönmez, Kemal ; Ferrer, Luciana et al. / "TalkPrinting" : Improving speaker recognition by modeling stylistic features. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). editor / Hsinchun Chen ; Daniel D. Zeng ; Therani Madhusudan ; Richard Miranda ; Jenny Schroeder ; Chris Demchak. Springer-Verlag, 2003. pp. 350-354 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inbook{c5db59b0ade84ad4a113f367faa9de2a,

title = "{"}TalkPrinting{"}: Improving speaker recognition by modeling stylistic features",

abstract = "Automatic speaker recognition is an important technology for intelligence gathering, law enforcement, and audio mining. Conventional speaker recognition systems, which are based on independent short-term spectral samples, suffer from a lack of noise robustness and are unable to model a speaker's idiosyncratic stylistic features. This paper describes {"}TalkPrinting{"}, a program of research aimed at adding such stylistic features to conventional systems. Results on three preliminary systems based on stylistic features demonstrate that (1) the new features alone carry significant speaker information; (2) they also carry significant complementary information compared to the conventional features; and (3) they provide increasing improvements in performance with increasing test durations.",

author = "Sachin Kajarekar and Kemal S{\"o}nmez and Luciana Ferrer and Venkata Gadde and Anand Venkataraman and Elizabeth Shriberg and Andreas Stolcke and Harry Bratt",

year = "2003",

doi = "10.1007/3-540-44853-5_28",

language = "English (US)",

isbn = "354040189X",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer-Verlag",

pages = "350--354",

editor = "Hsinchun Chen and Zeng, {Daniel D.} and Therani Madhusudan and Richard Miranda and Jenny Schroeder and Chris Demchak",

booktitle = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

}

TY - CHAP

T1 - "TalkPrinting"

T2 - Improving speaker recognition by modeling stylistic features

AU - Kajarekar, Sachin

AU - Sönmez, Kemal

AU - Ferrer, Luciana

AU - Gadde, Venkata

AU - Venkataraman, Anand

AU - Shriberg, Elizabeth

AU - Stolcke, Andreas

AU - Bratt, Harry

PY - 2003

Y1 - 2003

N2 - Automatic speaker recognition is an important technology for intelligence gathering, law enforcement, and audio mining. Conventional speaker recognition systems, which are based on independent short-term spectral samples, suffer from a lack of noise robustness and are unable to model a speaker's idiosyncratic stylistic features. This paper describes "TalkPrinting", a program of research aimed at adding such stylistic features to conventional systems. Results on three preliminary systems based on stylistic features demonstrate that (1) the new features alone carry significant speaker information; (2) they also carry significant complementary information compared to the conventional features; and (3) they provide increasing improvements in performance with increasing test durations.

AB - Automatic speaker recognition is an important technology for intelligence gathering, law enforcement, and audio mining. Conventional speaker recognition systems, which are based on independent short-term spectral samples, suffer from a lack of noise robustness and are unable to model a speaker's idiosyncratic stylistic features. This paper describes "TalkPrinting", a program of research aimed at adding such stylistic features to conventional systems. Results on three preliminary systems based on stylistic features demonstrate that (1) the new features alone carry significant speaker information; (2) they also carry significant complementary information compared to the conventional features; and (3) they provide increasing improvements in performance with increasing test durations.

UR - http://www.scopus.com/inward/record.url?scp=35248812449&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35248812449&partnerID=8YFLogxK

U2 - 10.1007/3-540-44853-5_28

DO - 10.1007/3-540-44853-5_28

M3 - Chapter

AN - SCOPUS:35248812449

SN - 354040189X

SN - 9783540401896

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 350

EP - 354

BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

A2 - Chen, Hsinchun

A2 - Zeng, Daniel D.

A2 - Madhusudan, Therani

A2 - Miranda, Richard

A2 - Schroeder, Jenny

A2 - Demchak, Chris

PB - Springer-Verlag

ER -

"TalkPrinting": Improving speaker recognition by modeling stylistic features

Abstract

Publication series

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this