Parameterization of prosodic feature distributions for SVM modeling in speaker recognition

Luciana Ferrer, Elizabeth Shriberg, Sachin Kajarekar, Kemal Sönmez

Research output: Chapter in Book/Report/Conference proceedingConference contribution

27 Scopus citations

Abstract

Multiple recent studies have shown that speaker recognition performance using frame-based cepstral features is improved by adding higher-level information, including prosodie and lexical features. This paper explores the important question of finding a good kernel for a system that models syllable-based prosodie features using support vector machines (SVMs). The system has been the best performing of our high-level systems in the last two NIST evaluations, and gives significant improvements when combined with cepstral-based systems. We introduce two new methods for transforming the syllable-level features into a single high-dimensional vector that can be well modeled by SVMs, resulting in significant gains in speaker recognition performance.

Original languageEnglish (US)
Title of host publication2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
PagesIV233-IV236
DOIs
StatePublished - Aug 6 2007
Externally publishedYes
Event2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 - Honolulu, HI, United States
Duration: Apr 15 2007Apr 20 2007

Publication series

NameICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Volume4
ISSN (Print)1520-6149

Other

Other2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
CountryUnited States
CityHonolulu, HI
Period4/15/074/20/07

Keywords

  • GMM
  • Prosody
  • SVM
  • Speaker recognition

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Parameterization of prosodic feature distributions for SVM modeling in speaker recognition'. Together they form a unique fingerprint.

  • Cite this

    Ferrer, L., Shriberg, E., Kajarekar, S., & Sönmez, K. (2007). Parameterization of prosodic feature distributions for SVM modeling in speaker recognition. In 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07 (pp. IV233-IV236). [4218080] (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings; Vol. 4). https://doi.org/10.1109/ICASSP.2007.366892