SPEAKER TRACKING AND DETECTION WITH MULTIPLE SPEAKERS

Kemal Sönmez, Larry Heck, Mitchel Weintraub

Research output: Contribution to conferencePaperpeer-review

19 Scopus citations

Abstract

We describe a speaker tracking and detection system, for Switchboard conversations, that uses a two-speaker and silence hidden Markov model (HMM) with a minimum state duration constraint and Gaussian mixture model (GMM) state distributions adapted from a single gender- and handset-independent imposter model distribution. Speaker tracking is used to segment speakers for detection, which is carried out by averaging frame scores of the Viterbi path and HNORM'ing via a novel parameter interpolation extension of HNORM for use with files of arbitrary lengths. Use of duration statistics augmenting the acoustic scores is also introduced via a nonlinear combination function. Results are reported on the NIST 1998 Multispeaker development evaluation dataset.

Original languageEnglish (US)
Pages2219-2222
Number of pages4
StatePublished - 1999
Event6th European Conference on Speech Communication and Technology, EUROSPEECH 1999 - Budapest, Hungary
Duration: Sep 5 1999Sep 9 1999

Conference

Conference6th European Conference on Speech Communication and Technology, EUROSPEECH 1999
Country/TerritoryHungary
CityBudapest
Period9/5/999/9/99

ASJC Scopus subject areas

  • Computer Science Applications
  • Software
  • Linguistics and Language
  • Communication

Fingerprint

Dive into the research topics of 'SPEAKER TRACKING AND DETECTION WITH MULTIPLE SPEAKERS'. Together they form a unique fingerprint.

Cite this