Evaluation of speaker mimic technology for personalizing SGD voices

Esther Klabbers, Alexander Kain, Jan P.H. Van Santen

Research output: Contribution to conferencePaperpeer-review

3 Scopus citations

Abstract

In this paper, we demonstrate the use of state-of-the-art speech technology to transform speech from a source speaker to mimic a particular target speaker with the intention of providng personalized voices to users of Speech Generating Devices (SGDs). This speaker mimicry (SM) capability allows us to use high-quality acoustic inventories from professional speakers and transform them to a different target speaker using a very limited set of sentences from that speaker. This technology targets future SGD users who still have a limited vocabulary or available previous recordings. The results of a perceptual study show that listeners can identify which SM voices most resemble their respective target voices. 1

Original languageEnglish (US)
Pages2154-2157
Number of pages4
StatePublished - 2010
Event11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010 - Makuhari, Chiba, Japan
Duration: Sep 26 2010Sep 30 2010

Other

Other11th Annual Conference of the International Speech Communication Association: Spoken Language Processing for All, INTERSPEECH 2010
CountryJapan
CityMakuhari, Chiba
Period9/26/109/30/10

Keywords

  • Perceptual evaluation
  • Prosody modeling
  • Speech synthesis
  • Voice transformation

ASJC Scopus subject areas

  • Language and Linguistics
  • Speech and Hearing

Fingerprint Dive into the research topics of 'Evaluation of speaker mimic technology for personalizing SGD voices'. Together they form a unique fingerprint.

Cite this