Effects of prosodic factors on spectral balance: Analysis and synthesis

Qi Miao, Xiaochuan Niu, Esther Klabbers, Jan Van Santen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In natural speech, prosodic factors such as accent, stress, phrasal position and speaking style play important roles in controlling several acoustic features, including segmental duration, pitch, and spectral balance, i.e., the amplitude pattern across different frequency ranges of the power spectrum. To synthesize speech that sounds natural, these effects need to be accurately modeled. In this study we describe and evaluate a synthesis method that mimics the effects of prosodic factors on spectral balance. We measure spectral balance by using the energy in four broad frequency bands that correspond to formant frequency ranges. An additive model is used to capture the effects of prosodic factors on spectral balance. A new sinusoidal synthesis module is implemented under Festival to predict the target spectral balance value for each band from analysis results and apply it to the amplitude parameters of the sinusoidal model during synthesis. In this study we evaluate an important strength of this system, which is its ability to reduce spectral discontinuities in unit concatenation.

Original languageEnglish (US)
Title of host publication3rd International Conference on Speech Prosody 2006
EditorsR. Hoffmann, H. Mixdorff
PublisherInternational Speech Communications Association
ISBN (Electronic)9780000000002
StatePublished - 2006
Event3rd International Conference on Speech Prosody, SP 2006 - Dresden, Germany
Duration: May 2 2006May 5 2006

Publication series

NameProceedings of the International Conference on Speech Prosody
ISSN (Print)2333-2042

Conference

Conference3rd International Conference on Speech Prosody, SP 2006
CountryGermany
CityDresden
Period5/2/065/5/06

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint Dive into the research topics of 'Effects of prosodic factors on spectral balance: Analysis and synthesis'. Together they form a unique fingerprint.

Cite this