Prosodic factors for predicting local pitch shape

E. Klabbers, Jan Van Santen, J. Wouters

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

In this paper, we investigate the predictive power of different prosodic factorization schemes with respect to pitch movement. We use this to propose an extension of a standard diphone database with diphones that have been recorded in different prosodic contexts. The goal of this research is to reduce the amount of pitch modification required, thereby improving the segmental quality of the synthetic voice. We present a factorization scheme based on the foot structure of utterances and show that this efficient scheme results in a fairly small number of additional diphones that need to be recorded.

Original languageEnglish (US)
Title of host publicationProceedings of 2002 IEEE Workshop on Speech Synthesis
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages123-126
Number of pages4
ISBN (Print)0780373952, 9780780373952
DOIs
StatePublished - 2002
Event2002 IEEE Workshop on Speech Synthesis - Santa Monica, United States
Duration: Sep 11 2002Sep 13 2002

Other

Other2002 IEEE Workshop on Speech Synthesis
CountryUnited States
CitySanta Monica
Period9/11/029/13/02

Fingerprint

Utterance
Data Base

ASJC Scopus subject areas

  • Language and Linguistics
  • Cultural Studies

Cite this

Klabbers, E., Van Santen, J., & Wouters, J. (2002). Prosodic factors for predicting local pitch shape. In Proceedings of 2002 IEEE Workshop on Speech Synthesis (pp. 123-126). [1224387] Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WSS.2002.1224387

Prosodic factors for predicting local pitch shape. / Klabbers, E.; Van Santen, Jan; Wouters, J.

Proceedings of 2002 IEEE Workshop on Speech Synthesis. Institute of Electrical and Electronics Engineers Inc., 2002. p. 123-126 1224387.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Klabbers, E, Van Santen, J & Wouters, J 2002, Prosodic factors for predicting local pitch shape. in Proceedings of 2002 IEEE Workshop on Speech Synthesis., 1224387, Institute of Electrical and Electronics Engineers Inc., pp. 123-126, 2002 IEEE Workshop on Speech Synthesis, Santa Monica, United States, 9/11/02. https://doi.org/10.1109/WSS.2002.1224387
Klabbers E, Van Santen J, Wouters J. Prosodic factors for predicting local pitch shape. In Proceedings of 2002 IEEE Workshop on Speech Synthesis. Institute of Electrical and Electronics Engineers Inc. 2002. p. 123-126. 1224387 https://doi.org/10.1109/WSS.2002.1224387
Klabbers, E. ; Van Santen, Jan ; Wouters, J. / Prosodic factors for predicting local pitch shape. Proceedings of 2002 IEEE Workshop on Speech Synthesis. Institute of Electrical and Electronics Engineers Inc., 2002. pp. 123-126
@inproceedings{188b362f1cc74f17a263f923615dd331,
title = "Prosodic factors for predicting local pitch shape",
abstract = "In this paper, we investigate the predictive power of different prosodic factorization schemes with respect to pitch movement. We use this to propose an extension of a standard diphone database with diphones that have been recorded in different prosodic contexts. The goal of this research is to reduce the amount of pitch modification required, thereby improving the segmental quality of the synthetic voice. We present a factorization scheme based on the foot structure of utterances and show that this efficient scheme results in a fairly small number of additional diphones that need to be recorded.",
author = "E. Klabbers and {Van Santen}, Jan and J. Wouters",
year = "2002",
doi = "10.1109/WSS.2002.1224387",
language = "English (US)",
isbn = "0780373952",
pages = "123--126",
booktitle = "Proceedings of 2002 IEEE Workshop on Speech Synthesis",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - GEN

T1 - Prosodic factors for predicting local pitch shape

AU - Klabbers, E.

AU - Van Santen, Jan

AU - Wouters, J.

PY - 2002

Y1 - 2002

N2 - In this paper, we investigate the predictive power of different prosodic factorization schemes with respect to pitch movement. We use this to propose an extension of a standard diphone database with diphones that have been recorded in different prosodic contexts. The goal of this research is to reduce the amount of pitch modification required, thereby improving the segmental quality of the synthetic voice. We present a factorization scheme based on the foot structure of utterances and show that this efficient scheme results in a fairly small number of additional diphones that need to be recorded.

AB - In this paper, we investigate the predictive power of different prosodic factorization schemes with respect to pitch movement. We use this to propose an extension of a standard diphone database with diphones that have been recorded in different prosodic contexts. The goal of this research is to reduce the amount of pitch modification required, thereby improving the segmental quality of the synthetic voice. We present a factorization scheme based on the foot structure of utterances and show that this efficient scheme results in a fairly small number of additional diphones that need to be recorded.

UR - http://www.scopus.com/inward/record.url?scp=84966340300&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84966340300&partnerID=8YFLogxK

U2 - 10.1109/WSS.2002.1224387

DO - 10.1109/WSS.2002.1224387

M3 - Conference contribution

AN - SCOPUS:84966340300

SN - 0780373952

SN - 9780780373952

SP - 123

EP - 126

BT - Proceedings of 2002 IEEE Workshop on Speech Synthesis

PB - Institute of Electrical and Electronics Engineers Inc.

ER -