Modeling coarticulation in continuous speech

Brian O. Bush; Alexander Kain

Modeling coarticulation in continuous speech

Brian O. Bush, Alexander Kain

Institute on Development and Disability

Research output: Contribution to journal › Conference article › peer-review

2 Scopus citations

Abstract

Modeling coarticulation in speech has been largely limited to short sequences and/or limited phonetic context. We introduce a methodology for modeling both formant frequency and bandwidth in continuous speech, allowing examination of sentencelevel coarticulation. The model represents continuous trajectories as a combination of overlapping local trajectories, which are represented by a weighted-addition of acoustic event targets by sigmoidal coarticulation functions characterized by slope and position. Estimation is achieved using a combination of hill-climbing and grid-search, with global target, joint slope for identical contexts, and local position parameters. We evaluate model performance for two speakers using an intelligibility test that compares vocoded model output to a purely vocoded and a natural condition.

Original language	English (US)
Pages (from-to)	193-197
Number of pages	5
Journal	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
State	Published - 2014
Event	15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 - Singapore, Singapore Duration: Sep 14 2014 → Sep 18 2014

Keywords

Coarticulation
Continuous speech
Formants

ASJC Scopus subject areas

Language and Linguistics
Human-Computer Interaction
Signal Processing
Software
Modeling and Simulation

Cite this

@article{2014f54271e1466d8d2781f048db2801,

title = "Modeling coarticulation in continuous speech",

abstract = "Modeling coarticulation in speech has been largely limited to short sequences and/or limited phonetic context. We introduce a methodology for modeling both formant frequency and bandwidth in continuous speech, allowing examination of sentencelevel coarticulation. The model represents continuous trajectories as a combination of overlapping local trajectories, which are represented by a weighted-addition of acoustic event targets by sigmoidal coarticulation functions characterized by slope and position. Estimation is achieved using a combination of hill-climbing and grid-search, with global target, joint slope for identical contexts, and local position parameters. We evaluate model performance for two speakers using an intelligibility test that compares vocoded model output to a purely vocoded and a natural condition.",

keywords = "Coarticulation, Continuous speech, Formants",

author = "Bush, {Brian O.} and Alexander Kain",

note = "Publisher Copyright: Copyright {\textcopyright} 2014 ISCA.; 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014 ; Conference date: 14-09-2014 Through 18-09-2014",

year = "2014",

language = "English (US)",

pages = "193--197",

journal = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH",

issn = "2308-457X",

}

TY - JOUR

T1 - Modeling coarticulation in continuous speech

AU - Bush, Brian O.

AU - Kain, Alexander

PY - 2014

Y1 - 2014

N2 - Modeling coarticulation in speech has been largely limited to short sequences and/or limited phonetic context. We introduce a methodology for modeling both formant frequency and bandwidth in continuous speech, allowing examination of sentencelevel coarticulation. The model represents continuous trajectories as a combination of overlapping local trajectories, which are represented by a weighted-addition of acoustic event targets by sigmoidal coarticulation functions characterized by slope and position. Estimation is achieved using a combination of hill-climbing and grid-search, with global target, joint slope for identical contexts, and local position parameters. We evaluate model performance for two speakers using an intelligibility test that compares vocoded model output to a purely vocoded and a natural condition.

AB - Modeling coarticulation in speech has been largely limited to short sequences and/or limited phonetic context. We introduce a methodology for modeling both formant frequency and bandwidth in continuous speech, allowing examination of sentencelevel coarticulation. The model represents continuous trajectories as a combination of overlapping local trajectories, which are represented by a weighted-addition of acoustic event targets by sigmoidal coarticulation functions characterized by slope and position. Estimation is achieved using a combination of hill-climbing and grid-search, with global target, joint slope for identical contexts, and local position parameters. We evaluate model performance for two speakers using an intelligibility test that compares vocoded model output to a purely vocoded and a natural condition.

KW - Coarticulation

KW - Continuous speech

KW - Formants

UR - http://www.scopus.com/inward/record.url?scp=84910047777&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84910047777&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:84910047777

SN - 2308-457X

SP - 193

EP - 197

JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

T2 - 15th Annual Conference of the International Speech Communication Association: Celebrating the Diversity of Spoken Languages, INTERSPEECH 2014

Y2 - 14 September 2014 through 18 September 2014

ER -

Modeling coarticulation in continuous speech

Abstract

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this