Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech

Brian O. Bush; Alexander Kain

doi:10.1109/ICASSP.2013.6639226

Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech

Brian O. Bush, Alexander Kain

Institute on Development and Disability

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

We present a data-driven formant model and methodology for discovering its parameters, namely phoneme targets and coarticulation functions for consonant-vowel-consonant (CVC) words from fully-automatic formant data. The model uses formant targets that are speaker dependent, but independent of speaking style and phonemic context. We used a global error measure to search for optimal formant targets for all phonemes, including classes of sounds where formants are not directly observable. Analysis of coarticulation parameters found significant differences in parameters between clear and conversational speech. Estimated formant targets were largely in agreement with acoustic-phonetic expectations. An intelligibility test validated that resynthesized CVC words using modeled formant trajectories were nearly as intelligible as resynthesized CVC words using observed formant trajectories.

Original language	English (US)
Title of host publication	2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
Pages	8017-8021
Number of pages	5
DOIs	https://doi.org/10.1109/ICASSP.2013.6639226
State	Published - Oct 18 2013
Event	2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Vancouver, BC, Canada Duration: May 26 2013 → May 31 2013

Publication series

Name	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN (Print)	1520-6149

Other

Other	2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Country/Territory	Canada
City	Vancouver, BC
Period	5/26/13 → 5/31/13

Keywords

clear speech
coarticulation
formants

ASJC Scopus subject areas

Software
Signal Processing
Electrical and Electronic Engineering

Access to Document

10.1109/ICASSP.2013.6639226

Cite this

Bush, B. O., & Kain, A. (2013). Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech. In 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings (pp. 8017-8021). Article 6639226 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). https://doi.org/10.1109/ICASSP.2013.6639226

Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech. / Bush, Brian O.; Kain, Alexander.
2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. 2013. p. 8017-8021 6639226 (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Bush, BO & Kain, A 2013, Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech. in 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings., 6639226, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp. 8017-8021, 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, 5/26/13. https://doi.org/10.1109/ICASSP.2013.6639226

Bush BO, Kain A. Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech. In 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings. 2013. p. 8017-8021. 6639226. (ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings). doi: 10.1109/ICASSP.2013.6639226

@inproceedings{b39674cecb8f44878846a834d7ba5fe4,

title = "Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech",

abstract = "We present a data-driven formant model and methodology for discovering its parameters, namely phoneme targets and coarticulation functions for consonant-vowel-consonant (CVC) words from fully-automatic formant data. The model uses formant targets that are speaker dependent, but independent of speaking style and phonemic context. We used a global error measure to search for optimal formant targets for all phonemes, including classes of sounds where formants are not directly observable. Analysis of coarticulation parameters found significant differences in parameters between clear and conversational speech. Estimated formant targets were largely in agreement with acoustic-phonetic expectations. An intelligibility test validated that resynthesized CVC words using modeled formant trajectories were nearly as intelligible as resynthesized CVC words using observed formant trajectories.",

keywords = "clear speech, coarticulation, formants",

author = "Bush, {Brian O.} and Alexander Kain",

year = "2013",

month = oct,

day = "18",

doi = "10.1109/ICASSP.2013.6639226",

language = "English (US)",

isbn = "9781479903566",

series = "ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings",

pages = "8017--8021",

booktitle = "2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings",

note = "2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 ; Conference date: 26-05-2013 Through 31-05-2013",

}

TY - GEN

T1 - Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech

AU - Bush, Brian O.

AU - Kain, Alexander

PY - 2013/10/18

Y1 - 2013/10/18

N2 - We present a data-driven formant model and methodology for discovering its parameters, namely phoneme targets and coarticulation functions for consonant-vowel-consonant (CVC) words from fully-automatic formant data. The model uses formant targets that are speaker dependent, but independent of speaking style and phonemic context. We used a global error measure to search for optimal formant targets for all phonemes, including classes of sounds where formants are not directly observable. Analysis of coarticulation parameters found significant differences in parameters between clear and conversational speech. Estimated formant targets were largely in agreement with acoustic-phonetic expectations. An intelligibility test validated that resynthesized CVC words using modeled formant trajectories were nearly as intelligible as resynthesized CVC words using observed formant trajectories.

AB - We present a data-driven formant model and methodology for discovering its parameters, namely phoneme targets and coarticulation functions for consonant-vowel-consonant (CVC) words from fully-automatic formant data. The model uses formant targets that are speaker dependent, but independent of speaking style and phonemic context. We used a global error measure to search for optimal formant targets for all phonemes, including classes of sounds where formants are not directly observable. Analysis of coarticulation parameters found significant differences in parameters between clear and conversational speech. Estimated formant targets were largely in agreement with acoustic-phonetic expectations. An intelligibility test validated that resynthesized CVC words using modeled formant trajectories were nearly as intelligible as resynthesized CVC words using observed formant trajectories.

KW - clear speech

KW - coarticulation

KW - formants

UR - http://www.scopus.com/inward/record.url?scp=84890515469&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84890515469&partnerID=8YFLogxK

U2 - 10.1109/ICASSP.2013.6639226

DO - 10.1109/ICASSP.2013.6639226

M3 - Conference contribution

AN - SCOPUS:84890515469

SN - 9781479903566

T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

SP - 8017

EP - 8021

BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings

T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013

Y2 - 26 May 2013 through 31 May 2013

ER -

Estimating phoneme formant targets and coarticulation parameters of conversational and clear speech

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this