Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

Alexander Kain; Akiko Amano-Kusumoto; John Paul Hosom

doi:10.1121/1.2967844

Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

Alexander Kain, Akiko Amano-Kusumoto, John Paul Hosom

Institute on Development and Disability

Research output: Contribution to journal › Article › peer-review

19 Scopus citations

Abstract

Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

Original language	English (US)
Pages (from-to)	2308-2319
Number of pages	12
Journal	Journal of the Acoustical Society of America
Volume	124
Issue number	4
DOIs	https://doi.org/10.1121/1.2967844
State	Published - 2008

ASJC Scopus subject areas

Arts and Humanities (miscellaneous)
Acoustics and Ultrasonics

Access to Document

10.1121/1.2967844

Cite this

@article{475b2ea3bfad477195408672418c7cb6,

title = "Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility",

abstract = "Speakers naturally adopt a special {"}clear{"} (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as {"}conversational{"} (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates {"}hybrid{"} (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).",

author = "Alexander Kain and Akiko Amano-Kusumoto and Hosom, {John Paul}",

year = "2008",

doi = "10.1121/1.2967844",

language = "English (US)",

volume = "124",

pages = "2308--2319",

journal = "Journal of the Acoustical Society of America",

issn = "0001-4966",

publisher = "Acoustical Society of America",

number = "4",

}

TY - JOUR

T1 - Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

AU - Kain, Alexander

AU - Amano-Kusumoto, Akiko

AU - Hosom, John Paul

PY - 2008

Y1 - 2008

N2 - Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

AB - Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

UR - http://www.scopus.com/inward/record.url?scp=53949098981&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=53949098981&partnerID=8YFLogxK

U2 - 10.1121/1.2967844

DO - 10.1121/1.2967844

M3 - Article

C2 - 19062869

AN - SCOPUS:53949098981

SN - 0001-4966

VL - 124

SP - 2308

EP - 2319

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

IS - 4

ER -

Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this