Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

Alexander Kain, Akiko Amano-Kusumoto, John Paul Hosom

Research output: Contribution to journalArticle

14 Citations (Scopus)

Abstract

Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

Original languageEnglish (US)
Pages (from-to)2308-2319
Number of pages12
JournalJournal of the Acoustical Society of America
Volume124
Issue number4
DOIs
StatePublished - 2008

Fingerprint

intelligibility
acoustics
sentences
phonemes
stimuli
Acoustics
Intelligibility
impairment
background noise
hearing
causes

ASJC Scopus subject areas

  • Acoustics and Ultrasonics

Cite this

Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility. / Kain, Alexander; Amano-Kusumoto, Akiko; Hosom, John Paul.

In: Journal of the Acoustical Society of America, Vol. 124, No. 4, 2008, p. 2308-2319.

Research output: Contribution to journalArticle

@article{475b2ea3bfad477195408672418c7cb6,
title = "Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility",
abstract = "Speakers naturally adopt a special {"}clear{"} (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as {"}conversational{"} (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates {"}hybrid{"} (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).",
author = "Alexander Kain and Akiko Amano-Kusumoto and Hosom, {John Paul}",
year = "2008",
doi = "10.1121/1.2967844",
language = "English (US)",
volume = "124",
pages = "2308--2319",
journal = "Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "Acoustical Society of America",
number = "4",

}

TY - JOUR

T1 - Hybridizing conversational and clear speech to determine the degree of contribution of acoustic features to intelligibility

AU - Kain, Alexander

AU - Amano-Kusumoto, Akiko

AU - Hosom, John Paul

PY - 2008

Y1 - 2008

N2 - Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

AB - Speakers naturally adopt a special "clear" (CLR) speaking style in order to be better understood by listeners who are moderately impaired in their ability to understand speech due to a hearing impairment, the presence of background noise, or both. In contrast, speech intended for nonimpaired listeners in quiet environments is referred to as "conversational" (CNV). Studies have shown that the intelligibility of CLR speech is usually higher than that of CNV speech in adverse circumstances. It is not known which individual acoustic features or combinations of features cause the higher intelligibility of CLR speech. The objective of this study is to determine the contribution of some acoustic features to intelligibility for a single speaker. The proposed method creates "hybrid" (HYB) speech stimuli that selectively combine acoustic features of one sentence spoken in the CNV and CLR styles. The intelligibility of these stimuli is then measured in perceptual tests, using 96 phonetically balanced sentences. Results for one speaker show significant sentence-level intelligibility improvements over CNV speech when replacing certain combinations of short-term spectra, phoneme identities, and phoneme durations of CNV speech with those from CLR speech, but no improvements for combinations involving fundamental frequency, energy, or nonspeech events (pauses).

UR - http://www.scopus.com/inward/record.url?scp=53949098981&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=53949098981&partnerID=8YFLogxK

U2 - 10.1121/1.2967844

DO - 10.1121/1.2967844

M3 - Article

C2 - 19062869

AN - SCOPUS:53949098981

VL - 124

SP - 2308

EP - 2319

JO - Journal of the Acoustical Society of America

JF - Journal of the Acoustical Society of America

SN - 0001-4966

IS - 4

ER -