Improving the intelligibility of dysarthric speech

Alexander Kain, John Paul Hosom, Xiaochuan Niu, Jan Van Santen, Melanie Fried-Oken, Janice Staehely

Research output: Contribution to journalArticle

50 Citations (Scopus)

Abstract

Dysarthria is a speech motor disorder usually resulting in a substantive decrease in speech intelligibility by the general population. In this study, we have significantly improved the intelligibility of dysarthric vowels of one speaker from 48% to 54%, as evaluated by a vowel identification task using 64 CVC stimuli judged by 24 listeners. Improvement was obtained by transforming the vowels of a speaker with dysarthria to more closely match the vowel space of a non-dysarthric (target) speaker. The optimal mapping feature set, from a list of 21 candidate feature sets, proved to be one utilizing vowel duration and F1-F3 stable points, which were calculated using shape-constrained isotonic regression. The choice of speaker-specific or speaker-independent vowel formant targets appeared to be insignificant. Comparisons with "oracle" conditions were performed in order to evaluate the analysis/re-synthesis system independently of the transformation function.

Original languageEnglish (US)
Pages (from-to)743-759
Number of pages17
JournalSpeech Communication
Volume49
Issue number9
DOIs
StatePublished - Sep 2007

Fingerprint

Speech Intelligibility
Speech intelligibility
Dysarthria
Isotonic Regression
Speech Disorders
Target
listener
Disorder
stimulus
candidacy
Synthesis
regression
Decrease
Evaluate
Population
Speech
Intelligibility
Motor Disorders

Keywords

  • Dysarthria
  • Intelligibility
  • Speech modification
  • Speech processing
  • Speech transformation

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Cite this

Improving the intelligibility of dysarthric speech. / Kain, Alexander; Hosom, John Paul; Niu, Xiaochuan; Van Santen, Jan; Fried-Oken, Melanie; Staehely, Janice.

In: Speech Communication, Vol. 49, No. 9, 09.2007, p. 743-759.

Research output: Contribution to journalArticle

Kain, Alexander ; Hosom, John Paul ; Niu, Xiaochuan ; Van Santen, Jan ; Fried-Oken, Melanie ; Staehely, Janice. / Improving the intelligibility of dysarthric speech. In: Speech Communication. 2007 ; Vol. 49, No. 9. pp. 743-759.
@article{87aaa5a5b2ce484ea4528fc9d2ab375b,
title = "Improving the intelligibility of dysarthric speech",
abstract = "Dysarthria is a speech motor disorder usually resulting in a substantive decrease in speech intelligibility by the general population. In this study, we have significantly improved the intelligibility of dysarthric vowels of one speaker from 48{\%} to 54{\%}, as evaluated by a vowel identification task using 64 CVC stimuli judged by 24 listeners. Improvement was obtained by transforming the vowels of a speaker with dysarthria to more closely match the vowel space of a non-dysarthric (target) speaker. The optimal mapping feature set, from a list of 21 candidate feature sets, proved to be one utilizing vowel duration and F1-F3 stable points, which were calculated using shape-constrained isotonic regression. The choice of speaker-specific or speaker-independent vowel formant targets appeared to be insignificant. Comparisons with {"}oracle{"} conditions were performed in order to evaluate the analysis/re-synthesis system independently of the transformation function.",
keywords = "Dysarthria, Intelligibility, Speech modification, Speech processing, Speech transformation",
author = "Alexander Kain and Hosom, {John Paul} and Xiaochuan Niu and {Van Santen}, Jan and Melanie Fried-Oken and Janice Staehely",
year = "2007",
month = "9",
doi = "10.1016/j.specom.2007.05.001",
language = "English (US)",
volume = "49",
pages = "743--759",
journal = "Speech Communication",
issn = "0167-6393",
publisher = "Elsevier",
number = "9",

}

TY - JOUR

T1 - Improving the intelligibility of dysarthric speech

AU - Kain, Alexander

AU - Hosom, John Paul

AU - Niu, Xiaochuan

AU - Van Santen, Jan

AU - Fried-Oken, Melanie

AU - Staehely, Janice

PY - 2007/9

Y1 - 2007/9

N2 - Dysarthria is a speech motor disorder usually resulting in a substantive decrease in speech intelligibility by the general population. In this study, we have significantly improved the intelligibility of dysarthric vowels of one speaker from 48% to 54%, as evaluated by a vowel identification task using 64 CVC stimuli judged by 24 listeners. Improvement was obtained by transforming the vowels of a speaker with dysarthria to more closely match the vowel space of a non-dysarthric (target) speaker. The optimal mapping feature set, from a list of 21 candidate feature sets, proved to be one utilizing vowel duration and F1-F3 stable points, which were calculated using shape-constrained isotonic regression. The choice of speaker-specific or speaker-independent vowel formant targets appeared to be insignificant. Comparisons with "oracle" conditions were performed in order to evaluate the analysis/re-synthesis system independently of the transformation function.

AB - Dysarthria is a speech motor disorder usually resulting in a substantive decrease in speech intelligibility by the general population. In this study, we have significantly improved the intelligibility of dysarthric vowels of one speaker from 48% to 54%, as evaluated by a vowel identification task using 64 CVC stimuli judged by 24 listeners. Improvement was obtained by transforming the vowels of a speaker with dysarthria to more closely match the vowel space of a non-dysarthric (target) speaker. The optimal mapping feature set, from a list of 21 candidate feature sets, proved to be one utilizing vowel duration and F1-F3 stable points, which were calculated using shape-constrained isotonic regression. The choice of speaker-specific or speaker-independent vowel formant targets appeared to be insignificant. Comparisons with "oracle" conditions were performed in order to evaluate the analysis/re-synthesis system independently of the transformation function.

KW - Dysarthria

KW - Intelligibility

KW - Speech modification

KW - Speech processing

KW - Speech transformation

UR - http://www.scopus.com/inward/record.url?scp=34447635527&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34447635527&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2007.05.001

DO - 10.1016/j.specom.2007.05.001

M3 - Article

VL - 49

SP - 743

EP - 759

JO - Speech Communication

JF - Speech Communication

SN - 0167-6393

IS - 9

ER -