TY - JOUR
T1 - Spoken language derived measures for detecting mild cognitive impairment
AU - Roark, Brian
AU - Mitchell, Margaret
AU - Hosom, John Paul
AU - Hollingshead, Kristy
AU - Kaye, Jeffrey
N1 - Funding Information:
Manuscript received August 06, 2010; revised November 05, 2010; accepted January 17, 2011. Date of publication February 07, 2011; date of current version July 20, 2011. This research was supported in part by the National Science Foundation (NSF) under Grants #IIS-0447214 and #BCS-0826654, National Institute of Health/National Institute of Aging (NIH/NIA) grants #P30AG08017 and #R01AG024059, and pilot grants from the Oregon Center for Aging and Technology (ORCATECH, NIH #1P30AG024978-01) and the Oregon Partnership for Alzheimer’s Research. Any opinions, findings, conclusions or recommendations expressed in this publication are those of the authors and do not necessarily reflect the views of the NSF or NIH. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Gokhan Tur.
PY - 2011
Y1 - 2011
N2 - Spoken responses produced by subjects during neuropsychological exams can provide diagnostic markers beyond exam performance. In particular, characteristics of the spoken language itself can discriminate between subject groups. We present results on the utility of such markers in discriminating between healthy elderly subjects and subjects with mild cognitive impairment (MCI). Given the audio and transcript of a spoken narrative recall task, a range of markers are automatically derived. These markers include speech features such as pause frequency and duration, and many linguistic complexity measures. We examine measures calculated from manually annotated time alignments (of the transcript with the audio) and syntactic parse trees, as well as the same measures calculated from automatic (forced) time alignments and automatic parses. We show statistically significant differences between clinical subject groups for a number of measures. These differences are largely preserved with automation. We then present classification results, and demonstrate a statistically significant improvement in the area under the ROC curve (AUC) when using automatic spoken language derived features in addition to the neuropsychological test scores. Our results indicate that using multiple, complementary measures can aid in automatic detection of MCI.
AB - Spoken responses produced by subjects during neuropsychological exams can provide diagnostic markers beyond exam performance. In particular, characteristics of the spoken language itself can discriminate between subject groups. We present results on the utility of such markers in discriminating between healthy elderly subjects and subjects with mild cognitive impairment (MCI). Given the audio and transcript of a spoken narrative recall task, a range of markers are automatically derived. These markers include speech features such as pause frequency and duration, and many linguistic complexity measures. We examine measures calculated from manually annotated time alignments (of the transcript with the audio) and syntactic parse trees, as well as the same measures calculated from automatic (forced) time alignments and automatic parses. We show statistically significant differences between clinical subject groups for a number of measures. These differences are largely preserved with automation. We then present classification results, and demonstrate a statistically significant improvement in the area under the ROC curve (AUC) when using automatic spoken language derived features in addition to the neuropsychological test scores. Our results indicate that using multiple, complementary measures can aid in automatic detection of MCI.
KW - Forced alignment
KW - linguistic complexity
KW - mild cognitive impairment (MCI)
KW - parsing
KW - spoken language understanding
UR - http://www.scopus.com/inward/record.url?scp=79960666270&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960666270&partnerID=8YFLogxK
U2 - 10.1109/TASL.2011.2112351
DO - 10.1109/TASL.2011.2112351
M3 - Article
AN - SCOPUS:79960666270
SN - 1558-7916
VL - 19
SP - 2081
EP - 2090
JO - IEEE Transactions on Speech and Audio Processing
JF - IEEE Transactions on Speech and Audio Processing
IS - 7
M1 - 5710404
ER -