The Post-Stroke Speech Transcription (PSST) Challenge

Robert C. Gale; Mikala Fleegle; Gerasimos Fergadiotis; Steven Bedrick

The Post-Stroke Speech Transcription (PSST) Challenge

Robert C. Gale, Mikala Fleegle, Gerasimos Fergadiotis, Steven Bedrick

Institute on Development and Disability

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

3 Scopus citations

Abstract

We present the outcome of the Post-Stroke Speech Transcription (PSST) challenge. For the challenge, we prepared a new data resource of responses to two confrontation naming tests found in AphasiaBank, extracting audio and adding new phonemic transcripts for each response. The challenge consisted of two tasks. Task A asked challengers to build an automatic speech recognizer (ASR) for phonemic transcription of the PSST samples, evaluated in terms of phoneme error rate (PER) as well as a finer-grained metric derived from phonological feature theory, feature error rate (FER). The best model had a 9.9% FER / 20.0% PER, improving on our baseline by a relative 18% and 24%, respectively. Task B approximated a downstream assessment task, asking challengers to identify whether each recording contained a correctly pronounced target word. Challengers were unable to improve on the baseline algorithm; however, using this algorithm with the improved transcripts from Task A resulted in 92.8% accuracy / 0.921 F1, a relative improvement of 2.8% and 3.3%, respectively.

Original language	English (US)
Title of host publication	Proceedings - 4th RaPID Workshop
Subtitle of host publication	Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022
Editors	Dimitrios Kokkinakis, Charalambos K. Themistocleous, Kristina Lundholm Fors, Athanasios Tsanas, Kathleen C. Fraser
Publisher	European Language Resources Association (ELRA)
Pages	41-55
Number of pages	15
ISBN (Electronic)	9791095546771
State	Published - 2022
Event	4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022 - Marseille, France Duration: Jun 25 2022 → …

Publication series

Name	Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022

Conference

Conference	4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022
Country/Territory	France
City	Marseille
Period	6/25/22 → …

Keywords

anomia
aphasia
automatic speech recognition
speech language pathology assessment

ASJC Scopus subject areas

Language and Linguistics
Education
Library and Information Sciences
Linguistics and Language

Cite this

Gale, R. C., Fleegle, M., Fergadiotis, G., & Bedrick, S. (2022). The Post-Stroke Speech Transcription (PSST) Challenge. In D. Kokkinakis, C. K. Themistocleous, K. L. Fors, A. Tsanas, & K. C. Fraser (Eds.), Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022 (pp. 41-55). (Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022). European Language Resources Association (ELRA).

The Post-Stroke Speech Transcription (PSST) Challenge. / Gale, Robert C.; Fleegle, Mikala; Fergadiotis, Gerasimos et al.
Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022. ed. / Dimitrios Kokkinakis; Charalambos K. Themistocleous; Kristina Lundholm Fors; Athanasios Tsanas; Kathleen C. Fraser. European Language Resources Association (ELRA), 2022. p. 41-55 (Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Gale, RC, Fleegle, M, Fergadiotis, G & Bedrick, S 2022, The Post-Stroke Speech Transcription (PSST) Challenge. in D Kokkinakis, CK Themistocleous, KL Fors, A Tsanas & KC Fraser (eds), Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022. Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022, European Language Resources Association (ELRA), pp. 41-55, 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022, Marseille, France, 6/25/22.

Gale RC, Fleegle M, Fergadiotis G, Bedrick S. The Post-Stroke Speech Transcription (PSST) Challenge. In Kokkinakis D, Themistocleous CK, Fors KL, Tsanas A, Fraser KC, editors, Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022. European Language Resources Association (ELRA). 2022. p. 41-55. (Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022).

Gale, Robert C. ; Fleegle, Mikala ; Fergadiotis, Gerasimos et al. / The Post-Stroke Speech Transcription (PSST) Challenge. Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022. editor / Dimitrios Kokkinakis ; Charalambos K. Themistocleous ; Kristina Lundholm Fors ; Athanasios Tsanas ; Kathleen C. Fraser. European Language Resources Association (ELRA), 2022. pp. 41-55 (Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022).

@inproceedings{ba12eeab5b79426e99c4d2529f516a23,

title = "The Post-Stroke Speech Transcription (PSST) Challenge",

abstract = "We present the outcome of the Post-Stroke Speech Transcription (PSST) challenge. For the challenge, we prepared a new data resource of responses to two confrontation naming tests found in AphasiaBank, extracting audio and adding new phonemic transcripts for each response. The challenge consisted of two tasks. Task A asked challengers to build an automatic speech recognizer (ASR) for phonemic transcription of the PSST samples, evaluated in terms of phoneme error rate (PER) as well as a finer-grained metric derived from phonological feature theory, feature error rate (FER). The best model had a 9.9% FER / 20.0% PER, improving on our baseline by a relative 18% and 24%, respectively. Task B approximated a downstream assessment task, asking challengers to identify whether each recording contained a correctly pronounced target word. Challengers were unable to improve on the baseline algorithm; however, using this algorithm with the improved transcripts from Task A resulted in 92.8% accuracy / 0.921 F1, a relative improvement of 2.8% and 3.3%, respectively.",

keywords = "anomia, aphasia, automatic speech recognition, speech language pathology assessment",

author = "Gale, {Robert C.} and Mikala Fleegle and Gerasimos Fergadiotis and Steven Bedrick",

note = "Publisher Copyright: {\textcopyright} European Language Resources Association (ELRA); 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022 ; Conference date: 25-06-2022",

year = "2022",

language = "English (US)",

series = "Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022",

publisher = "European Language Resources Association (ELRA)",

pages = "41--55",

editor = "Dimitrios Kokkinakis and Themistocleous, {Charalambos K.} and Fors, {Kristina Lundholm} and Athanasios Tsanas and Fraser, {Kathleen C.}",

booktitle = "Proceedings - 4th RaPID Workshop",

}

TY - GEN

T1 - The Post-Stroke Speech Transcription (PSST) Challenge

AU - Gale, Robert C.

AU - Fleegle, Mikala

AU - Fergadiotis, Gerasimos

AU - Bedrick, Steven

N1 - Publisher Copyright: © European Language Resources Association (ELRA)

PY - 2022

Y1 - 2022

N2 - We present the outcome of the Post-Stroke Speech Transcription (PSST) challenge. For the challenge, we prepared a new data resource of responses to two confrontation naming tests found in AphasiaBank, extracting audio and adding new phonemic transcripts for each response. The challenge consisted of two tasks. Task A asked challengers to build an automatic speech recognizer (ASR) for phonemic transcription of the PSST samples, evaluated in terms of phoneme error rate (PER) as well as a finer-grained metric derived from phonological feature theory, feature error rate (FER). The best model had a 9.9% FER / 20.0% PER, improving on our baseline by a relative 18% and 24%, respectively. Task B approximated a downstream assessment task, asking challengers to identify whether each recording contained a correctly pronounced target word. Challengers were unable to improve on the baseline algorithm; however, using this algorithm with the improved transcripts from Task A resulted in 92.8% accuracy / 0.921 F1, a relative improvement of 2.8% and 3.3%, respectively.

AB - We present the outcome of the Post-Stroke Speech Transcription (PSST) challenge. For the challenge, we prepared a new data resource of responses to two confrontation naming tests found in AphasiaBank, extracting audio and adding new phonemic transcripts for each response. The challenge consisted of two tasks. Task A asked challengers to build an automatic speech recognizer (ASR) for phonemic transcription of the PSST samples, evaluated in terms of phoneme error rate (PER) as well as a finer-grained metric derived from phonological feature theory, feature error rate (FER). The best model had a 9.9% FER / 20.0% PER, improving on our baseline by a relative 18% and 24%, respectively. Task B approximated a downstream assessment task, asking challengers to identify whether each recording contained a correctly pronounced target word. Challengers were unable to improve on the baseline algorithm; however, using this algorithm with the improved transcripts from Task A resulted in 92.8% accuracy / 0.921 F1, a relative improvement of 2.8% and 3.3%, respectively.

KW - anomia

KW - aphasia

KW - automatic speech recognition

KW - speech language pathology assessment

UR - http://www.scopus.com/inward/record.url?scp=85145873865&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85145873865&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85145873865

T3 - Proceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022

SP - 41

EP - 55

BT - Proceedings - 4th RaPID Workshop

A2 - Kokkinakis, Dimitrios

A2 - Themistocleous, Charalambos K.

A2 - Fors, Kristina Lundholm

A2 - Tsanas, Athanasios

A2 - Fraser, Kathleen C.

PB - European Language Resources Association (ELRA)

T2 - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022

Y2 - 25 June 2022

ER -

The Post-Stroke Speech Transcription (PSST) Challenge

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this