The Post-Stroke Speech Transcription (PSST) Challenge

Robert C. Gale, Mikala Fleegle, Gerasimos Fergadiotis, Steven Bedrick

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

We present the outcome of the Post-Stroke Speech Transcription (PSST) challenge. For the challenge, we prepared a new data resource of responses to two confrontation naming tests found in AphasiaBank, extracting audio and adding new phonemic transcripts for each response. The challenge consisted of two tasks. Task A asked challengers to build an automatic speech recognizer (ASR) for phonemic transcription of the PSST samples, evaluated in terms of phoneme error rate (PER) as well as a finer-grained metric derived from phonological feature theory, feature error rate (FER). The best model had a 9.9% FER / 20.0% PER, improving on our baseline by a relative 18% and 24%, respectively. Task B approximated a downstream assessment task, asking challengers to identify whether each recording contained a correctly pronounced target word. Challengers were unable to improve on the baseline algorithm; however, using this algorithm with the improved transcripts from Task A resulted in 92.8% accuracy / 0.921 F1, a relative improvement of 2.8% and 3.3%, respectively.

Original languageEnglish (US)
Title of host publicationProceedings - 4th RaPID Workshop
Subtitle of host publicationResources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022
EditorsDimitrios Kokkinakis, Charalambos K. Themistocleous, Kristina Lundholm Fors, Athanasios Tsanas, Kathleen C. Fraser
PublisherEuropean Language Resources Association (ELRA)
Pages41-55
Number of pages15
ISBN (Electronic)9791095546771
StatePublished - 2022
Event4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022 - Marseille, France
Duration: Jun 25 2022 → …

Publication series

NameProceedings - 4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, as part of the 13th Edition of the Language Resources and Evaluation Conference, LREC 2022

Conference

Conference4th RaPID Workshop: Resources and Processing of Linguistic, Para-Linguistic and Extra-Linguistic Data from People with Various Forms of Cognitive/Psychiatric/Developmental Impairments, RAPID 2022
Country/TerritoryFrance
CityMarseille
Period6/25/22 → …

Keywords

  • anomia
  • aphasia
  • automatic speech recognition
  • speech language pathology assessment

ASJC Scopus subject areas

  • Language and Linguistics
  • Education
  • Library and Information Sciences
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'The Post-Stroke Speech Transcription (PSST) Challenge'. Together they form a unique fingerprint.

Cite this