Perceptual experiments for diagnostic testing of text-to-speech systems

Jan P.H. van Santen

Research output: Contribution to journalArticlepeer-review

34 Scopus citations

Abstract

This paper describes perceptual methods for diagnosing problems in text-to-speech systems. Special attention is paid to two issues. First, coverageof the domain of a text-to-speech system. Since thisdomain involves an enormous range of contexts, it iscriticial for diagnostics, and also for overall evaluation, that test materials cover this range to the fullest extent possible. Automatic text generation algorithms that make extensive use of “greedy” algorithms are described that serve this purpose. Second, speech generated by text-to-speech systems tends to have a great variety of problems. A battery of experimental paradigms is discussed that address different facets of speech quality and intelligibility. Included are: (a) “word pointing” method for detection of problematic concatenative units, (b) “minimal pairs intelligibility test”-an expanded diagnostic rhyme test; (c) automatically scored orthographic name transcription task; (d) mean opinion score paradigm with problem categorization; and (e) paired comparison paradigm with strength-of-choice rating. The methods are applied in a series of experiments on high-end text-to-speech systems.

Original languageEnglish (US)
Pages (from-to)49-100
Number of pages52
JournalComputer Speech and Language
Volume7
Issue number1
DOIs
StatePublished - Jan 1993

ASJC Scopus subject areas

  • Software
  • Theoretical Computer Science
  • Human-Computer Interaction

Fingerprint Dive into the research topics of 'Perceptual experiments for diagnostic testing of text-to-speech systems'. Together they form a unique fingerprint.

Cite this