The American English SALA-II data collection

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

We discuss the collection of the American English SALA-II speech corpus. We focus on how we designed the prompt sheets to ensure maximum variability and on our strategy for recruiting the required 4000 speakers. We also present results on the effectiveness of the phonetically rich sentence. This paper should benefit others who are interested in using this corpus, or who are planning to collect a speech corpus with a large number of speakers.

Original languageEnglish (US)
Title of host publicationProceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004
PublisherEuropean Language Resources Association (ELRA)
Pages567-570
Number of pages4
ISBN (Electronic)2951740816, 9782951740815
StatePublished - Jan 1 2004
Event4th International Conference on Language Resources and Evaluation, LREC 2004 - Lisbon, Portugal
Duration: May 26 2004May 28 2004

Other

Other4th International Conference on Language Resources and Evaluation, LREC 2004
CountryPortugal
CityLisbon
Period5/26/045/28/04

Fingerprint

planning
Data Collection
American English
Planning
Prompts

ASJC Scopus subject areas

  • Library and Information Sciences
  • Education
  • Language and Linguistics
  • Linguistics and Language

Cite this

Heeman, P. (2004). The American English SALA-II data collection. In Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004 (pp. 567-570). European Language Resources Association (ELRA).

The American English SALA-II data collection. / Heeman, Peter.

Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004. European Language Resources Association (ELRA), 2004. p. 567-570.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Heeman, P 2004, The American English SALA-II data collection. in Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004. European Language Resources Association (ELRA), pp. 567-570, 4th International Conference on Language Resources and Evaluation, LREC 2004, Lisbon, Portugal, 5/26/04.
Heeman P. The American English SALA-II data collection. In Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004. European Language Resources Association (ELRA). 2004. p. 567-570
Heeman, Peter. / The American English SALA-II data collection. Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004. European Language Resources Association (ELRA), 2004. pp. 567-570
@inproceedings{b9c695c1913144c9b22874b0334ed677,
title = "The American English SALA-II data collection",
abstract = "We discuss the collection of the American English SALA-II speech corpus. We focus on how we designed the prompt sheets to ensure maximum variability and on our strategy for recruiting the required 4000 speakers. We also present results on the effectiveness of the phonetically rich sentence. This paper should benefit others who are interested in using this corpus, or who are planning to collect a speech corpus with a large number of speakers.",
author = "Peter Heeman",
year = "2004",
month = "1",
day = "1",
language = "English (US)",
pages = "567--570",
booktitle = "Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004",
publisher = "European Language Resources Association (ELRA)",

}

TY - GEN

T1 - The American English SALA-II data collection

AU - Heeman, Peter

PY - 2004/1/1

Y1 - 2004/1/1

N2 - We discuss the collection of the American English SALA-II speech corpus. We focus on how we designed the prompt sheets to ensure maximum variability and on our strategy for recruiting the required 4000 speakers. We also present results on the effectiveness of the phonetically rich sentence. This paper should benefit others who are interested in using this corpus, or who are planning to collect a speech corpus with a large number of speakers.

AB - We discuss the collection of the American English SALA-II speech corpus. We focus on how we designed the prompt sheets to ensure maximum variability and on our strategy for recruiting the required 4000 speakers. We also present results on the effectiveness of the phonetically rich sentence. This paper should benefit others who are interested in using this corpus, or who are planning to collect a speech corpus with a large number of speakers.

UR - http://www.scopus.com/inward/record.url?scp=85037086553&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85037086553&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85037086553

SP - 567

EP - 570

BT - Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004

PB - European Language Resources Association (ELRA)

ER -