Intelligibility of speech with filtered time trajectories of spectral envelopes

Takayuki Arai, Misha Pavel, Hynek Hermansky, Carlos Avendano

Research output: Chapter in Book/Report/Conference proceedingConference contribution

62 Citations (Scopus)

Abstract

The effect of filtering the time trajectories of spectral envelopes on speech intelligibility was investigated. Since LPC cepstrum forms the basis of many automatic speech recognition systems, we filtered time trajectories of LPC cepstrum of speech sounds, and the modified speech was reconstructed after the filtering. For processing, we applied low-pass, high-pass and band-pass filters. The results of the accuracy from the perceptual experiments for Japanese syllables show that speech intelligibility is not severely impaired as long as the filtered spectral components have 1) a rate of change faster than 1 Hz when high-pass filtered, 2) a rate of change slower than 24 Hz when low-pass filtered, and 3) a rate of change between 1 and 16 Hz when band-pass filtered.

Original languageEnglish (US)
Title of host publicationInternational Conference on Spoken Language Processing, ICSLP, Proceedings
Editors Anon
PublisherIEEE
Pages2490-2493
Number of pages4
Volume4
StatePublished - 1996
EventProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4) - Philadelphia, PA, USA
Duration: Oct 3 1996Oct 6 1996

Other

OtherProceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4)
CityPhiladelphia, PA, USA
Period10/3/9610/6/96

Fingerprint

Speech intelligibility
Trajectories
Bandpass filters
Speech recognition
Acoustic waves
Processing
Experiments

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Arai, T., Pavel, M., Hermansky, H., & Avendano, C. (1996). Intelligibility of speech with filtered time trajectories of spectral envelopes. In Anon (Ed.), International Conference on Spoken Language Processing, ICSLP, Proceedings (Vol. 4, pp. 2490-2493). IEEE.

Intelligibility of speech with filtered time trajectories of spectral envelopes. / Arai, Takayuki; Pavel, Misha; Hermansky, Hynek; Avendano, Carlos.

International Conference on Spoken Language Processing, ICSLP, Proceedings. ed. / Anon. Vol. 4 IEEE, 1996. p. 2490-2493.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Arai, T, Pavel, M, Hermansky, H & Avendano, C 1996, Intelligibility of speech with filtered time trajectories of spectral envelopes. in Anon (ed.), International Conference on Spoken Language Processing, ICSLP, Proceedings. vol. 4, IEEE, pp. 2490-2493, Proceedings of the 1996 International Conference on Spoken Language Processing, ICSLP. Part 1 (of 4), Philadelphia, PA, USA, 10/3/96.
Arai T, Pavel M, Hermansky H, Avendano C. Intelligibility of speech with filtered time trajectories of spectral envelopes. In Anon, editor, International Conference on Spoken Language Processing, ICSLP, Proceedings. Vol. 4. IEEE. 1996. p. 2490-2493
Arai, Takayuki ; Pavel, Misha ; Hermansky, Hynek ; Avendano, Carlos. / Intelligibility of speech with filtered time trajectories of spectral envelopes. International Conference on Spoken Language Processing, ICSLP, Proceedings. editor / Anon. Vol. 4 IEEE, 1996. pp. 2490-2493
@inproceedings{ac613e6939854014a321a00568f19f38,
title = "Intelligibility of speech with filtered time trajectories of spectral envelopes",
abstract = "The effect of filtering the time trajectories of spectral envelopes on speech intelligibility was investigated. Since LPC cepstrum forms the basis of many automatic speech recognition systems, we filtered time trajectories of LPC cepstrum of speech sounds, and the modified speech was reconstructed after the filtering. For processing, we applied low-pass, high-pass and band-pass filters. The results of the accuracy from the perceptual experiments for Japanese syllables show that speech intelligibility is not severely impaired as long as the filtered spectral components have 1) a rate of change faster than 1 Hz when high-pass filtered, 2) a rate of change slower than 24 Hz when low-pass filtered, and 3) a rate of change between 1 and 16 Hz when band-pass filtered.",
author = "Takayuki Arai and Misha Pavel and Hynek Hermansky and Carlos Avendano",
year = "1996",
language = "English (US)",
volume = "4",
pages = "2490--2493",
editor = "Anon",
booktitle = "International Conference on Spoken Language Processing, ICSLP, Proceedings",
publisher = "IEEE",

}

TY - GEN

T1 - Intelligibility of speech with filtered time trajectories of spectral envelopes

AU - Arai, Takayuki

AU - Pavel, Misha

AU - Hermansky, Hynek

AU - Avendano, Carlos

PY - 1996

Y1 - 1996

N2 - The effect of filtering the time trajectories of spectral envelopes on speech intelligibility was investigated. Since LPC cepstrum forms the basis of many automatic speech recognition systems, we filtered time trajectories of LPC cepstrum of speech sounds, and the modified speech was reconstructed after the filtering. For processing, we applied low-pass, high-pass and band-pass filters. The results of the accuracy from the perceptual experiments for Japanese syllables show that speech intelligibility is not severely impaired as long as the filtered spectral components have 1) a rate of change faster than 1 Hz when high-pass filtered, 2) a rate of change slower than 24 Hz when low-pass filtered, and 3) a rate of change between 1 and 16 Hz when band-pass filtered.

AB - The effect of filtering the time trajectories of spectral envelopes on speech intelligibility was investigated. Since LPC cepstrum forms the basis of many automatic speech recognition systems, we filtered time trajectories of LPC cepstrum of speech sounds, and the modified speech was reconstructed after the filtering. For processing, we applied low-pass, high-pass and band-pass filters. The results of the accuracy from the perceptual experiments for Japanese syllables show that speech intelligibility is not severely impaired as long as the filtered spectral components have 1) a rate of change faster than 1 Hz when high-pass filtered, 2) a rate of change slower than 24 Hz when low-pass filtered, and 3) a rate of change between 1 and 16 Hz when band-pass filtered.

UR - http://www.scopus.com/inward/record.url?scp=0030369532&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030369532&partnerID=8YFLogxK

M3 - Conference contribution

VL - 4

SP - 2490

EP - 2493

BT - International Conference on Spoken Language Processing, ICSLP, Proceedings

A2 - Anon, null

PB - IEEE

ER -