Least relative entropy for voiced/unvoiced speech classification

Darren K. Emge; Tulay Adali; M. Kemal Sonmez

Least relative entropy for voiced/unvoiced speech classification

Darren K. Emge, Tulay Adali, M. Kemal Sonmez

Research output: Contribution to conference › Paper › peer-review

Abstract

The aim of this work is to develop a flexible and efficient approach to the classification of the ratio of voiced to unvoiced excitation sources in continuous speech. To achieve this aim we adopt a probabilistic neural network approach. This is accomplished by designing a multi layer perceptron classifier trained by steepest descent minimization of the Least Relative Entropy (LRE) cost function. By using the LRE cost function we can directly output the ratio, as a probability, of excitation source, voiced to unvoiced, for a given speech segment. These output probabilities can then be used directly in other applications, such as low bit rate coders.

Original language	English (US)
Pages	2976-2980
Number of pages	5
State	Published - 1999
Externally published	Yes
Event	International Joint Conference on Neural Networks (IJCNN'99) - Washington, DC, USA Duration: Jul 10 1999 → Jul 16 1999

Other

Other	International Joint Conference on Neural Networks (IJCNN'99)
City	Washington, DC, USA
Period	7/10/99 → 7/16/99

ASJC Scopus subject areas

Software
Artificial Intelligence

Cite this

@conference{bf678e5c4cac4001b66b828a547ad93e,

title = "Least relative entropy for voiced/unvoiced speech classification",

abstract = "The aim of this work is to develop a flexible and efficient approach to the classification of the ratio of voiced to unvoiced excitation sources in continuous speech. To achieve this aim we adopt a probabilistic neural network approach. This is accomplished by designing a multi layer perceptron classifier trained by steepest descent minimization of the Least Relative Entropy (LRE) cost function. By using the LRE cost function we can directly output the ratio, as a probability, of excitation source, voiced to unvoiced, for a given speech segment. These output probabilities can then be used directly in other applications, such as low bit rate coders.",

author = "Emge, {Darren K.} and Tulay Adali and Sonmez, {M. Kemal}",

year = "1999",

language = "English (US)",

pages = "2976--2980",

note = "International Joint Conference on Neural Networks (IJCNN'99) ; Conference date: 10-07-1999 Through 16-07-1999",

}

TY - CONF

T1 - Least relative entropy for voiced/unvoiced speech classification

AU - Emge, Darren K.

AU - Adali, Tulay

AU - Sonmez, M. Kemal

PY - 1999

Y1 - 1999

N2 - The aim of this work is to develop a flexible and efficient approach to the classification of the ratio of voiced to unvoiced excitation sources in continuous speech. To achieve this aim we adopt a probabilistic neural network approach. This is accomplished by designing a multi layer perceptron classifier trained by steepest descent minimization of the Least Relative Entropy (LRE) cost function. By using the LRE cost function we can directly output the ratio, as a probability, of excitation source, voiced to unvoiced, for a given speech segment. These output probabilities can then be used directly in other applications, such as low bit rate coders.

AB - The aim of this work is to develop a flexible and efficient approach to the classification of the ratio of voiced to unvoiced excitation sources in continuous speech. To achieve this aim we adopt a probabilistic neural network approach. This is accomplished by designing a multi layer perceptron classifier trained by steepest descent minimization of the Least Relative Entropy (LRE) cost function. By using the LRE cost function we can directly output the ratio, as a probability, of excitation source, voiced to unvoiced, for a given speech segment. These output probabilities can then be used directly in other applications, such as low bit rate coders.

UR - http://www.scopus.com/inward/record.url?scp=0033351871&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033351871&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0033351871

SP - 2976

EP - 2980

T2 - International Joint Conference on Neural Networks (IJCNN'99)

Y2 - 10 July 1999 through 16 July 1999

ER -

Least relative entropy for voiced/unvoiced speech classification

Abstract

Other

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this