A VQ-based single-channel audio separation for music/speech mixtures

Meysam Asgari; Mahdi Fallah; Elahe Abouie Mehrizi; Ali Mostafavi

doi:10.1109/UKSIM.2009.123

A VQ-based single-channel audio separation for music/speech mixtures

Meysam Asgari, Mahdi Fallah, Elahe Abouie Mehrizi, Ali Mostafavi

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

In this paper, we address the problem of audio ource separation with one single sensor, based on estimation of statistical model of the sources. We improve the-state-of-the art Vector Quantization (VQ) by considering apriori histograms of huge training data. This will result in a more accurate codebook for each source in contrast to the commonly used Linde-Buzo-Gray (LBG) algorithm. An optimum estimator is introduced in separation stage based on Discrete Fourier Transform (DFT) amplitudes. Finally, conducting different simulations, it is demonstrated that proposed approach efficiently segregated audio mixtures in terms of Signal to Distortion Ratio (SDR) measures as well as Mean Opinion Score (MOS) criterion.

Original language	English (US)
Title of host publication	11th International Conference on Computer Modelling and Simulation, UKSim 2009
Pages	223-227
Number of pages	5
DOIs	https://doi.org/10.1109/UKSIM.2009.123
State	Published - 2009
Externally published	Yes
Event	11th International Conference on Computer Modelling and Simulation, UKSim 2009 - Cambridge, United Kingdom Duration: Mar 25 2009 → Mar 27 2009

Publication series

Name	11th International Conference on Computer Modelling and Simulation, UKSim 2009

Conference

Conference	11th International Conference on Computer Modelling and Simulation, UKSim 2009
Country/Territory	United Kingdom
City	Cambridge
Period	3/25/09 → 3/27/09

Keywords

Single cahnnel audio sepatation
Vector quantization

ASJC Scopus subject areas

Computational Theory and Mathematics
Computer Science Applications
Modeling and Simulation

Access to Document

10.1109/UKSIM.2009.123

Cite this

A VQ-based single-channel audio separation for music/speech mixtures. / Asgari, Meysam; Fallah, Mahdi; Mehrizi, Elahe Abouie et al.
11th International Conference on Computer Modelling and Simulation, UKSim 2009. 2009. p. 223-227 4809767 (11th International Conference on Computer Modelling and Simulation, UKSim 2009).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Asgari, M, Fallah, M, Mehrizi, EA & Mostafavi, A 2009, A VQ-based single-channel audio separation for music/speech mixtures. in 11th International Conference on Computer Modelling and Simulation, UKSim 2009., 4809767, 11th International Conference on Computer Modelling and Simulation, UKSim 2009, pp. 223-227, 11th International Conference on Computer Modelling and Simulation, UKSim 2009, Cambridge, United Kingdom, 3/25/09. https://doi.org/10.1109/UKSIM.2009.123

@inproceedings{bb09b1b836d54988b01083a3545866c3,

title = "A VQ-based single-channel audio separation for music/speech mixtures",

abstract = "In this paper, we address the problem of audio ource separation with one single sensor, based on estimation of statistical model of the sources. We improve the-state-of-the art Vector Quantization (VQ) by considering apriori histograms of huge training data. This will result in a more accurate codebook for each source in contrast to the commonly used Linde-Buzo-Gray (LBG) algorithm. An optimum estimator is introduced in separation stage based on Discrete Fourier Transform (DFT) amplitudes. Finally, conducting different simulations, it is demonstrated that proposed approach efficiently segregated audio mixtures in terms of Signal to Distortion Ratio (SDR) measures as well as Mean Opinion Score (MOS) criterion.",

keywords = "Single cahnnel audio sepatation, Vector quantization",

author = "Meysam Asgari and Mahdi Fallah and Mehrizi, {Elahe Abouie} and Ali Mostafavi",

year = "2009",

doi = "10.1109/UKSIM.2009.123",

language = "English (US)",

isbn = "9780769535937",

series = "11th International Conference on Computer Modelling and Simulation, UKSim 2009",

pages = "223--227",

booktitle = "11th International Conference on Computer Modelling and Simulation, UKSim 2009",

}

TY - GEN

T1 - A VQ-based single-channel audio separation for music/speech mixtures

AU - Asgari, Meysam

AU - Fallah, Mahdi

AU - Mehrizi, Elahe Abouie

AU - Mostafavi, Ali

PY - 2009

Y1 - 2009

N2 - In this paper, we address the problem of audio ource separation with one single sensor, based on estimation of statistical model of the sources. We improve the-state-of-the art Vector Quantization (VQ) by considering apriori histograms of huge training data. This will result in a more accurate codebook for each source in contrast to the commonly used Linde-Buzo-Gray (LBG) algorithm. An optimum estimator is introduced in separation stage based on Discrete Fourier Transform (DFT) amplitudes. Finally, conducting different simulations, it is demonstrated that proposed approach efficiently segregated audio mixtures in terms of Signal to Distortion Ratio (SDR) measures as well as Mean Opinion Score (MOS) criterion.

AB - In this paper, we address the problem of audio ource separation with one single sensor, based on estimation of statistical model of the sources. We improve the-state-of-the art Vector Quantization (VQ) by considering apriori histograms of huge training data. This will result in a more accurate codebook for each source in contrast to the commonly used Linde-Buzo-Gray (LBG) algorithm. An optimum estimator is introduced in separation stage based on Discrete Fourier Transform (DFT) amplitudes. Finally, conducting different simulations, it is demonstrated that proposed approach efficiently segregated audio mixtures in terms of Signal to Distortion Ratio (SDR) measures as well as Mean Opinion Score (MOS) criterion.

KW - Single cahnnel audio sepatation

KW - Vector quantization

UR - http://www.scopus.com/inward/record.url?scp=69649085693&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=69649085693&partnerID=8YFLogxK

U2 - 10.1109/UKSIM.2009.123

DO - 10.1109/UKSIM.2009.123

M3 - Conference contribution

AN - SCOPUS:69649085693

SN - 9780769535937

T3 - 11th International Conference on Computer Modelling and Simulation, UKSim 2009

SP - 223

EP - 227

BT - 11th International Conference on Computer Modelling and Simulation, UKSim 2009

T2 - 11th International Conference on Computer Modelling and Simulation, UKSim 2009

Y2 - 25 March 2009 through 27 March 2009

ER -

A VQ-based single-channel audio separation for music/speech mixtures

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this