The 2nd ‘CHiME’ Speech Separation and Recognition Challenge: Approaches on single-channel speech separation and model-driven speech enhancement |
Signal Processing and Speech Communication (SPSC) Lab |
Demonstration of results obtained for 2nd CHiME Challenge |
Recovering a target speech signal from a multisource reverberant environment i.e. real-world conditions with acoustic clutter and fast varying noise sources is a challenging area of research. The 2nd CHiME challenge addressed the development of machine listening applications for operation in such an adverse noise scenarios. Here, we present the sound demonstration of speech separation and enhancement in highly non-stationary noisy environment, as presented by CHiME 2013 challenge. The data was provided by the 2nd CHiME challenge organization and the source separation and speech enhancement was done at Speech Communication and Signal Processing (SPSC) Lab, Technische Universität Graz (TU GRAZ). The related references are found at the bottom of this page.
References
[1] P. Mowlaee, J. A. Morales-Cordovilla, F. Pernkopf, H. Passentheiner, M. Hagmuller, and G. Kubin, "The 2nd ‘CHIME’ Speech Separation and Recognition Challenge: Approaches on Single-channel Speech Separation and Model-driven Speech Enhancement", in Proceeding of the 2nd CHiME Speech Separation and Recognition Challenge,", IEEE Int. Conf. Acoustics, Speech, Signal Processing, May. 2013, Vancouver, Canada, In Press .
[2] M. Stark, M. Wohlmayr, and F. Pernkopf, "Source-Filter based Single Channel Speech Separation using Pitch Information", IEEE Transactions on Audio, Speech and Language Processing, vol. 19, issue 2, pp. 242 - 255, Feb., 2011.
[3] P. Mowlaee, R. Saeidi, "Target Speaker Separation in a Multisource Environment Using Speaker-dependent Postfilter and Noise Estimation", IEEE Int. Conf. Acoustics, Speech, Signal Processing, May. 2013, Vancouver, Canada, In Press .
[4] P. Mowlaee, R. Saeidi, R. Martin, “Model-driven speech enhancement for multisource reverberant environment (Signal Separation Evaluation Campaign (SiSEC) 2011)”, Proceedings of the 10th International Conference on Latent Variable Analysis and Source Separation, LVA/ICA, 454-461, 2012. |
Development set
|
m6dB |
Observed speech |
m3dB |
0dB |
3dB
|
6dB |
9dB |
MD-SCSE |
SCSS |
s17_sbwt1a |
s10_sbwy4n |
s16_sbbs7p |
s22_sbbz6a |
s1_bris5a |
s3_pbwc7s |
spkr utterance |