Toward overcoming fundamental limitation in frequency-domain blind source separation for reverberant speech mixtures

Lae-Hoon Kim; M Hasegawa-Johnson

doi:10.1109/ACSSC.2010.5757618

Toward overcoming fundamental limitation in frequency-domain blind source separation for reverberant speech mixtures

Source

2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers > 542 - 545

Abstract

Blind source separation can be implemented in the frequency domain using one-tap multiplication operation in each frequency bin, but only when the frame length is long enough to disregard temporal aliasing effects. If we take a short-time frequency transformation with a window shorter than a room reverberation time, the justification above does not hold anymore. In this paper, we present an appropriate representation in the short-time frequency domain. The suitability is justified by showing the equivalence with the original time domain approach under the overlap-add context. Experimental validation using a corpus synthesized by convolution with measured sets of room impulse responses is also provided.