Cognitive Computation

, Volume 5, Issue 4, pp 493-503

Open Access This content is freely available online to anyone, anywhere at any time.

Improved Convolutive and Under-Determined Blind Audio Source Separation with MRF Smoothing

  • Rafał ZdunekAffiliated withInstitute of Telecommunications, Teleinformatics, and Acoustics, Wroclaw University of Technology Email author 


Convolutive and under-determined blind audio source separation from noisy recordings is a challenging problem. Several computational strategies have been proposed to address this problem. This study is concerned with several modifications to the expectation-minimization-based algorithm, which iteratively estimates the mixing and source parameters. This strategy assumes that any entry in each source spectrogram is modeled using superimposed Gaussian components, which are mutually and individually independent across frequency and time bins. In our approach, we resolve this issue by considering a locally smooth temporal and frequency structure in the power source spectrograms. Local smoothness is enforced by incorporating a Gibbs prior in the complete data likelihood function, which models the interactions between neighboring spectrogram bins using a Markov random field. Simulations using audio files derived from stereo audio source separation evaluation campaign 2008 demonstrate high efficiency with the proposed improvement.


Blind source separation Nonnegative matrix factorization Expectation-maximization Markov random field Simultaneous auto-regression