Skip to main content

Frequency-Domain Blind Source Separation

  • Chapter
Speech Enhancement

Part of the book series: Signals and Communication Technology ((SCT))

Abstract

This chapter discusses the frequency-domain approach to the blind source separation (BSS) of convolutively mixed acoustic signals. In this approach, independent component analysis (ICA) is employed in each frequency bin to calculate the frequency responses of separation filters. Since convolutive mixtures in the time domain can be approximated as multiple instantaneous mixtures in the frequency domain, the advantage of this approach is that ICA is applied just for instantaneous mixtures, which is very simple. However, the permutation ambiguity of ICA solutions then becomes a problem. This chapter mainly deals with a method for solving the permutation problem. The method utilizes the source location information that can be estimated from the ICA solutions. We also discuss other important topics for frequency-domain BSS, such as complex-valued ICA, scaling alignment and spectral smoothing. To show the effectiveness of this frequency-domain approach, we report experimental results for separating up to four sources with a 4-element linear array, and also six sources with an 8-element planar array.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S. Haykin, Ed., Unsupervised Adaptive Filtering (Volume I: Blind Source Separation). John Wiley & Sons, 2000.

    Google Scholar 

  2. A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing. John Wiley & Sons, 2002.

    Google Scholar 

  3. A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. John Wiley & Sons, 2001.

    Google Scholar 

  4. T. W. Lee, Independent Component Analysis-Theory and Applications. Kluwer Academic Publishers, 1998.

    Google Scholar 

  5. S. Amari, S. Douglas, A. Cichocki, and H. Yang, “Multichannel blind deconvolution and equalization using the natural gradient,” in Proc. IEEE Workshop on Signal Processing Advances in Wireless Communications, 1997, pp. 101–104.

    Google Scholar 

  6. M. Kawamoto, K. Matsuoka, and N. Ohnishi, “A method of blind separation for convolved non-stationary signals,” Neurocomputing, vol. 22, pp. 157–171, 1998.

    Article  Google Scholar 

  7. K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” in Proc. ICA, 2001, pp. 722–727.

    Google Scholar 

  8. S. C. Douglas and X. Sun, “Convolutive blind separation of speech mixtures using the natural gradient,” Speech Communication, vol. 39, pp. 65–78, 2003.

    Article  Google Scholar 

  9. H. Buchner, R. Aichner, and W. Kellermann, “Blind source separation for convolutive mixtures: a unified treatment,” in Audio Signal Processing for Next-Generation Multimedia Communication Systems, Y. Huang and J. Benesty, Eds., Kluwer Academic Publishers, 2004, pp. 255–293.

    Google Scholar 

  10. T. Takatani, T. Nishikawa, H. Saruwatari, and K. Shikano, “High-fidelity blind separation of acoustic signals using SIMO-model-based independent component analysis,” IEICE Trans. Fundamentals, vol. E87-A, pp. 2063–2072, Aug. 2004.

    Google Scholar 

  11. P. Smaragdis, “Blind separation of convolved mixtures in the frequency domain,” Neurocomputing, vol. 22, pp. 21–34, 1998.

    Article  MATH  Google Scholar 

  12. L. Parra and C. Spence, “Convolutive blind separation of non-stationary sources,” IEEE Trans. Speech Audio Processing, vol. 8, pp. 320–327, May 2000.

    Article  Google Scholar 

  13. L. Schobben and W. Sommen, “A frequency domain blind signal separation method based on decorrelation,” IEEE Trans. Signal Processing, vol. 50, pp. 1855–1865, Aug. 2002.

    Article  Google Scholar 

  14. J. Anemüller and B. Kollmeier, “Amplitude modulation decorrelation for convolutive blind source separation,” in Proc. ICA, 2000, pp. 215–220.

    Google Scholar 

  15. N. Murata, S. Ikeda, and A. Ziehe, “An approach to blind source separation based on temporal structure of speech signals,” Neurocomputing, vol. 41, pp. 1–24, Oct. 2001.

    Article  Google Scholar 

  16. F. Asano, S. Ikeda, M. Ogawa, H. Asoh, and N. Kitawaki, “Combined approach of array processing and independent component analysis for blind separation of acoustic signals,” IEEE Trans. Speech Audio Processing, vol. 11, pp. 204–215, May 2003.

    Article  Google Scholar 

  17. S. Kurita, H. Saruwatari, S. Kajita, K. Takeda, and F. Itakura, “Evaluation of blind signal separation method using directivity pattern under reverberant conditions,” in Proc. IEEE ICASSP, 2000, pp. 3140–3143.

    Google Scholar 

  18. H. Saruwatari, S. Kurita, K. Takeda, F. Itakura, T. Nishikawa, and K. Shikano, “Blind source separation combining independent component analysis and beamforming,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1135–1146, 2003.

    Article  Google Scholar 

  19. M. Z. Ikram and D. R. Morgan, “A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation,” in Proc. IEEE ICASSP, 2002, pp. 881–884.

    Google Scholar 

  20. H. Sawada, R. Mukai, S. Araki, and S. Makino, “Polar coordinate based nonlinear function for frequency domain blind source separation,” IEICE Trans. Fundamentals, vol. E86-A, pp. 590–596, Mar. 2003.

    Google Scholar 

  21. S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, and H. Saruwatari, “Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming for convolutive mixtures,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157–1166, 2003.

    Article  Google Scholar 

  22. H. Sawada, R. Mukai, S. Araki, and S. Makino, “A robust and precise method for solving the permutation problem of frequency-domain blind source separation,” IEEE Trans. Speech Audio Processing, vol. 12, pp. 530–538, Sept. 2004.

    Article  Google Scholar 

  23. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Frequency domain blind source separation using small and large spacing sensor pairs,” in Proc. ISCAS, vol. V, 2004, pp. 1–4.

    Google Scholar 

  24. R. Mukai, H. Sawada, S. Araki, and S. Makino, “Frequency domain blind source separation for many speech signals,” in Proc. ICA (LNCS 3195), 2004, pp. 461–469.

    Google Scholar 

  25. S. Winter, H. Sawada, and S. Makino, “Geometrical understanding of the PCA subspace method for overdetermined blind source separation,” in Proc. IEEE ICASSP, 2003, pp. 769–772.

    Google Scholar 

  26. H. Sawada, R. Mukai, S. de la Kethulle, S. Araki, and S. Makino, “Spectral smoothing for frequency-domain blind source separation,” in Proc. IWAENC, 2003, pp. 311–314.

    Google Scholar 

  27. A. Hyvärinen, “Fast and robust fixed-point algorithm for independent component analysis,” IEEE Trans. Neural Networks, vol. 10, pp. 626–634, 1999.

    Article  Google Scholar 

  28. E. Bingham and A. Hyvärinen, “A fast fixed-point algorithm for independent component analysis of complex valued signals,” International Journal of Neural Systems, vol. 10, pp. 1–8, Feb. 2000.

    Google Scholar 

  29. M. Joho and P. Schniter, “Frequency domain realization of a multichannel blind deconvolution algorithm based on the natural gradient,” in Proc. ICA, 2003, pp. 543–548.

    Google Scholar 

  30. A. D. Back and A. C. Tsoi, “Blind deconvolution of signals using a complex recurrent network,” in Proc. Neural Networks for Signal Processing, 1994, pp. 565–574.

    Google Scholar 

  31. R. H. Lambert and A. J. Bell, “Blind separation of multiple speakers in a multipath environment,” in Proc. IEEE ICASSP, 1997, pp. 423–426.

    Google Scholar 

  32. T. W. Lee, A. J. Bell, and R. Orglmeister, “Blind source separation of real world signals,” in Proc. ICNN, 1997, pp. 2129–2135.

    Google Scholar 

  33. J. J. Shynk, “Frequency-domain and multirate adaptive filtering,” IEEE Signal Processing Magazine, vol. 9, pp. 14–37, Jan. 1992.

    Article  Google Scholar 

  34. H. Sawada, S. Winter, R. Mukai, S. Araki, and S. Makino, “Estimating the number of sources for frequency-domain blind source separation,” in Proc. ICA (LNCS 3195), 2004, pp. 610–617.

    Google Scholar 

  35. A. Bell and T. Sejnowski, “An information-maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, pp. 1129–1159, 1995.

    Google Scholar 

  36. S. Amari, “Natural gradient works efficiently in learning,” Neural Computation, vol. 10, pp. 251–276, 1998.

    Article  Google Scholar 

  37. J.-F. Cardoso, “Blind beamforming for non-Gaussian signals,” IEE Proceedings-F, pp. 362–370, Dec. 1993.

    Google Scholar 

  38. K. Matsuoka, M. Ohya, and M. Kawamoto, “A neural net for blind separation of nonstationary signals,” Neural Networks, vol. 8, pp. 411–419, 1995.

    Article  Google Scholar 

  39. R. O. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. Antennas and Propagation, vol. 34, pp. 276–280, Mar. 1986.

    Article  Google Scholar 

  40. H. Sawada, R. Mukai, and S. Makino, “Direction of arrival estimation for multiple source signals using independent component analysis,” in Proc. International Symposium on Signal Processing and its Applications, 2003, pp. 411–414.

    Google Scholar 

  41. B. D. Van Veen and K. M. Buckley, “Beamforming: a versatile approach to spatial filtering,” IEEE ASSP Magazine, pp. 2–24, Apr. 1988.

    Google Scholar 

  42. J.-F. Cardoso, “Source separation using higher order moments,” in Proc. IEEE ICASSP, vol. 4, 1989, pp. 2109–2112.

    Google Scholar 

  43. V. C. Soon, L. Tong, Y. F. Huang, and R. Liu, “A robust method for wideband signal separation,” in Proc. ISCAS, vol. 1, 1993, pp. 703–706.

    Google Scholar 

  44. R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2nd ed. Wiley Interscience, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Sawada, H., Mukai, R., Araki, S., Makino, S. (2005). Frequency-Domain Blind Source Separation. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_13

Download citation

  • DOI: https://doi.org/10.1007/3-540-27489-8_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-24039-6

  • Online ISBN: 978-3-540-27489-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics