Frequency-Domain Blind Source Separation

Sawada, Hiroshi; Mukai, Ryo; Araki, Shoko; Makino, Shoji

doi:10.1007/3-540-27489-8_13

Hiroshi Sawada⁴,
Ryo Mukai⁴,
Shoko Araki⁴ &
…
Shoji Makino⁴

Part of the book series: Signals and Communication Technology ((SCT))

2638 Accesses
33 Citations

Abstract

This chapter discusses the frequency-domain approach to the blind source separation (BSS) of convolutively mixed acoustic signals. In this approach, independent component analysis (ICA) is employed in each frequency bin to calculate the frequency responses of separation filters. Since convolutive mixtures in the time domain can be approximated as multiple instantaneous mixtures in the frequency domain, the advantage of this approach is that ICA is applied just for instantaneous mixtures, which is very simple. However, the permutation ambiguity of ICA solutions then becomes a problem. This chapter mainly deals with a method for solving the permutation problem. The method utilizes the source location information that can be estimated from the ICA solutions. We also discuss other important topics for frequency-domain BSS, such as complex-valued ICA, scaling alignment and spectral smoothing. To show the effectiveness of this frequency-domain approach, we report experimental results for separating up to four sources with a 4-element linear array, and also six sources with an 8-element planar array.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Haykin, Ed., Unsupervised Adaptive Filtering (Volume I: Blind Source Separation). John Wiley & Sons, 2000.
Google Scholar
A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing. John Wiley & Sons, 2002.
Google Scholar
A. Hyvärinen, J. Karhunen, and E. Oja, Independent Component Analysis. John Wiley & Sons, 2001.
Google Scholar
T. W. Lee, Independent Component Analysis-Theory and Applications. Kluwer Academic Publishers, 1998.
Google Scholar
S. Amari, S. Douglas, A. Cichocki, and H. Yang, “Multichannel blind deconvolution and equalization using the natural gradient,” in Proc. IEEE Workshop on Signal Processing Advances in Wireless Communications, 1997, pp. 101–104.
Google Scholar
M. Kawamoto, K. Matsuoka, and N. Ohnishi, “A method of blind separation for convolved non-stationary signals,” Neurocomputing, vol. 22, pp. 157–171, 1998.
Article Google Scholar
K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” in Proc. ICA, 2001, pp. 722–727.
Google Scholar
S. C. Douglas and X. Sun, “Convolutive blind separation of speech mixtures using the natural gradient,” Speech Communication, vol. 39, pp. 65–78, 2003.
Article Google Scholar
H. Buchner, R. Aichner, and W. Kellermann, “Blind source separation for convolutive mixtures: a unified treatment,” in Audio Signal Processing for Next-Generation Multimedia Communication Systems, Y. Huang and J. Benesty, Eds., Kluwer Academic Publishers, 2004, pp. 255–293.
Google Scholar
T. Takatani, T. Nishikawa, H. Saruwatari, and K. Shikano, “High-fidelity blind separation of acoustic signals using SIMO-model-based independent component analysis,” IEICE Trans. Fundamentals, vol. E87-A, pp. 2063–2072, Aug. 2004.
Google Scholar
P. Smaragdis, “Blind separation of convolved mixtures in the frequency domain,” Neurocomputing, vol. 22, pp. 21–34, 1998.
Article MATH Google Scholar
L. Parra and C. Spence, “Convolutive blind separation of non-stationary sources,” IEEE Trans. Speech Audio Processing, vol. 8, pp. 320–327, May 2000.
Article Google Scholar
L. Schobben and W. Sommen, “A frequency domain blind signal separation method based on decorrelation,” IEEE Trans. Signal Processing, vol. 50, pp. 1855–1865, Aug. 2002.
Article Google Scholar
J. Anemüller and B. Kollmeier, “Amplitude modulation decorrelation for convolutive blind source separation,” in Proc. ICA, 2000, pp. 215–220.
Google Scholar
N. Murata, S. Ikeda, and A. Ziehe, “An approach to blind source separation based on temporal structure of speech signals,” Neurocomputing, vol. 41, pp. 1–24, Oct. 2001.
Article Google Scholar
F. Asano, S. Ikeda, M. Ogawa, H. Asoh, and N. Kitawaki, “Combined approach of array processing and independent component analysis for blind separation of acoustic signals,” IEEE Trans. Speech Audio Processing, vol. 11, pp. 204–215, May 2003.
Article Google Scholar
S. Kurita, H. Saruwatari, S. Kajita, K. Takeda, and F. Itakura, “Evaluation of blind signal separation method using directivity pattern under reverberant conditions,” in Proc. IEEE ICASSP, 2000, pp. 3140–3143.
Google Scholar
H. Saruwatari, S. Kurita, K. Takeda, F. Itakura, T. Nishikawa, and K. Shikano, “Blind source separation combining independent component analysis and beamforming,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1135–1146, 2003.
Article Google Scholar
M. Z. Ikram and D. R. Morgan, “A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation,” in Proc. IEEE ICASSP, 2002, pp. 881–884.
Google Scholar
H. Sawada, R. Mukai, S. Araki, and S. Makino, “Polar coordinate based nonlinear function for frequency domain blind source separation,” IEICE Trans. Fundamentals, vol. E86-A, pp. 590–596, Mar. 2003.
Google Scholar
S. Araki, S. Makino, Y. Hinamoto, R. Mukai, T. Nishikawa, and H. Saruwatari, “Equivalence between frequency domain blind source separation and frequency domain adaptive beamforming for convolutive mixtures,” EURASIP Journal on Applied Signal Processing, vol. 2003, no. 11, pp. 1157–1166, 2003.
Article Google Scholar
H. Sawada, R. Mukai, S. Araki, and S. Makino, “A robust and precise method for solving the permutation problem of frequency-domain blind source separation,” IEEE Trans. Speech Audio Processing, vol. 12, pp. 530–538, Sept. 2004.
Article Google Scholar
R. Mukai, H. Sawada, S. Araki, and S. Makino, “Frequency domain blind source separation using small and large spacing sensor pairs,” in Proc. ISCAS, vol. V, 2004, pp. 1–4.
Google Scholar
R. Mukai, H. Sawada, S. Araki, and S. Makino, “Frequency domain blind source separation for many speech signals,” in Proc. ICA (LNCS 3195), 2004, pp. 461–469.
Google Scholar
S. Winter, H. Sawada, and S. Makino, “Geometrical understanding of the PCA subspace method for overdetermined blind source separation,” in Proc. IEEE ICASSP, 2003, pp. 769–772.
Google Scholar
H. Sawada, R. Mukai, S. de la Kethulle, S. Araki, and S. Makino, “Spectral smoothing for frequency-domain blind source separation,” in Proc. IWAENC, 2003, pp. 311–314.
Google Scholar
A. Hyvärinen, “Fast and robust fixed-point algorithm for independent component analysis,” IEEE Trans. Neural Networks, vol. 10, pp. 626–634, 1999.
Article Google Scholar
E. Bingham and A. Hyvärinen, “A fast fixed-point algorithm for independent component analysis of complex valued signals,” International Journal of Neural Systems, vol. 10, pp. 1–8, Feb. 2000.
Google Scholar
M. Joho and P. Schniter, “Frequency domain realization of a multichannel blind deconvolution algorithm based on the natural gradient,” in Proc. ICA, 2003, pp. 543–548.
Google Scholar
A. D. Back and A. C. Tsoi, “Blind deconvolution of signals using a complex recurrent network,” in Proc. Neural Networks for Signal Processing, 1994, pp. 565–574.
Google Scholar
R. H. Lambert and A. J. Bell, “Blind separation of multiple speakers in a multipath environment,” in Proc. IEEE ICASSP, 1997, pp. 423–426.
Google Scholar
T. W. Lee, A. J. Bell, and R. Orglmeister, “Blind source separation of real world signals,” in Proc. ICNN, 1997, pp. 2129–2135.
Google Scholar
J. J. Shynk, “Frequency-domain and multirate adaptive filtering,” IEEE Signal Processing Magazine, vol. 9, pp. 14–37, Jan. 1992.
Article Google Scholar
H. Sawada, S. Winter, R. Mukai, S. Araki, and S. Makino, “Estimating the number of sources for frequency-domain blind source separation,” in Proc. ICA (LNCS 3195), 2004, pp. 610–617.
Google Scholar
A. Bell and T. Sejnowski, “An information-maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, pp. 1129–1159, 1995.
Google Scholar
S. Amari, “Natural gradient works efficiently in learning,” Neural Computation, vol. 10, pp. 251–276, 1998.
Article Google Scholar
J.-F. Cardoso, “Blind beamforming for non-Gaussian signals,” IEE Proceedings-F, pp. 362–370, Dec. 1993.
Google Scholar
K. Matsuoka, M. Ohya, and M. Kawamoto, “A neural net for blind separation of nonstationary signals,” Neural Networks, vol. 8, pp. 411–419, 1995.
Article Google Scholar
R. O. Schmidt, “Multiple emitter location and signal parameter estimation,” IEEE Trans. Antennas and Propagation, vol. 34, pp. 276–280, Mar. 1986.
Article Google Scholar
H. Sawada, R. Mukai, and S. Makino, “Direction of arrival estimation for multiple source signals using independent component analysis,” in Proc. International Symposium on Signal Processing and its Applications, 2003, pp. 411–414.
Google Scholar
B. D. Van Veen and K. M. Buckley, “Beamforming: a versatile approach to spatial filtering,” IEEE ASSP Magazine, pp. 2–24, Apr. 1988.
Google Scholar
J.-F. Cardoso, “Source separation using higher order moments,” in Proc. IEEE ICASSP, vol. 4, 1989, pp. 2109–2112.
Google Scholar
V. C. Soon, L. Tong, Y. F. Huang, and R. Liu, “A robust method for wideband signal separation,” in Proc. ISCAS, vol. 1, 1993, pp. 703–706.
Google Scholar
R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, 2nd ed. Wiley Interscience, 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

NTT Communication Science Laboratories, Soraku-gun, Kyoto, 619-0237, Japan
Hiroshi Sawada, Ryo Mukai, Shoko Araki & Shoji Makino

Authors

Hiroshi Sawada
View author publications
You can also search for this author in PubMed Google Scholar
Ryo Mukai
View author publications
You can also search for this author in PubMed Google Scholar
Shoko Araki
View author publications
You can also search for this author in PubMed Google Scholar
Shoji Makino
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sawada, H., Mukai, R., Araki, S., Makino, S. (2005). Frequency-Domain Blind Source Separation. In: Speech Enhancement. Signals and Communication Technology. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-27489-8_13

Download citation

DOI: https://doi.org/10.1007/3-540-27489-8_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24039-6
Online ISBN: 978-3-540-27489-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics