Independent Vector Analysis for Convolutive Blind Speech Separation

Lee, Intae; Kim, Taesu; Lee, Te-Won

doi:10.1007/978-1-4020-6479-1_6

Intae Lee³,
Taesu Kim³ &
Te-Won Lee³

Part of the book series: Signals and Communication Technology ((SCT))

2584 Accesses
21 Citations

As a method to tackle blind source separation (BSS) in the frequency domain, we introduce independent vector analysis (IVA), an extension of independent component analysis (ICA) from univariate components to multivariate components. Given a mixture of statistically independent multivariate sources where the mixing is constrained to be component-wise, ICA needs to be followed by an additional algorithmic scheme in order to correct the permutation disorder that occurs after the component-wise separation, whereas IVA utilizes the inner dependency of the multivariate components and separates the fellow source components together. The efficiency of this new formulation in solving the permutation problem has been proven in its application to convolutive mixture of independent speech signals. Maximum likelihood (ML) approaches or information theoretic approaches have been employed where the time–frequency model of speech has been modelled by several multivariate joint densities, and natural gradient or Newton method algorithms have been derived. Here, we present a gentle tutorial on IVA for the separation of speech signals in the frequency domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

P. Smaragdis, “Blind separation of convolved mixtures in the frequency domain,” Neurocomputing, vol. 22, pp. 21-34, 1998.
Article MATH Google Scholar
L. Parra and C. Spence, “Convolutive blind separation of non-stationary sources,” IEEE Trans. on Speech and Audio Processing, vol. 8, no. 3, pp. 320-327, 2000.
Article Google Scholar
J. Anemueller and B. Kollmeier, “Amplitude modulation decorrelation for convolutive blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 215-220, 2000.
Google Scholar
N. Murata, S. Ikeda, and A. Ziehe, “An approach to blind source separation based on temporal structure of speech signals,” Neurocomputing, vol. 41, pp. 1-24, 2001.
Article MATH Google Scholar
M. Z. Ikram and D. R. Morgan, “A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 881-884, 2002.
Google Scholar
H. Sawada, R. Mukai , S. Araki, and S. Makino, “A robust and precise method for solving the permutation problem of frequency-domain blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 505-510, 2003.
Google Scholar
T. Kim, H. T. Attias, S.-Y. Lee, and T.-W. Lee, “Blind source separation exploiting higher-order frequency dependencies,” IEEE Trans. on Speech and Audio Processing, vol. 15, no. 1, pp. 70-79, 2007.
Article Google Scholar
I. Lee, T. Kim, and T.-W. Lee, “Complex FastIVA: a robust maximum like-lihood approach of MICA for convolutive BSS,” Lecture Notes in Computer Science, vol. 3889, pp. 625-632, 2006.
Article Google Scholar
A. Hiroe, “Solution of permutation problem in frequency domain ICA, using multivariate probability density functions,” Lecture Notes in Computer Science, vol. 3889, pp. 601-608, 2006.
Article Google Scholar
M. Davies, “Audio Source Separation,” Mathematics in Signal Processing 5, Oxford University Press, pp. 57-68, 2002.
Google Scholar
I. Lee, T. Kim, and T.-W. Lee, “Fast Fixed-Point Independent Vector Analysis Algorithms for Convolutive Blind Source Separation”, Signal Processing, vol. 87, no. 8, pp. 1859-1871, 2007.
Article Google Scholar
A. J. Bell and T. J. Sejnowski, “An information maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, no. 6, pp. 1129-1159, 1995.
Article Google Scholar
J.-F. Cardoso, “Infomax and maximum likelihood for source separation,” IEEE Signal Processing Letters, vol. 4, no. 4, 1997.
Google Scholar
E. Bingham and A. Hyvärinen, “A fast fixed-point algorithm for independent componenet analysis of complex-valued signals,” Int. J. of Neural Systems, vol. 10, no. 1, pp. 1-8, 2000.
Google Scholar
J.-F. Cardoso, “Entropic contrasts for source separation: Geometry and stability,” S. Haykin, Ed. Unsupervised Adaptive Filtering, vol. 1, pp. 139-189, John Wiley and Sons, 2000.
Google Scholar
J.-F. Cardoso, “Multidimensional independent component analysis,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1941-1944, 1998.
Google Scholar
A. Hyvärinen and P. O. Hoyer, “Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces,” Neural Computation, vol. 12, no. 7, pp. 1705-1720, 2000.
Article Google Scholar
D. H. Brandwood, “A complex gradient operator and its application in adaptive array theory,” IEE Proc. F and H, vol. 130, no. 1, pp. 11-16, 1983.
MathSciNet Google Scholar
A. van den Bos, “Complex gradient and Hessian,” IEE Proceedings on Vision, Image and Signal Processing, vol. 141, pp. 380-382, 1994.
Article Google Scholar
H. Brehm and W. Stammler, “Description and generation of spherically invariant speech-model signals,” Signal Processing, no. 12, pp. 119-141, 1987.
Article Google Scholar
H. Buchner, R. Aichner, and W. Kellermann, “Blind source separation for convolutive mixtures: A unified treatment,” Audio Signal Processing for NextGeneration Multimedia Communication Systems, Y. Huang and J. Benesty Ed., Kluwer Academic Publishers, Boston, pp. 255-293, 2004.
Google Scholar
I. Lee and T.-W. Lee, “On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model”, IEEE Trans. on Speech, Audio and Language Processing, vol. 15, no. 5, pp. 1521-1528, 2007.
Article Google Scholar
G.-J. Jang, I. Lee, and T.-W. Lee, “Independent Vector Analysis using NonSpherical Joint Densities for the Separation of Speech Signals”, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, pp. 629-632, 2007.
Google Scholar
S.-I. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Adv. Neural Information Processing Systems, vol. 8, pp. 757-763, 1996.
Google Scholar
J.-F. Cardoso, “The invariant approach to source separation,” Proc. International Symposium on Nonlinear Theory and Applications (NOLTA), vol. 1, pp. 55-60, 1995.
Google Scholar
A. Hyvärinen and E. Oja, “A Fast fixed-Point algorithm for independent component analysis,” Neural Computation, vol. 9, no. 7, pp. 1483-1492, 1997.
Article Google Scholar
A. Hyvärinen, “Fast and Robust fixed-point algorithms for independent componenet analysis, IEEE Trans. on Neural Networks, vol. 10, no. 3, pp. 626-634, 1999.
Google Scholar
E. G. Learned-Miller and J. W. Fisher III, “ICA using spacings estimates of entropy,” J. of Machine Learning Research, vol. 4, pp. 1271-1295, 2003.
Article MathSciNet Google Scholar
M. Z. Ikram and D. R. Morgan, “Exploring permutation inconsistency in blind separation of signals in a reverberant environment,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1041-1044, 2000.
Google Scholar
K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 722-727, 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Neural Computation, University of California, San Diego, 9500 Gilman Drive, 0523, 92093-0523, La Jolla, CA, USA
Intae Lee, Taesu Kim & Te-Won Lee

Authors

Intae Lee
View author publications
You can also search for this author in PubMed Google Scholar
Taesu Kim
View author publications
You can also search for this author in PubMed Google Scholar
Te-Won Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NTT Corporation, 2-4 Hikaridai, 619-0237, Soraku-gun, Kyoto, Japan
Shoji Makino & Hiroshi Sawada &
University of California, San Diego, 9500 Gilman Drive, 0523, 92093-0523, La Jolla, CA, USA
Te-Won Lee

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lee, I., Kim, T., Lee, TW. (2007). Independent Vector Analysis for Convolutive Blind Speech Separation. In: Makino, S., Sawada, H., Lee, TW. (eds) Blind Speech Separation. Signals and Communication Technology. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6479-1_6

Download citation

DOI: https://doi.org/10.1007/978-1-4020-6479-1_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-6478-4
Online ISBN: 978-1-4020-6479-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics