Skip to main content

Independent Vector Analysis for Convolutive Blind Speech Separation

  • Chapter
Blind Speech Separation

Part of the book series: Signals and Communication Technology ((SCT))

As a method to tackle blind source separation (BSS) in the frequency domain, we introduce independent vector analysis (IVA), an extension of independent component analysis (ICA) from univariate components to multivariate components. Given a mixture of statistically independent multivariate sources where the mixing is constrained to be component-wise, ICA needs to be followed by an additional algorithmic scheme in order to correct the permutation disorder that occurs after the component-wise separation, whereas IVA utilizes the inner dependency of the multivariate components and separates the fellow source components together. The efficiency of this new formulation in solving the permutation problem has been proven in its application to convolutive mixture of independent speech signals. Maximum likelihood (ML) approaches or information theoretic approaches have been employed where the time–frequency model of speech has been modelled by several multivariate joint densities, and natural gradient or Newton method algorithms have been derived. Here, we present a gentle tutorial on IVA for the separation of speech signals in the frequency domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. P. Smaragdis, “Blind separation of convolved mixtures in the frequency domain,” Neurocomputing, vol. 22, pp. 21-34, 1998.

    Article  MATH  Google Scholar 

  2. L. Parra and C. Spence, “Convolutive blind separation of non-stationary sources,” IEEE Trans. on Speech and Audio Processing, vol. 8, no. 3, pp. 320-327, 2000.

    Article  Google Scholar 

  3. J. Anemueller and B. Kollmeier, “Amplitude modulation decorrelation for convolutive blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 215-220, 2000.

    Google Scholar 

  4. N. Murata, S. Ikeda, and A. Ziehe, “An approach to blind source separation based on temporal structure of speech signals,” Neurocomputing, vol. 41, pp. 1-24, 2001.

    Article  MATH  Google Scholar 

  5. M. Z. Ikram and D. R. Morgan, “A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 881-884, 2002.

    Google Scholar 

  6. H. Sawada, R. Mukai , S. Araki, and S. Makino, “A robust and precise method for solving the permutation problem of frequency-domain blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 505-510, 2003.

    Google Scholar 

  7. T. Kim, H. T. Attias, S.-Y. Lee, and T.-W. Lee, “Blind source separation exploiting higher-order frequency dependencies,” IEEE Trans. on Speech and Audio Processing, vol. 15, no. 1, pp. 70-79, 2007.

    Article  Google Scholar 

  8. I. Lee, T. Kim, and T.-W. Lee, “Complex FastIVA: a robust maximum like-lihood approach of MICA for convolutive BSS,” Lecture Notes in Computer Science, vol. 3889, pp. 625-632, 2006.

    Article  Google Scholar 

  9. A. Hiroe, “Solution of permutation problem in frequency domain ICA, using multivariate probability density functions,” Lecture Notes in Computer Science, vol. 3889, pp. 601-608, 2006.

    Article  Google Scholar 

  10. M. Davies, “Audio Source Separation,” Mathematics in Signal Processing 5, Oxford University Press, pp. 57-68, 2002.

    Google Scholar 

  11. I. Lee, T. Kim, and T.-W. Lee, “Fast Fixed-Point Independent Vector Analysis Algorithms for Convolutive Blind Source Separation”, Signal Processing, vol. 87, no. 8, pp. 1859-1871, 2007.

    Article  Google Scholar 

  12. A. J. Bell and T. J. Sejnowski, “An information maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, no. 6, pp. 1129-1159, 1995.

    Article  Google Scholar 

  13. J.-F. Cardoso, “Infomax and maximum likelihood for source separation,” IEEE Signal Processing Letters, vol. 4, no. 4, 1997.

    Google Scholar 

  14. E. Bingham and A. Hyvärinen, “A fast fixed-point algorithm for independent componenet analysis of complex-valued signals,” Int. J. of Neural Systems, vol. 10, no. 1, pp. 1-8, 2000.

    Google Scholar 

  15. J.-F. Cardoso, “Entropic contrasts for source separation: Geometry and stability,” S. Haykin, Ed. Unsupervised Adaptive Filtering, vol. 1, pp. 139-189, John Wiley and Sons, 2000.

    Google Scholar 

  16. J.-F. Cardoso, “Multidimensional independent component analysis,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1941-1944, 1998.

    Google Scholar 

  17. A. Hyvärinen and P. O. Hoyer, “Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces,” Neural Computation, vol. 12, no. 7, pp. 1705-1720, 2000.

    Article  Google Scholar 

  18. D. H. Brandwood, “A complex gradient operator and its application in adaptive array theory,” IEE Proc. F and H, vol. 130, no. 1, pp. 11-16, 1983.

    MathSciNet  Google Scholar 

  19. A. van den Bos, “Complex gradient and Hessian,” IEE Proceedings on Vision, Image and Signal Processing, vol. 141, pp. 380-382, 1994.

    Article  Google Scholar 

  20. H. Brehm and W. Stammler, “Description and generation of spherically invariant speech-model signals,” Signal Processing, no. 12, pp. 119-141, 1987.

    Article  Google Scholar 

  21. H. Buchner, R. Aichner, and W. Kellermann, “Blind source separation for convolutive mixtures: A unified treatment,” Audio Signal Processing for NextGeneration Multimedia Communication Systems, Y. Huang and J. Benesty Ed., Kluwer Academic Publishers, Boston, pp. 255-293, 2004.

    Google Scholar 

  22. I. Lee and T.-W. Lee, “On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model”, IEEE Trans. on Speech, Audio and Language Processing, vol. 15, no. 5, pp. 1521-1528, 2007.

    Article  Google Scholar 

  23. G.-J. Jang, I. Lee, and T.-W. Lee, “Independent Vector Analysis using NonSpherical Joint Densities for the Separation of Speech Signals”, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, pp. 629-632, 2007.

    Google Scholar 

  24. S.-I. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Adv. Neural Information Processing Systems, vol. 8, pp. 757-763, 1996.

    Google Scholar 

  25. J.-F. Cardoso, “The invariant approach to source separation,” Proc. International Symposium on Nonlinear Theory and Applications (NOLTA), vol. 1, pp. 55-60, 1995.

    Google Scholar 

  26. A. Hyvärinen and E. Oja, “A Fast fixed-Point algorithm for independent component analysis,” Neural Computation, vol. 9, no. 7, pp. 1483-1492, 1997.

    Article  Google Scholar 

  27. A. Hyvärinen, “Fast and Robust fixed-point algorithms for independent componenet analysis, IEEE Trans. on Neural Networks, vol. 10, no. 3, pp. 626-634, 1999.

    Google Scholar 

  28. E. G. Learned-Miller and J. W. Fisher III, “ICA using spacings estimates of entropy,” J. of Machine Learning Research, vol. 4, pp. 1271-1295, 2003.

    Article  MathSciNet  Google Scholar 

  29. M. Z. Ikram and D. R. Morgan, “Exploring permutation inconsistency in blind separation of signals in a reverberant environment,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1041-1044, 2000.

    Google Scholar 

  30. K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 722-727, 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer

About this chapter

Cite this chapter

Lee, I., Kim, T., Lee, TW. (2007). Independent Vector Analysis for Convolutive Blind Speech Separation. In: Makino, S., Sawada, H., Lee, TW. (eds) Blind Speech Separation. Signals and Communication Technology. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6479-1_6

Download citation

  • DOI: https://doi.org/10.1007/978-1-4020-6479-1_6

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-6478-4

  • Online ISBN: 978-1-4020-6479-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics