Skip to main content

A New Improved Method to Permutation Ambiguity in BSS with Strong Reverberation

  • Conference paper
Advanced Research on Electronic Commerce, Web Application, and Communication (ECWAC 2011)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 143))

  • 1979 Accesses

Abstract

The major problem of blind source separation in frequency domain is the permutation ambiguity between different frequency bins, which is the key factor to recover the original sources correctly. A new idea is to consider the frequency components from the same source as a multivariate vector with a certain probability density function, and the vectors from different sources are independent each other. An algorithm based on this idea is proposed to solve the permutation ambiguity problem of BSS in frequency domain, and some approximate cost functions are compared with the existing algorithm in frequency domain. The computer simulations to two true speeches with strong reverberation are shown to verify the efficiency of the proposed algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hyvarinen, A., et al.: Independent component analysis. John Wiley and Sons, Chichester (2001)

    Book  MATH  Google Scholar 

  2. Benesty, J., Sonhi, M.M., Huang, Y., et al.: Handbook of Speech Processing. Springer, Heidelberg (2008)

    Book  Google Scholar 

  3. Saruwatari, H., Kurita, S., Takeda, K.: Blind source separation combining frequency-domain ICA and beamforming. ICASSP 5(7), 2733–2736 (2001)

    Google Scholar 

  4. Murata, N., Ikeda, S., Ziehe, A.: An approach to blind source separation based on temporal structure of speech signals. Neurocomputing 41(1-4), 1–24 (2001)

    Article  MATH  Google Scholar 

  5. Zhu, J., Wang, H., Li, H.: Joint algorithm for permutation problem in frequency-domain in blind speech source separation. Computer Applications 28(6), 1552–1554 (2008) (in chinese)

    Article  MATH  Google Scholar 

  6. Lee, I., Kim, T., Lee, T.-w.: Fast fixed-point independent vector analysis algorithms for convolutive blind source separation. Signal Processing 87, 1859–1871 (2007)

    Article  MATH  Google Scholar 

  7. Matsuoka, K., Nakashima, S.: Minimal distortion principle for blind source separation. In: Proceeding of the 41st SICE Annual (SICE 2002), Washington, vol. 4, pp. 2138–2143 (2002)

    Google Scholar 

  8. Davies, M.: Audio Source Separation. Mathematics in Signal Processing, vol. 5, pp. 57–68. Oxford University Press, Oxford (2002)

    MATH  Google Scholar 

  9. Lee, I., Lee, T.-W.: On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model. IEEE Trans. on Speech, Audio and Language Processing 15(5), 1521–1528 (2007)

    Article  Google Scholar 

  10. Lehmann, E., Johansson, A.: Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses. IEEE Trans. on Audio, Speech and Language Processing 18(6), 1429–1439 (2010)

    Article  Google Scholar 

  11. Hiroe, A.: Solution of permutation problem in frequency domain ICA, using multivariate probability density functions. In: Rosca, J.P., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds.) ICA 2006. LNCS, vol. 3889, pp. 601–608. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  12. Araki, S., Mukai, R., Makino, S., Nishikawa, T., et al.: The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech. IEEE Trans. Speech and Audio Processing 11, 109–116 (2003)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, H., Fan, G. (2011). A New Improved Method to Permutation Ambiguity in BSS with Strong Reverberation. In: Shen, G., Huang, X. (eds) Advanced Research on Electronic Commerce, Web Application, and Communication. ECWAC 2011. Communications in Computer and Information Science, vol 143. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20367-1_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-20367-1_34

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-20366-4

  • Online ISBN: 978-3-642-20367-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics