Skip to main content

Underdetermined Blind Separation of Speech Signals with Delays in Different Time-Frequency Domains

  • Conference paper
Nonlinear Speech Modeling and Applications (NN 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3445))

Included in the following conference series:

Abstract

This paper is devoted to the problem of speech signal separation from a set of observables, when the mixing system is underdetermined and static with unknown delays. The approaches appeared in the literature so far have shown that algorithms based on the property of sparsity of the original signals (effectively satisfied by speech sources) can be successfully applied to such a problem, specially if implemented in the time-frequency domain. Here, a survey on the usage of different time-frequency transforms within the already available three-step procedure for the addressed separation problem is carried out. The novelty of the contribution can be seen from this perspective: Wavelet, Complex Wavelet and Stockwell Transforms are the new transforms used in our problem, in substitution of the usual Short Time Fourier Transform (STFT). Their performances are analyzed and compared to those attainable through the STFT, evaluating how much different is the influence that their sparseness and spectral disjointness properties on the algorithm behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Jutten, C., Hérault, J., Comon, P., Sorouchiary, E.: Blind Separation of Sources, Parts I, II and III. Signal Processing 24(1), 1–29 (1991)

    Article  MATH  Google Scholar 

  2. Haykin, S.: Unsupervised Adaptive Filtering, vol. 1: Blind Source Separation. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)

    Google Scholar 

  3. Haykin, S.: Unsupervised Adaptive Filtering, vol. 2: Blind Deconvolution. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)

    Google Scholar 

  4. Cichocki, A., Amari, S.: Adaptive Blind Signal and Image Processing. In: Wiley (ed.) Learning Algorithms and Application (2002)

    Google Scholar 

  5. Lee, T.W., Lewicki, M.S., Girolami, M., Bell, A.J., Sejnowski, T.J.: Blind Source Separation of More Sources Using Overcomplete Representations. IEEE Signal Processing Letters 6(4), 87–90 (1999)

    Article  Google Scholar 

  6. Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley & Sons, New York (2001)

    Book  Google Scholar 

  7. Lee, T.W., Girolami, M., Bell, A.J., Sejnowski, T.J.: A Unifying Information Theoretic Framework for Independent Component Analysis. Computers & Mathematics with Applications 31(11), 1–21 (2000)

    Article  MathSciNet  Google Scholar 

  8. Theis, F.J., Lang, E.W.: Geometric Overcomplete ICA. In: Proc. of ESANN 2002, pp. 217–223 (2002)

    Google Scholar 

  9. Puntonet, G.C., Prieto, A., Jutten, C., Rodrìguez-Alvarez, M., Ortega, J.: Separation of Sources: a Geometry Based Procedure for Recostruction of n-Valued Signal. Elsevier Signal Processing 46(3), 267–284 (1995)

    MATH  Google Scholar 

  10. Bofill, P., Zibulevsky, M.: Blind Separation of More Sources than Mixtures Using the Sparsity of the Short-Time Fourier Transform. In: International Workshop on Independent Component Analysis and Blind Signal Separation, (Helsinki, Finland), pp. 87–92 (June 2000)

    Google Scholar 

  11. Bell, A.J., Sejnovsky, T.J.: An Information-Maximization Approach to Blind Separation and Blind Deconvolution. Neural Comput. 7, 1129–1159 (1995)

    Article  Google Scholar 

  12. Cardoso, J.F.: Informax and Maximum Likelihood for Blind Source Separation. IEEE Sign. Process: Letters 4, 109–111 (1997)

    Article  Google Scholar 

  13. Amari, S.: Natural Gradient Learning for Over- and Under-Complete Bases in ICA. Neural Computation 11(8), 1875–1883 (1999)

    Article  MathSciNet  Google Scholar 

  14. Theis, F.J., Lang, E.W., Lautenschlager, M.A., Puntonet, C.G.: A Theoretical Framework for Overcomplete Geometric BMMR. In: Proc. of SIP 2002, pp. 201–206 (2002)

    Google Scholar 

  15. Zibulevsky, M., Kisilev, P., Zeevi, Y.Y., Pearlmutter, B.A.: Blind Source Separation via Multinode Sparse Representation. In: Leen, T.K., Dietterich, T.G., Tresp, V. (eds.) Advances in Neural Information Processing Systems, vol. 13, MIT Press, Cambridge (2001)

    Google Scholar 

  16. Theis, F.J., Lang, E.W.: Formalization of the Two-Step Approach to Overcomplete BSS. SIP (2002)

    Google Scholar 

  17. Bofill, P.: Underdetermined Blind Separation of Delayed Sound Sources in the Frequency Domain. Neurocomputing, Special Issue ICA and BSS (March 2, 2001)

    Google Scholar 

  18. Lobo, M.S., Vandenberghe, L., Boyd, S., Lebret, H.: Applications of Second Order Cone Programming. Linear Algebra and Its Applications 284, 193–228 (1998)

    Article  MATH  MathSciNet  Google Scholar 

  19. Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transaction on Signal Processing 52(7) (July 2004)

    Google Scholar 

  20. Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)

    Google Scholar 

  21. Vetterli, M., Kovačević, J.: Wavelets and Subband Coding. Prentice Hall, Englewood Cliffs (1995)

    MATH  Google Scholar 

  22. Mallat, S.G.: A Wavelet Tour of Signal Processing. Academic Press, London (1998)

    MATH  Google Scholar 

  23. Mallat, S.G.: A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(7) (1989)

    Google Scholar 

  24. Coifman, R.R., Wickerhauser, M.V.: Entropy-Based Algorithms for Best Basis Selection. IEEE Trans. Inform. Theory 38(2)

    Google Scholar 

  25. Stockwell, R.G., Mansinha, L., Lowe, R.P.: Localization of the Complex Spectrum: The S Transform. IEEE Trans. Signal Process 44, 998–1001 (1996)

    Article  Google Scholar 

  26. Daubechies, I.: The Wavelet Transform, Time Frequency Localization and Signal Analysis. IEEE Transactions on Information Theory 36(5) (September 1990)

    Google Scholar 

  27. Kingsbury, N.G.: The Dual Tree Complex Wavelet Transform: a New Technique for Shift Invariance and Directional Filters. In: Proc. IEEE DSP Workshop Bryce Canyon (August 1998)

    Google Scholar 

  28. Sawada, H., Mukai, R., Araki, S., Makino, S.: Convolutive Blind Source Separation for More Than Two Sources in the Frequency Domain. In: ICASSP 2004

    Google Scholar 

  29. Winter, S., Sawada, H., Araki, S., Makino, S.: Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures. In: Workshop on Statistical and Perceptual Audio Processing SAPA 2004, Jeju Korea, October 3 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bastari, A., Squartini, S., Piazza, F. (2005). Underdetermined Blind Separation of Speech Signals with Delays in Different Time-Frequency Domains. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_7

Download citation

  • DOI: https://doi.org/10.1007/11520153_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27441-4

  • Online ISBN: 978-3-540-31886-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics