Underdetermined Blind Separation of Speech Signals with Delays in Different Time-Frequency Domains

Bastari, Alessandro; Squartini, Stefano; Piazza, Francesco

doi:10.1007/11520153_7

Alessandro Bastari²²,
Stefano Squartini²² &
Francesco Piazza²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3445))

Included in the following conference series:

International School on Neural Networks, Initiated by IIASS and EMFCSC

1161 Accesses
4 Citations

Abstract

This paper is devoted to the problem of speech signal separation from a set of observables, when the mixing system is underdetermined and static with unknown delays. The approaches appeared in the literature so far have shown that algorithms based on the property of sparsity of the original signals (effectively satisfied by speech sources) can be successfully applied to such a problem, specially if implemented in the time-frequency domain. Here, a survey on the usage of different time-frequency transforms within the already available three-step procedure for the addressed separation problem is carried out. The novelty of the contribution can be seen from this perspective: Wavelet, Complex Wavelet and Stockwell Transforms are the new transforms used in our problem, in substitution of the usual Short Time Fourier Transform (STFT). Their performances are analyzed and compared to those attainable through the STFT, evaluating how much different is the influence that their sparseness and spectral disjointness properties on the algorithm behavior.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jutten, C., Hérault, J., Comon, P., Sorouchiary, E.: Blind Separation of Sources, Parts I, II and III. Signal Processing 24(1), 1–29 (1991)
Article MATH Google Scholar
Haykin, S.: Unsupervised Adaptive Filtering, vol. 1: Blind Source Separation. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)
Google Scholar
Haykin, S.: Unsupervised Adaptive Filtering, vol. 2: Blind Deconvolution. Wiley Series on Adaptive and Learning Systems for Signal Processing, Communications and Control, Simon Haykin Series Editor (2000)
Google Scholar
Cichocki, A., Amari, S.: Adaptive Blind Signal and Image Processing. In: Wiley (ed.) Learning Algorithms and Application (2002)
Google Scholar
Lee, T.W., Lewicki, M.S., Girolami, M., Bell, A.J., Sejnowski, T.J.: Blind Source Separation of More Sources Using Overcomplete Representations. IEEE Signal Processing Letters 6(4), 87–90 (1999)
Article Google Scholar
Hyvärinen, A., Karhunen, J., Oja, E.: Independent Component Analysis. Wiley & Sons, New York (2001)
Book Google Scholar
Lee, T.W., Girolami, M., Bell, A.J., Sejnowski, T.J.: A Unifying Information Theoretic Framework for Independent Component Analysis. Computers & Mathematics with Applications 31(11), 1–21 (2000)
Article MathSciNet Google Scholar
Theis, F.J., Lang, E.W.: Geometric Overcomplete ICA. In: Proc. of ESANN 2002, pp. 217–223 (2002)
Google Scholar
Puntonet, G.C., Prieto, A., Jutten, C., Rodrìguez-Alvarez, M., Ortega, J.: Separation of Sources: a Geometry Based Procedure for Recostruction of n-Valued Signal. Elsevier Signal Processing 46(3), 267–284 (1995)
MATH Google Scholar
Bofill, P., Zibulevsky, M.: Blind Separation of More Sources than Mixtures Using the Sparsity of the Short-Time Fourier Transform. In: International Workshop on Independent Component Analysis and Blind Signal Separation, (Helsinki, Finland), pp. 87–92 (June 2000)
Google Scholar
Bell, A.J., Sejnovsky, T.J.: An Information-Maximization Approach to Blind Separation and Blind Deconvolution. Neural Comput. 7, 1129–1159 (1995)
Article Google Scholar
Cardoso, J.F.: Informax and Maximum Likelihood for Blind Source Separation. IEEE Sign. Process: Letters 4, 109–111 (1997)
Article Google Scholar
Amari, S.: Natural Gradient Learning for Over- and Under-Complete Bases in ICA. Neural Computation 11(8), 1875–1883 (1999)
Article MathSciNet Google Scholar
Theis, F.J., Lang, E.W., Lautenschlager, M.A., Puntonet, C.G.: A Theoretical Framework for Overcomplete Geometric BMMR. In: Proc. of SIP 2002, pp. 201–206 (2002)
Google Scholar
Zibulevsky, M., Kisilev, P., Zeevi, Y.Y., Pearlmutter, B.A.: Blind Source Separation via Multinode Sparse Representation. In: Leen, T.K., Dietterich, T.G., Tresp, V. (eds.) Advances in Neural Information Processing Systems, vol. 13, MIT Press, Cambridge (2001)
Google Scholar
Theis, F.J., Lang, E.W.: Formalization of the Two-Step Approach to Overcomplete BSS. SIP (2002)
Google Scholar
Bofill, P.: Underdetermined Blind Separation of Delayed Sound Sources in the Frequency Domain. Neurocomputing, Special Issue ICA and BSS (March 2, 2001)
Google Scholar
Lobo, M.S., Vandenberghe, L., Boyd, S., Lebret, H.: Applications of Second Order Cone Programming. Linear Algebra and Its Applications 284, 193–228 (1998)
Article MATH MathSciNet Google Scholar
Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transaction on Signal Processing 52(7) (July 2004)
Google Scholar
Daubechies, I.: Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Google Scholar
Vetterli, M., Kovačević, J.: Wavelets and Subband Coding. Prentice Hall, Englewood Cliffs (1995)
MATH Google Scholar
Mallat, S.G.: A Wavelet Tour of Signal Processing. Academic Press, London (1998)
MATH Google Scholar
Mallat, S.G.: A Theory for Multiresolution Signal Decomposition: The Wavelet Representation. IEEE Transactions on Pattern Analysis and Machine Intelligence 11(7) (1989)
Google Scholar
Coifman, R.R., Wickerhauser, M.V.: Entropy-Based Algorithms for Best Basis Selection. IEEE Trans. Inform. Theory 38(2)
Google Scholar
Stockwell, R.G., Mansinha, L., Lowe, R.P.: Localization of the Complex Spectrum: The S Transform. IEEE Trans. Signal Process 44, 998–1001 (1996)
Article Google Scholar
Daubechies, I.: The Wavelet Transform, Time Frequency Localization and Signal Analysis. IEEE Transactions on Information Theory 36(5) (September 1990)
Google Scholar
Kingsbury, N.G.: The Dual Tree Complex Wavelet Transform: a New Technique for Shift Invariance and Directional Filters. In: Proc. IEEE DSP Workshop Bryce Canyon (August 1998)
Google Scholar
Sawada, H., Mukai, R., Araki, S., Makino, S.: Convolutive Blind Source Separation for More Than Two Sources in the Frequency Domain. In: ICASSP 2004
Google Scholar
Winter, S., Sawada, H., Araki, S., Makino, S.: Hierarchical Clustering Applied to Overcomplete BSS for Convolutive Mixtures. In: Workshop on Statistical and Perceptual Audio Processing SAPA 2004, Jeju Korea, October 3 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Elettronica, Intelligenza Artificiale e Telecomunicazioni-Università, Politecnica delle Marche, Via Brecce Bianche 12, I-60121, Ancona, Italy
Alessandro Bastari, Stefano Squartini & Francesco Piazza

Authors

Alessandro Bastari
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Squartini
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Piazza
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS LTCI/TSI Paris, 46 rue Barrault, 75634, Paris Cedex 13, France
Gérard Chollet
Department of Psychology, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare, SA, Italy
Anna Esposito
Escola Universitària Politècnica de Mataró, Universitat Politècnica de Catalunya, Barcelona, Spain
Marcos Faundez-Zanuy
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi, SA, Italy
Maria Marinaro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bastari, A., Squartini, S., Piazza, F. (2005). Underdetermined Blind Separation of Speech Signals with Delays in Different Time-Frequency Domains. In: Chollet, G., Esposito, A., Faundez-Zanuy, M., Marinaro, M. (eds) Nonlinear Speech Modeling and Applications. NN 2004. Lecture Notes in Computer Science(), vol 3445. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11520153_7

Download citation

DOI: https://doi.org/10.1007/11520153_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27441-4
Online ISBN: 978-3-540-31886-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics