Advertisement

Underdetermined Blind Source Separation Using Linear Separation System

  • Jan Cermak
  • Zdenek Smekal
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5398)

Abstract

In automatic speech and speech emotion recognition, a good quality of input speech signal is often required. The hit rate of recognizers is lowered by degradation of speech quality due to noise. Blind source separation can be used to enhance the speech signal as a part of preprocessing techniques. This paper presents a multi channel linear blind source separation method that can be applied even in underdetermined case i.e. when the number of source signals is higher than the number of sensors. Experiments have shown that our system outperforms conventional time-frequency binary masking in both determined and underdetermined cases and significantly increases the hit rate of speech recognizers.

Keywords

array signal processing beamforming blind source separation speech processing time-frequency binary masking 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hyvarinen, A., Oja, E.: Independent Component Analysis: Algorithms and Applications. Neural Networks 13(4-5), 411–430 (2000)CrossRefGoogle Scholar
  2. 2.
    Johnson, D., Dungeon, D.: Array Signal Processing. Prentice Hall, Englewood Cliffs (1993)Google Scholar
  3. 3.
    Yilmaz, O., Rickard, S.: Blind Separation of Speech Mixtures via Time-Frequency Masking. IEEE Transactions on Signal Processing 52(7) (2004)Google Scholar
  4. 4.
    Cermak, J., Araki, S., Sawada, H., Makino, S.: Blind Source Separation Based on a Beamformer Array and Time-Frequency Binary Masking. In: ICASSP 2007, vol. 1, pp. 145–148 (2007) ISBN 1–4244–0728–1Google Scholar
  5. 5.
    Perceptual Evaluation of Speech Quality (PESQ). ITU-T Recommendation P.862, http://www.itu.int/rec/T-REC-p
  6. 6.
    Nouza, J., Zdansky, J., Cerva, P., Kolorenc, J.: A System for Information Retrieval from Large Records of Broadcast Programs. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2006. LNCS, vol. 4188, pp. 485–492. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  7. 7.
    Methods for Subjective Determination of Transmission Quality. ITU-T Recommendation P.800, http://www.itu.int/rec/T-REC-p

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Jan Cermak
    • 1
  • Zdenek Smekal
    • 2
  1. 1.Institute of Photonics and ElectronicsAcademy of Sciences of the Czech RepublicPragueCzech Republic
  2. 2.Faculty of Electrical Engineering and CommunicationBrno University of TechnologyBrnoCzech Republic

Personalised recommendations