Sound Localization Based on Excitation Source Information for Intelligent Home Service Robots

  • Keun-Chang Kwak
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5099)

Abstract

This paper is concerned with Sound Localization (SL) using Excitation Source Information (ESI) and effective angle estimation for intelligent home service robots that are equipped with multi-channel sound board and three low-cost condenser microphones. The main goal is to localize a caller by estimating time-delay with features obtained from the excitation source based on Linear Prediction (LP) residual and Hilbert envelop, when the speaker calls robot’s name in all directions. For performance analysis, we collected SL-DB (sound localization database) with the variation of distance and angle under test-bed environments like home. Here the localization success rate (LSR) and average localization error (ALE) from field of view (FOV) range of robot camera are used as localization performance criterion. The experimental results reveal that the presented method shows a good performance in comparison with the well-known Time Delay of Arrival (TDOA) and Generalized Cross Correlation- Phase Transform (GCC-PHAT) method.

Keywords

Sound localization excitation source information intelligent home service robots effective angle estimation low-cost microphones 

References

  1. 1.
    Raykar, V.C., Yegnanarayana, B., Prasanna, S.R.M., Duraiswami, R.: Speaker localization using excitation source information in speech. IEEE Trans. on Speech and Audio Processing 13(5), 751–761 (2005)CrossRefGoogle Scholar
  2. 2.
    Murty, K.S.R., Yegnanarayana, B.: Combining evidence from residual phase and MFCC features for speaker recognition. IEEE Signal Processing Letters 13(1), 52–55 (2006)CrossRefGoogle Scholar
  3. 3.
    Rao, K.S., Prasanna, S.R.M., Yegnanarayana, B.: Determination of instants of significant excitation in speech using Hilbert envelope and group delay function 14(10), 762-765 (2007)Google Scholar
  4. 4.
    Knapp, C.H., Carter, G.C.: The generalized correlation method for estimation of time delay. IEEE Trans. on Acoustic, Speech, and Signal Processing ASSP-24, 320–327 (1976)CrossRefGoogle Scholar
  5. 5.
    Huang, J., Supaongprapa, T., Terakura, I., Wang, F., Ohnishi, N., Sugie, N.: A model based sound localization system and its application to robot navigation. Robotics and Autonomous Systems, 199-209 (1999)Google Scholar
  6. 6.
    Kwak, K.C., Kim, H.J., Bae, K.S., Yoon, H.S.: Speaker identification and verification for intelligent service robots. In: Int. Conference on Artificial Intelligence (ICAI 2007), Las vegas, pp. 515–519 (2007)Google Scholar
  7. 7.
    Park, B.C., Ban, K.D., Kwak, K.C., Yoon, H.S.: Sound source localization based on audio-visual information for intelligent service robots. In: The 8th Int. Symposium on Advanced Intelligent Systems (ISIS, Sokcho 2007), pp. 364–367 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Keun-Chang Kwak
    • 1
  1. 1.Dept. of Control, Instrumentation, and Robotic EngineeringChosun UniversityGwangjuKorea

Personalised recommendations