Skip to main content

Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification

  • Conference paper
Intelligent Informatics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 182))

Abstract

The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in the task of text independent speaker identification. The proposed technique overcomes the single resolution limitation of MFCC by incorporating the multi resolution analysis offered by WPT. To check the accuracy of the proposed paradigm in the real life scenario, it is tested on the speaker database by using Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) as classifiers and their relative performance for identification purpose is compared. The identification results of the MFCC features and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm. Accuracy as high as 100% was achieved in some cases using WP-MFC Features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Reynolds, D.A.: Speaker Identification and Verification Using Gaussian Mixture Speaker Models. Speech Communication 17 (1995)

    Google Scholar 

  2. Bolt Richard, H., Cooper Franklin, S., David Edward Jr., E., Denes Peter, B., Pickett James, M., Stevens Kenneth, N.: Speaker Identification by Speech Spectograms: A Scientists’ View of its Reliability for Legal Purposes. The Acoustic Society of America 47 (1970)

    Google Scholar 

  3. Reynolds Douglas, A.: Identification, Experimental Evaluation of Features for Robust Speaker. IEEE Transactions on Speech and Audio Processing 77, 257–285 (1994)

    Google Scholar 

  4. Gaikwad Santosh, K., Gawali Bharti, W., Pravin, Y.: A Review on Speech Recognition Technique. International Journal of Computer Applications 10 (2010)

    Google Scholar 

  5. Sirko, M., Michael, P., Ralf, S., Hermann, N.: Computing Mel-frequency coefficients on Power Spectrum. IEEE Proceedings of IEEE 1, 73–76 (2001)

    Google Scholar 

  6. Chen, S.-H., Luo, Y.-R.: Speaker Verification Using MFCC and Support. In: Proceedings of the International MultiConference of Engineers and Computer Scientists (2009)

    Google Scholar 

  7. Rabiner, L.: A tutorial on hidden Markov models and selected applications in speech recognition, pp. 257–286 (1989)

    Google Scholar 

  8. Blimes, J.A.: A gentle tutorial of the EM algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. International Computer Science Institute (1998)

    Google Scholar 

  9. Reynolds, D.A., Campbell, W.M.: Springer Handbook of Speech Processing. Text Independent Speaker Recognition. Springer (2008)

    Google Scholar 

  10. Mallat, S.G.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE 111, 674–693 (1989)

    Google Scholar 

  11. Robi, P.: The Engineers Ultimate Guide to Wavelet Analysis (2012), http://users.rowan.edu/~polikar/wavelets/wttutorial.html (accessed March 20, 2012)

  12. VoxForge (2012), http://www.voxforge.org/home/downloads/speech/english (accessed February 20, 2012)

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Srivastava, S., Bhardwaj, S., Bhandari, A., Gupta, K., Bahl, H., Gupta, J.R.P. (2013). Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification. In: Abraham, A., Thampi, S. (eds) Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32063-7_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-32063-7_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-32062-0

  • Online ISBN: 978-3-642-32063-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics