Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification

Srivastava, Smriti; Bhardwaj, Saurabh; Bhandari, Abhishek; Gupta, Krit; Bahl, Hitesh; Gupta, J. R. P.

doi:10.1007/978-3-642-32063-7_26

Smriti Srivastava³,
Saurabh Bhardwaj³,
Abhishek Bhandari³,
Krit Gupta³,
Hitesh Bahl³ &
…
J. R. P. Gupta³

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 182))

1812 Accesses
2 Citations

Abstract

The present research proposes a paradigm which combines the Wavelet Packet Transform (WPT) with the distinguished Mel Frequency Cepstral Coefficients (MFCC) for extraction of speech feature vectors in the task of text independent speaker identification. The proposed technique overcomes the single resolution limitation of MFCC by incorporating the multi resolution analysis offered by WPT. To check the accuracy of the proposed paradigm in the real life scenario, it is tested on the speaker database by using Hidden Markov Model (HMM) and Gaussian Mixture Model (GMM) as classifiers and their relative performance for identification purpose is compared. The identification results of the MFCC features and the Wavelet Packet based Mel Frequency Cepstral (WP-MFC) Features are compared to validate the efficiency of the proposed paradigm. Accuracy as high as 100% was achieved in some cases using WP-MFC Features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Reynolds, D.A.: Speaker Identification and Verification Using Gaussian Mixture Speaker Models. Speech Communication 17 (1995)
Google Scholar
Bolt Richard, H., Cooper Franklin, S., David Edward Jr., E., Denes Peter, B., Pickett James, M., Stevens Kenneth, N.: Speaker Identification by Speech Spectograms: A Scientists’ View of its Reliability for Legal Purposes. The Acoustic Society of America 47 (1970)
Google Scholar
Reynolds Douglas, A.: Identification, Experimental Evaluation of Features for Robust Speaker. IEEE Transactions on Speech and Audio Processing 77, 257–285 (1994)
Google Scholar
Gaikwad Santosh, K., Gawali Bharti, W., Pravin, Y.: A Review on Speech Recognition Technique. International Journal of Computer Applications 10 (2010)
Google Scholar
Sirko, M., Michael, P., Ralf, S., Hermann, N.: Computing Mel-frequency coefficients on Power Spectrum. IEEE Proceedings of IEEE 1, 73–76 (2001)
Google Scholar
Chen, S.-H., Luo, Y.-R.: Speaker Verification Using MFCC and Support. In: Proceedings of the International MultiConference of Engineers and Computer Scientists (2009)
Google Scholar
Rabiner, L.: A tutorial on hidden Markov models and selected applications in speech recognition, pp. 257–286 (1989)
Google Scholar
Blimes, J.A.: A gentle tutorial of the EM algorithm and its application to parameter estimation for gaussian mixture and hidden markov models. International Computer Science Institute (1998)
Google Scholar
Reynolds, D.A., Campbell, W.M.: Springer Handbook of Speech Processing. Text Independent Speaker Recognition. Springer (2008)
Google Scholar
Mallat, S.G.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE 111, 674–693 (1989)
Google Scholar
Robi, P.: The Engineers Ultimate Guide to Wavelet Analysis (2012), http://users.rowan.edu/~polikar/wavelets/wttutorial.html (accessed March 20, 2012)
VoxForge (2012), http://www.voxforge.org/home/downloads/speech/english (accessed February 20, 2012)

Download references

Author information

Authors and Affiliations

Netaji Subhas Institute of Technology, New Delhi, 110078, India
Smriti Srivastava, Saurabh Bhardwaj, Abhishek Bhandari, Krit Gupta, Hitesh Bahl & J. R. P. Gupta

Authors

Smriti Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Saurabh Bhardwaj
View author publications
You can also search for this author in PubMed Google Scholar
Abhishek Bhandari
View author publications
You can also search for this author in PubMed Google Scholar
Krit Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Hitesh Bahl
View author publications
You can also search for this author in PubMed Google Scholar
J. R. P. Gupta
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

(MIR Labs), Scientific Network for Innovation and, Machine Intelligence Research Labs, MIR Labs Campus, Auburn, 98071, Washington, USA
Ajith Abraham
Technology and Management, Indian Institute of Information, Technopark Campus, Trivandrum, 695581, India
Sabu M Thampi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Srivastava, S., Bhardwaj, S., Bhandari, A., Gupta, K., Bahl, H., Gupta, J.R.P. (2013). Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification. In: Abraham, A., Thampi, S. (eds) Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32063-7_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-32063-7_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32062-0
Online ISBN: 978-3-642-32063-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics