Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification

Sen, Nirmalya; Patil, Hemant. A.; Mandal, Shyamal Kr. Das; Rao, K. Sreenivasa

doi:10.1007/978-3-319-03844-5_76

Nirmalya Sen²¹,
Hemant. A. Patil²³,
Shyamal Kr. Das Mandal²¹ &
…
K. Sreenivasa Rao²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8284))

2650 Accesses
2 Citations

Abstract

This paper compares performances between GMM-UBM classifier and SVM classifier with GMM supervector as the linear kernel for text-independent speaker verification. The MFCC feature set has been used for this comparison. Experimental evaluation was conducted on the POLYCOST database. The importance of utterance partitioning for training speech has been discussed. Results reveal that, without utterance partitioning, the accuracy of SVM classifier with GMM supervectors for small test segment is poor. For proper utterance partitioning of the training speech, the SVM classifier with GMM supervectors performs significantly better compared to GMM-UBM baseline. The detailed derivation of GMM supervector has also been discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker Verification Using Gaussian Mixture Models. Digital Signal Processing 10, 19–41 (2000)
Article Google Scholar
Cristianini, N., Taylor, J.S.: Support Vector Machines. Cambridge University Press, Cambridge (2000)
Google Scholar
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support Vector Machines using GMM Supervectors for Speaker Verification. IEEE Signal Processing Lett. 13, 308–311 (2006)
Article Google Scholar
Mak, M.W., Rao, W.: Utterance partitioning with acoustic vector resampling for GMM-SVM speaker verification. Speech Communication 53, 119–130 (2011)
Article Google Scholar
Petrovska, D., et al.: POLYCOST: A Telephonic speech database for speaker recognition. In: RLA2C, Avignon, France, April 20-23, pp. 211–214 (1998)
Google Scholar
Hsu, C.W., Chang, C.C., Lin, C.J.: A Practical Guide to Support Vector Classification, http://www.csie.ntu.edu.tw/~cjlin/libsvm
Campbell, J.P.: Speaker Recognition: A Tutorial. Proceedings of the IEEE 85(9), 1437–1462 (1997)
Article Google Scholar
Do, M.N.: Fast approximation of Kullback–Leibler distance for dependence trees and hidden Markov models. IEEE Signal Processing Lett. 10(4), 115–118 (2003)
Article MathSciNet Google Scholar
Davis, S.B., Mermelsteine, P.: Comparison of parametric representation for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust., Speech, Signal Processing ASSP-28(4), 357–365 (1980)
Article Google Scholar
Sahidullah, M., Saha, G.: Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech Communication 54(4), 543–565 (2012)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Signal Processing Group, C.E.T, IIT Kharagpur, India
Nirmalya Sen & Shyamal Kr. Das Mandal
S.I.T, IIT Kharagpur, India
K. Sreenivasa Rao
DA-IICT, Gandhinagar, Gujarat, India
Hemant. A. Patil

Authors

Nirmalya Sen
View author publications
You can also search for this author in PubMed Google Scholar
Hemant. A. Patil
View author publications
You can also search for this author in PubMed Google Scholar
Shyamal Kr. Das Mandal
View author publications
You can also search for this author in PubMed Google Scholar
K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Business Information Systems, FSIC, National University of Ireland, University College Cork, O’Rahilly Buildings, Cork, Ireland
Rajendra Prasath
Research Centre in Computer Science, V.H.N.Senthikumara Nadar College (Autonomous), 626 001, Virudhunagar, Tamil Nadu, India
T. Kathirvalavakumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sen, N., Patil, H.A., Mandal, S.K.D., Rao, K.S. (2013). Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification. In: Prasath, R., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. Lecture Notes in Computer Science(), vol 8284. Springer, Cham. https://doi.org/10.1007/978-3-319-03844-5_76

Download citation

DOI: https://doi.org/10.1007/978-3-319-03844-5_76
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03843-8
Online ISBN: 978-3-319-03844-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics