Abstract
In this work we propose the segment-level probabilistic sequence kernelĀ (SLPSK) as dynamic kernel to be used in support vector machineĀ (SVM) for classification of varying length patterns of long duration speech represented as sets of feature vectors. SLPSK is built upon a set of Gaussian basis functions, where half of the basis functions contain class specific information while the other half implicates the common characteristics of all the speech utterances of all classes. The proposed kernel is computed between the pair of examples, by partitioning the speech signal into fixed number of segments and then matching the corresponding segments. We study the performance of the SVM-based classifiers using the proposed SLPSK using different pooling technique for speech emotion recognition and speaker identification and compare with that of the SVM-based classifiers using other kernels for varying length patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dileep, A.D., Chandra Sekhar, C.: GMM-based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1421ā1432 (2014)
Smith, N., Gales, M., Niranjan, M.: Data-dependent kernels in SVM classification of speech patterns. Technical report CUED/F-INFENG/TR.387, Cambridge University Engineering Department, Trumpington Street, Cambridge, CB2 1PZ, U.K., April 2001
Lee, K-A., You, C.H., Li, H., Kinnunen, T.: A GMM-based probabilistic sequence kernel for speaker verification. In: Proceedings of INTERSPEECH, Antwerp, Belgium, pp. 294ā297, August 2007
Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308ā311 (2006)
You, C.H., Lee, K.A., Li, H.: An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16(1), 49ā52 (2009)
Dileep, A.D., Chandra Sekhar, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365ā379 (2012)
Sachdev, A., Dileep, A.D., Thenkanidiyoor, V.: Example-specific density based matching kernel for classificationof varying length patterns of speech using support vector machines. In: Proceedings of ICONIP, Istanbul, Turkey, pp.177ā184, November 2015
Yu, K., Lv, F., Huang, T., Wang, J., Yang, J., Gong, Y.: Locality-constrained linear coding for image classification. In: Proceedings of CVPR 2010, pp. 3360ā3367. IEEE (2010)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR 2009, pp. 1794ā1801. IEEE (2009)
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)
Burkhardt, F., Paeschke, A., Rolfes, M., Weiss, W.S.B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, Lisbon, Portugal, pp. 1517ā1520, September 2005
Steidl, S.: Automatic classification of emotion-related user states inspontaneous childernās speech. Ph.D. thesis, Der Technischen FakultƤt der UniversitƤt Erlangen-NĆ¼rnberg, Germany (2009)
The NIST year 2003 speaker recognition evaluation plan (2003). http://www.itl.nist.gov/iad/mig/tests/sre/2003/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2016 Springer International Publishing AG
About this paper
Cite this paper
Gupta, S., Thenkanidiyoor, V., Aroor Dinesh, D. (2016). Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech . In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9950. Springer, Cham. https://doi.org/10.1007/978-3-319-46681-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-46681-1_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46680-4
Online ISBN: 978-3-319-46681-1
eBook Packages: Computer ScienceComputer Science (R0)