Skip to main content

Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2016)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9950))

Included in the following conference series:

Abstract

In this work we propose the segment-level probabilistic sequence kernelĀ (SLPSK) as dynamic kernel to be used in support vector machineĀ (SVM) for classification of varying length patterns of long duration speech represented as sets of feature vectors. SLPSK is built upon a set of Gaussian basis functions, where half of the basis functions contain class specific information while the other half implicates the common characteristics of all the speech utterances of all classes. The proposed kernel is computed between the pair of examples, by partitioning the speech signal into fixed number of segments and then matching the corresponding segments. We study the performance of the SVM-based classifiers using the proposed SLPSK using different pooling technique for speech emotion recognition and speaker identification and compare with that of the SVM-based classifiers using other kernels for varying length patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dileep, A.D., Chandra Sekhar, C.: GMM-based intermediate matching kernel for classification of varying length patterns of long duration speech using support vector machines. IEEE Trans. Neural Netw. Learn. Syst. 25(8), 1421ā€“1432 (2014)

    ArticleĀ  Google ScholarĀ 

  2. Smith, N., Gales, M., Niranjan, M.: Data-dependent kernels in SVM classification of speech patterns. Technical report CUED/F-INFENG/TR.387, Cambridge University Engineering Department, Trumpington Street, Cambridge, CB2 1PZ, U.K., April 2001

    Google ScholarĀ 

  3. Lee, K-A., You, C.H., Li, H., Kinnunen, T.: A GMM-based probabilistic sequence kernel for speaker verification. In: Proceedings of INTERSPEECH, Antwerp, Belgium, pp. 294ā€“297, August 2007

    Google ScholarĀ 

  4. Campbell, W.M., Sturim, D.E., Reynolds, D.A.: Support vector machines using GMM supervectors for speaker verification. IEEE Signal Process. Lett. 13(5), 308ā€“311 (2006)

    ArticleĀ  Google ScholarĀ 

  5. You, C.H., Lee, K.A., Li, H.: An SVM kernel with GMM-supervector based on the Bhattacharyya distance for speaker recognition. IEEE Signal Process. Lett. 16(1), 49ā€“52 (2009)

    ArticleĀ  Google ScholarĀ 

  6. Dileep, A.D., Chandra Sekhar, C.: Speaker recognition using pyramid match kernel based support vector machines. Int. J. Speech Technol. 15(3), 365ā€“379 (2012)

    ArticleĀ  Google ScholarĀ 

  7. Sachdev, A., Dileep, A.D., Thenkanidiyoor, V.: Example-specific density based matching kernel for classificationof varying length patterns of speech using support vector machines. In: Proceedings of ICONIP, Istanbul, Turkey, pp.177ā€“184, November 2015

    Google ScholarĀ 

  8. Yu, K., Lv, F., Huang, T., Wang, J., Yang, J., Gong, Y.: Locality-constrained linear coding for image classification. In: Proceedings of CVPR 2010, pp. 3360ā€“3367. IEEE (2010)

    Google ScholarĀ 

  9. Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR 2009, pp. 1794ā€“1801. IEEE (2009)

    Google ScholarĀ 

  10. Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. (TIST) 2(3), 27 (2011)

    Google ScholarĀ 

  11. Burkhardt, F., Paeschke, A., Rolfes, M., Weiss, W.S.B.: A database of German emotional speech. In: Proceedings of INTERSPEECH, Lisbon, Portugal, pp. 1517ā€“1520, September 2005

    Google ScholarĀ 

  12. Steidl, S.: Automatic classification of emotion-related user states inspontaneous childernā€™s speech. Ph.D. thesis, Der Technischen FakultƤt der UniversitƤt Erlangen-NĆ¼rnberg, Germany (2009)

    Google ScholarĀ 

  13. The NIST year 2003 speaker recognition evaluation plan (2003). http://www.itl.nist.gov/iad/mig/tests/sre/2003/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dileep Aroor Dinesh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2016 Springer International Publishing AG

About this paper

Cite this paper

Gupta, S., Thenkanidiyoor, V., Aroor Dinesh, D. (2016). Segment-Level Probabilistic Sequence Kernel Based Support Vector Machines for Classification of Varying Length Patterns of Speech . In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9950. Springer, Cham. https://doi.org/10.1007/978-3-319-46681-1_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46681-1_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46680-4

  • Online ISBN: 978-3-319-46681-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics