Application of Vector Quantization in Emotion Recognition from Human Speech

Khanna, Preeti; Sasi Kumar, M.

doi:10.1007/978-3-642-19423-8_13

Preeti Khanna⁴ &
M. Sasi Kumar⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 141))

Included in the following conference series:

International Conference on Information Intelligence, Systems, Technology and Management

1134 Accesses
5 Citations

Abstract

Recognition of emotions from speech is a complex task that is furthermore complicated by the fact that there is no unambiguous answer to what the “correct” emotion is for a given speech sample. In this paper, we discuss emotion classification of a well known German database consisting of 6 basic emotions: sadness, boredom, neutral, fear, happiness, and anger using Mel frequency Cepstral Coefficients (MFCCs). A concern with MFCC is the large number of features. We discuss the use of LBG-VQ algorithm to minimize the amount of data to be handled. At last, emotion classification is done using Euclidean distance, Manhattan distance and Chebyshev distance of the codebooks between neutral state and other emotional states for the same sample.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Emotion Recognition from Speech Using Multiple Features and Clusters

Machine Learning Approach for Emotional Speech Classification

Speech Emotion Recognition of Tamil Language: An Implementation with Linear and Nonlinear Feature

References

Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.: Emotion recognition in human-computer interactions. IEEE Signal Proceedings 18(1), 32–80 (2001)
Article Google Scholar
Litman, D., Forbes, K.: Recognizing emotions from student speech in tutoring dialogues. In: The Proceedings of the ASRU 2003 (2003)
Google Scholar
Lee, C.M., Narayanan, S.: Towards detecting emotion in spoken dialogs. IEEE Trans. on Speech and Audio Processing 13(2) (2005)
Google Scholar
Tato, R., Santos, R., Kompe, R., Pardo, J.: Emotional space improves emotion recognition. In: The Proceedings of the Seventh International Conference on Spoken Language Processing, vol. 3, pp. 2029–2032 (2002)
Google Scholar
Yacoub, S., Simske, S., Lin, X., Burns, J.: Recognition of emotions in interactive voice response systems. In: The Proceedings of the Eighth European Conference on Speech Communication and Technology, pp. 729–732 (2003)
Google Scholar
Oudeyer, P.Y.: The production and recognition of emotions in speech: features and algorithms. International Journal of Human Computer Interaction 59(1-2), 157–183 (2003)
Google Scholar
Yu, F., Chang, E., Xu, Y.Q., Shum, H.Y.: Emotion detection from speech to enrich multimedia content. In: The Proceedings of the Second IEEE Pacific Rim Conference on Multimedia, pp. 550–557 (2001)
Google Scholar
Kwon, O.W., Chan, K., Hao, J., Lee, T.W.: Emotion recognition by speech signals. In: The Proceedings of the Eighth European Conference on Speech Communication and Technology (EUROSPEECH), pp. 125–128 (2003)
Google Scholar
German Emotional Speech Database, http://emotion-research.net/biblio/tuDatabase
Deller, J., Hansen, J., Proakis, J.: Discrete-time processing of speech signals, 2nd edn. IEEE Press, New York (2000)
Google Scholar
Soong, F., Rosenberg, E., Juang, B., Rabiner, L.: A vector quantization approach to speaker recognition. AT&T Technical Journal 66, 14–26 (1987)
Article Google Scholar
Linde, Y., Buzo, A., Gray, R.: An algorithm for vector quantizer design. IEEE Transactions on Communications 28, 84–95 (1980)
Article Google Scholar

Download references

Author information

Authors and Affiliations

SBM, SVKM’s NMIMS, Vile Parle, Mumbai, India
Preeti Khanna
CDAC, Kharghar, Navi, Mumbai, India
M. Sasi Kumar

Authors

Preeti Khanna
View author publications
You can also search for this author in PubMed Google Scholar
M. Sasi Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science, College of Engineering and Science, Louisiana Tech University, 71272, Ruston, LA, USA
Sumeet Dua
CISE Department, CSE 301, University of Florida, 32611, Gainesville, FL, USA
Sartaj Sahni
Management Development Institute, 122 007, Sukhrali, Gurgaon, India
D. P. Goyal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khanna, P., Sasi Kumar, M. (2011). Application of Vector Quantization in Emotion Recognition from Human Speech. In: Dua, S., Sahni, S., Goyal, D.P. (eds) Information Intelligence, Systems, Technology and Management. ICISTM 2011. Communications in Computer and Information Science, vol 141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-19423-8_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19422-1
Online ISBN: 978-3-642-19423-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Application of Vector Quantization in Emotion Recognition from Human Speech

Abstract

Access this chapter

Preview

Similar content being viewed by others

Emotion Recognition from Speech Using Multiple Features and Clusters

Machine Learning Approach for Emotional Speech Classification

Speech Emotion Recognition of Tamil Language: An Implementation with Linear and Nonlinear Feature

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Application of Vector Quantization in Emotion Recognition from Human Speech

Abstract

Access this chapter

Preview

Similar content being viewed by others

Emotion Recognition from Speech Using Multiple Features and Clusters

Machine Learning Approach for Emotional Speech Classification

Speech Emotion Recognition of Tamil Language: An Implementation with Linear and Nonlinear Feature

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation