Skip to main content

Comparative Analysis of Different Classifiers for Speech Emotion Recognition

  • Conference paper
  • First Online:
Proceedings of International Conference on Advances in Computer Engineering and Communication Systems

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 20))

Abstract

Speech Emotion Recognition has been a vital topic of research in human–machine interface applications for many years. It attempts to make human–machine interactions more intelligent by understanding the emotional state of human beings. This paper presents an up-to-date survey of Speech Emotion Recognition discussing the important approaches regarding the use of different classification algorithms to recognize emotions. The focus is mainly on classifiers like Multilayer Perceptron (MLP), Support Vector Machine, Decision Tree, Random Forest, and Convolutional Neural Network (CNN). First, an acted emotional dataset, RAVDESS, will be discussed in detail. Second, the features that were extracted and selected will be addressed. Then, the focus is shifted to these classifier algorithms that categorize the input data into four classes of emotions: happy, angry, sad, and neutral. Each algorithm is implemented and its performance is compared with the others. Finally, conclusions about the best working model and limitations of each classifier used for Speech Emotion Recognition System are presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. H. Meng, T. Yan, F. Yuan, A. Wei, Speech emotion recognition from 3D Log-Mel spectrograms with deep learning networks. IEEE (2019)

    Google Scholar 

  2. H. Ali, M. Hariharan, S. Yaacob, A.H. Adom, Facial emotion recognition using empirical mode decomposition. Expert Syst. Appl. 42(3), 1261–1277 (2015)

    Article  Google Scholar 

  3. Z.T. Liu, M. Wu, W.H. Cao, J.W. Mao, J.P. Xu, G.Z. Tan, Speech emotion recognition based on feature selection and extreme learning machine decision tree. Neurocomputing 273, 271–280 (2018)

    Article  Google Scholar 

  4. M. Ragot, N. Martin, S. Em, N. Pallamin, J.M. Diverrez, Emotion recognition using physiological signals: Laboratory vs. wearable sensors, in International Conference on Applied Human Factors and Ergonomics. Springer, pp. 15–22 (2017)

    Google Scholar 

  5. C.M. Lee, S.S. Narayan, Towards detecting emotions in spoken dialogues. IEEE Trans. Speech Audio Process. (2005)

    Google Scholar 

  6. S.N. Roopa, M. Prabhakaran, P. Betty, Speech emotion recognition using deep learning. Int. J. Recent Technol. Eng. (2018)

    Google Scholar 

  7. Z. Li, A study on emotional feature analysis and recognition in speech signal. J. China Inst. Commun. (2000)

    Google Scholar 

  8. S. Wu, T.H. Falk, W.Y. Chan, Automatic speech emotion recognition using modulation spectral features. Speech Commun. 53, 768–785 (2011)

    Article  Google Scholar 

  9. A.B. Ingale, D.S.Chaudhari, Speech emotion recognition. Int. J. Soft Comput. Eng. (2012)

    Google Scholar 

  10. Z. Yongzhao, C. Peng, Research and implementation of emotional feature and recognition in speech signal. J. Jiangsu Univ. (2005)

    Google Scholar 

  11. M. El Ayadi, M.S. Kamel, F. Karray, Survey on Speech emotion recognition: features, classification schemes and databases. Pattern Recogn. (2011)

    Google Scholar 

  12. C. Huang, W. Gong, W. Fu, D. Feng, A Research of Speech Emotion Recogntion Based on Deep Belief Network and SVM (Hindawi, 2014)

    Google Scholar 

  13. L. Kerkeni, S. Youssef, M. Mbarki, K. Raoof, M. Ali Mahjoub, Speech emotion recognition: methods and cases study. ICAART (2018)

    Google Scholar 

  14. N. Ratna Kanth, S. Saraswathi, A suvery on speech emotion recognition. Adv. Comput. Sci. Inf. Technol. (2014)

    Google Scholar 

  15. M. Waqas Bhatti, Y. Wang, L. Guan, A neural network approach for human emotion recognition in speech, in International Symposium on Circuits and Systems (2014)

    Google Scholar 

  16. K. Scherer,, Vocal communication of emotion: a review of research paradigms. Speech Commun. (2003)

    Google Scholar 

  17. M. Swain, A. Routray, P. Kabisatpathy, Databases, features and classifiers for speech emotion recognition: a review. Int. J. Speech Technol. (2018)

    Google Scholar 

  18. https://www.kaggle.com/uwrfkaggler/ravdess-emotional-speech-audio

  19. M. Swain, A. Routray, P. Kabisatpathy,Databases, features and classifiers for speech emotion recognition. Int. J. Speech Technol. (2018)

    Google Scholar 

  20. Q. Mao, M. Dong, Z. Huang, Y.Zhan, Learning salient features for speech emotion recognition using convolutional neural networks. IEEE (2014)

    Google Scholar 

  21. P. Shen, Z. Changjun, X. Chen, Automatic speech emotion recognition using support vector machine, in International Conference in Electronic and Mechanical Engineering and Information Technology (2011)

    Google Scholar 

  22. Y. Chavhan, M.L. Dhore, P. Yesaware, Speech emotion recognition using support vector machine. Int. J. Comput. Appl. (2010)

    Google Scholar 

  23. S.R. Gunn, Support Vector Machines for Classification and Regression [PhD thesis] (1998)

    Google Scholar 

  24. T.P. Robinson, Real-time recognition of affective states from nonverbal features of speech and its application in public speaking skill. IEEE (2011)

    Google Scholar 

  25. I.Chiriacescu, Automatic Emotion Analysis Based on Speech. M.Sc, THESIS Delft University of Technology (2009)

    Google Scholar 

  26. F. Noroozi, S. Tomasz, D. Kamińska, G. Anbarjafari, Vocal–based emotion recognition using random forests and decision tree. Int. J. Speech Technol. (2017)

    Google Scholar 

  27. Z. Ciota, Feature extraction of spoken dialogues for emotion recognition. ICSP (2006)

    Google Scholar 

  28. J. Zhu, X. Wu, Z.Lv, Speech emotionr recognition algorithm based on SVM. Comput. Sci. Appl. (2011)

    Google Scholar 

  29. O. Abdel-Hamid, A.R Mohamed, H. Jiang, L. Deng, G. Penn, D. Yu, Convolutional neural networks for speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Albert Priya Valentina .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Madhavi, A., Priya Valentina, A., Mounika, K., Rohit, B., Nagma, S. (2021). Comparative Analysis of Different Classifiers for Speech Emotion Recognition. In: Kiran Mai, C., Kiranmayee, B.V., Favorskaya, M.N., Chandra Satapathy, S., Raju, K.S. (eds) Proceedings of International Conference on Advances in Computer Engineering and Communication Systems. Learning and Analytics in Intelligent Systems, vol 20. Springer, Singapore. https://doi.org/10.1007/978-981-15-9293-5_48

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-9293-5_48

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-9292-8

  • Online ISBN: 978-981-15-9293-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics