Abstract
Speech Emotion Recognition has been a vital topic of research in human–machine interface applications for many years. It attempts to make human–machine interactions more intelligent by understanding the emotional state of human beings. This paper presents an up-to-date survey of Speech Emotion Recognition discussing the important approaches regarding the use of different classification algorithms to recognize emotions. The focus is mainly on classifiers like Multilayer Perceptron (MLP), Support Vector Machine, Decision Tree, Random Forest, and Convolutional Neural Network (CNN). First, an acted emotional dataset, RAVDESS, will be discussed in detail. Second, the features that were extracted and selected will be addressed. Then, the focus is shifted to these classifier algorithms that categorize the input data into four classes of emotions: happy, angry, sad, and neutral. Each algorithm is implemented and its performance is compared with the others. Finally, conclusions about the best working model and limitations of each classifier used for Speech Emotion Recognition System are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
H. Meng, T. Yan, F. Yuan, A. Wei, Speech emotion recognition from 3D Log-Mel spectrograms with deep learning networks. IEEE (2019)
H. Ali, M. Hariharan, S. Yaacob, A.H. Adom, Facial emotion recognition using empirical mode decomposition. Expert Syst. Appl. 42(3), 1261–1277 (2015)
Z.T. Liu, M. Wu, W.H. Cao, J.W. Mao, J.P. Xu, G.Z. Tan, Speech emotion recognition based on feature selection and extreme learning machine decision tree. Neurocomputing 273, 271–280 (2018)
M. Ragot, N. Martin, S. Em, N. Pallamin, J.M. Diverrez, Emotion recognition using physiological signals: Laboratory vs. wearable sensors, in International Conference on Applied Human Factors and Ergonomics. Springer, pp. 15–22 (2017)
C.M. Lee, S.S. Narayan, Towards detecting emotions in spoken dialogues. IEEE Trans. Speech Audio Process. (2005)
S.N. Roopa, M. Prabhakaran, P. Betty, Speech emotion recognition using deep learning. Int. J. Recent Technol. Eng. (2018)
Z. Li, A study on emotional feature analysis and recognition in speech signal. J. China Inst. Commun. (2000)
S. Wu, T.H. Falk, W.Y. Chan, Automatic speech emotion recognition using modulation spectral features. Speech Commun. 53, 768–785 (2011)
A.B. Ingale, D.S.Chaudhari, Speech emotion recognition. Int. J. Soft Comput. Eng. (2012)
Z. Yongzhao, C. Peng, Research and implementation of emotional feature and recognition in speech signal. J. Jiangsu Univ. (2005)
M. El Ayadi, M.S. Kamel, F. Karray, Survey on Speech emotion recognition: features, classification schemes and databases. Pattern Recogn. (2011)
C. Huang, W. Gong, W. Fu, D. Feng, A Research of Speech Emotion Recogntion Based on Deep Belief Network and SVM (Hindawi, 2014)
L. Kerkeni, S. Youssef, M. Mbarki, K. Raoof, M. Ali Mahjoub, Speech emotion recognition: methods and cases study. ICAART (2018)
N. Ratna Kanth, S. Saraswathi, A suvery on speech emotion recognition. Adv. Comput. Sci. Inf. Technol. (2014)
M. Waqas Bhatti, Y. Wang, L. Guan, A neural network approach for human emotion recognition in speech, in International Symposium on Circuits and Systems (2014)
K. Scherer,, Vocal communication of emotion: a review of research paradigms. Speech Commun. (2003)
M. Swain, A. Routray, P. Kabisatpathy, Databases, features and classifiers for speech emotion recognition: a review. Int. J. Speech Technol. (2018)
https://www.kaggle.com/uwrfkaggler/ravdess-emotional-speech-audio
M. Swain, A. Routray, P. Kabisatpathy,Databases, features and classifiers for speech emotion recognition. Int. J. Speech Technol. (2018)
Q. Mao, M. Dong, Z. Huang, Y.Zhan, Learning salient features for speech emotion recognition using convolutional neural networks. IEEE (2014)
P. Shen, Z. Changjun, X. Chen, Automatic speech emotion recognition using support vector machine, in International Conference in Electronic and Mechanical Engineering and Information Technology (2011)
Y. Chavhan, M.L. Dhore, P. Yesaware, Speech emotion recognition using support vector machine. Int. J. Comput. Appl. (2010)
S.R. Gunn, Support Vector Machines for Classification and Regression [PhD thesis] (1998)
T.P. Robinson, Real-time recognition of affective states from nonverbal features of speech and its application in public speaking skill. IEEE (2011)
I.Chiriacescu, Automatic Emotion Analysis Based on Speech. M.Sc, THESIS Delft University of Technology (2009)
F. Noroozi, S. Tomasz, D. Kamińska, G. Anbarjafari, Vocal–based emotion recognition using random forests and decision tree. Int. J. Speech Technol. (2017)
Z. Ciota, Feature extraction of spoken dialogues for emotion recognition. ICSP (2006)
J. Zhu, X. Wu, Z.Lv, Speech emotionr recognition algorithm based on SVM. Comput. Sci. Appl. (2011)
O. Abdel-Hamid, A.R Mohamed, H. Jiang, L. Deng, G. Penn, D. Yu, Convolutional neural networks for speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Madhavi, A., Priya Valentina, A., Mounika, K., Rohit, B., Nagma, S. (2021). Comparative Analysis of Different Classifiers for Speech Emotion Recognition. In: Kiran Mai, C., Kiranmayee, B.V., Favorskaya, M.N., Chandra Satapathy, S., Raju, K.S. (eds) Proceedings of International Conference on Advances in Computer Engineering and Communication Systems. Learning and Analytics in Intelligent Systems, vol 20. Springer, Singapore. https://doi.org/10.1007/978-981-15-9293-5_48
Download citation
DOI: https://doi.org/10.1007/978-981-15-9293-5_48
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-9292-8
Online ISBN: 978-981-15-9293-5
eBook Packages: Computer ScienceComputer Science (R0)