Abstract
Emotional information in speech signal is an important information resource that expresses the state of mind of a person. For robots to plan their actions autonomously and interact with people, recognizing human emotions is a critical parameter required for emotion recognition. In this paper, the emotion is recognized and identified from Tamil speech signal of the speaker so as to make conversation between human and computer more native and natural. The most human non-verbal cues such as pitch, loudness, spectrum and speech rate are efficient carriers of emotion. In this proposed work, there are three important aspects of designing a speech emotion recognition system. For developing a speech emotion recognition system, three aspects are taken into consideration. They are database, feature extraction and emotion classifiers. A database is created for Tamil language, and the Mel-Frequency Cepstral Coefficient (MFCC) and energy are calculated from both recorded audio signal and real-time audio signal. From these extracted prosodic and spectral features, emotions are classified using support vector machine (SVM) with overall 85.4% accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, A., Mishra, N.K.: Fusion based emotion recognition system. In: Proceedings—2016 International Conference on Computational Science and Computational Intelligence, CSCI 2016, pp. 727–732. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/CSCI.2016.0142
Desai, D.: Emotion recognition using speech signal: a review. Int. Res. J. Eng. Technol. (IRJET) 05(04), 1599–1605 (2018)
Mustafa, M.B., Yusoof, M.A.M., Don, Z.M., Malekzadeh, M.: Speech emotion recognition research: an analysis of research focus. Int. J. Speech Technol. 21(1), 137–156 (2018). https://doi.org/10.1007/s10772-018-9493-x
Ke, X., Zhu, Y., Wen, L., Zhang, W.: Speech emotion recognition based on SVM and ANN. Int. J. Mach. Learn. Comput. 8(3), 198–202 (2018). https://doi.org/10.18178/ijmlc.2018.8.3.687
Abdollahpour, M., Zamani, J., Rad, H.S.: Feature representation for speech emotion recognition. In: 2017 25th Iranian Conference on Electrical Engineering, ICEE 2017, pp. 1465–1468. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/IranianCEE.2017.7985273
Jamil, N., Apandi, F., Hamzah, R.: Influences of age in emotion recognition of spontaneous speech. In: 2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) (2017)
Albornoz, E.M., Milone, D.H.: Emotion recognition in never-seen languages a novel ensemble method with emotion profiles. IEEE Trans. Affect. Comput. 8(1), 43–53 (2017)
Joe, C.V.: Developing Tamil emotional speech corpus and evaluating using SVM. In: 2014 International Conference on Science Engineering and Management Research, ICSEMR 2014. Institute of Electrical and Electronics Engineers Inc. (2014). https://doi.org/10.1109/ICSEMR.2014.7043627
Fragopanagos, N., Taylor, J.G.: Emotion recognition in human-computer interaction. IEEE Trans. Speech Audio Process 18(9), 389–405 (2017)
Chandran, A., Pravena, D., Govind, D.: Development of speech emotion recognition system using deep belief networks in Malayalam language. In: 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017, Jan 2017, vol. 2017, pp. 676–680. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/ICACCI.2017.8125919
Chernykh, V., Prikhodko, P.: Emotion recognition from speech with recurrent neural networks (2017). Retrieved from http://arxiv.org/abs/1701.08071
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognit. 44(3), 572–587 (2011). https://doi.org/10.1016/j.patcog.2010.09.020
Garg, P., Sehgal, S.: Comparison of emotion recognition models in spoken dialogs. Int. J. Softw. Hardw. Res. Eng. 3 (2015)
Joseph, A., Sridhar, R.: Performance evaluation of various classifiers in emotion recognition using discrete wavelet transform, linear predictor coefficients and formant features (2017). https://doi.org/10.1007/978-981-10-2525-9
Javier, G., Sundgren, D., Rahmani, R., Larsson, A., Moran, A., Bonet, I.: Speech emotion recognition in emotional feedback for human-robot interaction. Int. J. Adv. Res. Artif. Intell. 4(2) (2015). https://doi.org/10.14569/ijarai.2015.040204
Srikanth, M., Pravena, D., Govind, D.: Tamil speech emotion recognition using deep belief network (DBN). In: International Symposium on Signal Processing and Intelligent Recognition Systems (SIRS), pp. 328–336 (2017)
Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1(VI), 1–5 (2013)
Gupta, D., Richhariya, B., Borah, P.: A fuzzy twin support vector machine based on information entropy for class imbalance learning. Neural Comput. Appl. 1–12 (2018). https://doi.org/10.1007/s00521-018-3551-9
Tanveer, M., Shubham, K.: Smooth twin support vector machines via unconstrained convex minimization. Filomat 31(8), 2195–2210 (2017). https://doi.org/10.2298/FIL1708195T
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sowmya, V., Rajeswari, A. (2020). Speech Emotion Recognition for Tamil Language Speakers. In: Agarwal, S., Verma, S., Agrawal, D. (eds) Machine Intelligence and Signal Processing. MISP 2019. Advances in Intelligent Systems and Computing, vol 1085. Springer, Singapore. https://doi.org/10.1007/978-981-15-1366-4_10
Download citation
DOI: https://doi.org/10.1007/978-981-15-1366-4_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1365-7
Online ISBN: 978-981-15-1366-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)