Speech Emotion Recognition for Tamil Language Speakers

Sowmya, V.; Rajeswari, A.

doi:10.1007/978-981-15-1366-4_10

V. Sowmya¹⁷ &
A. Rajeswari¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1085))

Included in the following conference series:

International Conference on Machine Intelligence and Signal Processing

506 Accesses
5 Citations

Abstract

Emotional information in speech signal is an important information resource that expresses the state of mind of a person. For robots to plan their actions autonomously and interact with people, recognizing human emotions is a critical parameter required for emotion recognition. In this paper, the emotion is recognized and identified from Tamil speech signal of the speaker so as to make conversation between human and computer more native and natural. The most human non-verbal cues such as pitch, loudness, spectrum and speech rate are efficient carriers of emotion. In this proposed work, there are three important aspects of designing a speech emotion recognition system. For developing a speech emotion recognition system, three aspects are taken into consideration. They are database, feature extraction and emotion classifiers. A database is created for Tamil language, and the Mel-Frequency Cepstral Coefficient (MFCC) and energy are calculated from both recorded audio signal and real-time audio signal. From these extracted prosodic and spectral features, emotions are classified using support vector machine (SVM) with overall 85.4% accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, A., Mishra, N.K.: Fusion based emotion recognition system. In: Proceedings—2016 International Conference on Computational Science and Computational Intelligence, CSCI 2016, pp. 727–732. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/CSCI.2016.0142
Desai, D.: Emotion recognition using speech signal: a review. Int. Res. J. Eng. Technol. (IRJET) 05(04), 1599–1605 (2018)
Google Scholar
Mustafa, M.B., Yusoof, M.A.M., Don, Z.M., Malekzadeh, M.: Speech emotion recognition research: an analysis of research focus. Int. J. Speech Technol. 21(1), 137–156 (2018). https://doi.org/10.1007/s10772-018-9493-x
Article Google Scholar
Ke, X., Zhu, Y., Wen, L., Zhang, W.: Speech emotion recognition based on SVM and ANN. Int. J. Mach. Learn. Comput. 8(3), 198–202 (2018). https://doi.org/10.18178/ijmlc.2018.8.3.687
Article Google Scholar
Abdollahpour, M., Zamani, J., Rad, H.S.: Feature representation for speech emotion recognition. In: 2017 25th Iranian Conference on Electrical Engineering, ICEE 2017, pp. 1465–1468. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/IranianCEE.2017.7985273
Jamil, N., Apandi, F., Hamzah, R.: Influences of age in emotion recognition of spontaneous speech. In: 2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) (2017)
Google Scholar
Albornoz, E.M., Milone, D.H.: Emotion recognition in never-seen languages a novel ensemble method with emotion profiles. IEEE Trans. Affect. Comput. 8(1), 43–53 (2017)
Article Google Scholar
Joe, C.V.: Developing Tamil emotional speech corpus and evaluating using SVM. In: 2014 International Conference on Science Engineering and Management Research, ICSEMR 2014. Institute of Electrical and Electronics Engineers Inc. (2014). https://doi.org/10.1109/ICSEMR.2014.7043627
Fragopanagos, N., Taylor, J.G.: Emotion recognition in human-computer interaction. IEEE Trans. Speech Audio Process 18(9), 389–405 (2017)
Google Scholar
Chandran, A., Pravena, D., Govind, D.: Development of speech emotion recognition system using deep belief networks in Malayalam language. In: 2017 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2017, Jan 2017, vol. 2017, pp. 676–680. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/ICACCI.2017.8125919
Chernykh, V., Prikhodko, P.: Emotion recognition from speech with recurrent neural networks (2017). Retrieved from http://arxiv.org/abs/1701.08071
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recognit. 44(3), 572–587 (2011). https://doi.org/10.1016/j.patcog.2010.09.020
Article Google Scholar
Garg, P., Sehgal, S.: Comparison of emotion recognition models in spoken dialogs. Int. J. Softw. Hardw. Res. Eng. 3 (2015)
Google Scholar
Joseph, A., Sridhar, R.: Performance evaluation of various classifiers in emotion recognition using discrete wavelet transform, linear predictor coefficients and formant features (2017). https://doi.org/10.1007/978-981-10-2525-9
Google Scholar
Javier, G., Sundgren, D., Rahmani, R., Larsson, A., Moran, A., Bonet, I.: Speech emotion recognition in emotional feedback for human-robot interaction. Int. J. Adv. Res. Artif. Intell. 4(2) (2015). https://doi.org/10.14569/ijarai.2015.040204
Srikanth, M., Pravena, D., Govind, D.: Tamil speech emotion recognition using deep belief network (DBN). In: International Symposium on Signal Processing and Intelligent Recognition Systems (SIRS), pp. 328–336 (2017)
Google Scholar
Dave, N.: Feature extraction methods LPC, PLP and MFCC in speech recognition. Int. J. Adv. Res. Eng. Technol. 1(VI), 1–5 (2013)
Google Scholar
Gupta, D., Richhariya, B., Borah, P.: A fuzzy twin support vector machine based on information entropy for class imbalance learning. Neural Comput. Appl. 1–12 (2018). https://doi.org/10.1007/s00521-018-3551-9
Article Google Scholar
Tanveer, M., Shubham, K.: Smooth twin support vector machines via unconstrained convex minimization. Filomat 31(8), 2195–2210 (2017). https://doi.org/10.2298/FIL1708195T
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Coimbatore Institute of Technology, Coimbatore, Tamil Nadu, India
V. Sowmya & A. Rajeswari

Authors

V. Sowmya
View author publications
You can also search for this author in PubMed Google Scholar
A. Rajeswari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V. Sowmya .

Editor information

Editors and Affiliations

Department of Information Technology, Indian Institute of Information Technology Allahabad, Allahabad, India
Sonali Agarwal
Department of Information Technology, Indian Institute of Information Technology Allahabad, Allahabad, India
Shekhar Verma
Department of Electrical Engineering and Computing Science, University of Cincinnati, Cincinnati, OH, USA
Dharma P. Agrawal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sowmya, V., Rajeswari, A. (2020). Speech Emotion Recognition for Tamil Language Speakers. In: Agarwal, S., Verma, S., Agrawal, D. (eds) Machine Intelligence and Signal Processing. MISP 2019. Advances in Intelligent Systems and Computing, vol 1085. Springer, Singapore. https://doi.org/10.1007/978-981-15-1366-4_10

Download citation

DOI: https://doi.org/10.1007/978-981-15-1366-4_10
Published: 26 February 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1365-7
Online ISBN: 978-981-15-1366-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics