Abstract
Nowadays, technological advances pose new frontiers to society. Artificial Intelligence (AI) has become one of the main research areas of interest due to its enormous possibilities. AI applications are spreading all over various fields, human-computer interaction is no exception. Since the turn of the millennium, the human-machine communication paradigm is shifting to a more efficient way where peripherals such as remote controls, mice or keyboards do not play the center role anymore. Machines are now expected to exhibit a human like behavior, in the sense of detecting, feeling or perceiving human actions, and reacting suitably. Non-intrusive sensing of stress, attention, burnout and emotions for instance, have become possible during human-machine interaction, using AI. In this context, areas such of sensors, images and audio processing and recognition, to name just a few, have been significantly developed. Emotion is an essential part of what means to be human, but it is still disregarded by most technical fields as something not to be considered in scientific or engineering projects. However, the understanding of emotion as an aspect of decision-making processes and of modelling of human behavior is essential to create a better connection between humans and their tools and machines. As voice remains the principal mean of communication of men and is also becoming a usual way of human-machine interaction, detecting emotions throughout speech becomes a powerful toll. In this work, an overview of such issues is done, and a framework to detect emotions throughout speech analysis is presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Cowie, R., et al.: Emotion recognition in human-computer interaction. IEEE Signal Process. Mag. 18(1), 32–80 (2001)
Rodrigues, M.F., Gonçalves, S.M., Santos, R., Fdez-Riverola, F., Carneiro, D.: Intelligent Tutoring: Active Monitoring And Recommendation. In: Interdisciplinary Perspectives on Contemporary Conflict Resolution, pp. 205–224. IGI Global (2016)
Carneiro, D., Rocha, H., Novais, P.: An environment for studying visual emotion perception. In: International Symposium on Ambient Intelligence, pp. 238–245. Springer, Cham, June 2017
Parrott, W.G. (ed.): Emotions in Social Psychology: Essential Readings. Psychology Press, Philadelphia (2001)
Wierzbicka, A.: Defining emotion concepts. Cogn. Sci. 16(4), 539–581 (1992)
Wu, C.H., Huang, Y.M., Hwang, J.P.: Review of affective computing in education/learning: trends and challenges. Br. J. Edu. Technol. 47(6), 1304–1323 (2016)
Barrett, L.F., Lewis, M., Haviland-Jones, J.M. (eds.): Handbook of Emotions. Guilford Publications, New York (2016)
Myers, D.G.: Theories of Emotion. Psychology, 7th edn., p. 500. Worth Publishers, New York (2004)
Averill, J.R.: The Acquisition of Emotions During Adulthood. The Social Construction of Emotions, pp. 98–118. Basil Blackwell, Oxford (1986)
Sharma, R.N.U., KIM, C.: Emotion recognition in spontaneous emotional utterances from movie sequences. In: WSEAS International Conference on Electronics, Control & Signal Processing [S.l.: s.n.] (2002)
Kandali, A.B., Routray, A., Basu, T.K.: Comparison of features based on MFCCs and Eigen values of autocorrelation matrix for cross-lingual vocal emotion recognition in five languages of Assam. In: IEEE India Conference (INDICON), 2009 Annual IEEE [S.l.], pp. 1–4 (2009)
Wu, C.-H., Liang, W.-B.: Emotion recognition of affective speech based on multiple classifiers using acoustic-prosodic information and semantic labels. IEEE Trans. Affect. Comput. 2(1), 10–21 (2011)
Ortony, A., Turner, T.J.: What’s basic about basic emotions? Psychological review. Am. Psychol. Assoc. 97(3), 315 (1990)
Plutchik, R.: The Emotions: Facts, Theories, and a New Model. New Random House, New York (1962)
Murray, I.R., Arnott, J.L.: Toward the simulation of emotion in synthetic speech: a review of the literature on human vocal emotion. J. Acoust. Soc. Am. 93(2), 1097–1108 (1993)
Alim, S.A., Rashid, N.K.A.: Some commonly used speech feature extraction algorithms. In: From Natural to Artificial Intelligence-Algorithms and Applications. IntechOpen (2018)
Eyben, F., et al.: Recent developments in openSMILE, the munich open-source multimedia feature extractor. In: Proceedings of ACM Multimedia 2013, pp. 835–838. ACM, Barcelona (2013)
Douglas-Cowie, E., et al.: Multimodal databases of everyday emotion: facing up to complexity. In: INTERSPEECH [S.l.: s.n.], pp. 813–816 (2005)
Bänziger, T., Scherer, K.R.: Using actor portrayals to systematically study multimodal emotion expression: the GEMEP corpus. In: Affective Computing and Intelligent Interaction [S.l.], pp. 476–487. Springer (2007)
Engberg, I.S., Hansen, A.V.: Documentation of the danish emotional speech database DES. Internal AAU report, Center for Person Kommunikation, Denmark (1996)
Burkhardt, F., et al.: A database of German emotional speech. In: Interspeech [S.l.: s.n.], pp. 1517–1520 (2005)
Sneddon, I., et al.: The belfast induced natural emotion database. IEEE Trans. Affect. Comput. 3(1), 32–41 (2012)
Douglas-Cowie, E., et al.: Emotional speech: towards a new generation of databases. Speech Commun. 40(1), 33–60 (2003)
Devillers, L., et al.: Real life emotions in French and English TV video clips: an integrated annotation protocol combining continuous and discrete approaches. In: 5th International Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy [S.l.: s.n.] (2006)
El Ayadi, M., Kamel, M.S., Karray, F.: Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn. 44(3), 572–587 (2011)
Teixeira, A., Rodrigues, M., Carneiro, D., Novais, P.: HORUS: an emotion recognition tool. In: Proceedings of SAI Intelligent Systems Conference, pp. 126–140. Springer, Cham, September 2019
Gonçalves, S., Rodrigues, M., Carneiro, D., Fdez-Riverola, F., Novais, P.: Boosting learning: non-intrusive monitoring of student’s efficiency. In: Methodologies and Intelligent Systems for Technology Enhanced Learning, pp. 73–80. Springer, Cham (2015)
Rodrigues, M., Novais, F.F.R.P.: An approach to assessing stress in eLearning students. In: Proceedings of the 11th European Conference on e-Learning: ECEL, p. 461 (2012)
Acknowledgment
This work has been supported by FCT - Fundação para a Ciência e a Tecnologia within the R&D Units project scope UIDB/00319/2020 and DSAIPA/AI/0099/2019 and “This work has been supported by national funds through FCT - Fundação para a Ciência e Tecnologia through project UIDB/04728/2020”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Rodrigues, M., Durães, D., Santos, R., Analide, C. (2021). Emotion Detection Throughout the Speech. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2020. Advances in Intelligent Systems and Computing, vol 1251. Springer, Cham. https://doi.org/10.1007/978-3-030-55187-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-55187-2_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-55186-5
Online ISBN: 978-3-030-55187-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)