Automatic Extraction of Affective Metadata from Videos Through Emotion Recognition Algorithms

Mircoli, Alex; Cimini, Giampiero

doi:10.1007/978-3-030-00063-9_19

Alex Mircoli¹⁵ &
Giampiero Cimini¹⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 909))

Included in the following conference series:

European Conference on Advances in Databases and Information Systems

1260 Accesses
1 Citations

Abstract

In recent years, the diffusion of social networks has made available large amounts of user-generated data containing people’s opinions and feelings. Such data are mostly unstructured and hence need to be enriched with a large set of metadata to allow for efficient data indexing and querying. In this work we focus on videos and we extend traditional metadata extraction techniques by taking into account emotional metadata, in order to enable data analysis from an affective perspective. To this purpose, we present a 3-phase methodology for the automatic extraction of emotional metadata from videos through facial expression recognition algorithms. We also propose a simple but versatile model for metadata that takes into account variations in emotions among video chunks. Experiments on a real-world video dataset show that our non-linear classifier reaches a remarkable 72% classification accuracy in facial expression recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Benitez-Garcia, G., Nakamura, T., Kaneko, M.: Multicultural facial expression recognition based on differences of western-caucasian and east-asian facial expressions of emotions. IEEE Trans. Inf. Syst. 5, 1317–1324 (2018)
Article Google Scholar
Cohen, I., Sebe, N., Garg, A., Chen, L.S., Huang, T.S.: Facial expression recognition from video sequences: temporal and static modeling. Comput. Vis. Image Underst. 91, 160–187 (2003)
Article Google Scholar
Diamantini, C., Mircoli, A., Potena, D., Storti, E.: Semantic disambiguation in a social information discovery system. In: Proceedings of the 2015 International Conference on Collaboration Technologies and Systems (CTS), pp. 326–333 (2015)
Google Scholar
Ekman, P.: An argument for basic emotions. Cogn. Emotion 6, 169–200 (1992)
Article Google Scholar
Felbo, B., Mislove, A., Sgaard, A., Rahwan, I., Lehmann, S.: Using millions of Emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1616–1626 (2017)
Google Scholar
Fourati, M., Jedidi, A., Gargouri, F.: Generic descriptions for movie document: an experimental study. In: Proceedings of IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2017, pp. 766–773, October 2018
Google Scholar
Huang, J., Yuan, C.: Weighted-PCANet for face recognition. In: Arik, S., Huang, T., Lai, W.K., Liu, Q. (eds.) ICONIP 2015. LNCS, vol. 9492, pp. 246–254. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-26561-2_30
Chapter Google Scholar
Mircoli, A., Cucchiarelli, A., Diamantini, C., Potena, D.: Automatic emotional text annotation using facial expression analysis. In: Proceedings of CEUR Workshop 1848, pp. 188–196 (2017)
Google Scholar
Mo, S., Niu, J., Su, Y., Das, S.K.: A novel feature set for video emotion recognition. Neurocomputing 291, 11–20 (2018)
Article Google Scholar
Poria, S., Peng, H., Hussain, A., Howard, N., Cambria, E.: Ensemble application of convolutional neural networks and multiple kernel learning for multimodal sentiment analysis. Neurocomputing 261, 217–230 (2017)
Article Google Scholar
Pramerdofer, C., Kampel, M.: Facial expression recognition using convolutional neural networks: state of the art. arXiv preprint arXiv:1612.02903 (2016)
Sailunaz, K., Dhaliwal, M., Rokne, J., Alhajj, R.: Emotion detection from text and speech: a survey. Soc. Netw. Anal. Min. 8(1) (2018)
Google Scholar
Sariyanidi, E., Gunes, H., Cavallaro, A.: Automatic analysis of facial affect: a survey of registration, representation and recognition. IEEE Trans. Patt. Anal. Mach. Intell. 37, 1113–1133 (2015)
Article Google Scholar
Sikos, L.F., Powers, D.M.W.: Knowledge-driven video information retrieval with LOD: Lrom semi-structured to structured video metadata. In: Proceedings of the 2015 Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR), pp. 35–37 (2015)
Google Scholar
Soltani, M., Zarzour, H., Babahenini, M.C.: Facial emotion detection in massive open online courses. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S. (eds.) WorldCIST’18 2018. AISC, vol. 745, pp. 277–286. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-77703-0_28
Chapter Google Scholar
Sun, W., Zhao, H., Jin, Z.: A complementary facial representation extracting method based on deep learning. Neurocomputing 306, 246–259 (2018)
Article Google Scholar
Sun, Y., Sebe, N., Lew, M.S., Gevers, T.: Authentic emotion detection in real-time video. In: Sebe, N., Lew, M., Huang, T.S. (eds.) CVHCI 2004. LNCS, vol. 3058, pp. 94–104. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-24837-8_10
Chapter Google Scholar
Viola, P., Jones, M.: Robust real-time object detection. Int. J. Comput. Vis., 137–154 (2001)
Google Scholar
Yu, Z., Zhang, C.: Image based static facial expression recognition with multiple deep network learning. In: Proceedings of the 2015 International Conference on Multimodal Interaction (ICMI), pp. 435–442 (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Ingegneria dell’Informazione, Università Politecnica delle Marche, Via Brecce Bianche, 60131, Ancona, Italy
Alex Mircoli & Giampiero Cimini

Authors

Alex Mircoli
View author publications
You can also search for this author in PubMed Google Scholar
Giampiero Cimini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alex Mircoli .

Editor information

Editors and Affiliations

Eötvös Loránd University, Budapest, Hungary
András Benczúr
Abt. Informatik, Universität Kiel, Kiel, Germany
Bernhard Thalheim
Eötvös Loránd University, Budapest, Hungary
Tomáš Horváth
Politecnico di Torino, Turin, Italy
Silvia Chiusano
Polytechnic University of Turin, Turin, Italy
Tania Cerquitelli
Hungarian Academy of Sciences, Budapest, Hungary
Csaba Sidló
University of Nebraska–Lincoln, Lincoln, NE, USA
Peter Z. Revesz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mircoli, A., Cimini, G. (2018). Automatic Extraction of Affective Metadata from Videos Through Emotion Recognition Algorithms. In: Benczúr, A., et al. New Trends in Databases and Information Systems. ADBIS 2018. Communications in Computer and Information Science, vol 909. Springer, Cham. https://doi.org/10.1007/978-3-030-00063-9_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-00063-9_19
Published: 31 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00062-2
Online ISBN: 978-3-030-00063-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics