Action unit detection in 3D facial videos with application in facial expression retrieval and recognition

Danelakis, Antonios; Theoharis, Theoharis; Pratikakis, Ioannis

doi:10.1007/s11042-018-5699-9

Action unit detection in 3D facial videos with application in facial expression retrieval and recognition

Published: 28 March 2018

Volume 77, pages 24813–24841, (2018)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Antonios Danelakis ORCID: orcid.org/0000-0002-5925-2179¹,
Theoharis Theoharis¹ &
Ioannis Pratikakis²

615 Accesses
13 Citations
3 Altmetric
Explore all metrics

Abstract

This work introduces a new scheme for action unit detection in 3D facial videos. Sets of features that define action unit activation in a robust manner are proposed. These features are computed based on eight detected facial landmarks on each facial mesh that involve angles, areas and distances. Support vector machine classifiers are then trained using the above features in order to perform action unit detection. The proposed AU detection scheme is used in a dynamic 3D facial expression retrieval and recognition pipeline, highlighting the most important AU s, in terms of providing facial expression information, and at the same time, resulting in better performance than state-of-the-art methodologies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Facial Expression Recognition Using Machine Learning and Deep Learning Techniques: A Systematic Review

Article 13 April 2024

A review on face recognition systems: recent approaches and challenges

Article 30 July 2020

Real-Time Human Pose Detection and Recognition Using MediaPipe

References

Ashraf AB, Lucey S, Cohn JF, Chen T, Ambadar Z, Prkachin KM, Solomon PE (2009) The painful face – pain expression recognition using active appearance models. Image Vis Comput 27(12):1788–1796
Article Google Scholar
Baltrušaitis T, Mahmoud M, Robinson P (2015) Cross-dataset learning and person-specific normalisation for automatic action unit detection. In: Automatic face and gesture recognition (FG), pp 1–6
Bartlett M, Littlewort G, Frank M, Lainscsek C, Fasel I, Movellan J (2006) Fully automatic facial action recognition in spontaneous behavior. In: 7th international conference on automatic face and gesture recognition (FGR), pp 223–230
Bayramoglu N, Zhao G, Pietikäinen M (2013) CS-3DLBP And geometry based person independent 3d facial action unit detection. In: International conference on biometrics, pp 1–6
Berretti S, Del Bimbo A, Pala P (2013) Automatic facial expression recognition in real-time from dynamic sequences of 3D face scans. Vis Comput 29(12):1333–1350
Article Google Scholar
Canavan SJ, Sun Y, Zhang X, Yin L (2012) A dynamic curvature based approach for facial activity analysis in 3D space. In: CVPR workshops, pp 14–19
Chang Y, Vieira MB, Turk M, Velho L (2005) Automatic 3D facial expression analysis in videos. In: IEEE workshop AMFG ’05, pp 293–307
Chen J, Liu X, Tu P, Aragones A (2013) Learning person-specific models for facial expression and action unit recognition. Pattern Recogn Lett 34(15):1964–1970
Article Google Scholar
Choi SS, Cha SH (2010) A survey of binary similarity and distance measures. Journal of Systemics Cybernetics and Informatics 8(1):43–48
Google Scholar
Chu WS, De La Torre F, Cohn JF (2013) Selective transfer machine for personalized facial action unit detection. In: The IEEE conference on computer vision and pattern recognition (CVPR), pp 3515–3522
Cosker D, Krumhuber E, Hilton A (2011) A FACS valid 3D dynamic action unit database with applications to 3D dynamic morphable facial modeling. In: Proc. ICCV ’11, pp 2296–2303
Dahmane M, Meunier J (2014) Prototype-based modeling for facial expression analysis. IEEE Trans Multimedia 16(6):1574–1584
Article Google Scholar
Danelakis A, Theoharis T, Pratikakis I (2014) Geotopo: dynamic 3D facial expression retrieval using topological and geometric information. In: Proc. 3d object retrieval 2014, pp 1–8
Danelakis A, Theoharis T, Pratikakis I, Perakis P (2016) An effective methodology for dynamic 3D facial expression retrieval. Pattern Recogn 52:174–185
Article Google Scholar
Dapogny A, Bailly K, Dubuisson S (2016) Confidence-weighted local expression predictions for occlusion handling in expression recognition and action unit detection. Int J Comput Vis 126(2-4):255–271
Article MathSciNet Google Scholar
Devijver PA, Kittler J (1982) Pattern recognition: a statistical approach. Prentice Hall, Englewood Cliffs
MATH Google Scholar
Donato G, Bartlett MS, Hager JC, Ekman P, Sejnowski TJ (1999) Classifying facial actions. IEEE Trans Pattern Anal Mach Intell 21(10):974–989
Article Google Scholar
Drira H, Ben Amor B, Daoudi M, Berretti S (2013) A dense deformation field for facial expression analysis in dynamic sequences of 3D scans. Springer International Publishing, Berlin, pp 148–159
Google Scholar
Ekman P, Friesen W (1978) Facial action coding system: a technique for the measurement of facial movement. Consulting Psychologists Press, Palo Alto
Google Scholar
Fang T, Zhao X, Shah SK, Kakadiaris IA (2011) 4D facial expression recognition. In: ICCV ’11, pp 1594–1601
Fang T, Zhao X, Ocegueda O, Shah SK, Kakadiaris IA (2012) 3d/4D facial expression analysis: an advanced annotated face model approach. Image Vis Comput 30(10):738–749
Article Google Scholar
Fasel B, Luettin J (1999) Automatic facial expression analysis: a survey. Pattern Recogn 36(1):259–275
Article MATH Google Scholar
Gao G, Xu M, Shen J, Ma H, Yan S (2016) Cast2face: assigning character names onto faces in movie with actor-character correspondence. IEEE Trans Circuits Syst Video Technol 26(12):2299–2312
Article Google Scholar
Gehrig T, Ekenel H (2011) A common framework for real-time emotion recognition and facial action unit detection. In: IEEE computer society conference on computer vision and pattern recognition workshops (CVPRW), pp 1–6
Gudi A, Tasli HE, den Uyl TM, Maroulis A (2015) Deep learning based facs action unit occurrence and intensity estimation. In: Automatic face and gesture recognition (FG), pp 1–5
Huang Y, Li Y, Fan N (2010) Robust symbolic dual-view facial expression recognition with skin wrinkles: Local versus global approach. IEEE Trans Multimedia 12(6):536–543
Article Google Scholar
Jaiswal S, Valstar M (2016) Deep learning the dynamic appearance and shape of facial action units. In: 2016 IEEE winter conference on applications of computer vision (WACV), pp 1–8
Jeni LA, Lórincz A, Nagy T, Palotai Z, Sebók J, Szabó Z, Takács D (2012) 3D shape estimation in video sequences provides high precision evaluation of facial expressions. Image Vis Comput 30(10):785–795
Article Google Scholar
Jeni LA, Girard JM, Cohn JF, Torre FDL (2013) Continuous au intensity estimation using localized, sparse facial feature space. In: 2013 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–7
Jiang B, Valstar MF, Pantic M (2011) Action unit detection using sparse appearance descriptors in space-time video volumes. In: Automatic face gesture recognition and workshops, pp 314–321
Khademi M, Morency LP (2014) Relative facial action unit detection. In: IEEE winter conference on applications of computer vision (WACV), pp 1090–1095
Kotsia I, Zafeiriou S, Pitas I (2008) Texture and shape information fusion for facial expression and facial action unit recognition. Pattern Recogn 41(3):833–851
Article MATH Google Scholar
Le V, Tang H, Huang TS (2011) Expression recognition from 3D dynamic faces using robust spatio-temporal shape features. In: IEEE FG ’11, pp 414–421
Lien JJJ, Kanade T, Cohn J, Li CC (1998) Subtly different facial expression recognition and expression intensity estimation. In: IEEE conference on computer vison and pattern recogntion, pp 853–859
Liu L, Cheng L, Liu Y, Jia Y, Rosenblum DS (2016) Recognizing complex activities by a probabilistic interval-based model. In: Proceedings of the thirtieth AAAI conference on artificial intelligence, pp 1266–1272
Liu Y, Nie L, Han L, Zhang L, Rosenblum DS (2016) Action2activity: recognizing complex activities from sensor data. Computer Vision and Pattern Recognition. arXiv:1611.01872
Liu Y, Nie L, Liu L, Rosenblum DS (2016) From action to activity: sensor-based activity recognition. Neurocomputing 181(Supplement C):108–115
Article Google Scholar
Lucey P, Cohn JF, Matthews I, Lucey S, Sridharan S, Howlett J, Prkachin KM (2011) Automatically detecting pain in video through facial action units. IEEE Trans Syst Man Cybern B Cybern 41(3):664–674
Article Google Scholar
Maalej A, Amor BB, Daoudi M, Srivastava A, Berretti S (2010) Local 3d shape analysis for facial expression recognition. In: 2010 20th international conference on pattern recognition, pp 4129–4132
Maalej A, Amor BB, Daoudi M, Srivastava A, Berretti S (2011) Shape analysis of local facial patches for 3d facial expression recognition. Pattern Recogn 44 (8):1581–1589
Article Google Scholar
Mahoor MH, Zhou M, Veon KL, Mavadati SM, Cohn JF (2011) Facial action unit recognition with sparse representation. In: Automatic face gesture recognition and workshops, pp 336–342
Mao Q, Rao Q, Yu Y, Dong M (2017) Hierarchical bayesian theme models for multipose facial expression recognition. IEEE Trans Multimedia 19(4):861–873
Article Google Scholar
Pantic M, Patras I (2006) Dynamics of facial expression: recognition of facial actions and their temporal segments from face profile image sequences. IEEE Trans Syst Man Cybern B Cybern 36(2):433–449
Article Google Scholar
Pantic M, Rothkrantz LJM (2000) Automatic analysis of facial expressions: the state of the art. IEEE Trans Pattern Anal Mach Intell 22(12):1424–1445
Article Google Scholar
Pantic M, Rothkrantz LJM (2004) Facial action recognition for facial expression analysis from static face images. IEEE Trans Syst Man Cybern B Cybern 34(3):1449–1461
Article Google Scholar
Perakis P, Theoharis T, Kakadiaris IA (2014) Feature fusion for facial landmark detection. Pattern Recogn 47(9):2783–2793
Article Google Scholar
Pinto SCD, Mena-Chalco JP, Lopes FM, Velho L (2011) Cesar, r.: 3d facial expression analysis by using 2d and 3d wavelet transforms. In: 2011 18th IEEE international conference on image processing (ICIP). IEEE, pp 1281–1284
Ramanathan S, Kassim A, Venkatesh YV, Wah WS (2006) Human facial expression recognition using a 3d morphable model. In: 2006 International conference on image processing, pp 661–664
Reale M, Zhang X, Yin L (2013) Nebula feature: a space-time feature for posed and spontaneous 4D facial behavior analysis. In: 10th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8
Rosato M, Chen X, Yin L (2008) Automatic registration of vertex correspondences for 3D facial expression analysis. In: IEEE international conference on biometrics: theory, applications and systems, pp 1–7
Ruiz A, Van de Weijer J, Binefa X (2015) From emotions to action units with hidden and semi-hidden-task learning. In: The IEEE international conference on computer vision (ICCV), pp 3703–3711
Sandbach G, Zafeiriou S, Pantic M (2012) Binary pattern analysis for 3D facial action unit detection. In: Proceedings of the british machine vision conference, BMVC 2012, Surrey, UK, pp 119.1–119.12
Sandbach G, Zafeiriou S, Pantic M (2012) Local normal binary patterns for 3D facial action unit detection. In: 19th IEEE international conference on image processing (ICIP), pp 1813–1816
Sandbach G, Zafeiriou S, Pantic M, Rueckert D (2012) Recognition of 3D facial expression dynamics. Elsevier Image and Vision Computing 30(10):762–773
Article Google Scholar
Savran A, Sankur B (2009) Automatic detection of facial actions from 3d data. In: 2009 IEEE 12th international conference on computer vision workshops, ICCV workshops, pp 1993–2000
Savran A, Sankur B (2010) Detecting action units on 3d faces. In: 2010 IEEE 18th signal processing and communications applications conference, pp 300–303
Savran A, Sankur B (2011) Detecting 3d facial action units via registration. In: 2011 IEEE 19th signal processing and communications applications conference (SIU), pp 371–374
Savran A, Sankur B, Bilge MT (2012) Comparative evaluation of 3d vs. 2d modality for automatic detection of facial action units. Pattern Recogn 45(2):767–782
Article Google Scholar
Senechal T, Bailly K, Prevost L (2014) Impact of action unit detection in automatic emotion recognition. Pattern Anal Applic 17(1):51–67
Article MathSciNet Google Scholar
Senechal T, McDuff D, el Kaliouby R (2015) Facial action unit detection using active learning and an efficient non-linear kernel approximation. In: The IEEE international conference on computer vision (ICCV) workshops, pp 10–18
Senechal T, Rapp V, Salam H, Seguier R, Bailly K, Prevost L (2011) Combining aam coefficients with lgbp histograms in the multi-kernel svm framework to detect facial action units. In: 2011 IEEE international conference on automatic face & gesture recognition and workshops (FG 2011). IEEE, pp 860–865
Senechal T, Rapp V, Salam H, Seguier R, Bailly K, Prevost L (2012) Facial action recognition combining heterogeneous features via multikernel learning. IEEE Trans Syst Man Cybern B Cybern 42(4):993–1005
Article Google Scholar
Simon T, Nguyen MH, Torre FDL, Cohn JF (2010) Action unit detection with segment-based svms. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), pp 2737–2744
Song Y, McDuff D, Vasisht D, Kapoor A (2015) Exploiting sparsity and co-occurrence structure for action unit recognition. In: 2015 11th IEEE international conference and workshops on automatic face and gesture recognition (FG), pp 1–8
Soyel H, Demirel H (2010) Optimal feature selection for 3d facial expression recognition using coarse-to-fine classification. Turk J Electr Eng Comput Sci 18 (6):1031–1040
Google Scholar
Sun Y, Yin L (2008) Facial expression recognition based on 3D dynamic range model sequences. In: Springer proc. ECCV ’08: part II, pp 58–71
Sun Y, Reale M, Yin L (2008) Recognizing partial facial action units based on 3d dynamic range data for facial expression recognition. In: Automatic face gesture recognition, pp 1–8
Sun Y, Reale M, Yin L (2008) Recognizing partial facial action units based on 3D dynamic range data for facial expression recognition. In: FG ’08, pp 1–8
Sun Y, Chen X, Rosato MJ, Yin L (2010) Tracking vertex flow and model adaptation for three-dimensional spatiotemporal face analysis. IEEE Trans Syst Man Cybern Syst Hum 40(3):461–474
Article Google Scholar
Tang H, Huang TS (2008) 3d facial expression recognition based on properties of line segments connecting facial feature points. In: 2008 8th IEEE international conference on automatic face gesture recognition, pp 1–6
Tao D, Song M, Li X, Shen J, Sun J, Wu X, Faloutsos C, Maybank SJ (2008) Bayesian tensor approach for 3-d face modeling. IEEE Trans Circuits Syst Video Technol 18(10):1397–1410
Article Google Scholar
Tawari A, Trivedi MM (2013) Face expression recognition by cross modal data association. IEEE Trans Multimedia 15(7):1543–1552
Article Google Scholar
Tian YI, Kanade T, Cohn JF (2001) Recognizing action units for facial expression analysis. IEEE Trans Pattern Anal Mach Intell 23(2):97–115
Article Google Scholar
Tong Y, Liao W, Ji Q (2007) Facial action unit recognition by exploiting their dynamic and semantic relationships. IEEE Trans Pattern Anal Mach Intell 29 (10):1683–1699
Article Google Scholar
Tsalakanidou F, Malassiotis S (2009) Robust facial action recognition from real-time 3D streams. In: CVPR ’09, pp 4–11
Tsalakanidou F, Malassiotis S (2010) Real-time 2d + 3D facial action and expression recognition. Pattern Recogn 43(5):1763–1775
Article Google Scholar
Tulyakov S, Vieriu RL, Sangineto E, Sebe N (2015) Facecept3d: real time 3d face tracking and analysis. In: The IEEE international conference on computer vision (ICCV) workshops, pp 29–33
Valstar M, Pantic M (2006) Fully automatic facial action unit detection and temporal analysis. In: Computer vision and pattern recognition workshop (CVPRW), pp 149–149
Valstar MF, Almaev T, Girard JM, McKeown G, Mehu M, Yin L, Pantic M, Cohn JF (2015) Fera 2015 - second facial expression recognition and analysis challenge. In: Automatic face and gesture recognition (FG), pp 1–8
Valstar MF, Sánchez-Lozano E, Cohn JF, Jeni LA, Girard JM, Zhang Z, Yin L, Pantic M (2017) Fera 2017-addressing head pose in the third facial expression recognition and analysis challenge. arXiv:1702.04174
van der Maaten L, Hendriks E (2012) Action unit classification using active appearance models and conditional random fields. Cogn Process 13(2):507–518
Article Google Scholar
Venkatesh YV, Kassim AK, Murthy OVR (2010) Resampling approach to facial expression recognition using 3d meshes. In: 2010 20th international conference on pattern recognition, pp 3772–3775
Walecki R, Rudovic O, Pavlovic V, Pantic M (2015) Variable-state latent conditional random fields for facial expression recognition and action unit detection. In: Automatic face and gesture recognition, vol 1, pp 1–8
Wegrzyn M, Vogt M, Kireclioglu B, Schneider J, Kissler J (2017) Mapping the emotional face. How individual face parts contribute to successful emotion recognition. PLOS ONE 12(5):1–15. https://doi.org/10.1371/journal.pone.0177239
Article Google Scholar
Wu CH, Wei WL, Lin JC, Lee WY (2013) Speaking effect removal on emotion recognition from facial expressions based on eigenface conversion. IEEE Trans Multimedia 15(8):1732–1744
Article Google Scholar
Xie L, Shen J, Han J, Zhu L, Shao L (2017) Dynamic multi-view hashing for online image retrieval. In: Proceedings of the twenty-sixth international joint conference on artificial intelligence, IJCAI-17, pp 3133–3139
Yan J, Zheng W, Xu Q, Lu G, Li H, Wang B (2016) Sparse kernel reduced-rank regression for bimodal emotion recognition from facial expression and speech. IEEE Trans Multimedia 18(7):1319–1329
Article Google Scholar
Yüce A, Gao H, Thiran JP (2015) Discriminant multi-label manifold embedding for facial action unit detection. In: Automatic face and gesture recognition (FG), pp 1–6
Yeasin M, Bullot B, Sharma R (2006) Recognition of facial expressions and measurement of levels of interest from video. IEEE Trans Multimedia 8(3):500–508
Article Google Scholar
Yin L, Wei X, Longo P, Bhuvanesh A (2006) Analyzing facial expressions using intensity-variant 3D data for human computer interaction. In: Proc. ICPR ’06, pp 1248–1251
Yin L, Chen X, Sun Y, Worm T, Reale M (2008) A high-resolution 3D dynamic facial expression database. In: IEEE Proc. FG ’08, pp 1–6
Yudin E, Wetzler A, Sela M, Kimmel R (2015) Improving 3d facial action unit detection with intrinsic normalization. In: Proceedings of the 1st international workshop on DIFFerential geometry in computer vision for analysis of shapes, images and trajectories (DIFF-CV 2015), 5, pp 1–10
Zafeiriou S, Pitas I (2008) Discriminant graph structures for facial expression recognition. IEEE Trans Multimedia 10(8):1528–1540
Article Google Scholar
Zen G, Porzi L, Sangineto E, Ricci E, Sebe N (2016) Learning personalized models for facial expression analysis and gesture recognition. IEEE Trans Multimedia 18(4):775–788
Article Google Scholar
Zeng J, Chu WS, Torre FDl, Cohn JF, Xiong Z (2015) Confidence preserving machine for facial action unit detection. In: 2015 IEEE international conference on computer vision (ICCV), pp 3622–3630
Zhang X, Reale M, Yin L (2013) Nebula feature: a space-time feature for posed and spontaneous 4D facial behavior analysis. In: IEEE FG ’13
Zhang X, Yin L, Cohn JF, Canavan S, Reale M, Horowitz A, Liu P, Girard J (2014) BP4d spontaneous: a high-resolution spontaneous 3D dynamic facial expression database. Image Vis Comput 32(10):692–706
Article Google Scholar
Zhang T, Zheng W, Cui Z, Zong Y, Yan J, Yan K (2016) A deep neural network-driven feature learning method for multi-view facial expression recognition. IEEE Trans Multimedia 18(12):2528–2536
Article Google Scholar
Zhao X, Dellandréa E, Chen L, Samaras D (2010) Au recognition on 3d faces based on an extended statistical facial feature model. In: 2010 fourth IEEE international conference on biometrics: theory applications and systems (BTAS). IEEE, pp 1–6
Zhao X, Dellandréa E, Zou J, Chen L (2013) A unified probabilistic framework for automatic 3d facial expression analysis based on a bayesian belief inference and statistical feature models. Image Vis Comput 31(3):231–245
Article Google Scholar
Zhao K, Chu WS, De la Torre F, Cohn JF, Zhang H (2015) Joint patch and multi-label learning for facial action unit detection. In: The IEEE conference on computer vision and pattern recognition (CVPR)
Zhen Q, Huang D, Wang Y, Chen L (2016) Muscular movement model-based automatic 3d/4d facial expression recognition. IEEE Trans Multimedia 18(7):1438–1450
Article Google Scholar
Zhu Y, la Torre FD, Cohn JF, Zhang YJ (2009) Dynamic cascades with bidirectional bootstrapping for spontaneous facial action unit detection. In: 2009 3rd international conference on affective computing and intelligent interaction and workshops, pp 1–8

Download references

Author information

Authors and Affiliations

Department of Computer & Information Science, Norwegian University of Science and Technology, 7034, Trondheim, Norway
Antonios Danelakis & Theoharis Theoharis
Department of Electrical & Computer Engineering, Democritus University of Thrace, 67100, Xanthi, Greece
Ioannis Pratikakis

Authors

Antonios Danelakis
View author publications
You can also search for this author in PubMed Google Scholar
Theoharis Theoharis
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Pratikakis
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonios Danelakis.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Danelakis, A., Theoharis, T. & Pratikakis, I. Action unit detection in 3D facial videos with application in facial expression retrieval and recognition. Multimed Tools Appl 77, 24813–24841 (2018). https://doi.org/10.1007/s11042-018-5699-9

Download citation

Received: 25 July 2017
Revised: 12 January 2018
Accepted: 22 January 2018
Published: 28 March 2018
Issue Date: October 2018
DOI: https://doi.org/10.1007/s11042-018-5699-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Action unit detection in 3D facial videos with application in facial expression retrieval and recognition

Abstract

Access this article

Similar content being viewed by others

Facial Expression Recognition Using Machine Learning and Deep Learning Techniques: A Systematic Review

A review on face recognition systems: recent approaches and challenges

Real-Time Human Pose Detection and Recognition Using MediaPipe

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Action unit detection in 3D facial videos with application in facial expression retrieval and recognition

Abstract

Access this article

Similar content being viewed by others

Facial Expression Recognition Using Machine Learning and Deep Learning Techniques: A Systematic Review

A review on face recognition systems: recent approaches and challenges

Real-Time Human Pose Detection and Recognition Using MediaPipe

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation