Abstract
We introduce a system that processes a sequence of images of a front-facing human face and recognises a set of facial expressions. We use an efficient appearance-based face tracker to locate the face in the image sequence and estimate the deformation of its non-rigid components. The tracker works in real time. It is robust to strong illumination changes and factors out changes in appearance caused by illumination from changes due to face deformation. We adopt a model-based approach for facial expression recognition. In our model, an image of a face is represented by a point in a deformation space. The variability of the classes of images associated with facial expressions is represented by a set of samples which model a low-dimensional manifold in the space of deformations. We introduce a probabilistic procedure based on a nearest-neighbour approach to combine the information provided by the incoming image sequence with the prior information stored in the expression manifold to compute a posterior probability associated with a facial expression. In the experiments conducted we show that this system is able to work in an unconstrained environment with strong changes in illumination and face location. It achieves an 89% recognition rate in a set of 333 sequences from the Cohn–Kanade database.
Similar content being viewed by others
References
Flanagan JL, Huang TS (2003) Special issue on human–computer multimodal interface. Proc IEEE, 91(9):1267–1468
Alkoot FM, Kittler J (2002) Moderating k-nn classifiers. Pattern Anal Appl 5:326–332
Baker S, Matthews I , Schneider J (2004) Automatic construction of active appearance models as an image coding problem. IEEE Trans Pattern Anal Mach Intell 26(10):1380–1384
Barlett MS, Littlewort G, Braathen B, Sejnowki T, Movellán J (2003) A prototype for automatic recognition of spontaneous facial actions. In: Becker S, Obermayer K (eds) Advances in neural information processing systems, vol 15. MIT Press, Cambridge, pp 1271–1278
Barlett MS, Littlewort G, Frank M, Lainscsek C, Fasel IR, Movellán J (2005) Recognizing facial expression: machine learning and application to spontaneous behaviour. In: Proceedings of CVPR, vol 2, pp 568–573
Bascle B, Blake A (1998) Separability of pose and expression in facial tracing and animation. In: Proceedings of international conference on computer vision. IEEE, Washington, pp 323–328
Basili JN (1979) Emotion recognition: the role of facial movement and the relative importance of upper and lower area of the face. J Pers Soc Psychol 37:2049–2059
Belhumeur PN, Hespanha JP, Kriegman DJ (1997) Eigenfaces vs. fisherfaces: recognition using class specific linear projection. IEEE Trans Pattern Anal Mach Intell 19(7):711–720
Belkin M, Niyogi P (2001) Laplacian eigenmaps and spectral techniques for embedding and clustering. In: Advances in neural information processing systems, pp 585–591
Black MJ, Jepson AD (1998) Eigentracking: robust matching and tracking of articulated objects using a view-based representation. Int J Comput Vis 26(1):63–84
Black MJ, Yacoob Y (1997) Recognizing facial expressions in image sequences using local parameterized models of image motion. Int J Comput Vis 25(1):23–48
Blanz V, Vetter T (1999) A morphable model for the synthesis of 3d faces. In: Proceedings of SIGGRAPH. ACM Press, New York, pp 187–194
Buenaposada JM (2001) Buenaposada and Luis Baumela: variations of grey world for face tracking. Image Process Commun 7(3, 4):51–61
Buenaposada JM (2002) Buenaposada and Luis Baumela: real-time tracking and estimation of plane pose. In: Proceedings of international conference on pattern recognition, vol II, QC, Canada, August 2002. IEEE, Washington, pp 697–700
Buenaposada JM, Muñoz E, Baumela L (2006) Efficiently estimating facial expression and illumination in appearance-based tracking. In: Proceedings British machine vision conference, vol I, pp 57–66
Chang Y, Hu C, Turk M (2004) Probabilistic expression analysis on manifolds. In: Proceedings of CVPR, vol 2, pp 520–527
Cohen I, Sebe N, Garg A, Chen LS, Huang TS (2003) Facial expression recognition from video sequences: temporal and static modeling. Comput Vis Image Underst 91:160–187
Cootes T, Edwards GJ, Taylor C (2001) Active appearance models. IEEE Trans Pattern Anal Mach Intell 23(6):681–685
Cowie R, Douglas-Cowie E, Tsapatsoulis N, Votsis G, Kollias S, Fellenz W, Taylor JG (2001) Emotion recognition in human–computer interaction. Signal Process Mag, 18(1):32–80
DeCarlo D, Metaxas D (2000) Optical flow constraints on deformable models with applications to face tracking. Int J Comput Vis 38(2):99–127
DeCarlo D, Metaxas D(2000) Optical flow constraints on deformable models with applications to face tracking. Int J Comput Vis 38(2):99–127
Duda RO, Hart PE, Stork DG (2000) Pattern classification. Wiley, New York
Ekman P (1993) Facial expression and emotion. Am Psychol 44:384–392
Ekman P (1994) Strong evidence for universals in facial expressions: a reply to Russell’s mistaken critique. Psychol Bull 115(2):268–287
Essa I, Pentland A (1997) Coding, analysis, interpretation, recognition of facial expressions. IEEE Trans Pattern Anal Mach Intell 19(7):757–763
Fasel B, Luettin J (2003) Automatic facial expression analysis: a survey. Pattern Recognit 36:259–275
Gao Y, Leung MKH, Hui SC, Tanada MW (2003) Facial expression recognition from line-based caricatures. Trans SMC A 33(3):407–412
Gee A, Cipolla R (1996) Fast visual tracking by temporal consensus. Image Vis Comput 14(2):105–114
Hager G, Belhumeur P (1998) Efficient region tracking with parametric models of geometry and illumination. IEEE Trans Pattern Anal Mach Intell 20(10):1025–1039
He X, Niyogi P (2003) Locality preserving projections. In: Thrun S, Saul L, Schölkopf B (eds) Advances in neural information processing systems, vol 16. MIT Press, Cambridge
Hu C, Ferris R, Turk M (2003) Active wavelet networks for face alignment. In: Proceedings of British machine vision conference
Jongwoo L, Ross D, Ruei-Sung L, Ming-Hsuan Y (2004) Incremental learning for visual tracking. In: Advances in neural information processing systems, vol 17. MIT Press, Cambridge, 793–800
Kanade T, Cohn J, Tian Y-L (2000) Comprehensive database for facial expression analysis. In: Proceedings of international conference on automatic face and gesture recognition, pp 46–53
Lanitis A, Taylor CJ, Cootes TF (1997) Automatic interpretation and coding of face images using flexible models. IEEE Trans Pattern Anal Mach Intell 19(7):743–756
Lien JJ, Kanade T, Cohn JF, Li C (1997) Detection, tracking and classification of action units in facial expression. J Rob Auton Syst 31:131–146
Lyons MJ, Budynek L, Akamatsu S (1999) Automatic classification of single facial images. IEEE Trans Pattern Anal Mach Intell 21(12):1357–1362
Mase K (1991) Recognition of facial expression from optical flow. IEICE Trans E 74(10):3474–3483
Matthews I, Baker S (2004) Active appearance models revisited. Int J Comput Vis 60(2):135–164
McTear MF (2002) Spoken dialogue technology: enabling the conversational user interface. ACM Comput Surv 34(1):90–169
Michel P, El Kaliouby R (2003) Real time facial expression recognition in video using support vector machines. In: Proceedings of international conference on multimodal interfaces. ACM, New York, pp 258–264
Muñoz E, Buenaposada JM, Baumela L (2005) Efficient model-based 3d tracking of deformable objects. In: Proceedings of international conference on computer vision, Beijing, China, vol I, pp 877–882,
Ohya J, Kitamura Y, Takemura H, Ishi H, Kishino F, Terashima N (1996) Virtual space teleconferencing: real-time reproduction of 3d human images. J Vis Commun Image Represent 6(1):1–25
Ojala T, Pietikainen M, Menpp T (2002) Multirresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Oliver N, Pentland A, Bérard F (2000) Lafter: a real-time face and lips tracker with facial expression recognition. Pattern Recognit 33:1369–1382
Panti M, Rothkrantz LJM (2000) Automatic analysis of facial expressions: the state of the art. IEEE Trans Pattern Anal Mach Intell 22(12):1424–1445
Pantic M, Rothkrantz LJM (2000) Expert system for automatic analysis of facial expressions. Image Vis Comput 18(11):881–905
Picard RW (1997) Affective computing. MIT Press, Cambridge
Raducanu B, Graña M, Albizuri FX, d’Anjou A (2004) A probabilistic hit-and-miss transform for face localization. Pattern Anal Appl 7:117–127
Rani P, Liu C, Sarkar N (2006) An empirical study of machine learning techniques for affect recognition in human-robot interaction. Pattern Anal Appl 9:58–69
Romdhani S, Vetter T (2003) Efficient robust and accurate fitting of a 3rd morphable model. In: Proceedings of international conference on computer vision, vol 1, pp 59–66
Rose N (2006) Facial expression classification using gabor and log-gabor filters. In: Proceedings of international conference on automatic face and gesture recognition
Rosenblum M, Yacoob Y, Davis LS (1996) Human expression recognition from motion using radial basis function network architecture. IEEE Trans Neural Netw 7(5):1121–1138
Roweis S, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500):2323–2326
Rowley H, Baluja S, Kanade T (1998) Neural network-based face detection. IEEE Trans Pattern Anal Mach Intell 20(1):23–28
Shan C, Gong S, McOwan PW (2005) Appearance manifold of facial expression. In: IEEE international workshop on human–computer interaction
Shan C, Gong S, McOwan PW (2006) Dynamic facial expression recognition using a bayesian temporal manifold model. In: Proceedings of British machine vision conference vol 1, pp, 297–306
Tenenbaum JB, de Silva V, Langford JC (2000) A global geometric framework for nonlinear dimensionality reduction. Science 290(5500):2319–2323
Terzopoulos D, Waters K (1993) Analysis and synthesis of facial image sequences using physical and anatomical models. IEEE Trans Pattern Anal Mach Intell 15(6):569–579
Tian Y, Kanade T, Cohn J (2001) Recognizing action units for facial expression analysis. IEEE Trans Pattern Anal Mach Intell 23(2):97–115
Tong Y, Liao W, Ji Q (2006). Inferring facial action units with causal relations. In: Proceedings of CVPR , vol 2, pp 1623–1630
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cogn Neurosci 3(1):71–86
Viola P, Jones MJ (2004) Robust real-time face detection. Int J Comput Vis 57(2):137–154
Wang J, Yin L, Wei X, Sun Y (2006) 3d facial expression recognition based on primitive surface feature distribution. In: Proceedings of CVPR, vol 2, pp 1399–1406
Xu Y, Roy-Chowdhury AK (2007) Integrating motion, illumination and structure in video sequences with applications in illumination-invariant tracking. IEEE Trans Pattern Anal Mach Intell 29(5):793–806
Yacoob Y, Davis LS (1996) Recognizing human facial expressions from long image sequences using optical flow. IEEE Trans Pattern Anal Mach Intell 18(6):636–642
Yeasin M, Bullot B, Sharma R (2004) From facial expression to levels of interest: a spatio-temporal approach. In: Proceedings of CVPR, vol 2, 922–927
Zhang Y, Ji Q (2003) Facial expression understanding in image sequences using dynamic and active information fusion. In: Proceedings of international conference on computer vision, Nice, France
Zhang Y, Ji Q (2005) Active and dynamic information fusion for facial expression understanding from image sequences. IEEE Trans Pattern Anal Mach Intell 27(5):1–16
Zhang Z, Lyons M, Schuster M, Akamatsu S (1998) Comparison between geomtry-based and gabor wavelets-based facial expression recognition using multi-layer perceptron. In: Proceedings of international conference on automatic face and gesture recognition, Nara, Japan, pp 454–459
Zhao G, Piettikäinen M (2007) Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans Pattern Anal Mach Intell 29(6):915–928
Acknowledgments
The authors gratefully acknowledge funding from the Spanish Ministerio de Educación y Ciencia under contract TRA2005-08529-C02-02 . They also thank the anonymous reviewers for their comments and Jeffrey Cohn and Takeo Kanade for providing the Cohn–Kanade image database.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Buenaposada, J.M., Muñoz, E. & Baumela, L. Recognising facial expressions in video sequences. Pattern Anal Applic 11, 101–116 (2008). https://doi.org/10.1007/s10044-007-0084-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-007-0084-8