Skip to main content

Combined Support Vector Machines and Hidden Markov Models for Modeling Facial Action Temporal Dynamics

  • Conference paper
Human–Computer Interaction (HCI 2007)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4796))

Included in the following conference series:

Abstract

The analysis of facial expression temporal dynamics is of great importance for many real-world applications. Being able to automatically analyse facial muscle actions (Action Units, AUs) in terms of recognising their neutral, onset, apex and offset phases would greatly benefit application areas as diverse as medicine, gaming and security. The base system in this paper uses Support Vector Machines (SVMs) and a set of simple geometrical features derived from automatically detected and tracked facial feature point data to segment a facial action into its temporal phases. We propose here two methods to improve on this base system in terms of classification accuracy. The first technique describes the original time-independent set of features over a period of time using polynomial parametrisation. The second technique replaces the SVM with a hybrid SVM/Hidden Markov Model (HMM) classifier to model time in the classifier. Our results show that both techniques contribute to an improved classification accuracy. Modeling the temporal dynamics by the hybrid SVM-HMM classifier attained a statistically significant increase of recall and precision by 4.5% and 7.0%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ekman, P., Friesen, W.V., Hager, J.C.: Facial Action Coding System. A Human Face, Salt Lake City (2002)

    Google Scholar 

  2. Cohn, J.F.: Foundations of human computing: Facial expression and emotion. In: Proc. ACM Int’l Conf. Multimodal Interfaces, vol. 1, pp. 610–616 (2006)

    Google Scholar 

  3. Tian, Y., Kanade, T., Cohn, J.: Recognizing action units for facial expression analysis. IEEE Trans. Pattern Analysis and Machine Intelligence 23(2), 97–115 (2001)

    Article  Google Scholar 

  4. Valstar, M.F., Pantic, M.: Fully automatic facial action unit detection and temporal analysis. In: Proc. IEEE Conference on Computer Vision and Pattern Recognition, p. 149 (2006)

    Google Scholar 

  5. Bartlett, M.S., Littlewort, G., Lainscsek, C., Fasel, I., Movellan, J.: Machine learning methods for fully automatic recognition of facial expressions and actions. In: Proc. IEEE Int’l Conf. on Systems, Man and Cybernetics, vol. 1, pp. 592–597 (2004)

    Google Scholar 

  6. Pantic, M., Rothkrantz, L.J.M.: Toward an affect-sensitive multimodal human-computer interaction. Proc. IEEE 91(9), 1370–1390 (2003)

    Article  Google Scholar 

  7. Tian, Y.L., Kanade, T., Cohn, J.F.: Handbook of Face Recognition. Springer, Heidelberg (2005)

    Book  Google Scholar 

  8. Cohn, J.F., Schmidt, K.L.: The timing of facial motion in posed and spontaneous smiles. J. Wavelets, Multi-resolution and Information Processing 2(2), 121–132 (2004)

    Article  Google Scholar 

  9. Bassili, J.N.: Facial motion in the perception of faces and of emotional expression. J. Experimental Psychology 4(3), 373–379 (1978)

    Google Scholar 

  10. Hess, U., Kleck, R.E.: Differentiating emotion elicited and deliberate emotional facial expressions. European J. of Social Psychology 20(5), 369–385 (1990)

    Article  Google Scholar 

  11. de Williams, A.C.: Facial expression of pain: An evolutionary account. Behavioral and Brain Sciences 25(4), 439–488 (2006)

    Article  Google Scholar 

  12. Ekman, P.: Darwin, deception, and facial expression. Annals of New York Ac. of sciences 1000, 105–221 (2003)

    Google Scholar 

  13. Ekman, P., Rosenberg, E.L.: What the face reveals: Basic and applied studies of spontaneous expression using the Facial Action Coding System. Oxford University Press, Oxford (2005)

    Google Scholar 

  14. Valstar, M.F., Pantic, M., Ambadar, Z., Cohn, J.F.: Spontaneous vs. posed facial behavior: automatic analysis of brow actions. In: Proc. ACM Intl. conf. on Multimodal Interfaces, pp. 162–170 (2006)

    Google Scholar 

  15. Fasel, I.R., Fortenberry, B., Movellan, J.R.: A generative framework for real time object detection and classification. Comp. Vision, and Image Understanding 98(1), 181–210 (2005)

    Google Scholar 

  16. Viola, P., Jones, M.: Robust real-time object detection. Technical report CRL 200001/01 (2001)

    Google Scholar 

  17. Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: A statistical view of boosting. The Annals of Statistics 28(2), 337–374 (2000)

    Article  MATH  MathSciNet  Google Scholar 

  18. Vukdadinovic, D., Pantic, M.: Fully automatic facial feature point detection using gabor feature based boosted features. In: Proc. IEEE Int’l Conf. on Systems, Man and Cybernetics, pp. 1692–1698 (2005)

    Google Scholar 

  19. Patras, I., Pantic, M.: Particle filtering with factorized likelihoods for tracking facial features. In: Proc. Int’l Conf. Automatic Face & Gesture Recognition, pp. 97–102 (2004)

    Google Scholar 

  20. Patras, I., Pantic, M.: Tracking deformable motion. In: Proc. Int’l Conf. Systems, Man and Cybernetics, pp. 1066–1071 (2005)

    Google Scholar 

  21. Bourlard, H., Morgan, N.: Hybrid hmm/ann systemds for speech recognition: Overview and new research directions. In: Giles, C.L., Gori, M. (eds.) Adaptive Processing of Sequences and Data Structures. LNCS (LNAI), vol. 1387, pp. 389–417. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  22. Kruger, S.E., Schaffner, M., Katz, M., Andelic, E., Wendemuth, A.: Speech recognition with support vector machines in a hybrid system. In: Interspeech, pp. 993–996 (2005)

    Google Scholar 

  23. Platt, J.: Probabilistic outputs for support vector machines and comparison to regularized likelihood methods, pp. 61–74. Cambridge, MA (2000)

    Google Scholar 

  24. Pantic, M., Valstar, M.F., Rademaker, R., Maat, L.: Web-based database for facial expression analysis. In: Proc. Int’l Conf. Multimedia & Expo, pp. 317–321 (2005)

    Google Scholar 

  25. Kanade, T., Cohn, J., Tian, Y.: Comprehensive database for facial expression analysis. In: IEEE Int’l Conf. on Automatic Face and Gesture Recognition, pp. 46–53 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Michael Lew Nicu Sebe Thomas S. Huang Erwin M. Bakker

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Valstar, M.F., Pantic, M. (2007). Combined Support Vector Machines and Hidden Markov Models for Modeling Facial Action Temporal Dynamics. In: Lew, M., Sebe, N., Huang, T.S., Bakker, E.M. (eds) Human–Computer Interaction. HCI 2007. Lecture Notes in Computer Science, vol 4796. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75773-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75773-3_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75772-6

  • Online ISBN: 978-3-540-75773-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics