Abstract
Motivated by model-based video compression, we are studying the modeling, analysis, and synthesis of human face motion. The geometrical shape of the face is modeled by a triangular mesh. Motion can be described at three levels. At the lowest level, motion is represented by the 3D displacements of the vertices of the triangles from one time instant to the next. At the intermediate level, one has the so-called “action units” (e.g., raising eye brows), each of which consists of the movements of several specific vertices. Finally, at the top level, one is concerned with the meaning of the motion. Restricting to the emotional aspect of face motion (not the speech aspect), one considers meanings such as: Happy, angry, anxious, etc.
In this paper, we describe some of our preliminary results on human face motion analysis. Specifically, algorithms are presented for: (i) extracting and tracking key feature points on the face, and (ii) estimating the global and local motion of the head/face. These two sets of algorithms interact with each other. Based on feature correspondences over successive time instants, motion is estimated. Then the estimated motion is used to help track the feature points at future time instants. We attempt to determine the motion of 26 key feature points on the face, from which the action units and meaning of the motion can be deduced.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. Aizawa, H. Harashima, T. Saito, Model-based analysis synthesis image coding (MBASIC) system for a person’s face, Proc. GLOBECOM-87, Nov. 1987, pp. 45-49.
Eric Badiqué, Knowledge-based facial area recognition and improved coding in a CCIT-compatible low-bitrate video-codec, Proc. PCS 90, 9.1 pp. 1-2.
C.S. Choi, K. Aizawa, H. Harashima, T. Takebe, Analysis and synthesis of facial expressions in model-based image coding, Proc. PCS-90, 9.15 pp. 1-2.
P. Ekman, W.V. Friesen, “Facial Action Coding System”, Consulting Psychologists Press, Inc., Palo Alto, CA.
R. Forchheimer, The motion estimation problem in semantic image coding, in Proc. PCS-87, Stockholm 1987, pp. 171-172.
R. Forchheimer, T. Kronander, Image coding—from waveforms to animation, IEEE Trans. ASSP, Dec. 1989.
M. Kaneko, A. Koike and Y. Hatori, Coding with knowledge-based analysis of motion pictures, PCS 87, 12.3.
M. Kaneko, A. Koike and Y. Hatori, Model-based image coding with 3D motion estimation and shape change detection, Proc. PCS-90, 9.5, pp. 1-2.
Michael Kass, A. Terzopoulus, et al., Snakes: active contour models, ICCV 1987, pp. 259-267.
F. Keppei, C.-E. Liedtke, Modeling of 3D objects from monocular TV-image sequences, CAIP 1989.
Mikael Rydfalk, Candide, a parameterized face, Department of Electrical Engineering, Linköping University, Linkôping, Sweden, 1989.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1992 Springer Science+Business Media New York
About this chapter
Cite this chapter
Huang, T.C., Reddy, S.C. (1992). Human Face Motion Analysis. In: Arcelli, C., Cordella, L.P., di Baja, G.S. (eds) Visual Form. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-0715-8_28
Download citation
DOI: https://doi.org/10.1007/978-1-4899-0715-8_28
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-0717-2
Online ISBN: 978-1-4899-0715-8
eBook Packages: Springer Book Archive