Abstract
The recognition of human actions such as pointing at objects (“Give me that...”) is difficult because they ought to be recognized independent of scene parameters such as viewing direction. Furthermore, the parameters of the action, such as pointing direction, are important pieces of information. One common way to achieve recognition is by using 3D human body tracking followed by action recognition based on the captured tracking data. General 3D body tracking is, however, still a difficult problem. In this paper, we are looking at human body tracking for action recognition from a context-driven perspective. Instead of the space of human body poses, we consider the space of possible actions of a given context and argue that 3D body tracking reduces to action tracking in the parameter space in which the actions live. This reduces the high-dimensional problem to a low-dimensional one. In our approach, we use parametric hidden Markov models to represent parametric movements; particle filtering is used to track in the space of action parameters. Our approach is content with monocular video data and we demonstrate its effectiveness on synthetic and on real image sequences. In the experiments we focus on human arm movements.
Chapter PDF
References
Asfour, T., Welke, K., Ude, A., Azad, P., Dillmann, R.: Perceiving Objects and Movemetns to Generate Actions on a Humanoid Robot. In: Kragic, D., Kyrki, V. (eds.) Unifying Perspectives in Computational and Robot Vision. LNEE, vol. 8, pp. 41–55. Springer, Heidelberg (2008)
Krüger, V., Kragic, D., Ude, A., Geib, C.: The meaning of action: A review on action recognition and mapping. Advanced Robotics 21, 1473–1501 (2007)
Wilson, A.D., Bobick, A.F.: Parametric hidden markov models for gesture recognition. PAMI 21, 884–900 (1999)
Ren, H., Xu, G., Kee, S.: Subject-independent Natural Action Recognition. In: International Conference on Automatic Face and Gesture Recognition, Seoul, Korea, May 17-19 (2004)
Lv, F., Nevatia, R.: Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part IV. LNCS, vol. 3954, pp. 359–372. Springer, Heidelberg (2006)
Xiang, T., Gong, S.: Beyond Tracking: Modelling Action and Understanding Behavior. International Journal of Computer Vision 67, 21–51 (2006)
Lee, M., Nevatia, R.: Human pose tracking in monocular sequences using multilevel structured models. PAMI 31, 27–38 (2009)
Deutscher, J., Blake, A., Reid, I.: Articulated body motion capture by annealed particle filtering. In: CVPR, vol. 2, pp. 126–133 (2000)
Sidenbladh, H., Black, M.J., Sigal, L.: Implicit Probabilistic Models of Human Motion for Synthesis and Tracking. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002, Part I. LNCS, vol. 2350, pp. 784–800. Springer, Heidelberg (2002)
Sminchisescu, C., Triggs, B.: Covarinace Scaled Sampling for Monocular 3D Body Tracking. In: CVPR, Kauai Marriott, Hawaii (2001)
Gupta, A., Davis, L.: Objects in action: An approach for combining action understanding and object perception. In: CVPR (2007)
Kjellström, H., Romero, J., Martínez, D., Kragić, D.: Simultaneous Visual Recognition of Manipulation Actions and Manipulated Objects. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 336–349. Springer, Heidelberg (2008)
Helbig, H.B., Graf, M., Kiefer, M.: The role of action representation in visual object. Experimental Brain Research 174, 221–228 (2006)
Bub, D., Masson, M.: Gestural knowledge evoked by objects as part of conceptual representations. Aphasiology 20, 1112–1124 (2006)
Rizzolatti, G., Fogassi, L., Gallese, V.: Neurophysiological Mechanisms Underlying the Understanding and Imitation of Action. Nature Reviews 2, 661–670 (2001)
Rizzolatti, G., Fogassi, L., Gallese, V.: Parietal Cortex: from Sight to Action. Current Opinion in Neurobiology 7, 562–567 (1997)
Guerra-Filho, G., Aloimonos, Y.: A sensory-motor language for human activity understanding. HUMANOIDS (2006)
Jenkins, O., Mataric, M.: Deriving Action and Behavior Primitives from Human Motion Data. In: International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30-October 4, pp. 2551–2556 (2002)
Guerra-Filho, G., Aloimonos, Y.: A language for human action. Computer 40, 42–51 (2007)
Ivanov, Y., Bobick, A.: Recognition of Visual Activities and Interactions by Stochastic Parsing. PAMI 22, 852–872 (2000)
Moeslund, T., Hilton, A., Krueger, V.: A survey of advances in vision-based human motion capture and analysis. Computer Vision and Image Understanding 104, 90–127 (2006)
Isard, M., Blake, A.: Condensation – conditional density propagation for visual tracking. International Journal of Computer Vision 29, 5–28 (1998)
Gupta, A., Mittal, A., Davis, L.S.: Constraint integration for efficient multiview pose estimation with self-occlusions. PAMI 30, 493–506 (2008)
Gall, J., Patthoff, J., Schnoerr, C., Rosenhahn, B., Seidel, H.P.: Interacting and annealing particle filters: Mathematics and recipe for applications. Jounral of Mathematical Imaging and Vision 28, 1–18 (2007)
Urtasun, R., Fua, P.: 3D Human Body Tracking Using Deterministic Temporal Motion Models. In: Pajdla, T., Matas, J. (eds.) ECCV 2004, Part III. LNCS, vol. 3023, pp. 92–106. Springer, Heidelberg (2004)
Elgammal, A., Lee, C.S.: Inferring 3D body pose from silhouettes using activity manifold learning. In: CVPR (2004)
Wang, J.M., Fleet, D.J., Hertzmann, A.: Correction to ”gaussian process dynamical models for human motion”. PAMI 30, 1118 (2008)
Rabiner, L.R., Juang, B.H.: An introduction to hidden Markov models. IEEE ASSP Magazine, 4–15 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Herzog, D.L., Krüger, V. (2012). Tracking in Action Space. In: Kutulakos, K.N. (eds) Trends and Topics in Computer Vision. ECCV 2010. Lecture Notes in Computer Science, vol 6553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35749-7_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-35749-7_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35748-0
Online ISBN: 978-3-642-35749-7
eBook Packages: Computer ScienceComputer Science (R0)