Abstract
We present a new markerless generative approach for Human Motion Tracking using a single depth camera. It is based on a Sums of Spatial Gaussians (SoGs) representation for modeling the scene. In contrast to existing systems our approach does not require a multi-view camera setup, exemplar database or training data. The proposed system is accurate, fast and capable of tracking complex motions including 360° turns and self-occlusion of limited duration. The motivation behind our approach is that representing the depth data and a given a priori human model by a SoGs, we can construct an efficient continuously differentiable similarity measure and estimate an optimal pose for each input frame using a local optimization algorithm (Modified Gradient Ascent Linear Search, MGALS).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baak, A.: Müller, M., Bharaj, G., Seidel, H.P., Theobalt, C.: A data-driven approach for real-time full body pose reconstruction from a depth camera. In: Proc. ICCV (2011)
Bleiweiss, A., Kutliroff, E., Eilat, G.: Markerless motion capture using a single depth sensor. In: SIGGRAPH ASIA, Sketches (2009)
Ganapathi, V., Plagemann, C., Koller, D., Thrun, S.: Real-time human pose tracking from range data. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 738–751. Springer, Heidelberg (2012)
Ganapathi, V., Plagemann, C., Koller, D., Thrun, S.: Real time motion capture using a single time-of-light camera. In: Proc. CVPR (2010)
Girshick, R., Shotton, J., P.K., Criminisi, A., Fitzgibbon, A.: Efficient regression of general-activity human poses from depth images. In ICCV pp. 415–422 (2011)
Knoop, S., Vacek, S., Dillmann, R.: Fusion of 2d and 3d sensor data for articulated body tracking. Robotics and Autonomous Systems 57(3), 321–329 (2009)
Moeslund, T., Granum, E.: A survey of computer vision-based human motion capture. Computer Vision and Image Understanding 81(3), 231–268 (2001)
Moeslund, T., Hilton, A.: Krüger, V.: A survey of advances in human motion capture and analysis. Computer Vision and Image Understanding 104(2) (2006)
Pekelny, Y., Gotsman, C.: Articulated object reconstruction and markerless motion capture from depth video. CGF 27(2), 399–408 (2008)
Plagemann, C., Ganapathi, V., Koller, D., Thrun, S.: Real-time identification and localization of body parts from depth images. In: ICRA, Anchorage, USA (2010)
Poppe, R.: Vision-based human motion analysis: An overview. CVIU 108 (2007)
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: Proc. CVPR Computer Vision and Image Understanding (2011)
Stoll, C., Hasler, N., Gall, J., Seidel, H.P., Theobalt, C.: Fast articulated motion tracking using a sums of gaussians body model. In: ICCV (2011)
Taylor, J., Shotton, J., Sharp, T., Fitzgibbon, A.: The vitruvian manifold: Inferring dense correspondences for one-shot human pose estimation. In: CVPR (2012)
Zhu, Y., Curless, B., Seitz, S.M.: Kinematic self retargeting: A framework for human pose estimation. CVIU 114(12), 1362–1375 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kurmankhojayev, D., Hasler, N., Theobalt, C. (2013). Monocular Pose Capture with a Depth Camera Using a Sums-of-Gaussians Body Model. In: Weickert, J., Hein, M., Schiele, B. (eds) Pattern Recognition. GCPR 2013. Lecture Notes in Computer Science, vol 8142. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40602-7_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-40602-7_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40601-0
Online ISBN: 978-3-642-40602-7
eBook Packages: Computer ScienceComputer Science (R0)