Regularity Guaranteed Human Pose Correction

  • Wei ShenEmail author
  • Rui Lei
  • Dan Zeng
  • Zhijiang Zhang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9004)


Benefited from the advantages provided by depth sensors, 3D human pose estimation has become feasible. However, the current estimation systems usually yield poor results due to severe occlusion and sensor noise in depth data. In this paper, we focus on a post-process step, pose correction, which takes the initial estimated poses as the input and deliver more reliable results. Although the regression based correction approach [1] has shown its effectiveness in decreasing the estimated errors, it cannot guarantee the regularity of corrected poses. To address this issue, we formulate pose correction as an optimization problem, which combines the output of the regression model with a pose prior model learned on a pre-captured motion data set. By considering the complexity and the geometric property of the pose data, the pose prior is estimated by von Mises-Fisher distributions in subspaces following divide-and-conquer strategies. By introducing the pose prior into our optimization framework, the regularity of the corrected poses is guaranteed. The experimental results on a challenging data set demonstrate that the proposed pose correction approach not only improves the accuracy, but also outputs more regular poses, compared to the-state-of-the-art.


Depth Image Temporal Constraint Prior Model Golf Swing Motion Capture Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This work was supported in part by the National Natural Science Foundation of China under Grant 61303095, in part by Research Fund for the Doctoral Program of Higher Education of China under Grant 20133108120017, in part by Innovation Program of Shanghai Municipal Education Commission under Grant 14YZ018, in part by Innovation Program of Shanghai University under Grant SDCX2013012 and in part by Cultivation Fund for the Young Faculty of Higher Education of Shanghai under Grant ZZSD13005. We thank Microsoft Corporation for providing the skeleton data set used in our experiments.


  1. 1.
    Shen, W., Deng, K., Bai, X., Leyvand, T., Guo, B., Tu, Z.: Exemplar-based human action pose correction and tagging. In: Proceedings of CVPR (2012)Google Scholar
  2. 2.
    Microsoft Corp. Kinect for XBOX 360. Redmond WAGoogle Scholar
  3. 3.
    Han, J., Shao, L., Xu, D., Shotton, J.: Enhanced computer vision with microsoft kinect sensor: a review. IEEE Trans. Cybern. 43, 1318–1334 (2013)CrossRefGoogle Scholar
  4. 4.
    Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from a single depth image. In: Proceedings of CVPR (2011)Google Scholar
  5. 5.
    Ganapathi, V., Plagemann, C., Koller, D., Thrun, S.: Real time motion capture using a single time-of-flight camera. In: Proceedings of CVPR, pp. 755–762 (2010)Google Scholar
  6. 6.
    Ye, M., Wang, X., Yang, R., Ren, L., Pollefeys, M.: Accurate 3D pose estimation from a single depth image. In: Proceedings of ICCV (2011)Google Scholar
  7. 7.
    Baak, A., Müller, M., Bharaj, G., Seidel, H.P., Theobalt, C.: A data-driven approach for real-time full body pose reconstruction from a depth camera. In: Proceedings of ICCV (2011)Google Scholar
  8. 8.
    Girshick, R., Shotton, J., Kohli, P., Criminisi, A., Fitzgibbon, A.: Efficient regression of general-activity human poses from depth images. In: Proceedings of ICCV (2011)Google Scholar
  9. 9.
    Sun, M., Kohli, P., Shotton, J.: Conditional regression forests for human pose estimation. In: Proceedings of CVPR (2012)Google Scholar
  10. 10.
    Shum, H.P.H., Ho, E.S.L., Jiang, Y., Takagi, S.: Real-time posture reconstruction for microsoft kinect. IEEE Trans. Cybern. 43, 1357–1369 (2013)CrossRefGoogle Scholar
  11. 11.
    Shen, W., Deng, K., Bai, X., Leyvand, T., Guo, B., Tu, Z.: Exemplar-based human action pose correction. IEEE Trans. Cybern. 44(7), 1053–1066 (2014)CrossRefGoogle Scholar
  12. 12.
    Wang, X., Zhang, Z., Ma, Y., Bai, X., Liu, W., Tu, Z.: Robust subspace discovery via relaxed rank minimization. Neural Comput. 26, 611–635 (2014)CrossRefMathSciNetGoogle Scholar
  13. 13.
    Wang, B., Tu, Z.: Sparse subspace denoising for image manifolds. In: Proceedings of CVPR, pp. 468–475 (2013)Google Scholar
  14. 14.
    Bentley, J.L.: Multidimensional divide-and-conquer. Commun. ACM 23, 214–229 (1980)CrossRefzbMATHMathSciNetGoogle Scholar
  15. 15.
    Fisher, N.I., Lewis, T., Embleton, B.J.J.: Statistical Analysis of Spherical Data. Cambridge University Press, Cambridge (1993)zbMATHGoogle Scholar
  16. 16.
    Wang, X., Bai, X., Ma, T., Liu, W., Latecki, L.J.: Fan shape model for object detection. In: Proceedings of CVPR, pp. 151–158 (2012)Google Scholar
  17. 17.
    Plagemann, C., Ganapathi, V., Koller, D., Thrun, S.: Real-time identification and localization of body parts from depth images. In: Proceedings of ICRA, pp. 3108–3113 (2010)Google Scholar
  18. 18.
    Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)CrossRefzbMATHGoogle Scholar
  19. 19.
    Quinlan, J.R.: Induction of decision trees. Mach. Learn. 1, 81–106 (1986)Google Scholar
  20. 20.
    Ganapathi, V., Plagemann, C., Koller, D., Thrun, S.: Real-time human pose tracking from range data. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 738–751. Springer, Heidelberg (2012) CrossRefGoogle Scholar
  21. 21.
    Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. Int. J. Comput. Vis. 61, 55–79 (2005)CrossRefGoogle Scholar
  22. 22.
    Ren, X., Berg, A.C., Malik, J.: Recovering human body configurations using pairwise constraints between parts. In: Proceedings of ICCV, pp. 824–831 (2005)Google Scholar
  23. 23.
    Ramanan, D.: Learning to parse images of articulated bodies. In: Proceedings of NIPS, pp. 1129–1136 (2006)Google Scholar
  24. 24.
    Dantone, M., Gall, J., Leistner, C., Gool, L.J.V.: Human pose estimation using body parts dependent joint regressors. In: Proceedings of CVPR, pp. 3041–3048 (2013)Google Scholar
  25. 25.
    Ladicky, L., Torr, P.H.S., Zisserman, A.: Human pose estimation using a joint pixel-wise and part-wise formulation. In: Proceedings of CVPR, pp. 3578–3585 (2013)Google Scholar
  26. 26.
    Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: Proceedings of CVPR, pp. 1385–1392 (2011)Google Scholar
  27. 27.
    Yao, C., Bai, X., Liu, W., Latecki, L.J.: Human detection using learned part alphabet and pose dictionary. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part V. LNCS, vol. 8693, pp. 251–266. Springer, Heidelberg (2014) CrossRefGoogle Scholar
  28. 28.
    Lehrmann, A.M., Gehler, P.V., Nowozin, S.: A non-parametric bayesian network prior of human pose. In: Proceedings of ICCV, pp. 1281–1288 (2013)Google Scholar
  29. 29.
    Murray, R.M., Li, Z., Sastry, S.S.: A Mathematical Introduction to Robotic Manipulation. CRC Press, Boca Raton (1994) zbMATHGoogle Scholar
  30. 30.
    Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22, 888–905 (2000)CrossRefGoogle Scholar
  31. 31.
    Sra, S.: A short note on parameter approximation for von mises-fisher distributions: and a fast implementation of \(i_s(x)\). Comput. Stat. 27, 177–190 (2011)CrossRefMathSciNetGoogle Scholar
  32. 32.
    Luo, Z.Q., Tseng, P.: On the convergence of the coordinate descent method for convex differentiable minimization. J. Optim. Theory Appl. 72, 7–35 (1992)CrossRefzbMATHMathSciNetGoogle Scholar
  33. 33.
    Rasmussen, C.E., Williams, C.: Gaussian Processes for Machine Learning. MIT Press, Cambridge (2006) zbMATHGoogle Scholar
  34. 34.
    Schölkopf, B., Smola, A., Williamson, R., Bartlett, P.L.: New support vector algorithms. Neural Comput. 12, 1207–1245 (2000)CrossRefGoogle Scholar
  35. 35.
    Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. In: Proceedings of CVPR, pp. 2887–2894 (2012)Google Scholar
  36. 36.
    Zhou, Y., Yang, Y., Yi, M., Bai, X., Liu, W., Latecki, L.J.: Online multiple targets detection and tracking from mobile robot in cluttered indoor environments with depth camera. IJPRAI 28(1) (2014)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.School of Communication and Information EngineeringShanghai UniversityShanghaiPeople’s Republic of China

Personalised recommendations