Virtual Reality

, Volume 20, Issue 3, pp 159–172 | Cite as

Optimizing human model reconstruction from RGB-D images based on skin detection

  • Guang Chen
  • Jituo Li
  • Jiping Zeng
  • Bei Wang
  • Guodong Lu
Original Article


This paper reconstructs human model from multi-view RGB-D images of an Xbox One Kinect. We preprocess the depth images by implicit surface de-noising and then part-wisely register them into a point cloud. A template model is selected from the human model database to fit the registered point cloud of a human body by Laplacian deformation. Skin detection of RGB-D images helps to tightly constrain the skin parts of human body in template fitting step in order to get more precise and lifelike human model. We propose a robust skin detection method that is not affected by clothing pattern and background. Experiments demonstrate the effectiveness of our method.


Human model reconstruction Kinect RGB-D image Skin detection 



This work was partially supported by National Natural Science Foundation of China (51575481, 61379096) and Project of Public Technology Research in Industry of Zhejiang Province (2014C31048).


  1. Anguelov D, Srinivasan P, Koller D, Thrun S, Rodgers J, Davis J (2005) SCAPE: shape completion and animation of people. ACM Trans Graph 24:408–416CrossRefGoogle Scholar
  2. Bălan AO, Black MJ (2008) The naked truth: estimating body shape under clothing. Computer Vision–ECCV 2008. Springer, Marseille, pp 15–29CrossRefGoogle Scholar
  3. Box GEP, Muller ME (1958) A note on the generation of random normal deviates. Ann Math Stat 29:610–611CrossRefzbMATHGoogle Scholar
  4. Chang W, Zwicker M (2011) Global registration of dynamic range scans for articulated model reconstruction. ACM Trans Graph 30:171–179Google Scholar
  5. Chen X, Guo Y, Zhou B, Zhao Q (2013) Deformable model for estimating clothed and naked human shapes from a single image. Vis Comput 29:1187–1196CrossRefGoogle Scholar
  6. Chen G, Li J, Wang B, Zeng J, Lu G, Zhang D (2016) Reconstructing 3D human models with a Kinect. Comput Anim Virtual Worlds 27:72–85CrossRefGoogle Scholar
  7. Hasler N, Stoll C, Sunkel M, Rosenhahn B, Seidel H-P (2009) A statistical model of human pose and body shape. Comput Graph Forum 28:337–346CrossRefGoogle Scholar
  8. Hasler N, Ackermann H, Rosenhahn B, Thormahlen T, Seidel H-P (2010) Multilinear pose and body shape estimation of dressed subjects from image sets. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), pp 1823–1830Google Scholar
  9. Huang QX, Adams B, Wicke M, Guibas LJ (2008) Non-rigid registration under isometric deformations. Comput Graph Forum 27:1449–1457CrossRefGoogle Scholar
  10. Izadi S et al (2011) KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the 24th annual ACM symposium on user interface software and technology, pp 559–568Google Scholar
  11. Lee JY, Yoo SI (2002) An elliptical boundary model for skin color detection. In: Proceedings of the 2002 international conference on imaging science, systems, and technologyGoogle Scholar
  12. Li J, Lu G (2011) Skeleton driven animation based on implicit skinning. Comput Graph 35:945–954CrossRefGoogle Scholar
  13. Li J, Wang Y (2007) Automatically constructing skeletons and parametric structures for polygonal human bodies. In: Proceedings of the 25th computer graphics international conferenceGoogle Scholar
  14. Mitra NJ, Flöry S, Ovsjanikov M, Gelfand N, Guibas L, Pottmann H (2007) Dynamic geometry registration. In: Symposium on geometry processing, pp 173–182Google Scholar
  15. Newcombe RA et al (2011) KinectFusion: real-time dense surface mapping and tracking. In: 2011 10th IEEE international symposium on mixed and augmented reality (ISMAR), pp 127–136Google Scholar
  16. Otsu N (1975) A threshold selection method from gray-level histograms. Automatica 11:23–27Google Scholar
  17. Rother C, Kolmogorov V, Blake A (2004) Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans Graph 23:309–314CrossRefGoogle Scholar
  18. Tong J, Zhou J, Liu L, Pan Z, Yan H (2012) Scanning 3D full human bodies using Kinects. IEEE Trans Vis Comput Graph 18:643–650CrossRefGoogle Scholar
  19. Weiss A, Hirshberg D, Black MJ (2011) Home 3D body scans from noisy image and range data. In: Proceedings of the 2011 international conference on computer vision, pp 1951–1958Google Scholar
  20. Zeng M, Zheng J, Cheng X, Liu X (2013) Templateless quasi-rigid shape modeling with implicit loop-closure. In: 2013 IEEE conference on computer vision and pattern recognition (CVPR), pp 145–152Google Scholar
  21. Zeng M, Cao L, Dong H, Lin K, Wang M, Tong J (2015) Estimation of human body shape and cloth field in front of a Kinect. Neurocomputing 151:626–631CrossRefGoogle Scholar

Copyright information

© Springer-Verlag London 2016

Authors and Affiliations

  • Guang Chen
    • 1
  • Jituo Li
    • 1
  • Jiping Zeng
    • 1
  • Bei Wang
    • 1
  • Guodong Lu
    • 1
  1. 1.Research Center of Design and Production Innovation, College of Mechanical EngineeringZhejiang UniversityHangzhouPeople’s Republic of China

Personalised recommendations