A Unified Energy Minimization Framework for Model Fitting in Depth

  • Carl Yuheng Ren
  • Ian Reid
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7584)


In this paper we present a unified energy minimization framework for model fitting and pose recovery problems in depth cameras. 3D level-set embedding functions are used to represent object models implicitly and a novel 3D chamfer matching based energy function is minimized by adjusting the generic projection matrix, which could be parameterized differently according to specific applications. Our proposed energy function takes the advantage of the gradient of 3D level-set embedding function and can be efficiently solved by gradients-based optimization methods. We show various real-world applications, including real-time 3D tracking in depth, simultaneous calibration and tracking, and 3D point cloud modeling. We perform experiments on both real data and synthetic data to show the superior performance of our method for all the applications above.


Point Cloud Energy Function Depth Image Intrinsic Parameter Depth Camera 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Oikonomidis, I., Kyriazis, N., Argyros, A.: Efficient model-based 3D tracking of hand articulations using kinect. In: Proceedings of the British Machine Vision Conference, pp. 101.1–101.11. BMVA Press (2011)Google Scholar
  2. 2.
    Newcombe, R.A., Izadi, S., Hilliges, O., Molyneaux, D., Kim, D., Davison, A.J., Kohli, P., Shotton, J., Hodges, S., Fitzgibbon, A.W.: Kinectfusion: Real-time dense surface mapping and tracking. In: ISMAR, pp. 127–136. IEEE (2011)Google Scholar
  3. 3.
    Koppula, H.S., Anand, A., Joachims, T., Saxena, A.: Semantic labeling of 3D point clouds for indoor scenes. In: NIPS, pp. 244–252 (2011)Google Scholar
  4. 4.
    Ueda, R.: Tracking 3D objects with point cloud library (2012),
  5. 5.
    Besl, P.J., McKay, N.D.: A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell. 14, 239–256 (1992)CrossRefGoogle Scholar
  6. 6.
    Thayananthan, A., Stenger, B., Torr, P.H.S., Cipolla, R.: Shape context and chamfer matching in cluttered scenes. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, vol. 1, p. 127 (2003)Google Scholar
  7. 7.
    Daniel Herrera, C., Kannala, J., Heikkila, J.: Joint depth and color camera calibration with distortion correction. IEEE Transactions on Pattern Analysis and Machine Intelligence 99 (2012)Google Scholar
  8. 8.
    Zhang, Z.: A flexible new technique for camera calibration. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 1330–1334 (2000)CrossRefGoogle Scholar
  9. 9.
    Prisacariu, V., Reid, I.: Simultaneous monocular 2D segmentation, 3D pose recovery and 3D reconstruction. Technical report, University of Oxford (2012)Google Scholar
  10. 10.
    Lawrence, N.D.: Probabilistic non-linear principal component analysis with gaussian process latent variable models. Journal of Machine Learning Research 6, 1783–1816 (2005)MathSciNetzbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Carl Yuheng Ren
    • 1
  • Ian Reid
    • 1
  1. 1.Dept. Engineering ScienceOxford UniversityOxfordUK

Personalised recommendations