Model-based object pose in 25 lines of code

  • Daniel F. DeMenthon
  • Larry S. Davis
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 588)


We find the pose of an object from a single image when the relative geometry of four or more noncoplanar visible feature points is known. We first describe an algorithm, POS (Pose from Orthography and Scaling), that solves for the rotation matrix and the translation vector of the object by a linear algebra technique under the scaled orthographic projection approximation. We then describe an iterative algorithm, POSIT (POS with ITerations), that uses the pose found by POS to remove the “perspective distortions” from the image, then applies POS to the corrected image instead of the original image. POSIT generally converges to accurate pose measurements in a few iterations. Mathematica code is provided in an Appendix.


Feature Point Rotation Matrix Image Point Translation Vector Object Point 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    H.S. Baird, “Model-Bases Image Matching Using Location”, MIT Press, Cambridge, MA, 1985.Google Scholar
  2. 2.
    T.A. Cass, “Feature Matching for Object Localization in the Presence of Uncertainty”, MIT A.I. Memo 1113, May 1990.Google Scholar
  3. 3.
    D. DeMenthon and L.S. Davis, “Model-Based Object Pose in 25 Lines of Code”, Center for Automation Research Technical Report CAR-TR-599, December 1991.Google Scholar
  4. 4.
    R.M. Haralick, “Performance Characterization in Computer Vision”, University of Washington C.S. Technical Report, July 1991.Google Scholar
  5. 5.
    R. Horaud, B. Conio and O. Leboulleux, “An Analytical Solution for the Perspective-4-Point Problem”, Computer Vision, Graphics, and Image Processing, vol. 47, pp. 33–44, 1989.Google Scholar
  6. 6.
    C. Tomasi, “Shape and Motion from Image Streams: A Factorization Method”, Technical Report CMU-CS-91-172, Carnegie Mellon University, September 1991.Google Scholar
  7. 7.
    R.Y. Tsai, “A Versatile Camera Calibration Technique for High-Accuracy 3D Machine Vision Metrology Using Off-the-Shelf TV Cameras and Lenses,” IEEE J. Robotics and Automation, vol. 3, pp. 323–344, 1987.Google Scholar
  8. 8.
    S. Ullman and R. Basri, “Recognition by Linear Combinations of Models”, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, pp. 992–1006, 1991.Google Scholar
  9. 9.
    J.S.C. Yuan, “A General Photogrammetric Method for Determining Object Position and Orientation”, IEEE Trans. on Robotics and Automation, vol. 5, pp. 129–142, 1989.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1992

Authors and Affiliations

  • Daniel F. DeMenthon
    • 1
  • Larry S. Davis
    • 1
  1. 1.Computer Vision Laboratory, Center for Automation ResearchUniversity of MarylandUSA

Personalised recommendations