Skip to main content
Log in

Recognizing solid objects by alignment with an image

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

In this paper we consider the problem of recognizing solid objects from a single two-dimensional image of a three-dimensional scene. We develop a new method for computing a transformation from a three-dimensional model coordinate frame to the two-dimensional image coordinate frame, using three pairs of model and image points. We show that this transformation always exists for three noncollinear points, and is unique up to a reflective ambiguity. The solution method is closed-form and only involves second-order equations. We have implemented a recognition system that uses this transformation method to determine possible alignments of a model with an image. Each of these hypothesized matches is verified by comparing the entire edge contours of the aligned object with the image edges. Using the entire edge contours for verification, rather than a few local feature points, reduces the chance of finding false matches. The system has been tested on partly occluded objects in highly cluttered scenes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  • Ayache, N., and Faugeras, O.D., 1986. HYPER: A new approach for the recognition and positioning of two-dimensional objects. IEEE Trans. Patt. Anal. Mach. Intell. 8 (1): 44–54.

    Google Scholar 

  • Basri, R., and Ullman, S. 1988. The alignment of objects with smooth surfaces. Proc. 2nd Intern. Conf. Comput. Vision, pp. 482–488.

  • Bolles, R.C., and Cain, R.A. 1982. Recognizing and locating partially visible objects: The local feature focus method. Intern. J. Robotics Res. 1 (3): 57–82.

    Google Scholar 

  • Brooks, R.A. 1981. Symbolic reasoning around 3-D models and 2-D images. Artificial Intelligence J. 17: 285–348.

    Google Scholar 

  • Canny, J.F. 1986. A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell. 8 (6): 34–43.

    Google Scholar 

  • Chin, R.T., and Dyer, C.R. 1986. Model-based recognition in robot vision. ACM Computing Surveys 18 (1): 67–108.

    Google Scholar 

  • Cyganski, D., and Orr, J.A. 1985. Applications of tensor theory to object recognition and orientation determination. IEEE Trans. Patt. Anal. Mach. Intell. PAMI-7 (6): 662–673.

    Google Scholar 

  • Duda, R.O., and Hart, P.E., 1973. Pattern Classification and Scene Analysis. Wiley: New York.

    Google Scholar 

  • Fischler, M.A., and Bolles, R.C. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography, Comm. Assoc. Comput. Mach. 24 (6): 381–395.

    Google Scholar 

  • Goad, C. 1986. Fast 3D model-based vision. In From Pixels to Predicates: Recent Advances in Computational and Robotic Vision. A.P., Pentland, ed. Ablex: Norwood, N.J.

    Google Scholar 

  • Grimson, W.E.L., and Lozano-Pérez, T. 1987. Localizing overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell. 9 (4):469–482.

    Google Scholar 

  • Grimson, W.E.L., and Huttenlocher, D.P. 1990. On the sensitivity of the Hough transform for object recognition. IEEE Trans. Patt. Anal. Mach. Intell. 12 (3): 255–274.

    Google Scholar 

  • Horn, B.K.P., and Weldon, E.J. 1985. Filtering closed curves. Proc. Conf. Comput. Vision Patt. Recog., pp. 478–484.

  • Horn, B.K.P. 1986. Robot Vision. MIT Press: Cambridge, MA.

    Google Scholar 

  • Huttenlocher, D.P., and Ullman, S. 1987. Object recognition using alignment. Proc. 1st Intern. Conf. Comput. Vision, pp. 102–111.

  • Huttenlocher, D.P., and Ullman, S. 1988. Recognizing solid objects by alignment. Proc. DARPA Image Understanding Workshop. Morgan Kaufman Publishers: San Mateo, CA, pp. 1114–1124.

    Google Scholar 

  • Ikeuchi, K. 1987. Precompiling a geometrical model into an interpretation tree for object recognition in bin-picking tasks. Proc. DARPA Image Understanding Workshop. Morgan Kaufmann Publishers: San Mateo, CA, pp. 321–338.

    Google Scholar 

  • Kanade, T., and Kender, J.R. 1983. Mapping image properties into shape constraits: Shewed symmetry, affine transformable patterns, and the shape-from-texture paradigm. In J., Beck et al. (eds.), Human and Machine Vision, Academic Press: Orlando, FL.

    Google Scholar 

  • Klein, F. 1939. Elementary Mathematics from an Advanced Standpoint: Geometry, MacMillan: New York.

    Google Scholar 

  • Lamdan, Y., Schwartz, J.T., and Wolfson, H.J. 1988. Object recognition by affine invariant matching. Proc. IEEE Conf. Comput. Vision Patt. Recog.

  • Linainmaa, S., Harwood, D., and Davis, L.S. 1985. Pose determination of a three-dimensional object using triangle pairs. CAR-TR-143, Center for Automation Research, University of Maryland.

  • Lowe, D.G. 1987. Three-dimensional object recognition from single two-dimensional images. Artificial Intelligence J. 31: 355–395.

    Google Scholar 

  • Lowe, D.G. 1988. Organization of smooth image curves at multiple scales. Proc. 2nd Intern. Conf. Comput. Vision, pp. 558–567.

  • Mokhtarian, F., and Mackworth, A. 1986. Scale-based description and recognition of planar curves and two-dimensional shapes. IEEE Trans. Patt. Anal. Mach. Intell. 8 (1).

  • Roberts, L.G. 1965. Machine perception of three-dimensional solids. J.T., Tippet et al. eds. MIT Press, Cambridge, MA.

    Google Scholar 

  • Shoham, D., and Ullman, S. 1988. Aligning a model to an image using minimal information. Proc. 2nd Intern. Conf. Comput. Vision.

  • Silberberg, T., Harwood, D., and Davis, L.S., 1986. Object recognition using oriented model points. Comput. Vision, Graphics and Image Process. 35: 47–71.

    Google Scholar 

  • Thompson, D., and Mundy, J.L. 1987. Three-dimensional model matching from an unconstrained viewpoint. Proc. IEEE Conf. Robotics and Automation, p. 280.

  • Ullman, S. 1987. An approach to object recognition: Aligning pictorial descriptions. MIT Artificial Intellignece Lab., Memo No. 931.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Huttenlocher, D.P., Ullman, S. Recognizing solid objects by alignment with an image. Int J Comput Vision 5, 195–212 (1990). https://doi.org/10.1007/BF00054921

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF00054921

Keywords

Navigation