Abstract
In this paper we consider the problem of recognizing solid objects from a single two-dimensional image of a three-dimensional scene. We develop a new method for computing a transformation from a three-dimensional model coordinate frame to the two-dimensional image coordinate frame, using three pairs of model and image points. We show that this transformation always exists for three noncollinear points, and is unique up to a reflective ambiguity. The solution method is closed-form and only involves second-order equations. We have implemented a recognition system that uses this transformation method to determine possible alignments of a model with an image. Each of these hypothesized matches is verified by comparing the entire edge contours of the aligned object with the image edges. Using the entire edge contours for verification, rather than a few local feature points, reduces the chance of finding false matches. The system has been tested on partly occluded objects in highly cluttered scenes.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ayache, N., and Faugeras, O.D., 1986. HYPER: A new approach for the recognition and positioning of two-dimensional objects. IEEE Trans. Patt. Anal. Mach. Intell. 8 (1): 44–54.
Basri, R., and Ullman, S. 1988. The alignment of objects with smooth surfaces. Proc. 2nd Intern. Conf. Comput. Vision, pp. 482–488.
Bolles, R.C., and Cain, R.A. 1982. Recognizing and locating partially visible objects: The local feature focus method. Intern. J. Robotics Res. 1 (3): 57–82.
Brooks, R.A. 1981. Symbolic reasoning around 3-D models and 2-D images. Artificial Intelligence J. 17: 285–348.
Canny, J.F. 1986. A computational approach to edge detection. IEEE Trans. Patt. Anal. Mach. Intell. 8 (6): 34–43.
Chin, R.T., and Dyer, C.R. 1986. Model-based recognition in robot vision. ACM Computing Surveys 18 (1): 67–108.
Cyganski, D., and Orr, J.A. 1985. Applications of tensor theory to object recognition and orientation determination. IEEE Trans. Patt. Anal. Mach. Intell. PAMI-7 (6): 662–673.
Duda, R.O., and Hart, P.E., 1973. Pattern Classification and Scene Analysis. Wiley: New York.
Fischler, M.A., and Bolles, R.C. 1981. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography, Comm. Assoc. Comput. Mach. 24 (6): 381–395.
Goad, C. 1986. Fast 3D model-based vision. In From Pixels to Predicates: Recent Advances in Computational and Robotic Vision. A.P., Pentland, ed. Ablex: Norwood, N.J.
Grimson, W.E.L., and Lozano-Pérez, T. 1987. Localizing overlapping parts by searching the interpretation tree. IEEE Trans. Patt. Anal. Mach. Intell. 9 (4):469–482.
Grimson, W.E.L., and Huttenlocher, D.P. 1990. On the sensitivity of the Hough transform for object recognition. IEEE Trans. Patt. Anal. Mach. Intell. 12 (3): 255–274.
Horn, B.K.P., and Weldon, E.J. 1985. Filtering closed curves. Proc. Conf. Comput. Vision Patt. Recog., pp. 478–484.
Horn, B.K.P. 1986. Robot Vision. MIT Press: Cambridge, MA.
Huttenlocher, D.P., and Ullman, S. 1987. Object recognition using alignment. Proc. 1st Intern. Conf. Comput. Vision, pp. 102–111.
Huttenlocher, D.P., and Ullman, S. 1988. Recognizing solid objects by alignment. Proc. DARPA Image Understanding Workshop. Morgan Kaufman Publishers: San Mateo, CA, pp. 1114–1124.
Ikeuchi, K. 1987. Precompiling a geometrical model into an interpretation tree for object recognition in bin-picking tasks. Proc. DARPA Image Understanding Workshop. Morgan Kaufmann Publishers: San Mateo, CA, pp. 321–338.
Kanade, T., and Kender, J.R. 1983. Mapping image properties into shape constraits: Shewed symmetry, affine transformable patterns, and the shape-from-texture paradigm. In J., Beck et al. (eds.), Human and Machine Vision, Academic Press: Orlando, FL.
Klein, F. 1939. Elementary Mathematics from an Advanced Standpoint: Geometry, MacMillan: New York.
Lamdan, Y., Schwartz, J.T., and Wolfson, H.J. 1988. Object recognition by affine invariant matching. Proc. IEEE Conf. Comput. Vision Patt. Recog.
Linainmaa, S., Harwood, D., and Davis, L.S. 1985. Pose determination of a three-dimensional object using triangle pairs. CAR-TR-143, Center for Automation Research, University of Maryland.
Lowe, D.G. 1987. Three-dimensional object recognition from single two-dimensional images. Artificial Intelligence J. 31: 355–395.
Lowe, D.G. 1988. Organization of smooth image curves at multiple scales. Proc. 2nd Intern. Conf. Comput. Vision, pp. 558–567.
Mokhtarian, F., and Mackworth, A. 1986. Scale-based description and recognition of planar curves and two-dimensional shapes. IEEE Trans. Patt. Anal. Mach. Intell. 8 (1).
Roberts, L.G. 1965. Machine perception of three-dimensional solids. J.T., Tippet et al. eds. MIT Press, Cambridge, MA.
Shoham, D., and Ullman, S. 1988. Aligning a model to an image using minimal information. Proc. 2nd Intern. Conf. Comput. Vision.
Silberberg, T., Harwood, D., and Davis, L.S., 1986. Object recognition using oriented model points. Comput. Vision, Graphics and Image Process. 35: 47–71.
Thompson, D., and Mundy, J.L. 1987. Three-dimensional model matching from an unconstrained viewpoint. Proc. IEEE Conf. Robotics and Automation, p. 280.
Ullman, S. 1987. An approach to object recognition: Aligning pictorial descriptions. MIT Artificial Intellignece Lab., Memo No. 931.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Huttenlocher, D.P., Ullman, S. Recognizing solid objects by alignment with an image. Int J Comput Vision 5, 195–212 (1990). https://doi.org/10.1007/BF00054921
Issue Date:
DOI: https://doi.org/10.1007/BF00054921