Skip to main content
Log in

Recognition Using Region Correspondences

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

Recognition systems attempt to recover information about the identity of observed objects and their location in the environment. A fundamental problem in recognition is pose estimation. This is the problem of using a correspondence between some portions of an object model and some portions of an image to determine whether the image contains an instance of the object, and, in case it does, to determine the transformation that relates the model to the image. The current approaches to this problem are divided into methods that use “global” properties of the object (e.g., centroid and moments of inertia) and methods that use “local” properties of the object (e.g., corners and line segments). Global properties are sensitive to occlusion and, specifically, to self occlusion. Local properties are difficult to locate reliably, and their matching involves intensive computation.

We present a novel method for recognition that uses region information. In our approach the model and the image are divided into regions. Given a match between subsets of regions (without any explicit correspondence between different pieces of the regions) the alignment transformation is computed. The method applies to planar objects under similarity, affine, and projective transformations and to projections of 3-D objects undergoing affine and projective transformations. The new approach combines many of the advantages of the previous two approaches, while avoiding some of their pitfalls. Like the global methods, our approach makes use of region information that reflects the true shape of the object. But like local methods, our approach can handle occlusion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Alter, T.D. and Grimson, W.E.L. 1993. Fast and robust 3D recognition by alignment. In Proc. Fourth Inter. Conf. Computer Vision, pp. 113-120.

  • Alter, T.D. and Jacobs, D. 1994. Error propagation in full 3D-from-2Dobject recognition. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 892-898.

  • Amenta, N. 1994. Bounded boxes, Hausdorff distance, and a new proof of an interesting Helly-type theorem. Proceedings of the 10th Annual ACM Symposium on Computational Geometry, pp. 340- 347.

  • Ayache, N. and Faugeras, O. 1986. HYPER: A new approach for the recognition and positioning of two-dimensional objects. IEEE Trans. on Pattern Analysis and Machine Intelligence, 8(1):44-54.

    Google Scholar 

  • Baird, H. 1985. Model-Based Image Matching Using Location. MIT Press: Cambridge.

    Google Scholar 

  • Bajcsy, R. and Solina, F. 1987. Three dimensional object representation revisited. Proc. of The First Int. Conf. on Computer Vision, London, pp. 231-240.

  • Basri, R. 1996. Paraperspective ≡ Affine. Int. J. of Comp. Vis., 19(2):169-179.

    Google Scholar 

  • Basri, R. and Ullman, S. 1993. The alignment of objects with smooth surfaces. Computer Vision, Graphics, and Image Processing: Image Understanding, 57(3):331-345.

    Google Scholar 

  • Basri, R. and Jacobs, D.W. 1994. Recognition using region correspondences. The Weizmann Institute of Science, T.R. CS95-33.

  • Basri, R. and Jacobs, D.W. 1995. Recognition using region correspondences. IEEE Int. Conf. on Computer Vision, pp. 8-15.

  • Biederman, I. 1985. Human image understanding: Recent research and a theory. Computer Graphics, Vision, and Image Processing, 32:29-73.

    Google Scholar 

  • Binford, T. 1971. Visual perception by computer. IEEE Conf. on Systems and Control.

  • Breuel, T. 1991. Model based recognition using pruned correspondence search. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 257-268.

  • Brooks, R. 1981. Symbolic reasoning among 3-D models and 2-D images. Artificial Intelligence, 17:285-348.

    Google Scholar 

  • Canny, J. 1983. A computational approach to edge detection. Trans. on Pattern Recognition and Machine Intelligence, 8(6):679-698.

    Google Scholar 

  • Cass, T. 1992. Polynomial time object recognition in the presence of clutter, occlusion and uncertainty. Second European Conf. on Computer Vision, pp. 834-842.

  • Conway, J.B. 1990. A Course in Functional Analysis. Springer-Verlag.

  • Coxeter, H.S.M. 1993. The Real Projective Plane. Springer-Verlag.

  • Darrell, T., Sclaroff, S., and Pentland, A. 1990. Segmentation by minimal description. IEEE Int. Conf. on Computer Vision, Japan, pp. 112-116.

  • Duda, R.O. and Hart, P.E. 1973. Pattern Classification and Scene Analysis. Wiley-Interscience Publication, John Wiley and Sons, Inc.

  • Dudani, S.A., Breeding, K.J., and McGhee, R.B. 1977. Aircraft identification by moments invariants. IEEE Trans. on Computations, C-26(1):39-46.

    Google Scholar 

  • Fischler, M.A. and Bolles, R.C. 1981. Random sample consensus: A paradigm for model fitting with application to image analysis and automated cartography. Com. of the A.C.M., 24(6):381-395.

    Google Scholar 

  • Forsyth, D., Mundy, J.L., Zisserman, A., Coelho, C., Heller, A., and Rothwell, C. 1991. Invariant descriptors for 3-D object recognition and pose. IEEE Trans. on Pattern Analysis and Machine Intelligence, 13(10):971-991.

    Google Scholar 

  • Hoffman, D.D. and Richards, W. 1985. Parts of recognition. Cognition, 18:65-96.

    Google Scholar 

  • Horaud, R. 1987. New methods for matching 3-D objects with single perspective views. IEEE Trans. Pattern Anal. Machine Intell., 9(3):401-412.

    Google Scholar 

  • Hu, M.K. 1962. Visual pattern recognition by moment invariants. IRE Trans. on Information Theory, IT-8:169-187.

    Google Scholar 

  • Huttenlocher, D.P. and Ullman, S. 1990. Recognizing solid objects by alignment with an image. Int. J. Computer Vision, 5(2):195- 212.

    Google Scholar 

  • Huttenlocher, D., Klanderman, G., and Rucklidge, W. 1993a. Comparing images using the Hausdorff distance. IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(9):850-863.

    Google Scholar 

  • Huttenlocher, D., Noh, J., and Rucklidge, W. 1993b. Tracking non-rigid objects in complex scenes. 4th Int. Conf. on Computer Vision, pp. 93-101.

  • Jacobs, D. 1992. Space efficient 3D model indexing. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 439-444.

  • Jacobs, D. 1996. Robust and efficient detection of convex groups. IEEE Trans. PAMI (18)1:23-37.

    Google Scholar 

  • Joshi, T., Ponce, J., Vijayakumar, B., and Kriegman, D. 1994. Hot curves for modelling and recognition of smooth curved 3D objects. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 876- 880.

  • Koenderink, J. and van Doorn, A. 1991. Affine structure from motion. Journal of the Optical Society of America, 8(2):377-385.

    Google Scholar 

  • Kriegman, D. and Ponce, J. 1990. On recognizing and positioning curved 3-D objects from image contours. IEEE Trans. Pattern Anal. Machine Intell., 12(12):1127-1137.

    Google Scholar 

  • Lamdan, Y. and Wolfson, H.J. 1988. Geometric hashing: A general and efficient model-based recognition scheme. Second International Conf. Computer Vision, pp. 238-249.

  • Lamdan, Y., Schwartz, J.T., and Wolfson, H.J. 1990. Affine invariant model-based object recognition. IEEE Trans. Robotics and Automation, 6:578-589.

    Google Scholar 

  • Lowe, D. 1985. Perceptual Organization and Visual Recognition. Kluwer Academic Publishers: The Netherlands.

    Google Scholar 

  • Marr, D. and Nishihara, H. 1978. Representation and recognition of the spatial organization of three dimensional structure. Proceedings of the Royal Society of London B, 200:269-294.

    Google Scholar 

  • Pavlidis, T. and Horowitz, S. 1974. Segmentation of plane curves. IEEE Trans. on Computers, TC-23:860-870.

    Google Scholar 

  • Pentland, A. 1987. Recognition by parts. Proceedings of the First International Conf. on Computer Vision, pp. 612-620.

  • Persoon, E. and Fu, K.S. 1977. Shape descimination using Fourier descriptors. IEEE Trans. on Systems, Man and Cybernetics, 7:534- 541.

    Google Scholar 

  • Poelman, C.J. and Kanade, T. 1994. A paraperspective factorization method for shape and motion recovery. Proc. of European Conf. on Computer Vision.

  • Reeves, A.P., Prokop, R.J., Andrews, S.E., and Kuhl, F.P. 1988. Three-dimensional shape analysis using moments and Fourier descriptors. Trans. on Pattern Recognition and Machine Intelligence, 10(6):937-943.

    Google Scholar 

  • Richard, C.W. and Hemami, H. 1974. Identification of three dimensional objects using Fourier descriptors of the boundry curve. IEEE Trans. on Systems, Man and Cybernetics, 4(4):371- 378.

    Google Scholar 

  • Rothwell, C., Zisserman, A., Mundy, J., and Forsyth, D. 1992. Efficient model library access by projectively invariant indexing functions. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 109-114.

  • Rothwell, C.A., Zisserman, A., Forsyth, D.A., and Mundy, J.L. 1992. Canonical frames for planar object recognition. Proc. of 2nd Eur. Conf. on Computer Vision, pp. 757-772.

  • Sadjadi, F.A. and Hall, E.L. 1980. Three-dimensional moment invariants. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2(2):127-136.

    Google Scholar 

  • Subrahmonia, J., Cooper, D., and Keren, D. 1996. Practical reliable Bayesian recognition of 2D and 3D objects using implicit polynomials and algebraic invariants. IEEE Trans. on Pattern Analysis and Machine Intelligence, 18(5):505-519.

    Google Scholar 

  • Sugimoto, A. 1996. Object recognition by combining paraperspective images. Int. J. of Comp. Vis., 19(2):181-201.

    Google Scholar 

  • Thompson, D.W. and Mundy, J.L. 1987. Three dimensional model matching from an unconstrained viewpoint. Proc. of IEEE Int. Conf. on Robotics and Automation, pp. 208-220.

  • Tomasi, C. and Kanade, T. 1992. Shape and motion from image streams under orthography: A factorization method. International Journal of Computer Vision, 9(2):137-154.

    Google Scholar 

  • Ullman, S. and Basri, R. 1991. Recognition by linear combinations of models. IEEE Trans. on PAMI, 13(10):992- 1006.

    Google Scholar 

  • Weiss, I. 1993. Geometric invariants and object recognition. International Journal of Computer Vision, 10(3):207- 231.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Basri, R., Jacobs, D.W. Recognition Using Region Correspondences. International Journal of Computer Vision 25, 145–166 (1997). https://doi.org/10.1023/A:1007919917506

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1007919917506

Navigation