Recognition Using Region Correspondences

Basri, Ronen; Jacobs, David W.

doi:10.1023/A:1007919917506

Recognition Using Region Correspondences

Published: November 1997

Volume 25, pages 145–166, (1997)
Cite this article

International Journal of Computer Vision Aims and scope Submit manuscript

Ronen Basri¹ &
David W. Jacobs²

278 Accesses
24 Citations
Explore all metrics

Abstract

Recognition systems attempt to recover information about the identity of observed objects and their location in the environment. A fundamental problem in recognition is pose estimation. This is the problem of using a correspondence between some portions of an object model and some portions of an image to determine whether the image contains an instance of the object, and, in case it does, to determine the transformation that relates the model to the image. The current approaches to this problem are divided into methods that use “global” properties of the object (e.g., centroid and moments of inertia) and methods that use “local” properties of the object (e.g., corners and line segments). Global properties are sensitive to occlusion and, specifically, to self occlusion. Local properties are difficult to locate reliably, and their matching involves intensive computation.

We present a novel method for recognition that uses region information. In our approach the model and the image are divided into regions. Given a match between subsets of regions (without any explicit correspondence between different pieces of the regions) the alignment transformation is computed. The method applies to planar objects under similarity, affine, and projective transformations and to projections of 3-D objects undergoing affine and projective transformations. The new approach combines many of the advantages of the previous two approaches, while avoiding some of their pitfalls. Like the global methods, our approach makes use of region information that reflects the true shape of the object. But like local methods, our approach can handle occlusion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Alter, T.D. and Grimson, W.E.L. 1993. Fast and robust 3D recognition by alignment. In Proc. Fourth Inter. Conf. Computer Vision, pp. 113-120.
Alter, T.D. and Jacobs, D. 1994. Error propagation in full 3D-from-2Dobject recognition. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 892-898.
Amenta, N. 1994. Bounded boxes, Hausdorff distance, and a new proof of an interesting Helly-type theorem. Proceedings of the 10th Annual ACM Symposium on Computational Geometry, pp. 340- 347.
Ayache, N. and Faugeras, O. 1986. HYPER: A new approach for the recognition and positioning of two-dimensional objects. IEEE Trans. on Pattern Analysis and Machine Intelligence, 8(1):44-54.
Google Scholar
Baird, H. 1985. Model-Based Image Matching Using Location. MIT Press: Cambridge.
Google Scholar
Bajcsy, R. and Solina, F. 1987. Three dimensional object representation revisited. Proc. of The First Int. Conf. on Computer Vision, London, pp. 231-240.
Basri, R. 1996. Paraperspective ≡ Affine. Int. J. of Comp. Vis., 19(2):169-179.
Google Scholar
Basri, R. and Ullman, S. 1993. The alignment of objects with smooth surfaces. Computer Vision, Graphics, and Image Processing: Image Understanding, 57(3):331-345.
Google Scholar
Basri, R. and Jacobs, D.W. 1994. Recognition using region correspondences. The Weizmann Institute of Science, T.R. CS95-33.
Basri, R. and Jacobs, D.W. 1995. Recognition using region correspondences. IEEE Int. Conf. on Computer Vision, pp. 8-15.
Biederman, I. 1985. Human image understanding: Recent research and a theory. Computer Graphics, Vision, and Image Processing, 32:29-73.
Google Scholar
Binford, T. 1971. Visual perception by computer. IEEE Conf. on Systems and Control.
Breuel, T. 1991. Model based recognition using pruned correspondence search. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 257-268.
Brooks, R. 1981. Symbolic reasoning among 3-D models and 2-D images. Artificial Intelligence, 17:285-348.
Google Scholar
Canny, J. 1983. A computational approach to edge detection. Trans. on Pattern Recognition and Machine Intelligence, 8(6):679-698.
Google Scholar
Cass, T. 1992. Polynomial time object recognition in the presence of clutter, occlusion and uncertainty. Second European Conf. on Computer Vision, pp. 834-842.
Conway, J.B. 1990. A Course in Functional Analysis. Springer-Verlag.
Coxeter, H.S.M. 1993. The Real Projective Plane. Springer-Verlag.
Darrell, T., Sclaroff, S., and Pentland, A. 1990. Segmentation by minimal description. IEEE Int. Conf. on Computer Vision, Japan, pp. 112-116.
Duda, R.O. and Hart, P.E. 1973. Pattern Classification and Scene Analysis. Wiley-Interscience Publication, John Wiley and Sons, Inc.
Dudani, S.A., Breeding, K.J., and McGhee, R.B. 1977. Aircraft identification by moments invariants. IEEE Trans. on Computations, C-26(1):39-46.
Google Scholar
Fischler, M.A. and Bolles, R.C. 1981. Random sample consensus: A paradigm for model fitting with application to image analysis and automated cartography. Com. of the A.C.M., 24(6):381-395.
Google Scholar
Forsyth, D., Mundy, J.L., Zisserman, A., Coelho, C., Heller, A., and Rothwell, C. 1991. Invariant descriptors for 3-D object recognition and pose. IEEE Trans. on Pattern Analysis and Machine Intelligence, 13(10):971-991.
Google Scholar
Hoffman, D.D. and Richards, W. 1985. Parts of recognition. Cognition, 18:65-96.
Google Scholar
Horaud, R. 1987. New methods for matching 3-D objects with single perspective views. IEEE Trans. Pattern Anal. Machine Intell., 9(3):401-412.
Google Scholar
Hu, M.K. 1962. Visual pattern recognition by moment invariants. IRE Trans. on Information Theory, IT-8:169-187.
Google Scholar
Huttenlocher, D.P. and Ullman, S. 1990. Recognizing solid objects by alignment with an image. Int. J. Computer Vision, 5(2):195- 212.
Google Scholar
Huttenlocher, D., Klanderman, G., and Rucklidge, W. 1993a. Comparing images using the Hausdorff distance. IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(9):850-863.
Google Scholar
Huttenlocher, D., Noh, J., and Rucklidge, W. 1993b. Tracking non-rigid objects in complex scenes. 4th Int. Conf. on Computer Vision, pp. 93-101.
Jacobs, D. 1992. Space efficient 3D model indexing. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 439-444.
Jacobs, D. 1996. Robust and efficient detection of convex groups. IEEE Trans. PAMI (18)1:23-37.
Google Scholar
Joshi, T., Ponce, J., Vijayakumar, B., and Kriegman, D. 1994. Hot curves for modelling and recognition of smooth curved 3D objects. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 876- 880.
Koenderink, J. and van Doorn, A. 1991. Affine structure from motion. Journal of the Optical Society of America, 8(2):377-385.
Google Scholar
Kriegman, D. and Ponce, J. 1990. On recognizing and positioning curved 3-D objects from image contours. IEEE Trans. Pattern Anal. Machine Intell., 12(12):1127-1137.
Google Scholar
Lamdan, Y. and Wolfson, H.J. 1988. Geometric hashing: A general and efficient model-based recognition scheme. Second International Conf. Computer Vision, pp. 238-249.
Lamdan, Y., Schwartz, J.T., and Wolfson, H.J. 1990. Affine invariant model-based object recognition. IEEE Trans. Robotics and Automation, 6:578-589.
Google Scholar
Lowe, D. 1985. Perceptual Organization and Visual Recognition. Kluwer Academic Publishers: The Netherlands.
Google Scholar
Marr, D. and Nishihara, H. 1978. Representation and recognition of the spatial organization of three dimensional structure. Proceedings of the Royal Society of London B, 200:269-294.
Google Scholar
Pavlidis, T. and Horowitz, S. 1974. Segmentation of plane curves. IEEE Trans. on Computers, TC-23:860-870.
Google Scholar
Pentland, A. 1987. Recognition by parts. Proceedings of the First International Conf. on Computer Vision, pp. 612-620.
Persoon, E. and Fu, K.S. 1977. Shape descimination using Fourier descriptors. IEEE Trans. on Systems, Man and Cybernetics, 7:534- 541.
Google Scholar
Poelman, C.J. and Kanade, T. 1994. A paraperspective factorization method for shape and motion recovery. Proc. of European Conf. on Computer Vision.
Reeves, A.P., Prokop, R.J., Andrews, S.E., and Kuhl, F.P. 1988. Three-dimensional shape analysis using moments and Fourier descriptors. Trans. on Pattern Recognition and Machine Intelligence, 10(6):937-943.
Google Scholar
Richard, C.W. and Hemami, H. 1974. Identification of three dimensional objects using Fourier descriptors of the boundry curve. IEEE Trans. on Systems, Man and Cybernetics, 4(4):371- 378.
Google Scholar
Rothwell, C., Zisserman, A., Mundy, J., and Forsyth, D. 1992. Efficient model library access by projectively invariant indexing functions. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 109-114.
Rothwell, C.A., Zisserman, A., Forsyth, D.A., and Mundy, J.L. 1992. Canonical frames for planar object recognition. Proc. of 2nd Eur. Conf. on Computer Vision, pp. 757-772.
Sadjadi, F.A. and Hall, E.L. 1980. Three-dimensional moment invariants. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2(2):127-136.
Google Scholar
Subrahmonia, J., Cooper, D., and Keren, D. 1996. Practical reliable Bayesian recognition of 2D and 3D objects using implicit polynomials and algebraic invariants. IEEE Trans. on Pattern Analysis and Machine Intelligence, 18(5):505-519.
Google Scholar
Sugimoto, A. 1996. Object recognition by combining paraperspective images. Int. J. of Comp. Vis., 19(2):181-201.
Google Scholar
Thompson, D.W. and Mundy, J.L. 1987. Three dimensional model matching from an unconstrained viewpoint. Proc. of IEEE Int. Conf. on Robotics and Automation, pp. 208-220.
Tomasi, C. and Kanade, T. 1992. Shape and motion from image streams under orthography: A factorization method. International Journal of Computer Vision, 9(2):137-154.
Google Scholar
Ullman, S. and Basri, R. 1991. Recognition by linear combinations of models. IEEE Trans. on PAMI, 13(10):992- 1006.
Google Scholar
Weiss, I. 1993. Geometric invariants and object recognition. International Journal of Computer Vision, 10(3):207- 231.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Applied Math., The Weizmann Inst. of Science, Rehovot, 76100, Israel
Ronen Basri
Nec Research Institute, 4 Independence Way, Princeton, NJ, 08540, USA
David W. Jacobs

Authors

Ronen Basri
View author publications
You can also search for this author in PubMed Google Scholar
David W. Jacobs
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Basri, R., Jacobs, D.W. Recognition Using Region Correspondences. International Journal of Computer Vision 25, 145–166 (1997). https://doi.org/10.1023/A:1007919917506

Download citation

Issue Date: November 1997
DOI: https://doi.org/10.1023/A:1007919917506

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Recognition Using Region Correspondences

Abstract

Access this article

Similar content being viewed by others

3D Representation for Object Detection and Verification

Adaptive locally affine-invariant shape matching

Image Matching Using Generalized Scale-Space Interest Points

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Recognition Using Region Correspondences

Abstract

Access this article

Similar content being viewed by others

3D Representation for Object Detection and Verification

Adaptive locally affine-invariant shape matching

Image Matching Using Generalized Scale-Space Interest Points

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation