Modeling 3D Objects from Stereo Views and Recognizing Them in Photographs

Kushal, Akash; Ponce, Jean

doi:10.1007/11744047_43

Akash Kushal¹⁹ &
Jean Ponce^19,20

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3952))

Included in the following conference series:

European Conference on Computer Vision

4942 Accesses
4 Citations
6 Altmetric

Abstract

Local appearance models in the neighborhood of salient image features, together with local and/or global geometric constraints, serve as the basis for several recent and effective approaches to 3D object recognition from photographs. However, these techniques typically either fail to explicitly account for the strong geometric constraints associated with multiple images of the same 3D object, or require a large set of training images with much overlap to construct relatively sparse object models. This paper proposes a simple new method for automatically constructing 3D object models consisting of dense assemblies of small surface patches and affine-invariant descriptions of the corresponding texture patterns from a few (7 to 12) stereo pairs. Similar constraints are used to effectively identify instances of these models in highly cluttered photographs taken from arbitrary and unknown viewpoints. Experiments with a dataset consisting of 80 test images of 9 objects, including comparisons with a number of baseline algorithms, demonstrate the promise of the proposed approach.

Download to read the full chapter text

Chapter PDF

A Robust System for High-Quality Reconstruction of 3D Objects from Photographs

A robust hybrid image-based modeling system

Article 21 April 2015

3DNN: 3D Nearest Neighbor

Article 22 July 2014

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Tuytelaars, T., Van Gool, L.J.: Content-based image retrieval based on local affinely invariant regions. In: Visual Information and Information Systems, pp. 493–500 (1999)
Google Scholar
Lowe, D.G.: Local feature view clustering for 3d object recognition. In: Conference on Computer Vision and Pattern Recognition (2001)
Google Scholar
Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 128–142. Springer, Heidelberg (2002)
Chapter Google Scholar
Matas, J., Chum, O., Urban, M., Pajdla, T.: Robust wide baseline stereo from maximally stable extremal regions. In: British Machine Vision Conference, vol. I, pp. 384–393 (2002)
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: International Conference on Computer Vision, Corfu, Greece, pp. 1150–1157 (1999)
Google Scholar
Ferrari, V., Tuytelaars, T., Van Gool, L.: Simultaneous object recognition and segmentation by image exploration. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 40–54. Springer, Heidelberg (2004)
Chapter Google Scholar
Ferrari, V., Tuytelaars, T., Gool, L.V.: Integrating multiple model views for object recognition. In: Conference on Computer Vision and Pattern Recognition (2004)
Google Scholar
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3d object modeling and recognition using affine-invariant patches and multi-view spatial constraints. In: Conference on Computer Vision and Pattern Recognition, vol. II, pp. 272–277 (2003)
Google Scholar
Rothganger, F., Lazebnik, S., Schmid, C., Ponce, J.: 3d object modeling and recognition using local affine-invariant image descriptors and multi-view spatial constraints. International Journal of Computer Vision (in press, 2005)
Google Scholar
Rothganger, F.: 3D object modeling and recognition in photographs and video. PhD thesis, University of Illinois, Urbana Champaign (2004)
Google Scholar
Benjemaa, R., Schmitt, F.: A solution for the registration of multiple 3D point sets using unit quaternions. In: Burkhardt, H., Neumann, B. (eds.) ECCV 1998. LNCS, vol. 1407, pp. 34–50. Springer, Heidelberg (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois at Urbana Champaign, USA
Akash Kushal & Jean Ponce
Département d’Informatique, Ecole Normale Supérieure, Paris, France
Jean Ponce

Authors

Akash Kushal
View author publications
You can also search for this author in PubMed Google Scholar
Jean Ponce
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kushal, A., Ponce, J. (2006). Modeling 3D Objects from Stereo Views and Recognizing Them in Photographs. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744047_43

Download citation

DOI: https://doi.org/10.1007/11744047_43
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33834-5
Online ISBN: 978-3-540-33835-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Modeling 3D Objects from Stereo Views and Recognizing Them in Photographs

Abstract

Chapter PDF

Similar content being viewed by others

A Robust System for High-Quality Reconstruction of 3D Objects from Photographs

A robust hybrid image-based modeling system

3DNN: 3D Nearest Neighbor

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Modeling 3D Objects from Stereo Views and Recognizing Them in Photographs

Abstract

Chapter PDF

Similar content being viewed by others

A Robust System for High-Quality Reconstruction of 3D Objects from Photographs

A robust hybrid image-based modeling system

3DNN: 3D Nearest Neighbor

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation