Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

Li, Xiaowei; Wu, Changchang; Zach, Christopher; Lazebnik, Svetlana; Frahm, Jan-Michael

doi:10.1007/978-3-540-88682-2_33

Xiaowei Li⁴,
Changchang Wu⁴,
Christopher Zach⁴,
Svetlana Lazebnik⁴ &
…
Jan-Michael Frahm⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5302))

Included in the following conference series:

European Conference on Computer Vision

9574 Accesses
128 Citations

Abstract

This paper presents an approach for modeling landmark sites such as the Statue of Liberty based on large-scale contaminated image collections gathered from the Internet. Our system combines 2D appearance and 3D geometric constraints to efficiently extract scene summaries, build 3D models, and recognize instances of the landmark in new test images. We start by clustering images using low-dimensional global “gist” descriptors. Next, we perform geometric verification to retain only the clusters whose images share a common 3D structure. Each valid cluster is then represented by a single iconic view, and geometric relationships between iconic views are captured by an iconic scene graph. In addition to serving as a compact scene summary, this graph is used to guide structure from motion to efficiently produce 3D models of the different aspects of the landmark. The set of iconic images is also used for recognition, i.e., determining whether new test images contain the landmark. Results on three data sets consisting of tens of thousands of images demonstrate the potential of the proposed approach.

Download to read the full chapter text

Chapter PDF

Bundling centre for landmark image discovery

Article 01 December 2015

Hierarchical Image Geo-location on a World-Wide Scale

Graph-Based Discriminative Learning for Location Recognition

Article 12 November 2014

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Fergus, R., Perona, P., Zisserman, A.: A visual category filter for Google images. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3024, pp. 242–256. Springer, Heidelberg (2004)
Chapter Google Scholar
Berg, T., Forsyth, D.: Animals on the web. In: CVPR (2006)
Google Scholar
Schroff, F., Criminisi, A., Zisserman, A.: Harvesting image databases from the web. In: ICCV (2007)
Google Scholar
Goesele, M., Snavely, N., Curless, B., Hoppe, H., Seitz, S.M.: Multi-view stereo for community photo collections. In: ICCV (2007)
Google Scholar
Snavely, N., Seitz, S.M., Szeliski, R.: Photo tourism: Exploring photo collections in 3d. In: SIGGRAPH, pp. 835–846 (2006)
Google Scholar
Ni, K., Steedly, D., Dellaert, F.: Out-of-core bundle adjustment for large-scale 3d reconstruction. In: ICCV (2007)
Google Scholar
Snavely, N., Seitz, S.M., Szeliski, R.: Skeletal sets for efficient structure from motion. In: CVPR (2008)
Google Scholar
Simon, I., Snavely, N., Seitz, S.M.: Scene summarization for online image collections. In: ICCV (2007)
Google Scholar
Berg, T.L., Forsyth, D.: Automatic ranking of iconic images. Technical report, University of California, Berkeley (2007)
Google Scholar
Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: Automatic query expansion with a generative feature model for object retrieval. In: ICCV (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: Improving particular object retrieval in large scale image databases. In: CVPR (2008)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. IJCV 42(3), 145–175 (2001)
Article MATH Google Scholar
Hays, J., Efros, A.A.: Scene completion using millions of photographs. In: SIGGRAPH (2007)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91–110 (2004)
Article Google Scholar
Frahm, J.M., Pollefeys, M.: RANSAC for (quasi-)degenerate data (QDEGSAC). In: CVPR, vol. 1, pp. 453–460 (2006)
Google Scholar
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR (2006)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. PAMI 22, 888–905 (2000)
Article Google Scholar
Beder, C., Steffen, R.: Determining an initial image pair for fixing the scale of a 3d reconstruction from an image sequence. In: Proc. DAGM, pp. 657–666 (2006)
Google Scholar
Nistér, D.: An efficient solution to the five-point relative pose problem. PAMI 26, 756–770 (2004)
Article Google Scholar
Lourakis, M., Argyros, A.: The design and implementation of a generic sparse bundle adjustment software package based on the Levenberg-Marquardt algorithm. Technical Report 340, Institute of Computer Science - FORTH (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, University of North Carolina, Chapel Hill, NC 27599-3175, USA
Xiaowei Li, Changchang Wu, Christopher Zach, Svetlana Lazebnik & Jan-Michael Frahm

Authors

Xiaowei Li
View author publications
You can also search for this author in PubMed Google Scholar
Changchang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Zach
View author publications
You can also search for this author in PubMed Google Scholar
Svetlana Lazebnik
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Michael Frahm
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, Urbana, IL 61801, USA
David Forsyth
Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK
Philip Torr
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Electronic Supplementary Material

Supplementary material (16,307 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, X., Wu, C., Zach, C., Lazebnik, S., Frahm, JM. (2008). Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5302. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88682-2_33

Download citation

DOI: https://doi.org/10.1007/978-3-540-88682-2_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88681-5
Online ISBN: 978-3-540-88682-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

Abstract

Chapter PDF

Similar content being viewed by others

Bundling centre for landmark image discovery

Hierarchical Image Geo-location on a World-Wide Scale

Graph-Based Discriminative Learning for Location Recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Electronic Supplementary Material

Supplementary material (16,307 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs

Abstract

Chapter PDF

Similar content being viewed by others

Bundling centre for landmark image discovery

Hierarchical Image Geo-location on a World-Wide Scale

Graph-Based Discriminative Learning for Location Recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Electronic Supplementary Material

Supplementary material (16,307 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation