International Journal of Computer Vision

, Volume 80, Issue 2, pp 189–210

Modeling the World from Internet Photo Collections

Article

DOI: 10.1007/s11263-007-0107-3

Cite this article as:
Snavely, N., Seitz, S.M. & Szeliski, R. Int J Comput Vis (2008) 80: 189. doi:10.1007/s11263-007-0107-3

Abstract

There are billions of photographs on the Internet, comprising the largest and most diverse photo collection ever assembled. How can computer vision researchers exploit this imagery? This paper explores this question from the standpoint of 3D scene modeling and visualization. We present structure-from-motion and image-based rendering algorithms that operate on hundreds of images downloaded as a result of keyword-based image search queries like “Notre Dame” or “Trevi Fountain.” This approach, which we call Photo Tourism, has enabled reconstructions of numerous well-known world sites. This paper presents these algorithms and results as a first step towards 3D modeling of the world’s well-photographed sites, cities, and landscapes from Internet imagery, and discusses key open problems and challenges for the research community.

Keywords

Structure from motion3D scene analysisInternet imageryPhoto browsers3D navigation

Copyright information

© Springer Science+Business Media, LLC 2007

Authors and Affiliations

  • Noah Snavely
    • 1
  • Steven M. Seitz
    • 1
  • Richard Szeliski
    • 2
  1. 1.University of WashingtonSeattleUSA
  2. 2.Microsoft ResearchRedmondUSA