Abstract
Despite recent advances in computer vision and large-scale indexing techniques, automatic geo-localization of images and videos remains a challenging task. The majority of existing computer vision solutions for geo-localization are limited to highly-visited urban regions for which a significant amount of geo-tagged imagery is available, and therefore, do not scale well to large and ordinary geo-spatial regions. In this chapter, we provide an overview of the major research themes in visual geo-localization, investigate the challenges, and point to problem areas that will benefit from common synthesis of perspectives from these research themes. In particular, we discuss how the availability of web-scale geo-referenced data affects visual geo-localization, what role semantic information plays in this problem, and how precise localization can be achieved using large-scale textured (RGB) and untextured (non-RGB) 3D models. We also introduce a few real-world applications which became feasible as a result of the capability of estimating an image’s geo-location. We conclude this chapter by providing an overview of the emerging trends in visual geo-localization and a summary of the rest of the chapters of the book.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sheikh Y, Khan S, Shah M (2004) Feature-based georegistration of aerial images. GeoSensor Netw 4
Zitova B, Flusser J (2003) Image registration methods: a survey. Image Vis Comput 21(11): 977–1000
Kumar R, Sawhney H, Asmuth J, Pope A, Hsu S (1998) Registration of video to georeferenced imagery. In: Proceedings of fourteenth international conference on pattern recognition, vol 2, pp 1393–1400
Grant S, Brown M, Szeliski R (2007) City-scale location recognition. In: IEEE conference on computer vision and pattern recognition, CVPR’07
Zamir AR, Shah M (2014) Image geo-localization based on multiple nearest neighbor feature matching using generalized graphs. In: T-PAMI
Torii A, Sivic J, Pajdla T, Okutomi M (2013) Visual place recognition with repetitive structures. In: IEEE conference on computer vision and pattern recognition (CVPR)
Gronat P et al (2013) Learning and calibrating per-location classifiers for visual place recognition. In: IEEE conference on computer vision and pattern recognition (CVPR)
Knopp J, Sivic J, Pajdla T (2010) Avoiding confusing features in place recognition. Computer vision-ECCV. Springer, Heidelberg, pp 748–761.
Chen DM et al (2011) City-scale landmark identification on mobile devices. In: IEEE conference on computer vision and pattern recognition (CVPR)
Lin T-Y, Belongie S, Hays J (2013) Cross-view image geolocalization. In: IEEE conference on computer vision and pattern recognition (CVPR)
Hays J, Efros AA (2008) IM2GPS: estimating geographic information from a single image. In: IEEE conference on computer vision and pattern recognition (CVPR)
Vaca-Castano G, Zamir AR, Shah M (2012) City scale geo-spatial trajectory estimation of a moving camera. In: Computer vision and pattern recognition (CVPR)
Bansal M et al (2011) Geo-localization of street views with aerial image databases. In: Proceedings of the 19th ACM international conference on multimedia
Bansal M, Daniilidis K, Sawhney H (2012) Ultra-wide baseline facade matching for geo-localization computer vision-ECCV 2012. In: Workshops and demonstrations. Springer, Berlin
Doersch C et al (2012) What makes Paris look like Paris? ACM Trans Graph (TOG) 31(4):101
Lee YJ, Efros AA, Hebert M (2013) Style-aware mid-level representation for discovering visual connections in space and time. In: ICCV
Ardeshir S, Zamir AR, Shah M (2014) GIS-assisted object detection and geospatial localization. In: European conference on computer vision (ECCV)
Sullivan A. The view from your window contest. http://dish.andrewsullivan.com/vfyw-contest/
Wang C, Croitoru A, Stefanidis A, Agouris P (2007) Image-to-X registration using linear features and networks. In: FUZZ-IEEE
Wang C, Stefanidis A, Agouris P (2007) Relaxation matching for georegistration of aerial and satellite imagery. In: IEEE ICIP
Castaldo F, Zamir A, Angst R, Palmieri F, Savarese S (2015) The IEEE international conference on computer vision (ICCV) workshops, pp 9–17
Agarwal S, Snavely N, Simon I, Seitz SM, Szeliski R (2009) Building rome in a day. In: ICCV
Crandall D, Owens A, Snavely N, Huttenlocher D (2011) Discrete-continuous optimization for large-scale structure from motion. In: CVPR 2011. Best paper award runner-up
Snavely N, Simon I, Goesele M, Szeliski R, Seitz SM (2010) Scene reconstruction and visualization from community photo collections. In: Proceedings of the IEEE
Grzeszczuk R, Kosecka J, Hile H, Vedantham R (2009) Creating compact architectural models by geo-registering image collections. In: IEEE international workshop on 3-D digital imaging and modeling, ICCV
Li Y et al (2012) Worldwide pose estimation using 3d point clouds. Computer vision-ECCV 2012. Springer, Berlin, pp 15–29
Snavely N, Garg R, Seitz SM, Szeliski R (2008) Finding paths through the world’s photos, SIGGRAPH
Kosecka J, Zhang W (2007) Image based localization. IEEE Trans Rob
Li, Y, Snavely N, Huttenlocher DP (2010) Location recognition using prioritized feature matching. Computer vision-ECCV 2010. Springer, Berlin, pp 791–804
Baatz G et al (2012) Large scale visual geo-localization of images in mountainous terrain. Computer vision-ECCV 2012. Springer, Berlin, pp 517–530
Zakhor A et al (2013) User-driven geolocation of untagged desert imagery using digital elevation models
Baboud L et al (2011) Automatic photo-to-terrain alignment for the annotation of mountain pictures. In: IEEE conference on computer vision and pattern recognition (CVPR)
Snavely N, Seitz SM, Szeliski R (2006) Photo tourism: exploring photo collections in 3D. ACM Trans Graph (TOG) 25(3):835–846
Zamir AR, Dehghan A, Shah M (2013) Visual business recognition: a multimodal approach. In: Proceedings of the 21st ACM international conference on multimedia
Jacobs N, Miskell K, Pless R (2011) Webcam geo-localization using aggregate light levels. In: Applications of computer vision (WACV)
Jacobs N, Satkin S, Roman N, Speyer R, Pless R (2007) Geolocating static cameras. In: ICCV
Lalonde J-F, Narasimhan SG, Efros AA (2010) What do the sun and the sky tell us about the camera. IJCV 88(1):24–51
Zamir A, Shah M (2010) Accurate image localization based on google maps street view. Computer vision–ECCV 2010. Springer, Berlin, pp 255–268
Schindler G, Dellaert F (2012) 4D cities: analyzing, visualizing, and interacting with historical urban photo collections. J Multimed 7(2):124–131
Matzen K, Snavely N (2014) Scene chronology. Computer vision-ECCV 2014. Springer International Publishing, pp 615–630
Torii A et al (2015) 24/7 place recognition by view synthesis. In: CVPR 2015–28th IEEE conference on computer vision and pattern recognition
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems
Zhou B et al (2014) Learning deep features for scene recognition using places database. In: Advances in neural information processing systems
Eigen D, Puhrsch C, Fergus R (2014) Depth map prediction from a single image using a multi-scale deep network. In: Advances in neural information processing systems
Lin T-Y et al (2015) Learning deep representations for ground-to-aerial geolocalization. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Workman S, Souvenir R, Jacobs N (2015) Wide-area image geolocalization with aerial reference imagery. arXiv:1510.03743
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Zamir, A.R., Hakeem, A., Van Gool, L., Shah, M., Szeliski, R. (2016). Introduction to Large-Scale Visual Geo-localization. In: Zamir, A., Hakeem, A., Van Gool, L., Shah, M., Szeliski, R. (eds) Large-Scale Visual Geo-Localization. Advances in Computer Vision and Pattern Recognition. Springer, Cham. https://doi.org/10.1007/978-3-319-25781-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-25781-5_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25779-2
Online ISBN: 978-3-319-25781-5
eBook Packages: Computer ScienceComputer Science (R0)