A Real-World System for Image/Video Geo-localization

  • Himaanshu Gupta
  • Yi Chen
  • Minwoo Park
  • Kiran Gunda
  • Gang Qian
  • Dave Conger
  • Khurram Shafique
Chapter
Part of the Advances in Computer Vision and Pattern Recognition book series (ACVPR)

Abstract

Determining where an image was taken and geo-locating depicted structures are important tasks from a surveillance and intelligence standpoint. For example, the image might show terrorist training facilities or the vicinity of a safe house. To geo-localize, the user must combine prior knowledge of the area with subtle clues from the image in order to mitigate the tedious manual search of GIS reference data. This process is extremely challenging, time-consuming, and often yields poor accuracy. In this chapter, we describe WALDO (Wide Area Localization of Depicted Objects), a system that solves this challenging problem by combining the insight of analysts with the power of automated analysis for Internet-scale, geo-location-driven data mining. WALDO’s goal-driven constrained resource management leverages a full spectrum of data-driven, semantic, and geometric geo-localization experts and user tools.

Keywords

Geographic Information System Digital Elevation Model Query Image Candidate List Inverted Index 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
  2. 2.
  3. 3.
  4. 4.
  5. 5.
    Campbell J (1993) Evaluation of the dark-object subtraction technique for adjustment of multispectral remote-sensing data. In: Proceedings SPIE, vol 1819Google Scholar
  6. 6.
    Park M, Chen Y, Shafique K (2013) Tag configuration matcher for geo-tagging. In: ACM SIGSPATIALGoogle Scholar
  7. 7.
    Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. In: IEEE PAMI, vol 24Google Scholar
  8. 8.
    Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPRGoogle Scholar
  9. 9.
    Babenko A, Lempitsky V (2012) The inverted multi-index. In: IEEE conference on computer vision and pattern recognition (CVPR)Google Scholar
  10. 10.
    Ge T, He K, Ke Q, Sun J (2013) Optimized product quantization for approximate nearest neighbor search. In: IEEE conference on computer vision and pattern recognition (CVPR)Google Scholar
  11. 11.
    Park M, Gunda K, Gupta H, Shafique K (2014) Optimized transform coding for approximate KNN search. In: BMVCGoogle Scholar
  12. 12.
    Qian G, Chen Y, Gupta H, Gunda K, Shafique K (2015) Camera geolocation from mountain image. In: FusionGoogle Scholar
  13. 13.
    Baatz G, Saurer O, Koser K, Pollefeys M (2012) Large scale visual geo-localization of images in mountainous terrain. In: ECCVGoogle Scholar
  14. 14.
    He K, Sun J, Tang X (2009) Single image haze removal using dark channel prior. In: CVPRGoogle Scholar
  15. 15.
    Barinova O, Lempitsky V, Tretiak E, Kohli P (2010) Geometric image parsing in man-made environments. In: European conference on computer visionGoogle Scholar
  16. 16.
    Xu Y, Oh S, Hoogs A (2013) A minimum error vanishing point detection approach for uncalibrated monocular images of man-made environments. In: IEEE conference on computer vision and pattern recognitionGoogle Scholar
  17. 17.
    Agarwal S, Snavely N, Simon I, Seitz S, Szeliski R (2009) Building Rome in a day. In: ICCVGoogle Scholar
  18. 18.
    Hinton G (1999) Products of experts. In: Proceedings of the ninth International Conference on Artificial Neural Networks (ICANN99)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Himaanshu Gupta
    • 1
  • Yi Chen
    • 1
  • Minwoo Park
    • 1
  • Kiran Gunda
    • 1
  • Gang Qian
    • 1
  • Dave Conger
    • 1
  • Khurram Shafique
    • 1
  1. 1.Object Video, Inc.RestonUSA

Personalised recommendations