Skip to main content

Georeferenced Social Multimedia as Volunteered Geographic Information

  • Chapter
  • First Online:

Part of the book series: GeoJournal Library ((GEJL,volume 118))

Abstract

We argue that georeferenced social multimedia is really a form of volunteered geographic information. For example, community-contributed images and videos available at websites such as Flickr often indicate the location where they were acquired, and, thus, potentially contain a wealth of information about what-is-where on the surface of the Earth. The challenge is how to extract this information from these complex and noisy data, preferably in an automated fashion. We describe a novel analysis framework termed proximate sensing that makes progress towards this goal by using the visual content of georeferenced ground-level images and videos to extract and map geographically relevant information. We describe several geographic knowledge discovery contexts along with case studies where this new analysis paradigm has the potential to map phenomena not easily observable through other means, if at all.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.flickr.com.

  2. 2.

    http://www.trekearth.com.

  3. 3.

    http://www.treknature.com.

  4. 4.

    http://www.wikipedia.org.

  5. 5.

    http://www.merriam-webster.com.

  6. 6.

    https://www.iarpa.gov/index.php/research-programs/finder.

  7. 7.

    http://www.geograph.org.uk.

  8. 8.

    http://scenicornot.datasciencelab.co.uk/.

  9. 9.

    http://twitter.com.

  10. 10.

    http://www.facebook.com.

References

  • Anderson JR, Hardy EE, Roach JT, Witmer RE (1976) A land use and land cover classification system for use with remote sensor data. US Geological Survey Professional Paper (964)

    Google Scholar 

  • Ballan L, Bertini M, Bimbo A, Seidenari L, Serra G (2011) Event detection and recognition for semantic annotation of video. Multimed Tools Appl 51:279–302

    Article  Google Scholar 

  • Cao L, Luo J, Kautz H, Huang T (2008) Annotating collections of photos using hierarchical event and scene models. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8

    Google Scholar 

  • Cao L, Yu J, Luo J, Huang TS (2009) Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression. In: Proceedings of the ACM international conference on multimedia, pp 125–134

    Google Scholar 

  • Chen WC, Battestini A, Gelfand N, Setlur V (2009) Visual summaries of popular landmarks from community photo collections. In: Proceedings of the ACM international conference on multimedia, pp 789–792

    Google Scholar 

  • Crandall D, Backstrom L, Huttenlocher D, Kleinberg J (2009) Mapping the world’s photos. In: Proceedings of the international world wide web conference, pp 761–770

    Google Scholar 

  • Cristani M, Perina A, Castellani U, Murino V (2008) Geo-located image analysis using latent representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8

    Google Scholar 

  • Divvala S, Hoiem D, Hays J, Efros A, Hebert M (2009) An empirical study of context in object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1271–1278

    Google Scholar 

  • Fisher P, Comber AJ, Wadsworth R (2005) Land use and land cover: contradiction or complement. In: Fisher P, Unwin DJ (eds) Re-presenting GIS. Wiley, pp 85–98

    Google Scholar 

  • Gallagher A, Joshi D, Yu J, Luo J (2009) Geo-location inference from image content and user tags. In: Proceedings of the IEEE conference on computer vision and pattern recognition, workshop on internet vision, pp 55–62

    Google Scholar 

  • Goodchild MF (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211–221

    Article  Google Scholar 

  • Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology

    Google Scholar 

  • Hays J, Efros A (2008) IM2GPS: estimating geographic information from a single image. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8

    Google Scholar 

  • Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the International ACM SIGIR conference on research and development in information retrieval, pp 50–57

    Google Scholar 

  • Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1–2):177–196

    Article  Google Scholar 

  • Jacobs N, Satkin S, Roman N, Speyer R, Pless R (2007) Geolocating static cameras. In: Proceedings of the IEEE international conference on computer vision, pp 1–6

    Google Scholar 

  • Jiang YG, Ngo CW, Yang J (2007) Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proceedings of ACM international conference on image and video retrieval, pp 494–510

    Google Scholar 

  • Jiang YG, Yanagawa A, Chang SF, Ngo CW (2008) CU-VIREO374: fusing Columbia374 and VIREO374 for large scale semantic concept detection. Technical Report, Columbia University ADVENT #223-2008-1

    Google Scholar 

  • Jiang YG, Yang J, Ngo CW, Hauptmann A (2010) Representations of keypoint-based semantic concept detection: a comprehensive study. IEEE Trans Multimed 12(1):42–53

    Article  Google Scholar 

  • Joshi D, Luo J (2008) Inferring generic activities and events from image content and bags of geo-tags. In: Proceedings of the international conference on content-based image and video retrieval, pp 37–46

    Google Scholar 

  • Kennedy L, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceedings of the international world wide web conference, pp 297–306

    Google Scholar 

  • Kennedy L, Naaman M, Ahern S, Nair R, Rattenbury T (2007) How Flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the ACM international conference on multimedia, pp 631–640

    Google Scholar 

  • Leung D, Newsam S (2009) Proximate sensing using georeferenced community contributed photo collections. In: Proceedings of the ACM SIGSPATIAL international conference on advances in geographic information systems: workshop on location based social networks, pp 57–64

    Google Scholar 

  • Leung D, Newsam S (2010) Proximate sensing: inferring what-is-where from georeferenced photo collections. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8

    Google Scholar 

  • Leung D, Newsam S (2012) Exploring geotagged images for land-use classification. In: Proceedings of the ACM international conference on multimedia: workshop on geotagging and its applications in multimedia, pp 3–8

    Google Scholar 

  • Lowe DG (1999) Object recognition from local scale-invariant features. Proceedings of the IEEE international conference on computer vision 2:1150–1157

    Article  Google Scholar 

  • Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  • Manjunath BS, Ohm JR, Vasudevan VV, Yamada A (1998) Color and texture descriptors. IEEE Trans Circuits Syst Video Technol 11:703–715

    Article  Google Scholar 

  • Manjunath BS, Salembier P, Sikora T (eds) (2002) Introduction to MPEG-7: multimedia content description interface. John Wiley & Sons

    Google Scholar 

  • Moxley E, Kleban J, Manjunath BS (2008) SpiritTagger: A geo-aware tag suggestion tool mined from Flickr. In: Proceedings of the ACM international conference on multimedia information retrieval, pp 24–30

    Google Scholar 

  • Naaman M, Yeh RB, Garcia-Molina H, Paepcke A (2005) Leveraging context to resolve identity in photo albums. In: Proceedings of the ACM/IEEE-CS joint conference on digital libraries, pp 178–187

    Google Scholar 

  • Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175

    Article  Google Scholar 

  • Ponce J, Hebert M, Schmid C, Zisserman A (eds) (2006) Toward category-level object recognition, LNCS, vol 4170. Springer

    Google Scholar 

  • Quack T, Leibe B, Van Gool L (2008) World-scale mining of objects and events from community photo collections. In: Proceedings of the international conference on content-based image and video retrieval, pp 47–56

    Google Scholar 

  • Snoek CGM, Worring M, van Gemert JC, Geusebroek JM, Smeulders AWM (2006) The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the ACM international conference on multimedia, pp 421–430

    Google Scholar 

  • Standard Land Use Coding Manual (1965) Standard Land use coding manual. urban renewal administration, housing and home finance agency and bureau of public roads, Department of Commerce

    Google Scholar 

  • Torralba A, Murphy KP, Freeman WT (2004) Sharing features: efficient boosting procedures for multiclass object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 762–769

    Google Scholar 

  • Xie L, Newsam S (2011) IM2MAP: Deriving maps from georeferenced community contributed photo collections. In: Proceedings of the ACM international conference on multimedia: workshop on social media, pp 29–34

    Google Scholar 

  • Xiong Z, Divakaran A, Peker KA, Radhakrishnan R, Cabasson R (2003) Video summarization using MPEG-7 motion activity and audio descriptors. In: ISO/IEC 21000-7 FDIS, information technology—multimedia framework—Part 7: digital item adaptation, Kluwer Academic Publishers

    Google Scholar 

  • Yanagawa A, Chang SF, Kennedy L, Hsu W (2007) Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Technical Report, Columbia University ADVENT #222-2006-8

    Google Scholar 

  • Yanai K, Yaegashi K, Qiu B (2009) Detecting cultural differences using consumer-generated geotagged photos. In: Proceedings of the international workshop on location and the web

    Google Scholar 

  • Zheng YT, Zhao M, Song Y, Adam H, Buddemeier U, Bissacco A, Brucher F, Chua TS, Neven H (2009) Tour the world: building a web-scale landmark recognition engine. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1085–1092

    Google Scholar 

  • Zhu S, Wang G, Ngo CW, Jiang YG (2010) On the sampling of Web images for learning visual concept classifiers. In: Proceedings of the ACM international conference on image and video retrieval, pp 50–57

    Google Scholar 

Download references

Acknowledgements

This work was funded in part by an National Science Foundation CAREER grant (IIS-1150115) and a US Department of Energy Early Career Scientist and Engineer/PECASE award.

The Geograph Britain and Ireland images in Fig. 2 are copyright the following users (starting at the top right and proceeding clockwise): Andrew Abbott, Richard Law, Colin Smith, and L S Wilson. The Geograph Britain and Ireland images in Fig. 10 are copyright the following users (left to right): Andy Beecroft and Gordon Hatton. All the images are licensed under the Creative Commons Attribution-Share Alike 3.0 Unported License.

The Flickr images in Fig. 8 are copyright the following users (starting at the top and proceeding clockwise): D.H. Parks, Perfect Zero, Monica’s Dad, umjanedoan, michaelz1, asmythie, Max Braun, wabatson, Monica’s Dad, MaxVT, and zenra. The images are licensed under the Creative Commons Attribution-Share Alike 3.0 Unported License.

The maps in Figs. 2 and 8 are copyright OpenStreetMap contributors. The data is made available under the Open Database License and the cartography is licensed under the Creative Commons Attribution-Share Alike License.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shawn Newsam .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Science+Business Media B.V., part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Newsam, S., Leung, D. (2019). Georeferenced Social Multimedia as Volunteered Geographic Information. In: Wang, S., Goodchild, M. (eds) CyberGIS for Geospatial Discovery and Innovation. GeoJournal Library, vol 118. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-1531-5_12

Download citation

Publish with us

Policies and ethics