Abstract
We argue that georeferenced social multimedia is really a form of volunteered geographic information. For example, community-contributed images and videos available at websites such as Flickr often indicate the location where they were acquired, and, thus, potentially contain a wealth of information about what-is-where on the surface of the Earth. The challenge is how to extract this information from these complex and noisy data, preferably in an automated fashion. We describe a novel analysis framework termed proximate sensing that makes progress towards this goal by using the visual content of georeferenced ground-level images and videos to extract and map geographically relevant information. We describe several geographic knowledge discovery contexts along with case studies where this new analysis paradigm has the potential to map phenomena not easily observable through other means, if at all.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
References
Anderson JR, Hardy EE, Roach JT, Witmer RE (1976) A land use and land cover classification system for use with remote sensor data. US Geological Survey Professional Paper (964)
Ballan L, Bertini M, Bimbo A, Seidenari L, Serra G (2011) Event detection and recognition for semantic annotation of video. Multimed Tools Appl 51:279–302
Cao L, Luo J, Kautz H, Huang T (2008) Annotating collections of photos using hierarchical event and scene models. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Cao L, Yu J, Luo J, Huang TS (2009) Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression. In: Proceedings of the ACM international conference on multimedia, pp 125–134
Chen WC, Battestini A, Gelfand N, Setlur V (2009) Visual summaries of popular landmarks from community photo collections. In: Proceedings of the ACM international conference on multimedia, pp 789–792
Crandall D, Backstrom L, Huttenlocher D, Kleinberg J (2009) Mapping the world’s photos. In: Proceedings of the international world wide web conference, pp 761–770
Cristani M, Perina A, Castellani U, Murino V (2008) Geo-located image analysis using latent representations. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Divvala S, Hoiem D, Hays J, Efros A, Hebert M (2009) An empirical study of context in object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1271–1278
Fisher P, Comber AJ, Wadsworth R (2005) Land use and land cover: contradiction or complement. In: Fisher P, Unwin DJ (eds) Re-presenting GIS. Wiley, pp 85–98
Gallagher A, Joshi D, Yu J, Luo J (2009) Geo-location inference from image content and user tags. In: Proceedings of the IEEE conference on computer vision and pattern recognition, workshop on internet vision, pp 55–62
Goodchild MF (2007) Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211–221
Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset. Technical Report 7694, California Institute of Technology
Hays J, Efros A (2008) IM2GPS: estimating geographic information from a single image. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the International ACM SIGIR conference on research and development in information retrieval, pp 50–57
Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1–2):177–196
Jacobs N, Satkin S, Roman N, Speyer R, Pless R (2007) Geolocating static cameras. In: Proceedings of the IEEE international conference on computer vision, pp 1–6
Jiang YG, Ngo CW, Yang J (2007) Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proceedings of ACM international conference on image and video retrieval, pp 494–510
Jiang YG, Yanagawa A, Chang SF, Ngo CW (2008) CU-VIREO374: fusing Columbia374 and VIREO374 for large scale semantic concept detection. Technical Report, Columbia University ADVENT #223-2008-1
Jiang YG, Yang J, Ngo CW, Hauptmann A (2010) Representations of keypoint-based semantic concept detection: a comprehensive study. IEEE Trans Multimed 12(1):42–53
Joshi D, Luo J (2008) Inferring generic activities and events from image content and bags of geo-tags. In: Proceedings of the international conference on content-based image and video retrieval, pp 37–46
Kennedy L, Naaman M (2008) Generating diverse and representative image search results for landmarks. In: Proceedings of the international world wide web conference, pp 297–306
Kennedy L, Naaman M, Ahern S, Nair R, Rattenbury T (2007) How Flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the ACM international conference on multimedia, pp 631–640
Leung D, Newsam S (2009) Proximate sensing using georeferenced community contributed photo collections. In: Proceedings of the ACM SIGSPATIAL international conference on advances in geographic information systems: workshop on location based social networks, pp 57–64
Leung D, Newsam S (2010) Proximate sensing: inferring what-is-where from georeferenced photo collections. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1–8
Leung D, Newsam S (2012) Exploring geotagged images for land-use classification. In: Proceedings of the ACM international conference on multimedia: workshop on geotagging and its applications in multimedia, pp 3–8
Lowe DG (1999) Object recognition from local scale-invariant features. Proceedings of the IEEE international conference on computer vision 2:1150–1157
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Manjunath BS, Ohm JR, Vasudevan VV, Yamada A (1998) Color and texture descriptors. IEEE Trans Circuits Syst Video Technol 11:703–715
Manjunath BS, Salembier P, Sikora T (eds) (2002) Introduction to MPEG-7: multimedia content description interface. John Wiley & Sons
Moxley E, Kleban J, Manjunath BS (2008) SpiritTagger: A geo-aware tag suggestion tool mined from Flickr. In: Proceedings of the ACM international conference on multimedia information retrieval, pp 24–30
Naaman M, Yeh RB, Garcia-Molina H, Paepcke A (2005) Leveraging context to resolve identity in photo albums. In: Proceedings of the ACM/IEEE-CS joint conference on digital libraries, pp 178–187
Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175
Ponce J, Hebert M, Schmid C, Zisserman A (eds) (2006) Toward category-level object recognition, LNCS, vol 4170. Springer
Quack T, Leibe B, Van Gool L (2008) World-scale mining of objects and events from community photo collections. In: Proceedings of the international conference on content-based image and video retrieval, pp 47–56
Snoek CGM, Worring M, van Gemert JC, Geusebroek JM, Smeulders AWM (2006) The challenge problem for automated detection of 101 semantic concepts in multimedia. In: Proceedings of the ACM international conference on multimedia, pp 421–430
Standard Land Use Coding Manual (1965) Standard Land use coding manual. urban renewal administration, housing and home finance agency and bureau of public roads, Department of Commerce
Torralba A, Murphy KP, Freeman WT (2004) Sharing features: efficient boosting procedures for multiclass object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 762–769
Xie L, Newsam S (2011) IM2MAP: Deriving maps from georeferenced community contributed photo collections. In: Proceedings of the ACM international conference on multimedia: workshop on social media, pp 29–34
Xiong Z, Divakaran A, Peker KA, Radhakrishnan R, Cabasson R (2003) Video summarization using MPEG-7 motion activity and audio descriptors. In: ISO/IEC 21000-7 FDIS, information technology—multimedia framework—Part 7: digital item adaptation, Kluwer Academic Publishers
Yanagawa A, Chang SF, Kennedy L, Hsu W (2007) Columbia University’s baseline detectors for 374 LSCOM semantic visual concepts. Technical Report, Columbia University ADVENT #222-2006-8
Yanai K, Yaegashi K, Qiu B (2009) Detecting cultural differences using consumer-generated geotagged photos. In: Proceedings of the international workshop on location and the web
Zheng YT, Zhao M, Song Y, Adam H, Buddemeier U, Bissacco A, Brucher F, Chua TS, Neven H (2009) Tour the world: building a web-scale landmark recognition engine. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1085–1092
Zhu S, Wang G, Ngo CW, Jiang YG (2010) On the sampling of Web images for learning visual concept classifiers. In: Proceedings of the ACM international conference on image and video retrieval, pp 50–57
Acknowledgements
This work was funded in part by an National Science Foundation CAREER grant (IIS-1150115) and a US Department of Energy Early Career Scientist and Engineer/PECASE award.
The Geograph Britain and Ireland images in Fig. 2 are copyright the following users (starting at the top right and proceeding clockwise): Andrew Abbott, Richard Law, Colin Smith, and L S Wilson. The Geograph Britain and Ireland images in Fig. 10 are copyright the following users (left to right): Andy Beecroft and Gordon Hatton. All the images are licensed under the Creative Commons Attribution-Share Alike 3.0 Unported License.
The Flickr images in Fig. 8 are copyright the following users (starting at the top and proceeding clockwise): D.H. Parks, Perfect Zero, Monica’s Dad, umjanedoan, michaelz1, asmythie, Max Braun, wabatson, Monica’s Dad, MaxVT, and zenra. The images are licensed under the Creative Commons Attribution-Share Alike 3.0 Unported License.
The maps in Figs. 2 and 8 are copyright OpenStreetMap contributors. The data is made available under the Open Database License and the cartography is licensed under the Creative Commons Attribution-Share Alike License.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media B.V., part of Springer Nature
About this chapter
Cite this chapter
Newsam, S., Leung, D. (2019). Georeferenced Social Multimedia as Volunteered Geographic Information. In: Wang, S., Goodchild, M. (eds) CyberGIS for Geospatial Discovery and Innovation. GeoJournal Library, vol 118. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-1531-5_12
Download citation
DOI: https://doi.org/10.1007/978-94-024-1531-5_12
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-1529-2
Online ISBN: 978-94-024-1531-5
eBook Packages: Earth and Environmental ScienceEarth and Environmental Science (R0)