Using Geotags to Derive Rich Tag-Clouds for Image Annotation

Joshi, Dhiraj; Luo, Jiebo; Yu, Jie; Lei, Phoury; Gallagher, Andrew

doi:10.1007/978-0-85729-436-4_11

Dhiraj Joshi⁷,
Jiebo Luo⁷,
Jie Yu⁷,
Phoury Lei⁷ &
…
Andrew Gallagher⁷

1106 Accesses
9 Citations

Abstract

Geotagging has become popular for many multimedia applications. In this chapter, we present an integrated and intuitive system for location-driven tag suggestion, in the form of tag-clouds, for geotagged photos. Potential tags from multiple sources are extracted and weighted. Sources include points of interest (POI) tags from a public Geographic Names Information System (GNIS) database, community tags from Flickr^® pictures, and personal tags shared through users’ own, family, and friends’ photo collections. To increase the effectiveness of GNIS POI tags, bags of place-name tags are first retrieved, clustered, and then re-ranked using a combined tf-idf and spatial distance criteria. The community tags from photos taken in the vicinity of the input geotagged photo are ranked according to distance and visual similarity to the input photo. Personal tags from other personally related photos inherently carry a significant weight due more to their high relevance than to both the generic place-name tags and community tags, and are ranked by weights that decay over time and distance differences. Finally, a rich set of the most relevant location-driven tags is presented to the user in the form of individual tag clouds under the three mentioned source categories. The tag clouds act as intuitive suggestions for tagging an input image. We also discuss quantitative and qualitative findings from a user study that we conducted. Evaluation has revealed the respective benefits of the three categories toward the effectiveness of the integrated tag suggestion system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ahern, S., Davis, M., Eckles, D., King, S., Naaman, M., Nair, R., Spasojevic, M., Yang, J.: Zonetag: Designing context aware mobile media capture to increase participation. In: Proceedings of Workshop on Pervasive Image Capture and Sharing (2006)
Google Scholar
Ames, M., Naaman, M.: Why we tag: Motivations for annotation in mobile and online media. In: Proceedings of ACM SIGCHI Conference on Human Factors in Computing Systems (2007)
Google Scholar
Cao, L., Luo, J., Kautz, H., Huang, T.: Annotating collections of geotagged photos using hierarchical event and scene models. In: Proceedings of IEEE CVPR (2008)
Google Scholar
Cao, L., Luo, J., Huang, T.S.: Annotating photo collections by label propagation according to multiple proximity cues. In: Proceedings of ACM Multimedia (2008)
Google Scholar
Cao, L., Yu, J., Luo, J., Huang, T.S.: Enhancing semantic and geographic annotation of Web images via logistic canonical correlation regression. In: Proceedings of ACM Multimedia (2009)
Google Scholar
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
Article Google Scholar
Crandall, D., Backstrom, L., Huttenlocher, D., Kleinberg, J.: Mapping the world’s photos. In: Proceedings of World Wide Web Conference (2009)
Google Scholar
Cristani, M., Perina, A., Castellani, U., Murino, V.: Geo-located image analysis using latent representations. In: Proceedings of IEEE CVPR (2008)
Google Scholar
Divvala, S., Hoiem, D., Hays, J., Efros, A., Hebert, M.: An empirical study of context in object detection. In: Proceedings of IEEE CVPR (2009)
Google Scholar
Dubinko, M., Kumar, R., Magnani, Novak J., Raghavan, P., Tomkins, A.: Visualizing tags over time. In: Proceedings of World Wide Web Conference (2006)
Google Scholar
Hays, J., Efros, A.: IM2GPS: Estimating geographic information from a single image. In: Proceedings of IEEE CVPR (2008)
Google Scholar
Jacobs, N., Satkin, S., Roman, N., Speyer, R., Pless, R.: Geolocating static cameras. In: Proceedings of IEEE International Conference on Computer Vision (2007)
Google Scholar
Jain, V., Singhal, A., Luo, J.: Selective hidden random fields: Exploiting domain specific saliency for event classification. In: Proceedings of IEEE CVPR (2008)
Google Scholar
Joshi, D., Luo, J.: Inferring generic activities and events from image content and bags of geo-tags. In: Proceedings of ACM CIVR (2008)
Google Scholar
Joshi, D., Gallagher, A., Yu, J., Luo, J.: Inferring photographic location using geotagged web images. Multimed. Tools Appl. J. (2010)
Google Scholar
Joshi, D., Luo, J., Yu, J., Lei, P., Gallagher, A.: Rich location-driven tag cloud suggestions based on public, community, and personal sources. In: Proceedings of ACM Int. Workshop on Connected Media Mining (2010)
Google Scholar
Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How Flickr helps us make sense of the world: Context and content in community-contributed media collections. In: Proceedings of ACM Multimedia (2007)
Google Scholar
Kennedy, L., Slaney, M., Weinberger, K.: Reliable tags using image similarity: Mining specificity and expertise from large-scale multimedia databases. In: ACM Workshop on Web-Scale Multimedia Corpus (2009)
Google Scholar
Kleban, J., Moxley, E., Xu, J., Manjunath, B.S.: Global annotation on georeferenced photographs. In: Proceedings of ACM CIVR (2009)
Google Scholar
Kosecka, J., Zhang, W.: Video compass. In: Proceedings of European Conference on Computer Vision (ECCV) (2002)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: Proceedings of IEEE CVPR (2006)
Google Scholar
Liao, L., Fox, D., Kautz, H.: Extracting places and activities from GPS traces using hierarchical conditional random fields. Int. J. Robot. Res. (2007)
Google Scholar
Li, L.-J., Fei-Fei, L.: What, where and who? Classifying event by scene and object recognition. In: Proceedings of IEEE ICCV (2007)
Google Scholar
Luo, J., Boutell, M., Brown, C.: Pictures are not taken in a vacuum: An overview of exploiting context for semantic scene content understanding. IEEE Signal Process. Mag. 23(2), 101–114 (2006)
Article Google Scholar
Luo, J., Yu, J., Joshi, D., Hao, W.: Event recognition: viewing the world with a third eye. In: Proceedings of ACM Multimedia (2008)
Google Scholar
Moxley, E., Kleban, J., Manjunath, B.S.: SpiritTagger: A geo-aware tag suggestion tool mined from Flickr. In: Proceedings of ACM Multimedia Information Retrieval (MIR) (2008)
Google Scholar
O’Hare, N., Smeaton, A.: Context-aware person identification in personal photo collections. IEEE Trans. Multimed. (2009)
Google Scholar
Quack, T., Leibe, B., Van Gool, L.: World-scale mining of objects and events from community photo collections. In: Proceedings of CIVR (2008)
Google Scholar
Torralba, A., Fergus, R., Freeman, W.T.: Tiny images. Technical Report MIT-CSAIL-TR-2007-024 (2007)
Google Scholar
Toyama, K., Logan, R., Roseway, A.: Geographic location tags on digital images. In: Proceedings of ACM Multimedia (2003)
Google Scholar
Tsai, C.-M., Qamra, A., Chang, E.: Extent: Inferring image metadata from context and content. In: Proceedings of IEEE ICME (2005)
Google Scholar
Wei, X.-Y., Jiang, Y.-G., Ngo, C.-W.: Exploring inter-concept relationship with context space for semantic video indexing. In: Proceedings of ACM CIVR (2009)
Google Scholar
Wolf, L., Bileschi, S.: A critical view of context. Int. J. Comput. Vis. 68(1), 43–52 (2006)
Article Google Scholar
Yu, J., Luo, J.: Leveraging probabilistic season and location context models for scene understanding. In: Proceedings of ACM CIVR (2008)
Google Scholar
Yuan, J., Luo, J., Kautz, H., Wu, Y.: Mining GPS traces and visual words for event classification. In: Proceedings of ACM Multimedia Information Retrieval (MIR) (2008)
Google Scholar
Zheng, V.W., Zheng, Y., Xie, X., Yang, Q.: Collaborative location and activity recommendations with GPS history data. In: Proceedings of World Wide Web Conference (2010)
Google Scholar
Zheng, Y.-T., Zhao, M., Song, Y., Adam, H., Buddemeier, U., Bissacco, A., Brucher, F., Chua, T.-S., Neven, H.: Tour the world: Building a webscale landmark recognition engine. In: Proceedings of IEEE CVPR (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Research and Engineering, Eastman Kodak Company, Rochester, USA
Dhiraj Joshi, Jiebo Luo, Jie Yu, Phoury Lei & Andrew Gallagher

Authors

Dhiraj Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Jiebo Luo
View author publications
You can also search for this author in PubMed Google Scholar
Jie Yu
View author publications
You can also search for this author in PubMed Google Scholar
Phoury Lei
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Gallagher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dhiraj Joshi .

Editor information

Editors and Affiliations

School of Computer Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Steven C.H. Hoi
Kodak Research Laboratories, Lake Avenue 1999, Rochester, 14650, New York, USA
Jiebo Luo
Media Informatics and Multimedia Systems, University of Oldenburg, Escherweg 2, Oldenburg, 26121, Germany
Susanne Boll
School of Computer Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Dong Xu
Dept. Computer Science and Engineering, Michigan State University, Engineering Building 3115, East Lansing, 48824, Michigan, USA
Rong Jin
Dept. Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong/PR China
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Joshi, D., Luo, J., Yu, J., Lei, P., Gallagher, A. (2011). Using Geotags to Derive Rich Tag-Clouds for Image Annotation. In: Hoi, S., Luo, J., Boll, S., Xu, D., Jin, R., King, I. (eds) Social Media Modeling and Computing. Springer, London. https://doi.org/10.1007/978-0-85729-436-4_11

Download citation

DOI: https://doi.org/10.1007/978-0-85729-436-4_11
Publisher Name: Springer, London
Print ISBN: 978-0-85729-435-7
Online ISBN: 978-0-85729-436-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics