Can Geotags Help Image Recognition?
In this paper, we propose to exploit geotags as additional information for visual recognition of consumer photos to improve its performance. Geotags, which represent places where the photos were taken, for photos can be obtained automatically by carrying a portable small GPS device with digital cameras. Geotags have potential to improve performance of visual image recognition, since recognition targets are unevenly distributed. For example, “beach” photos can be taken near the sea and “lion” photos can be taken only in a zoo except Africa.
To integrate geotag information into visual image recognition, we adopt two types of geographical information, raw values of latitude and longitude, and visual feature of aerial photos around the location the geotag represents. As classifiers, we use both a discriminative method and a generative method in the experiments.
The objective of this paper is to examine if geotags can help category-level image recognition. Note that we define an image recognition problem as deciding if an image is associated with a certain given concept such as “mountain” and “beach” in this paper. We propose a novel method to carry out geotagged image recognition in this paper. The experimental results demonstrate effectiveness of usage of geographical information for recognition of consumer photos.
KeywordsSupport Vector Machine Aerial Photo Visual Feature Latent Dirichlet Allocation Scale Invariant Feature Transform
- 2.Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Proc. of ECCV Workshop on Statistical Learning in Computer Vision, pp. 59–74 (2004)Google Scholar
- 3.Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: Proc. of IEEE Computer Vision and Pattern Recognition, pp. 524–531 (2005)Google Scholar
- 5.Kennedy, L., Naaman, M.: Generating diverse and representative image search results for landmarks. In: Proc. of the International World Wide Web Conference, pp. 297–306 (2008)Google Scholar
- 6.Lillesand, T.M., Kiefer, R.W., Chipman, J.W.: Remote sensing and image interpretation. John Wiley, Chichester (2004)Google Scholar
- 10.Sivic, J., Russell, B.C., Efros, A.A., Zisserman, A., Freeman, W.T.: Discovering objects and their localization in images. In: Proc. of IEEE International Conference on Computer Vision, pp. 370–377 (2005)Google Scholar
- 12.Varma, M., Ray, D.: Learning the discriminative power-invariance trade-off. In: Proc. of IEEE International Conference on Computer Vision, pp. 1150–1157 (2007)Google Scholar