Augmenting Image Processing with Social Tag Mining for Landmark Recognition

  • Amogh Mahapatra
  • Xin Wan
  • Yonghong Tian
  • Jaideep Srivastava
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6523)


Social Multimedia computing is a new approach which combines the contextual information available in the social networks with available multimedia content to achieve greater accuracy in traditional multimedia problems like face and landmark recognition. Tian et al.[12] introduce this concept and suggest various fields where this approach yields significant benefits. In this paper, this approach has been applied to the landmark recognition problem. The dataset of was used to select a set of images for a given landmark. Then image processing techniques were applied on the images and text mining techniques were applied on the accompanying social metadata to determine independent rankings. These rankings were combined using models similar to meta search engines to develop an improved integrated ranking system. Experiments have shown that the recombination approach gives better results than the separate analysis.


Social Mutimedia Computing Landmark Recognition 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Wan, H.L., Chowdhury, M.U.: Image Semantic Classification by Using SVM. Journal of Software 14, 1891–1899 (2003)zbMATHGoogle Scholar
  2. 2.
    Zhang, D., Wong, A., Indrawan, M., Lu, G.: Content-based Image Retrieval Using Gabor Texture Feature. In: Proceedings of First IEEE Pacific-Rim Conference on Multimedia (PCM 2000), Sydney, Australia, pp. 392–395 (2000)Google Scholar
  3. 3.
    Lowe, D.: Distinctive Image Features from Scale-Invariant Keypoints. Int’l J. Computer Vision 2(60), 91–110 (2004)CrossRefGoogle Scholar
  4. 4.
    Kennedy, L.S., Naaman, M.: Generating diverse and representative image search results for landmarks. In: Proceeding of the 17th International Conference on World Wide Web, WWW 2008, pp. 297–306. ACM Press, New York (2008)Google Scholar
  5. 5.
    Beis, J., Lowe, D.G: Shape indexing using approximate nearest-neighbour search in high-dimensional spaces. In: Conference on Computer Vision and Pattern Recognition, pp. 1000–1006, Puerto Rico (1997)Google Scholar
  6. 6.
    Oztekin, B., Karypis, G., Kumar, V.: Expert Agreement and Content Based Reranking in a Meta-search Environment Using Mearf. In: Proceedings of the 11th International World Wide Web Conference, pp. 333–344, Honolulu, Hawaii, USA, May 7-11 (2002)Google Scholar
  7. 7.
    Wiguna, W.S., Fernández-Tébar, J.J., García-Serrano, A.: Using a Fuzzy Model for Combining Search Results from Different Information Sources to Build a Metasearch Engine. In: International Conference 9th Fuzzy Daysin Dortmund, Germany, pp. 325–334 (2006)Google Scholar
  8. 8.
    Zheng, Y., Zhao, M., Song, H., Adam, H., Buddemeier, U., Bissacco, A., Brucher, F., Chua, T., Neven, H.: Tour the World: building a web-scale landmark recognition engine. In: Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR (2009)Google Scholar
  9. 9.
    Kennedy, L., Naaman, M., Ahern, S., Nair, R., Rattenbury, T.: How flickr helps us make sense of the world: context and content in community-contributed media collections. In: Proceedings of the 15th International Conference on Multimedia, Augsburg, Germany, September 25-29, pp. 631–640 (2007)Google Scholar
  10. 10.
    Simon, I., Snavely, N., Seitz, S.M.: Scene summarization for online image collections. In: Proceedings of the 11th IEEE International Conference on Computer Vision. IEEE, Los Alamitos (2007)Google Scholar
  11. 11.
    Tsai, C., Qamra, A., Chang, E.Y., Wang, Y.: Extent: Inferring Image Metadata from Context and Content. In: IEEE International Conference on Multimedia and Expo., pp. 1270–1273 (2005)Google Scholar
  12. 12.
    Tian, Y., Srivastava, J., Huang, T., Contractor, N.: Social Multimedia Computing. In: Computer IEEE Computer Society Digital Library, June 30. IEEE Computer Society, Los Alamitos (2010)Google Scholar
  13. 13.
    Ramos, J.: Using TF-IDF to Determine Word Relevance in Document Queries .In: First International Conference on. Machine Learning (2003)Google Scholar
  14. 14.
    Hoffman, T.: Probabilistic Latent Semantic Indexing. In: Uncertainity in Artificial Intelligence, UAI 1999, Stockholm (1999)Google Scholar
  15. 15.
    Explore About Interestingness,
  16. 16.
    Yager, R.R.: Induced aggregation operators. Fuzzy Sets and Systems 137, 59–69 (2003)MathSciNetCrossRefzbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Amogh Mahapatra
    • 1
  • Xin Wan
    • 1
  • Yonghong Tian
    • 2
  • Jaideep Srivastava
    • 1
  1. 1.Department of CSUniversity of MinnesotaUSA
  2. 2.School of EE & CSPeking UniversityChina

Personalised recommendations