Skip to main content

Social Image Tag Ranking by Two-View Learning

  • Chapter
  • 1040 Accesses

Abstract

Tags play a central role in text-based social image retrieval and browsing. However, the tags annotated by web users could be noisy, irrelevant, and often incomplete for describing the image contents, which may severely deteriorate the performance of text-based image retrieval models. In order to solve this problem, researchers have proposed techniques to rank the annotated tags of a social image according to their relevance to the visual content of the image. In this paper, we aim to overcome the challenge of social image tag ranking for a corpus of social images with rich user-generated tags by proposing a novel two-view learning approach. It can effectively exploit both textual and visual contents of social images to discover the complicated relationship between tags and images. Unlike the conventional learning approaches that usually assumes some parametric models, our method is completely data-driven and makes no assumption about the underlying models, making the proposed solution practically more effective. We formulate our method as an optimization task and present an efficient algorithm to solve it. To evaluate the efficacy of our method, we conducted an extensive set of experiments by applying our technique to both text-based social image retrieval and automatic image annotation tasks. Our empirical results showed that the proposed method can be more effective than the conventional approaches.

This book chapter is an extended version of the paper [35], which will appear at the fourth ACM Conference on Web Search and Data Mining (WSDM), Hong Kong, 2011.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://www.flickr.com/.

  2. 2.

    http://www.flickr.com/.

  3. 3.

    http://glaros.dtc.umn.edu/gkhome/views/cluto/.

References

  1. Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York (1999)

    Google Scholar 

  2. Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)

    MATH  Google Scholar 

  3. Cilibrasi, R., Vitányi, P.M.B.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)

    Article  Google Scholar 

  4. Datta, R., Ge, W., Li, J., Wang, J.Z.: Toward bridging the annotation-retrieval gap in image search by a generative modeling approach. In: ACM Multimedia, pp. 977–986 (2006)

    Google Scholar 

  5. Farquhar, J.D.R., Hardoon, D.R., Meng, H., Shawe-Taylor, J., Szedmák, S.: Two view learning: Svm-2k, theory and practice. In: NIPS, 2005

    Google Scholar 

  6. Golder, S.A., Huberman, B.A.: The structure of collaborative tagging systems. J. Inf. Sci. 32, 198–208 (2005)

    Article  Google Scholar 

  7. Hoi, S.C., Lyu, M.R.: Web image learning for searching semantic concepts in image databases. In: Proceedings of the 13th International World Wide Web Conference (WWW2004), New York City, US, May 17–22 2004

    Google Scholar 

  8. Hoi, S.C.H., Jin, R., Lyu, M.R.: Learning nonparametric kernel matrices from pairwise constraints. In: Proceedings of the 24th International Conference on Machine Learning. ICML’07, Corvalis, Oregon, pp. 361–368. ACM, New York (2007)

    Chapter  Google Scholar 

  9. Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 119–126 (2003)

    Google Scholar 

  10. Jin, Y., Khan, L., Wang, L., Awad, M.: Image annotations by combining multiple evidence & wordnet. In: ACM Multimedia, pp. 706–715 (2005)

    Google Scholar 

  11. Lades, M., Vorbrüggen, J.C., Buhmann, J.M., Lange, J., von der Malsburg, C., Würtz, R.P., Konen, W.: Distortion invariant object recognition in the dynamic link architecture. IEEE Trans. Comput. 42(3), 300–311 (1993)

    Article  Google Scholar 

  12. Li, X., Snoek, C.G.M., Worring, M.: Learning tag relevance by neighbor voting for social image retrieval. In: Multimedia Information Retrieval, pp. 180–187 (2008)

    Google Scholar 

  13. Liu, D., Wang, M., Yang, L., Xua, X.-S., Zhang, H.J.: Tag quality improvement for social images. In: Multimedia and Expo, pp. 350–353 (2009)

    Google Scholar 

  14. Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW, pp. 351–360 (2009)

    Chapter  Google Scholar 

  15. Liu, X., Ji, R., Yao, H., Xu, P., Sun, X., Liu, T.: Cross-media manifold learning for image retrieval & annotation. In: Multimedia Information Retrieval, pp. 141–148 (2008)

    Google Scholar 

  16. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  17. Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht06, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT ’06: Proceedings of the Seventeenth Conference on Hypertext and Hypermedia, pp. 31–40 (2006)

    Chapter  Google Scholar 

  18. Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 29(1), 51–59 (1996)

    Article  Google Scholar 

  19. Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW, pp. 327–336 (2008)

    Chapter  Google Scholar 

  20. Spall, J.C.: Introduction to Stochastic Search and Optimization. Wiley, New York (2003)

    Book  MATH  Google Scholar 

  21. Wallraven, C., Caputo, B., Graf, A.: Recognition with local features: the kernel recipe. In: ICCV ’03: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 257. IEEE Computer Society, Washington (2003)

    Chapter  Google Scholar 

  22. Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM Multimedia, pp. 647–650 (2006)

    Google Scholar 

  23. Wang, C., Jing, F., Zhang, L., Zhang, H.-J.: Content-based image annotation refinement. In: CVPR, 2007

    Google Scholar 

  24. Wang, C., Zhang, L., Zhang, H.-J.: Learning to reduce the semantic gap in web image retrieval and annotation. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 355–362 (2008)

    Google Scholar 

  25. Weinberger, K.Q., Slaney, M., van Zwol, R.: Resolving tag ambiguity. In: ACM Multimedia, pp. 111–120 (2008)

    Google Scholar 

  26. Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information with application to automated photo tagging. In: Proceedings of ACM International Conference on Multimedia (MM2009), Beijing, China, Oct. 19–24 2009

    Google Scholar 

  27. Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information for automated photo tagging. ACM Trans. Intell. Syst. Technol. 2(2), 13:1–13:28 (2011)

    Google Scholar 

  28. Wu, L., Yang, L., Yu, N., Hua, X.-S.: Learning to tag. In: WWW, pp. 361–370 (2009)

    Chapter  Google Scholar 

  29. Wu, P., Hoi, S.C., Zhao, P., He, Y.: Mining social images with distance metric learning for automated image tagging. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011

    Google Scholar 

  30. Zhang, T.: Sequential greedy approximation for certain convex optimization problems. IEEE Trans. Inf. Theory 49(3), 682–691 (2003)

    Article  MATH  Google Scholar 

  31. Zhao, Y., Karypis, G.: Empirical and theoretical comparisons of selected criterion functions for document clustering. Mach. Learn. 55(3), 311–331 (2004)

    Article  MATH  Google Scholar 

  32. Zhu, J., Hoi, S.C., Lyu, M.R.: Face annotation by transductive kernel fisher discriminant. IEEE Trans. Multimed. 10(01), 86–96 (2008)

    Article  Google Scholar 

  33. Zhu, J., Hoi, S.C.H., Lyu, M.R., Yan, S.: Near-duplicate keyframe retrieval by nonrigid image matching. In: ACM Multimedia, pp. 41–50 (2008)

    Google Scholar 

  34. Zhuang, J., Hoi, S.C.: Non-parametric kernel ranking approach for social image retrieval. In: ACM International Conference on Image and Video Retrieval (CIVR2010), Xian, PR China, 2010

    Google Scholar 

  35. Zhuang, J., Hoi, S.C.: A two-view learning approach for image tag ranking. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jinfeng Zhuang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag London Limited

About this chapter

Cite this chapter

Zhuang, J., Hoi, S.C.H. (2011). Social Image Tag Ranking by Two-View Learning. In: Hoi, S., Luo, J., Boll, S., Xu, D., Jin, R., King, I. (eds) Social Media Modeling and Computing. Springer, London. https://doi.org/10.1007/978-0-85729-436-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-0-85729-436-4_3

  • Publisher Name: Springer, London

  • Print ISBN: 978-0-85729-435-7

  • Online ISBN: 978-0-85729-436-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics