Social Image Tag Ranking by Two-View Learning

Zhuang, Jinfeng; Hoi, Steven C. H.

doi:10.1007/978-0-85729-436-4_3

Social Image Tag Ranking by Two-View Learning

Jinfeng Zhuang⁷ &
Steven C. H. Hoi⁷

Chapter

1040 Accesses

Abstract

Tags play a central role in text-based social image retrieval and browsing. However, the tags annotated by web users could be noisy, irrelevant, and often incomplete for describing the image contents, which may severely deteriorate the performance of text-based image retrieval models. In order to solve this problem, researchers have proposed techniques to rank the annotated tags of a social image according to their relevance to the visual content of the image. In this paper, we aim to overcome the challenge of social image tag ranking for a corpus of social images with rich user-generated tags by proposing a novel two-view learning approach. It can effectively exploit both textual and visual contents of social images to discover the complicated relationship between tags and images. Unlike the conventional learning approaches that usually assumes some parametric models, our method is completely data-driven and makes no assumption about the underlying models, making the proposed solution practically more effective. We formulate our method as an optimization task and present an efficient algorithm to solve it. To evaluate the efficacy of our method, we conducted an extensive set of experiments by applying our technique to both text-based social image retrieval and automatic image annotation tasks. Our empirical results showed that the proposed method can be more effective than the conventional approaches.

This book chapter is an extended version of the paper [35], which will appear at the fourth ACM Conference on Web Search and Data Mining (WSDM), Hong Kong, 2011.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

References

Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York (1999)
Google Scholar
Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
MATH Google Scholar
Cilibrasi, R., Vitányi, P.M.B.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)
Article Google Scholar
Datta, R., Ge, W., Li, J., Wang, J.Z.: Toward bridging the annotation-retrieval gap in image search by a generative modeling approach. In: ACM Multimedia, pp. 977–986 (2006)
Google Scholar
Farquhar, J.D.R., Hardoon, D.R., Meng, H., Shawe-Taylor, J., Szedmák, S.: Two view learning: Svm-2k, theory and practice. In: NIPS, 2005
Google Scholar
Golder, S.A., Huberman, B.A.: The structure of collaborative tagging systems. J. Inf. Sci. 32, 198–208 (2005)
Article Google Scholar
Hoi, S.C., Lyu, M.R.: Web image learning for searching semantic concepts in image databases. In: Proceedings of the 13th International World Wide Web Conference (WWW2004), New York City, US, May 17–22 2004
Google Scholar
Hoi, S.C.H., Jin, R., Lyu, M.R.: Learning nonparametric kernel matrices from pairwise constraints. In: Proceedings of the 24th International Conference on Machine Learning. ICML’07, Corvalis, Oregon, pp. 361–368. ACM, New York (2007)
Chapter Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 119–126 (2003)
Google Scholar
Jin, Y., Khan, L., Wang, L., Awad, M.: Image annotations by combining multiple evidence & wordnet. In: ACM Multimedia, pp. 706–715 (2005)
Google Scholar
Lades, M., Vorbrüggen, J.C., Buhmann, J.M., Lange, J., von der Malsburg, C., Würtz, R.P., Konen, W.: Distortion invariant object recognition in the dynamic link architecture. IEEE Trans. Comput. 42(3), 300–311 (1993)
Article Google Scholar
Li, X., Snoek, C.G.M., Worring, M.: Learning tag relevance by neighbor voting for social image retrieval. In: Multimedia Information Retrieval, pp. 180–187 (2008)
Google Scholar
Liu, D., Wang, M., Yang, L., Xua, X.-S., Zhang, H.J.: Tag quality improvement for social images. In: Multimedia and Expo, pp. 350–353 (2009)
Google Scholar
Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW, pp. 351–360 (2009)
Chapter Google Scholar
Liu, X., Ji, R., Yao, H., Xu, P., Sun, X., Liu, T.: Cross-media manifold learning for image retrieval & annotation. In: Multimedia Information Retrieval, pp. 141–148 (2008)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht06, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT ’06: Proceedings of the Seventeenth Conference on Hypertext and Hypermedia, pp. 31–40 (2006)
Chapter Google Scholar
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 29(1), 51–59 (1996)
Article Google Scholar
Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW, pp. 327–336 (2008)
Chapter Google Scholar
Spall, J.C.: Introduction to Stochastic Search and Optimization. Wiley, New York (2003)
Book MATH Google Scholar
Wallraven, C., Caputo, B., Graf, A.: Recognition with local features: the kernel recipe. In: ICCV ’03: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 257. IEEE Computer Society, Washington (2003)
Chapter Google Scholar
Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM Multimedia, pp. 647–650 (2006)
Google Scholar
Wang, C., Jing, F., Zhang, L., Zhang, H.-J.: Content-based image annotation refinement. In: CVPR, 2007
Google Scholar
Wang, C., Zhang, L., Zhang, H.-J.: Learning to reduce the semantic gap in web image retrieval and annotation. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 355–362 (2008)
Google Scholar
Weinberger, K.Q., Slaney, M., van Zwol, R.: Resolving tag ambiguity. In: ACM Multimedia, pp. 111–120 (2008)
Google Scholar
Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information with application to automated photo tagging. In: Proceedings of ACM International Conference on Multimedia (MM2009), Beijing, China, Oct. 19–24 2009
Google Scholar
Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information for automated photo tagging. ACM Trans. Intell. Syst. Technol. 2(2), 13:1–13:28 (2011)
Google Scholar
Wu, L., Yang, L., Yu, N., Hua, X.-S.: Learning to tag. In: WWW, pp. 361–370 (2009)
Chapter Google Scholar
Wu, P., Hoi, S.C., Zhao, P., He, Y.: Mining social images with distance metric learning for automated image tagging. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011
Google Scholar
Zhang, T.: Sequential greedy approximation for certain convex optimization problems. IEEE Trans. Inf. Theory 49(3), 682–691 (2003)
Article MATH Google Scholar
Zhao, Y., Karypis, G.: Empirical and theoretical comparisons of selected criterion functions for document clustering. Mach. Learn. 55(3), 311–331 (2004)
Article MATH Google Scholar
Zhu, J., Hoi, S.C., Lyu, M.R.: Face annotation by transductive kernel fisher discriminant. IEEE Trans. Multimed. 10(01), 86–96 (2008)
Article Google Scholar
Zhu, J., Hoi, S.C.H., Lyu, M.R., Yan, S.: Near-duplicate keyframe retrieval by nonrigid image matching. In: ACM Multimedia, pp. 41–50 (2008)
Google Scholar
Zhuang, J., Hoi, S.C.: Non-parametric kernel ranking approach for social image retrieval. In: ACM International Conference on Image and Video Retrieval (CIVR2010), Xian, PR China, 2010
Google Scholar
Zhuang, J., Hoi, S.C.: A two-view learning approach for image tag ranking. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Engineering, Nanyang Technological University, Singapore, Singapore
Jinfeng Zhuang & Steven C. H. Hoi

Authors

Jinfeng Zhuang
View author publications
You can also search for this author in PubMed Google Scholar
Steven C. H. Hoi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jinfeng Zhuang .

Editor information

Editors and Affiliations

School of Computer Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Steven C.H. Hoi
Kodak Research Laboratories, Lake Avenue 1999, Rochester, 14650, New York, USA
Jiebo Luo
Media Informatics and Multimedia Systems, University of Oldenburg, Escherweg 2, Oldenburg, 26121, Germany
Susanne Boll
School of Computer Engineering, Nanyang Technological University, Singapore, 639798, Singapore
Dong Xu
Dept. Computer Science and Engineering, Michigan State University, Engineering Building 3115, East Lansing, 48824, Michigan, USA
Rong Jin
Dept. Computer Science and Engineering, The Chinese University of Hong Kong, Shatin, Hong Kong/PR China
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zhuang, J., Hoi, S.C.H. (2011). Social Image Tag Ranking by Two-View Learning. In: Hoi, S., Luo, J., Boll, S., Xu, D., Jin, R., King, I. (eds) Social Media Modeling and Computing. Springer, London. https://doi.org/10.1007/978-0-85729-436-4_3

Download citation

DOI: https://doi.org/10.1007/978-0-85729-436-4_3
Publisher Name: Springer, London
Print ISBN: 978-0-85729-435-7
Online ISBN: 978-0-85729-436-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics