Abstract
Tags play a central role in text-based social image retrieval and browsing. However, the tags annotated by web users could be noisy, irrelevant, and often incomplete for describing the image contents, which may severely deteriorate the performance of text-based image retrieval models. In order to solve this problem, researchers have proposed techniques to rank the annotated tags of a social image according to their relevance to the visual content of the image. In this paper, we aim to overcome the challenge of social image tag ranking for a corpus of social images with rich user-generated tags by proposing a novel two-view learning approach. It can effectively exploit both textual and visual contents of social images to discover the complicated relationship between tags and images. Unlike the conventional learning approaches that usually assumes some parametric models, our method is completely data-driven and makes no assumption about the underlying models, making the proposed solution practically more effective. We formulate our method as an optimization task and present an efficient algorithm to solve it. To evaluate the efficacy of our method, we conducted an extensive set of experiments by applying our technique to both text-based social image retrieval and automatic image annotation tasks. Our empirical results showed that the proposed method can be more effective than the conventional approaches.
This book chapter is an extended version of the paper [35], which will appear at the fourth ACM Conference on Web Search and Data Mining (WSDM), Hong Kong, 2011.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Baeza-Yates, R.A., Ribeiro-Neto, B.A.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York (1999)
Boyd, S., Vandenberghe, L.: Convex Optimization. Cambridge University Press, Cambridge (2004)
Cilibrasi, R., Vitányi, P.M.B.: The google similarity distance. IEEE Trans. Knowl. Data Eng. 19(3), 370–383 (2007)
Datta, R., Ge, W., Li, J., Wang, J.Z.: Toward bridging the annotation-retrieval gap in image search by a generative modeling approach. In: ACM Multimedia, pp. 977–986 (2006)
Farquhar, J.D.R., Hardoon, D.R., Meng, H., Shawe-Taylor, J., Szedmák, S.: Two view learning: Svm-2k, theory and practice. In: NIPS, 2005
Golder, S.A., Huberman, B.A.: The structure of collaborative tagging systems. J. Inf. Sci. 32, 198–208 (2005)
Hoi, S.C., Lyu, M.R.: Web image learning for searching semantic concepts in image databases. In: Proceedings of the 13th International World Wide Web Conference (WWW2004), New York City, US, May 17–22 2004
Hoi, S.C.H., Jin, R., Lyu, M.R.: Learning nonparametric kernel matrices from pairwise constraints. In: Proceedings of the 24th International Conference on Machine Learning. ICML’07, Corvalis, Oregon, pp. 361–368. ACM, New York (2007)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 119–126 (2003)
Jin, Y., Khan, L., Wang, L., Awad, M.: Image annotations by combining multiple evidence & wordnet. In: ACM Multimedia, pp. 706–715 (2005)
Lades, M., Vorbrüggen, J.C., Buhmann, J.M., Lange, J., von der Malsburg, C., Würtz, R.P., Konen, W.: Distortion invariant object recognition in the dynamic link architecture. IEEE Trans. Comput. 42(3), 300–311 (1993)
Li, X., Snoek, C.G.M., Worring, M.: Learning tag relevance by neighbor voting for social image retrieval. In: Multimedia Information Retrieval, pp. 180–187 (2008)
Liu, D., Wang, M., Yang, L., Xua, X.-S., Zhang, H.J.: Tag quality improvement for social images. In: Multimedia and Expo, pp. 350–353 (2009)
Liu, D., Hua, X.-S., Yang, L., Wang, M., Zhang, H.-J.: Tag ranking. In: WWW, pp. 351–360 (2009)
Liu, X., Ji, R., Yao, H., Xu, P., Sun, X., Liu, T.: Cross-media manifold learning for image retrieval & annotation. In: Multimedia Information Retrieval, pp. 141–148 (2008)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Marlow, C., Naaman, M., Boyd, D., Davis, M.: Ht06, tagging paper, taxonomy, flickr, academic article, to read. In: HYPERTEXT ’06: Proceedings of the Seventeenth Conference on Hypertext and Hypermedia, pp. 31–40 (2006)
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recognit. 29(1), 51–59 (1996)
Sigurbjörnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: WWW, pp. 327–336 (2008)
Spall, J.C.: Introduction to Stochastic Search and Optimization. Wiley, New York (2003)
Wallraven, C., Caputo, B., Graf, A.: Recognition with local features: the kernel recipe. In: ICCV ’03: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 257. IEEE Computer Society, Washington (2003)
Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM Multimedia, pp. 647–650 (2006)
Wang, C., Jing, F., Zhang, L., Zhang, H.-J.: Content-based image annotation refinement. In: CVPR, 2007
Wang, C., Zhang, L., Zhang, H.-J.: Learning to reduce the semantic gap in web image retrieval and annotation. In: Proceedings of ACM Special Interest Group on Information Retrieval, pp. 355–362 (2008)
Weinberger, K.Q., Slaney, M., van Zwol, R.: Resolving tag ambiguity. In: ACM Multimedia, pp. 111–120 (2008)
Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information with application to automated photo tagging. In: Proceedings of ACM International Conference on Multimedia (MM2009), Beijing, China, Oct. 19–24 2009
Wu, L., Hoi, S.C., Zhu, J., Jin, R., Yu, N.: Distance metric learning from uncertain side information for automated photo tagging. ACM Trans. Intell. Syst. Technol. 2(2), 13:1–13:28 (2011)
Wu, L., Yang, L., Yu, N., Hua, X.-S.: Learning to tag. In: WWW, pp. 361–370 (2009)
Wu, P., Hoi, S.C., Zhao, P., He, Y.: Mining social images with distance metric learning for automated image tagging. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011
Zhang, T.: Sequential greedy approximation for certain convex optimization problems. IEEE Trans. Inf. Theory 49(3), 682–691 (2003)
Zhao, Y., Karypis, G.: Empirical and theoretical comparisons of selected criterion functions for document clustering. Mach. Learn. 55(3), 311–331 (2004)
Zhu, J., Hoi, S.C., Lyu, M.R.: Face annotation by transductive kernel fisher discriminant. IEEE Trans. Multimed. 10(01), 86–96 (2008)
Zhu, J., Hoi, S.C.H., Lyu, M.R., Yan, S.: Near-duplicate keyframe retrieval by nonrigid image matching. In: ACM Multimedia, pp. 41–50 (2008)
Zhuang, J., Hoi, S.C.: Non-parametric kernel ranking approach for social image retrieval. In: ACM International Conference on Image and Video Retrieval (CIVR2010), Xian, PR China, 2010
Zhuang, J., Hoi, S.C.: A two-view learning approach for image tag ranking. In: Fourth ACM International Conference on Web Search and Data Mining (WSDM2011), Hong Kong, 2011
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag London Limited
About this chapter
Cite this chapter
Zhuang, J., Hoi, S.C.H. (2011). Social Image Tag Ranking by Two-View Learning. In: Hoi, S., Luo, J., Boll, S., Xu, D., Jin, R., King, I. (eds) Social Media Modeling and Computing. Springer, London. https://doi.org/10.1007/978-0-85729-436-4_3
Download citation
DOI: https://doi.org/10.1007/978-0-85729-436-4_3
Publisher Name: Springer, London
Print ISBN: 978-0-85729-435-7
Online ISBN: 978-0-85729-436-4
eBook Packages: Computer ScienceComputer Science (R0)