Social image annotation via cross-domain subspace learning

Si, Si; Tao, Dacheng; Wang, Meng; Chan, Kwok-Ping

doi:10.1007/s11042-010-0567-2

Social image annotation via cross-domain subspace learning

Published: 20 July 2010

Volume 56, pages 91–108, (2012)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Si Si¹,
Dacheng Tao²,
Meng Wang³ &
…
Kwok-Ping Chan¹

291 Accesses
9 Citations
6 Altmetric
Explore all metrics

Abstract

In recent years, cross-domain learning algorithms have attracted much attention to solve labeled data insufficient problem. However, these cross-domain learning algorithms cannot be applied for subspace learning, which plays a key role in multimedia processing. This paper envisions the cross-domain discriminative subspace learning and provides an effective solution to cross-domain subspace learning. In particular, we propose the cross-domain discriminative locally linear embedding or CDLLE for short. CDLLE connects the training and the testing samples by minimizing the quadratic distance between the distribution of the training samples and that of the testing samples. Therefore, a common subspace for data representation can be preserved. We basically expect the discriminative information to separate the concepts in the training set can be shared to separate the concepts in the testing set as well and thus we have a chance to address above cross-domain problem duly. The margin maximization is duly adopted in CDLLE so the discriminative information for separating different classes can be well preserved. Finally, CDLLE encodes the local geometry of each training samples through a series of linear coefficients which can reconstruct a given sample by its intra-class neighbour samples and thus can locally preserve the intra-class local geometry. Experimental evidence on NUS-WIDE, a popular social image database collected from Flickr, and MSRA-MM, a popular real-world web image annotation database collected from the Internet by using Microsoft Live Search, demonstrates the effectiveness of CDLLE for real-world cross-domain applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Belkin M, Niyogi P, Sindhwani V (2006) Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples. J Mach Learn Res 7:2399–2434
MATH MathSciNet Google Scholar
Cai D, He X, Han J (2007) Semi-supervised discriminant analysis. IEEE International Conference on Computer Vision, pp. 1-7
Caruana R (1997) Multitask learning. Mach Lear 28(1):41–75
Article MathSciNet Google Scholar
Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y-T (2009) NUS-WIDE: A real-world web image database from national university of Singapore. ACM International Conference on Image and Video Retrieval, pp. 1-8
Dai W, Yang Q, Xue G, Yu Y (2007) Boosting for transfer learning. Processing of the 24th international conference on Machine learning, pp. 193-200
Duan L, Tsang IW, Xu D, Maybank SJ (2009) Domain transfer svm for video concept detection. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7(2):179–188
Article Google Scholar
He X, Niyogi P (2003) Locality preserving projections. Adv Neural Inf Process Syst 16:1–8
Google Scholar
Li H, Wang M, Hua X-S (2009) MSRA-MM 2.0: A large-scale web multimedia dataset. ICDM Workshop on Internet Multimedia Mining
Ling X, Dai W, Xue G, Yang Q, Yu Y (2008) Spectral domain-transfer learning. Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 488-496
Liu W, Tao D, Liu J (2008) Transductive component analysis. The 8th IEEE International Conference on Data Mining, pp. 433-442
Liu D, Hua XS, Yang L, Wang M, Zhang H-J (2009) Tag ranking. International World Wide Web Conference (WWW)
Mihalkova L, Mooney R (2006) Transfer learning with markov logic networks. ICML Workshop on Structural Knowledge Transfer for Machine Learning
Pan J, Kwok JT, Yang Q (2008) Transfer learning via dimensionality reduction. Proceedings of the 23th AAAI Conference on Artificial Intelligence, pp. 677-682
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
Article MATH MathSciNet Google Scholar
Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326
Article Google Scholar
Sebe N, Lew MS, Huijsmans DP (2000) Toward improved ranking metrics. IEEE Trans Pattern Anal Mach Intell 22(10):1132–1143
Article Google Scholar
Si S, Tao D, Chan KP Evolutionary cross-domain discriminative Hessian eigenmaps. IEEE Trans Image Process, to appear
Si S, Tao D, Geng B Bregmann divergence based regularization for transfer subspace learning. IEEE Trans Knowl Data Eng to appear
Snoek CG, Worring M, Smeulders AW (2005) Early versus late fusion in semantic video analysis. Proceeding of the 13th ACM international on Multimedia, pp. 399–402
Snoek CGM, Worring M, Geusebroek JM, Koelma DC, Seinstra FJ, Smeulders AWM (2006) The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing. IEEE Trans Pattern Anal Mach Intell 28(10):1678–1689
Article Google Scholar
Song D, Tao D Biologically inspired feature manifold for scene classification. IEEE Trans Image Process, to appear
Tang J, Yan S, Hong R, Qi GJ, Chua TS (2009) Inferring semantic concepts from community-contributed images and noisy tags. Proceeding of the 17th ACM international on Multimedia, pp. 223–232
Tao D, Li X, Wu X, Maybank SJ (2007) General tensor discriminant analysis and gabor features for gait recognition. IEEE Trans Pattern Anal Mach Intell 29(10):1700–1715
Article Google Scholar
Wang M, Hua XS (2008) Study on the combination of video concept detectors. Proceeding of the 16th ACM International Conference on Multimedia, pp. 47–650
Wang M, Hua XS, Song Y, Yuan X, Li SP, Zhang HJ (2006) Automatic video annotation by semi-supervised learning with kernel density estimation. Proceeding of the 14th ACM International Conference on Multimedia, pp. 967–976
Wang M, Hua XS, Yuan X, Song Y, Dai LR (2007) Optimizing multi-graph learning: Towards a unified video annotation scheme. Proceeding of the 15th International Conference on Multimedia, pp. 862-871
Wang J, Jiang YG, Chang SF (2009) Label diagnosis through self tuning for web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Wang M, Yang K, Hua XS, Zhang H-J (2009) Visual tag dictionary: interpreting tags with visual words. ACM Workshop on Web-Scale Multimedia Corpus, in association with ACM MM
Wu Z, Ke QF, Isard M, Sun J (2009) Bundling features for large scale partial-duplicate web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition
Yang J, Hauptmann AG (2008) A framework for classifier adaptation and its applications in concept detection. Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 467–474
Yang J, Yan R, Hauptmann AG (2007) Cross-domain video concept detection using adaptive svms. Proceeding of the 15th international conference on Multimedia, pp. 188-197
Zhang T, Tao D, Yang J (2008) Discriminative locality alignment. Proceeding of the 10th European Conference on Computer Vision, pp. 725-738
Zhang T, Tao D, Li X, Yang T (2008) A unifying framework for spectral analysis based dimensionality reduction. IEEE International Joint Conference on Neural Networks 1670-1677, June
Zhang T, Tao D, Yang J (2009) Patch alignment for dimensionality reduction. IEEE Trans Knowl Data Eng 21(9):1299–1313
Article Google Scholar
Zheng V, Yang E, Yang Q, Xiang W, Shen D (2008) Transferring localization models over time. Proceedings of the 23th international conference on Artificial intelligence, pp. 1421-1426

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Hong Kong, Pokfulam, Hong Kong
Si Si & Kwok-Ping Chan
School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore
Dacheng Tao
Microsoft Research Asia, Beijing, 100080, China
Meng Wang

Authors

Si Si
View author publications
You can also search for this author in PubMed Google Scholar
Dacheng Tao
View author publications
You can also search for this author in PubMed Google Scholar
Meng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Kwok-Ping Chan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Si Si.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Si, S., Tao, D., Wang, M. et al. Social image annotation via cross-domain subspace learning. Multimed Tools Appl 56, 91–108 (2012). https://doi.org/10.1007/s11042-010-0567-2

Download citation

Published: 20 July 2010
Issue Date: January 2012
DOI: https://doi.org/10.1007/s11042-010-0567-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Social image annotation via cross-domain subspace learning

Abstract

Access this article

Similar content being viewed by others

Domain Invariant Subspace Learning for Cross-Modal Retrieval

Distributed cross-media multiple binary subspace learning

Manifold transfer subspace learning based on double relaxed discriminative regression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Social image annotation via cross-domain subspace learning

Abstract

Access this article

Similar content being viewed by others

Domain Invariant Subspace Learning for Cross-Modal Retrieval

Distributed cross-media multiple binary subspace learning

Manifold transfer subspace learning based on double relaxed discriminative regression

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation