Skip to main content
Log in

Social image annotation via cross-domain subspace learning

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In recent years, cross-domain learning algorithms have attracted much attention to solve labeled data insufficient problem. However, these cross-domain learning algorithms cannot be applied for subspace learning, which plays a key role in multimedia processing. This paper envisions the cross-domain discriminative subspace learning and provides an effective solution to cross-domain subspace learning. In particular, we propose the cross-domain discriminative locally linear embedding or CDLLE for short. CDLLE connects the training and the testing samples by minimizing the quadratic distance between the distribution of the training samples and that of the testing samples. Therefore, a common subspace for data representation can be preserved. We basically expect the discriminative information to separate the concepts in the training set can be shared to separate the concepts in the testing set as well and thus we have a chance to address above cross-domain problem duly. The margin maximization is duly adopted in CDLLE so the discriminative information for separating different classes can be well preserved. Finally, CDLLE encodes the local geometry of each training samples through a series of linear coefficients which can reconstruct a given sample by its intra-class neighbour samples and thus can locally preserve the intra-class local geometry. Experimental evidence on NUS-WIDE, a popular social image database collected from Flickr, and MSRA-MM, a popular real-world web image annotation database collected from the Internet by using Microsoft Live Search, demonstrates the effectiveness of CDLLE for real-world cross-domain applications.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Belkin M, Niyogi P, Sindhwani V (2006) Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples. J Mach Learn Res 7:2399–2434

    MATH  MathSciNet  Google Scholar 

  2. Cai D, He X, Han J (2007) Semi-supervised discriminant analysis. IEEE International Conference on Computer Vision, pp. 1-7

  3. Caruana R (1997) Multitask learning. Mach Lear 28(1):41–75

    Article  MathSciNet  Google Scholar 

  4. Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y-T (2009) NUS-WIDE: A real-world web image database from national university of Singapore. ACM International Conference on Image and Video Retrieval, pp. 1-8

  5. Dai W, Yang Q, Xue G, Yu Y (2007) Boosting for transfer learning. Processing of the 24th international conference on Machine learning, pp. 193-200

  6. Duan L, Tsang IW, Xu D, Maybank SJ (2009) Domain transfer svm for video concept detection. Proceeding of the 21th conference on Computer Vision and Pattern Recognition

  7. Fisher RA (1936) The use of multiple measurements in taxonomic problems. Ann Eugen 7(2):179–188

    Article  Google Scholar 

  8. He X, Niyogi P (2003) Locality preserving projections. Adv Neural Inf Process Syst 16:1–8

    Google Scholar 

  9. Li H, Wang M, Hua X-S (2009) MSRA-MM 2.0: A large-scale web multimedia dataset. ICDM Workshop on Internet Multimedia Mining

  10. Ling X, Dai W, Xue G, Yang Q, Yu Y (2008) Spectral domain-transfer learning. Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 488-496

  11. Liu W, Tao D, Liu J (2008) Transductive component analysis. The 8th IEEE International Conference on Data Mining, pp. 433-442

  12. Liu D, Hua XS, Yang L, Wang M, Zhang H-J (2009) Tag ranking. International World Wide Web Conference (WWW)

  13. Mihalkova L, Mooney R (2006) Transfer learning with markov logic networks. ICML Workshop on Structural Knowledge Transfer for Machine Learning

  14. Pan J, Kwok JT, Yang Q (2008) Transfer learning via dimensionality reduction. Proceedings of the 23th AAAI Conference on Artificial Intelligence, pp. 677-682

  15. Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076

    Article  MATH  MathSciNet  Google Scholar 

  16. Roweis ST, Saul LK (2000) Nonlinear dimensionality reduction by locally linear embedding. Science 290:2323–2326

    Article  Google Scholar 

  17. Sebe N, Lew MS, Huijsmans DP (2000) Toward improved ranking metrics. IEEE Trans Pattern Anal Mach Intell 22(10):1132–1143

    Article  Google Scholar 

  18. Si S, Tao D, Chan KP Evolutionary cross-domain discriminative Hessian eigenmaps. IEEE Trans Image Process, to appear

  19. Si S, Tao D, Geng B Bregmann divergence based regularization for transfer subspace learning. IEEE Trans Knowl Data Eng to appear

  20. Snoek CG, Worring M, Smeulders AW (2005) Early versus late fusion in semantic video analysis. Proceeding of the 13th ACM international on Multimedia, pp. 399–402

  21. Snoek CGM, Worring M, Geusebroek JM, Koelma DC, Seinstra FJ, Smeulders AWM (2006) The semantic pathfinder: Using an authoring metaphor for generic multimedia indexing. IEEE Trans Pattern Anal Mach Intell 28(10):1678–1689

    Article  Google Scholar 

  22. Song D, Tao D Biologically inspired feature manifold for scene classification. IEEE Trans Image Process, to appear

  23. Tang J, Yan S, Hong R, Qi GJ, Chua TS (2009) Inferring semantic concepts from community-contributed images and noisy tags. Proceeding of the 17th ACM international on Multimedia, pp. 223–232

  24. Tao D, Li X, Wu X, Maybank SJ (2007) General tensor discriminant analysis and gabor features for gait recognition. IEEE Trans Pattern Anal Mach Intell 29(10):1700–1715

    Article  Google Scholar 

  25. Wang M, Hua XS (2008) Study on the combination of video concept detectors. Proceeding of the 16th ACM International Conference on Multimedia, pp. 47–650

  26. Wang M, Hua XS, Song Y, Yuan X, Li SP, Zhang HJ (2006) Automatic video annotation by semi-supervised learning with kernel density estimation. Proceeding of the 14th ACM International Conference on Multimedia, pp. 967–976

  27. Wang M, Hua XS, Yuan X, Song Y, Dai LR (2007) Optimizing multi-graph learning: Towards a unified video annotation scheme. Proceeding of the 15th International Conference on Multimedia, pp. 862-871

  28. Wang J, Jiang YG, Chang SF (2009) Label diagnosis through self tuning for web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition

  29. Wang M, Yang K, Hua XS, Zhang H-J (2009) Visual tag dictionary: interpreting tags with visual words. ACM Workshop on Web-Scale Multimedia Corpus, in association with ACM MM

  30. Wu Z, Ke QF, Isard M, Sun J (2009) Bundling features for large scale partial-duplicate web image search. Proceeding of the 21th conference on Computer Vision and Pattern Recognition

  31. Yang J, Hauptmann AG (2008) A framework for classifier adaptation and its applications in concept detection. Proceedings of the 1st ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 467–474

  32. Yang J, Yan R, Hauptmann AG (2007) Cross-domain video concept detection using adaptive svms. Proceeding of the 15th international conference on Multimedia, pp. 188-197

  33. Zhang T, Tao D, Yang J (2008) Discriminative locality alignment. Proceeding of the 10th European Conference on Computer Vision, pp. 725-738

  34. Zhang T, Tao D, Li X, Yang T (2008) A unifying framework for spectral analysis based dimensionality reduction. IEEE International Joint Conference on Neural Networks 1670-1677, June

  35. Zhang T, Tao D, Yang J (2009) Patch alignment for dimensionality reduction. IEEE Trans Knowl Data Eng 21(9):1299–1313

    Article  Google Scholar 

  36. Zheng V, Yang E, Yang Q, Xiang W, Shen D (2008) Transferring localization models over time. Proceedings of the 23th international conference on Artificial intelligence, pp. 1421-1426

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Si Si.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Si, S., Tao, D., Wang, M. et al. Social image annotation via cross-domain subspace learning. Multimed Tools Appl 56, 91–108 (2012). https://doi.org/10.1007/s11042-010-0567-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-010-0567-2

Keywords

Navigation