Skip to main content
Log in

Partial label metric learning by collapsing classes

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Partial label learning (PLL) is a weakly supervised learning framework proposed recently, in which the ground-truth label of training sample is not precisely annotated but concealed in a set of candidate labels, which makes the accuracy of the existing PLL algorithms is usually lower than that of the traditional supervised learning algorithms. Since the accuracy of a learning algorithm is usually closely related to its distance metric, the metric learning technologies can be employed to improve the accuracy of the existing PLL algorithms. However, only a few PLL metric learning algorithms have been proposed up to the present. In view of this, a novel PLL metric learning algorithm is proposed by using the collapsing classes model in this paper. The basic idea is first to take each training sample and its neighbor with shared candidate labels as a similar pair, while each training sample and its neighbor without shared candidate labels as a dissimilar pair, then two probability distributions are defined based on the distance and label similarity of these pairs, respectively, finally the metric matrix is obtained via minimizing the Kullback–Leibler divergence of these two probability distributions. Experimental results on six UCI data sets and four real-world PLL data sets show that the proposed algorithm can obviously improve the accuracy of the existing PLL algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

References

  1. Beygelzimer A, Langford J (2009) The offset tree for learning with partial labels. In: Proceedings of the 15th ACM sigkdd international conference on knowledge discovery and data mining. ACM, pp 129–138

  2. Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  3. Chen CH, Patel VM, Chellappa R (2017) Learning from ambiguously labeled face images. IEEE Trans Pattern Anal Mach Intell 40(7):1653–1667

    Google Scholar 

  4. Chen YC, Patel VM, Chellappa R, Phillips PJ (2014) Ambiguously labeled learning using dictionaries. IEEE Trans Inf Forensics Secur 9(12):2076–2088

    Google Scholar 

  5. Côme E, Oukhellou L, Denoeux T, Aknin P (2009) Learning from partially supervised data using mixture models and belief functions. Pattern Recognit 42(3):334–348

    MATH  Google Scholar 

  6. Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12(May):1501–1536

    MathSciNet  MATH  Google Scholar 

  7. Dua D, Graff C (2019) UCI machine learning repository. University of California, School of Information and Computer Science, Irvine, CA. http://archive.ics.uci.edu/ml

  8. Globerson A, Roweis ST (2006) Metric learning by collapsing classes. In: Advances in neural information processing systems, pp 451–458

  9. Goldberger J, Hinton GE, Roweis ST, Salakhutdinov RR (2005) Neighbourhood components analysis. In: Advances in neural information processing systems, pp 513–520

  10. Gong C, Liu T, Tang Y, Yang J, Yang J, Tao D (2017) A regularization approach for instance-based superset label learning. IEEE Trans Cybern 48(3):967–978

    Google Scholar 

  11. Hüllermeier E, Beringer J (2006) Learning from ambiguously labeled examples. Intell Data Anal 10(5):419–439

    MATH  Google Scholar 

  12. Li C, Zhang J, Chen Z (2013) Structured output learning with candidate labels for local parts. In: Joint European conference on machine learning and knowledge discovery in databases, pp 336–352. Springer

  13. Liu L, Dietterich TG (2012) A conditional multinomial mixture model for superset label learning. In: Advances in neural information processing systems, pp 548–556

  14. Liu Y, Gao Q, Han J, Wang S, Gao X (2019) Graph and autoencoder based feature extraction for zero-shot learning. In: Proceedings of the 28th international joint conference on artificial intelligence, pp 15–36

  15. Liu Y, Gao Q, Li J, Han J, Shao L (2018) Zero shot learning via low-rank embedded semantic autoencoder. In: Proceedings of the 27th international joint conference on artificial intelligence, pp 2490–2496

  16. Liu Y, Gao Q, Miao S, Gao X, Nie F, Li Y (2016) A non-greedy algorithm for L1-norm LDA. IEEE Trans Image Process 26(2):684–695

    MathSciNet  MATH  Google Scholar 

  17. Liu Y, Gao X, Gao Q, Han J, Shao L (2020) Label-activating framework for zero-shot learning. Neural Netw 121:1–9

    Google Scholar 

  18. Luo J, Orabona F (2010) Learning from candidate labeling sets. In: Advances in neural information processing systems, pp 1504–1512

  19. Lyu G, Feng S, Wang T, Lang C, Li Y (2019) GM-PLL: Graph matching based partial label learning. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2933837

    Article  Google Scholar 

  20. Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 551–559

  21. Raykar VC, Yu S, Zhao LH, Valadez GH, Florin C, Bogoni L, Moy L (2010) Learning from crowds. J Mach Learn Res 11(Apr):1297–1322

    MathSciNet  Google Scholar 

  22. Song J, Liu H, Geng F, Zhang C (2016) Weakly-supervised classification of pulmonary nodules based on shape characters. In: the 14th international conference on dependable, autonomic and secure computing. IEEE, pp 228–232

  23. Tang CZ, Zhang ML (2017) Confidence-rated discriminative partial label learning. In: Thirty-first AAAI conference on artificial intelligence

  24. Verma Y, Jawahar C (2012) Image annotation using metric learning in semantic neighbourhoods. In: European conference on computer vision. Springer, pp 836–849

  25. Wang S, Jin R (2009) An information geometry approach for distance metric learning. In: Artificial intelligence and statistics, pp 591–598

  26. Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10(Feb):207–244

    MATH  Google Scholar 

  27. Wisniewski G, Pécheux N, Gahbiche-Braham S, Yvon F (2014) Cross-lingual part-of-speech tagging through ambiguous learning. In: Proceedings of the 2014 conference on empirical methods in natural language processing, pp 1779–1785

  28. Wu J, Pan S, Zhu X, Zhang C, Wu X (2018) Multi-instance learning with discriminative bag mapping. IEEE Trans Knowl Data Eng 30(6):1065–1080

    Google Scholar 

  29. Xing EP, Jordan MI, Russell SJ, Ng AY (2003) Distance metric learning with application to clustering with side-information. In: Advances in neural information processing systems, pp 521–528

  30. Xu BC, Ting KM, Zhou ZH (2019) Isolation set-kernel and its application to multi-instance learning. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp 941–949

  31. Xu N, Lv J, Geng X (2019) Partial label learning via label enhancement. In: AAAI conference on artificial intelligence

  32. Yu F, Zhang ML (2016) Maximum margin partial label learning. In: Asian conference on machine learning, pp 96–111

  33. Zhang ML, Yu F (2015) Solving the partial label learning problem: An instance-based approach. In: Twenty-fourth international joint conference on artificial intelligence

  34. Zhang ML, Yu F, Tang CZ (2017) Disambiguation-free partial label learning. IEEE Trans Knowl Data Eng 29(10):2155–2167

    Google Scholar 

  35. Zhang ML, Zhou BB, Liu XY (2016) Partial label learning via feature-aware disambiguation. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1335–1344

  36. Zhang S, Chai J (2018) Partial label learning algorithm based on maximum margin. Sci Technol Eng 18(28):109–115

    Google Scholar 

  37. Zhou Y, Gu H (2018) Geometric mean metric learning for partial label data. Neurocomputing 275:394–402

    Google Scholar 

  38. Zhou Y, He J, Gu H (2016) Partial label learning via gaussian processes. IEEE Trans Cybernet 47(12):4443–4450

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianjun He.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, S., Yang, M., Zhou, Y. et al. Partial label metric learning by collapsing classes. Int. J. Mach. Learn. & Cyber. 11, 2453–2460 (2020). https://doi.org/10.1007/s13042-020-01129-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-020-01129-z

Keywords

Navigation