Abstract
Partial label learning (PLL) is a weakly supervised learning framework proposed recently, in which the ground-truth label of training sample is not precisely annotated but concealed in a set of candidate labels, which makes the accuracy of the existing PLL algorithms is usually lower than that of the traditional supervised learning algorithms. Since the accuracy of a learning algorithm is usually closely related to its distance metric, the metric learning technologies can be employed to improve the accuracy of the existing PLL algorithms. However, only a few PLL metric learning algorithms have been proposed up to the present. In view of this, a novel PLL metric learning algorithm is proposed by using the collapsing classes model in this paper. The basic idea is first to take each training sample and its neighbor with shared candidate labels as a similar pair, while each training sample and its neighbor without shared candidate labels as a dissimilar pair, then two probability distributions are defined based on the distance and label similarity of these pairs, respectively, finally the metric matrix is obtained via minimizing the Kullback–Leibler divergence of these two probability distributions. Experimental results on six UCI data sets and four real-world PLL data sets show that the proposed algorithm can obviously improve the accuracy of the existing PLL algorithms.
Similar content being viewed by others
References
Beygelzimer A, Langford J (2009) The offset tree for learning with partial labels. In: Proceedings of the 15th ACM sigkdd international conference on knowledge discovery and data mining. ACM, pp 129–138
Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge
Chen CH, Patel VM, Chellappa R (2017) Learning from ambiguously labeled face images. IEEE Trans Pattern Anal Mach Intell 40(7):1653–1667
Chen YC, Patel VM, Chellappa R, Phillips PJ (2014) Ambiguously labeled learning using dictionaries. IEEE Trans Inf Forensics Secur 9(12):2076–2088
Côme E, Oukhellou L, Denoeux T, Aknin P (2009) Learning from partially supervised data using mixture models and belief functions. Pattern Recognit 42(3):334–348
Cour T, Sapp B, Taskar B (2011) Learning from partial labels. J Mach Learn Res 12(May):1501–1536
Dua D, Graff C (2019) UCI machine learning repository. University of California, School of Information and Computer Science, Irvine, CA. http://archive.ics.uci.edu/ml
Globerson A, Roweis ST (2006) Metric learning by collapsing classes. In: Advances in neural information processing systems, pp 451–458
Goldberger J, Hinton GE, Roweis ST, Salakhutdinov RR (2005) Neighbourhood components analysis. In: Advances in neural information processing systems, pp 513–520
Gong C, Liu T, Tang Y, Yang J, Yang J, Tao D (2017) A regularization approach for instance-based superset label learning. IEEE Trans Cybern 48(3):967–978
Hüllermeier E, Beringer J (2006) Learning from ambiguously labeled examples. Intell Data Anal 10(5):419–439
Li C, Zhang J, Chen Z (2013) Structured output learning with candidate labels for local parts. In: Joint European conference on machine learning and knowledge discovery in databases, pp 336–352. Springer
Liu L, Dietterich TG (2012) A conditional multinomial mixture model for superset label learning. In: Advances in neural information processing systems, pp 548–556
Liu Y, Gao Q, Han J, Wang S, Gao X (2019) Graph and autoencoder based feature extraction for zero-shot learning. In: Proceedings of the 28th international joint conference on artificial intelligence, pp 15–36
Liu Y, Gao Q, Li J, Han J, Shao L (2018) Zero shot learning via low-rank embedded semantic autoencoder. In: Proceedings of the 27th international joint conference on artificial intelligence, pp 2490–2496
Liu Y, Gao Q, Miao S, Gao X, Nie F, Li Y (2016) A non-greedy algorithm for L1-norm LDA. IEEE Trans Image Process 26(2):684–695
Liu Y, Gao X, Gao Q, Han J, Shao L (2020) Label-activating framework for zero-shot learning. Neural Netw 121:1–9
Luo J, Orabona F (2010) Learning from candidate labeling sets. In: Advances in neural information processing systems, pp 1504–1512
Lyu G, Feng S, Wang T, Lang C, Li Y (2019) GM-PLL: Graph matching based partial label learning. IEEE Trans Knowl Data Eng. https://doi.org/10.1109/TKDE.2019.2933837
Nguyen N, Caruana R (2008) Classification with partial labels. In: Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, pp 551–559
Raykar VC, Yu S, Zhao LH, Valadez GH, Florin C, Bogoni L, Moy L (2010) Learning from crowds. J Mach Learn Res 11(Apr):1297–1322
Song J, Liu H, Geng F, Zhang C (2016) Weakly-supervised classification of pulmonary nodules based on shape characters. In: the 14th international conference on dependable, autonomic and secure computing. IEEE, pp 228–232
Tang CZ, Zhang ML (2017) Confidence-rated discriminative partial label learning. In: Thirty-first AAAI conference on artificial intelligence
Verma Y, Jawahar C (2012) Image annotation using metric learning in semantic neighbourhoods. In: European conference on computer vision. Springer, pp 836–849
Wang S, Jin R (2009) An information geometry approach for distance metric learning. In: Artificial intelligence and statistics, pp 591–598
Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10(Feb):207–244
Wisniewski G, Pécheux N, Gahbiche-Braham S, Yvon F (2014) Cross-lingual part-of-speech tagging through ambiguous learning. In: Proceedings of the 2014 conference on empirical methods in natural language processing, pp 1779–1785
Wu J, Pan S, Zhu X, Zhang C, Wu X (2018) Multi-instance learning with discriminative bag mapping. IEEE Trans Knowl Data Eng 30(6):1065–1080
Xing EP, Jordan MI, Russell SJ, Ng AY (2003) Distance metric learning with application to clustering with side-information. In: Advances in neural information processing systems, pp 521–528
Xu BC, Ting KM, Zhou ZH (2019) Isolation set-kernel and its application to multi-instance learning. In: Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pp 941–949
Xu N, Lv J, Geng X (2019) Partial label learning via label enhancement. In: AAAI conference on artificial intelligence
Yu F, Zhang ML (2016) Maximum margin partial label learning. In: Asian conference on machine learning, pp 96–111
Zhang ML, Yu F (2015) Solving the partial label learning problem: An instance-based approach. In: Twenty-fourth international joint conference on artificial intelligence
Zhang ML, Yu F, Tang CZ (2017) Disambiguation-free partial label learning. IEEE Trans Knowl Data Eng 29(10):2155–2167
Zhang ML, Zhou BB, Liu XY (2016) Partial label learning via feature-aware disambiguation. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, pp 1335–1344
Zhang S, Chai J (2018) Partial label learning algorithm based on maximum margin. Sci Technol Eng 18(28):109–115
Zhou Y, Gu H (2018) Geometric mean metric learning for partial label data. Neurocomputing 275:394–402
Zhou Y, He J, Gu H (2016) Partial label learning via gaussian processes. IEEE Trans Cybernet 47(12):4443–4450
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xu, S., Yang, M., Zhou, Y. et al. Partial label metric learning by collapsing classes. Int. J. Mach. Learn. & Cyber. 11, 2453–2460 (2020). https://doi.org/10.1007/s13042-020-01129-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-020-01129-z