Abstract
Multi-label learning deals with the problem which each data example can be represented by an instance and associated with a set of labels, i.e., every example can be classified into multiple classes simultaneously. Most of the existing multi-label learning methods are supervised which cannot deal with such application scenarios where manually labeling the data is very expensive and time-consuming while the unlabeled data are very cheap and easy to obtain. This paper proposes an ensemble learning method which integrates multi-label learning and graph-based semi-supervised learning into one framework. The label correlation consistency is introduced to deal with the multi-label learning. The proposed method has been evaluated on five public multi-label datasets by comparing it with state-of-the-art supervised and semi-supervised multi-label methods according to multiple evaluation metrics to confirm its effectiveness. Experimental results show that the proposed method can achieve the comparable performance compared with the state-of-the-art methods. Furthermore, it is more confident on every single predicted label.
Similar content being viewed by others
Notes
http://mulan.sourceforge.net/datasets-mlc.html
References
Abbasi, M., Rabiee, H. R., and Gagné, C. Monocular 3d human pose estimation with a semi-supervised graph-based method. In 2015 International Conference on 3D Vision (2015), IEEE, pp. 518–526.
Arthur, D., and Vassilvitskii, S. k-means++: The advantages of careful seeding. Tech. rep., Stanford, 2006.
Boutell, M. R., Luo, J., Shen, X., and Brown, C. M. Learning multi-label scene classification. Pattern recognition 37, 9 (2004), 1757–1771.
Breiman, L. Random forests. Machine learning 45, 1 (2001), 5–32.
Chen, Z.-M., Wei, X.-S., Wang, P., and Guo, Y. Multi-label image recognition with graph convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019), pp. 5177–5186.
Chung, F. R., and Graham, F. C. Spectral graph theory. No. 92. American Mathematical Soc., 1997.
Clare, A., and King, R. Knowledge discovery in multi-label phenotype data. Principles of data mining and knowledge discovery (2001), 42–53.
Demšar, J. Statistical comparisons of classifiers over multiple data sets. The Journal of Machine Learning Research 7 (2006), 1–30.
Elisseeff, A., and Weston, J. A kernel method for multi-labelled classification. In Advances in neural information processing systems (2002), pp. 681–687.
Erhan, D., Bengio, Y., Courville, A., Manzagol, P.-A., Vincent, P., and Bengio, S. Why does unsupervised pre-training help deep learning? The Journal of Machine Learning Research 11 (2010), 625–660.
Fürnkranz, J., Hüllermeier, E., Mencía, E. L., and Brinker, K. Multilabel classification via calibrated label ranking. Machine learning 73, 2 (2008), 133–153.
Ghamrawi, N., and McCallum, A. Collective multi-label classification. In Proceedings of the 14th ACM international conference on Information and knowledge management (2005), ACM, pp. 195–200.
Han, A. L.-F., Zeng, X., Wong, D. F., and Chao, L. S. Chinese named entity recognition with graph-based semi-supervised learning model. In Proceedings of the Eighth SIGHAN Workshop on Chinese Language Processing (2015), pp. 15–20.
Hastie, T., Tibshirani, R., and Friedman, J. The elements of statistical learning: data mining, inference, and prediction. Springer Science & Business Media, 2009.
Huang, S. J., and Zhou, Z. H. Multi-label learning by exploiting label correlations locally. In Twenty-Sixth AAAI Conference on Artificial Intelligence (2012), pp. 949–955.
Katakis, I., Tsoumakas, G., and Vlahavas, I. Multilabel text classification for automated tag suggestion. In Proceedings of the ECML/PKDD (2008), vol. 18, p. 5.
Kazawa, H., Izumitani, T., Taira, H., and Maeda, E. Maximal margin labeling for multi-topic text categorization. In Advances in neural information processing systems (2005), pp. 649–656.
Kong, X., Ng, M. K., and Zhou, Z. H. Transductive multilabel learning via label set propagation. IEEE Transactions on Knowledge & Data Engineering 25, 3 (2013), 704–719.
Li, C. X. Exploiting label correlations for multi-label classification. PhD Thesis, University of California (2011).
Liu, W., He, J., and Chang, S.-F. Large graph construction for scalable semi-supervised learning. In ICML (2010).
Ma, L., Ma, A., Ju, C., and Li, X. Graph-based semi-supervised learning for spectral-spatial hyperspectral image classification. pattern recognition letters 83 (2016), 133–142.
Madjarov, G., Kocev, D., Gjorgjevikj, D., and Džeroski, S. An extensive experimental comparison of methods for multi-label learning. Pattern Recognition 45, 9 (2012), 3084–3104.
Read, J., Pfahringer, B., Holmes, G., and Frank, E. Classifier chains for multi-label classification. Machine learning 85, 3 (2011), 333–359.
Schapire, R. E., and Singer, Y. Boostexter: A boosting-based system for text categorization. Machine learning 39, 2–3 (2000), 135–168.
Sun, J., Yu, H., Zhong, G., Dong, J., Zhang, S., and Yu, H. Random shapley forests: cooperative game-based random forests with consistency. IEEE transactions on cybernetics (2020).
Tang, J., Li, H., Qi, G.-J., and Chua, T.-S. Image annotation by graph-based inference with integrated multiple/single instance representations. IEEE Transactions on Multimedia 12, 2 (2009), 131–141.
Trohidis, K., Tsoumakas, G., Kalliris, G., and Vlahavas, I. P. Multi-label classification of music into emotions. In ISMIR (2008), vol. 8, pp. 325–330.
Tsoumakas, G., Katakis, I., and Vlahavas, I. Mining multi-label data. In Data mining and knowledge discovery handbook. Springer, 2009, pp. 667–685.
Tsoumakas, G., and Vlahavas, I. Random k-labelsets: An ensemble method for multilabel classification. Machine learning: ECML 2007 (2007), 406–417.
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y., and Manzagol, P.-A. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. The Journal of Machine Learning Research 11 (2010), 3371–3408.
Zha, Z. J., Mei, T., Wang, J., Wang, Z., and Hua, X. S. Graph-based semi-supervised learning with multiple labels. Journal of Visual Communication & Image Representation 20, 2 (2009), 97–103.
Zhang, J., Wu, Q., Shen, C., Zhang, J., and Lu, J. Multilabel image classification with regional latent semantic dependencies. IEEE Transactions on Multimedia 20, 10 (2018), 2801–2813.
Zhang, M.-L., and Zhang, K. Multi-label learning by exploiting label dependency. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (2010), ACM, pp. 999–1008.
Zhang, M.-L., and Zhou, Z.-H. Ml-knn: A lazy learning approach to multi-label learning. Pattern recognition 40, 7 (2007), 2038–2048.
Zhang, M. L., and Zhou, Z. H. Multi-label learning by instance differentiation. In National Conference on Artificial Intelligence (2007), pp. 669–674.
Zhang, M.-L., and Zhou, Z.-H. A review on multi-label learning algorithms. IEEE transactions on knowledge and data engineering 26, 8 (2014), 1819–1837.
Zhang, Q., Sun, J., Zhong, G., and Dong, J. Random multi-graphs: a semi-supervised learning framework for classification of high dimensional data. Image and Vision Computing 60 (2017), 30–37.
Zhao, Y., Ball, R., Mosesian, J., de Palma, J.-F., and Lehman, B. Graph-based semi-supervised learning for fault detection and classification in solar photovoltaic arrays. IEEE Transactions on Power Electronics 30, 5 (2014), 2848–2858.
Zhu, X., and Goldberg, A. B. Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning 3, 1 (2009), 1–130.
Acknowledgements
This work was supported by the Major Project for New Generation of AI under Grant No. 2018AAA0100400, the Open Project Funding of Shandong Key Laboratory of computer networks, the Qingdao Agricultural University Research Foundation for Advanced Talents under Grant No. 1119012, the Joint Fund of the Equipments Pre-Research and Ministry of Education of China under Grant No. 6141A020337, the Science and Technology Program of Qingdao under Grant No. 21-1-4-ny-19-nsh, and the Natural Science Foundation of Shandong Province under Grant No. ZR2020MF131.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interest
The authors declare that they have no conflict of interest.
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed Consent
Informed consent was obtained from all individual participants included in the study.
Rights and permissions
About this article
Cite this article
Zhang, Q., Zhong, G. & Dong, J. A Graph-based Semi-supervised Multi-label Learning Method Based on Label Correlation Consistency. Cogn Comput 13, 1564–1573 (2021). https://doi.org/10.1007/s12559-021-09912-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-021-09912-y