Abstract
Multi-label learning has received much attention due to its wide range of application domains. Multi-label data often has high-dimensional features, which brings more challenges to classification algorithms. Feature selection based on sparse learning is one of the most important approaches, which can select discriminative features to alleviate the curse of dimensionality. However, most of these methods do not consider feature redundancy and label correlation simultaneously. In this work, we propose a multi-label feature selection method that takes feature redundancy and label correlation into account. Specifically, the proposed method first divides features into groups according to feature correlation and then encourages competition within each group but relaxes competition between groups to eliminate redundant and irrelevant features. Moreover, we propose an approach to capture label correlation and exploit it to improve the feature selection. Finally, an iterative optimization algorithm is designed to obtain the feature weights for multi-label feature selection. Extensive experiments on various multi-label datasets demonstrate the superiority of the proposed method compared with some state-of-the-art multi-label feature selection methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cai, Z., Zhu, W.: Multi-label feature selection via feature manifold learning and sparsity regularization. Int. J. Mach. Learn. Cybern. 9, 1321–1334 (2018)
Chen, L., et al.: Predicting gene phenotype by multi-label multi-class model based on essential functional features. Mol. Genet. Genom. 296(4), 905–918 (2021)
Chen, Y., Zou, C., Chen, J.: Label-aware graph representation learning for multi-label image classification. Neurocomputing 492, 50–61 (2022)
Chen, Z.M., Wei, X.S., Wang, P., Guo, Y.: Multi-label image recognition with graph convolutional networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5177–5186 (2019)
Chou, H.C., Lee, C.C., Busso, C.: Exploiting co-occurrence frequency of emotions in perceptual evaluations to train a speech emotion classifier. In: Proceedings of Interspeech, vol. 2022, pp. 161–165 (2022)
Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)
Hu, L., Li, Y., Gao, W., Zhang, P., Hu, J.: Multi-label feature selection with shared common mode. Pattern Recogn. 104, 107344 (2020)
Huang, J., Xu, Q., Qu, X., Lin, Y., Zheng, X.: Improving multi-label learning by correlation embedding. Appl. Sci. 11(24), 12145 (2021)
Jia, X., Sun, F., Li, H., Cao, Y., Zhang, X.: Image multi-label annotation based on supervised nonnegative matrix factorization with new matching measurement. Neurocomputing 219, 518–525 (2017)
Jian, L., Li, J., Shu, K., Liu, H.: Multi-label informed feature selection. IJCAI 16, 1627–33 (2016)
Jordan, M.I.: Serial order: a parallel distributed processing approach. In: Advances in Psychology, vol. 121, pp. 471–495. Elsevier (1997)
Kashef, S., Nezamabadi-pour, H., Nikpour, B.: Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdiscip. Rev. Data Mining Knowl. Disc. 8, e1240 (2018)
Kong, D., Liu, J., Liu, B., Bao, X.: Uncorrelated group lasso. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 30 (2016)
Lee, J., Yu, I.: Park: memetic feature selection for multilabel text categorization using label frequency difference. Inf. Sci. 485, 263–280 (2019)
Liu, Y., Wen, K., Gao, Q., Gao, X., Nie, F.: SVM based multi-label learning with missing labels for image annotation. Pattern Recogn. 78, 307–317 (2018)
Sarfraz, S., Sharma, V., Stiefelhagen, R.: Efficient parameter-free clustering using first neighbor relations. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 8934–8943 (2019)
Tsoumakas, G., Spyromitros-Xioufis, E., Vilcek, J., Vlahavas, I.: Mulan: a java library for multi-label learning. J. Mach. Learn. Res. 12, 2411–2414 (2011)
Wu, B., Liu, Z., Wang, S., Hu, B.G., Ji, Q.: Multi-label learning with missing labels. In: 2014 22nd International Conference on Pattern Recognition, pp. 1964–1968. IEEE (2014)
Wu, X.Z., Zhou, Z.H.: A unified view of multi-label performance measures. In: International Conference on Machine Learning, pp. 3780–3788. PMLR (2017)
Zhang, J., Luo, Z., Li, C., Zhou, C., Li, S.: Manifold regularized discriminative feature selection for multi-label learning. Pattern Recogn. 95, 136–150 (2019)
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Pattern Recogn. 40(7), 2038–2048 (2007)
Zhang, Z., Liu, L., Li, J., Wu, X.: Integrating global and local feature selection for multi-label learning. ACM Trans. Knowl. Discov. Data 17(1), 1–37 (2023)
Acknowledgment
This research was supported by the National Key Research and Development Program of China (No. 2020YFC0833302), the National Natural Science Foundation of China (No. 62076059), the Science Project of Liaoning Province (2021-MS-105) and the 111 Project (B16009).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Zhang, K., Liang, W., Cao, P., Yang, J., Li, W., Zaiane, O.R. (2023). Label Correlation Guided Feature Selection for Multi-label Learning. In: Yang, X., et al. Advanced Data Mining and Applications. ADMA 2023. Lecture Notes in Computer Science(), vol 14179. Springer, Cham. https://doi.org/10.1007/978-3-031-46674-8_27
Download citation
DOI: https://doi.org/10.1007/978-3-031-46674-8_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46673-1
Online ISBN: 978-3-031-46674-8
eBook Packages: Computer ScienceComputer Science (R0)