Abstract
Instances in multi-label data sets are generally described as a high-dimensional feature vector, as brings the “curse of dimensionality” problem. To ease this problem, some multi-label feature selection algorithms have been proposed. However, they all handle feature selection problems with the assumption that all candidate features are available beforehand. While in some real applications, feature selection must be conducted in the online manner with dynamic features, for example, novel topics arise constantly with a set of features in social networks. Online streaming feature selection (OSFS), dealing with dynamic features, has attracted intensive interest in recent years. Some online feature selection methods are designed for single-label applications, They can not be directly applied in multi-label scenarios. In this paper, we propose a multi-label online streaming feature selection algorithm based on spectral granulation and mutual information (ML-OSMI), which takes high-order label correlations into consideration. Moreover, comprehensive experiments are conducted to verify the effectiveness of the proposed algorithm on twelve multi-label high-dimensional benchmark data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hua, X.S., Qi, G.J.: Online multi-label active annotation: towards large-scale content-based video search. In: International Conference on Multimedia 2008, Vancouver, British Columbia, Canada, pp. 141–150, October 2008
Lai, H., Yan, P., Shu, X., Wei, Y., Yan, S.: Instance-aware hashing for multi-label image retrieval. IEEE Trans. Image Process. 25(6), 2469 (2016)
Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.P.: Multi-label classification of music into emotions. In: ISMIR 2008, 9th International Conference on Music Information Retrieval, Drexel University, Philadelphia, PA, USA, 14–18 September 2008, pp. 325–330 (2008)
Wu, B., Lyu, S., Hu, B.G., Ji, Q.: Multi-label learning with missing labels for image annotation and facial action unit recognition. Patt. Recogn. 48(7), 2279–2289 (2015)
Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans. Knowl. Data Eng. 18(10), 1338–1351 (2006)
Tsoumakas, G., Katakis, I., Vlahavas, I.P.: Mining multi-label data. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., pp. 667–685 (2010)
Jian, L., Li, J., Shu, K., Liu, H.: Multi-label informed feature selection. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9–15 July 2016, pp. 1627–1633 (2016)
Lee, J., Kim, D.W.: Mutual information-based multi-label feature selection using interaction information. Expert Syst. Appl. 42(4), 2013–2025 (2015)
Li, F., Miao, D., Pedrycz, W.: Granular multi-label feature selection based on mutual information. Patt. Recogn. 67, 410–423 (2017)
Wu, X., Yu, K., Wang, H., Ding, W.: Online streaming feature selection. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), 21–24 June 2010, Haifa, Israel, pp. 1159–1166 (2010)
Wu, X., Yu, K., Ding, W., Wang, H.: Online feature selection with streaming features. IEEE Trans. Patt. Anal. Mach. Intell. 35(5), 1178 (2013)
Wang, J., et al.: Online feature selection with group structure analysis. IEEE Trans. Knowl. Data Eng. 27(11), 3029–3041 (2016)
Perkins, S., Theiler, J.: Online feature selection using grafting. In: Machine Learning, Proceedings of the Twentieth International Conference (ICML 2003), 21–24 August 2003, Washington, DC, USA, pp. 592–599 (2003)
Zhou, J., Foster, D.P., Stine, R.A., Ungar, L.H.: Streaming feature selection using alpha-investing. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, 21–24 August 2005, pp. 384–393 (2005)
Cherman, E.A., Monard, M.C., Lee, H.D.: A comparison of multi-label feature selection methods using the problem transformation approach. Electr. Notes Theor. Comput. Sci. 292, 135–151 (2013)
Spolaôr, N., Monard, M.C., Lee, H.D.: Feature selection for multi-label learning. In: Proceedings of the 24th International Conference on Artificial Intelligence, Series, IJCAI 2015, pp. 4401–4402. AAAI Press (2015)
Lin, Y., Hu, Q., Liu, J., Duan, J.: Multi-label feature selection based on max-dependency and min-redundancy. Neurocomputing 168, 92–103 (2015)
Kimura, K., Sun, L., Kudo, M.: MLC toolbox: A MATLAB/OCTAVE library for multi-label classification. CoRR, abs/1704.02592 (2017). http://arxiv.org/abs/1704.02592
Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Patt. Anal. Mach. Intell. 27(8), 1226 (2005)
Nie, F., Huang, H., Cai, X., Ding, C.H.Q.: Efficient and robust feature selection via joint \(l_{2,1}\)-norms minimization. In: Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6–9 December 2010, Vancouver, British Columbia, Canada, pp. 1813–1821 (2010)
Lin, Y., Hu, Q., Zhang, J., Wu, X.: Multi-label feature selection with streaming labels. Inf. Sci. 372, 256–275 (2016)
Yu, K., Wu, X., Ding, W., Pei, J.: Towards scalable and accurate online feature selection for big data. In: 2014 IEEE International Conference on Data Mining, ICDM 2014, Shenzhen, China, 14–17 December 2014, pp. 660–669 (2014)
Sun, L., Kudo, M., Kimura, K.: Multi-label classification with meta-label-specific features. In: 23rd International Conference on Pattern Recognition, ICPR 2016, Cancún, Mexico, 4–8 December 2016, pp. 1612–1617 (2016)
Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Patt. Recogn. 40(7), 2038–2048 (2007)
Kong, D., Ding, C.H.Q., Huang, H., Zhao, H.: Multi-label reliefF and F-statistic feature selections for image annotation. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012, pp. 2352–2359 (2012)
Acknowledgements
This work was supported by the National Key Research and Development Program of China (Grant no. 2016YFB1000900), the National Natural Science Foundation of China (Grant nos. 61572091, 61772096), Chongqing Basic and Frontier Research Project (cstc2015jcyjA40018) and The Science and Technology Project Affiliated to the Education Department of Chongqing Municipality (KJ1500438).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, H., Yu, D., Li, Y., Li, Z., Wang, G. (2018). Multi-label Online Streaming Feature Selection Based on Spectral Granulation and Mutual Information. In: Nguyen, H., Ha, QT., Li, T., Przybyła-Kasperek, M. (eds) Rough Sets. IJCRS 2018. Lecture Notes in Computer Science(), vol 11103. Springer, Cham. https://doi.org/10.1007/978-3-319-99368-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-99368-3_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99367-6
Online ISBN: 978-3-319-99368-3
eBook Packages: Computer ScienceComputer Science (R0)