Skip to main content

Multi-label Online Streaming Feature Selection Based on Spectral Granulation and Mutual Information

  • Conference paper
  • First Online:
Rough Sets (IJCRS 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11103))

Included in the following conference series:

Abstract

Instances in multi-label data sets are generally described as a high-dimensional feature vector, as brings the “curse of dimensionality” problem. To ease this problem, some multi-label feature selection algorithms have been proposed. However, they all handle feature selection problems with the assumption that all candidate features are available beforehand. While in some real applications, feature selection must be conducted in the online manner with dynamic features, for example, novel topics arise constantly with a set of features in social networks. Online streaming feature selection (OSFS), dealing with dynamic features, has attracted intensive interest in recent years. Some online feature selection methods are designed for single-label applications, They can not be directly applied in multi-label scenarios. In this paper, we propose a multi-label online streaming feature selection algorithm based on spectral granulation and mutual information (ML-OSMI), which takes high-order label correlations into consideration. Moreover, comprehensive experiments are conducted to verify the effectiveness of the proposed algorithm on twelve multi-label high-dimensional benchmark data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://meka.sourceforge.net/#datasets.

  2. 2.

    https://github.com/KKimura360/MLC_toolbox.

References

  1. Hua, X.S., Qi, G.J.: Online multi-label active annotation: towards large-scale content-based video search. In: International Conference on Multimedia 2008, Vancouver, British Columbia, Canada, pp. 141–150, October 2008

    Google Scholar 

  2. Lai, H., Yan, P., Shu, X., Wei, Y., Yan, S.: Instance-aware hashing for multi-label image retrieval. IEEE Trans. Image Process. 25(6), 2469 (2016)

    Article  MathSciNet  Google Scholar 

  3. Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.P.: Multi-label classification of music into emotions. In: ISMIR 2008, 9th International Conference on Music Information Retrieval, Drexel University, Philadelphia, PA, USA, 14–18 September 2008, pp. 325–330 (2008)

    Google Scholar 

  4. Wu, B., Lyu, S., Hu, B.G., Ji, Q.: Multi-label learning with missing labels for image annotation and facial action unit recognition. Patt. Recogn. 48(7), 2279–2289 (2015)

    Article  Google Scholar 

  5. Zhang, M.L., Zhou, Z.H.: Multilabel neural networks with applications to functional genomics and text categorization. IEEE Trans. Knowl. Data Eng. 18(10), 1338–1351 (2006)

    Article  Google Scholar 

  6. Tsoumakas, G., Katakis, I., Vlahavas, I.P.: Mining multi-label data. In: Data Mining and Knowledge Discovery Handbook, 2nd edn., pp. 667–685 (2010)

    Google Scholar 

  7. Jian, L., Li, J., Shu, K., Liu, H.: Multi-label informed feature selection. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, IJCAI 2016, New York, NY, USA, 9–15 July 2016, pp. 1627–1633 (2016)

    Google Scholar 

  8. Lee, J., Kim, D.W.: Mutual information-based multi-label feature selection using interaction information. Expert Syst. Appl. 42(4), 2013–2025 (2015)

    Article  Google Scholar 

  9. Li, F., Miao, D., Pedrycz, W.: Granular multi-label feature selection based on mutual information. Patt. Recogn. 67, 410–423 (2017)

    Article  Google Scholar 

  10. Wu, X., Yu, K., Wang, H., Ding, W.: Online streaming feature selection. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), 21–24 June 2010, Haifa, Israel, pp. 1159–1166 (2010)

    Google Scholar 

  11. Wu, X., Yu, K., Ding, W., Wang, H.: Online feature selection with streaming features. IEEE Trans. Patt. Anal. Mach. Intell. 35(5), 1178 (2013)

    Article  Google Scholar 

  12. Wang, J., et al.: Online feature selection with group structure analysis. IEEE Trans. Knowl. Data Eng. 27(11), 3029–3041 (2016)

    Article  Google Scholar 

  13. Perkins, S., Theiler, J.: Online feature selection using grafting. In: Machine Learning, Proceedings of the Twentieth International Conference (ICML 2003), 21–24 August 2003, Washington, DC, USA, pp. 592–599 (2003)

    Google Scholar 

  14. Zhou, J., Foster, D.P., Stine, R.A., Ungar, L.H.: Streaming feature selection using alpha-investing. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, 21–24 August 2005, pp. 384–393 (2005)

    Google Scholar 

  15. Cherman, E.A., Monard, M.C., Lee, H.D.: A comparison of multi-label feature selection methods using the problem transformation approach. Electr. Notes Theor. Comput. Sci. 292, 135–151 (2013)

    Article  Google Scholar 

  16. Spolaôr, N., Monard, M.C., Lee, H.D.: Feature selection for multi-label learning. In: Proceedings of the 24th International Conference on Artificial Intelligence, Series, IJCAI 2015, pp. 4401–4402. AAAI Press (2015)

    Google Scholar 

  17. Lin, Y., Hu, Q., Liu, J., Duan, J.: Multi-label feature selection based on max-dependency and min-redundancy. Neurocomputing 168, 92–103 (2015)

    Article  Google Scholar 

  18. Kimura, K., Sun, L., Kudo, M.: MLC toolbox: A MATLAB/OCTAVE library for multi-label classification. CoRR, abs/1704.02592 (2017). http://arxiv.org/abs/1704.02592

  19. Peng, H., Long, F., Ding, C.: Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans. Patt. Anal. Mach. Intell. 27(8), 1226 (2005)

    Article  Google Scholar 

  20. Nie, F., Huang, H., Cai, X., Ding, C.H.Q.: Efficient and robust feature selection via joint \(l_{2,1}\)-norms minimization. In: Advances in Neural Information Processing Systems 23: 24th Annual Conference on Neural Information Processing Systems 2010. Proceedings of a meeting held 6–9 December 2010, Vancouver, British Columbia, Canada, pp. 1813–1821 (2010)

    Google Scholar 

  21. Lin, Y., Hu, Q., Zhang, J., Wu, X.: Multi-label feature selection with streaming labels. Inf. Sci. 372, 256–275 (2016)

    Article  Google Scholar 

  22. Yu, K., Wu, X., Ding, W., Pei, J.: Towards scalable and accurate online feature selection for big data. In: 2014 IEEE International Conference on Data Mining, ICDM 2014, Shenzhen, China, 14–17 December 2014, pp. 660–669 (2014)

    Google Scholar 

  23. Sun, L., Kudo, M., Kimura, K.: Multi-label classification with meta-label-specific features. In: 23rd International Conference on Pattern Recognition, ICPR 2016, Cancún, Mexico, 4–8 December 2016, pp. 1612–1617 (2016)

    Google Scholar 

  24. Zhang, M.L., Zhou, Z.H.: ML-KNN: a lazy learning approach to multi-label learning. Patt. Recogn. 40(7), 2038–2048 (2007)

    Article  Google Scholar 

  25. Kong, D., Ding, C.H.Q., Huang, H., Zhao, H.: Multi-label reliefF and F-statistic feature selections for image annotation. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012, pp. 2352–2359 (2012)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the National Key Research and Development Program of China (Grant no. 2016YFB1000900), the National Natural Science Foundation of China (Grant nos. 61572091, 61772096), Chongqing Basic and Frontier Research Project (cstc2015jcyjA40018) and The Science and Technology Project Affiliated to the Education Department of Chongqing Municipality (KJ1500438).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Zhixing Li or Guoyin Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, H., Yu, D., Li, Y., Li, Z., Wang, G. (2018). Multi-label Online Streaming Feature Selection Based on Spectral Granulation and Mutual Information. In: Nguyen, H., Ha, QT., Li, T., Przybyła-Kasperek, M. (eds) Rough Sets. IJCRS 2018. Lecture Notes in Computer Science(), vol 11103. Springer, Cham. https://doi.org/10.1007/978-3-319-99368-3_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-99368-3_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-99367-6

  • Online ISBN: 978-3-319-99368-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics