Skip to main content
Log in

A correlation-based feature weighting filter for multi-label Naive Bayes

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

Multi-label classification is used to solve the problem where multiple labels are associated with single sample. Naive Bayes (NB) classifier is widely used for single label classification due to its high performance and simplicity. Therefore it is vital to extend NB for multi-label classification. In single label classification feature weighted NB gives high accuracy by solving the conditional independence assumption of NB. However, NB is not much explored for multi-label classification. This paper proposes correlation dependent feature weighted NB (MLCFWNB) for multi-label classification. The proposed MLCFWNB is tested over eight benchmark datasets. The experimental result suggest that MLCFWNB wins 60% times in case of different multi-label learning evaluation parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Data availability

The data supporting the findings of this study are available in the http://mulan.sourceforge.net/datasets-mlc.html.

References

  1. Bishop Christopher M et al (1995) Neural networks for pattern recognition. Oxford University Press

    Book  Google Scholar 

  2. Chaudhuri Abhilasha, Sahu Tirath Prasad (2021) Feature weighting for naïve bayes using multi objective artificial bee colony algorithm. Int J Comput Sci Eng 24(1):74–88

    Google Scholar 

  3. Hall M (2006) A decision tree-based attribute weighting filter for naive bayes. In: International conference on innovative techniques and applications of artificial intelligence, pages 59–70. Springer

  4. Hall MA (1999) Correlation-based feature selection for machine learning. PhD thesis, The University of Waikato

  5. Jiang Liangxiao, Li Chaoqun, Wang Shasha, Zhang Lungan (2016) Deep feature weighting for naive bayes and its application to text classification. Eng Appl Artif Intell 52:26–39

    Article  Google Scholar 

  6. Jiang Liangxiao, Zhang Lungan, Li Chaoqun, Jia Wu (2018) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213

    Article  Google Scholar 

  7. Kashef Shima, Nezamabadi-pour Hossein, Nikpour Bahareh (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1240

    Article  Google Scholar 

  8. Lee C-H, Gutierrez F, Dou D (2011) Calculating feature weights in naive bayes with kullback-leibler measure. In: 2011 IEEE 11th international conference on data mining, pages 1146–1151. IEEE

  9. Niño-Adan Iratxe, Manjarres Diana, Landa-Torres Itziar, Portillo Eva (2021) Feature weighting methods: a review. Expert Syst Appl 184:115424

    Article  Google Scholar 

  10. Sharma Manoj, Kumar Naresh, Kumar Pardeep et al (2022) Naive bayes-correlation based feature weighting technique for sports match result prediction. Evol Intel 15(3):2171–2186

    Article  Google Scholar 

  11. Wang S, Jiang L, Li C (2014) A cfs-based feature weighting approach to naive bayes text classifiers. In: International conference on artificial neural networks, pages 555–562. Springer

  12. Yan X, Li W, Wu Q, Sheng VS (2015) A double weighted naive bayes for multi-label classification. In: International symposium on computational intelligence and intelligent systems, pages 382–389. Springer

  13. Yang Youlong, Ding Mengxiao (2019) Decision function with probability feature weighting based on bayesian network for multi-label classification. Neural Comput Appl 31(9):4819–4828

    Article  Google Scholar 

  14. Zhang H, Sheng S (2004) Learning weighted naive bayes with accurate ranking. In: Fourth IEEE international conference on data mining (ICDM’04), pages 567–570. IEEE

  15. Sharma A, Mishra PK (2022) Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. Int J Inf Tecnol 14:1949–1960. https://doi.org/10.1007/s41870-021-00671-5

    Article  Google Scholar 

  16. Sharaff A, Jain M, Modugula G (2022) Feature based cluster ranking approach for single document summarization. Int J Inf Tecnol 14:2057–2065. https://doi.org/10.1007/s41870-021-00853-1

    Article  Google Scholar 

  17. Juneja K, Rana C (2020) An improved weighted decision tree approach for breast cancer prediction. Int J Inf Tecnol 12:797–804. https://doi.org/10.1007/s41870-018-0184-2

    Article  Google Scholar 

  18. Fadele AA, Kamsin A, Ahmad K et al (2022) A novel classification to categorise original hadith detection techniques. Int J Inf Tecnol 14:2361–2375. https://doi.org/10.1007/s41870-021-00649-3

    Article  Google Scholar 

  19. Dharmasena I, Domaratzki M, Muthukumarana S (2021) Modeling mobile apps user behavior using Bayesian networks. Int J Inf Tecnol 13:1269–1277. https://doi.org/10.1007/s41870-021-00699-7

    Article  Google Scholar 

  20. Qu G, Zhang H, Hartrick CT (2011) Multi-label classification with Bayes’ theorem. In 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI) (Vol. 4, pp. 2281-2285). IEEE

  21. Tenenboim L, Rokach L, Shapira B (2010) Identification of label dependencies for multi-label classification. In MLD 2010 : second international workshop on learning from multi-label data

  22. Tsoumakas G, Katakis I (2007) Multi label classification: An overview. Int J Data Warehouse Min 3(3):1–13

    Article  Google Scholar 

  23. Tsoumakas G, Katakis I, Vlahava I (2009) Mining multi-label data. In: Maimon O, Rokach L (eds) Data mining and knowledge discovery handbook, 2nd edn. Springer, New York

    Google Scholar 

  24. Katakis I, Tsoumakas G, Vlahavas I (2008) Multilabel text classification for automated tag suggestion. In Proceedings of the ECML/PKDD 2008 discovery challenge

  25. Briggs F, Huang Y, Raich R, Eftaxias K, Lei Z, Cukierski W, Hadley Sarah F, Hadley A, Betts M, Fern Xiaoli Z, Irvine J, Neal L, Thomas A, Fodor G, Tsoumakas G, Ng Hong W, Nguyen Thi Ngoc T, Huttunen H, Ruusuvuori P, Manninen T, Diment A, Virtanen T, Marzat J, Defretin J, Callender D, Hurlburt C, Larrey K, Milakov M (2013) The 9th annual MLSP competition: new methods for acoustic classification of multiple simultaneous bird species in a noisy environment. In IEEE international workshop on machine learning for signal processing, MLSP 2013, Southampton, United Kingdom, September 22-25, 2013, pages 1–8

  26. Duygulu P, Barnard K (2002) Nando de Freitas, and David Forsyth, Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary , 7th European conference on computer vision, pp IV:97-112

  27. Tsoumakas G, Katakis I, Vlahavas I (2008) Effective and efficient multilabel classification in domains with large number of labels. In Proc ECML/PKDD 2008 workshop on mining multidimensional data (MMD’08)

  28. Read J, Pfahringer B, Holmes G (2008) Multi-label classification using ensembles of pruned sets. In ICDM ’08: proceedings of the 2008 Eighth IEEE international conference on data mining, volume 0, pages 995–1000, Washington, DC, USA. IEEE Computer Society

  29. Boutell Matthew R, Luo Jiebo, Shen Xipeng, Brown Christopher M (2004) Learning multi-label scene classification. Pattern Recogn 37(9):1757–1771

    Article  Google Scholar 

  30. Pestian John P, Brew C, Matykiewicz P, Hovermale DJ, Johnson N, Cohen KB, Duch W (2007) A shared task involving multi-label classification of clinical free text. In Proceedings of the workshop on BioNLP 2007: biological, translational, and clinical language processing (BioNLP ’07), pages 97–104

  31. Diplaris S, Tsoumakas G, Mitkas P, Vlahavas I (2005) Protein classification with multiple algorithms. In: Panhellenic conference on informaticspages. 448–456

  32. Chen S, Webb GI, Liu L, Ma X (2020) A novel selective naïve Bayes algorithm. Knowl-Based Syst 192:105361

    Article  Google Scholar 

  33. Kashef S, Nezamabadi-pour H, Nikpour B (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1240

    Article  Google Scholar 

  34. Zhang Min-Ling, Peña José M, Robles Victor (2009) Feature selection for multi-label naive Bayes classification. Inf Sci 179(19):3218–3229

    Article  Google Scholar 

  35. Kim Hae-Cheon, Park Jin-Hyeong, Kim Dae-Won, Lee Jaesung (2020) Multilabel naïve Bayes classification considering label dependence. Pattern Recogn Lett 136:279–285

    Article  Google Scholar 

  36. Liangjun Yu, Gan Shengfeng, Chen Yu, He Meizhang (2020) Correlation-based weight adjusted naive Bayes. IEEE Access 8:51377–51387

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gurudatta Verma.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Verma, G., Sahu, T.P. A correlation-based feature weighting filter for multi-label Naive Bayes. Int. j. inf. tecnol. 16, 611–619 (2024). https://doi.org/10.1007/s41870-023-01555-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-023-01555-6

Keywords

Navigation