Abstract
Multi-label classification is used to solve the problem where multiple labels are associated with single sample. Naive Bayes (NB) classifier is widely used for single label classification due to its high performance and simplicity. Therefore it is vital to extend NB for multi-label classification. In single label classification feature weighted NB gives high accuracy by solving the conditional independence assumption of NB. However, NB is not much explored for multi-label classification. This paper proposes correlation dependent feature weighted NB (MLCFWNB) for multi-label classification. The proposed MLCFWNB is tested over eight benchmark datasets. The experimental result suggest that MLCFWNB wins 60% times in case of different multi-label learning evaluation parameters.
Similar content being viewed by others
Data availability
The data supporting the findings of this study are available in the http://mulan.sourceforge.net/datasets-mlc.html.
References
Bishop Christopher M et al (1995) Neural networks for pattern recognition. Oxford University Press
Chaudhuri Abhilasha, Sahu Tirath Prasad (2021) Feature weighting for naïve bayes using multi objective artificial bee colony algorithm. Int J Comput Sci Eng 24(1):74–88
Hall M (2006) A decision tree-based attribute weighting filter for naive bayes. In: International conference on innovative techniques and applications of artificial intelligence, pages 59–70. Springer
Hall MA (1999) Correlation-based feature selection for machine learning. PhD thesis, The University of Waikato
Jiang Liangxiao, Li Chaoqun, Wang Shasha, Zhang Lungan (2016) Deep feature weighting for naive bayes and its application to text classification. Eng Appl Artif Intell 52:26–39
Jiang Liangxiao, Zhang Lungan, Li Chaoqun, Jia Wu (2018) A correlation-based feature weighting filter for naive bayes. IEEE Trans Knowl Data Eng 31(2):201–213
Kashef Shima, Nezamabadi-pour Hossein, Nikpour Bahareh (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1240
Lee C-H, Gutierrez F, Dou D (2011) Calculating feature weights in naive bayes with kullback-leibler measure. In: 2011 IEEE 11th international conference on data mining, pages 1146–1151. IEEE
Niño-Adan Iratxe, Manjarres Diana, Landa-Torres Itziar, Portillo Eva (2021) Feature weighting methods: a review. Expert Syst Appl 184:115424
Sharma Manoj, Kumar Naresh, Kumar Pardeep et al (2022) Naive bayes-correlation based feature weighting technique for sports match result prediction. Evol Intel 15(3):2171–2186
Wang S, Jiang L, Li C (2014) A cfs-based feature weighting approach to naive bayes text classifiers. In: International conference on artificial neural networks, pages 555–562. Springer
Yan X, Li W, Wu Q, Sheng VS (2015) A double weighted naive bayes for multi-label classification. In: International symposium on computational intelligence and intelligent systems, pages 382–389. Springer
Yang Youlong, Ding Mengxiao (2019) Decision function with probability feature weighting based on bayesian network for multi-label classification. Neural Comput Appl 31(9):4819–4828
Zhang H, Sheng S (2004) Learning weighted naive bayes with accurate ranking. In: Fourth IEEE international conference on data mining (ICDM’04), pages 567–570. IEEE
Sharma A, Mishra PK (2022) Performance analysis of machine learning based optimized feature selection approaches for breast cancer diagnosis. Int J Inf Tecnol 14:1949–1960. https://doi.org/10.1007/s41870-021-00671-5
Sharaff A, Jain M, Modugula G (2022) Feature based cluster ranking approach for single document summarization. Int J Inf Tecnol 14:2057–2065. https://doi.org/10.1007/s41870-021-00853-1
Juneja K, Rana C (2020) An improved weighted decision tree approach for breast cancer prediction. Int J Inf Tecnol 12:797–804. https://doi.org/10.1007/s41870-018-0184-2
Fadele AA, Kamsin A, Ahmad K et al (2022) A novel classification to categorise original hadith detection techniques. Int J Inf Tecnol 14:2361–2375. https://doi.org/10.1007/s41870-021-00649-3
Dharmasena I, Domaratzki M, Muthukumarana S (2021) Modeling mobile apps user behavior using Bayesian networks. Int J Inf Tecnol 13:1269–1277. https://doi.org/10.1007/s41870-021-00699-7
Qu G, Zhang H, Hartrick CT (2011) Multi-label classification with Bayes’ theorem. In 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI) (Vol. 4, pp. 2281-2285). IEEE
Tenenboim L, Rokach L, Shapira B (2010) Identification of label dependencies for multi-label classification. In MLD 2010 : second international workshop on learning from multi-label data
Tsoumakas G, Katakis I (2007) Multi label classification: An overview. Int J Data Warehouse Min 3(3):1–13
Tsoumakas G, Katakis I, Vlahava I (2009) Mining multi-label data. In: Maimon O, Rokach L (eds) Data mining and knowledge discovery handbook, 2nd edn. Springer, New York
Katakis I, Tsoumakas G, Vlahavas I (2008) Multilabel text classification for automated tag suggestion. In Proceedings of the ECML/PKDD 2008 discovery challenge
Briggs F, Huang Y, Raich R, Eftaxias K, Lei Z, Cukierski W, Hadley Sarah F, Hadley A, Betts M, Fern Xiaoli Z, Irvine J, Neal L, Thomas A, Fodor G, Tsoumakas G, Ng Hong W, Nguyen Thi Ngoc T, Huttunen H, Ruusuvuori P, Manninen T, Diment A, Virtanen T, Marzat J, Defretin J, Callender D, Hurlburt C, Larrey K, Milakov M (2013) The 9th annual MLSP competition: new methods for acoustic classification of multiple simultaneous bird species in a noisy environment. In IEEE international workshop on machine learning for signal processing, MLSP 2013, Southampton, United Kingdom, September 22-25, 2013, pages 1–8
Duygulu P, Barnard K (2002) Nando de Freitas, and David Forsyth, Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary , 7th European conference on computer vision, pp IV:97-112
Tsoumakas G, Katakis I, Vlahavas I (2008) Effective and efficient multilabel classification in domains with large number of labels. In Proc ECML/PKDD 2008 workshop on mining multidimensional data (MMD’08)
Read J, Pfahringer B, Holmes G (2008) Multi-label classification using ensembles of pruned sets. In ICDM ’08: proceedings of the 2008 Eighth IEEE international conference on data mining, volume 0, pages 995–1000, Washington, DC, USA. IEEE Computer Society
Boutell Matthew R, Luo Jiebo, Shen Xipeng, Brown Christopher M (2004) Learning multi-label scene classification. Pattern Recogn 37(9):1757–1771
Pestian John P, Brew C, Matykiewicz P, Hovermale DJ, Johnson N, Cohen KB, Duch W (2007) A shared task involving multi-label classification of clinical free text. In Proceedings of the workshop on BioNLP 2007: biological, translational, and clinical language processing (BioNLP ’07), pages 97–104
Diplaris S, Tsoumakas G, Mitkas P, Vlahavas I (2005) Protein classification with multiple algorithms. In: Panhellenic conference on informaticspages. 448–456
Chen S, Webb GI, Liu L, Ma X (2020) A novel selective naïve Bayes algorithm. Knowl-Based Syst 192:105361
Kashef S, Nezamabadi-pour H, Nikpour B (2018) Multilabel feature selection: a comprehensive review and guiding experiments. Wiley Interdiscip Rev Data Min Knowl Discov 8(2):e1240
Zhang Min-Ling, Peña José M, Robles Victor (2009) Feature selection for multi-label naive Bayes classification. Inf Sci 179(19):3218–3229
Kim Hae-Cheon, Park Jin-Hyeong, Kim Dae-Won, Lee Jaesung (2020) Multilabel naïve Bayes classification considering label dependence. Pattern Recogn Lett 136:279–285
Liangjun Yu, Gan Shengfeng, Chen Yu, He Meizhang (2020) Correlation-based weight adjusted naive Bayes. IEEE Access 8:51377–51387
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Verma, G., Sahu, T.P. A correlation-based feature weighting filter for multi-label Naive Bayes. Int. j. inf. tecnol. 16, 611–619 (2024). https://doi.org/10.1007/s41870-023-01555-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-023-01555-6