Abstract
Extended naive Bayesian classifier (ENBC) is a general framework of NBCs, which is developed based on \(t\)-norm based ordered weighted averaging (\(t\)-OWA) operator and uses the weighted summation of products of margin probabilities to determine class-conditional probability. Since ENBC was proposed in 2006, there is no such a study which tests the performances of ENBC on the real classification datasets. Thus, in this paper we conduct an experimental investigation to ENBC’s generalization capability based on 44 benchmark KEEL and UCI datasets. The analysis shows that (1) ENBC is instable and its aggregation weights are sensitive to the order of training samples and (2) ENBC indeed has higher generalization capability than the existing NBCs, e.g., normal naive Bayesian and flexible naive Bayesian when its weight is properly selected.
Similar content being viewed by others
Notes
Strictly speaking, Eqs. (3) and (4) are not correct. Regarding this issue, please refer to the footnote on page 339 of John and Langley’s work [7].
For the meanings of mathematical symbols \(h_1\), \(h_2\), \(s_2\), \(t_1\), \(t_2\), \(t_3\), \(u_1\), and \(u_2\), etc., please refer to the table on page 585 of Yager’s work [19].
References
Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163
Hall M (2007) A decision tree-based attribute weighting filter for naive Bayes. Knowl Based Syst 20(2):120–126
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18
Huang S, Li J, Ye JP, Fleisher A, Chen KW, Wu T, Reiman E (2013) A sparse structure learning algorithm for Gaussian Bayesian network identification from high-dimensional data. IEEE Trans Patt Anal Mach Int 35(6):1328–1342
Janez D (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Jiang LX, Zhang H, Cai ZH (2009) A novel bayes model: hidden naive Bayes. IEEE Trans Knowl Data Eng 21(10):1361–1371
John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. Proc Int Uncertain Artif Intel 338–345.
Liang X, Wei CP, Chen ZM (2013) An intuitionistic fuzzy weighted OWA operator and its application. Int J Mach Learn Cyber 4(6):713–719
Mitchell T (1997) Machine learning. McGraw Hill, New York
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
Pérez A, Larrañaga P, Inza I (2009) Bayesian classifiers based on kernel density estimation: flexible classifiers. Int J Approx Reason 50(2):341–362
Philippot E, Santosh KC, Belaïd A, Belaïd Y (2014) Bayesian networks for incomplete data analysis in form processing. Int J Mach Learn Cyber. doi:10.1007/s13042-014-0234-4
Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection. Int J Mach Learn Cybern 4(6):609–619
Wang XZ, He YL, Wand DD (2014) Non-naive Bayesian classifiers for classification problems with continuous attributes. IEEE Trans on Cybern 44(1):21–39
Webb GI, Boughton JR, Wang ZH (2005) Not so naive Bayes: aggregating one-dependence estimators. Mach Learn 58(1):5–24
Webb GI, Boughton JR, Zheng F, Ting KM, Salem H (2012) Learning by extrapolation from marginal to full-multivariate probability distributions: decreasingly naive Bayesian classification. Mach Learn 86(2):233–272
Yager RR (1998) On ordered weighted averaging aggregation operators in multicriteria decision making. IEEE Trans Syst Man Cybern 18(1):183–190
Yager RR (2005) Extending multicriteria decision making by mixing t-norms and OWA operators. Int J Intell Syst 20(4):453–474
Yager RR (2006) An extension of the naive Bayesian classifier. Inform Sci 176(5):577–588
Zaidi NA, Cerquides J, Carman MJ, Webb GI (2013) Alleviating naive Bayes attribute independence assumption by attribute weighting. J Mach Learn Res 14(1):1947–1988
Acknowledgments
The authors are very grateful for the editors and two anonymous reviewers. Their valuable and constructive comments and suggestions helped us significantly improve this work.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chen, Ss., Cao, Jj., Gan, Ll. et al. Experimental study on generalization capability of extended naive Bayesian classifier. Int. J. Mach. Learn. & Cyber. 9, 5–19 (2018). https://doi.org/10.1007/s13042-014-0311-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13042-014-0311-8