Skip to main content
Log in

Experimental study on generalization capability of extended naive Bayesian classifier

  • Original Article
  • Published:
International Journal of Machine Learning and Cybernetics Aims and scope Submit manuscript

Abstract

Extended naive Bayesian classifier (ENBC) is a general framework of NBCs, which is developed based on \(t\)-norm based ordered weighted averaging (\(t\)-OWA) operator and uses the weighted summation of products of margin probabilities to determine class-conditional probability. Since ENBC was proposed in 2006, there is no such a study which tests the performances of ENBC on the real classification datasets. Thus, in this paper we conduct an experimental investigation to ENBC’s generalization capability based on 44 benchmark KEEL and UCI datasets. The analysis shows that (1) ENBC is instable and its aggregation weights are sensitive to the order of training samples and (2) ENBC indeed has higher generalization capability than the existing NBCs, e.g., normal naive Bayesian and flexible naive Bayesian when its weight is properly selected.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. http://sci2s.ugr.es/keel/category.php?cat=clas.

  2. http://archive.ics.uci.edu/ml/.

  3. Strictly speaking, Eqs. (3) and (4) are not correct. Regarding this issue, please refer to the footnote on page 339 of John and Langley’s work [7].

  4. For the meanings of mathematical symbols \(h_1\), \(h_2\), \(s_2\), \(t_1\), \(t_2\), \(t_3\), \(u_1\), and \(u_2\), etc., please refer to the table on page 585 of Yager’s work [19].

  5. http://www.cs.waikato.ac.nz/ml/weka/.

References

  1. Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29(2–3):131–163

    Article  MATH  Google Scholar 

  2. Hall M (2007) A decision tree-based attribute weighting filter for naive Bayes. Knowl Based Syst 20(2):120–126

    Article  Google Scholar 

  3. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18

    Article  Google Scholar 

  4. Huang S, Li J, Ye JP, Fleisher A, Chen KW, Wu T, Reiman E (2013) A sparse structure learning algorithm for Gaussian Bayesian network identification from high-dimensional data. IEEE Trans Patt Anal Mach Int 35(6):1328–1342

    Article  Google Scholar 

  5. Janez D (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  MATH  Google Scholar 

  6. Jiang LX, Zhang H, Cai ZH (2009) A novel bayes model: hidden naive Bayes. IEEE Trans Knowl Data Eng 21(10):1361–1371

    Article  Google Scholar 

  7. John GH, Langley P (1995) Estimating continuous distributions in Bayesian classifiers. Proc Int Uncertain Artif Intel 338–345.

  8. Liang X, Wei CP, Chen ZM (2013) An intuitionistic fuzzy weighted OWA operator and its application. Int J Mach Learn Cyber 4(6):713–719

    Article  Google Scholar 

  9. Mitchell T (1997) Machine learning. McGraw Hill, New York

    MATH  Google Scholar 

  10. Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076

    Article  MathSciNet  MATH  Google Scholar 

  11. Pérez A, Larrañaga P, Inza I (2009) Bayesian classifiers based on kernel density estimation: flexible classifiers. Int J Approx Reason 50(2):341–362

    Article  MATH  Google Scholar 

  12. Philippot E, Santosh KC, Belaïd A, Belaïd Y (2014) Bayesian networks for incomplete data analysis in form processing. Int J Mach Learn Cyber. doi:10.1007/s13042-014-0234-4

    Google Scholar 

  13. Subrahmanya N, Shin YC (2013) A variational Bayesian framework for group feature selection. Int J Mach Learn Cybern 4(6):609–619

    Article  Google Scholar 

  14. Wang XZ, He YL, Wand DD (2014) Non-naive Bayesian classifiers for classification problems with continuous attributes. IEEE Trans on Cybern 44(1):21–39

    Article  Google Scholar 

  15. Webb GI, Boughton JR, Wang ZH (2005) Not so naive Bayes: aggregating one-dependence estimators. Mach Learn 58(1):5–24

    Article  MATH  Google Scholar 

  16. Webb GI, Boughton JR, Zheng F, Ting KM, Salem H (2012) Learning by extrapolation from marginal to full-multivariate probability distributions: decreasingly naive Bayesian classification. Mach Learn 86(2):233–272

    Article  MathSciNet  MATH  Google Scholar 

  17. Yager RR (1998) On ordered weighted averaging aggregation operators in multicriteria decision making. IEEE Trans Syst Man Cybern 18(1):183–190

    Article  MATH  Google Scholar 

  18. Yager RR (2005) Extending multicriteria decision making by mixing t-norms and OWA operators. Int J Intell Syst 20(4):453–474

    Article  MATH  Google Scholar 

  19. Yager RR (2006) An extension of the naive Bayesian classifier. Inform Sci 176(5):577–588

    Article  MathSciNet  Google Scholar 

  20. Zaidi NA, Cerquides J, Carman MJ, Webb GI (2013) Alleviating naive Bayes attribute independence assumption by attribute weighting. J Mach Learn Res 14(1):1947–1988

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgments

The authors are very grateful for the editors and two anonymous reviewers. Their valuable and constructive comments and suggestions helped us significantly improve this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Si-si Chen.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chen, Ss., Cao, Jj., Gan, Ll. et al. Experimental study on generalization capability of extended naive Bayesian classifier. Int. J. Mach. Learn. & Cyber. 9, 5–19 (2018). https://doi.org/10.1007/s13042-014-0311-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13042-014-0311-8

Keywords

Navigation