Skip to main content

Bayes-N: An Algorithm for Learning Bayesian Networks from Data Using Local Measures of Information Gain Applied to Classification Problems

  • Conference paper
MICAI 2004: Advances in Artificial Intelligence (MICAI 2004)

Abstract

Bayes-N is an algorithm for Bayesian network learning from data based on local measures of information gain, applied to problems in which there is a given dependent or class variable and a set of independent or explanatory variables from which we want to predict the class variable on new cases. Given this setting, Bayes-N induces an ancestral ordering of all the variables generating a directed acyclic graph in which the class variable is a sink variable, with a subset of the explanatory variables as its parents. It is shown that classification using this variables as predictors performs better than the naive bayes classifier, and at least as good as other algorithms that learn Bayesian networks such as K2, PC and Bayes-9. It is also shown that the MDL measure of the networks generated by Bayes-N is comparable to those obtained by these other algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, 2nd edn. Morgan Kauffman Pub., Inc., San Francisco (1988)

    Google Scholar 

  2. Cowell, R.G., Dawid, A.P., Lauritzen, S.L.: Probabilistic Networks and Expert Systems. Springer, Heidelberg (1999)

    MATH  Google Scholar 

  3. Jensen, F.V.: Bayesian Networks and Decision Graphs. Springer, Heidelberg (2001)

    MATH  Google Scholar 

  4. Freidman, N., Geiger, D., Goldszmidt, S.: Bayesian Networks classifiers. Machine Learning 29, 131–161 (1997)

    Article  Google Scholar 

  5. Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  6. Martinez-Morales, M.: An Algorithm for the Induction of Probabilistic Networks from Data. XII Reunion Nacional de Inteligencia Artificial, ITESM, Cuernavaca, Morelos, Mexico, Limusa. (1995)

    Google Scholar 

  7. Spirtes, P., Glymour, C., Scheines, R.: An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review 9, 62–72 (1991)

    Article  Google Scholar 

  8. Cruz-Ramirez, N.: Building Bayesian Networks From Data: a Constraint Based Approach. Ph D Thesis. Department of Psychology, The University of Sheffield (2001)

    Google Scholar 

  9. Cooper, G.F., Herskovits, E.: A Bayesian Method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)

    MATH  Google Scholar 

  10. Kullback, S.: Information Theory and Statistics. Dover, New York (1949)

    Google Scholar 

  11. Ku, H.H., Varner, R.N., Kullback, S.: Analysis of Multidimensional Contingency Tables. J. Amer. Statist. Assoc. 66, 55–64 (1971)

    Article  MATH  MathSciNet  Google Scholar 

  12. Feinberg, S.E.: The Analysis of Cross-Classified Categorical Data. The MIT Press, Cambridge (1981)

    MATH  Google Scholar 

  13. Whittaker, J.: Graphical Models in Applied Multivariate Analysis. John Wiley, Chichester (1990)

    Google Scholar 

  14. Shannon, C.E., Weaver, W.: The mathematical theory of communication. University of Illinois Press, Urbana (1949)

    MATH  Google Scholar 

  15. Spirtes, P., Glymour, C.: Causation, Prediction and Search. Springer, Heidelberg (1993)

    MATH  Google Scholar 

  16. Bickel, P.J., Doksum, K.A.: Mathematical Statistics: Basic Ideas and Selected Topics. Holden Day, Inc., Oakland (1977)

    MATH  Google Scholar 

  17. Bland, J.M., Altman, D.G.: Multiple significance tests: the Bonferroni method. BMJ 310, 170 (1995)

    Google Scholar 

  18. Cruz-Ramirez, N., Martinez-Morales, M.: Un algoritmo para generar redes Bayesianas a partir de Datos estadísticos. Primer Encuentro Nacional de Computación, ENC 1997, Querétaro (1997)

    Google Scholar 

  19. Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: 14th International Joint Conference on Artificial Intelligence IJCAI 1995, Montreal, Canada, Morgan Kaufmann, San Francisco (1995)

    Google Scholar 

  20. Han, J., Kamber, M.: Data Mining. Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)

    Google Scholar 

  21. Cooper, G.F.: An Overview of the Representation and Discovery of Causal Relationships using Bayesian Networks. In: Glymour, C., Cooper, G.F. (eds.) Computation, Causation & Discovery, vol. 3-62, AAAI Press / MIT Press (1999)

    Google Scholar 

  22. Cross, S.S., Dube, A.K., et al.: Evaluation of a statistically derived decision tree for the cytodiagnosis of fine needle aspirates of the breast (FNAB). Cytopathology 8, 178–187 (1998)

    Article  Google Scholar 

  23. Cross, S.S., Downs, J., et al.: Which Decision Support Technologies Are Appropriate for the Cytodiagnosis of Breast Cancer? In: Jain, A., Jain, A., Jain, S., Jain, L. (eds.) Artificial Intelligence Techniques in Breast Cancer Diagnosis and Prognosis, vol. 39, pp. 265–295. World Scientific, Singapore (2000)

    Chapter  Google Scholar 

  24. Norsys. Norsys Software Corporation, Electronic source (2001), http://www.norsys.com

  25. Heckerman, D., Geiger, D., et al.: Learning Bayesian Networks: The combination of knowledge and statistical data, Technical Report MSR-TR-94-09, Microsoft Research, Redmond, Washington (1994)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Martínez-Morales, M., Cruz-Ramírez, N., Jiménez-Andrade, J.L., Garza-Domínguez, R. (2004). Bayes-N: An Algorithm for Learning Bayesian Networks from Data Using Local Measures of Information Gain Applied to Classification Problems. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds) MICAI 2004: Advances in Artificial Intelligence. MICAI 2004. Lecture Notes in Computer Science(), vol 2972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24694-7_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24694-7_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21459-5

  • Online ISBN: 978-3-540-24694-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics