Abstract
Bayes-N is an algorithm for Bayesian network learning from data based on local measures of information gain, applied to problems in which there is a given dependent or class variable and a set of independent or explanatory variables from which we want to predict the class variable on new cases. Given this setting, Bayes-N induces an ancestral ordering of all the variables generating a directed acyclic graph in which the class variable is a sink variable, with a subset of the explanatory variables as its parents. It is shown that classification using this variables as predictors performs better than the naive bayes classifier, and at least as good as other algorithms that learn Bayesian networks such as K2, PC and Bayes-9. It is also shown that the MDL measure of the networks generated by Bayes-N is comparable to those obtained by these other algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, 2nd edn. Morgan Kauffman Pub., Inc., San Francisco (1988)
Cowell, R.G., Dawid, A.P., Lauritzen, S.L.: Probabilistic Networks and Expert Systems. Springer, Heidelberg (1999)
Jensen, F.V.: Bayesian Networks and Decision Graphs. Springer, Heidelberg (2001)
Freidman, N., Geiger, D., Goldszmidt, S.: Bayesian Networks classifiers. Machine Learning 29, 131–161 (1997)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Martinez-Morales, M.: An Algorithm for the Induction of Probabilistic Networks from Data. XII Reunion Nacional de Inteligencia Artificial, ITESM, Cuernavaca, Morelos, Mexico, Limusa. (1995)
Spirtes, P., Glymour, C., Scheines, R.: An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review 9, 62–72 (1991)
Cruz-Ramirez, N.: Building Bayesian Networks From Data: a Constraint Based Approach. Ph D Thesis. Department of Psychology, The University of Sheffield (2001)
Cooper, G.F., Herskovits, E.: A Bayesian Method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)
Kullback, S.: Information Theory and Statistics. Dover, New York (1949)
Ku, H.H., Varner, R.N., Kullback, S.: Analysis of Multidimensional Contingency Tables. J. Amer. Statist. Assoc. 66, 55–64 (1971)
Feinberg, S.E.: The Analysis of Cross-Classified Categorical Data. The MIT Press, Cambridge (1981)
Whittaker, J.: Graphical Models in Applied Multivariate Analysis. John Wiley, Chichester (1990)
Shannon, C.E., Weaver, W.: The mathematical theory of communication. University of Illinois Press, Urbana (1949)
Spirtes, P., Glymour, C.: Causation, Prediction and Search. Springer, Heidelberg (1993)
Bickel, P.J., Doksum, K.A.: Mathematical Statistics: Basic Ideas and Selected Topics. Holden Day, Inc., Oakland (1977)
Bland, J.M., Altman, D.G.: Multiple significance tests: the Bonferroni method. BMJ 310, 170 (1995)
Cruz-Ramirez, N., Martinez-Morales, M.: Un algoritmo para generar redes Bayesianas a partir de Datos estadísticos. Primer Encuentro Nacional de Computación, ENC 1997, Querétaro (1997)
Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: 14th International Joint Conference on Artificial Intelligence IJCAI 1995, Montreal, Canada, Morgan Kaufmann, San Francisco (1995)
Han, J., Kamber, M.: Data Mining. Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)
Cooper, G.F.: An Overview of the Representation and Discovery of Causal Relationships using Bayesian Networks. In: Glymour, C., Cooper, G.F. (eds.) Computation, Causation & Discovery, vol. 3-62, AAAI Press / MIT Press (1999)
Cross, S.S., Dube, A.K., et al.: Evaluation of a statistically derived decision tree for the cytodiagnosis of fine needle aspirates of the breast (FNAB). Cytopathology 8, 178–187 (1998)
Cross, S.S., Downs, J., et al.: Which Decision Support Technologies Are Appropriate for the Cytodiagnosis of Breast Cancer? In: Jain, A., Jain, A., Jain, S., Jain, L. (eds.) Artificial Intelligence Techniques in Breast Cancer Diagnosis and Prognosis, vol. 39, pp. 265–295. World Scientific, Singapore (2000)
Norsys. Norsys Software Corporation, Electronic source (2001), http://www.norsys.com
Heckerman, D., Geiger, D., et al.: Learning Bayesian Networks: The combination of knowledge and statistical data, Technical Report MSR-TR-94-09, Microsoft Research, Redmond, Washington (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martínez-Morales, M., Cruz-Ramírez, N., Jiménez-Andrade, J.L., Garza-Domínguez, R. (2004). Bayes-N: An Algorithm for Learning Bayesian Networks from Data Using Local Measures of Information Gain Applied to Classification Problems. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds) MICAI 2004: Advances in Artificial Intelligence. MICAI 2004. Lecture Notes in Computer Science(), vol 2972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24694-7_54
Download citation
DOI: https://doi.org/10.1007/978-3-540-24694-7_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21459-5
Online ISBN: 978-3-540-24694-7
eBook Packages: Springer Book Archive