Bayes-N: An Algorithm for Learning Bayesian Networks from Data Using Local Measures of Information Gain Applied to Classification Problems

Martínez-Morales, Manuel; Cruz-Ramírez, Nicandro; Jiménez-Andrade, José Luis; Garza-Domínguez, Ramiro

doi:10.1007/978-3-540-24694-7_54

Manuel Martínez-Morales¹⁰,
Nicandro Cruz-Ramírez¹¹,
José Luis Jiménez-Andrade¹⁰ &
…
Ramiro Garza-Domínguez¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2972))

Included in the following conference series:

Mexican International Conference on Artificial Intelligence

1541 Accesses
2 Citations

Abstract

Bayes-N is an algorithm for Bayesian network learning from data based on local measures of information gain, applied to problems in which there is a given dependent or class variable and a set of independent or explanatory variables from which we want to predict the class variable on new cases. Given this setting, Bayes-N induces an ancestral ordering of all the variables generating a directed acyclic graph in which the class variable is a sink variable, with a subset of the explanatory variables as its parents. It is shown that classification using this variables as predictors performs better than the naive bayes classifier, and at least as good as other algorithms that learn Bayesian networks such as K2, PC and Bayes-9. It is also shown that the MDL measure of the networks generated by Bayes-N is comparable to those obtained by these other algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, 2nd edn. Morgan Kauffman Pub., Inc., San Francisco (1988)
Google Scholar
Cowell, R.G., Dawid, A.P., Lauritzen, S.L.: Probabilistic Networks and Expert Systems. Springer, Heidelberg (1999)
MATH Google Scholar
Jensen, F.V.: Bayesian Networks and Decision Graphs. Springer, Heidelberg (2001)
MATH Google Scholar
Freidman, N., Geiger, D., Goldszmidt, S.: Bayesian Networks classifiers. Machine Learning 29, 131–161 (1997)
Article Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Martinez-Morales, M.: An Algorithm for the Induction of Probabilistic Networks from Data. XII Reunion Nacional de Inteligencia Artificial, ITESM, Cuernavaca, Morelos, Mexico, Limusa. (1995)
Google Scholar
Spirtes, P., Glymour, C., Scheines, R.: An algorithm for fast recovery of sparse causal graphs. Social Science Computer Review 9, 62–72 (1991)
Article Google Scholar
Cruz-Ramirez, N.: Building Bayesian Networks From Data: a Constraint Based Approach. Ph D Thesis. Department of Psychology, The University of Sheffield (2001)
Google Scholar
Cooper, G.F., Herskovits, E.: A Bayesian Method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)
MATH Google Scholar
Kullback, S.: Information Theory and Statistics. Dover, New York (1949)
Google Scholar
Ku, H.H., Varner, R.N., Kullback, S.: Analysis of Multidimensional Contingency Tables. J. Amer. Statist. Assoc. 66, 55–64 (1971)
Article MATH MathSciNet Google Scholar
Feinberg, S.E.: The Analysis of Cross-Classified Categorical Data. The MIT Press, Cambridge (1981)
MATH Google Scholar
Whittaker, J.: Graphical Models in Applied Multivariate Analysis. John Wiley, Chichester (1990)
Google Scholar
Shannon, C.E., Weaver, W.: The mathematical theory of communication. University of Illinois Press, Urbana (1949)
MATH Google Scholar
Spirtes, P., Glymour, C.: Causation, Prediction and Search. Springer, Heidelberg (1993)
MATH Google Scholar
Bickel, P.J., Doksum, K.A.: Mathematical Statistics: Basic Ideas and Selected Topics. Holden Day, Inc., Oakland (1977)
MATH Google Scholar
Bland, J.M., Altman, D.G.: Multiple significance tests: the Bonferroni method. BMJ 310, 170 (1995)
Google Scholar
Cruz-Ramirez, N., Martinez-Morales, M.: Un algoritmo para generar redes Bayesianas a partir de Datos estadísticos. Primer Encuentro Nacional de Computación, ENC 1997, Querétaro (1997)
Google Scholar
Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: 14th International Joint Conference on Artificial Intelligence IJCAI 1995, Montreal, Canada, Morgan Kaufmann, San Francisco (1995)
Google Scholar
Han, J., Kamber, M.: Data Mining. Concepts and Techniques. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Cooper, G.F.: An Overview of the Representation and Discovery of Causal Relationships using Bayesian Networks. In: Glymour, C., Cooper, G.F. (eds.) Computation, Causation & Discovery, vol. 3-62, AAAI Press / MIT Press (1999)
Google Scholar
Cross, S.S., Dube, A.K., et al.: Evaluation of a statistically derived decision tree for the cytodiagnosis of fine needle aspirates of the breast (FNAB). Cytopathology 8, 178–187 (1998)
Article Google Scholar
Cross, S.S., Downs, J., et al.: Which Decision Support Technologies Are Appropriate for the Cytodiagnosis of Breast Cancer? In: Jain, A., Jain, A., Jain, S., Jain, L. (eds.) Artificial Intelligence Techniques in Breast Cancer Diagnosis and Prognosis, vol. 39, pp. 265–295. World Scientific, Singapore (2000)
Chapter Google Scholar
Norsys. Norsys Software Corporation, Electronic source (2001), http://www.norsys.com
Heckerman, D., Geiger, D., et al.: Learning Bayesian Networks: The combination of knowledge and statistical data, Technical Report MSR-TR-94-09, Microsoft Research, Redmond, Washington (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

Facultad de Física e Inteligencia Artificial, Universidad Veracruzana, Xalapa, Veracruz, México
Manuel Martínez-Morales, José Luis Jiménez-Andrade & Ramiro Garza-Domínguez
Laboratorio Nacional de Informática Avanzada (LANIA), Xalapa, Veracruz, México
Nicandro Cruz-Ramírez

Authors

Manuel Martínez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Nicandro Cruz-Ramírez
View author publications
You can also search for this author in PubMed Google Scholar
José Luis Jiménez-Andrade
View author publications
You can also search for this author in PubMed Google Scholar
Ramiro Garza-Domínguez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, Tecnológico de Monterrey, Campus Estado de México, Carretera al lago de Guadalupe, Km 3.5, Atizapán, 52926, Mexico
Raúl Monroy
Instituto de Investigaciones Electricas, Reforma # 113, Col. Palmira, 62490, Morelos, Cuernavaca, Mexico
Gustavo Arroyo-Figueroa
Instituto Nacional de Astrofísica, Óptica y Electrónica, Luis Enrique Erro No. 1, 72840, Puebla, México
Luis Enrique Sucar
Centro de Investigación en Computación – IPN, Av. Juan de Dios Batíz, esquina con Miguel Othón de Mendizábal, Ciudad de México, 07738, México
Humberto Sossa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Martínez-Morales, M., Cruz-Ramírez, N., Jiménez-Andrade, J.L., Garza-Domínguez, R. (2004). Bayes-N: An Algorithm for Learning Bayesian Networks from Data Using Local Measures of Information Gain Applied to Classification Problems. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds) MICAI 2004: Advances in Artificial Intelligence. MICAI 2004. Lecture Notes in Computer Science(), vol 2972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24694-7_54

Download citation

DOI: https://doi.org/10.1007/978-3-540-24694-7_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21459-5
Online ISBN: 978-3-540-24694-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics