Skip to main content

An Improved Approach for the Discovery of Causal Models via MML

  • Conference paper
  • First Online:
Advances in Knowledge Discovery and Data Mining (PAKDD 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2336))

Included in the following conference series:

Abstract

Discovering a precise causal structure accurately reflecting the given data is one of the most essential tasks in the area of data mining and machine learning. One of the successful causal discovery approaches is the information-theoretic approach using the Minimum Message Length Principle[19]. This paper presents an improved and further experimental results of the MML discovery algorithm. We introduced a new encoding scheme for measuring the cost of describing the causal structure. Stiring function is also applied to further simplify the computational complexity and thus works more efficiently. The experimental results of the current version of the discovery system show that: (1) the current version is capable of discovering what discovered by previous system; (2) current system is capable of discovering more complicated causal models with large number of variables; (3) the new version works more efficiently compared with the previous version in terms of time complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. H. L. Chin and G. F. Cooper. Stochastic simulation of Bayesian belief networks. In Proc. of 3rd Workshop on Uncertainty in AI, Seattle, 1987.

    Google Scholar 

  2. G. F. Cooper and E. Herskovits. A Bayesian method for constructing Bayesian belief networks from databases. In Proc. of 7th Conference on Uncertainty in AI. Morgan Kaufmann, 1991.

    Google Scholar 

  3. Honghua Dai, Kevin Korb, Chris Wallace, and Xindong Wu. A study of causal discovery with small samples and weak links. In Proceedings of the 15th International Joint Conference On Artificial Intelligence IJCAI’97, pages 1304–1309. Morgan Kaufmann Publishers, Inc., 1997.

    Google Scholar 

  4. Clark Glymour, Richard Scheines, Peter Spirtes, and Kevin Kelly. Discovering Causal Structure: Artificial Intelligence, Philosophy of Science, and Statistical Modeling. Academic Press, San Diego, 1987.

    MATH  Google Scholar 

  5. David Heckerman, Dan Geiger, and David M. Chickering. Learning bayesian networks: The combination of knowledge and statistical data. Machine Learning, 20(3):197–243, 1995.

    MATH  Google Scholar 

  6. Wai Lam and Fahiem Bacchus. Learning Bayesian belief networks: An approach based on the MDL principle. Computational Intelligence, 10:269–292, 1994.

    Article  Google Scholar 

  7. John C. Loehlin. Latent Variable Models: An Introduction to Factor, Path and Structural Analysis. Lawrence Erlbaum Associates, Hillsdale, New Jersey, second edition, 1992.

    Google Scholar 

  8. Richard Neapolitan. Probabilistic Reasoning in Expert Systems. Wiley, New York, 1990.

    Google Scholar 

  9. J.J. Oliver and R.A. Baxter. MML and Bayesianism: Similarities and differences. Tech Report 206, Computer Science, Monash University, 1994.

    Google Scholar 

  10. Judea Pearl. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, California, 1988.

    Google Scholar 

  11. R.W. Robinson. Counting unlabelled acyclic digraphs. In C.H.C. Little, editor, Lecture Notes in Mathematics: Combinatorial Mathematics V, pages 28–43. Springer-Verlag, 1977.

    Google Scholar 

  12. Peter Spirtes, Clark Glymour, and Richard Scheines. Causality from probability. In J.E. Tiles, G.T. McKee, and G.C. Dean, editors, Evolving Knowledge in Natural Science and Artificial Intelligence, London, 1990. Pitman.

    Google Scholar 

  13. Peter Spirtes, Clark Glymour, and Richard Scheines. Causation, Prediction, and Search. Springer-Verlag, New York, Berlin, Heideberg, 1993.

    MATH  Google Scholar 

  14. Peter Spirtes, Clark Glymour, and Richard Scheines. Causation, Prediction, and Search. MIT Press, New York, 2000.

    Google Scholar 

  15. Peter Spirtes, Clark Glymour, Richard Scheines, and C. Meek. TETRAD II: tools for causal modeling. Lawrence Erlbaum, Hillsdale, New Jersey, 1994.

    Google Scholar 

  16. Chris Wallace and David Boulton. An information measure for classification. Computer Journal, 11:185–194, 1968.

    MATH  Google Scholar 

  17. Chris Wallace and P.R. Freeman. Estimation and inference by compact coding. Journal of the Royal Statistical Society, B,49:240–252, 1987.

    MathSciNet  Google Scholar 

  18. Chris Wallace and Michael Georgeff. A general selection criterion for inductive inference. ECAI 84, Advances in Artificial Intelligence, pages 1–18, 1984.

    Google Scholar 

  19. Chris Wallace, Kevin Korb, and Honghua Dai. Causal discovery via MML. In Proceedings of the 13th International Conference on Machine Learning (ICML’96), pages 516–524, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dai, H., Li, G. (2002). An Improved Approach for the Discovery of Causal Models via MML. In: Chen, MS., Yu, P.S., Liu, B. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2002. Lecture Notes in Computer Science(), vol 2336. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47887-6_30

Download citation

  • DOI: https://doi.org/10.1007/3-540-47887-6_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43704-8

  • Online ISBN: 978-3-540-47887-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics