Skip to main content
Log in

Experimental comparisons with respect to the usage of the promising relations in EDA-based causal discovery

  • CS and OR in Big Data and Cloud Com
  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

A Bayesian network is a promising probabilistic model to represent causal relations between nodes (random variables). One of the major research issue in a Bayesian network is how to infer causal relations from a dataset by constructing better heuristic learning algorithms. Many kinds of approaches were so far introduced, and estimation of distribution algorithms (EDAs) are one of the promising causal discovery algorithms. However, the performance of EDAs is considerably dependent on the quality of the first population because new individuals are reproduced from the previous populations. In this paper, we introduce a new initialization method for EDAs that extracts promising candidate causal relations based on causal scores. Then, we used the promising relations to construct a better first population and to reproduce better individuals until the learning algorithm is terminated. Experimental results show that EDAs infer a more number of correct causal relations when promising relations were used in EDA based structure learning. It means that the performance of EDAs can be improved by providing better local search space, and it was the promising relations in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Armañanzas, R., Inza, I., Santana, R., Saeys, Y., Flores, J. L., Lozano, J. A., et al. (2008). A review of estimation of distribution algorithms in bioinformatics. BioData Mining, 1(6), 1–12.

    Google Scholar 

  • Baluja, S. (1994). Population-based incremental learning. A method for integrating genetic search based function optimization and competitive learning. Technical report, DTIC Document.

  • Baluja, S., & Davies, S. (1997). Using optimal dependency-trees for combinatorial optimization: Learning the structure of the search space. Technical report, DTIC Document.

  • Blanco, R., Inza, I., & Larranaga, P. (2003). Learning bayesian networks in the space of structures by estimation of distribution algorithms. International Journal of Intelligent Systems, 18(2), 205–220.

    Article  Google Scholar 

  • Butz, C. J., Hua, S., Chen, J., & Yao, H. (2009). A simple graphical approach for understanding probabilistic inference in bayesian networks. Information Sciences, 179(6), 699–716.

    Article  Google Scholar 

  • Chickering, D. M. (2002). Learning equivalence classes of bayesian-network structures. The Journal of Machine Learning Research, 2, 445–498.

    Google Scholar 

  • Daly, R., Shen, Q., & Aitken, S. (2011). Learning bayesian networks: Approaches and issues. The Knowledge Engineering Review, 26(02), 99–157.

    Article  Google Scholar 

  • De Bonet, J. S., Isbell, C. L., & Viola, P. (1997). Mimic: Finding optima by estimating probability densities. In M. Mozer et al. (Eds.), Advances in neural information processing systems (Vol. 9, pp. 424–430).

  • Ding, C., & Peng, W. (2014). A robust and efficient evolutionary algorithm based on probabilistic model. Journal of Computers, 9(6), 1462–1469.

    Article  Google Scholar 

  • Etxeberria, R., & Larranaga, P. (1999). Global optimization using bayesian networks. In Second symposium on artificial intelligence (CIMAF-99) (pp. 332–339). Habana, Cuba.

  • Handa, H. (2005). Estimation of distribution algorithms with mutation. In G. R. Raidl & J. Gottlieb (Eds.), Evolutionary computation in combinatorial optimization (pp. 112–121). Berlin: Springer.

  • Harik, G. R., Lobo, F. G., & Goldberg, D. E. (1999). The compact genetic algorithm. IEEE Transactions on Evolutionary Computation, 3(4), 287–297.

    Article  Google Scholar 

  • Ko, S., & Kim, D.-W. (2014). An efficient node ordering method using the conditional frequency for the k2 algorithm. Pattern Recognition Letters, 40, 80–87.

    Article  Google Scholar 

  • Larranaga, P., & Lozano, J. A. (2002). Estimation of distribution algorithms: A new tool for evolutionary computation (Vol. 2). Berlin: Springer.

    Google Scholar 

  • Li, Z., Li, P., Krishnan, A., & Liu, J. (2011). Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic bayesian network analysis. Bioinformatics, 27(19), 2686–2691.

    Article  Google Scholar 

  • Mühlenbein, H., & Paass, G. (1996). From recombination of genes to the estimation of distributions in. binary parameters. In W. Ebeling, I. Rechenberg, H.-P. Schwefel & H.-M. Voigt (Eds.), Parallel problem solving from naturePPSN IV (pp. 178–187). Berlin: Springer.

  • Neapolitan, R. E. (2004). Learning bayesian networks. Upper Saddle River: Pearson Prentice Hall.

  • Pearl, J. (2014). Probabilistic reasoning in intelligent systems: Networks of plausible inference. Burlington: Morgan Kaufmann.

    Google Scholar 

  • Pelikan, M., & Mühlenbein, H. (1999). The bivariate marginal distribution algorithm. In R. Roy, T. Furuhashi & P. K. Chawdhry (Eds.), Advances in soft computing (pp. 521–535). London: Springer.

  • Pelikan, M., & Sastry, K. (2009). Initial-population bias in the univariate estimation of distribution algorithm. In Proceedings of the 11th annual conference on genetic and evolutionary computation (pp. 429–436). ACM.

  • Pelikan, M. (2005). Hierarchical Bayesian optimization algorithm. Berlin: Springer.

    Book  Google Scholar 

  • Pelikan, M., Goldberg, D. E., & Cantu-Paz, E. (2000). Linkage problem, distribution estimation, and bayesian networks. Evolutionary Computation, 8(3), 311–340.

    Article  Google Scholar 

  • Pelikan, M., Goldberg, D. E., & Lobo, F. G. (2002). A survey of optimization by building and using probabilistic models. Computational Optimization and Applications, 21(1), 5–20.

    Article  Google Scholar 

  • Romero, T., Larrañaga, P., & Sierra, B. (2004). Learning bayesian networks in the space of orderings with estimation of distribution algorithms. International Journal of Pattern Recognition and Artificial Intelligence, 18(04), 607–625.

    Article  Google Scholar 

  • Santana, R., Armañanzas, R., Bielza, C., & Larrañaga, P. (2013). Network measures for information extraction in evolutionary algorithms. International Journal of Computational Intelligence Systems, 6(6), 1163–1188.

    Article  Google Scholar 

  • Yang, S., & Chang, K.-C. (2002). Comparison of score metrics for bayesian network learning. IEEE Transactions on Systems, Man and Cybernetics, Part A: Systems and Humans, 32(3), 419–428.

    Article  Google Scholar 

Download references

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2014R1A6A3A01058174) and by Ministry of Culture, Sports and Tourism (MCST) and Korea Creative Content Agency (KOCCA) in the Culture Technology (CT) Research & Development Program 2016.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dae-Won Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ko, S., Lim, H., Ko, H. et al. Experimental comparisons with respect to the usage of the promising relations in EDA-based causal discovery. Ann Oper Res 265, 241–255 (2018). https://doi.org/10.1007/s10479-016-2390-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-016-2390-2

Keywords

Navigation