Analyzing the k Most Probable Solutions in EDAs Based on Bayesian Networks

Echegoyen, Carlos; Mendiburu, Alexander; Santana, Roberto; Lozano, Jose A.

doi:10.1007/978-3-642-12834-9_8

Carlos Echegoyen⁴,
Alexander Mendiburu⁴,
Roberto Santana⁵ &
…
Jose A. Lozano⁴

Part of the book series: Evolutionary Learning and Optimization ((ALO,volume 3))

748 Accesses
2 Citations

Abstract

Estimation of distribution algorithms (EDAs) have been successfully applied to a wide variety of problems but, for themost complex approaches, there is no clear understanding of the way these algorithms complete the search. For that reason, in this work we exploit the probabilistic models that EDAs based on Bayesian networks are able to learn in order to provide new information about their behavior. Particularly, we analyze the k solutions with the highest probability in the distributions estimated during the search. In order to study the relationship between the probabilistic model and the fitness function, we focus on calculating, for the k most probable solutions (MPSs), the probability values, the function values and the correlation between both sets of values at each step of the algorithm. Furthermore, the objective functions of the k MPSs are contrasted with the k best individuals in the population. We complete the analysis by calculating the position of the optimum in the k MPSs during the search and the genotypic diversity of these solutions. We carry out the analysis by optimizing functions of different natures such as Trap5, two variants of Ising spin glass and Max-SAT. The results not only show information about the relationship between the probabilistic model and the fitness function, but also allow us to observe characteristics of the search space, the quality of the setup of the parameters and even distinguish between successful and unsuccessful runs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anumanchipalli, G.K., Ravishankar, M., Reddy, R.: Improving pronunciation inference using n-best list, acoustics and orthography. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP 2007, Honolulu, HI, vol. IV, pp. 925–928 (2007)
Google Scholar
Armañanzas, R., Inza, I., Santana, R., Saeys, Y., Flores, J.L., Lozano, J.A., Van de Peer, Y., Blanco, R., Robles, V., Bielza, C., Larrañaga, P.: A review of estimation of distribution algorithms in bioinformatics. BioData Mining 1(6) (2008)
Google Scholar
Barahona, F.: On the computational complexity of Ising spin glass model. Journal of Physics A: Mathematical and General 15(10) (1982)
Google Scholar
Brownlee, S., McCall, J., Brown, D.: Solving the MAXSAT problem using a multivariate EDA based on Markov networks. In: Proceedings of the 2007 conference on Genetic and Evolutionary Computation (GECCO-2007), London, England, pp. 2423–2428. ACM, New York (2007)
Chapter Google Scholar
Brownlee, S., McCall, J., Zhang, Q., Brown, D.: Approaches to selection and their effect on fitness modelling in an estimation of distribution algorithm. In: Proceedings of the 2008 Congress on Evolutionary Computation (CEC 2008), Hong Kong, pp. 2621–2628. IEEE Press, Los Alamitos (2008)
Google Scholar
Buntine, W.: Theory refinement on Bayesian networks. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 52–60 (1991)
Google Scholar
Castillo, E., Gutierrez, J.M., Hadi, A.S.: Expert Systems and Probabilistic Network Models. Springer, Heidelberg (1997)
Google Scholar
Cook, S.A.: The complexity of theorem-proving procedures. In: Proceedings of the Third Annual ACM Symposium on Theory of Computing, pp. 151–158. Shaker Heights, Ohio (1971)
Chapter Google Scholar
Cowell, R.: Sampling without replacement in junction trees. Technical Report Statistical Research Paper 15, City University, London (1997)
Google Scholar
Deb, K., Goldberg, D.E.: Sufficient conditions for deceptive and easy binary functions. Annals of Mathematics and Artificial Intelligence 10, 385–408 (1994)
Article MATH MathSciNet Google Scholar
Echegoyen, C., Lozano, J.A., Santana, R., Larrañaga, P.: Exact Bayesian network learning in estimation of distribution algorithms. In: Proceedings of the 2007 Congress on Evolutionary Computation (CEC 2007), pp. 1051–1058. IEEE Press, Los Alamitos (2007)
Chapter Google Scholar
Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.: Analyzing the probability of the optimum in EDAs based on Bayesian networks. In: Proceedings of the 2009 Congress on Evolutionary Computation (CEC 2009), Trondheim, Norway, pp. 1652–1659. IEEE Press, Los Alamitos (2009)
Chapter Google Scholar
Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.: A quantitative analysis of estimation of distribution algorithms based on Bayesian networks. Technical Report EHU-KZAA-TR-2-2009, Department of Computer Science and Artificial Intelligence (2009)
Google Scholar
Echegoyen, C., Santana, R., Lozano, J., Larrañaga, P.: The Impact of Exact Probabilistic Learning Algorithms in EDAs Based on Bayesian Networks. In: Linkage in Evolutionary Computation, pp. 109–139. Springer, Heidelberg (2008)
Chapter Google Scholar
Etxeberria, R., Larrañaga, P.: Global optimization using Bayesian networks. In: Ochoa, A., Soto, M.R., Santana, R. (eds.) Proceedings of the Second Symposium on Artificial Intelligence (CIMAF 1999), Havana, Cuba, pp. 151–173 (1999)
Google Scholar
Friedman, N., Yakhini, Z.: On the sample complexity of learning Bayesian networks. In: Proceedings of the 12th Conference on Uncertainty in Artificial Intelligence (UAI 1996), pp. 274–282. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading (1989)
MATH Google Scholar
Hauschild, M., Pelikan, M.: Enhancing efficiency of hierarchical BOA via distance-based model restrictions. MEDAL Report No. 2008007, Missouri Estimation of Distribution Algorithms Laboratory (MEDAL) (April 2008)
Google Scholar
Hauschild, M., Pelikan, M., Sastry, K., Lima, C.: Analyzing Probabilistic Models in Hierarchical BOA. IEEE Transactions on Evolutionary Computation (to appear)
Google Scholar
Höns, R., Santana, R., Larrañaga, P., Lozano, J.A.: Optimization by max-propagation using Kikuchi approximations. Technical Report EHU-KZAA-IK-2/07, Department of Computer Science and Artificial Intelligence, University of the Basque Country (November 2007)
Google Scholar
Hoos, H., Stutzle, T.: SATLIB: An online resource for research on SAT. In: van Maaren, H., Gent, I.P., Walsh, T. (eds.) SAT 2000, pp. 283–292. IOS Press, Amsterdam (2000)
Google Scholar
Ising, E.: The theory of ferromagnetism. Zeitschrift fuer Physik 31, 253–258 (1925)
Article Google Scholar
Larrañaga, P., Lozano, J.A. (eds.): Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation. Kluwer Academic Publishers, Dordrecht (2002)
MATH Google Scholar
Lima, C.F., Lobo, F.G., Pelikan, M.: From mating pool distributions to model overfitting. In: Proceedings of the ACM Genetic and Evolutionary Computation Conference (GECCO 2008), pp. 431–438. IEEE Press, Los Alamitos (2008)
Chapter Google Scholar
Lima, C.F., Pelikan, M., Goldberg, D.E., Lobo, F.G., Sastry, K., Hauschild, M.: Influence of selection and replacement strategies on linkage learning in BOA. In: Proceedings of the 2007 Congress on Evolutionary Computation (CEC 2007), pp. 1083–1090. IEEE Press, Los Alamitos (2007)
Chapter Google Scholar
Lima, C.F., Pelikan, M., Lobo, F.G., Goldberg, D.E.: Loopy Substructural Local Search for the Bayesian Optimization Algorithm. In: Engineering Stochastic Local Search Algorithms. Designing, Implementing and Analyzing Effective Heuristics, pp. 61–75. Springer, Heidelberg (2009)
Chapter Google Scholar
Lozano, J.A., Larrañaga, P., Inza, I., Bengoetxea, E. (eds.): Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms. Springer, Heidelberg (2006)
MATH Google Scholar
Mendiburu, A., Santana, R., Lozano, J.A.: Introducing belief propagation in estimation of distribution algorithms: A parallel framework. Technical Report EHU-KAT-IK-11/07, Department of Computer Science and Artificial Intelligence, University of the Basque Country (October 2007)
Google Scholar
Mühlenbein, H., Mahnig, T.: FDA – a scalable evolutionary algorithm for the optimization of additively decomposed functions. Evolutionary Computation 7(4), 353–376 (1999)
Article Google Scholar
Mühlenbein, H., Mahnig, T.: Evolutionary synthesis of Bayesian networks for optimization. In: Patel, M., Honavar, V., Balakrishnan, K. (eds.) Advances in Evolutionary Synthesis of Intelligent Agents, pp. 429–455. MIT Press, Cambridge (2001)
Google Scholar
Mühlenbein, H., Mahnig, T., Ochoa, A.: Schemata, distributions and graphical models in evolutionary optimization. Journal of Heuristics 5(2), 213–247 (1999)
Article Google Scholar
Mühlenbein, H., Paaß, G.: From recombination of genes to the estimation of distributions I. Binary parameters. In: Ebeling, W., Rechenberg, I., Voigt, H.-M., Schwefel, H.-P. (eds.) PPSN 1996. LNCS, vol. 1141, pp. 178–187. Springer, Heidelberg (1996)
Chapter Google Scholar
Murphy, K.: The Bayes Net Toolbox for Matlab. In: Computer science and Statistics: Proceedings of Interface, vol. 33 (2001)
Google Scholar
Nilsson, D.: An efficient algorithm for finding the M most probable configurations in probabilistic expert systems. Statistics and Computing 2, 159–173 (1998)
Article Google Scholar
Nilsson, D., Goldberger, J.: Sequentially finding the n-best list in Hidden Markov Models. In: Proceedings of he Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001 (2001)
Google Scholar
Pelikan, M.: Hierarchical Bayesian Optimization Algorithm. Toward a New Generation of Evolutionary Algorithms. Studies in Fuzziness and Soft Computing. Springer, Heidelberg (2005)
MATH Google Scholar
Pelikan, M., Goldberg, D.E.: Hierarchical BOA solves Ising Spin Glasses and MAXSAT. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 1271–1282. Springer, Heidelberg (2003)
Chapter Google Scholar
Pelikan, M., Goldberg, D.E., Cantú-Paz, E.: BOA: The Bayesian optimization algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO1999), Orlando, FL, vol. I, pp. 525–532. Morgan Kaufmann Publishers, San Francisco (1999)
Google Scholar
Pelikan, M., Hartmann, A.K.: Searching for ground states of Ising spin glasses with hierarchical BOA and cluster exact approximation. In: Pelikan, M., Sastry, K., Cantú-Paz, E. (eds.) Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications. Studies in Computational Intelligence, pp. 333–349. Springer, Heidelberg (2006)
Chapter Google Scholar
Pelikan, M., Sastry, K., Cantú-Paz, E. (eds.): Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications. Studies in Computational Intelligence. Springer, Heidelberg (2006)
MATH Google Scholar
Santana, R.: Advances in Probabilistic Graphical Models for Optimization and Learning. Applications in Protein Modelling. PhD thesis, University of the Basque Country (2006)
Google Scholar
Santana, R., Echegoyen, C., Mendiburu, A., Bielza, C., Lozano, J.A., Larrañaga, P., Armañanzas, R., Shakya, S.: MATEDA: A suite of EDA programs in matlab. Technical Report EHU-KZAA-IK-2/09, Department of Computer Science and Artificial Intelligence (2009)
Google Scholar
Santana, R., Larrañaga, P., Lozano, J.A.: Research topics on discrete estimation of distribution algorithms. Memetic Computing 1(1), 35–54 (2009)
Article Google Scholar
Schwarz, G.: Estimating the dimension of a model. Annals of Statistics 7(2), 461–464 (1978)
Article Google Scholar
Yanover, C., Weiss, Y.: Approximate inference and protein-folding. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems, vol. 15, pp. 1457–1464. MIT Press, Cambridge (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Systems Group, Department of Computer Science and Artificial Intelligence, University of the Basque Country, Paseo Manuel de Lardizábal 1, 20080, San Sebastián, Donostia, Spain
Carlos Echegoyen, Alexander Mendiburu & Jose A. Lozano
Universidad Politécnica de Madrid, Campus de Montegacedo sn., 28660, Boadilla del Monte, Madrid, Spain
Roberto Santana

Authors

Carlos Echegoyen
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Mendiburu
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Santana
View author publications
You can also search for this author in PubMed Google Scholar
Jose A. Lozano
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Natural Computing Laboratory Department of Computer Science, National Chiao Tung University, 1001 Ta Hsueh Road, 300, HsinChu City, Taiwan
Ying-ping Chen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.A. (2010). Analyzing the k Most Probable Solutions in EDAs Based on Bayesian Networks. In: Chen, Yp. (eds) Exploitation of Linkage Learning in Evolutionary Algorithms. Evolutionary Learning and Optimization, vol 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12834-9_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-12834-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12833-2
Online ISBN: 978-3-642-12834-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics