Abstract
Estimation of distribution algorithms (EDAs) have been successfully applied to a wide variety of problems but, for themost complex approaches, there is no clear understanding of the way these algorithms complete the search. For that reason, in this work we exploit the probabilistic models that EDAs based on Bayesian networks are able to learn in order to provide new information about their behavior. Particularly, we analyze the k solutions with the highest probability in the distributions estimated during the search. In order to study the relationship between the probabilistic model and the fitness function, we focus on calculating, for the k most probable solutions (MPSs), the probability values, the function values and the correlation between both sets of values at each step of the algorithm. Furthermore, the objective functions of the k MPSs are contrasted with the k best individuals in the population. We complete the analysis by calculating the position of the optimum in the k MPSs during the search and the genotypic diversity of these solutions. We carry out the analysis by optimizing functions of different natures such as Trap5, two variants of Ising spin glass and Max-SAT. The results not only show information about the relationship between the probabilistic model and the fitness function, but also allow us to observe characteristics of the search space, the quality of the setup of the parameters and even distinguish between successful and unsuccessful runs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Anumanchipalli, G.K., Ravishankar, M., Reddy, R.: Improving pronunciation inference using n-best list, acoustics and orthography. In: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. ICASSP 2007, Honolulu, HI, vol. IV, pp. 925–928 (2007)
Armañanzas, R., Inza, I., Santana, R., Saeys, Y., Flores, J.L., Lozano, J.A., Van de Peer, Y., Blanco, R., Robles, V., Bielza, C., Larrañaga, P.: A review of estimation of distribution algorithms in bioinformatics. BioData Mining 1(6) (2008)
Barahona, F.: On the computational complexity of Ising spin glass model. Journal of Physics A: Mathematical and General 15(10) (1982)
Brownlee, S., McCall, J., Brown, D.: Solving the MAXSAT problem using a multivariate EDA based on Markov networks. In: Proceedings of the 2007 conference on Genetic and Evolutionary Computation (GECCO-2007), London, England, pp. 2423–2428. ACM, New York (2007)
Brownlee, S., McCall, J., Zhang, Q., Brown, D.: Approaches to selection and their effect on fitness modelling in an estimation of distribution algorithm. In: Proceedings of the 2008 Congress on Evolutionary Computation (CEC 2008), Hong Kong, pp. 2621–2628. IEEE Press, Los Alamitos (2008)
Buntine, W.: Theory refinement on Bayesian networks. In: Proceedings of the Seventh Conference on Uncertainty in Artificial Intelligence, pp. 52–60 (1991)
Castillo, E., Gutierrez, J.M., Hadi, A.S.: Expert Systems and Probabilistic Network Models. Springer, Heidelberg (1997)
Cook, S.A.: The complexity of theorem-proving procedures. In: Proceedings of the Third Annual ACM Symposium on Theory of Computing, pp. 151–158. Shaker Heights, Ohio (1971)
Cowell, R.: Sampling without replacement in junction trees. Technical Report Statistical Research Paper 15, City University, London (1997)
Deb, K., Goldberg, D.E.: Sufficient conditions for deceptive and easy binary functions. Annals of Mathematics and Artificial Intelligence 10, 385–408 (1994)
Echegoyen, C., Lozano, J.A., Santana, R., Larrañaga, P.: Exact Bayesian network learning in estimation of distribution algorithms. In: Proceedings of the 2007 Congress on Evolutionary Computation (CEC 2007), pp. 1051–1058. IEEE Press, Los Alamitos (2007)
Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.: Analyzing the probability of the optimum in EDAs based on Bayesian networks. In: Proceedings of the 2009 Congress on Evolutionary Computation (CEC 2009), Trondheim, Norway, pp. 1652–1659. IEEE Press, Los Alamitos (2009)
Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.: A quantitative analysis of estimation of distribution algorithms based on Bayesian networks. Technical Report EHU-KZAA-TR-2-2009, Department of Computer Science and Artificial Intelligence (2009)
Echegoyen, C., Santana, R., Lozano, J., Larrañaga, P.: The Impact of Exact Probabilistic Learning Algorithms in EDAs Based on Bayesian Networks. In: Linkage in Evolutionary Computation, pp. 109–139. Springer, Heidelberg (2008)
Etxeberria, R., Larrañaga, P.: Global optimization using Bayesian networks. In: Ochoa, A., Soto, M.R., Santana, R. (eds.) Proceedings of the Second Symposium on Artificial Intelligence (CIMAF 1999), Havana, Cuba, pp. 151–173 (1999)
Friedman, N., Yakhini, Z.: On the sample complexity of learning Bayesian networks. In: Proceedings of the 12th Conference on Uncertainty in Artificial Intelligence (UAI 1996), pp. 274–282. Morgan Kaufmann, San Francisco (1996)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading (1989)
Hauschild, M., Pelikan, M.: Enhancing efficiency of hierarchical BOA via distance-based model restrictions. MEDAL Report No. 2008007, Missouri Estimation of Distribution Algorithms Laboratory (MEDAL) (April 2008)
Hauschild, M., Pelikan, M., Sastry, K., Lima, C.: Analyzing Probabilistic Models in Hierarchical BOA. IEEE Transactions on Evolutionary Computation (to appear)
Höns, R., Santana, R., Larrañaga, P., Lozano, J.A.: Optimization by max-propagation using Kikuchi approximations. Technical Report EHU-KZAA-IK-2/07, Department of Computer Science and Artificial Intelligence, University of the Basque Country (November 2007)
Hoos, H., Stutzle, T.: SATLIB: An online resource for research on SAT. In: van Maaren, H., Gent, I.P., Walsh, T. (eds.) SAT 2000, pp. 283–292. IOS Press, Amsterdam (2000)
Ising, E.: The theory of ferromagnetism. Zeitschrift fuer Physik 31, 253–258 (1925)
Larrañaga, P., Lozano, J.A. (eds.): Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation. Kluwer Academic Publishers, Dordrecht (2002)
Lima, C.F., Lobo, F.G., Pelikan, M.: From mating pool distributions to model overfitting. In: Proceedings of the ACM Genetic and Evolutionary Computation Conference (GECCO 2008), pp. 431–438. IEEE Press, Los Alamitos (2008)
Lima, C.F., Pelikan, M., Goldberg, D.E., Lobo, F.G., Sastry, K., Hauschild, M.: Influence of selection and replacement strategies on linkage learning in BOA. In: Proceedings of the 2007 Congress on Evolutionary Computation (CEC 2007), pp. 1083–1090. IEEE Press, Los Alamitos (2007)
Lima, C.F., Pelikan, M., Lobo, F.G., Goldberg, D.E.: Loopy Substructural Local Search for the Bayesian Optimization Algorithm. In: Engineering Stochastic Local Search Algorithms. Designing, Implementing and Analyzing Effective Heuristics, pp. 61–75. Springer, Heidelberg (2009)
Lozano, J.A., Larrañaga, P., Inza, I., Bengoetxea, E. (eds.): Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms. Springer, Heidelberg (2006)
Mendiburu, A., Santana, R., Lozano, J.A.: Introducing belief propagation in estimation of distribution algorithms: A parallel framework. Technical Report EHU-KAT-IK-11/07, Department of Computer Science and Artificial Intelligence, University of the Basque Country (October 2007)
Mühlenbein, H., Mahnig, T.: FDA – a scalable evolutionary algorithm for the optimization of additively decomposed functions. Evolutionary Computation 7(4), 353–376 (1999)
Mühlenbein, H., Mahnig, T.: Evolutionary synthesis of Bayesian networks for optimization. In: Patel, M., Honavar, V., Balakrishnan, K. (eds.) Advances in Evolutionary Synthesis of Intelligent Agents, pp. 429–455. MIT Press, Cambridge (2001)
Mühlenbein, H., Mahnig, T., Ochoa, A.: Schemata, distributions and graphical models in evolutionary optimization. Journal of Heuristics 5(2), 213–247 (1999)
Mühlenbein, H., Paaß, G.: From recombination of genes to the estimation of distributions I. Binary parameters. In: Ebeling, W., Rechenberg, I., Voigt, H.-M., Schwefel, H.-P. (eds.) PPSN 1996. LNCS, vol. 1141, pp. 178–187. Springer, Heidelberg (1996)
Murphy, K.: The Bayes Net Toolbox for Matlab. In: Computer science and Statistics: Proceedings of Interface, vol. 33 (2001)
Nilsson, D.: An efficient algorithm for finding the M most probable configurations in probabilistic expert systems. Statistics and Computing 2, 159–173 (1998)
Nilsson, D., Goldberger, J.: Sequentially finding the n-best list in Hidden Markov Models. In: Proceedings of he Seventeenth International Joint Conference on Artificial Intelligence, IJCAI 2001 (2001)
Pelikan, M.: Hierarchical Bayesian Optimization Algorithm. Toward a New Generation of Evolutionary Algorithms. Studies in Fuzziness and Soft Computing. Springer, Heidelberg (2005)
Pelikan, M., Goldberg, D.E.: Hierarchical BOA solves Ising Spin Glasses and MAXSAT. In: Cantú-Paz, E., Foster, J.A., Deb, K., Davis, L., Roy, R., O’Reilly, U.-M., Beyer, H.-G., Kendall, G., Wilson, S.W., Harman, M., Wegener, J., Dasgupta, D., Potter, M.A., Schultz, A., Dowsland, K.A., Jonoska, N., Miller, J., Standish, R.K. (eds.) GECCO 2003. LNCS, vol. 2724, pp. 1271–1282. Springer, Heidelberg (2003)
Pelikan, M., Goldberg, D.E., Cantú-Paz, E.: BOA: The Bayesian optimization algorithm. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO1999), Orlando, FL, vol. I, pp. 525–532. Morgan Kaufmann Publishers, San Francisco (1999)
Pelikan, M., Hartmann, A.K.: Searching for ground states of Ising spin glasses with hierarchical BOA and cluster exact approximation. In: Pelikan, M., Sastry, K., Cantú-Paz, E. (eds.) Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications. Studies in Computational Intelligence, pp. 333–349. Springer, Heidelberg (2006)
Pelikan, M., Sastry, K., Cantú-Paz, E. (eds.): Scalable Optimization via Probabilistic Modeling: From Algorithms to Applications. Studies in Computational Intelligence. Springer, Heidelberg (2006)
Santana, R.: Advances in Probabilistic Graphical Models for Optimization and Learning. Applications in Protein Modelling. PhD thesis, University of the Basque Country (2006)
Santana, R., Echegoyen, C., Mendiburu, A., Bielza, C., Lozano, J.A., Larrañaga, P., Armañanzas, R., Shakya, S.: MATEDA: A suite of EDA programs in matlab. Technical Report EHU-KZAA-IK-2/09, Department of Computer Science and Artificial Intelligence (2009)
Santana, R., Larrañaga, P., Lozano, J.A.: Research topics on discrete estimation of distribution algorithms. Memetic Computing 1(1), 35–54 (2009)
Schwarz, G.: Estimating the dimension of a model. Annals of Statistics 7(2), 461–464 (1978)
Yanover, C., Weiss, Y.: Approximate inference and protein-folding. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Advances in Neural Information Processing Systems, vol. 15, pp. 1457–1464. MIT Press, Cambridge (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Echegoyen, C., Mendiburu, A., Santana, R., Lozano, J.A. (2010). Analyzing the k Most Probable Solutions in EDAs Based on Bayesian Networks. In: Chen, Yp. (eds) Exploitation of Linkage Learning in Evolutionary Algorithms. Evolutionary Learning and Optimization, vol 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12834-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-12834-9_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12833-2
Online ISBN: 978-3-642-12834-9
eBook Packages: EngineeringEngineering (R0)