Skip to main content

Reinforcement Learning Estimation of Distribution Algorithm

  • Conference paper
  • First Online:
Genetic and Evolutionary Computation — GECCO 2003 (GECCO 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2724))

Included in the following conference series:

Abstract

This paper proposes an algorithm for combinatorial optimizations that uses reinforcement learning and estimation of joint probability distribution of promising solutions to generate a new population of solutions. We call it Reinforcement Learning Estimation of Distribution Algorithm (RELEDA). For the estimation of the joint probability distribution we consider each variable as univariate. Then we update the probability of each variable by applying reinforcement learning method. Though we consider variables independent of one another, the proposed method can solve problems of highly correlated variables. To compare the efficiency of our proposed algorithm with other Estimation of Distribution Algorithms (EDAs) we provide the experimental results of the two problems: four peaks problem and bipolar function.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baluja, S.: Population based incremental learning: A method for integrating genetic search based function optimization and competitive learning. Technical Report No. CMU-CS-94-163, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA (1994).

    Google Scholar 

  2. Baluja, S. and Caruana, R.: Removing the genetics from standard genetic algorithm. In A. Prieditis and S. Russell, editors, Proceedings of the International Conference on Machine Learning, Morgan Kaufmann, (1995) 38–46.

    Google Scholar 

  3. Baluja, S. and Davies, S.: Using optimal dependency trees for combinatorial optimization: Learning the structure of search space. Technical Report No. CMU-CS-97-107, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA (1997).

    Google Scholar 

  4. Berny, A.: Statistical Machine Learning and Combinatorial Optimization. In Kallel, L., Naudts, B. and Rogers, A., editors, Theoretical Aspects of Evolutionary Computing, Springer (2001).

    Google Scholar 

  5. De Bonet, J.S., Isbell, C.L. and Viola, P.: MIMIC: Finding Optima by estimating probability densities. Advances in Neural Information Processing Systems, 9 (1997).

    Google Scholar 

  6. Deb, K. and Goldberg, D.E.: Sufficient conditions for deceptive and easy binary functions. Annals of Mathematics and Artificial Intelligence, 10 (1994), 385–408.

    Article  MATH  MathSciNet  Google Scholar 

  7. Goldberg, D.E., Korb, B. and Deb, K.: Messy genetic algorithms: Motivation, analysis and first results. Complex Systems 3(5) (1989) 493–530.

    MATH  MathSciNet  Google Scholar 

  8. González, C., Lozano, J.A. and Larrañaga, P.: Mathematical modeling of discrete estimation of distribution algorithms. In P. Larrañaga and J.A. Lozano, editors, Estimation of Distribution Algorithms: A New Tool for Evolutionary Optimization. Kluwer Academic Publishers, Boston (2001).

    Google Scholar 

  9. Harik, G.: Learning gene linkage to efficiently solve problems of bounded difficulty using genetic algorithms. IlliGAL Report No. 97005, Illinois Genetic Algorithms Laboratory, University of Illinois, Urbana, Illinois, USA (1997).

    Google Scholar 

  10. Harik, G.: Linkage learning via probabilistic modeling in the ECGA. Illigal Report No. 99010, Illinois Genetic Algorithm Laboratory, University of Illinois, Urbana, Illinois, USA (1999).

    Google Scholar 

  11. Harik, G.R., Lobo, F.G. and Goldberg, D.E.: The compact genetic algorithm. In Proceedings of the IEEE Conference on Evolutionary Computation, (1998) 523–528

    Google Scholar 

  12. Holland, J.H.: Adaptation in Natural and Artificial Systems. The University of Michigan Press (1975).

    Google Scholar 

  13. Kargupta, H.: Revisiting the GEMGA: Scalable evolutionary optimization through linkage learning. In Proceedings of 1998 IEEE International Conference on Evolutionary Computation, IEEE Press, Piscataway, New Jersey, USA (1998) 603–608.

    Google Scholar 

  14. Larrañaga, P. and Lozano, J.A.: Estimation of Distribution Algorithms: A New Tool for Evolutionary Optimization. Kluwer Academic Publishers, Boston, (2001).

    Google Scholar 

  15. Mühlenbein, H.: The equation for response to selection and its use for prediction. Evolutionary Computation, 5(3) (1998) 303–346.

    Article  Google Scholar 

  16. Mühlenbein, H. and Mahnig, T.: The Factorized Distribution Algorithm for additively decomposed functions. Proceedings of the 1999 Congress on Evolutionary Computation, IEEE press (1999) 752–759.

    Google Scholar 

  17. Mühlenbein, H. and Paaß, G.: From recombination of genes to the estimation of distributions I. Binary parameters. In Lecture Notes in Computer Science 1411: Parallel Problem Solving from Nature-PPSN IV, (1996) 178–187.

    Chapter  Google Scholar 

  18. Paul, T.K. and Iba, H.: Linear and Combinatorial Optimizations by Estimation of Distribution Algorithms. 9th MPS Symposium on Evolutionary Computation, IPSJ Symposium 2003, Japan (2002), 99–106.

    Google Scholar 

  19. Pelikan, M., Goldberg, D.E. and Cantú-Paz, E.: Linkage Problem, Distribution Estimation and Bayesian Networks. Evolutionary Computation, 8(3) (2000) 311–340.

    Article  Google Scholar 

  20. Pelikan, M., Goldberg, D.E. and Lobo, F.G.: A survey of optimization by building and using probabilistic models. Technical Report, Illigal Report No. 99018, University of Illinois at Urbana-Champaign, USA (1999).

    Google Scholar 

  21. Pelikan, M. and Mühlenbein, H.: The bivariate marginal distribution algorithm. Advances in Soft Computing-Engineering Design and Manufacturing, (1999) 521–535.

    Google Scholar 

  22. Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8 (1992) 229–256.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Paul, T.K., Iba, H. (2003). Reinforcement Learning Estimation of Distribution Algorithm. In: Cantú-Paz, E., et al. Genetic and Evolutionary Computation — GECCO 2003. GECCO 2003. Lecture Notes in Computer Science, vol 2724. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45110-2_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-45110-2_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40603-7

  • Online ISBN: 978-3-540-45110-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics