Selection and Reinforcement Learning for Combinatorial Optimization

Berny, A.

doi:10.1007/3-540-45356-3_59

A. Berny⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1917))

Included in the following conference series:

International Conference on Parallel Problem Solving from Nature

7483 Accesses
12 Citations

Abstract

Improving on a previous paper, we explicitly relate reinforcement and selection learning (PBIL) algorithms for combinatorial optimization, which is understood as the task of finding a fixed-length binary string maximizing an arbitrary function. We show the equivalence of searching for an optimal string and searching for a probability distribution over strings maximizing the function expectation. In this paper however, we will only consider the family of Bernoulli distributions. Next, we introduce two gradient dynamical systems acting on probability vectors. The first one maximizes the expectation of the function and leads to reinforcement learning algorithms whereas the second one maximizes the logarithm of the expectation of the function and leads to selection learning algorithms. We finally give a stability analysis of solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

S. Baluja and R. Caruana. Removing the genetics from the standard genetic algorithm. In Proceedings of the 12th Annual Conference on Machine Learning, pages 38–46, 1995.
Google Scholar
A.G. Barto and P. Anandan. Pattern recognizing stochastic learning automata. IEEE Trans, on Systems, Man, and Cybernetics, 15:60–75, 1985.
Google Scholar
A. Berny. An adaptive scheme for real function optimization acting as a selection operator. In X. Yao, editor, First IEEE Symposium on Combinations of Evolutionary Computation and Neural Networks, 2000.
Google Scholar
A. Berny. Statistical machine learning and combinatorial optimization. In L. Kallel, B. Naudts, and A. Rogers, editors, Theoretical Aspects of Evolutionary Computing, Lecture Notes in Natural Computing. Springer-Verlag, 2000.
Google Scholar
R. Cerf. Une théorie asymptotique des algorithmes génétiques. PhD thesis, Université de Montpellier II, mars 1994.
Google Scholar
P. Dayan and G.E. Hinton. Using EM for reinforcement learning. Neural Computation, 9(2):271–278, 1997.
Article MATH Google Scholar
M.W. Hirsch and S. Smale. Differential equations, dynamical systems, and linear algebra. Academic Press, 1974.
Google Scholar
P. Larrañaga, R. Etxeberria, J.A. Lozano, and J.M. Peña. Optimization by learning and simulation of Bayesian and Gaussian networks. Technical Report EHU-KZAA-IK-4/99, Intelligent System Group, Dept. of Computer Science and Artificial Intelligence, University of the Basque Country, December 1999.
Google Scholar
H. Mühlenbein. The equation for response to selection and its use for prediction. Evolutionary Computation, 5(3):303–346, 1997.
Google Scholar
K.S. Narendra and M.A.L. Thathachar. Learning automata: an introduction. Prentice Hall, NJ, 1989.
Google Scholar
B. Naudts and L. Kallel. Comparison of summary statistics of fitness landscapes. IEEE Transactions on Evolutionary Computation, 2000. To appear.
Google Scholar
P.F. Stadler. Complex Systems and Binary Networks, chapter Towards a Theory of Landscapes, pages 77–163. Springer-Verlag, Berlin, 1995.
Google Scholar
G. Syswerda. Simulated crossover in genetic algorithms. In L.D. Whitley, editor, Second workshop on foundations of genetic algorithms, pages 239–255. Morgan Kaufmann, 1993.
Google Scholar
R.J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229–256, 1992.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

IRIN, Université de Nantes, France
A. Berny

Authors

A. Berny
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CMAP, Ecole Polytechnique, 91128, Palaiseau Cedex, France
Marc Schoenauer
Dept. of Mechanical Engineering Kanpur Genetic Algorithms Laboratory, Indian Institute of Technology Kanpur, Kanpur, Pin, 208 016, India
Kalyanmoy Deb
Fachbereich Informatik, Lehrstuhl für Systemanalyse, Universität Dortmund, Joseph-von-Fraunhofer-Str. 20, 44221, Dortmund, Germany
Günther Rudolph & Hans-Paul Schwefel &
School of Computer Science, The University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
Xin Yao
Projet Fractales, INRIA Rocquencourt, BP 105, 78153, Le Chesnay Cedex, France
Evelyne Lutton
Dept. de Arquitectura y Technologa de los Computadores, GeNeura Team, Universidad de Granada, Campus Fuenetenueva, s/n, Granada
Juan Julian Merelo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Berny, A. (2000). Selection and Reinforcement Learning for Combinatorial Optimization. In: Schoenauer, M., et al. Parallel Problem Solving from Nature PPSN VI. PPSN 2000. Lecture Notes in Computer Science, vol 1917. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45356-3_59

Download citation

DOI: https://doi.org/10.1007/3-540-45356-3_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41056-0
Online ISBN: 978-3-540-45356-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics