Rollout Algorithms for Combinatorial Optimization
Purchase on Springer.com
$39.95 / €34.95 / £29.95*
Rent the article at a discountRent now
* Final gross prices may vary according to local VAT.
We consider the approximate solution of discrete optimization problems using procedures that are capable of magnifying the effectiveness of any given heuristic algorithm through sequential application. In particular, we embed the problem within a dynamic programming framework, and we introduce several types of rollout algorithms, which are related to notions of policy iteration. We provide conditions guaranteeing that the rollout algorithm improves the performance of the original heuristic algorithm. The method is illustrated in the context of a machine maintenance and repair problem.
- Barto, A.G., S.J. Bradtke, and S.P. Singh. (1995). "Learning to Act Using Real-Time Dynamic Programming." Artificial Intelligence 72, 81–138. CrossRef
- Barto, A.S. and R. Sutton. (1997). Reinforcement Learning. MIT Press (forthcoming).
- Bertsekas, D.P. and J.N. Tsitsiklis. (1996). Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
- Glover, F., E. Taillard, and D. deWerra. (1993). "A User's Guide to Tabu Search." Annals of Operations Research 41, 3–28. CrossRef
- Pattipati, K.R. and M.G. Alexandridis. (1990). "Application of Heuristic Search and Information Theory to Sequential Fault Diagnosis." IEEE Transactions on Systems, Man, and Cybernetics 20, 872–887. CrossRef
- Tesauro, G. and G.R. Galperin. (1996). "On-Line Policy Improvement Using Monte Carlo Search." Unpublished report.
- Rollout Algorithms for Combinatorial Optimization
Journal of Heuristics
Volume 3, Issue 3 , pp 245-262
- Cover Date
- Print ISSN
- Online ISSN
- Kluwer Academic Publishers
- Additional Links