Abstract
We consider the approximate solution of discrete optimization problems using procedures that are capable of magnifying the effectiveness of any given heuristic algorithm through sequential application. In particular, we embed the problem within a dynamic programming framework, and we introduce several types of rollout algorithms, which are related to notions of policy iteration. We provide conditions guaranteeing that the rollout algorithm improves the performance of the original heuristic algorithm. The method is illustrated in the context of a machine maintenance and repair problem.
Similar content being viewed by others
References
Barto, A.G., S.J. Bradtke, and S.P. Singh. (1995). "Learning to Act Using Real-Time Dynamic Programming." Artificial Intelligence 72, 81–138.
Barto, A.S. and R. Sutton. (1997). Reinforcement Learning. MIT Press (forthcoming).
Bertsekas, D.P. and J.N. Tsitsiklis. (1996). Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.
Glover, F., E. Taillard, and D. deWerra. (1993). "A User's Guide to Tabu Search." Annals of Operations Research 41, 3–28.
Pattipati, K.R. and M.G. Alexandridis. (1990). "Application of Heuristic Search and Information Theory to Sequential Fault Diagnosis." IEEE Transactions on Systems, Man, and Cybernetics 20, 872–887.
Tesauro, G. and G.R. Galperin. (1996). "On-Line Policy Improvement Using Monte Carlo Search." Unpublished report.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Bertsekas, D.P., Tsitsiklis, J.N. & Wu, C. Rollout Algorithms for Combinatorial Optimization. Journal of Heuristics 3, 245–262 (1997). https://doi.org/10.1023/A:1009635226865
Issue Date:
DOI: https://doi.org/10.1023/A:1009635226865