Abstract
Rollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each decision stage by simulating future events while following a heuristic policy, referred to as the base policy. While in many cases rollout algorithms are guaranteed to perform as well as their base policies, there have been few theoretical results showing additional improvement in performance. In this paper, we perform a probabilistic analysis of the subset sum problem and 0–1 knapsack problem, giving theoretical evidence that rollout algorithms perform strictly better than their base policies. Using a stochastic model from the existing literature, we analyze two rollout methods that we refer to as the exhaustive rollout and consecutive rollout, both of which employ a simple greedy base policy. We prove that both methods yield a significant improvement in expected performance after a single iteration of the rollout algorithm, relative to the base policy.
Similar content being viewed by others
References
Bertsekas, D.P., Tsitsiklis, J., Wu, C.: Rollout algorithms for combinatorial optimization. J. Heuristics 3, 245–262 (1997)
Bertsekas, D.P., Castanon, D.A.: Rollout algorithms for stochastic scheduling problems. J. Heuristics 5, 89–108 (1999)
Bertsekas, D.P.: Dynamic Programming and Optimal Control, 3rd edn. Athena Scientific, Belmont, MA (2007)
Bertazzi, L.: Minimum and worst-case performance ratios of rollout algorithms. J. Optim. Theory Appl. 152, 378–393 (2012)
Borgwardt, K., Tremel, B.: The average quality of greedy-algorithms for the subset-sum-maximization problem. Math. Methods Oper. Res. 35, 113–149 (1991)
Kellerer, H., Pferschy, U., Pisinger, D.: Knapsack Problems. Springer, Berlin (2004)
Martello, S., Toth, P.: Knapsack Problems: Algorithms and Computer Implementations. Wiley, New York (1990)
Tesauro, G., Galperin, G.R.: On-line policy improvement using monte-carlo search. Adv. Neural Inf. Process. Syst. 9, 1068–1074 (1997)
Secomandi, N.: A rollout policy for the vehicle routing problem with stochastic demands. Oper. Res. 49, 796–802 (2001)
Tu, F., Pattipati, K.: Rollout strategies for sequential fault diagnosis. In: AUTOTESTCON Proceedings, pp. 269–295. IEEE (2002)
Li, Y., Krakow, L.W., Chong, E.K.P., Groom, K.N.: Approximate stochastic dynamic programming for sensor scheduling to track multiple targets. Digit. Signal Process. 19, 978–989 (2009)
Martello, S., Toth, P.: Worst-case analysis of greedy algorithms for the subset-sum problem. Math. Program. 28, 198–205 (1984)
D’Atri, G., Puech, C.: Probabilistic analysis of the subset-sum problem. Discret. Appl. Math. 4, 329–334 (1982)
Pferschy, U.: Stochastic analysis of greedy algorithms for the subset sum problem. Cent. Eur. J. Oper. Res. 7, 53–70 (1999)
Szkatula, K., Libura, M.: Probabilistic analysis of simple algorithms for binary knapsack problem. Control Cybern. 12, 147–157 (1983)
Szkatula, K., Libura, M.: On probabilistic properties of greedy-like algorithms for the binary knapsack problem. In: Proceedings of Advanced School on Stochastics in Combinatorial Optimization pp. 233–254 (1987)
Diubin, G., Korbut, A.: The average behaviour of greedy algorithms for the knapsack problem: general distributions. Math. Methods Oper. Res. 57, 449–479 (2003)
Calvin, J.M., Leung, J.Y.T.: Average-case analysis of a greedy algorithm for the 0/1 knapsack problem. Oper. Res. Lett. 31, 202–210 (2003)
Dean, B.C., Goemans, M.X., Vondrdk, J.: Approximating the stochastic knapsack problem: the benefit of adaptivity. In: Proceedings of the 45th Annual IEEE Symposium on Foundations of Computer Science, pp. 208–217. IEEE (2004)
Lueker, G.S.: Average-case analysis of off-line and on-line knapsack problems. In: Proceedings of the Sixth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 179–188. Society for Industrial and Applied Mathematics (1995)
Beier, R., Vöcking, B.: Random knapsack in expected polynomial time. In: Proceedings of the Thirty-Fifth Annual ACM Symposium on Theory of Computing, pp. 232–241. ACM (2003)
Acknowledgments
The authors would like to thank the two referees for the helpful comments. Research supported by NSF Grant 1029603 and ONR Grant N00014-12-1-0033; the first author is supported in part by a NSF graduate research fellowship.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Anita Schöbel.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Mastin, A., Jaillet, P. Average-Case Performance of Rollout Algorithms for Knapsack Problems. J Optim Theory Appl 165, 964–984 (2015). https://doi.org/10.1007/s10957-014-0603-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10957-014-0603-x