Abstract
We introduce the notion of a greedy policy for general stochastic control models. Sufficient conditions for the optimality of the greedy policy for finite and infinite horizon are given. Moreover, we derive error bounds if the greedy policy is not optimal. The main results are illustrated by Bayesian information models, discounted Bayesian search problems, stochastic scheduling problems, single-server queueing networks and deterministic dynamic programs.
Similar content being viewed by others
References
Friis S-H, Rieder U, Weishaupt J (1993) Optimal control of single-server queueing networks. ZOR-Mathem Meth Oper Res 37:187–205
Glazebrook KD (1987a) Sensitivity analysis for stochastic scheduling problems. Math Oper Res 12:205–223
Glazebrook KD (1987b) Evaluating the effects of machine breakdowns in stochastic scheduling problems. Naval Research Logistics 34:319–335
Gittins JC (1989) Multi-armed bandit allocation indices. Wiley, Chichester
Lehnerdt M (1982) On the structure of discrete sequential search problems and of their solutions. Optimization 13:523–557
Liebig T (1995) Strukturuntersuchungen in Bayesschen Suchproblemen. Dissertation, Universität Ulm
Liebig T (1996) Discounted Bayesian search problems with unknown detection probabilities. Mathem Meth Oper Res 44
Rieder U (1988) Bayessche Kontrollmodelle. Skript, Universität Ulm
Weishaupt J (1994) Optimal myopic policies and index policies. ZOR-Mathem Meth Oper Res 40:75–89
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liebig, T., Rieder, U. Optimal greedy policies for stochastic control models. Mathematical Methods of Operations Research 44, 115–133 (1996). https://doi.org/10.1007/BF01246332
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF01246332