Optimal greedy policies for stochastic control models

Liebig, Thilo; Rieder, Ulrich

doi:10.1007/BF01246332

Optimal greedy policies for stochastic control models

Published: February 1996

Volume 44, pages 115–133, (1996)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Thilo Liebig¹ &
Ulrich Rieder¹

99 Accesses
1 Citation
Explore all metrics

Abstract

We introduce the notion of a greedy policy for general stochastic control models. Sufficient conditions for the optimality of the greedy policy for finite and infinite horizon are given. Moreover, we derive error bounds if the greedy policy is not optimal. The main results are illustrated by Bayesian information models, discounted Bayesian search problems, stochastic scheduling problems, single-server queueing networks and deterministic dynamic programs.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Article 08 May 2024

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Article 22 April 2021

References

Friis S-H, Rieder U, Weishaupt J (1993) Optimal control of single-server queueing networks. ZOR-Mathem Meth Oper Res 37:187–205
Google Scholar
Glazebrook KD (1987a) Sensitivity analysis for stochastic scheduling problems. Math Oper Res 12:205–223
Google Scholar
Glazebrook KD (1987b) Evaluating the effects of machine breakdowns in stochastic scheduling problems. Naval Research Logistics 34:319–335
Google Scholar
Gittins JC (1989) Multi-armed bandit allocation indices. Wiley, Chichester
Google Scholar
Lehnerdt M (1982) On the structure of discrete sequential search problems and of their solutions. Optimization 13:523–557
Google Scholar
Liebig T (1995) Strukturuntersuchungen in Bayesschen Suchproblemen. Dissertation, Universität Ulm
Liebig T (1996) Discounted Bayesian search problems with unknown detection probabilities. Mathem Meth Oper Res 44
Rieder U (1988) Bayessche Kontrollmodelle. Skript, Universität Ulm
Weishaupt J (1994) Optimal myopic policies and index policies. ZOR-Mathem Meth Oper Res 40:75–89
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics VII, University of Ulm, 89069, Ulm, Germany
Thilo Liebig & Ulrich Rieder

Authors

Thilo Liebig
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Rieder
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liebig, T., Rieder, U. Optimal greedy policies for stochastic control models. Mathematical Methods of Operations Research 44, 115–133 (1996). https://doi.org/10.1007/BF01246332

Download citation

Received: 15 August 1995
Issue Date: February 1996
DOI: https://doi.org/10.1007/BF01246332

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimal greedy policies for stochastic control models

Abstract

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key words

Navigation

Optimal greedy policies for stochastic control models

Abstract

Access this article

Similar content being viewed by others

A practical guide to multi-objective reinforcement learning and planning

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation