Markov renewal decision processes with finite horizon

Schellhaas, H.

doi:10.1007/BF01720156

Markov renewal decision processes with finite horizon

Theoretical Papers
Published: 03 May 2005

Volume 2, pages 33–40, (1980)
Cite this article

Operations-Research-Spektrum Aims and scope Submit manuscript

H. Schellhaas¹

62 Accesses
8 Citations
Explore all metrics

Summary

We investigate Markov renewal decision processes with finite horizon, countable state space, general action space and unbounded rewards. Under rather weak restrictions we derive the optimality equation and state conditions ensuring the convergence of successive approximations and the existence of optimal stationary policies. Strengthening the conditions we prove uniqueness of the solution of the optimality equation. Finally we discuss some numerical aspects including extrapolations using an equivalent optimality equation.

Zusammenfassung

Wir untersuchen Semi-Markoffsche Entscheidungsprozesse mit endlichem Horizont, abzählbarem Zustandsraum, allgemeinem Aktionenraum und unbeschränkten Erträgen. Unter schwachen Voraussetzungen leiten wir die Optimalitätsgleichung her und geben hinreichende Bedingungen für die Konvergenz der sukzessiven Approximation und die Existenz optimaler stationärer Politiken. Unter schärferen Voraussetzungen zeigen wir die Eindeutigkeit der Lösung der Optimalitätsgleichung. Schließlich diskutieren wir einige numerische Aspekte einschließlich einer Extrapolation basierend auf einer äquivalenten Optimalitätsgleichung.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Feller W (1971) An introduction to probability theory and its applications, vol. 2, 2nd ed. J Wiley, New York
Google Scholar
Hinderer K (1971) Instationäre dynamische Optimierung bei schwachen Voraussetzungen über die Gewinnfunktionen. Abh Math Sem Univ Hamburg 36:208–223
Article Google Scholar
Hinderer K (1970) Foundations of non-stationary dynamic programming with discrete time parameter. Springer, Berlin Heidelberg New York
Book Google Scholar
Hinderer K (1978) On approximate solutions of finite-stage dynamic programs. In: Puterman ML (ed) Dynamic Programming and its Applications, Proc. Internat. Conference on Dynamic Programming, Vancouver 1977. Academic Press, New York, pp 289–317
Chapter Google Scholar
Hinderer K, Hübner G (1977) On exact and approximate solutions of unstructured finite-stage dynamic programs. Proc. Advanced Seminar on Markov Decision Theory, Amsterdam 1976. Math Centre Tracts 93:57–76
Google Scholar
Jewell WS (1963) Markov-renewal programming. I: Formulation, finite return models. II: Infinite return models, example. Oper Res 11:938–971
Article Google Scholar
Lembersky MR (1974) On maximal rewards andε-optimal policies in continuous time Markov decision chains. Ann Statist 2:159–169
Article Google Scholar
Lembersky MR (1974) Preferred rules in continuous time Markov decision processes. Manage Sci 21:348–357
Article Google Scholar
Porteus E (1975) Bounds and transformations for discounted finite Markov decision chains. Oper Res 23:761–784
Article Google Scholar
Rieder U (1976) On dynamic programming with unbounded reward functions. Report, Inst. f. Math. Stochastik, University of Hamburg
Schäl M (1975) Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z Wahrsch Verw. Gebiete 32:179–196
Article Google Scholar
Schellhaas H (1974) Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung. Z Oper Res 18:91–104
Google Scholar
Schellhaas H (1979) Über Semi-Markoffsche Entscheidungsprozesse mit endlichem Horizont. Proc. in Operat. Res. Vol. 8. Gaede KW et al. (eds) Physica-Verlag, Würzburg Wien, pp 122–129
Google Scholar
Stidham S On the convergence of successive approximations in dynamic programming with non-zero terminal reward. NCSU Technical report No. 78-9
Waldman K-H (1978) A natural extension of the MacQueen extrapolation. Preprint Nr. 436, Fachbereich Mathematik, Technische Hochschule Darmstadt

Download references

Author information

Authors and Affiliations

Fachbereich Mathematik, Technische Hochschule Darmstadt, Schloßgartenstraße 7, D-6100, Darmstadt, Germany
H. Schellhaas

Authors

H. Schellhaas
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schellhaas, H. Markov renewal decision processes with finite horizon. OR Spektrum 2, 33–40 (1980). https://doi.org/10.1007/BF01720156

Download citation

Received: 10 November 1979
Accepted: 16 May 1980
Published: 03 May 2005
Issue Date: March 1980
DOI: https://doi.org/10.1007/BF01720156

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Markov renewal decision processes with finite horizon

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

Finite horizon continuous-time Markov decision processes with mean and variance criteria

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Markov renewal decision processes with finite horizon

Summary

Zusammenfassung

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

Finite horizon continuous-time Markov decision processes with mean and variance criteria

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation