Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung

Schellhaas, H.

doi:10.1007/BF01949684

Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung

Published: July 1974

Volume 18, pages 91–104, (1974)
Cite this article

Zeitschrift für Operations Research Aims and scope Submit manuscript

H. Schellhaas¹

76 Accesses
2 Citations
Explore all metrics

Zusammenfassung

Es wird eine einheitliche Methode entwickelt, beiMarkoffschen Entscheidungs-modellen (diskreteMarkoff-Ketten,Semi-Markoff-Prozesse, regenerative Prozesse) mit endlichem Zustands- und Entscheidungsraum und Diskontierung aus den Iterierten der Wertiteration oder überrelaxation obere und untere Schranken für den Optimalwert der Zielfunktion zu gewinnen. Schließlich werden einige numerische Ergebnisse für die resultierenden Algorithmen angegeben.

Summary

The paper deals withMarkovian decision models (discreteMarkov chains, semi-Markov processes, regenerative processes) with finite state and action space in the case of discounting future rewards. A unified method is derived to obtain upper and lower bounds for the optimal objective function based on iterates of value iteration or successive overrelaxation. Finally some numerical tests for the resulting algorithms are given.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Literaturverzeichnis

Albrecht, J.: Fehlerschranken und Konvergenzbeschleunigung bei einer monotonen oder alternierenden Iterationsfolge. Num. Math.4, 196–208, 1962.
Google Scholar
Denardo, E. V.: Contraction Mappings in the Theory Underlying Dynamic Programming. SIAM Review9, 165–177, 1967.
Google Scholar
Finkbeiner, B., undW. Runggaldier: Ein Wertiterationsalgorithmus für unendliche sequentielle Entscheidungsprozesse mit Diskontierung. In:R. Henn, H. P. Künzi, H. Schubert (Hrsg.), Operations Research Verfahren VI (1. Oberwolfach-Tagung über OR 1968), 124–131, Meisenheim 1969.
Hastings, N. A. J.: Some Notes on Dynamic Programming and Replacement. Operat. Res. Quart.19, 453–464, 1968.
Google Scholar
—: Optimization of Discounted Markov Decision Problems. Operat. Res. Quart.20, 499–500, 1969.
Google Scholar
—: Bounds on the Gain of a Markov Decision Process. Operat. Res.19, 240–244, 1971.
Google Scholar
Hitchcock, D. F., andJ. B. MacQueen: On Computing the Expected Discounted Return in a Markov Chain. Nav. Res. Logist. Quart.17, 237–241, 1970.
Google Scholar
Howard, R. A.: Dynamic Programming and Markov Processes. The MIT Press, Cambridge 1960.
Google Scholar
Jewell, W.S.: Markov-Renewal Programming I and II. Operat. Res.3, 938–971, 1963.
Google Scholar
Mac Queen, J.: A Modified Dynamic Programming Method for Markovian Decision Problems. J. Math. Anal. Appl.14, 38–43, 1966.
Google Scholar
—: A Test for Suboptimal Actions in Markovian Decision Problems. Operat. Res.15, 559–561, 1967.
Google Scholar
Morton, T. E.: On the Asymptotic Convergence Rate of Cost Differences for Markovian Decision Processes. Operat. Res.19, 244–248, 1971.
Google Scholar
Odoni, A. R.: On Finding the Maximal Gain for Markov Decision Processes. Operat. Res.17, 857–860, 1969.
Google Scholar
Porteus, E. L.: Some Bounds for Discounted Sequential Decision Processes. Management Science18, 7–11, 1971.
Google Scholar
Reetz, D.: Solution of a Markovian Decision Problems by Successive Overrelaxation. Zeitschr. f. Operat. Res.17, 29–32, 1973.
Google Scholar
Schellhaas, H.: Regenerative stochastische Entscheidungsprozesse mit endlich vielen Zuständen. In:R. Henn, H. P. Künzi, H. Schubert (Hrsg.): Operations Research Verfahren XIII (IV. Oberwolfach-Tagung über OR 1971), 332–357, Meisenheim 1972.
Schweitzer, P. J.: Multiple Policy Improvements in Undiscounted Markov Renewal Programming. Operat. Res.19, 784–793, 1971.
Google Scholar
Shapiro, J. F.: Turnpike Planning Horizons for a Markovian Decision Model. Management Science14, 292–300, 1968.
Google Scholar

Download references

Author information

Authors and Affiliations

Fachbereich Mathematik, Technische Hochschule, 61 Darmstadt
H. Schellhaas

Authors

H. Schellhaas
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schellhaas, H. Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung. Zeitschrift für Operations Research 18, 91–104 (1974). https://doi.org/10.1007/BF01949684

Download citation

Received: 01 August 1973
Issue Date: July 1974
DOI: https://doi.org/10.1007/BF01949684

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Zur Extrapolation in Markoffschen Entscheidungsmodellen mit Diskontierung

Zusammenfassung

Summary

Access this article

Literaturverzeichnis

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation