Block-scaling of value-iteration for discounted Markov renewal programming

Schweitzer, Paul J.

doi:10.1007/BF02283616

Block-scaling of value-iteration for discounted Markov renewal programming

Computational Issues
Published: December 1991

Volume 29, pages 603–630, (1991)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Paul J. Schweitzer¹

41 Accesses
2 Citations
Explore all metrics

Abstract

The functional equations of infinite horizon discounted Markov renewal programming arev*=Tv* whereT is a monotone contraction operator. This paper shows how to accelerate convergence of the value-iteration schemev ⁽ⁿ⁺¹⁾=Tv ⁽ⁿ⁾ by a block-scaling step whereby all states in a given group have theirv ⁽ⁿ⁾_i scaled by a common scale factor. A similar method exists when the relative valuesv*_i−v*₁ are computed iteratively. In both cases, the block scaling factors are solution to a set of functional equations which has similar structure to a discounted Markov renewal program, and can itself be solved by successive approximation, policy iteration, or linear programming.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

D.P. Bertsekas and D.A. Castanon, Adaptive aggregation methods for infinite horizon dynamic programming, IEEE Trans. Automatic Control AC-34 (1989) 589–598.
Google Scholar
F. Chatelin and W.L. Miranker, Acceleration by aggregation of successive approximation methods, Lin. Algebra Appl. 43 (1982) 17–47.
Google Scholar
C. Derman,Finite State Markovian Decision Processes (Academic Press, New York, 1970).
Google Scholar
L.M. Dudkin, I. Rabinovich and J. Vakhutinsky,Iterative Aggregation Theory (Marcel Dekker, New York, 1980).
Google Scholar
A. Federgruen and P.J. Schweitzer, A survey of asymptotic value-iteration for undiscounted Markovian decision process, in: R. Hartley, L.C. Thomas and D.J. White (eds.),Recent Developments in Markov Decision Processes (Academic Press, New York, 1980), pp. 73–109.
Google Scholar
(a) P. Henrici,Elements of Numerical Analysis (Wiley, New York, 1964).
Google Scholar
(b) D.P. Heyman and M.J. Sobel,Stochastic Models in Operations Research, Vol. 2 (McGraw-Hill, New York, 1984).
Google Scholar
R.A. Howard, Semi-Markovian decision processes, Bull. Int. Statist. Inst. 40, Part 2 (1963) 625–652.
Google Scholar
W.S. Jewell, Markov renewal programming I and II, Oper. Res. 11 (1963) 938–971.
Google Scholar
S. Lippman, Applying a new device in the optimization of exponential systems, Oper. Res. 23 (1975) 687–710.
Google Scholar
J. MacQueen, A modified dynamic programming method for Markovian decision problems, J. Math. Anal. Appl. 14 (1966) 38–43.
Google Scholar
R. Mendelssohn, An iterative aggregation procedure for Markov decision processes, Oper. Res. 30 (1982) 62–73.
Google Scholar
S. Osaki and H. Mine, Linear programming algorithms for semi-Markovian decision processes, J. Math. Anal. Appl. 22 (1968) 356–381
Google Scholar
E.L. Porteus, Some bounds for discounted sequential decision processes, Manag. Sci. 18 (1971) 7–11.
Google Scholar
E.L. Porteus, Bounds and transformations for discounted finite Markov decision chains, Oper. Res. 33 (1975) 761–784.
Google Scholar
E.L. Porteus, Overview of iterative methods for discounted finite Markov and semi-Markov decision chains, in: R. Hartley, L.C. Thomas and D.J. White (eds.),Recent Developments in Markov Decision Processes (Academic Press, New York, 1980) pp. 1–20.
Google Scholar
P.J. Schweitzer, Bounds on the fixed point of a monotone contraction operator, J. Math. Anal. Appl. 123 (1987) 376–388.
Google Scholar
P.J. Schweitzer, Iterative aggregation-disaggregation for discounted Markov renewal programming, forthcoming.
P.J. Schweitzer, U. Sumita and K. Ohno, A replacement process decomposition for discounted Markov renewal programs, this volume, pp. 631–646.
L.P. Seelen, An algorithm forPh / Ph / c queues, Europ. J. Oper. Res. 23 (1986) 118–127.
Google Scholar
K.-H. Waldmann, On bounds for dynamic programs, Math. Oper. Res. 10 (1985) 220–232.
Google Scholar

Download references

Author information

Authors and Affiliations

William E. Simon Graduate School of Business Administration, University of Rochester, 14627, Rochester, NY, USA
Paul J. Schweitzer

Authors

Paul J. Schweitzer
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Schweitzer, P.J. Block-scaling of value-iteration for discounted Markov renewal programming. Ann Oper Res 29, 603–630 (1991). https://doi.org/10.1007/BF02283616

Download citation

Issue Date: December 1991
DOI: https://doi.org/10.1007/BF02283616

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Block-scaling of value-iteration for discounted Markov renewal programming

Abstract

Access this article

Similar content being viewed by others

Stochastic Dynamic Programming with Non-linear Discounting

Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints

On maximin dynamic programming and the rate of discount

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Block-scaling of value-iteration for discounted Markov renewal programming

Abstract

Access this article

Similar content being viewed by others

Stochastic Dynamic Programming with Non-linear Discounting

Convergence of Value Functions for Finite Horizon Markov Decision Processes with Constraints

On maximin dynamic programming and the rate of discount

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation