Non-randomized strategies in stochastic decision processes

Feinberg, Eugene A.

doi:10.1007/BF02283603

Non-randomized strategies in stochastic decision processes

Borel State Space
Published: December 1991

Volume 29, pages 315–332, (1991)
Cite this article

Annals of Operations Research Aims and scope Submit manuscript

Eugene A. Feinberg¹

66 Accesses
6 Citations
Explore all metrics

Abstract

This paper deals with discrete time infinite horizon stochastic decision processes with various reward criteria. Sufficient conditions are obtained for the value of a class of strategies to be equal to the value of the subclass of non-randomized strategies from this class.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Introduction to Reinforcement Learning

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Article 22 April 2021

References

R.J. Aumann, Mixed and behavior strategies in infinite extensive games, Ann. Math. Studies 53 (1964) 627–650.
Google Scholar
D.P. Bertsekas and S.E. Shreve,Stochastic Optimal Control: The Discrete Time Case (Academic Press, New York, 1978).
Google Scholar
D. Blackwell, D. Freedman and M. Orkin, The optimal reward operator in dynamic programming, Ann. Statist. 1 (1974) 926–941.
Google Scholar
E.B. Dynkin and A.A. Yushkevich,Controlled Markov Processes (Springer, New York, 1979).
Google Scholar
E.A. Feinberg, Non-randomized Markov and semi-Markov strategies in dynamic programming, Theory Probab. Appl. 27 (1982) 116–126.
Google Scholar
E.A. Feinberg, Controlled Markov processes with arbitrary numerical criteria, Theory Probab. Appl. 27 (1982) 486–503.
Google Scholar
E.A. Feinberg, Sufficient classes of strategies in discrete dynamic programming I: Decomposition of randomized strategies and embedded models, Theory Probab. Appl. 31 (1986) 658–668.
Google Scholar
E.A. Feinberg, On stationary strategies in Borel dynamic programming, submitted to Math. Oper. Res. (1989).
E.A. Feinberg and I.M. Sonin, Persistently nearly optimal strategies in stochastic dynamic programming, in:Statistics and Control of Stochastic Processes (Steklov Seminar, 1984), Optimization Software, New York (1985) pp. 69–101.
Google Scholar
I.I. Gikhman and A.V. Skorokhod,Controlled Random Processes (Springer, New York, 1979).
Google Scholar
K.M. van Hee, Markov strategies in dynamic programming, Math. Oper. Res. 3 (1978) 37–41.
Google Scholar
T.P. Hill, On the existence of good Markov strategies, Trans. Amer. Math. Soc. 247 (1979) 157–176.
Google Scholar
T.P. Hill and V.C. Pestien, The existence of good Markov strategies for decision processes with general payoffs, Stochastic Process. Appl. 24 (1987) 61–76.
Google Scholar
G. Kallianpur,Stochastic Filtering Theory (Springer, New York, 1980).
Google Scholar
N.V. Krylov, The construction of an optimal strategy for a finite controlled chain, Theory Probab. Appl. 10 (1965) 45–54.
Google Scholar
P.A. Meyer,Probability and Potentials (Blaisdell, Waltham, MA, 1966).
Google Scholar
J. Neveu,Mathematical Foundations of the Calculus of Probability (Holden-Day, San Francisco, 1965).
Google Scholar
M. Schäl, Stationary policies in dynamic programming models under compactness assumptions, Math. Oper. Res. 8 (1983) 366–372.
Google Scholar
A.N. Shiryaev,Optimal Stopping Rules (Springer, New York, 1978).
Google Scholar
I.M. Sonin and E.A. Feinberg, Sufficient classes of strategies in controllable countable Markov chains with total criterion, Sov. Math. Dokl. 29 (1984) 308–311.
Google Scholar
R. Strauch, Negative dynamic programming, Ann. Math. Statist. 37 (1966) 871–890.
Google Scholar
J. van der Wal,Stochastic Dynamic Programming (Mathematisch Centrum, Amsterdam, 1981).
Google Scholar
A.A. Yushkevich and R.J. Chitashvili, Controlled random sequences, Russian Math. Surveys 37 (1982) 239–274.
Google Scholar

Download references

Author information

Authors and Affiliations

W. Averell Harriman School for Management and Policy, SUNY at Stony Brook, 11794-3775, Stony Brook, NY, USA
Eugene A. Feinberg

Authors

Eugene A. Feinberg
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Feinberg, E.A. Non-randomized strategies in stochastic decision processes. Ann Oper Res 29, 315–332 (1991). https://doi.org/10.1007/BF02283603

Download citation

Issue Date: December 1991
DOI: https://doi.org/10.1007/BF02283603

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Non-randomized strategies in stochastic decision processes

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Introduction to Reinforcement Learning

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Non-randomized strategies in stochastic decision processes

Abstract

Access this article

Similar content being viewed by others

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Introduction to Reinforcement Learning

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation