New exactly solvable examples for controlled discrete-time Markov chains

Piunovskii, A. B.; Khametov, V. M.

doi:10.1007/BF01068323

New exactly solvable examples for controlled discrete-time Markov chains

Published: May 1991

Volume 27, pages 420–433, (1991)
Cite this article

Cybernetics and Systems Analysis Aims and scope

A. B. Piunovskii &
V. M. Khametov

49 Accesses
Explore all metrics

Abstract

We examine a finite-horizon Markov decision process which admits an unbounded Bellman function. The optimality equation is analyzed and the necessary and sufficient conditions for the optimal Markov A-strategies are obtained. Optimal synthesis using the entropy criterion is considered.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition

Learning and Designing Stochastic Processes from Logical Constraints

Probabilistic Model Checking for Continuous-Time Markov Chains via Sequential Bayesian Inference

Literature Cited

V. I. Arkin and I. V. Evstigneev, Probabilistic Models of Control and Economic Dynamics [in Russian], Nauka, Moscow (1979).
Google Scholar
A. M. Ter-Krikorov, Optimal Control and Mathematical Economics [in Russian], Nauka, Moscow (1977).
Google Scholar
É. L. Presman and I. M. Sonin, Sequential Control under Incomplete Information [in Russian], Nauka, Moscow (1982).
Google Scholar
R. Sh. Liptser and A. N. Shiryaev, Statistics of Random Processes [in Russian], Nauka, Moscow (1974).
Google Scholar
A. A. Yushkevich, “Controlled Markov jump models,” Teor. Veroyatn. Primen., No. 2, 247–270 (1980).
Google Scholar
V. M. Chametov and A. B. Piunovski, “On the optimal control of information transmission,” Fundamentals of Teletraffic Theory, Proc. 3rd Int. Seminar on Teletraffic Theory, Moscow, Int. Advisory Council of Int. Teletraffic Congresses, Inst. of Problems of Information Transmission, Akad. Nauk SSSR (1984), pp. 57–60.
H. Mine and S. Osaki, Markovian Decision Processes, Elsevier, New York (1970).
Google Scholar
E. A. Fainberg, “ε-Optimal control of a finite Markov chain by a mean criterion,” Teor. Veroyatn. Primen., No. 1, 71 (1980).
Google Scholar
P. Varaiyal, “Optimal and suboptimal stationary controls for markov chains,” IEEE Trans. Autom. Contr.,AC-23, No. 3, 388–394 (1978).
Google Scholar
A. A. Yushkevich and R. Ya. Chitashvili, “Controlled random sequences and Markov chains,” Usp. Mat. Nauk,37, No. 6, 213–242 (1982).
Google Scholar
D. Bertsekas and S. Shreve, Stochastic Optimal Control: The Discrete Time Case, Academic Press, New York (1978).
Google Scholar
Van der Wal, Stochastic Dynamic Programming, Successive Approximations and Nearly Optimal Strategies for Markov Decision Processes and Markov Games, MCT, Amsterdam (1980).
Google Scholar
J. Wessels, “Markov programming by successive approximations with respect to weighted supremum norms,” J. Math. Anal. Appl.,58, 326–335 (1977).
Google Scholar
J. A. E. E. Van Nunen and J. Wessels, “Markov decision processes with unbounded rewards,” Proc. Adv. Seminar on Markov Decision Theory, MCT, Amsterdam, (1977), pp. 1–24.
D. R. Robinson, “Markov decision chains with unbounded costs and applications to the control of queues,” Adv. Appl. Probab.,8, No. 1, 159–176 (1976).
Google Scholar
A. N. Shiryaev, Probability [in Russian], Nauka, Moscow (1980).
Google Scholar
A. A. Borovkov, Mathematical Statistics [in Russian], Nauka, Moscow (1984).
Google Scholar
A. K. Zvonkin, “On sequentially controlled Markov processes,” Mat. Sb.,86, 611–621 (1971).
Google Scholar
Mathematical Encyclopedia [in Russian], Vol. 2, Izd. Sov. Éntsikl., Moscow (1979).

Download references

Authors

A. B. Piunovskii
View author publications
You can also search for this author in PubMed Google Scholar
V. M. Khametov
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Translated from Kibernetika, No. 3, pp. 82–90, May–June, 1991.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Piunovskii, A.B., Khametov, V.M. New exactly solvable examples for controlled discrete-time Markov chains. Cybern Syst Anal 27, 420–433 (1991). https://doi.org/10.1007/BF01068323

Download citation

Received: 04 July 1985
Revised: 05 September 1990
Issue Date: May 1991
DOI: https://doi.org/10.1007/BF01068323

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

New exactly solvable examples for controlled discrete-time Markov chains

Abstract

Access this article

Similar content being viewed by others

Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition

Learning and Designing Stochastic Processes from Logical Constraints

Probabilistic Model Checking for Continuous-Time Markov Chains via Sequential Bayesian Inference

Literature Cited

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

New exactly solvable examples for controlled discrete-time Markov chains

Abstract

Access this article

Similar content being viewed by others

Sample-Path Optimality in Average Markov Decision Chains Under a Double Lyapunov Function Condition

Learning and Designing Stochastic Processes from Logical Constraints

Probabilistic Model Checking for Continuous-Time Markov Chains via Sequential Bayesian Inference

Literature Cited

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation