Optimal control problem regularization for the Markov process with finite number of states and constraints

Miller, B. M.; Miller, G. B.; Semenikhin, K. V.

doi:10.1134/S0005117916090071

Optimal control problem regularization for the Markov process with finite number of states and constraints

Stochastic Systems, Queueing Systems
Published: 10 September 2016

Volume 77, pages 1589–1611, (2016)
Cite this article

Automation and Remote Control Aims and scope Submit manuscript

B. M. Miller¹,
G. B. Miller² &
K. V. Semenikhin³

66 Accesses
8 Citations
Explore all metrics

Abstract

The optimal control problem is considered for a system given by the Markov chain with integral constraints. It is shown that the solution to the optimal control problem on the set of all predictable controls satisfies Markov property. This optimal Markov control can be obtained as a solution of the corresponding dual problem (in case if the regularity condition holds) or (in other case) by means of proposed regularization method. The problems arising due to the system nonregularity along with the way to cope with those problems are illustrated by an example of optimal control problem for a single channel queueing system.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

Article 07 January 2021

Stochastic Control of a Class of Dynamical Systems via Path Limits

Article 14 July 2021

Continuous-Time Controlled Jump Markov Processes on the Finite Horizon

References

Low, S.H., Paganini, F., and Doyle, J.C., Internet Congestion Control, IEEE Control Syst. Mag., 2002, vol. 22, no. 1, pp. 28–43.
Article MATH Google Scholar
Vasenin, V.A. and Simonova, G.I., Mathematical Models of Traffic Control in Internet. New Approaches Based of TCP/AQM Schemes, Autom. Remote Control, 2005, vol. 66, no. 8, pp. 1274–1286.
Article MathSciNet MATH Google Scholar
Kelly, F.P., Maulloo, A., and Tan, D., Rate Control in Communication Networks. Shadow Prices, Proportional Fairness and Stability, J. Oper. Res. Soc., 1998, vol. 49, pp. 237–252.
Article MATH Google Scholar
Hordijk, A. and Spieksma, F., Constrained Admission Control to a Queueing System, Adv. Appl. Probab., 1989, vol. 21, no. 2, pp. 409–431.
Article MathSciNet MATH Google Scholar
Kitaev, M.Y. and Rykov, V.V., Controlled Queueing Systems, Boca Raton: CRC, 1995.
MATH Google Scholar
Hordijk, A. and van der Duyn Schouten, F.A., Average Optimal Policies in Markov Decision Drift Processes with Application to a Queueing and a Replacement Model, Adv. Appl. Probab., 1983, vol. 15, pp. 274–303.
Article MATH Google Scholar
Yeh, L., A Repair Replacement Model, Adv. Appl. Probab., 1990, vol. 22, no. 2, pp. 494–497.
Article MathSciNet MATH Google Scholar
Piunovskiy, A.B., Bicriteria Optimization of a Queue with a Controlled Input Stream, Queueing Syst., 2004, vol. 48, pp. 159–184.
Article MathSciNet MATH Google Scholar
Abramov, V., Optimal Control of Large Dam, J. Appl. Probab., 2007, vol. 44, pp. 249–258.
Article MathSciNet MATH Google Scholar
Nandalal, K.D.W. and Bogardi, J.J., Dynamic Programming Based Operation of Reservoirs. Applicability and Limits. International Hydrology Series, Cambridge: Cambridge Univ. Press, 2007.
Google Scholar
Miller, A., Miller, B., Popov, A., and Stepanyan, K., Towards the Development of Numerical Procedure for Control of Connected Markov Chains, in Proc. 5th Australian Control Conf. (AUCC), Gold Coast, 2015, 5–6 November, pp. 336–341.
Google Scholar
Altman, E., Constrained Markov Decision Processes, Boca Raton: Chapman, 1999.
MATH Google Scholar
Bertsekas, D.P., and Sreve, S.E., Stochastic Optimal Control, New York: Academic, 1978. Translated under the title Stokhasticheskoe optimal’noe upravlenie, Moscow: Nauka, 1984.
Google Scholar
Fainberg, E.A., Nonrandomized Markov and Semi-Markov Strategies in Dynamic Programming, Theory Probab. Appl., 1982, vol. 27, no. 1, pp. 116–126.
Article MathSciNet Google Scholar
Serfozo, R., Optimal Control of Random Walks, Birth and Death Processes, and Queues, Adv. Appl. Probab., 1981, vol. 13, no. 1, pp. 61–83.
Article MathSciNet MATH Google Scholar
Howard, R., Dynamic Programming and Markov Processes, New York: Wiley, 1960.
MATH Google Scholar
Miller, B.L., Finite State Continuous Time Markov Decision Process with a Finite Planning Horizon, SIAM J. Control, 1968, vol. 6, no. 2, pp. 266–279.
Article MathSciNet MATH Google Scholar
Pliska, S., Controlled Jump Processes, Stoch. Process Appl., 1975, vol. 3, pp. 259–282.
Article MathSciNet MATH Google Scholar
Yushkevich, A.A., Controlled Markov Models with Countable State Space and Continuous Time, Theory Probab. Appl., 1977, vol. 22, no. 2, pp. 215–235.
Article MathSciNet MATH Google Scholar
Elliott, R.J., Aggoun, L., and Moore, J.B., Hidden Markov Models. Estimation and Control, New York: Springer, 1995.
MATH Google Scholar
Liptser, R.Sh. and Shiryaev, A.N., Statistics of Random Processes, New York: Springer, 1979.
MATH Google Scholar
Davis, M.H.A. and Elliott, R.J., Optimal Control of Jump Process, Z. Wahrscheinlichkeitstheorie verw. Gebiete, 1977, vol. 40, pp. 183–202.
Article MathSciNet MATH Google Scholar
Wan, C.B. and Davis, M.H.A., Existence of Optimal Control for Stochastic Jump Processes, SIAM J. Control Optim., 1979, vol. 17, no. 4, pp. 511–524.
Article MathSciNet MATH Google Scholar
Tidball, M.M., Lombardi, A., Pourtaller, O., et al., Continuity of Optimal Values and Solutions for Control of Markov Chains with Constraints, SIAM J. Control Optim., 2000, vol. 38, no. 4, pp. 1204–1222.
Article MathSciNet MATH Google Scholar
Piunovskii, A.B., Controllable Jump Model with Discounting under Constraints, Theory Probab. Appl., 1997, vol. 42, no. 1, pp. 51–72.
Article MathSciNet Google Scholar
Piunovskiy, A.B., Optimal Control of Random Sequences in Problems with Constraints, Dordrecht: Kluwer, 1997.
Book MATH Google Scholar
Mendiondo, M. and Stockbridge, R., Approximation of Infinite-dimensional Linear Programming Problems Which Arise in Stochastic Control, SIAM J. Control Optim., 1998, vol. 36, no. 4, pp. 1448–1472.
Article MathSciNet MATH Google Scholar
Miller, B.M., Miller, G.B., and Semenikhin, K.V., Methods to Design Optimal Control of Markov Process with Finite State Set in the Presence of Constraints, Autom. Remote Control, 2011, vol. 72, no. 2, pp. 323–341.
Article MathSciNet MATH Google Scholar
Gikhman, I.I. and Skorokhod, A.V., Controllable Random Processes, Kiev: Naukova Dumka, 1977.
MATH Google Scholar
Davis, M.H.A., Markov Models and Optimization, London: Chapman, 1993.
Book MATH Google Scholar
Elliott, R.J., A Partially Observed Control Problem for Markov Chains, Appl. Math. Optim., 1992, vol. 25, pp. 151–169.
Article MathSciNet MATH Google Scholar
Fleming, W.H. and Rishel, R.W., Deterministic and Stochastic Optimal Control, New York: Springer, 1975. Translated under the title Optimal’noe upravlenie determinirovannymi i stokhasticheskimi sistemami, Moscow: Mir, 1978.
Book MATH Google Scholar
Lee, E.B. and Marcus, L., Foundation of Optimal Control Theory, New York: Wiley, 1975.
Google Scholar
Bremaud, P., Optimal Thinning of a Point Process, SIAM J. Control Optim., 1979, vol. 17, no. 2, pp. 222–230.
Article MathSciNet MATH Google Scholar
Miller, B., Optimization of Stochastic Networks via Stochastic Controls, in Proc. Int. Conf. “System Identification and Control Problems” (SICPRO’08), Moscow, 2008, January 28–31.
Google Scholar
Miller, B., Access and Service Rate Control in Queueing Systems, in Proc. IFAC World Congr., Seoul, 2008, July 6–11.
Google Scholar
Gonzalez-Hernandez, J. and Hernandez-Lerma, O., Extreme Points of Sets of Randomized Strategies in Constrained Optimization and Control Problems, SIAM J. Optim., 2005, vol. 15, no. 4, pp. 1085–1104.
Article MathSciNet MATH Google Scholar
Ioffe, A.D. and Tikhomirov, V.M., Teoriya ekstremal’nykh zadach (Theory of Extremal Problems), Moscow: Nauka, 1974.
Google Scholar
Polyak, B.T., Vvedenie v optimizatsiyu, Moscow: Nauka, 1983. Translated into English under the title Introduction to Optimization, New York: Optimization Software, 1987.
MATH Google Scholar
Miller, A.B., Using Methods of Stochastic Control to Prevent Overloads in Data Transmission Networks, Autom. Remote Control, 2010, vol. 71, no. 9, pp. 1804–1815.
Article MathSciNet MATH Google Scholar
Miller, B.M., Myasnikov, D.V., and Semenikhin, K.V., Designing the Optimal Stochastic Control of a Queuing System with Regard for Constraints, in Proc. 37th School-Conf. of Young Researchers and Experts “Information Technologies and Systems,” Kaliningrad, 2013, September 1–6.
Google Scholar
Myasnikov, D.V. and Semenikhin, K.V., Control of M|M|1|N Queue Parameters under Constraints, J. Comput. Syst. Sci. Int., 2016, vol. 55, no. 1, pp. 59–78.
Article Google Scholar
Rockafellar, R.T., Convex Analysis, Princeton: Princeton Univ. Press, 1970. Translated under the title Vypuklyi analiz, Moscow: Mir, 1973.
MATH Google Scholar
Zangwill, W.I., Nonlinear Programming. A Unified Approach, Englewood Cliffs: Prentice-Hall, 1969. Translated under the title Nelineinoe programmirovanie. Edinyi podkhod, Moscow: Sovetskoe Radio, 1973.
MATH Google Scholar
Berkovitz, L., Convexity and Optimization in Rⁿ, New York: Wiley, 2002.
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
B. M. Miller
Institute of Informatics Problems, Federal Research Center “Computer Science and Control” of Russian Academy of Sciences, Moscow, Russia
G. B. Miller
Moscow State Aviation Institute, Moscow, Russia
K. V. Semenikhin

Authors

B. M. Miller
View author publications
You can also search for this author in PubMed Google Scholar
G. B. Miller
View author publications
You can also search for this author in PubMed Google Scholar
K. V. Semenikhin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to B. M. Miller.

Additional information

Original Russian Text © B.M. Miller, G.B. Miller, K.V. Semenikhin, 2016, published in Avtomatika i Telemekhanika, 2016, No. 9, pp. 96–123.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Miller, B.M., Miller, G.B. & Semenikhin, K.V. Optimal control problem regularization for the Markov process with finite number of states and constraints. Autom Remote Control 77, 1589–1611 (2016). https://doi.org/10.1134/S0005117916090071

Download citation

Received: 26 April 2015
Published: 10 September 2016
Issue Date: September 2016
DOI: https://doi.org/10.1134/S0005117916090071

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimal control problem regularization for the Markov process with finite number of states and constraints

Abstract

Access this article

Similar content being viewed by others

Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

Stochastic Control of a Class of Dynamical Systems via Path Limits

Continuous-Time Controlled Jump Markov Processes on the Finite Horizon

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Optimal control problem regularization for the Markov process with finite number of states and constraints

Abstract

Access this article

Similar content being viewed by others

Integro-differential optimality equations for the risk-sensitive control of piecewise deterministic Markov processes

Stochastic Control of a Class of Dynamical Systems via Path Limits

Continuous-Time Controlled Jump Markov Processes on the Finite Horizon

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation