Abstract
The optimal control problem is considered for a system given by the Markov chain with integral constraints. It is shown that the solution to the optimal control problem on the set of all predictable controls satisfies Markov property. This optimal Markov control can be obtained as a solution of the corresponding dual problem (in case if the regularity condition holds) or (in other case) by means of proposed regularization method. The problems arising due to the system nonregularity along with the way to cope with those problems are illustrated by an example of optimal control problem for a single channel queueing system.
Similar content being viewed by others
References
Low, S.H., Paganini, F., and Doyle, J.C., Internet Congestion Control, IEEE Control Syst. Mag., 2002, vol. 22, no. 1, pp. 28–43.
Vasenin, V.A. and Simonova, G.I., Mathematical Models of Traffic Control in Internet. New Approaches Based of TCP/AQM Schemes, Autom. Remote Control, 2005, vol. 66, no. 8, pp. 1274–1286.
Kelly, F.P., Maulloo, A., and Tan, D., Rate Control in Communication Networks. Shadow Prices, Proportional Fairness and Stability, J. Oper. Res. Soc., 1998, vol. 49, pp. 237–252.
Hordijk, A. and Spieksma, F., Constrained Admission Control to a Queueing System, Adv. Appl. Probab., 1989, vol. 21, no. 2, pp. 409–431.
Kitaev, M.Y. and Rykov, V.V., Controlled Queueing Systems, Boca Raton: CRC, 1995.
Hordijk, A. and van der Duyn Schouten, F.A., Average Optimal Policies in Markov Decision Drift Processes with Application to a Queueing and a Replacement Model, Adv. Appl. Probab., 1983, vol. 15, pp. 274–303.
Yeh, L., A Repair Replacement Model, Adv. Appl. Probab., 1990, vol. 22, no. 2, pp. 494–497.
Piunovskiy, A.B., Bicriteria Optimization of a Queue with a Controlled Input Stream, Queueing Syst., 2004, vol. 48, pp. 159–184.
Abramov, V., Optimal Control of Large Dam, J. Appl. Probab., 2007, vol. 44, pp. 249–258.
Nandalal, K.D.W. and Bogardi, J.J., Dynamic Programming Based Operation of Reservoirs. Applicability and Limits. International Hydrology Series, Cambridge: Cambridge Univ. Press, 2007.
Miller, A., Miller, B., Popov, A., and Stepanyan, K., Towards the Development of Numerical Procedure for Control of Connected Markov Chains, in Proc. 5th Australian Control Conf. (AUCC), Gold Coast, 2015, 5–6 November, pp. 336–341.
Altman, E., Constrained Markov Decision Processes, Boca Raton: Chapman, 1999.
Bertsekas, D.P., and Sreve, S.E., Stochastic Optimal Control, New York: Academic, 1978. Translated under the title Stokhasticheskoe optimal’noe upravlenie, Moscow: Nauka, 1984.
Fainberg, E.A., Nonrandomized Markov and Semi-Markov Strategies in Dynamic Programming, Theory Probab. Appl., 1982, vol. 27, no. 1, pp. 116–126.
Serfozo, R., Optimal Control of Random Walks, Birth and Death Processes, and Queues, Adv. Appl. Probab., 1981, vol. 13, no. 1, pp. 61–83.
Howard, R., Dynamic Programming and Markov Processes, New York: Wiley, 1960.
Miller, B.L., Finite State Continuous Time Markov Decision Process with a Finite Planning Horizon, SIAM J. Control, 1968, vol. 6, no. 2, pp. 266–279.
Pliska, S., Controlled Jump Processes, Stoch. Process Appl., 1975, vol. 3, pp. 259–282.
Yushkevich, A.A., Controlled Markov Models with Countable State Space and Continuous Time, Theory Probab. Appl., 1977, vol. 22, no. 2, pp. 215–235.
Elliott, R.J., Aggoun, L., and Moore, J.B., Hidden Markov Models. Estimation and Control, New York: Springer, 1995.
Liptser, R.Sh. and Shiryaev, A.N., Statistics of Random Processes, New York: Springer, 1979.
Davis, M.H.A. and Elliott, R.J., Optimal Control of Jump Process, Z. Wahrscheinlichkeitstheorie verw. Gebiete, 1977, vol. 40, pp. 183–202.
Wan, C.B. and Davis, M.H.A., Existence of Optimal Control for Stochastic Jump Processes, SIAM J. Control Optim., 1979, vol. 17, no. 4, pp. 511–524.
Tidball, M.M., Lombardi, A., Pourtaller, O., et al., Continuity of Optimal Values and Solutions for Control of Markov Chains with Constraints, SIAM J. Control Optim., 2000, vol. 38, no. 4, pp. 1204–1222.
Piunovskii, A.B., Controllable Jump Model with Discounting under Constraints, Theory Probab. Appl., 1997, vol. 42, no. 1, pp. 51–72.
Piunovskiy, A.B., Optimal Control of Random Sequences in Problems with Constraints, Dordrecht: Kluwer, 1997.
Mendiondo, M. and Stockbridge, R., Approximation of Infinite-dimensional Linear Programming Problems Which Arise in Stochastic Control, SIAM J. Control Optim., 1998, vol. 36, no. 4, pp. 1448–1472.
Miller, B.M., Miller, G.B., and Semenikhin, K.V., Methods to Design Optimal Control of Markov Process with Finite State Set in the Presence of Constraints, Autom. Remote Control, 2011, vol. 72, no. 2, pp. 323–341.
Gikhman, I.I. and Skorokhod, A.V., Controllable Random Processes, Kiev: Naukova Dumka, 1977.
Davis, M.H.A., Markov Models and Optimization, London: Chapman, 1993.
Elliott, R.J., A Partially Observed Control Problem for Markov Chains, Appl. Math. Optim., 1992, vol. 25, pp. 151–169.
Fleming, W.H. and Rishel, R.W., Deterministic and Stochastic Optimal Control, New York: Springer, 1975. Translated under the title Optimal’noe upravlenie determinirovannymi i stokhasticheskimi sistemami, Moscow: Mir, 1978.
Lee, E.B. and Marcus, L., Foundation of Optimal Control Theory, New York: Wiley, 1975.
Bremaud, P., Optimal Thinning of a Point Process, SIAM J. Control Optim., 1979, vol. 17, no. 2, pp. 222–230.
Miller, B., Optimization of Stochastic Networks via Stochastic Controls, in Proc. Int. Conf. “System Identification and Control Problems” (SICPRO’08), Moscow, 2008, January 28–31.
Miller, B., Access and Service Rate Control in Queueing Systems, in Proc. IFAC World Congr., Seoul, 2008, July 6–11.
Gonzalez-Hernandez, J. and Hernandez-Lerma, O., Extreme Points of Sets of Randomized Strategies in Constrained Optimization and Control Problems, SIAM J. Optim., 2005, vol. 15, no. 4, pp. 1085–1104.
Ioffe, A.D. and Tikhomirov, V.M., Teoriya ekstremal’nykh zadach (Theory of Extremal Problems), Moscow: Nauka, 1974.
Polyak, B.T., Vvedenie v optimizatsiyu, Moscow: Nauka, 1983. Translated into English under the title Introduction to Optimization, New York: Optimization Software, 1987.
Miller, A.B., Using Methods of Stochastic Control to Prevent Overloads in Data Transmission Networks, Autom. Remote Control, 2010, vol. 71, no. 9, pp. 1804–1815.
Miller, B.M., Myasnikov, D.V., and Semenikhin, K.V., Designing the Optimal Stochastic Control of a Queuing System with Regard for Constraints, in Proc. 37th School-Conf. of Young Researchers and Experts “Information Technologies and Systems,” Kaliningrad, 2013, September 1–6.
Myasnikov, D.V. and Semenikhin, K.V., Control of M|M|1|N Queue Parameters under Constraints, J. Comput. Syst. Sci. Int., 2016, vol. 55, no. 1, pp. 59–78.
Rockafellar, R.T., Convex Analysis, Princeton: Princeton Univ. Press, 1970. Translated under the title Vypuklyi analiz, Moscow: Mir, 1973.
Zangwill, W.I., Nonlinear Programming. A Unified Approach, Englewood Cliffs: Prentice-Hall, 1969. Translated under the title Nelineinoe programmirovanie. Edinyi podkhod, Moscow: Sovetskoe Radio, 1973.
Berkovitz, L., Convexity and Optimization in Rn, New York: Wiley, 2002.
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © B.M. Miller, G.B. Miller, K.V. Semenikhin, 2016, published in Avtomatika i Telemekhanika, 2016, No. 9, pp. 96–123.
Rights and permissions
About this article
Cite this article
Miller, B.M., Miller, G.B. & Semenikhin, K.V. Optimal control problem regularization for the Markov process with finite number of states and constraints. Autom Remote Control 77, 1589–1611 (2016). https://doi.org/10.1134/S0005117916090071
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S0005117916090071