Abstract
The self-tuning scheme for the adaptive control of a diffusion process is studied with long-run average cost criterion and maximum likelihood estimation of parameters. Asymptotic optimality under a suitable identifiability condition is established under two alternative sets of hypotheses—a Lyapunov-type stability criterion and a condition on cost which penalizes instability.
Similar content being viewed by others
References
D. G. Aronson, Bounds for the fundamental solution of a parabolic equation, Bull. Amer. Math. Soc. 73 (1967), pp. 890–896.
K. Aström, B. Wittenmark, On self-tuning regulators, Automatica 9 (1973), pp. 185–199.
V. E. Beneš, Existence of optimal strategies based on specified information, for a class of stochastic decision problems, SIAM J. Control 8 (1970), pp. 179–188.
A. Bensoussan, Stochastic Control of Functional Analysis Methods, North-Holland, Amsterdam, 1982.
R. N. Bhattacharya, Asymptotic behaviour of several dimensional diffusions, in Stochastic Nonlinear Systems, L. Arnold and R. Lefever, eds., Springer-Verlag, New York, 1981, pp. 86–99.
V. S. Borkar, A topology for Markov controls, Appl. Math. Optim. 20 (1989), pp. 55–62.
V. S. Borkar, A. Bagchi, Parameter estimation in continuous-time stochastic processes, Stochastics 8 (1982), pp. 193–212.
V. S. Borkar, M. K. Ghosh, Ergodic control of multidimensional diffusions, I: the existence results, SIAM J. Control Optim. 26 (1988), pp. 112–126.
V. S. Borkar, P. Varaiya, Identification and adaptive control of Markov chains, SIAM J. Control Optim. 20 (1982), pp. 470–489.
R. M. Cox, Stationary and discounted control of diffusion processes, Ph.D. Thesis, Columbia University, 1984.
W. H. Fleming, Generalized solutions in stochastic control, in Differential Games and Control Theory III, E. Roxin, P. T. Liu and R. L. Sternberg, eds., Marcel Dekker, New York, 1977, pp. 147–165.
D. Gilbarg, N. S. Trudinger, Elliptic Partial Differential Equations of Second Order, 2nd edn., Springer-Verlag, New York, 1983.
P. Grisvard, Elliptic Problems in Non-Smooth Domains, Pitman, Boston, 1965.
N. Ikeda, S. Watanabe, Stochastic Differential Equations and Diffusion Processes, North-Holland Kodansha, Amsterdam, 1981.
R. Z. Khas'Minskii, Ergodic properties of recurrent diffusion processes and stabilization of the solution to the Cauchy problem of parabolic equations, Theory Probab. Appl. 2 (1960), pp. 179–196.
N. V. Krylov, Controlled Diffusion Processes, Springer-Verlag, New York, 1980.
P. R. Kumar, Survey of results in stochastic adaptive control, SIAM J. Control Optim. 23 (1985), pp. 329–380.
P. R. Kumar, W. Lin, Optimal adaptive controllers for unknown Markov chains, IEEE Trans. Automat. Control (1982), pp. 765–774.
H. J. Kushner, Existence results for optimal stochastic control, J. Optim. Theory Appl. 15 (1975), pp. 347–359.
O. A. Ladyzhenskaya, N. N. Ural'Tseva, Linear and Quasilinear Elliptic Equations, Academic Press, New York, 1968.
P. L. Lions, On the Hamilton-Jacobi-Bellman equations, Acta Appl. Math. 1 (1983), pp. 17–41.
R. S. Lipster, A. N. Shiryayev, Statistics of Random Processes I, Springer-Verlag, New York, 1977.
P. Mandl, Estimation and control of Markov chains, Adv. in Appl. Probab. 6 (1974), pp. 40–60.
P. Mandl, Self-optimizing control of Markov processes and Markov potential theory, Proceedings of ICM, Warsaw, 1983, vol. 2, pp. 1097–1105.
M. Schäl, Estimation and control in discounted dynamic programming, Stochastics (1987), pp. 51–71.
A. Ju. Veretennikov, On strong solutions and explicit formulas for solutions of stochastic integral equations, Math. USSR-Sb. 39 (1981), pp. 387–403.
Author information
Authors and Affiliations
Additional information
Communicated by S. K. Mitter
Rights and permissions
About this article
Cite this article
Borkar, V.S., Ghosh, M.K. Ergodic control of multidimensional diffusions, II: Adaptive control. Appl Math Optim 21, 191–220 (1990). https://doi.org/10.1007/BF01445163
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/BF01445163