References
R. Bellman, Dynamic Programming [in Russian], IL, Moscow (1960).
D. Blackwell, “Discrete dynamic programming,” Ann. Math. Stat.,33, 712–726 (1962).
E. B. Dynkin and A. A. Yushkevich, Controlled Markov Processes [in Russian], Nauka, Moscow (1975).
H. Mine and S. Osaki, Markovian Decision Processes, Elsevier, New York (1970).
D. Bertsekas and S. Shreve, Stochastic Optimal Control: The Discrete Time Case, Academic Press, New York (1978).
R. A. Howard, Dynamic Programming and Markov Processes, MIT Press, Cambridge, MA (1960).
V. V. Baranov, “Optimization methods of successive approximations in Markov decision processes,” Kibernetika, No. 4, 103–111 (1985).
V. V. Baranov, “Computational methods of optimal stochastic control. Optimality principle and optimization scheme for successive approximations,” Zh. Vychisl. Mat. Mat. Fiz., No. 5, 663–680 (1991).
G. Hubner, “A unified approach to adaptive control of average reward Markov decision processes,” OR Spectrum,10, 161–166 (1988).
V. V. Baranov, “On recursive adaptive control algorithms in stochastic systems,” Kibernetika, No. 6, 88–94 (1981).
J. Kemeny and J. Snell, Finite Markov Chains, Van Nostrand, Princeton, NJ (1967).
Additional information
Translated from Kibernetika i Sistemnyi Analiz, No. 6, pp. 100–120, November–December, 1992.
Rights and permissions
About this article
Cite this article
Baranov, V.V. Sequential identification and adaptive control in stochastic systems. Cybern Syst Anal 28, 890–904 (1992). https://doi.org/10.1007/BF01291293
Received:
Issue Date:
DOI: https://doi.org/10.1007/BF01291293