Stochastic Approximation Algorithms

Bhatnagar, S.; Prasad, H.; Prashanth, L.

doi:10.1007/978-1-4471-4285-0_3

S. Bhatnagar⁴,
H. Prasad⁴ &
L. Prashanth⁴

Part of the book series: Lecture Notes in Control and Information Sciences ((LNCIS,volume 434))

3026 Accesses
4 Citations

Abstract

Stochastic approximation algorithms have been one of the main focus areas of research on solution methods for stochastic optimization problems. The Robbins-Monro algorithm [17] is a basic stochastic approximation scheme that has been found to be applicable in a variety of settings that involve finding the roots of a function under noisy observations. We first review in this chapter the Robbins-Monro algorithm and its convergence. In cases where one is interested in optimizing the steady-state system performance, i.e., the objective is a long-run average cost function, multi-timescale variants of the Robbins-Monro algorithm have been found useful. We also review multi-timescale stochastic approximation in this chapter since many of the schemes presented in the later chapters shall involve such algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Benaim, M.: A dynamical systems approach to stochastic approximations. SIAM Journal on Control and Optimization 34(2), 437–472 (1996)
Article MathSciNet MATH Google Scholar
Benveniste, A., Métivier, M., Priouret, P.: Adaptive Algorithms and Stochastic Approximations. Springer, Berlin (1990)
Book MATH Google Scholar
Benveniste, A., Priouret, P., Métivier, M.: Adaptive algorithms and stochastic approximations. Springer-Verlag New York, Inc. (1990)
Google Scholar
Borkar, V.S.: Stochastic approximation with two timescales. Systems and Control Letters 29, 291–294 (1997)
Article MathSciNet MATH Google Scholar
Borkar, V.S.: Stochastic Approximation: A Dynamical Systems Viewpoint. Cambridge University Press and Hindustan Book Agency (Jointly Published), Cambridge and New Delhi (2008)
Google Scholar
Borkar, V.S., Meyn, S.P.: The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal of Control and Optimization 38(2), 447–469 (2000)
Article MathSciNet MATH Google Scholar
Chen, H.: Stochastic approximation and its applications, vol. 64. Kluwer Academic Pub. (2002)
Google Scholar
Chen, H.F., Duncan, T.E., Pasik-Duncan, B.: A Kiefer-Wolfowitz algorithm with randomized differences. IEEE Trans. Auto. Cont. 44(3), 442–453 (1999)
Article MathSciNet MATH Google Scholar
Dai, J.G.: On positive Harris recurrence for multiclass queueing networks: A unified approach via fluid limit models. Annals of Applied Probability 5, 49–77 (1995)
Article MathSciNet MATH Google Scholar
Dai, J.G., Meyn, S.P.: Stability and convergence of moments for multiclass queueing networks via fluid limit models. IEEE Transactions on Automatic Control 40, 1889–1904 (1995)
Article MathSciNet MATH Google Scholar
Duflo, M.: Random iterative models, vol. 34. Springer (1997)
Google Scholar
Kushner, H.J., Clark, D.S.: Stochastic Approximation Methods for Constrained and Unconstrained Systems. Springer, New York (1978)
Book Google Scholar
Kushner, H.J., Yin, G.G.: Stochastic Approximation Algorithms and Applications. Springer, New York (1997)
MATH Google Scholar
Ljung, L.: Analysis of recursive stochastic algorithms. IEEE Transactions on Automatic Control AC-22, 551–575 (1977)
Article MathSciNet Google Scholar
Polyak, B.T., Juditsky, A.B.: Acceleration of stochastic approximation by averaging. SIAM J. Control and Optim. 30(4), 838–855 (1992)
Article MathSciNet MATH Google Scholar
Renotte, C., Wouwer, A.V., Remy, M.: Neural modeling and control of a heat exchanger based on SPSA techniques. In: Proceedings of the American Control Conference, Chicago, IL, pp. 3299–3303 (2000)
Google Scholar
Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Statist. 22, 400–407 (1951)
Article MathSciNet MATH Google Scholar
Spall, J.C., Cristion, J.A.: Nonlinear adaptive control using neural networks: estimation with a smoothed form of simultaneous perturbation gradient approximation. Statistica Sinica 4, 1–27 (1994)
MathSciNet MATH Google Scholar
Wouwer, A.V., Renotte, C., Remy, M.: Application of stochastic approximation techniques in neural modelling and control. International Journal of Systems Science 34, 851–863 (2003)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Automation, Indian Institute of Science, 560012, Bangalore, India
S. Bhatnagar, H. Prasad & L. Prashanth

Authors

S. Bhatnagar
View author publications
You can also search for this author in PubMed Google Scholar
H. Prasad
View author publications
You can also search for this author in PubMed Google Scholar
L. Prashanth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. Bhatnagar .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bhatnagar, S., Prasad, H., Prashanth, L. (2013). Stochastic Approximation Algorithms. In: Stochastic Recursive Algorithms for Optimization. Lecture Notes in Control and Information Sciences, vol 434. Springer, London. https://doi.org/10.1007/978-1-4471-4285-0_3

Download citation

DOI: https://doi.org/10.1007/978-1-4471-4285-0_3
Publisher Name: Springer, London
Print ISBN: 978-1-4471-4284-3
Online ISBN: 978-1-4471-4285-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics