Skip to main content

Part of the book series: Lecture Notes in Control and Information Sciences ((LNCIS,volume 434))

Abstract

Stochastic approximation algorithms have been one of the main focus areas of research on solution methods for stochastic optimization problems. The Robbins-Monro algorithm [17] is a basic stochastic approximation scheme that has been found to be applicable in a variety of settings that involve finding the roots of a function under noisy observations. We first review in this chapter the Robbins-Monro algorithm and its convergence. In cases where one is interested in optimizing the steady-state system performance, i.e., the objective is a long-run average cost function, multi-timescale variants of the Robbins-Monro algorithm have been found useful. We also review multi-timescale stochastic approximation in this chapter since many of the schemes presented in the later chapters shall involve such algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Benaim, M.: A dynamical systems approach to stochastic approximations. SIAM Journal on Control and Optimization 34(2), 437–472 (1996)

    Article  MathSciNet  MATH  Google Scholar 

  2. Benveniste, A., Métivier, M., Priouret, P.: Adaptive Algorithms and Stochastic Approximations. Springer, Berlin (1990)

    Book  MATH  Google Scholar 

  3. Benveniste, A., Priouret, P., Métivier, M.: Adaptive algorithms and stochastic approximations. Springer-Verlag New York, Inc. (1990)

    Google Scholar 

  4. Borkar, V.S.: Stochastic approximation with two timescales. Systems and Control Letters 29, 291–294 (1997)

    Article  MathSciNet  MATH  Google Scholar 

  5. Borkar, V.S.: Stochastic Approximation: A Dynamical Systems Viewpoint. Cambridge University Press and Hindustan Book Agency (Jointly Published), Cambridge and New Delhi (2008)

    Google Scholar 

  6. Borkar, V.S., Meyn, S.P.: The O.D.E. method for convergence of stochastic approximation and reinforcement learning. SIAM Journal of Control and Optimization 38(2), 447–469 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  7. Chen, H.: Stochastic approximation and its applications, vol. 64. Kluwer Academic Pub. (2002)

    Google Scholar 

  8. Chen, H.F., Duncan, T.E., Pasik-Duncan, B.: A Kiefer-Wolfowitz algorithm with randomized differences. IEEE Trans. Auto. Cont. 44(3), 442–453 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  9. Dai, J.G.: On positive Harris recurrence for multiclass queueing networks: A unified approach via fluid limit models. Annals of Applied Probability 5, 49–77 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  10. Dai, J.G., Meyn, S.P.: Stability and convergence of moments for multiclass queueing networks via fluid limit models. IEEE Transactions on Automatic Control 40, 1889–1904 (1995)

    Article  MathSciNet  MATH  Google Scholar 

  11. Duflo, M.: Random iterative models, vol. 34. Springer (1997)

    Google Scholar 

  12. Kushner, H.J., Clark, D.S.: Stochastic Approximation Methods for Constrained and Unconstrained Systems. Springer, New York (1978)

    Book  Google Scholar 

  13. Kushner, H.J., Yin, G.G.: Stochastic Approximation Algorithms and Applications. Springer, New York (1997)

    MATH  Google Scholar 

  14. Ljung, L.: Analysis of recursive stochastic algorithms. IEEE Transactions on Automatic Control AC-22, 551–575 (1977)

    Article  MathSciNet  Google Scholar 

  15. Polyak, B.T., Juditsky, A.B.: Acceleration of stochastic approximation by averaging. SIAM J. Control and Optim. 30(4), 838–855 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  16. Renotte, C., Wouwer, A.V., Remy, M.: Neural modeling and control of a heat exchanger based on SPSA techniques. In: Proceedings of the American Control Conference, Chicago, IL, pp. 3299–3303 (2000)

    Google Scholar 

  17. Robbins, H., Monro, S.: A stochastic approximation method. Ann. Math. Statist. 22, 400–407 (1951)

    Article  MathSciNet  MATH  Google Scholar 

  18. Spall, J.C., Cristion, J.A.: Nonlinear adaptive control using neural networks: estimation with a smoothed form of simultaneous perturbation gradient approximation. Statistica Sinica 4, 1–27 (1994)

    MathSciNet  MATH  Google Scholar 

  19. Wouwer, A.V., Renotte, C., Remy, M.: Application of stochastic approximation techniques in neural modelling and control. International Journal of Systems Science 34, 851–863 (2003)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Bhatnagar .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this chapter

Cite this chapter

Bhatnagar, S., Prasad, H., Prashanth, L. (2013). Stochastic Approximation Algorithms. In: Stochastic Recursive Algorithms for Optimization. Lecture Notes in Control and Information Sciences, vol 434. Springer, London. https://doi.org/10.1007/978-1-4471-4285-0_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4285-0_3

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4284-3

  • Online ISBN: 978-1-4471-4285-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics