Skip to main content

Stochastic Approximation with Averaging and Feedback: Faster Convergence

  • Conference paper

Part of the book series: The IMA Volumes in Mathematics and its Applications ((IMA,volume 74))

Abstract

Consider the stochastic approximation algorithm (*) % MathType!MTEF!2!1!+- % feaagaart1ev2aaatCvAUfKttLearuqr1ngBPrgarmWu51MyVXgatC % vAUfeBSjuyZL2yd9gzLbvyNv2CaeHbd9wDYLwzYbItLDharyavP1wz % ZbItLDhis9wBH5garqqtubsr4rNCHbGeaGqiVu0Je9sqqrpepC0xbb % L8F4rqqrFfpeea0xe9Lq-Jc9vqaqpepm0xbba9pwe9Q8fs0-yqaqpe % pae9pg0FirpepeKkFr0xfr-xfr-xb9adbaqaaeGaciGaaiaabeqaam % aaeaqbaaGcbiGacWefca0fcqWGybawdaWgaaWcbaGaemOBa4Maey4k % aSIaeGymaedabeaakiabg2da9iabdIfaynaaBaaaleaacqWGUbGBae % qaaOGaey4kaSIaemyyae2aaSbaaSqaaiabd6gaUbqabaGccqWGNbWz % cqGGOaakcqWGybawdaWgaaWcbaGaemOBa4gabeaakiabcYcaSiabe6 % 7a4jabcMcaPiabc6caUaaa!5136! \[ X_{n + 1} = X_n + a_n g(X_n ,\xi ). \]

The problem of selecting the gain or step size sequences a n has been a serious handicap in applications. In a fundamental paper, Polyak and Juditsky [17] showed that (loosely speaking) if the coefficients a n go to zero slower than O(l/n), then the averaged sequence % MathType!MTEF!2!1!+- % feaagaart1ev2aaatCvAUfKttLearuqr1ngBPrgarmWu51MyVXgatC % vAUfeBSjuyZL2yd9gzLbvyNv2CaeHbd9wDYLwzYbItLDharyavP1wz % ZbItLDhis9wBH5garqqtubsr4rNCHbGeaGqiVu0Je9sqqrpepC0xbb % L8F4rqqrFfpeea0xe9Lq-Jc9vqaqpepm0xbba9pwe9Q8fs0-yqaqpe % pae9pg0FirpepeKkFr0xfr-xfr-xb9adbaqaaeGaciGaaiaabeqaam % aaeaqbaaGcbiGacWefca0fdaaeWaqaaiabdIfaybWcbaGaemyAaKMa % eyypa0JaeGymaedabaGaemOBa4ganiabggHiLdGccqWGPbqAcqGGVa % WlcqWGUbGBaaa!472B! \[ \sum\nolimits_{i = 1}^n X i/n \] converged to its limit at an optimum rate, for any coefficient sequence. This result implies that we should use “larger” than usual” gains, and let the off line averaging take care of the increased noise effects, with substantial overall improvement. Here we give a simpler proof under weaker conditions. Basically, it is shown that the averaging works whenever there is a “classical” rate of convergence theorem. I.e., results of this type are generic to stochastic approximation. Intuitive insight is provided by relating the behavior to that of a two time scale discrete algorithm. The value of the method has been supported by simulations. Since the averaged estimate is “off line,” it is not the actual value used in the SA iteration (*) itself. We show how the averaged value can be partially fed back into the actual operating algorithm for improved performance. Numerical data are presented to support the theoretical conclusions. An error in the tightness part of the proof in [14] is corrected.

Supported by AFOSR Contract F 49620-92-0081 and NSF grant ECS-8913351.

Supported by AFOSR Contract F 49620-92-0081.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Benveniste, M. Metivier, AND P. Prioret, Adaptive Algorithms and Stochastic Approximation, Springer-Verlag, New York, Berlin 1990.

    Book  Google Scholar 

  2. P. Billingsley, Convergence of Probability Measures, John Wiley, New York 1968.

    MATH  Google Scholar 

  3. G. Blankenship AND G.C. Papanicolaou, Stability and control of systems with wide band noise disturbances, SIAM J. Appl. Math., 34 (1978), pp. 437–476.

    Article  MathSciNet  MATH  Google Scholar 

  4. B. Deylon AND A. Juditsky, Stochastic optimization with averaging of trajectories, Stochastics, 39 (1992), pp. 107–118.

    Google Scholar 

  5. S.N. Ethier AND T.G. Kurtz, Markov Processes: Characterization and Convergence, Wiley, New York 1986.

    Book  MATH  Google Scholar 

  6. H.J. Kushner AND Hai Huang, Averaging methods for the asymptotic analysis of learning and adaptive systems, SIAM J. on Control and Optimization, 19 (1981), pp. 635–650.

    Article  MathSciNet  MATH  Google Scholar 

  7. H.J. Kushner, Stochastic approximation with discontinuous dynamics and state dependent noise, J. Math. Analysis and Applications, 82 (1981), pp. 527–542.

    Article  MathSciNet  MATH  Google Scholar 

  8. H.J. Kushner, Approximation and Weak Convergence Methods for Random Processes with Applications to Stochastic System Theory, MIT Press, Cambridge, MA 1984.

    Google Scholar 

  9. H.J. Kushner, Weak Convergence Methods and Singularly Perturbed Stochastic Control and Filtering Problems, Volume 3 of Systems and Control, Birkhauser, Boston 1990.

    Google Scholar 

  10. H.J. Kushner AND D.S. Clark, Stochastic Approximation for Constrained and Unconstrained Syetems, Springer-Verlag, Berlin and New York 1978.

    Book  Google Scholar 

  11. H.J. Kushner AND A. Shwartz, An invariant measure approach to the convergence of stochastic approximations with state dependent noise, SIAM J. on Optimization and Control, 22 (1984), pp. 13–27.

    Article  MathSciNet  MATH  Google Scholar 

  12. H.J. Kushner AND J. Yang, A montecario method for the sensitivity analysis and parametric optimization of nonlinear stochastic systems, SIAM J. on Control and Optimization, 29 (1992), pp. 1216–1249.

    MathSciNet  Google Scholar 

  13. H.J. Kushner AND J. Yang, Stochastic approximation with averaging and feedback: rapidly convergent “on line” algorithms, and applications to adaptive systems, (Technical report) Brown University, Lefschetz Center for Dynamical Systems 1992.

    Google Scholar 

  14. H.J. Kushner AND J. Yang, Stochastic approximation with averaging: optimal asymptotic rates of convergence for general processes, SIAM J. on Control and Optimization 31 (1993), pp. 1045–1062.

    Article  MathSciNet  MATH  Google Scholar 

  15. H.J. Kushner AND G. Yin, Asymptotic properties of distributed and communicating stochastic approximation algorithms, SIAM J. on Control and Optimization, 25 (1987), pp. 1266–1290.

    Article  MathSciNet  MATH  Google Scholar 

  16. B.T. Polyak, New stochastic approximation type procedures, Autom. i Telemekh, 7 (1990), pp. 98–107.

    MathSciNet  Google Scholar 

  17. B.T. Polyak AND A.B. Juditsky, Acceleration of stochastic approximation by averaging, SIAM J. on Control and Optimization, 30 (1992), pp. 838–855.

    Article  MathSciNet  MATH  Google Scholar 

  18. D. Ruppbrt, Efficient estimators from a slowly convergent robbins-munro process, Technical Report 787, School of Operations Research and Industrial Engineering, Cornell University 1988.

    Google Scholar 

  19. G. Yin, On extensions of polyak’s averaging approach to stochastic approximation, Stochastics, 36 (1992), pp. 245–264.

    Google Scholar 

  20. G. Yin, Stochastic approximation via averaging: Polyak’s approach revisited, in Lecture Notes in Economics and Mathematical Systems 374 (G. Pflug AND U. Dieter, editors), Springer-Verlag, Berlin 1992, pp. 119–134.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer Science+Business Media New York

About this paper

Cite this paper

Kushner, H.J., Yang, J. (1995). Stochastic Approximation with Averaging and Feedback: Faster Convergence. In: Åström, K.J., Goodwin, G.C., Kumar, P.R. (eds) Adaptive Control, Filtering, and Signal Processing. The IMA Volumes in Mathematics and its Applications, vol 74. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-8568-2_9

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-8568-2_9

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4612-6439-2

  • Online ISBN: 978-1-4419-8568-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics