Skip to main content
Log in

On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model

  • Published:
Applied Mathematics and Optimization Aims and scope Submit manuscript

Abstract

We control a discrete-time uniformly ergodic system, which depends on an unknown parameter α0 εA, a compact set. Our purpose is to minimize the long-run average-cost functional. We estimate the unknown parameter using the biased maximum likelihood estimator and apply the control which is almost optimal for the value of estimation. This way we construct strategies such that the value of the cost functional can be arbitrarily close to the optimal value obtained for α0.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Borkar V (1990) The Kumar-Becker-Lin scheme revisited. J Optim Theory Appl 66:289–309

    Google Scholar 

  2. Borkar V (to appear) Self-tuning control of diffusions without identifiability condition. J Optim Theory Appl

  3. Borkar V, Bagchi A (1982) Parameter estimation in continuous time stochastic processes. Stochastics 8:193–212

    Google Scholar 

  4. Doob JL (1953) Stochastic Processes. Wiley, New York

    Google Scholar 

  5. Gubenko LG, Shtatland ES (1972) On discrete time Markov decision processes. Probab Theory Math Statist 7:51–64

    Google Scholar 

  6. Hernandez-Lerma O (1989) Adaptive Markov Control Processes. Springer-Verlag, Berlin

    Google Scholar 

  7. Kartashow NW (1984) Criteria for uniform ergodicity and strong stability of Markov chains in general state space. Probab Theory Math Statist 30:65–81

    Google Scholar 

  8. Kumar PR, Becker A (1982) A new family of optimal adaptive controllers for Markov chains. IEEE Trans Automat Control 27:137–146

    Google Scholar 

  9. Ueno T (1957) Some limit theorems for temporally discrete Markov processes. J Fac Sci Univ Tokyo 7:449–462

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Stettner, Ł. On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model. Appl Math Optim 27, 161–177 (1993). https://doi.org/10.1007/BF01195980

Download citation

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF01195980

Key words

AMS classification

Navigation