On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model

Stettner, Łukasz

doi:10.1007/BF01195980

On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model

Published: March 1993

Volume 27, pages 161–177, (1993)
Cite this article

Applied Mathematics and Optimization Aims and scope Submit manuscript

Łukasz Stettner¹

46 Accesses
10 Citations
Explore all metrics

Abstract

We control a discrete-time uniformly ergodic system, which depends on an unknown parameter α⁰ εA, a compact set. Our purpose is to minimize the long-run average-cost functional. We estimate the unknown parameter using the biased maximum likelihood estimator and apply the control which is almost optimal for the value of estimation. This way we construct strategies such that the value of the cost functional can be arbitrarily close to the optimal value obtained for α⁰.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimal Control and Pontryagin’s Maximum Principle

H-Infinity Control

Stochastic Optimal Transport with at Most Quadratic Growth Cost

Article 15 May 2024

References

Borkar V (1990) The Kumar-Becker-Lin scheme revisited. J Optim Theory Appl 66:289–309
Google Scholar
Borkar V (to appear) Self-tuning control of diffusions without identifiability condition. J Optim Theory Appl
Borkar V, Bagchi A (1982) Parameter estimation in continuous time stochastic processes. Stochastics 8:193–212
Google Scholar
Doob JL (1953) Stochastic Processes. Wiley, New York
Google Scholar
Gubenko LG, Shtatland ES (1972) On discrete time Markov decision processes. Probab Theory Math Statist 7:51–64
Google Scholar
Hernandez-Lerma O (1989) Adaptive Markov Control Processes. Springer-Verlag, Berlin
Google Scholar
Kartashow NW (1984) Criteria for uniform ergodicity and strong stability of Markov chains in general state space. Probab Theory Math Statist 30:65–81
Google Scholar
Kumar PR, Becker A (1982) A new family of optimal adaptive controllers for Markov chains. IEEE Trans Automat Control 27:137–146
Google Scholar
Ueno T (1957) Some limit theorems for temporally discrete Markov processes. J Fac Sci Univ Tokyo 7:449–462
Google Scholar

Download references

Author information

Authors and Affiliations

Insitute of Mathematics, Polish Academy of Sciences, Śniadeckich 8, 00-950, Warsaw, Poland
Łukasz Stettner

Authors

Łukasz Stettner
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Stettner, Ł. On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model. Appl Math Optim 27, 161–177 (1993). https://doi.org/10.1007/BF01195980

Download citation

Accepted: 22 October 1991
Issue Date: March 1993
DOI: https://doi.org/10.1007/BF01195980

Key words

AMS classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model

Abstract

Access this article

Similar content being viewed by others

Optimal Control and Pontryagin’s Maximum Principle

H-Infinity Control

Stochastic Optimal Transport with at Most Quadratic Growth Cost

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key words

AMS classification

Navigation

On nearly self-optimizing strategies for a discrete-time uniformly ergodic adaptive model

Abstract

Access this article

Similar content being viewed by others

Optimal Control and Pontryagin’s Maximum Principle

H-Infinity Control

Stochastic Optimal Transport with at Most Quadratic Growth Cost

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key words

AMS classification

Search

Navigation