Robust Dynamics and Control of a Partially Observed Markov Chain

Elliott, R. J.; Malcolm, W. P.; Moore, J. P.

doi:10.1007/s00245-007-9007-8

Robust Dynamics and Control of a Partially Observed Markov Chain

Published: 28 August 2007

Volume 56, pages 303–311, (2007)
Cite this article

Applied Mathematics and Optimization Submit manuscript

R. J. Elliott¹,
W. P. Malcolm² &
J. P. Moore²

88 Accesses
2 Citations
Explore all metrics

Abstract

In a seminal paper, Martin Clark (Communications Systems and Random Process Theory, Darlington, 1977, pp. 721–734, 1978) showed how the filtered dynamics giving the optimal estimate of a Markov chain observed in Gaussian noise can be expressed using an ordinary differential equation. These results offer substantial benefits in filtering and in control, often simplifying the analysis and an in some settings providing numerical benefits, see, for example Malcolm et al. (J. Appl. Math. Stoch. Anal., 2007, to appear).

Clark’s method uses a gauge transformation and, in effect, solves the Wonham-Zakai equation using variation of constants. In this article, we consider the optimal control of a partially observed Markov chain. This problem is discussed in Elliott et al. (Hidden Markov Models Estimation and Control, Applications of Mathematics Series, vol. 29, 1995). The innovation in our results is that the robust dynamics of Clark are used to compute forward in time dynamics for a simplified adjoint process. A stochastic minimum principle is established.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bismut, J.-M.: An introductory approach to duality in optimal stochastic control. SIAM Rev. 20(1), 62–78 (1978)
Article MATH MathSciNet Google Scholar
Clark, J.M.C.: The design of robust approximations to the stochastic differential equations for nonlinear filtering. In: Skwirzynski, J.K. (ed.) Communications Systems and Random Process Theory, Darlington, 1977, pp. 721–734. Sijthoff and Noorhoff, Alphen aan den Rijn (1978)
Google Scholar
Davis, M.H.A.: Martingale methods in stochastic control. In: Stochastic Control Theory and Stochastic Differential Systems, pp. 85–117. Springer, New York (1979)
Chapter Google Scholar
Elliott, R.J.: A stochastic minimum principle. Bull. Am. Math. Soc. 82, 944–946 (1976)
Article MATH Google Scholar
Elliott, R.J.: Stochastic Calculus and its Applications. Springer, Berlin (1982)
Google Scholar
Elliott, R.J.: A partially observed control problem for Markov chains. Appl. Math. Optim. 25, 151–169 (1992)
Article MATH MathSciNet Google Scholar
Elliott, R.J., Moore, J.P.: Adjoint processes for Markov chains observed in Gaussian Noise. In: 26th Asilomar Conference on Signals Systems and Computing, vol. 1, pp. 396–399. California, USA, October 1992
Elliott, R.J., Aggoun, L., Moore, J.P.: Hidden Markov Models Estimation and Control. Applications of Mathematics Series, vol. 29. Springer, Berlin (1995)
MATH Google Scholar
Fleming, W.H.: Optimal continuous-parameter stochastic control. SIAM Rev. 11(4) (October 1969)
Haussmann, U.G.: Some examples of optimal stochastic controls or: the stochastic maximum principle at work. SIAM Rev. 23(3), 292–307 (1981)
Article MATH MathSciNet Google Scholar
James, M.R., Krishnamurthy, V., Le Gland, F.: Time discretization of continuous-time filters and smoothers for HMM parameter estimation. IEEE Trans. Inform. Theory 42(2), 593–605 (1996)
Article MATH Google Scholar
Malcolm, W.P., Elliott, R.J., van der Hoek, J.: A deterministic discretisation-step upper bound for state estimation via Clark transformations. J. Appl. Math. Stoch. Anal. 371–384 (2004)
Malcolm, W.P., Elliott, R.J., van der Hoek, J.: On the numerical stability of time-discretised state estimation via Clark transformations. In: 42nd IEEE Conference on Decision and Control, pp. 1406–1412, Mauii, Hawaii, December 2003

Download references

Author information

Authors and Affiliations

Haskayne School of Business, Scurfield Hall, University of Calgary, 2500 University Drive NW, Calgary, AB, Canada, T2N 1N4
R. J. Elliott
National ICT Australia, Locked Bag 8001, Canberra, ACT, 2601, Australia
W. P. Malcolm & J. P. Moore

Authors

R. J. Elliott
View author publications
You can also search for this author in PubMed Google Scholar
W. P. Malcolm
View author publications
You can also search for this author in PubMed Google Scholar
J. P. Moore
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. J. Elliott.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Elliott, R.J., Malcolm, W.P. & Moore, J.P. Robust Dynamics and Control of a Partially Observed Markov Chain. Appl Math Optim 56, 303–311 (2007). https://doi.org/10.1007/s00245-007-9007-8

Download citation

Published: 28 August 2007
Issue Date: December 2007
DOI: https://doi.org/10.1007/s00245-007-9007-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Dynamics and Control of a Partially Observed Markov Chain

Abstract

Access this article

Similar content being viewed by others

Parameter Estimation Problems in Markov Random Processes

Uncertainty and filtering of hidden Markov models in discrete time

Optimal Linear Responses for Markov Chains and Stochastically Perturbed Dynamical Systems

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Robust Dynamics and Control of a Partially Observed Markov Chain

Abstract

Access this article

Similar content being viewed by others

Parameter Estimation Problems in Markov Random Processes

Uncertainty and filtering of hidden Markov models in discrete time

Optimal Linear Responses for Markov Chains and Stochastically Perturbed Dynamical Systems

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation