Consistency of Feature Markov Processes

Sunehag, Peter; Hutter, Marcus

doi:10.1007/978-3-642-16108-7_29

Consistency of Feature Markov Processes

Peter Sunehag²³ &
Marcus Hutter²³

Conference paper

1131 Accesses
6 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6331))

Abstract

We are studying long term sequence prediction (forecasting). We approach this by investigating criteria for choosing a compact useful state representation. The state is supposed to summarize useful information from the history. We want a method that is asymptotically consistent in the sense it will provably eventually only choose between alternatives that satisfy an optimality property related to the used criterion. We extend our work to the case where there is side information that one can take advantage of and, furthermore, we briefly discuss the active setting where an agent takes actions to achieve desirable outcomes.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baum, L.E., Petrie, T.: Statistical inference for probabilistic functions of Finite State Markov chains. The Annals of Mathematical Statistics 37(6), 1554–1563 (1966)
Article MATH MathSciNet Google Scholar
Cappé, O., Moulines, E., Rydenp, T.: Inference in Hidden Markov Models. Springer Series in Statistics. Springer, New York (2005)
MATH Google Scholar
Csiszr, I., Shields, P.C.: The consistency of the bic markov order estimator (2000)
Google Scholar
Ephraim, Y., Merhav, N.: Hidden Markov processes. IEEE Transactions on Information Theory 48(6), 1518–1569 (2002)
Article MATH MathSciNet Google Scholar
Finesso, L., Liu, C., Narayan, P.: The optimal error exponent for markov order estimation. IEEE Trans. Inform. Theory 42, 1488–1497 (1996)
Article MATH MathSciNet Google Scholar
Gassiat, E., Boucheron, S.: Optimal error exponents in hidden Markov models order estimation. IEEE Transactions on Information Theory 49(4), 964–980 (2003)
Article MATH MathSciNet Google Scholar
Hutter, M.: Feature reinforcement learning: Part I: Unstructured MDPs. Journal of Artificial General Intelligence 1, 3–24 (2009)
Google Scholar
Mahmud, M.M.: Constructing states for reinforcement learning. In: The 27:th International Conference on Machine Learning, ICML 2010 (2010)
Google Scholar
McCallum, A.K.: Reinforcement learning with selective perception and hidden state. PhD thesis, The University of Rochester (1996)
Google Scholar
Petrie, T.: Probabilistic functions of Finite State Markov chains. The Annals of Mathematical Statistics 40(1), 97–115 (1969)
Article MATH MathSciNet Google Scholar
Rissanen, J.: A universal data compression system. IEEE Transactions on Information Theory 29(5), 656–663 (1983)
Article MATH MathSciNet Google Scholar
Rissanen, J.: Complexity of strings in the class of Markov sources. IEEE Transactions on Information Theory 32(4), 526–532 (1986)
Article MATH MathSciNet Google Scholar
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach, 3rd edn. Prentice-Hall, Englewood Cliffs (2010)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning). MIT Press, Cambridge (March 1998)
Google Scholar
Singer, Y.: Adaptive mixtures of probabilistic transducers. Neural Computation 9, 1711–1733 (1996)
Article Google Scholar
Vidal, E., Thollard, F., de la Higuera, C., Casacuberta, F., Carrasco, R.C.: Probabilistic finite-state machines – Part I. IEEE Transactions on Pattern Analysis and Machine Intelligence 27(7), 1013–1025 (2005a)
Article Google Scholar

Download references

Author information

Authors and Affiliations

RSISE@Australian National University and SML@NICTA, Canberra, ACT, 0200, Australia
Peter Sunehag & Marcus Hutter

Authors

Peter Sunehag
View author publications
You can also search for this author in PubMed Google Scholar
Marcus Hutter
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Information Sciences and Engineering, Australian National University and NICTA, 0200, Canberra, ACT, Australia
Marcus Hutter
Department of Mathematics, National University of Singapore, Block S17, 10 Lower Kent Ridge Road, 119076, Singapore, Republic of Singapore
Frank Stephan
Department of Computer Science, University of London, Royal Holloway, TW20 0EX, Egham, Surrey, UK
Vladimir Vovk
Division of Computer Science, Hokkaido University, , ,, N-14, W-9, Sapporo, 060-0814, Japan
Thomas Zeugmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sunehag, P., Hutter, M. (2010). Consistency of Feature Markov Processes. In: Hutter, M., Stephan, F., Vovk, V., Zeugmann, T. (eds) Algorithmic Learning Theory. ALT 2010. Lecture Notes in Computer Science(), vol 6331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16108-7_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-16108-7_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16107-0
Online ISBN: 978-3-642-16108-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics