Abstract
In this paper we obtain non-uniform exponential upper bounds for the rate of convergence of a version of the algorithm Context, when the underlying tree is not necessarily bounded. The algorithm Context is a well-known tool to estimate the context tree of a Variable Length Markov Chain. As a consequence of the exponential bounds we obtain a strong consistency result. We generalize in this way several previous results in the field.
This work is part of PRONEX/FAPESP’s project Stochastic behavior, critical phenomena and rhythmic pattern identification in natural languages (grant number 03/09930-9) and CNPq’s projects Stochastic modeling of speech (grant number 475177/2004-5) and Rhythmic patterns, prosodic domains and probabilistic modeling in Portuguese Corpora (grant number 485999/2007-2). AG is partially supported by a CNPq fellowship (grant 308656/2005-9) and FL is supported by a FAPESP fellowship (grant 06/56980-0).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
P. Bühhmann and A. J. Wyner. Variable length Markov chains. Ann. Statist., 27:480–513, 1999.
F. Comets, R. Fernández, and P. Ferrari. Processes with long memory: Regenerative construction and perfect simulation. Ann. Appl. Probab., 12(3):921–943, 2002.
I. Csiszár and Z. Talata. Context tree estimation for not necessarily finite memory processes, via BIC and MDL. IEEE Trans. Inform. Theory, 52(3):1007–1016, 2006.
J. Dedecker and P. Doukhan. A new covariance inequality and applications. Stochastic Process. Appl., 106(1):63–80, 2003.
J. Dedecker and C. Prieur. New dependence coefficients, examples and applications to statistics. Probab. Theory Related Fields, 132:203–236, 2005.
D. Duarte, A. Galves, and N. L. Garcia. Markov approximation and consistent estimation of unbounded probabilistic suffix trees. Bull. Braz. Math. Soc., 37(4):581–592, 2006.
R. Fernández and A. Galves. Markov approximations of chains of infinite order. Bull. Braz. Math. Soc., 33(3):295–306, 2002.
F. Ferrari and A. Wyner. Estimation of general stationary processes by variable length Markov chains. Scand. J. Statist., 30(3):459–480, 2003.
L. Finesso, C-C. Liu, and P. Narayan. The optimal error exponent for Markov order estimation. IEEE Trans. Inform. Theory, 42(5):1488–1497, 1996.
A. Galves, V. Maume-Deschamps, and B. Schmitt. Exponential inequalities for VLMC empirical trees. ESAIM Prob. Stat. (accepted), 2006.
J. Rissanen. A universal data compression system. IEEE Trans. Inform. Theory, 29(5):656–664, 1983.
D. Ron, Y. Singer, and N. Tishby. The power of amnesia: Learning probabilistic automata with variable memory length. Machine Learning, 25(2–3):117–149, 1996.
M. J. Weinberger, J. Rissanen, and M. Feder. A universal finite memory source. IEEE Trans. Inform. Theory, 41(3):643–652, 1995.
F.M. Willems, Y. M. Shtarkov, and T. J. Tjalkens. The context-tree weighting method: basic properties. IEEE Trans. Inform. Theory, IT-44:653–664, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Birkhäuser Verlag Basel/Switzerland
About this chapter
Cite this chapter
Galves, A., Leonardi, F. (2008). Exponential Inequalities for Empirical Unbounded Context Trees. In: Sidoravicius, V., Vares, M.E. (eds) In and Out of Equilibrium 2. Progress in Probability, vol 60. Birkhäuser Basel. https://doi.org/10.1007/978-3-7643-8786-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-7643-8786-0_12
Publisher Name: Birkhäuser Basel
Print ISBN: 978-3-7643-8785-3
Online ISBN: 978-3-7643-8786-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)