Data-driven model selection for same-realization predictions in autoregressive processes

Kamila, Kare

doi:10.1007/s10463-022-00855-1

Data-driven model selection for same-realization predictions in autoregressive processes

Published: 27 November 2022

Volume 75, pages 567–592, (2023)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Kare Kamila¹

114 Accesses
Explore all metrics

Abstract

This paper is about the one-step ahead prediction of the future of observations drawn from an infinite-order autoregressive AR(\(\infty \)) process. It aims to design penalties (fully data driven) ensuring that the selected model verifies the efficiency property but in the non-asymptotic framework. We show that the excess risk of the selected estimator enjoys the best bias-variance trade-off over the considered collection. To achieve these results, we needed to overcome the dependence difficulties by following a classical approach which consists in restricting to a set where the empirical covariance matrix is equivalent to the theoretical one. We show that this event happens with probability larger than \(1-c_0/n^2\) with \(c_0>0\). The proposed data-driven criteria are based on the minimization of the penalized criterion akin to the Mallows’s \(C_p\).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Evaluating time series forecasting models: an empirical study on performance estimation methods

Article 13 October 2020

Machine Learning Strategies for Time Series Forecasting

Causal inference for time series analysis: problems, methods and evaluation

Article 23 November 2021

References

Baraud, Y., Comte, F., Viennet, G. (2001a). Model selection for (auto-) regression with dependent data. ESAIM: Probability and Statistics, 5, 33–49.
Article MathSciNet MATH Google Scholar
Baraud, Y., Comte, F., Viennet, G., et al. (2001b). Adaptive estimation in autoregression or-mixing regression via model selection. The Annals of Statistics, 29(3), 839–875.
Article MathSciNet MATH Google Scholar
Bardet, J.-M., Wintenberger, O. (2009). Asymptotic normality of the quasi-maximum likelihood estimator for multidimensional causal processes. The Annals of Statistics, 37(5B), 2730–2759.
Article MathSciNet MATH Google Scholar
Birgé, L., Massart, P. (2001). Gaussian model selection. Journal of the European Mathematical Society, 3(3), 203–268.
Article MathSciNet MATH Google Scholar
Birgé, L., Massart, P. (2007). Minimal penalties for gaussian model selection. Probability theory and related fields, 138(1–2), 33–73.
Article MathSciNet MATH Google Scholar
Comte, F., Dedecker, J., Taupin, M.-L. (2008). Adaptive density deconvolution with dependent inputs. Mathematical methods of Statistics, 17(2), 87.
Article MathSciNet MATH Google Scholar
Comte, F., Genon-Catalot, V. (2020). Regression function estimation as a partly inverse problem. Annals of the Institute of Statistical Mathematics, 72(4), 1023–1054.
Article MathSciNet MATH Google Scholar
Dedecker, J., Prieur, C. (2005). New dependence coefficients. examples and applications to statistics. Probability Theory and Related Fields, 132(2), 203–236.
Article MathSciNet MATH Google Scholar
Doukhan, P., Wintenberger, O. (2008). Weakly dependent chains with infinite memory. Stochastic Processes and their Applications, 118(11), 1997–2013.
Article MathSciNet MATH Google Scholar
Goldenshluger, A., Zeevi, A. (2001). Nonasymptotic bounds for autoregressive time series modeling. Annals of Statistics, 29, 417–444.
Article MathSciNet MATH Google Scholar
Hsu, D., Kakade, S. M., Zhang, T. ( 2011). An analysis of random design linear regression. arXiv preprint arXiv:1106.2363 .
Ing, C.-K., Wei, C.-Z. (2003). On same-realization prediction in an infinite-order autoregressive process. Journal of Multivariate Analysis, 85(1), 130–155.
Article MathSciNet MATH Google Scholar
Ing, C.-K., Wei, C.-Z., et al. (2005). Order selection for same-realization predictions in autoregressive processes. The Annals of Statistics, 33(5), 2423–2474.
Article MathSciNet MATH Google Scholar
Klein, T., Rio, E., et al. (2005). Concentration around the mean for maxima of empirical processes. The Annals of Probability, 33(3), 1060–1077.
Article MathSciNet MATH Google Scholar
Lebarbier, E., Mary-Huard, T. (2004). Le critère BIC: fondements théoriques et interprétation, PhD thesis, INRIA.
Lerasle, M., et al. (2011). Optimal model selection for density estimation of stationary data under various mixing conditions. The Annals of Statistics, 39(4), 1852–1877.
Article MathSciNet MATH Google Scholar
Shibata, R. (1980). Asymptotically efficient selection of the order of the model for estimating parameters of a linear process. The Annals of Statistics, 8(1), 147–164.
Article MathSciNet MATH Google Scholar
Van de Geer, S. A. (2002). On hoeffding’s inequality for dependent random variables. In Empirical process techniques for dependent data (pp. 161–169), Springer.

Download references

Acknowledgements

The author thanks William KENGNE and Jean-Marc BARDET for proofreads and helpful discussions. Kare Kamila has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 754362.

Author information

Authors and Affiliations

SAMM, Université Paris 1, Panthéon-Sorbonne, 90, rue de Tolbiac, 75634, Paris, France
Kare Kamila

Authors

Kare Kamila
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kare Kamila.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Kare Kamila has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 754362.

About this article

Cite this article

Kamila, K. Data-driven model selection for same-realization predictions in autoregressive processes. Ann Inst Stat Math 75, 567–592 (2023). https://doi.org/10.1007/s10463-022-00855-1

Download citation

Received: 11 April 2021
Revised: 13 September 2022
Accepted: 11 October 2022
Published: 27 November 2022
Issue Date: August 2023
DOI: https://doi.org/10.1007/s10463-022-00855-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Data-driven model selection for same-realization predictions in autoregressive processes

Abstract

Access this article

Similar content being viewed by others

Evaluating time series forecasting models: an empirical study on performance estimation methods

Machine Learning Strategies for Time Series Forecasting

Causal inference for time series analysis: problems, methods and evaluation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Keywords

Navigation

Data-driven model selection for same-realization predictions in autoregressive processes

Abstract

Access this article

Similar content being viewed by others

Evaluating time series forecasting models: an empirical study on performance estimation methods

Machine Learning Strategies for Time Series Forecasting

Causal inference for time series analysis: problems, methods and evaluation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Search

Navigation