Skip to main content
Log in

Data-driven model selection for same-realization predictions in autoregressive processes

  • Published:
Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Abstract

This paper is about the one-step ahead prediction of the future of observations drawn from an infinite-order autoregressive AR(\(\infty \)) process. It aims to design penalties (fully data driven) ensuring that the selected model verifies the efficiency property but in the non-asymptotic framework. We show that the excess risk of the selected estimator enjoys the best bias-variance trade-off over the considered collection. To achieve these results, we needed to overcome the dependence difficulties by following a classical approach which consists in restricting to a set where the empirical covariance matrix is equivalent to the theoretical one. We show that this event happens with probability larger than \(1-c_0/n^2\) with \(c_0>0\). The proposed data-driven criteria are based on the minimization of the penalized criterion akin to the Mallows’s \(C_p\).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Baraud, Y., Comte, F., Viennet, G. (2001a). Model selection for (auto-) regression with dependent data. ESAIM: Probability and Statistics, 5, 33–49.

    Article  MathSciNet  MATH  Google Scholar 

  • Baraud, Y., Comte, F., Viennet, G., et al. (2001b). Adaptive estimation in autoregression or-mixing regression via model selection. The Annals of Statistics, 29(3), 839–875.

    Article  MathSciNet  MATH  Google Scholar 

  • Bardet, J.-M., Wintenberger, O. (2009). Asymptotic normality of the quasi-maximum likelihood estimator for multidimensional causal processes. The Annals of Statistics, 37(5B), 2730–2759.

    Article  MathSciNet  MATH  Google Scholar 

  • Birgé, L., Massart, P. (2001). Gaussian model selection. Journal of the European Mathematical Society, 3(3), 203–268.

    Article  MathSciNet  MATH  Google Scholar 

  • Birgé, L., Massart, P. (2007). Minimal penalties for gaussian model selection. Probability theory and related fields, 138(1–2), 33–73.

    Article  MathSciNet  MATH  Google Scholar 

  • Comte, F., Dedecker, J., Taupin, M.-L. (2008). Adaptive density deconvolution with dependent inputs. Mathematical methods of Statistics, 17(2), 87.

    Article  MathSciNet  MATH  Google Scholar 

  • Comte, F., Genon-Catalot, V. (2020). Regression function estimation as a partly inverse problem. Annals of the Institute of Statistical Mathematics, 72(4), 1023–1054.

    Article  MathSciNet  MATH  Google Scholar 

  • Dedecker, J., Prieur, C. (2005). New dependence coefficients. examples and applications to statistics. Probability Theory and Related Fields, 132(2), 203–236.

    Article  MathSciNet  MATH  Google Scholar 

  • Doukhan, P., Wintenberger, O. (2008). Weakly dependent chains with infinite memory. Stochastic Processes and their Applications, 118(11), 1997–2013.

    Article  MathSciNet  MATH  Google Scholar 

  • Goldenshluger, A., Zeevi, A. (2001). Nonasymptotic bounds for autoregressive time series modeling. Annals of Statistics, 29, 417–444.

    Article  MathSciNet  MATH  Google Scholar 

  • Hsu, D., Kakade, S. M., Zhang, T. ( 2011). An analysis of random design linear regression. arXiv preprint arXiv:1106.2363 .

  • Ing, C.-K., Wei, C.-Z. (2003). On same-realization prediction in an infinite-order autoregressive process. Journal of Multivariate Analysis, 85(1), 130–155.

    Article  MathSciNet  MATH  Google Scholar 

  • Ing, C.-K., Wei, C.-Z., et al. (2005). Order selection for same-realization predictions in autoregressive processes. The Annals of Statistics, 33(5), 2423–2474.

    Article  MathSciNet  MATH  Google Scholar 

  • Klein, T., Rio, E., et al. (2005). Concentration around the mean for maxima of empirical processes. The Annals of Probability, 33(3), 1060–1077.

    Article  MathSciNet  MATH  Google Scholar 

  • Lebarbier, E., Mary-Huard, T. (2004). Le critère BIC: fondements théoriques et interprétation, PhD thesis, INRIA.

  • Lerasle, M., et al. (2011). Optimal model selection for density estimation of stationary data under various mixing conditions. The Annals of Statistics, 39(4), 1852–1877.

    Article  MathSciNet  MATH  Google Scholar 

  • Shibata, R. (1980). Asymptotically efficient selection of the order of the model for estimating parameters of a linear process. The Annals of Statistics, 8(1), 147–164.

    Article  MathSciNet  MATH  Google Scholar 

  • Van de Geer, S. A. (2002). On hoeffding’s inequality for dependent random variables. In Empirical process techniques for dependent data (pp. 161–169), Springer.

Download references

Acknowledgements

The author thanks William KENGNE and Jean-Marc BARDET for proofreads and helpful discussions. Kare Kamila has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 754362.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kare Kamila.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Kare Kamila has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 754362.

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kamila, K. Data-driven model selection for same-realization predictions in autoregressive processes. Ann Inst Stat Math 75, 567–592 (2023). https://doi.org/10.1007/s10463-022-00855-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10463-022-00855-1

Keywords

Navigation