Prediction error criterion for selecting variables in a linear regression model

Fujikoshi, Yasunori; Kan, Tamio; Takahashi, Shin; Sakurai, Tetsuro

doi:10.1007/s10463-009-0233-5

Prediction error criterion for selecting variables in a linear regression model

Published: 30 April 2009

Volume 63, pages 387–403, (2011)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Yasunori Fujikoshi¹,
Tamio Kan²,
Shin Takahashi² &
…
Tetsuro Sakurai¹

196 Accesses
6 Citations
Explore all metrics

Abstract

Several criteria, such as CV, C _p, AIC, CAIC, and MAIC, are used for selecting variables in linear regression models. It might be noted that C _p has been proposed as an estimator of the expected standardized prediction error, although the target risk function of CV might be regarded as the expected prediction error R _PE. On the other hand, the target risk function of AIC, CAIC, and MAIC is the expected log-predictive likelihood. In this paper, we propose a prediction error criterion, PE, which is an estimator of the expected prediction error R _PE. Consequently, it is also a competitor of CV. Results of this study show that PE is an unbiased estimator when the true model is contained in the full model. The property is shown without the assumption of normality. In fact, PE is demonstrated as more faithful for its risk function than CV. The prediction error criterion PE is extended to the multivariate case. Furthermore, using simulations, we examine some peculiarities of all these criteria.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predictive performance of linear regression models

Article 08 May 2014

A Consistent Likelihood-Based Variable Selection Method in Normal Multivariate Linear Regression

Comparison of Bayesian predictive methods for model selection

Article Open access 07 April 2016

References

Akaike H. (1973) Informaiton theory and an extension of the maximum likelihood principle. In: Petrov B.N., Csáki F. (eds) 2nd International symposium on information theory. Budapest, Akadémia Kiado, pp 267–281
Google Scholar
Allen D.M. (1971) Mean square error of prediction as a criterion for selecting variables. Technometrics 13: 469–475
Article MATH Google Scholar
Allen D.M. (1974) The relationship between variable selection and data augumentation, and a method for prediction. Technometrics 16: 125–127
Article MathSciNet MATH Google Scholar
Bedrick E.D., Tsai C.L. (1994) Model selection for multivariate regression in small samples. Biometrics 50: 226–231
Article MATH Google Scholar
Davies S.L., Neath A.A., Cavanaugh J.E. (2006) Estimation of optimality of corrected AIC and modified C _p in linear regression. International Statistical Review 74: 161–168
Article Google Scholar
Fujikoshi Y., Satoh K. (1997) Modified AIC and C _p in multivariate linear regression. Biometrika 84: 707–716
Article MathSciNet MATH Google Scholar
Haga Y., Takeuchi K., Okuno C. (1973) New criteria for selecting of variables in regression model. Quality (Hinshitsu, Journal of the Japanese Society for Quality Control) 6: 73–78 (in Japanese)
Google Scholar
Hocking R.R. (1972) Criteria for selecting of a subset regression; which one should be used. Technometrics 14: 967–970
Article Google Scholar
Mallows C.L. (1973) Some comments on C _p. Technometrics 15: 661–675
Article MATH Google Scholar
Mallows C.L. (1995) More comments on C _p. Technometrics 37: 362–372
Article MathSciNet MATH Google Scholar
Stone M. (1974) Cross-validatory choice and assesment of statistical predictions (with Discussion). Journal of the Royal Statistical Society, B 36: 111–147
MATH Google Scholar
Sugiura N. (1978) Futher analysis of the data by Akaike’s information criterion and the finite corrections. Communications in Statistics: Theory and Methods 7: 13–26
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Graduate School of Science and Engineering, Chuo University, 1-13-27 Kasuga, Bunkyo-ku, Tokyo, 112-8551, Japan
Yasunori Fujikoshi & Tetsuro Sakurai
Esumi Co. Ltd., Nakano F bldg. 8F, 4-44-18 Honcho, Nakano-ku, Tokyo, 164-0012, Japan
Tamio Kan & Shin Takahashi

Authors

Yasunori Fujikoshi
View author publications
You can also search for this author in PubMed Google Scholar
Tamio Kan
View author publications
You can also search for this author in PubMed Google Scholar
Shin Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuro Sakurai
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yasunori Fujikoshi.

About this article

Cite this article

Fujikoshi, Y., Kan, T., Takahashi, S. et al. Prediction error criterion for selecting variables in a linear regression model. Ann Inst Stat Math 63, 387–403 (2011). https://doi.org/10.1007/s10463-009-0233-5

Download citation

Received: 12 September 2007
Revised: 01 September 2008
Published: 30 April 2009
Issue Date: April 2011
DOI: https://doi.org/10.1007/s10463-009-0233-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Prediction error criterion for selecting variables in a linear regression model

Abstract

Access this article

Similar content being viewed by others

Predictive performance of linear regression models

A Consistent Likelihood-Based Variable Selection Method in Normal Multivariate Linear Regression

Comparison of Bayesian predictive methods for model selection

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Keywords

Navigation

Prediction error criterion for selecting variables in a linear regression model

Abstract

Access this article

Similar content being viewed by others

Predictive performance of linear regression models

A Consistent Likelihood-Based Variable Selection Method in Normal Multivariate Linear Regression

Comparison of Bayesian predictive methods for model selection

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Keywords

Search

Navigation