Skip to main content
Log in

Corrected versions of cross-validation criteria for selecting multivariate regression and growth curve models

  • Cross-validation
  • Published:
Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Abstract

This paper is concerned with cross-validation (CV) criteria for choice of models, which can be regarded as approximately unbiased estimators for two types of risk functions. One is AIC type of risk or equivalently the expected Kullback-Leibler distance between the distributions of observations under a candidate model and the true model. The other is based on the expected mean squared error of prediction. In this paper we study asymptotic properties of CV criteria for selecting multivariate regression models and growth curve models under the assumption that a candidate model includes the true model. Based on the results, we propose their corrected versions which are more nearly unbiased for their risks. Through numerical experiments, some tendency of the CV criteria will be also pointed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle,2nd International Symposium on Information Theory (eds. B. N. Petrov and F. Csáki), 267–281, Akadémia Kiado, Budapest.

    Google Scholar 

  • Anderson, T. W. (1984).An Introduction to Multivariate Analysis, 2nd ed., John Wiley & Sons, New York.

    Google Scholar 

  • Bedrick, E. J. and Tsai, C. L. (1994). Model selection for multivariate regression in small samples,Biometrics,76, 226–231.

    Article  Google Scholar 

  • Fujikoshi, Y. and Satoh, K. (1994). Modified AIC andC p in multivariate linear regression,Biometrika,84, 707–716.

    Article  MathSciNet  Google Scholar 

  • Gleser, L. J. and Olkin, I. (1970). Linear models in multivariate analysis,Essays in Probability and Statistics (eds. R. C. Bose, I. M. Chakravarti, P. C. Mahalanobis, C. R. Rao and K. J. C. Smith, University of North Carolina Press, Chapel Hill, North Carolina.

    Google Scholar 

  • Mallows, C. L. (1973). Some comments onC p,Technometrics,15, 661–675.

    Article  MATH  Google Scholar 

  • Potthoff, R. F. and Roy, S. N. (1964). A generalized multivariate analysis of variance model useful especially for growth curve problems,Biometrika,51, 313–325.

    Article  MATH  MathSciNet  Google Scholar 

  • Satoh, K., Kobayashi, M. and Fujikoshi, Y. (1997). Variable selection for the growth curve model,J. Multivariate Anal.,60, 277–292.

    Article  MATH  MathSciNet  Google Scholar 

  • Siotani, M., Hayakawa, T. and Fujikoshi, Y. (1985).Modern Multivariate Statistical Analysis: A Graduate Course and Handbook, American Sciences Press, Columbus, Ohio.

    Google Scholar 

  • Stone, M. (1974). Cross-validation and multinomial prediction,Biometrika,61, 509–515.

    Article  MATH  MathSciNet  Google Scholar 

  • Stone, M. (1977). An asymptotic equivalence of choice of model by cross-validation and Akaike's criterion,J. Roy. Statist. Soc. Ser. B,39, 44–47.

    MATH  MathSciNet  Google Scholar 

  • Sugiura, N. (1978). Further analysis of the data by Akaike's information criterion and the finite corrections,Comm. Statist. Theory Methods,7, 13–26.

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

About this article

Cite this article

Fujikoshi, Y., Noguchi, T., Ohtaki, M. et al. Corrected versions of cross-validation criteria for selecting multivariate regression and growth curve models. Ann Inst Stat Math 55, 537–553 (2003). https://doi.org/10.1007/BF02517806

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02517806

Key words and phrases

Navigation