Abstract
We consider selecting a regression model, using a variant of the general-to-specific algorithm in PcGets, when there are more variables than observations. We look at the special case where the variables are single impulse dummies, one defined for each observation. We show that this setting is unproblematic if tackled appropriately, and obtain the asymptotic distribution of the mean and variance in a location-scale model, under the null that no impulses matter. Monte Carlo simulations confirm the null distributions and suggest extensions to highly non-normal cases.
Similar content being viewed by others
References
Campos J, Hendry DF, Krolzig H-M (2003) Consistent model selection by an automatic gets approach. Oxf Bull Econ Stat 65:803–819
Campos J, Ericsson NR, Hendry DF (2005) Editor’s introduction. In: Campos J, Ericsson NR, Hendry DF (eds) Readings on general-to-specific modelling. Edward Elgar, Cheltenham (forthcoming)
Doornik JA (2001) OX, an object oriented matrix programming language, 4th edn. Timberlake Consultants Press, London
Foster DP, Stine RA (2004) Honest confidence intervals for the error variance in stepwise regression, Mimeo. Statistics Department, Wharton School, University of Pennsylvania
Granger CWJ, Hendry DF (2003) A dialogue concerning a new instrument for econometric modelling. Unpublished Paper, Department of Economics, University of Oxford
Hendry DF (1995) Dynamic econometrics. Oxford University Press, Oxford
Hendry DF, Krolzig H-M (2001) Automatic econometric model selection using PcGets. Timberlake Consultants Press, London
Hendry DF, Krolzig H-M (2003) New developments in automatic general-to-specific modelling. In: Stigum BP (ed) Econometrics and the philosophy of economics. Princeton University Press, Princeton, pp 379–419
Hendry DF, Krolzig H-M (2004) Model selection with more variables than observations. Unpublished Paper, Department of Economics, University of Oxford
Hendry DF, Krolzig H-M (2005) The properties of automatic GETS modelling. Econ J 115:C32–C61
Hendry DF, Santos C (2005) Regression models with data-based indicator variables. Oxf Bull Econ Stat 67:571–595
Hendry DF, Santos C (2006) Automatic tests for super exogeneity. Unpublished Paper, Department of Economics, University of Oxford
Hoover KD, Perez SJ (1999) Data mining reconsidered: encompassing and the general-to-specific approach to specification search. Econometrics J 2:167–191
Hoover SJ, Perez SJ (2004) Truth and robustness in cross-country growth regressions. Oxf Bull Econ Stat 66(5):765–798
Krolzig H-M, Hendry DF (2001) Computer automation of general-to-specific model selection procedures. J Econ Dyn Control 25:831–866
Salkever DS (1976) The use of dummy variables to compute predictions, prediction errors, and confidence intervals. J Econometrics 4:393–397
White H (1984) Asymptotic theory for econometricians. Academic, London
Author information
Authors and Affiliations
Corresponding author
Additional information
An erratum to this article can be found at http://dx.doi.org/10.1007/s00180-008-0112-1
Rights and permissions
About this article
Cite this article
Santos, C., Hendry, D.F. & Johansen, S. Automatic selection of indicators in a fully saturated regression. Computational Statistics 23, 317–335 (2008). https://doi.org/10.1007/s00180-007-0054-z
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-007-0054-z