Abstract
We propose a method for selecting terms to be included into a regression model, when a part of the primary candidates is specified (e.g., the main effects), and discuss related experimental design problems. A distinctive feature here is a deficit in the admissible number of experiments in comparison with a much larger number of candidate terms. We apply a large number of random projections of candidate terms to eliminate spurious terms. The design problem is solved for a linear regression with a very large number of interactions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The model (1) resembles a model that was proposed by Cook and Weisberg (2004). A fundamental difference is that here s is selected at random and only β is estimated, while in Cook and Weisberg (2004) both β and s are estimated. See Skubalska-Rafajłowicz and Rafajłowicz (2012) for further discussion and references.
- 2.
Strictly speaking, \(\mathbf{M}_{\mathbb{S}}\) is the Fisher information matrix for fixed \(\mathbb{S}\) subject to the hypothesis that \(b_{j}^{0}=0\), j=1,2,…,K.
- 3.
One can consider the Plackett and Burman designs for estimating main effects, but in our opinion they share the same drawback.
References
Agboto, V., Li, W., Nachtsheim, C.: Screening designs for model discrimination. J. Stat. Plan. Inference 140, 766–780 (2010)
Atkinson, A.C., Donev, A.N., Tobias, R.D.: Optimum Experimental Designs, with SAS. Oxford University Press, Oxford (2007)
Cook, R.D., Weisberg, S.: Partial one-dimensional regression models. Am. Stat. 58, 110–116 (2004)
Dean, A.M., Lewis, S.M.: Screening methods for experimentation. In: Industry, Drug Discovery, and Genetics. Springer, New York (2006)
Dette, H., Kwiecien, R.: Finite sample performance of sequential designs for model identification. J. Stat. Comput. Simul. 75, 477–495 (2005)
Donoho, D., Jin, J.: Higher criticism for detecting sparse heterogeneous mixtures. Ann. Stat. 32, 962–994 (2004)
Du Mouchel, W., Jones, B.: A simple Bayesian modification of D-optimal designs to reduce dependence on an assumed model. Technometrics 36, 37–47 (1994)
Konishi, S., Kitagawa, G.: Information Criteria and Statistical Modeling. Springer, New York (2008)
Lewis, S.M., Dean, A.M.: Detection of interactions in experiments on large numbers of factors (with discussion). J. R. Stat. Soc. B 63, 633–672 (2001)
Lopez-Fidalgo, J., Dette, H., Zhu, W.: Optimal designs for discriminating between heteroscedastic models. In: Proc. of the 5th St Petersburg Workshop on Simulation, pp. 429–436. NII Chemistry Univ. Publ., St. Petersburg (2005)
Morris, M.D.: An overview of group factor screening. In: Dean, A.M., Lewis, S.M. (eds.) Screening Methods for Experimentation in Industry, Drug Discovery, and Genetics, pp. 191–207. Springer, New York (2006)
Niederreiter, H.: Random Number Generation and Quasi-Monte Carlo Methods. SIAM, Philadelphia (1992)
Rafajłowicz, E., Schwabe, R.: Halton and Hammersley sequences in multivariate nonparametric regression. Stat. Probab. Lett. 76, 803–812 (2006)
Satterthwaite, F.E.: Random balance experimentation. Technometrics 1, 111–137 (1959)
Skubalska-Rafajłowicz, E.: Experiments with Neural Network for Modeling of Nonlinear Dynamical Systems: Design Problems. Lecture presented at the Isaac Newton Institute, Cambridge (2011). Available. http://www.newton.ac.uk/programmes/DAE/seminars/072010301.html
Skubalska-Rafajłowicz, E., Rafajłowicz, E.: Appendix to Random projections in regression model selection and corresponding experiment design problems. Technical Report (2012). Available http://swing.ict.pwr.wroc.pl/~ewa/raf/index.html
Tibshirani, R.: Regression analysis and selection via the Lasso. J. R. Stat. Soc. B 58, 267–288 (1996)
Titterington, D.M.: Optimal design in flexible models, including feed-forward networks and nonparametric regression (with discussion). In: Atkinson, A.C., Bogacka, B., Zhigljavsky, A. (eds.) Optimum Design 2000, pp. 261–273. Kluwer, Dordrecht (2000)
Tsai, P.-W., Gilmour, S.G., Mead, R.: Projective three-level main effects designs robust to model uncertainty. Biometrika 87, 467–475 (2000)
Uciński, D., Bogacka, B.: T-optimum designs for discrimination between two multivariate dynamic models. J. R. Stat. Soc. B 67, 3–18 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Skubalska-Rafajłowicz, E., Rafajłowicz, E. (2013). Random Projections in Model Selection and Related Experimental Design Problems. In: Ucinski, D., Atkinson, A., Patan, M. (eds) mODa 10 – Advances in Model-Oriented Design and Analysis. Contributions to Statistics. Springer, Heidelberg. https://doi.org/10.1007/978-3-319-00218-7_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-00218-7_27
Publisher Name: Springer, Heidelberg
Print ISBN: 978-3-319-00217-0
Online ISBN: 978-3-319-00218-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)