Abstract
In many medical and biological investigations, including genetics, it is typical to handle high dimensional data which can be viewed as a set of values of some factors and a binary response variable. For instance, the response variable can describe the state of a patient health and one often assumes that it depends only on some part of factors. An important problem is to determine collections of significant factors. In this regard we turn to the MDR-method introduced by M. Ritchie and coauthors. Our recent paper provided the necessary and sufficient conditions for strong consistency of estimates of the prediction error employing the K-fold cross-validation and an arbitrary penalty function. Here we introduce the regularized versions of the mentioned estimates and prove for them the multidimensional CLT. Statistical variants of the CLT involving self-normalization are discussed as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arlot, S., Celisse, A.: A survey of cross-validation procedures for model selection. Stat. Surv. 4, 40–79 (2010)
Bradley-Smith, G., Hope, S., Firth, H.V., Hurst, J.A.: Oxford Handbook of Genetics. Oxford University Press, New York (2009)
Bulinski, A.V.: On foundation of the dimesionality reduction method for explanatory variables. Zapiski Nauchnyh Seminarov POMI. 408, 84–101 (2012) (In Russian; English translation in Journal of Mathematical Sciences 199(2), 113–122 (2014))
Bulinski, A., Butkovsky, O., Sadovnichy, V., Shashkin, A., Yaskov, P., Balatskiy, A., Samokhodskaya, L., Tkachuk, V.: Statistical methods of SNP data analysis and applications. Open J. Stat. 2(1), 73–87 (2012)
Edwards, T.L., Torstenson, E.S., Martin, E.M., Ritchie, M.D.: A cross-validation procedure for general pedigrees and matched odds ratio fitness metric implemented for the multifactor dimensionality reduction pedigree disequilibrium test MDR-PDT and cross-validation: power studies. Genet. Epidemiol. 34(2), 194–199 (2010)
Golland, P., Liang, F., Mukherjee, S., Panchenko, D.: Permutation tests for classification. In: COLT’05 Proceedings of the 18th Annual Conference on Learning Theory, pp. 501–515. Springer, Berlin (2005)
Gui, J., Andrew, A.S., Andrews, P., Nelson, H.M., Kelsey, K.T., Karagas, M.R., Moore, J.H.: A robust multifactor dimensionality reduction method for detecting gene-gene interactions with application to the genetic analysis of bladder cancer susceptibility. Ann. Hum. Genet. 75(1), 20–28 (2011)
He, H., Oetting, W.S., Brott, M.J., Basu, S.: Power of multifactor dimensionality reduction and penalized logistic regression for detecting gene-gene interaction in a case-control study. BMC Med. Genet. 10, 127 (2009)
Mei, H., Cuccaro, M.L., Martin, E.R.: Multifactor dimensionality reduction-phenomics: a novel method to capture genetic heterogeneity with use of phenotypic variables. Am. J. Hum. Genet. 81(6), 1251–1261 (2007)
Namkung, J., Elston, R.C., Yang, J.M., Park, T.: Identification of gene-gene interactions in the presence of missing data using the multifactor dimensionality reduction method. Genet. Epidemiol. 33(7), 646–656 (2009)
Niu, A., Zhang, S., Sha, Q.: A novel method to detect gene-gene interactions in structured populations: MDR-SP. Ann. Hum. Genet. 75(6), 742–754 (2011)
Oh, S., Lee, J., Kwon, M-S., Weir, B., Ha, K., Park, T.: A novel method to identify high order gene-gene interactions in genome-wide association studies: gene-based MDR. BMC Bioinformatics 13(Suppl 9), S5 (2012)
Ritchie, M.D., Hahn, L.W., Roodi, N., Bailey, R.L., Dupont, W.D., Parl, F.F., Moore, J.H.: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am. J. Hum. Genet. 69(1), 138–147 (2001)
Ritchie, M.D., Motsinger, A.A.: Multifactor dimensionality reduction for detecting gene-gene and gene-environment interactions in pharmacogenomics studies. Pharmacogenomics 6(8), 823–834 (2005)
Schwender, H., Ruczinski, I., Ickstadt K.: Testing SNPs and sets of SNPs for importance in association studies. Biostatistics 12(1), 18–32 (2011)
Taylor, R.L., Hu, T.-C.: Strong laws of large numbers for arrays of row-wise independent random elements. Int. J. Math. Math. Sci. 10(4), 805–814 (1987)
Velez, D.R., White, B.C., Motsinger, A.A., Bush, W.S., Ritchie, M.D., Williams, S.M., Moore, J.H.: A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction. Genet. Epidemiol. 31(4), 306–315 (2007)
Winham, S.J., Slater, A.J., Motsinger-Reif, A.A.: A comparison of internal validation techniques for multifactor dimensionality reduction. BMC Bioinformatics 11, 394 (2010)
Acknowledgements
The author is grateful to Organizing Committee for invitation to participate in the Fields Institute International Symposium on Asymptotic Methods in Stochastics, in Honour of Miklós Csörgő’s Work. Special thanks are due to Professor Csörgő and his colleagues for hospitality.
The work is partially supported by RFBR grant 13-01-00612.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer Science+Business Media New York
About this chapter
Cite this chapter
Bulinski, A. (2015). Central Limit Theorem Related to MDR-Method. In: Dawson, D., Kulik, R., Ould Haye, M., Szyszkowicz, B., Zhao, Y. (eds) Asymptotic Laws and Methods in Stochastics. Fields Institute Communications, vol 76. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-3076-0_7
Download citation
DOI: https://doi.org/10.1007/978-1-4939-3076-0_7
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-3075-3
Online ISBN: 978-1-4939-3076-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)