Abstract
Variable selection is a common problem in regression modelling with a myriad of applications. This paper proposes a new feature ranking algorithm (DEPTH) for variable selection in parametric regression based on permutation statistics and stability selection. DEPTH is: (i) applicable to any parametric regression task, (ii) designed to be run in a parallel environment, and (iii) adapts naturally to the correlation structure of the predictors. DEPTH was applied to a genome-wide association study of breast cancer and found evidence that there are variants in a pathway of candidate genes that are associated with a common subtype of breast cancer, a finding which would not have been discovered by conventional analyses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Manolio, T.A.: Genomewide association studies and assessment of the risk of disease. The New England Journal of Medicine 363(2), 166–176 (2010)
Dudoit, S., Shaffer, J.P., Boldrick, J.C.: Multiple hypothesis testing in microarray experiments. Statistical Science 18(1), 71–103 (2003)
Miller, A.J.: Selection of subsets of regression variables. Journal of the Royal Statistical Society (Series A) 147(3), 389–425 (1984)
Dite, G., Jenkins, M., Southey, M., Hocking, J., Giles, G., McCredie, M., Venter, D., Hopper, J.: Familial risks, early-onset breast cancer, and BRCA1 and BRCA2 germline mutations. J. Natl. Cancer Inst. 95, 448–457 (2003)
Odefrey, F., Gurrin, L., Byrnes, G., Apicella, C., Dite, G.: Common genetic variants associated with breast cancer and mammographic density measures that predict disease. Cancer Research 70, 1449–1458 (2010)
Weale, M.: Quality control for genome-wide association studies. Methods Mol. Biol. 628, 341–372 (2010)
Consortium, I.H.: A second generation human haplotype map of over 3.1 million snps. Nature 449, 851–861 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Makalic, E., Schmidt, D.F., Hopper, J.L. (2013). DEPTH: A Novel Algorithm for Feature Ranking with Application to Genome-Wide Association Studies. In: Cranefield, S., Nayak, A. (eds) AI 2013: Advances in Artificial Intelligence. AI 2013. Lecture Notes in Computer Science(), vol 8272. Springer, Cham. https://doi.org/10.1007/978-3-319-03680-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-03680-9_9
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03679-3
Online ISBN: 978-3-319-03680-9
eBook Packages: Computer ScienceComputer Science (R0)