Efficient and Accurate Multiple-Phenotypes Regression Method for High Dimensional Data Considering Population Structure
- First Online:
- Cite this paper as:
- Joo J.W.J. et al. (2015) Efficient and Accurate Multiple-Phenotypes Regression Method for High Dimensional Data Considering Population Structure. In: Przytycka T. (eds) Research in Computational Molecular Biology. RECOMB 2015. Lecture Notes in Computer Science, vol 9029. Springer, Cham
A typical GWAS tests correlation between a single phenotype and each genotype one at a time. However, it is often very useful to analyze many phenotypes simultaneously. For example, this may increase the power to detect variants by capturing unmeasured aspects of complex biological networks that a single phenotype might miss. There are several multivariate approaches that try to detect variants related to many phenotypes, but none of them consider population structure and each may result in a significant number of false positive identifications. Here, we introduce a new methodology, referred to as GAMMA, that could both simultaneously analyze many phenotypes as well as correct for population structure. In a simulated study, GAMMA accurately identifies true genetic effects without false positive identifications, while other methods either fail to detect true effects or result in many false positive identifications. We further apply our method to genetic studies of yeast and gut microbiome from mouse and show that GAMMA identifies several variants that are likely to have a true biological mechanism.