Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping
The Bayesian shrinkage technique has been applied to multiple quantitative trait loci (QTLs) mapping to estimate the genetic effects of QTLs on quantitative traits from a very large set of possible effects including the main and epistatic effects of QTLs. Although the recently developed empirical Bayes (EB) method significantly reduced computation comparing with the fully Bayesian approach, its speed and accuracy are limited by the fact that numerical optimization is required to estimate the variance components in the QTL model.
We developed a fast empirical Bayesian LASSO (EBLASSO) method for multiple QTL mapping. The fact that the EBLASSO can estimate the variance components in a closed form along with other algorithmic techniques render the EBLASSO method more efficient and accurate. Comparing with the EB method, our simulation study demonstrated that the EBLASSO method could substantially improve the computational speed and detect more QTL effects without increasing the false positive rate. Particularly, the EBLASSO algorithm running on a personal computer could easily handle a linear QTL model with more than 100,000 variables in our simulation study. Real data analysis also demonstrated that the EBLASSO method detected more reasonable effects than the EB method. Comparing with the LASSO, our simulation showed that the current version of the EBLASSO implemented in Matlab had similar speed as the LASSO implemented in Fortran, and that the EBLASSO detected the same number of true effects as the LASSO but a much smaller number of false positive effects.
The EBLASSO method can handle a large number of effects possibly including both the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTL mapping.
- Carlborg, Ö, Haley, CS (2004) Epistasis: too often neglected in complex trait studies?. Nat Rev Genet 5: pp. 618-625 CrossRef
- Schwarz, G (1978) Estimating the dimension of a model. Ann Stat 6: pp. 461-464 CrossRef
- Hoerl, AE, Kennard, RW (1970) Ridge regression: Biased estimation for nonorthogonal problems. Technometrics 12: pp. 55-67 CrossRef
- Tibshirani, R (1996) Regression Shrinkage and Selection Via the Lasso. J R Statistical Soc Ser B 58: pp. 267-288
- O'Hara, RB, Sillanpää, MJ (2009) A review of Bayesain variable selection methods: what, how and which. Bayesian Analysis 4: pp. 85-118 CrossRef
- Xu, S (2003) Estimating polygenic effects using markers of the entire genome. Genetics 163: pp. 789-801
- Wang, H, Zhang, YM, Li, X, Masinde, GL, Mohan, S, Baylink, DJ, Xu, S (2005) Bayesian shrinkage estimation of quantitative trait loci parameters. Genetics 170: pp. 465-480 CrossRef
- Hoti, F, Sillanpää, MJ (2006) Bayesian mapping of genotype × expression interactions in quantitative and qualitative traits. Heredity 97: pp. 4-18 CrossRef
- Huang, H, Eversley, CD, Threadgill, DW, Zou, F (2007) Bayesian Multiple Quantitative Trait Loci Mapping for Complex Traits Using Markers of the Entire Genome. Genetics 176: pp. 2529-2540 CrossRef
- Yi, N, Xu, S (2008) Bayesian LASSO for quantitative trait loci mapping. Genetics 179: pp. 1045-1055 CrossRef
- Robert, CR, Casella, G (2004) Monte Carlo Statistical Methods.
- Xu, S (2007) An Empirical Bayes Method for Estimating Epistatic Effects of Quantitative Trait Loci. Biometrics 63: pp. 513-521 CrossRef
- Xu, S (2010) An expectation maximization algorithm for the Lasso estimation of quantitative trait locus effects. Heredity 2010: pp. 1-12
- Nelder, JA, Mead, R (1965) A Simplex Method for Function Minimization. Comput J 7: pp. 308-313
- Tipping, ME (2001) Sparse Bayesian learning and the relevance vector machine. J Mach Learn Res 1: pp. 211-244 CrossRef
- Tipping, ME, Faul, AC (2003) Fast Marginal Likelihood Maximisation for Sparse Bayesian Models. Proc 9th International Workshop on Artificial Intelligence and Statistics. Key West, FL
- Park, T, Casella, G (2008) The Bayesian Lasso. J Amer Stat Assoc 103: pp. 681-686 CrossRef
- Yi, N, Banerjee, S (2009) Hierachical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping. Genetics 181: pp. 1101-1133 CrossRef
- Friedman, J, Hastie, T, Tibshirani, R (2010) Regularization Paths for Generalized Linear Models via Coordinate Descent. J Stat Softw 33: pp. 1-22
- Luo, ZW, Potokina, E, Druka, A, Wise, R, Waugh, R, Kearsey, MJ (2007) SEP genotyping from Affymetrix arrays is robust but largely detects cis-acting expression regulators. Genetics 176: pp. 789-800 CrossRef
- Zhang, YM, Xu, S (2005) A penalized maximum likelihood method for estimating epistatic effects of QTL. Heredity 95: pp. 96-104 CrossRef
- George, EI, McCulloch, RE (1993) Variable Selection Via Gibbs Sampling. Journal of the American Statistical Association 88: pp. 881-889 CrossRef
- Yi, N, George, V, Allison, DB (2003) Stochastic Search Variable Selection for Identifying Multiple Quantitative Trait Loci. Genetics 164: pp. 1129-1138
- Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping
- Open Access
- Available under Open Access This content is freely available online to anyone, anywhere at any time.
- Online Date
- May 2011
- Online ISSN
- BioMed Central
- Additional Links
- Industry Sectors