Abstract
An extension of interval mapping is presented that incorporates all intervals on the linkage map simultaneously. The approach uses a working model in which the sizes of putative QTL for all intervals across the genome are random effects. An outlier detection method is used to screen for possible QTL. Selected QTL are subsequently fitted as fixed effects. This screening and selection approach is repeated until the variance component for QTL sizes is not statistically significant. A comprehensive simulation study is conducted in which map uncertainty is included. The proposed method is shown to be superior to composite interval mapping in terms of power of detection of QTL. There is an increase in the rate of false positive QTL detected when using the new approach, but this rate decreases as the population size increases. The new approach is much simpler computationally. The analysis of flour milling yield in a doubled haploid population illustrates the improved power of detection of QTL using the approach, and also shows how vital it is to allow for sources of non-genetic variation in the analysis.
Similar content being viewed by others
References
Broman KW, Speed TP (2002) A model selection approach for the identification of quantitative trait loci in experimental crosses. J R Stat Soc B64:641–656
Broman KW, Wu H, with ideas from Gary Churchill, Sen S, & contributions from Brian Yandell (2005) qtl: Tools for analyzing QTL experiments. R package version 1.01-9
Butler DG, Cullis BR, Gilmour AR, Gogel BJ (2007) ASReml-R, reference manual. Technical report, Queensland Department of Primary Industries
Cook RD, Holschuh N, Weisberg S (1982) A note on an alternative outlier model. J R Stat Soc B44:370–376
Crainiceanu C, Ruppert D (2004) Likelihood ratio tests in linear mixed models with one variance component. J R Stat Soc B66:165–185
Diggle PJ (1990) Time series analysis: a biostatistical approach. Oxford University Press, Oxford
Eckermann PJ, Verbyla AP, Cullis BR, Thompson R (2001) The analysis of quantitative traits in wheat mapping populations. Aust J Agric Res 52:1195–1206
Foster SD, Verbyla AP, Pitchford WS (2007) Incorporating LASSO effects the linear mixed model for the detection of QTL. J Agric Biol Environ Stat 12:300–314
Gianola D, Perez-Enciso M, Toro MA (2003) On marker-assisted prediction of genetic value: beyond the ridge. Genetics 163:347–365
Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2007) ASReml Users Guide. VSN International Ltd., Release 2.0
Gogel BJ (1997) Spatial analysis of multi-environment variety trials. PhD thesis, Department of Statistics, The University of Adelaide
Gogel BJ, Welham SJ, Verbyla AP, Cullis BR (2001) Outlier detection in linear mixed effects: summary of research. Technical Report P106, The University of Adelaide, Biometrics
Haley CS, Knott SA (1992) A simple regression method for mapping quantitative trait loci in line crosses using flanking markers. Heredity 69:315–324
Henderson CR (1950) Estimation of genetic parameters (abstract). Ann Math Stat 21:309–310
Jansen RC (1994) Controlling the type I and type II errors in mapping quantitative trait loci. Genetics 138:871–881
Kiiveri HT (2004) A Bayesian approach to variable selection when the number of variables is very large. In: Science and statistics: a Festchrift for Terry speed. Lecture Notes. Institute of Mathematical Statistics, pp 127–144
Lander ES, Botstein D (1989) Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 121:185–199
Lander ES, Green P (1987) Construction of multilocus genetic linkage maps in humans. Proc Natl Acad Sci USA 84:2363–2367
Lehmensiek A, Eckermann PJ, Verbyla AP, Appels R, Sutherland MW, Daggard GE (2005) Curation of wheat maps to improve map accuracy and QTL detection. Aust J Agric Res 56:1347–1354
Lehmensiek A, Eckermann PJ, Verbyla AP, Appels R, Sutherland MW, Martin D, Daggard GE (2006) Flour yield QTLs in three Australian doubled haploid wheat populations. Aust J Agric Res 57:1115–1122
Martinez O, Curnow RN (1992) Estimating the locations and the sizes of the effects of quantitative trait loci using flanking markers. Theor Appl Genetics 85:480–488
Martinez O, Curnow RN (1994) Missing markers when estimating quantitative trait loci using regression mapping. Heredity 73:198–206
Moreau L, Monod H, Charcosset A, Gallais A (1999) Marker-assisted selection with spatial analysis of unreplicated field trials. Theor Appl Genetics 98:234–242
Patterson HD, Thompson R (1971) Recovery of interblock information when block sizes are unequal. Biometrika 58:545–554
Piepho H-P (2000) A mixed-model approach to mapping quantitative trait loci in barley on the basis of multiple environment data. Genetics 156:2043–2050
R Development Core Team (2006) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0
Robinson GK (1991) That BLUP is a good thing: the estimation of random effects. Stat Sci 6:15–51
Smith AB, Cullis BR, Appels R, Campbell AW, Cornish GB, Martin D, Allen HM (2001) The statistical analysis of quality traits in plant improvement programs with application to the mapping of milling yield in wheat. Aust J Agric Res 52:1207–1219
Smith AB, Lim P, Cullis BR (2006) The design and analysis of multi-phase quality trait experiments. J Agric Sci (Cambridge) 144:393–409
Stram DO, Lee JW (1994) Variance components testing in the longitudinal mixed effects model. Biometrics 50:1171–1177
Thompson R (1985) A note on restricted maximum likelihood estimation with an alternative outlier model. J R Stat Soc B47:53–55
Thompson R, Cullis B, Smith A, Gilmour A (2003) A sparse implementation of the average information algorithm for factor analytic and reduced rank variance models. Aust N Z J Stat 45:445–459
Tibshirani R (1996) Regression shrinkage and selection via the lasso. J R Stat Soc B58:267–288
Trow A (1913) Forms of reproduction: primary and secondary. J Genetics 2:313–324
Verbyla AP, Eckermann PJ, Thompson R, Cullis BR (2003) The analysis of quantitative trait loci in multi-environment trials using a multiplicative mixed model. Aust J Agric Res 54:1395–1408
Welham SJ (2006) Smoothing spline methods within the mixed model framework. PhD thesis, London School of Hygiene and Tropical Medicine, The University of London
Whittaker JC, Thompson R, Visscher PM (1996) On the mapping of QTL by regression of phenotype on marker-type. Heredity 77:23–32
Whittaker JC, Thompson R, Denham MC (2000) Marker-assisted selection using ridge regression. Genet Res Camb 75:249–252
Yi N, George V, Allison DB (2003) Stochastic search variable selection for identifying multiple quantitative trait loci. Genetics 164:1129–1138
Zeng Z-B (1994) Precision mapping of quantitative trait loci. Genetics 136:1457–1468
Acknowledgments
We gratefully acknowledge the Grains Research and Development Corporation (GRDC) for support through Key Programme 3 of their National Statistics Project. We thank the Australian Winter Cereals Molecular Marker Program and it’s predecessor the National Wheat Molecular Marker Program, both funded by GRDC, for the flour milling yield data analysed in this paper. We are grateful to Simon Diffey, New South Wales Department of Primary Industries, for his excellent implementation of the approach using R and the qtl package. Lastly, we thank the Associate Editor and the referees whose comments have led to substantial improvements and clarifications being incorporated into the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by J.-L. Jannink.
Appendix
Appendix
The expectation result (12) relies on Haldane’s mapping function (4). In terms of recombination frequencies,
Thus for example, on substituting for θ k;j in (7)
and hence assuming d k;j ∼ U[0, d k;j,j+1] we find (using x as the dummy variable for integration of the distance)
as given in (12). The result for λ k;j,j follows by symmetry or by repeating the integration process explicitly using (6).
Rights and permissions
About this article
Cite this article
Verbyla, A.P., Cullis, B.R. & Thompson, R. The analysis of QTL by simultaneous use of the full linkage map. Theor Appl Genet 116, 95–111 (2007). https://doi.org/10.1007/s00122-007-0650-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00122-007-0650-x