Abstract
We consider the likelihood ratio test (LRT) process related to the test of the absence of QTL (a QTL denotes a gene with quantitative effect on a trait) on the interval [0, T] representing a chromosome. The observation is the trait and the composition of the genome at some locations called “markers”. We focus on the interference phenomenon, i.e. a recombination event inhibits the formation of another recombination event nearby. We give the asymptotic distribution of the LRT process under the null hypothesis that there is no QTL on [0, T] and under local alternatives with a QTL at \(t^{\star}\) on [0, T]. We show that the LRT process is asymptotically the square of a “linear interpolated and normalized process”. We prove that under the null hypothesis, the distribution of the maximum of the LRT process is the same for a model with or without interference. However, the powers of detection are totally different between the two models.
Similar content being viewed by others
References
Azaïs JM, Cierco-Ayrolles C (2002) An asymptotic test for quantitative gene detection. Ann Inst Henri Poincaré (B) 38(6):1087–1092
Azaïs JM, Delmas C, Rabier CE (2012) Likelihood ratio test process for quantitative trait locus detection. Statistics. doi:10.1080/02331888.2012.760093
Azaïs JM, Gassiat E, Mercadier C (2006) Asymptotic distribution and local power of the likelihood ratio test for mixtures. Bernoulli 12(5):775–799
Azaïs JM, Gassiat E, Mercadier C (2009) The likelihood ratio test for general mixture models with possibly structural parameter. ESAIM 13:301–327
Azaïs JM, Wschebor M (2009) Level sets and extrema of random processes and fields. Wiley, New York
Chang MN, Wu R, Wu SS, Casella G (2009) Score statistics for mapping quantitative trait loci. Stat Appl Genet Mol Biol 8(1):16
Cierco C (1998) Asymptotic distribution of the maximum likelihood ratio test for gene detection. Statistics 31:261–285
Davies RB (1977) Hypothesis testing when a nuisance parameter is present only under the alternative. Biometrika 64:247–254
Davies RB (1987) Hypothesis testing when a nuisance parameter is present only under the alternative. Biometrika 74:33–43
Foss E, Lande R, Stahl FW, Steinberg CM (1993) Chiasma interference as a function of genetic distance. Genetics 133:681–691
Gassiat E (2002) Likelihood ratio inequalities with applications to various mixtures. Ann Inst Henri Poincaré (B) 6:897–906
Genz A (1992) Numerical computation of multivariate normal probabilities. J Comput Graph Stat 1:141–149
Haldane JBS (1919) The combination of linkage values and the calculation of distance between the loci of linked factors. J Genet 8:299–309
Hayes B (2007) A short-course organized by the department of Animal Science, Iowa State University
Hillers KJ, Villeneuve AM (2003) Chromosome-wide control of meiotic crossing over in C. elegans. Curr Biol 13:1641–1647
Huang N, Parco A, Mew T, Magpantay G, McCouch S, Guiderdoni E, Xu J, Subudhi P, Angeles ER, Khush GS (1997) RFLP mapping of isozimes, RAPD and QTLs for grain shape, brown planthopper resistance in a doubled haploid rice population. Mol Breed 3:105–113
Karlin S, Liberman U (1979) A natural class of multilocus recombination processes and related measures of crossover interference. Adv Appl Prob 11:479–501
King JS, Mortimer RK (1990) A polymerization model of chiasma interference and corresponding computer simulation. Genetics 126:1127–1138
Lander ES, Botstein D (1989) Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 138:235–240
Lobo I, Shaw K (2008) Thomas Hunt Morgan, genetic recombination, and gene mapping. Nat Educ 1:1
Martini E, Diaz LD, Hunter N, Keeney S (2006) Crossover homeostasis in yeast meiosis. Cell 126:285–295
McPeek MS, Speed TP (1995) Modeling interference in genetic recombination. Genetics 139:1031–1044
Muller HJ (1916) The mechanism of crossing-over. Am Nat 50:193–221 (pp 284–305, 350–366, 421–434)
Rebaï A, Goffinet B, Mangin B (1994) Approximate thresholds of interval mapping tests for QTL detection. Genetics 138:235–240
Rebaï A, Goffinet B, Mangin B (1995) Comparing power of different methods for QTL detection. Biometrics 51:87–99
Siegmund D, Yakir B (2007) The statistics of gene mapping. Springer, New York
Stam P (1979) Interference in genetic crossing over and chromosome mapping. Genetics 92:573-594
Sturtevant AH (1915) The behavior of the chromosomes as studied through linkage. Z Indukt Abstammungs Vererbungsl 13:234–287
Van der Vaart AW (1998) Asymptotic statistics. Cambridge series in statistical and probabilistic mathematics
Youlds J, Boulton S (2011) The choice in meiosis-defining the factors that influence crossover or non-crossover formation. J Cell Sci 124:501–513
Wu R, Ma CX, Casella G (2007) Statistical genetics of quantitative traits. Springer, New York
Acknowledgments
I thank Jean-Marc Azaïs, Céline Delmas, Jean-Michel Elsen and Brigitte Mangin for fruitful discussions. I also thank the associate editor and the reviewers who helped me to improve the paper. This work has been supported by the Animal Genetic Department of the French National Institute for Agricultural Research, SABRE, and the National Center for Scientific Research (CNRS).
Author information
Authors and Affiliations
Corresponding author
Appendix: Proofs of theoretical results
Appendix: Proofs of theoretical results
1.1 Proof of Theorem 1
As mentioned before, we consider values of t and \(t^{\star}, \) distinct of marker locations and the result can be prolonged by continuity on markers.
1.1.1 Study of the score process under the null hypothesis
The study is based on the key lemma:
Lemma 2
with \( \alpha(t)= \frac{t_{2}-t}{t_{2}-t_{1}}\) and \(\beta(t)= \frac{t-t_{1}}{t_{2}-t_{1}}. \)
To prove this lemma, use formula (5) and check that both coincide whatever the value of X(t 1), X(t 2) is. Now using formula (7), we have
this proves the interpolation. On the other hand
and a direct application of central limit theorem implies that these two variables have a limit distribution which is Gaussian centered distribution with variance
This proves the expression of the covariance. The weak convergence of the score process, S n (.), is then a direct consequence of (10), the convergence of (S n (t 1),S n (t 2)) and the continuous mapping theorem.
1.1.2 Study of the score process under the local alternative
Under the alternative
The second term has the same distribution as under the null hypothesis and the first one gives the expectation. We have
According to Lemma 2, we have:
So, we need now to calculate \({\mathbb{E}\left\{X(t_{1})U(t^*)\right\}}\) and \({\mathbb{E}\left\{U(t^*)X(t_{2})\right\}. }\) We have
As a consequence,
As a result,
In the same way, we obtain
This gives the result.
1.1.3 About the LRT process
Since the model with t fixed is regular, it is easy to prove that for fixed t
under the null hypothesis.
Let us consider a local alternative defined by t* and \(q = a/\sqrt{n}. \) The model with t* fixed is differentiable in quadratic mean, this implies that the alternative defines a contiguous sequence of alternatives. By Le Cam’s first Lemma, relation (11) remains true under the alternative. This gives the result for the convergence of finite-dimensional distribution. Concerning the study of the supremum of the LRT process, the proof is exactly the same as in Azaïs et al. (2012) which is based on results of Azaïs et al. (2006), (2009) and Gassiat (2002). □
1.2 Proof of Theorem 2
We recall that we consider values t or \(t^{\star}\) of the parameters that are distinct of the markers positions, and the result will be prolonged by continuity at the markers positions.
The proof of the theorem is the same as the proof of Theorem 1 as soon as we can limit our attention to the interval (t ℓ, t r) when considering a unique instant t. So, under H 0, the result is straightforward. However, under the local alternative, the proof is more complicated than the proof of Theorem 1. Indeed, the location \(t^{\star}\) of the QTL and the location t, can belong to a different marker interval.
According to the proof of Theorem 1, under the alternative
As previously, the second term has the same distribution as under the null hypothesis and the first one gives the expectation. We have
We notice that we have \({u(t^{\star})=\mathbb{E}\left\{U(t^{\star})\mid X(t^{\star \ell})X(t^{\star r})\right\}. }\) Besides, u(t) is a function of X(t ℓ) and X(t r). As a consequence, by the properties of conditional expectancy, we have
According to Lemma 2,
In order to obtain \({\mathbb{E}\left\{ u(t^*) u(t^{\ell}) \right\}, }\) we just have to use the dominated convergence theorem. As a result
To conclude the proof, we just have to notice that
In order to obtain \({\mathbb{E}\left\{ u(t^*) u(t^{r})\right\}, }\) we just have to replace t ℓ by t r. This gives the result. □
Rights and permissions
About this article
Cite this article
Rabier, CE. On quantitative trait locus mapping with an interference phenomenon. TEST 23, 311–329 (2014). https://doi.org/10.1007/s11749-013-0349-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-013-0349-z