On quantitative trait locus mapping with an interference phenomenon

Rabier, Charles-Elie

doi:10.1007/s11749-013-0349-z

On quantitative trait locus mapping with an interference phenomenon

Original Paper
Published: 12 December 2013

Volume 23, pages 311–329, (2014)
Cite this article

TEST Aims and scope Submit manuscript

Charles-Elie Rabier^1,2,3

99 Accesses
3 Citations
Explore all metrics

Abstract

We consider the likelihood ratio test (LRT) process related to the test of the absence of QTL (a QTL denotes a gene with quantitative effect on a trait) on the interval [0, T] representing a chromosome. The observation is the trait and the composition of the genome at some locations called “markers”. We focus on the interference phenomenon, i.e. a recombination event inhibits the formation of another recombination event nearby. We give the asymptotic distribution of the LRT process under the null hypothesis that there is no QTL on [0, T] and under local alternatives with a QTL at $t^{\star}$ on [0, T]. We show that the LRT process is asymptotically the square of a “linear interpolated and normalized process”. We prove that under the null hypothesis, the distribution of the maximum of the LRT process is the same for a model with or without interference. However, the powers of detection are totally different between the two models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Chi-square processes for gene mapping in a population with family structure

Article 31 October 2016

Novel Algorithm for Multiple Quantitative Trait Loci Mapping by Using Bayesian Variable Selection Regression

Statistical Analysis of Genomic Data

References

Azaïs JM, Cierco-Ayrolles C (2002) An asymptotic test for quantitative gene detection. Ann Inst Henri Poincaré (B) 38(6):1087–1092
Article MATH Google Scholar
Azaïs JM, Delmas C, Rabier CE (2012) Likelihood ratio test process for quantitative trait locus detection. Statistics. doi:10.1080/02331888.2012.760093
Azaïs JM, Gassiat E, Mercadier C (2006) Asymptotic distribution and local power of the likelihood ratio test for mixtures. Bernoulli 12(5):775–799
Article MATH MathSciNet Google Scholar
Azaïs JM, Gassiat E, Mercadier C (2009) The likelihood ratio test for general mixture models with possibly structural parameter. ESAIM 13:301–327
Article MATH Google Scholar
Azaïs JM, Wschebor M (2009) Level sets and extrema of random processes and fields. Wiley, New York
Book MATH Google Scholar
Chang MN, Wu R, Wu SS, Casella G (2009) Score statistics for mapping quantitative trait loci. Stat Appl Genet Mol Biol 8(1):16
MathSciNet Google Scholar
Cierco C (1998) Asymptotic distribution of the maximum likelihood ratio test for gene detection. Statistics 31:261–285
Article MATH MathSciNet Google Scholar
Davies RB (1977) Hypothesis testing when a nuisance parameter is present only under the alternative. Biometrika 64:247–254
Article MATH MathSciNet Google Scholar
Davies RB (1987) Hypothesis testing when a nuisance parameter is present only under the alternative. Biometrika 74:33–43
MATH MathSciNet Google Scholar
Foss E, Lande R, Stahl FW, Steinberg CM (1993) Chiasma interference as a function of genetic distance. Genetics 133:681–691
Google Scholar
Gassiat E (2002) Likelihood ratio inequalities with applications to various mixtures. Ann Inst Henri Poincaré (B) 6:897–906
Article Google Scholar
Genz A (1992) Numerical computation of multivariate normal probabilities. J Comput Graph Stat 1:141–149
Google Scholar
Haldane JBS (1919) The combination of linkage values and the calculation of distance between the loci of linked factors. J Genet 8:299–309
Article Google Scholar
Hayes B (2007) A short-course organized by the department of Animal Science, Iowa State University
Hillers KJ, Villeneuve AM (2003) Chromosome-wide control of meiotic crossing over in C. elegans. Curr Biol 13:1641–1647
Article Google Scholar
Huang N, Parco A, Mew T, Magpantay G, McCouch S, Guiderdoni E, Xu J, Subudhi P, Angeles ER, Khush GS (1997) RFLP mapping of isozimes, RAPD and QTLs for grain shape, brown planthopper resistance in a doubled haploid rice population. Mol Breed 3:105–113
Article Google Scholar
Karlin S, Liberman U (1979) A natural class of multilocus recombination processes and related measures of crossover interference. Adv Appl Prob 11:479–501
Article MATH MathSciNet Google Scholar
King JS, Mortimer RK (1990) A polymerization model of chiasma interference and corresponding computer simulation. Genetics 126:1127–1138
Google Scholar
Lander ES, Botstein D (1989) Mapping mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics 138:235–240
Google Scholar
Lobo I, Shaw K (2008) Thomas Hunt Morgan, genetic recombination, and gene mapping. Nat Educ 1:1
Google Scholar
Martini E, Diaz LD, Hunter N, Keeney S (2006) Crossover homeostasis in yeast meiosis. Cell 126:285–295
Article Google Scholar
McPeek MS, Speed TP (1995) Modeling interference in genetic recombination. Genetics 139:1031–1044
Google Scholar
Muller HJ (1916) The mechanism of crossing-over. Am Nat 50:193–221 (pp 284–305, 350–366, 421–434)
Article Google Scholar
Rebaï A, Goffinet B, Mangin B (1994) Approximate thresholds of interval mapping tests for QTL detection. Genetics 138:235–240
Google Scholar
Rebaï A, Goffinet B, Mangin B (1995) Comparing power of different methods for QTL detection. Biometrics 51:87–99
Article MATH Google Scholar
Siegmund D, Yakir B (2007) The statistics of gene mapping. Springer, New York
MATH Google Scholar
Stam P (1979) Interference in genetic crossing over and chromosome mapping. Genetics 92:573-594
MathSciNet Google Scholar
Sturtevant AH (1915) The behavior of the chromosomes as studied through linkage. Z Indukt Abstammungs Vererbungsl 13:234–287
Google Scholar
Van der Vaart AW (1998) Asymptotic statistics. Cambridge series in statistical and probabilistic mathematics
Youlds J, Boulton S (2011) The choice in meiosis-defining the factors that influence crossover or non-crossover formation. J Cell Sci 124:501–513
Article Google Scholar
Wu R, Ma CX, Casella G (2007) Statistical genetics of quantitative traits. Springer, New York
MATH Google Scholar

Download references

Acknowledgments

I thank Jean-Marc Azaïs, Céline Delmas, Jean-Michel Elsen and Brigitte Mangin for fruitful discussions. I also thank the associate editor and the reviewers who helped me to improve the paper. This work has been supported by the Animal Genetic Department of the French National Institute for Agricultural Research, SABRE, and the National Center for Scientific Research (CNRS).

Author information

Authors and Affiliations

Université de Toulouse, Institut de Mathématiques de Toulouse, U.P.S, Toulouse, France
Charles-Elie Rabier
INRA UR631, Station d’Amélioration Génétique des Animaux, Auzeville, France
Charles-Elie Rabier
University of Wisconsin-Madison, Statistic Department, Medical Science Center, 1300 University Avenue, Madison, WI, 53706-1532, USA
Charles-Elie Rabier

Authors

Charles-Elie Rabier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Charles-Elie Rabier.

Appendix: Proofs of theoretical results

1.1 Proof of Theorem 1

As mentioned before, we consider values of t and $t^{\star}, $ distinct of marker locations and the result can be prolonged by continuity on markers.

1.1.1 Study of the score process under the null hypothesis

The study is based on the key lemma:

Lemma 2

$$ u(t)=\alpha(t) X(t_1) + \beta(t) X(t_2) $$

with $ \alpha(t)= \frac{t_{2}-t}{t_{2}-t_{1}}$ and $\beta(t)= \frac{t-t_{1}}{t_{2}-t_{1}}. $

To prove this lemma, use formula (5) and check that both coincide whatever the value of X(t ₁), X(t ₂) is. Now using formula (7), we have

$$ \frac{\partial l^n_t}{ \partial q}\mid_{\theta_{0}} = \sum_{j=1}^n \frac{Y_j -\mu }{ \sigma^2 } u_j(t) = 1/\sigma \sum_{j=1}^n \varepsilon_j u_j(t) = \frac{\alpha(t)}{\sigma} \sum_{j=1}^n \varepsilon_j X_j(t_1) +\frac{ \beta(t) }{\sigma} \sum_{j=1}^n \varepsilon_j X_j(t_2) $$

(10)

this proves the interpolation. On the other hand

$$ S_n(t_k) = \sum_{j=1} ^n \frac{ \varepsilon_j X_j(t_k) }{\sqrt{n}} \quad k=1,2 $$

and a direct application of central limit theorem implies that these two variables have a limit distribution which is Gaussian centered distribution with variance

$$ \left( \begin{array}{ll} 1 & \exp\,( -2|t_2-t_1|) \\ \exp\,( -2|t_2-t_1|) & 1\\ \end{array} \right). $$

This proves the expression of the covariance. The weak convergence of the score process, S _n(.), is then a direct consequence of (10), the convergence of (S _n(t ₁),S _n(t ₂)) and the continuous mapping theorem.

1.1.2 Study of the score process under the local alternative

Under the alternative

$$ S_n(t) = \frac{a}{n \sigma} \sum_{j=1} ^n \frac{ U_j(t^*) u_j(t)}{\sqrt{\hbox{Var}\left\{u(t)\right\}}} + \frac{1}{\sqrt{n}} \sum_{j=1} ^n \varepsilon_j \frac{ u_j(t)}{\sqrt{\hbox{Var}\left\{u(t)\right\}}}. $$

The second term has the same distribution as under the null hypothesis and the first one gives the expectation. We have

$$ {\mathbb{E}} \left\{S_n(t)\right\} = \frac{a\,{\mathbb{E}}\left\{ U(t^*) u(t) \right\}}{\sigma\,\sqrt{\hbox{Var}\left\{u(t)\right\}}}. $$

According to Lemma 2, we have:

$$ {\mathbb{E}} \left\{U(t^*)u(t)\right\}= \alpha(t)\,{\mathbb{E}}\left\{X(t_{1})U(t^*)\right\}\,+\,\beta(t)\,{\mathbb{E}}\left\{U(t^*)X(t_{2})\right\}. $$

So, we need now to calculate ${\mathbb{E}\left\{X(t_{1})U(t^*)\right\}}$ and ${\mathbb{E}\left\{U(t^*)X(t_{2})\right\}. }$ We have

$$ \begin{aligned} {\mathbb{P}}\left\{X(t_{1})U(t^{\star})=-1\right\}&= {\mathbb{P}}\left\{U(t^{\star})=1\mid X(t_{1})=-1,X(t_{2})=1\right\}{\mathbb{P}}\left\{X(t_{1})=-1,X(t_{2})=1\right\}\\ &+ {\mathbb{P}}\left\{U(t^{\star})=1\mid X(t_{1})=-1,X(t_{2})=-1\right\}{\mathbb{P}}\left\{X(t_{1})=-1,X(t_{2})=-1\right\}\\ &+ {\mathbb{P}}\left\{U(t^{\star})=-1\mid X(t_{1})=1,X(t_{2})=1\right\}{\mathbb{P}}\left\{X(t_{1})=1,X(t_{2})=1\right\}\\ &+ {\mathbb{P}}\left\{U(t^{\star})=-1\mid X(t_{1})=1,X(t_{2})=-1\right\}{\mathbb{P}}\left\{X(t_{1})=1,X(t_{2})=-1\right\}\\ &= \frac{\beta(t^{\star})r(t_1,t_2)}{2} + 0 + 0 + \frac{\beta(t^{\star})r(t_1,t_2)}{2} = \beta(t^{\star})r(t_1,t_2). \end{aligned} $$

As a consequence,

$$ {\mathbb{P}}\left\{X(t_{1})U(t^{\star})=1\right\}= 1-\beta(t^{\star})r(t_1,t_2). $$

As a result,

$$ {\mathbb{E}}\left\{X(t_{1})U(t^{\star})\right\}= 1-2\beta(t^{\star})r(t_1,t_2)= \alpha(t^{\star})+\beta(t^{\star}) \rho(t_1,t_2)\quad \hbox {with}\;\rho(t_1,t_2)= e^{-2\mid t_1-t_2 \mid}. $$

In the same way, we obtain

$$ {\mathbb{E}}\left\{U(t^{\star})X(t_{2})\right\}= \alpha(t^{\star})\rho(t_1,t_2)+\beta(t^{\star}) . $$

This gives the result.

1.1.3 About the LRT process

Since the model with t fixed is regular, it is easy to prove that for fixed t

$$ \Uplambda_n (t) = S_n^2(t) + o_{P} (1) $$

(11)

under the null hypothesis.

Let us consider a local alternative defined by t* and $q = a/\sqrt{n}. $ The model with t* fixed is differentiable in quadratic mean, this implies that the alternative defines a contiguous sequence of alternatives. By Le Cam’s first Lemma, relation (11) remains true under the alternative. This gives the result for the convergence of finite-dimensional distribution. Concerning the study of the supremum of the LRT process, the proof is exactly the same as in Azaïs et al. (2012) which is based on results of Azaïs et al. (2006), (2009) and Gassiat (2002). □

1.2 Proof of Theorem 2

We recall that we consider values t or $t^{\star}$ of the parameters that are distinct of the markers positions, and the result will be prolonged by continuity at the markers positions.

The proof of the theorem is the same as the proof of Theorem 1 as soon as we can limit our attention to the interval (t ^ℓ, t ^r) when considering a unique instant t. So, under H ₀, the result is straightforward. However, under the local alternative, the proof is more complicated than the proof of Theorem 1. Indeed, the location $t^{\star}$ of the QTL and the location t, can belong to a different marker interval.

According to the proof of Theorem 1, under the alternative

$$ S_n(t) = \frac{a}{n \sigma} \sum_{j=1} ^n \frac{ U_j(t^*) u_j(t)}{\sqrt{\hbox{Var}\left\{u(t)\right\}}} + \frac{1}{\sqrt{n}} \sum_{j=1} ^n \varepsilon_j \frac{ u_j(t)}{\sqrt{\hbox{Var}\left\{u(t)\right\}}}. $$

As previously, the second term has the same distribution as under the null hypothesis and the first one gives the expectation. We have

$$ {\mathbb{E}} \left\{S_n(t)\right\} = \frac{a\,{\mathbb{E}}\left\{ U(t^*) u(t) \right\}}{\sigma\,\sqrt{\hbox{Var}\left\{u(t)\right\}}}. $$

We notice that we have ${u(t^{\star})=\mathbb{E}\left\{U(t^{\star})\mid X(t^{\star \ell})X(t^{\star r})\right\}. }$ Besides, u(t) is a function of X(t ^ℓ) and X(t ^r). As a consequence, by the properties of conditional expectancy, we have

$$ {\mathbb{E}}\left\{ U(t^*) u(t) \right\}= {\mathbb{E}}\left\{ u(t^*) u(t) \right\}. $$

According to Lemma 2,

$$ \begin{aligned} {\mathbb{E}}\left\{ u(t^*) u(t) \right\}&= \alpha(t^{\star})\alpha(t) {\mathbb{E}}\left\{X(t^{\star\ell})X(t^{\ell})\right\} +\beta(t^{\star})\alpha(t) {\mathbb{E}}\left\{X(t^{\star r})X(t^{\ell})\right\}\\ &+ \alpha(t^{\star})\beta(t) {\mathbb{E}}\left\{X(t^{\star\ell})X(t^{r})\right\} +\beta(t^{\star})\beta(t){\mathbb{E}}\left\{X(t^{\star r})X(t^{r})\right\}\\ &= \alpha(t^{\star})\alpha(t)\rho(t^{\ell},t^{\star \ell}) +\beta(t^{\star})\alpha(t) \rho(t^{\ell},t^{\star r})\\ &+ \alpha(t^{\star})\beta(t)\rho(t^{\star \ell},t^{r}) +\beta(t^{\star})\beta(t)\rho(t^{r},t^{\star r}). \end{aligned} $$

In order to obtain ${\mathbb{E}\left\{ u(t^*) u(t^{\ell}) \right\}, }$ we just have to use the dominated convergence theorem. As a result

$$ {\mathbb{E}}\left\{ u(t^*) u(t^{\ell})\right\}= \alpha(t^{\star})\rho(t^{\ell},t^{\star \ell})+\beta(t^{\star})\rho(t^{\ell},t^{\star r}) . $$

To conclude the proof, we just have to notice that

$$ \begin{aligned} {\mathbb{E}}\left\{ u(t^*) u(t^{\ell})\right\}&= \rho(t^{\ell},t^{\star \ell}) \left\{\alpha(t^{\star}) + \beta(t^{\star}) \rho(t^{\star \ell},t^{\star r})\right\}\quad\hbox {if} \; t^\star>t^{\ell}\\ &= \rho(t^{\ell},t^{\star r}) \left\{\alpha(t^{\star})\rho(t^{\star r},t^{\star \ell}) + \beta(t^{\star}) \right\}\quad\hbox {if}\; t^\star<t^{\ell}. \end{aligned} $$

In order to obtain ${\mathbb{E}\left\{ u(t^*) u(t^{r})\right\}, }$ we just have to replace t ^ℓ by t ^r. This gives the result. □

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rabier, CE. On quantitative trait locus mapping with an interference phenomenon. TEST 23, 311–329 (2014). https://doi.org/10.1007/s11749-013-0349-z

Download citation

Received: 17 January 2013
Accepted: 24 November 2013
Published: 12 December 2013
Issue Date: June 2014
DOI: https://doi.org/10.1007/s11749-013-0349-z

Keywords

Mathematics Subject Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On quantitative trait locus mapping with an interference phenomenon

Abstract

Access this article

Similar content being viewed by others

Chi-square processes for gene mapping in a population with family structure

Novel Algorithm for Multiple Quantitative Trait Loci Mapping by Using Bayesian Variable Selection Regression

Statistical Analysis of Genomic Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Proofs of theoretical results

1.1 Proof of Theorem 1

1.1.1 Study of the score process under the null hypothesis

Lemma 2

1.1.2 Study of the score process under the local alternative

1.1.3 About the LRT process

1.2 Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

On quantitative trait locus mapping with an interference phenomenon

Abstract

Access this article

Similar content being viewed by others

Chi-square processes for gene mapping in a population with family structure

Novel Algorithm for Multiple Quantitative Trait Loci Mapping by Using Bayesian Variable Selection Regression

Statistical Analysis of Genomic Data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix: Proofs of theoretical results

Appendix: Proofs of theoretical results

1.1 Proof of Theorem 1

1.1.1 Study of the score process under the null hypothesis

Lemma 2

1.1.2 Study of the score process under the local alternative

1.1.3 About the LRT process

1.2 Proof of Theorem 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation