A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits

van der Sluis, Sophie; Dolan, Conor V.; Neale, Michael C.; Posthuma, Danielle

doi:10.1007/s10519-008-9201-8

A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits

Original Research
Open access
Published: 04 April 2008

Volume 38, pages 372–389, (2008)
Cite this article

Download PDF

You have full access to this open access article

Behavior Genetics Aims and scope Submit manuscript

A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits

Download PDF

Sophie van der Sluis¹,
Conor V. Dolan²,
Michael C. Neale^1,3 &
…
Danielle Posthuma¹

1747 Accesses
20 Citations
Explore all metrics

Abstract

Several association studies support the hypothesis that genetic variants can modify the influence of environmental factors on behavioral outcomes, i.e., G × E interaction. The case-control design used in these studies is powerful, but population stratification with respect to allele frequencies can give rise to false positive or false negative associations. Stratification with respect to the environmental factors can lead to false positives or false negatives with respect to environmental main effects and G × E interaction effects as well. Here we present a model based on Fulker et al. (1999) and Purcell (2002) for the study of G × E interaction in family-based association designs, in which the effects of stratification can be controlled. Simulations illustrate the power to detect genetic and environmental main effects, and G × E interaction effects for the sib pair design. The power to detect interaction was studied in eight different situations, both with and without the presence of population stratification, and for categorical and continuous environmental factors. Results show that the power to detect genetic and environmental main effects, and G × E interaction effects, depends on the allele frequencies and the distribution of the environmental moderator. Admixture effects of realistic effect size lead only to very small stratification effects in the G × E component, so impractically large numbers of sib pairs are required to detect such stratification.

Genotype-Environment Correlation in the Era of DNA

Article Open access 07 September 2014

Minor Allele Frequency Changes the Nature of Genotype by Environment Interactions

Article 22 April 2016

Fitting Procedures for Novel Gene-by-Measured Environment Interaction Models in Behavior Genetic Designs

Article 04 March 2015

Introduction

Several studies have demonstrated that genetic variants may modify the influence of environmental factors on behavioral outcomes, or, equivalently, that environmental factors modify the effects of genes (e.g., Caspi et al. 2002, 2003; Foley et al. 2004; Huizinga et al. 2004; Yaffe et al. 2000). Recently, for example, Lasky-Su et al. (2007) reported SNP-by-socioeconomic status interaction with respect to attention hyperactivity deficit (ADHD) symptom count in and around the BDNF gene. Although some of these studies may be subject to methodological limitations (Eaves 2006), gene by environment interaction (G × E) should be considered in genetic association studies.

Most genetic association studies are based on a case-control design. While case-control designs for genetic association are powerful, they suffer from potential effects of population stratification, leading to false positives or negatives (e.g., Cardon and Bell 2001; Posthuma et al. 2004). Family-based designs, which compare genetically related subjects, are therefore preferred. Fulker et al. (1999) proposed a design for association analysis of quantitative traits in sib pair data using maximum-likelihood variance-components procedures. They showed that the design is robust against spurious association stemming from population stratification, because the association effect is decomposed into a within-family effect and a between-family effect. The within-family effect is free of the potential effects of population stratification, because sibling pairs are drawn from the same family, and thus from the same genetic stratum. This design was extended by Neale et al. (1999) to include covariates, and by Abecasis et al. (2000a) to include multiple sibs, and parental information. The Fulker model now forms the basis for widely used statistical packages such as QTDT (Abecasis et al. 2000a, b).

Just like the association between genotypes and phenotypes, the associations between the environment and a phenotype, and between the G × E interaction and a phenotype are susceptible to the effects of population stratification. If two populations with (a) different allele frequencies, (b) different environmental frequencies (categorical environmental measure) or different environmental means (continuous environmental measure), and (c) different phenotypic means, are mixed, spurious environmental effects and spurious interaction effects can result, in addition to spurious allelic effects. In the sib pair design, it is therefore expedient to decompose into orthogonal between- and within-family effects (1) the allelic association; (2) the main effect of the environment; and (3) the G × E interaction.

In the present paper, we extend the sib pair model proposed by Fulker et al. (1999) to include environmental main effects and G × E interaction effects. The measured environmental variable may be either categorical or continuous. We report simulations carried out to investigate the statistical power to detect the presence of environmental main effects and G × E interaction effects for different effect sizes, different allele frequencies, and different environmental frequencies or means. In addition, we examine the statistical power to detect spurious G × E interaction due to population stratification.

Sib pair-based association including environmental effects and G × E interaction

We assume a diallelic marker with allele A₁ with frequency p, and allele A₂ with frequency 1 − p = q, and genotypes A₁A₁, A₁A₂ and A₂A₂ with genotypic effects a, d and −a, respectively, (Fisher 1918; Falconer and Mackay 1996). For simplicity we assume throughout the paper that the marker under study is the actual quantitative trait locus (QTL), i.e., recombination fraction θ is zero. In reality, the genotypic value of a marker is unequal to zero only if the marker is the QTL, or if the marker is in linkage disequilibrium (LD) with the QTL. We assume that the observed trait value of an individual is a function of a major gene effect (QTL), an additive polygenic genetic background effect, a shared familial or ‘common environmental’ effect, and an unshared, unique environmental effect (which also includes measurement error). Furthermore we assume that the effects of the additive polygenic genetic background, the common and unique environment, and the QTL are mutually uncorrelated, and that the additive polygenic genetic background effect and the environmental effects are normally distributed with mean zero.

If data from sibling pairs are available, the additive and dominance QTL effects may be partitioned into between- and within-family effects, as specified in Fulker et al. (1999), Abecasis et al. (2000a, b), and Posthuma et al. (2004). We will now introduce parameters for the effects of the environment, and for the interaction between genotype and environment.

Categorical environment

We assume that there are two environmental levels or conditions. The probability of being in either one of the environmental conditions is assumed not to depend on one’s genotype, i.e., the correlation between genotype and environmental status (rGE) is zero. We also assume that the probability of being in either one of the environmental conditions is independent of the environmental condition of other family members.

We adopt the notation of van den Oord (1999), and model the environmental main effect (e) as the difference in the phenotypic means of environmental Conditions 1 and 2. To model the interaction effect, we assign interaction effect i to subjects with genotype A₁A₁ in Condition 2, interaction effect −i to subjects with genotypes A₂A₂ in Condition 2, and interaction effect c to the heterozygotes A₁A₂ in Condition 2. Modeled as such, the interaction parameter i represents the difference between genotypic value a in Condition 2, and genotypic value a in Condition 1, after the main effect of the environmental condition has been taken into account. Similarly, interaction parameter −i represents the difference between genotypic value −a in Condition 2 and genotypic value −a in Condition 1, after accounting for the environmental main effect. The interaction parameter c represents the difference between the dominance effect in Condition 2 and in Condition 1, once the main effect of the environment has been accounted for (see Mather and Jinks 1977, for a similar parameterization). For the purpose of illustration, the expected phenotypic means $ \hat{y}_{{kg}} $ (i.e., the expected score of subjects in condition k with genotype g) are presented in Table 1 for the case of an environment with three levels.

Table 1 Expected phenotypic means for genotypic groups distinguished with respect to three environmental conditions

Full size table

Note that in the case of sib pair data (or data including multiple siblings and parents), various combinations of these means models are likely to be observed. When family-data are available, the effects of the QTL, the environmental measure under study, and their interaction, may be further partitioned into between- and within-family effects. To illustrate, for sib pairs and a dichotomous environment, all possible combinations are presented in Tables 2–4.

Table 2 Both sibs in environmental Condition 1

Full size table

Table 3 Both sibs in environmental Condition 2

Full size table

Table 4 Sibs in different environmental conditions: sib1 in Condition 1 and sib2 in Condition 2

Full size table

In the case of the sib pair association design, the phenotypic score y _ijkg (i.e., the observed score y of subject j from family i in condition k with genotype g) is modeled as:

$$ \hat{y}_{{ijkg}} = \tau _{i} + a_{b} A_{{bi}} + a_{w} A_{{wij}} + d_{b} D_{{bi}} + d_{w} D_{{wij}} + e_{{bk}} E_{k} + e_{{wk}} E_{k} + i_{{bkg}} I_{{kg}} + i_{{wkg}} I_{{kg}} + \varepsilon _{{ij}} , $$

(1)

where τ_i is the family-specific intercept, and ɛ _ij the residual term, i.e., the part of the phenotypic score y _ijkg that is not explained by the measured QTL, the environmental measure, or the interaction between these two, and which may be due to background genetic, or background environmental effects, unmodeled interactions, or measurement error. The parameters a _b and a _w are the estimated between- and within-family additive genetic effects of the marker, which are weighted by the derived coefficients A _bi and A _wij, respectively. These coefficients are either −1, −½, 0, ½ or 1, as calculated in the 7th and 8th column of Tables 2–4 (see Fulker et al. 1999). Parameters d _b and d _w are the estimated between- and within-family dominance genetic effects of the marker, which are weighted by the derived coefficients D _bi and D _wij, respectively. These coefficients are either 0, ½ or 1, as calculated in the 9th and 10th column of Tables 2–4 (see Posthuma et al. 2004). Similarly, the parameters e _bk and e _wk represent the between- and within-family effects of environmental condition k, which are weighted by the derived coefficient E _k. This coefficient is either −½, 0, ½ or 1, as calculated in the 11th and 12th column of Tables 2–4. The parameters i _bkg and i _wkg represent the between- and within-family effects of the interaction of genotype g and environmental Condition k, which are weighted by the derived coefficient I _kg. This coefficient is either −½, 0, ½ or 1, as calculated in the 7th to 10th column of Tables 2–4.

Continuous environment

If the environmental measure is continuous in nature, rather than categorical, the model as presented in Eq. 1, is altered as follows. The between and within-family environmental parameters e _b and e _w are simply weighted by the subject’s score on the continuous environmental measure, E _j, just as the genotype-dependent between and within-family interaction parameters i _bg and i _wg. The continuous environmental measure is now modeled as a continuous moderator, in the manner proposed by Purcell (2002). In the case of a continuous environmental measure, the phenotypic score y _ijg is modeled as:

$$ \hat{y}_{{ijg}} = \tau _{i} + a_{b} A_{{bi}} + a_{w} A_{{wij}} + d_{b} D_{{bi}} + d_{w} D_{{wij}} + e_{b} E_{j} + e_{w} E_{j} + i_{{bg}} E_{j} + i_{{wg}} E_{j} + \varepsilon _{{ij}} . $$

(2)

Given these additional effects of the environment and the G × E interaction, the variance-covariance matrix for siblings j and m of the ith family, Σ _i, is given as:

$$ {\sum\nolimits_i { = {\left\{ {\begin{array}{*{20}c} {{\begin{array}{*{20}l} {{\sigma ^{2}_{{QTL{\text{ - }}A}} + \sigma ^{2}_{{QTL{\text{ - }}D}} + \sigma ^{2}_{{ENV}} + \sigma ^{2}_{{INT}} + \sigma ^{2}_{s} + \sigma ^{2}_{u} } \hfill} \\ {{\hat{\pi }_{{ijk}} \sigma ^{2}_{{QTL{\text{ - }}A}} + \hat{Z}_{{ijk}} \sigma ^{2}_{{QTL{\text{ - }}D}} + \sigma ^{2}_{s} } \hfill} \\ \end{array} }} & {{\begin{array}{*{20}c} {{{\text{if}}\,j = m}} \\ {{{\text{if}}\,j \ne m}} \\ \end{array} }} \\ \end{array} } \right\}}} } $$

(3)

where σ ²_QTL-A is the variance due to the additive genetic effect of the marker, σ ²_QTL-D is the variance due to the dominance effects of the marker, σ ²_ENV is the variance due to the measured environmental indicator, and σ ²_INT is the variance due to the interaction between the marker and the environmental measure. After all these measured effects are accounted for, σ ²_s denotes the residual sibling resemblance, which is due to shared alleles other than the QTL alleles under study, shared environmental effects other than the measured environmental variable under study, or covariance between these two sources. Finally, σ ²_u denotes all variance that is not shared by siblings from the same family, and which is due to unshared alleles, and unshared environmental effects. The covariance between the phenotypic scores of siblings equals the additive and dominance QTL variance, weighted by $ \hat{\pi }_{{ijk}} $ (the estimated proportion of alleles that siblings j and m from family i share IBD, i.e. p(IBD = 2) + ½ p(IBD = 1)) and $ \hat{Z}_{{ijk}} $ (the probability of complete IBD sharing between siblings j and m, i.e., p(IBD = 2)), respectively. Because we assumed the environmental effect under study to be unrelated to genotype (i.e. rGE = 0) or to family membership, the environmental effect and the interaction effect only contribute to the variance through σ ²_ENV and σ ²_INT , but not to the covariance between siblings j and m. Note that in practice, σ ²_ENV , σ ²_INT , and σ ²_u cannot be estimated individually (i.e., only the sum of them can be estimated). Note also that when the marker under study is indeed the actual QTL, as is assumed throughout this paper, and the environmental measure is an accurate reflection of the true environmental moderator, the expected variance–covariance matrix Σ _i reduces to

$$ {\sum\nolimits_i { = {\left\{ {\begin{array}{*{20}c} {{\begin{array}{*{20}l} {{\sigma ^{2}_{s} + \sigma ^{2}_{u} } \hfill} \\ {{\sigma ^{2}_{s} } \hfill} \\ \end{array} }} & {{\begin{array}{*{20}c} {{{\text{if}}\,j = m}} \\ {{{\text{if}}\,j \ne m}} \\ \end{array} }} \\ \end{array} } \right\}}} } $$

(4)

(Fulker et al. 1999), because the family variance–covariance matrices Σ _i are formed conditionally on the marker genotypes of the siblings, and conditionally on their environmental status. Conditionally on the marker genotype and the environmental status of the siblings, there no longer is any variation in marker genotype or environmental status, so these variables no longer explain any variance. The effects of the marker and the environment are then modeled via the mean structure only, per Eqs. 1 and 2.

In the variance-components approach, the means and variances of related individuals are modeled simultaneously, as a function of the set of parameters θ which equals θ = {τ_i, a _b, a _w, d _b, d _w, e _b, e _w, i _bg, i _wg, σ ²_s , σ ²_u }, if the marker under study is the QTL. Maximum likelihood estimation can be used to obtain parameter estimates, and likelihood ratio tests can be used to test specific constraints on the parameters (Azzelini 1996). For example, one can test whether the regression weight for the between-family additive genetic marker effect, a _b, is equal to the within-family additive genetic marker effect, a _w, the idea being that a _b only differs from a _w when population stratification significantly influences the results of the test for genetic association.

Sib pair association models including a measured environment and G × E interaction effects can readily be implemented in the Mx software package^{Footnote 1} (Neale et al. 2003). Appendices I and II include example Mx-scripts for the case of sib pair data and a dichotomous environment or a continuous environment, respectively. Adaptation of these scripts for the modeling of more than two siblings, or categorical environments with multiple levels is quite straightforward. Extension of these scripts to include data from nuclear families (parents and offspring; Abecasis et al. 2000a) requires some modifications which are spelled out in the Mx-script provided by Posthuma et al. (2004) in their Appendix II. An example script for the modeling of data from monozygotic and dizygotic twin pairs is available online.^{Footnote 2} Note that whereas sib pair data only allow distinction between σ ²_s and σ ²_u , twin data allow a more detailed decomposition of the background variance into variance due to additive genetic effects (σ ²_A ), common environmental effects (σ ²_C ) or dominance genetic effects (σ ²_D ), and unshared environmental effects (σ ²_E ).

Power calculations for the G × E model

We performed a series of simulation studies to investigate the power of the extended sib pair model to detect the G × E interaction effects. We considered both a dichotomous environmental measure and a continuous environmental measure. All simulations were based on simulated sibling pairs only, and the simulated marker was assumed to be the actual QTL. The power analyses are thus limited to the detection of effects on the means (association), not the variances (linkage).

Procedures

Simulations involved a diallelic marker locus with frequency p of the increaser allele A₁ being .5 or .2. Except where noted, QTL dominance effects were absent. In the case of a dichotomous environment, the frequency b ₁ of Condition 1 was either .5 or .2. The continuous environmental measure was standard normally distributed, i.e., Environment ∼ N(0,1). Simulated environmental values were uncorrelated to the simulated genotypes (e.g., rGE = 0). The continuous phenotype was standard normally distributed when all measured allelic and environmental effects were zero. When these effects are not zero, the phenotypic mean and variance deviate from 0 and 1, respectively. The degree of deviation depends on their effect size.

The QTL effect, the environmental effect, and the interaction were manipulated so that in isolation, these factors each explained 1%, 2.5% or 5% of the total phenotypic variance in the total sample. In the simulations with a dichotomous environment, these effect sizes were determined for the case that both environmental conditions and alleles were evenly distributed (i.e., b ₁ = b ₂ = .5 and p = q = .5). Note, however, that the percentage of explained variance depends on the allele frequencies and the distribution of the environmental variable. For instance, if the parameters representing the genotypic effect of the QTL locus are chosen such that the locus explains 5% of the variance in the case that p = q = .5, this same locus (i.e., same genotypic values) only explains 3.3% of the variance in the case that p = .2. Likewise, an environmental effect that explains 5% of the variance if b ₁ = b ₂ = .5, explains only 3.3% of the variance if b ₁ = .2. For the simulations including a continuous environment, effect sizes were determined for the case that p =q = .5.

Where noted, population stratification was generated by mixing two samples (A and B) of equal proportions, with different phenotypic means (μ_A and μ_B), and different marker allele frequencies (p _A = .7, p _B = .3). In the case of a dichotomous environment, environmental frequencies differed between samples A and B (b _A1 = .3, b _B1 = .7). In the case of a continuous environment, the environmental means differed between samples A and B (μ_envA = 0, μ_envB = 2). The phenotypic means of samples A and B were selected such that admixture accounted for 20% of the total phenotypic variance in the combined population, i.e., (μ_A − μ_B)²/4σ ²_TOT = .20 (see Abecasis et al. 2000a).^{Footnote 3} The mixture of these two samples with different phenotypic means, different allele frequencies, and different environmental frequencies or means, results in spurious allelic, spurious environmental, and spurious interaction effects. The emphasis in these simulations is thus on the detection of false positives, but false negatives are theoretically possible (e.g., Posthuma et al. 2004; Neale et al. 1999).

For all simulations, background variance was modeled such that, after accounting for the QTL-effect, the environmental main effect, and the interaction, 30% of the remaining variance was attributable to additive polygenic genetic effects (A), and 70% was due to non-shared environmental effects (E). Covariance between the sibs due to shared environmental components (C) was fixed to zero, so all resemblance between the sibs was due to genetic factors only (i.e., the QTL and other unidentified genes). Because A and C cannot be distinguished unless the sample includes monozygotic twins, in addition to regular siblings or dizygotic twins, the term σ ²_s will include all the siblings’ resemblance due to shared genes other than the QTL, and common environmental influences. The term σ ²_u then includes all variance due to unidentified non-shared genes and non-shared environmental effects. Note that, in general, the power to detect the effects of interest increases as the residual sibling resemblance σ ²_s increases, even if the exact nature of resemblance (genes or environment) cannot be distinguished. This is because, as a result of increasing σ ²_s , the non-shared component σ ²_u decreases, and less unshared variance implies less “noise” (i.e., unexplained variance), which increases statistical power. The choice to fix shared environmental effects to zero in all simulations, thus results in conservative estimations of the power to detect the effects of interest.

All data simulations were performed in the R program,^{Footnote 4} and exact data simulation was used for all analyses (van der Sluis et al. 2008; Bollen and Stine 1993; Dolan et al. 2005). Exact data simulation can be used when sufficient summary statistics are available in theory, i.e., when all information present in the raw data can be summarized sufficiently in the variance covariance matrix Σ, and the means vector μ. Exact data simulation implies the simulation of raw data that are transformed to fit the true model exactly. Consequently, when the true model is fitted to these data, all parameter estimates used to simulate the data are recovered exactly. Subsequently, the constrained, nested (wrong) model is fitted to the data, in which parameters of interest are fixed to zero, or constrained to be equal. Minus twice the difference in the log-likelihoods of the true model and the nested model asymptotically equals the non-centrality parameter λ of the non-central χ²-distribution, with df equal to the difference in the number of parameters estimated. This non-centrality parameter can subsequently be used to calculate the sample size N required for a chosen power level, given a chosen critical value α (Saris and Satorra 1993).^{Footnote 5}

The results of power analyses based on exact data simulation equal exactly the results obtained through the analysis of (population or expected) summary statistics Σ and μ. Also, as in power calculations based on summary statistics, these results are asymptotically similar to results obtained through Monte Carlo simulation (depending on the number of runs, and the sample size N used in the Monte Carlo procedure). In contrast to Monte Carlo simulation, however, exact data simulation obviates the requirement to replicate the analyses in different runs because the quasi-randomly generated data are transformed to fit the true model exactly. Exact data simulation is therefore not only easy to perform but also computationally light compared to Monte Carlo simulation, which is why we chose to use exact data simulation here. We refer to Van der Sluis et al. (2008) for an extensive discussion on exact data simulation.

Given non-centrality parameter λ, the Mx program computes the total sample size that would be required, given the reported proportion of subjects in each distinguishable group, to reject the tested hypothesis at various power levels, ranging from .25 to .99. Here, we focus on the conditions required for a power of 80%. For all statistical tests, α was chosen to equal .05.

Patterns of G × E interaction

The power to detect G × E interaction was studied given eight different patterns of interactions (see also Van den Oord 1999; Khoury et al. 1988, 1993). These eight designs are illustrated in Fig. 1 for a dichotomous environment. Design _(i) concerns the situation that all effects are zero except the interaction effect for the homozygotes. As a result, the phenotypic means are equal across genotypes in Condition 1, but they are increased or decreased in the homozygotes in the second environmental condition. Design _(i = c) represents a variation on Design_(i); here the interaction effect in the heterozygotes is also assumed to be non-zero. More specifically, the interaction effect in the heterozygotes is set to equal the effect in the A₁ homozygotes (i.e., ‘complete interaction dominance’). The phenotypic mean of the heterozygotes (A₁A₂) therefore equals the phenotypic mean of the group with genotype A₁A₁ in both the first and the second environmental condition. Design _(i,e) applies when the environmental main effect and the interaction effect in the homozygotes are greater than zero. Design _(i,a) is a function of a non-zero allelic effect (A₁ being the increaser allele), and a non-zero interaction effect. As a result, the phenotypic means of the three genotypic groups differ in Condition 1, and fan out even more in Condition 2. Design _(i,a,d) is a variation on Design_(i,a), with the difference that complete genetic dominance is present under environmental Condition 1, while the interaction effect in the heterozygotes remains zero. As a consequence, the phenotypic means in the groups with genotype A₁A₁ and A₁A₂ are equal in Condition 1, but differ in Condition 2 due to different interaction effects. In Design _(i,a,e), allelic effects, environmental main effects and interaction effects are non-zero, and dominance is absent for all effects. Design _(−i,a) is a variation on Design_(i,a), where both allelic effects and interaction effects are non-zero. For Design_(−i,a), however, the signs of the interaction effects are reversed, resulting in crossing lines. As a consequence, the group with the highest phenotypic mean in environmental Condition 1, has the lowest phenotypic mean in environmental Condition 2, and vice versa. Design _(−i,a,e) resembles Design_(−i,a), except that in addition, environmental main effects are non-zero as well.

Results

All tables with results of power analyses (Tables 5–7) show the number of sib pairs required for a power of 80% given α = .05; non-centrality parameters are not reported here but are available online.^{Footnote 6}

Table 5 Number of sib pairs required to detect main effects of QTL and environment, and G × E interaction effects of different effect sizes, in the context of different allele frequencies, and different types of environments (categorical versus continuous) for power of .80 with α = .05 when all other effects are 0

Full size table

To start with, we studied the power to detect specific effects in the situation where all other effects are zero. The simulated data included either a main effect for the QTL, or a main effect of the environment, or a G × E interaction effect (i.e., interaction in the absence of main effects). Within this context, we studied the effects on the power of allele frequencies, the scale of the environmental measures (dichotomous or continuous), and in the case of a dichotomous environment, the frequencies of the environmental conditions. Knowledge of the power to detect isolated effects of given effect sizes, provides a useful guide to subsequent analyses, where interaction effects are tested in the presence of other effects. Data were simulated such that the specific effects explained 1%, 2.5% or 5% of the variance when p = .5 and, if applicable, b ₁ = .5. Note that these simulations included no population stratification. All between and within parameters could thus be constrained to be equal without loss of fit (given the exact data simulation, this implies χ² = 0 for all tests concerning admixture effects). Recall that the background variance (i.e., the variance not due to the marker under study, the environmental measure under study, or their interaction) was simulated to consist of 30% additive polygenic genetic effects (σ ²_s ) and 70% environmental effects not-shared by the siblings (σ ²_u ). In addition, note that in determining the power to detect the effects of interest, we first fitted the full model, i.e., the model including all effects, both zero and non-zero effects. Subsequently we fitted the model in which only the parameters of interest were constrained to zero.

The results are presented in the first three columns of Table 5. With respect to the main effects of the QTL, all tests have 2 degrees of freedom (df), as parameters for both additive and dominance allelic effects are constrained to zero. The power is greatest when p = q = .5, and when the environment is a continuous measure. A more uneven distribution of alleles is detrimental to the power to detect allelic effects, as is an uneven distribution of environmental conditions in the case of a dichotomous environmental measure. Interestingly, the distribution of the environmental variable influences the power to detect the QTL main effect, even though association between the environment and the phenotype is absent. These results are consistent with those in Table 6 of Neale et al. (1999).

All tests for environmental main effects have one degree of freedom. As can be seen from the first three columns of Table 5, the power to detect main effects of the environment is somewhat lower when the environmental effects are continuous, compared to a dichotomous environment with equally distributed conditions. The power to detect environmental main effects is lowest when both the alleles and the environmental conditions are unevenly distributed. Evidently, the allele frequencies influence the power to detect the environmental main effect when the genotypic effects are estimated freely, even though association between the QTL and the phenotype is absent.

All tests for interaction effects are 2 df-tests as both the interaction effects in the homozygotes and the heterozygotes are constrained to zero. The first three columns of Table 5 show that the power to detect interaction effects is greatest when both the allele frequencies and the environmental frequencies are evenly distributed. The power to detect interaction in the context of a continuous environment is only slightly lower.

In conclusion, if alleles are approximately evenly distributed, representative samples of about 200–400 sibling pairs are sufficient to detect main effects for the QTL or the environment, or interaction effects with effect sizes as small as 2.5–5% of the variance.

For illustrative purposes, the last three columns of Table 5 show the sample sizes required to detect the isolated effects with a power of 80% when all zero-effects are actually fixed to zero. As knowledge about which effects are actually zero is usually absent in practice, this is not a realistic situation. It does however illustrate two interesting points. First, the power to detect the effects of interest is much better in the context of a more constrained model. Practically, this implies that the order in which constraints are imposed on the model, may determine the probability to detect effects. This is something to bear in mind when deciding on model fitting procedures. Second, we previously noted that the power to detect effects (e.g., a QTL main effect, an environmental main effect) depends on the distribution of other variables in the model (e.g., the environmental variable, allele frequencies), even when these other variables are not actually associated with the phenotype under study. Naturally, this effect disappears when these zero-effects are excluded from the model.

Subsequently, we examined the power to detect genuine G × E interaction effects in the eight different designs distinguished by van den Oord (1999, see Fig. 1). For these simulations, parameter values for all non-zero effects were chosen such that in isolation, these effects would explain 2.5%. However, in the case of a dichotomous environment, the presence of other effects influences the percentage of variance explained by the G × E interaction. Using the same parameter values, the actual percentage of variance explained by the G × E interaction varied from 2.1% for Design_(i,a,e), to 3.5% for Design_(i = c). Also, as is well known in the context of ANOVA analysis, interaction effects can show up as main effects. In this case, the interaction effects show up as allelic main effects when the environment is dichotomous. Consequently, for Design_(i) through Design_(i,a,e), the main effects of the QTL deviated from zero, with effect sizes ranging from 2.5% (Design_(i)) to 8.6% (Design_(i,a)). For Design_(−i,a) and Design_(−i,a,e) on the other hand, the QTL main effect explained 0% of the variance as the actual effect of the QTL was nullified entirely by the reversed interaction effect. The G × E interaction only turned up as a main environmental effect in Design_(i = c). In all cases that the main effect of the environment was specifically modeled to be larger than zero (Design_(i,e), Design_(i,a,e) and Design_(−i,a,e)), the effect was slightly lower than 2.5% (2.3, 2.0 and 2.4, respectively) due to the presence of the G × E interaction effect. Again, the background variance was simulated to consist of 30% additive polygenic genetic effects (σ ²_s ), and 70% environmental effects not-shared by the siblings (σ ²_u ) in all conditions. These simulations included no population stratification, so all between and within parameters could be constrained to be equal without loss of fit.

The results of these simulations are in Table 6. All tests are 2 df-tests, as both interaction effects for the homozygotes and the heterozygotes are constrained to zero. Note that, irrespective of the allele frequencies, and the measurement scale of the environment, the power to detect interaction effects is higher for Design_(i = c) than for Design_(i). This makes sense, because the heterozygous group only contributes to the power to detect G × E interaction if the heterozygous interaction effect deviates from zero (Design_(i = c) and not Design_(i)). Note also that the power to detect the interaction in the context of complete interaction dominance (Design_(i = c)) is higher given p = .2 than given p = .5. This is because the distribution of the informative genotypic groups is more even in the case of p = .2 (i.e., A₁A₁+A₁A₂ vs. A₂A₂: .51:.49) than in the case of p = .5 (i.e., A₁A₁+A₁A₂ vs. A₂A₂: .75:.25), which increases the power to detect the effects of interest.

Table 6 Number of sib pairs required to detect G × E interaction effects in eight different conditions (see Fig. 1) for power of .80 with α = .05^a

Full size table

The power to detect the interaction effect is not influenced by the presence or absence of an environmental main effect (Design_(i,e) versus Design_(i), and Design_(i,a,e) and Design_(−i,a,e) versus Design_(i,a), Design_(i,a,d) and Design_(−i,a)). This is understandable, given that the environmental main effect only influences the phenotypic means of the genotypic groups, but not the differences in phenotypic means between the genotypic groups. The environmental main effect may thus be viewed as a constant, which does not influence the power to detect interaction.

The presence or absence of a main effect of the QTL also has no influence on the power to detect G × E interaction. (To assure that this finding was not due to the size of the allelic effect, additional analyses including a larger allelic effect, explaining 10% and 20% of the variance rather than 2.5%, were run, which showed similar results.)

Finally, we studied the power to detect population stratification with respect to the interaction component of the model. As described above, we mixed two subsamples of equal proportions, which differed with respect to allele frequencies (p _A= .7, p _B= .3), and environmental distribution (in case of a dichotomous environmental measure, b _A1=.3, b _B1=.7; in case of a continuous environmental measure, μ_envA= 0, μ_envB= 2), choosing phenotypic means of the subsamples such that the admixture accounted for 20% of the total phenotypic variance in the combined sample. When these admixture proportions were used to simulate data in which the actual effects (allelic, environmental and interaction) were zero, spurious allelic, environmental, and interaction effects were observed in the combined sample due to the admixture. For the dichotomous environment, the between family effects deviated from the within family effects, with the stratification effect being largest for the allelic effects (N = 184 for 80% power), intermediate for the environmental effects (N = 1,465 for 80% power), and smallest for the interaction effects (N = 7,028 for 80% power). For the continuous environment, the between family effects also deviated from the within family effects, with the stratification effect being largest for the environmental effects (N = 278 for 80% power), medium for the allelic effects (N = 576 for) and smallest for the interaction effects (N = 3,233 for 80% power). It is clear that very large numbers of sib pairs are required to detect stratification effects in the interaction component. It is also noteworthy that the allele frequencies in the subsamples determine how the spurious G × E interaction is expressed. With the present admixture settings (p _A= .7, p _B= .3, i.e., contrasting allele frequencies), spurious G × E is only apparent with respect to the interaction parameter for the heterozygous group, while the interaction parameter for the homozygous group obtained in the combined sample does not deviate from its actual value in the subsamples. However, if the allele frequencies in the subsamples are not contrasting (e.g., p _A= .3, p _B= .5), both interaction parameters for the heterozygous and homozygous groups are informative about spurious interaction.

Given population stratification, we again considered the eight different interaction designs (Fig. 1) to study (a) the power to detect stratification effects with respect to the interaction component of the model (tests with 2 df as both homozygote and heterozygote interaction effects are constrained to be equal within and across families) and (b) the power to detect genuine interaction effects (tests with 2 df as both homozygote and heterozygote interaction effects are constrained to be zero within families, while the between-family effects are freely estimated). For all conditions, the background variance was again simulated to consist of 30% additive polygenic genetic effects (σ ²_s ), and 70% environmental effects not-shared by the siblings (σ ²_u ).

The results are presented in Table 7. Besides confirming the observation that prohibitively large samples of sib pairs are required to detect spurious interaction (B = W), it is shown that the power to detect the spurious interaction due to population admixture varies across the eight differentiated subtypes. Overall, the power to detect spurious interaction is somewhat higher when the environment is continuous in nature, but the sample sizes required to detect stratification with respect to the interaction effect are prohibitively large in all simulated scenarios.

Table 7 Number of sib pairs required to detect spurious (H1:B = W vs. H0: B≠W) and genuine (H1:B≠W = 0 vs. H0: B≠W≠0) G × E interaction effects in eight different conditions for power of .80 with α = .05^a

Full size table

An indication of the power to detect the genuine interaction effect is obtained by freely estimating the between-family effect, while the within-family effect is constrained to be zero (B≠W = 0). The results in Table 7 show that the power to detect G × E effects on the means is about as large as one would expect given the previous results presented in Table 6, and the distribution of the genotypes in the mixed population (i.e., freq(A₁A₁) = (.7²+.3²)/2 = .29; freq(A₁A₂) = ((2 * .3 * .7) + (2 * .3 * .7))/2 = .42; freq(A₂A₂) = (.3²+.7²)/2 = .29).

Discussion

In this paper, the family-based association design was extended to include G × E interaction effects and environmental main effects. Power calculation showed that allele frequencies, and characteristics of environment (e.g., measurement level, and in the case of a categorical environment, the frequencies of the environmental conditions) affect the power to detect G × E interaction. Relatively small interaction effects, explaining 2.5–5% of the phenotypic variance in the total sample, can be detected with reasonably small sample sizes (200–400 sib pairs, respectively), if alleles are evenly distributed. The power to detect main effects and interaction effects generally is reasonable, particularly when all zero-effects are removed from the model first.

Throughout the paper, we assumed that the marker locus under study is the actual QTL. In practice, this will often not be the case and markers will usually be more or less strongly in LD with the QTL. Also, a criterion level α of .05 was employed in the simulation studies. Often, however, one will not test for association in one, but several marker loci, and α will be adjusted downwards to control for Type I errors. The power results presented here thus concern the most favorable conditions, and in practice, larger sample sizes may be required to obtain a power of 80%.

Modeling measured environmental effects in association studies is standard (e.g., Caspi et al. 2002, 2003; Foley et al. 2004; Huizinga et al. 2006; Lasky-Su et al. 2007; Yaffe et al. 2000). The use of the extended sib pair model in such studies has the advantage of controlling for population stratification, and excluding spurious main effects of the QTL and the environment, and, given sufficiently large sample size, spurious interaction effects. This extension can be implemented readily in packages such as Mx (Appendices I and II), or, in case of a categorical environmental factor, in SPSS (Beem and Boomsma 2006).

Some caveats are in order. First, it has often been shown that non-normality can result in spurious interaction effects (e.g., Boomsma and Martin 2002; Martin 1999; Purcell 2002; van den Berg et al. 2007; van der Sluis et al. 2006). However, the actual presence of G × E can also render the distribution non-normal (e.g., Purcell 2002; van der Sluis et al. 2006), resulting in the problem that non-normality of the data can either indicate the presence of G × E (i.e., G × E being the source of the non-normality) or mimic the presence of G × E (i.e., non-normality due to e.g. censoring or poor scaling of the phenotypic measure). The model presented here is equally susceptible to this phenomenon.

Although there is no ready solution to this problem, researchers should at least investigate alternative reasons for the non-normality of their data than the presence of G × E (e.g., poor measurement scale, selective sampling, etc.). As has been argued before (e.g., Martin 1999; van der Sluis et al. 2006), transformation of the data is no solution as it will remove both spurious and genuine G × E from the data.

Here we presented a model with measured genotypes and a measured environment. If these measured variables are indeed the ones involved in the G × E interaction, and thus the ones causing the heteroscedasticity, then accounting for these measures (i.e., modeling their effects) should render the remaining variance (as summarized in Eq. 4) homoscedastic. In a previous paper (van der Sluis et al. 2006), marginal maximum likelihood showed to be useful in the detection of heteroscedasticity. If heteroscedasticity is present before modeling the genotypic and environmental effects, but absent when these effects are controlled for, then this can be taken to indicate that the heteroscedasticity was due to the interaction between the locus and environment under study. Yet, if the heteroscedasticity is still present, this can mean (a) that the heteroscedasticity is caused by scaling problems in the instrument used to measure the phenotype, or (b) that G × E interaction is present but the genes and environment controlled for are not the ones involved in this interaction, or only ‘rough approximations’ of the actual gene/environment involved (e.g., a poorly designed environmental measure, or a marker that is only slightly in LD with the actual QTL). Important in this context is an issue discussed by Eaves (1984) in the light of plant studies, that the genes that control average performance (i.e., main effects) may not be the genes that control the sensitivity to the environment (i.e., the genes involved in the interaction effect, giving rise to the heteroscedasticity, see Berg et al. 1989, for a similar distinction between ‘level’ and ‘variability’ genes). Within a design as discussed here, where both genes and environment are measured entities, level and variability genes can be distinguished. This distinction may be important in understanding the biological basis of the G × E interaction.

Second, throughout this paper, we assumed that the environmental measure is independent of genotype and family membership. Using so-called family-level environmental measures, i.e., environmental measures that are by definition equal for all siblings within a family, is problematic in the sib pair design discussed here, because the decomposition in between and within family environmental effects (e _b vs. e _w) depends on siblings that are discordant with respect to the environmental measure (see Tables 2–4). The use of family-level environmental measures thus excludes the possibility to test for stratification effects in family-level environmental components, such as socioeconomic status, divorce status of the parents, domestic violence, and loss of a parent. However, stratification with respect to the allelic effects and the interaction effect can still be controlled for, and one can still test the significance of the interaction effect, and allelic and environmental main effects. In this context it is important to note that there is ample debate about whether genuine family-level environmental measures actually exist. For example, the fact that divorce status of the parents is necessarily equal for siblings from the same family does not necessarily imply that this event has similar effects on the siblings, or is experienced in exactly the same manner by all siblings. We refer to Turkheimer et al. (2005) for an extensive discussion of this subject.

Third, the model presented so far does not account for the presence of gene-environment correlation (rGE). rGE represents the genetic liability to experience different environmental events, or the genetic control of exposure to different environments (e.g., Kendler and Eaves 1986; Plomin et al. 1977). Genetic factors have been found to substantially influence individual differences in, for example, the likelihood of experiencing stressful life events, lack of social support, participation in leisure activities, martial status, and age of first sexual intercourse (see Rutter and Silberg 2002 for a review). The finding that so many diverse ‘environmental’ measures are under genetic control, suggests that the present sib pair model may prove to be of limited use. Extension of this model to include the possibility to test and account for rGE is therefore desirable. For now, we advise researchers to test for the presence of rGE before they proceed. For instance, one can test whether the genotypic groups differ with respect to their environmental mean (ANOVA), or, if the environmental measure is categorical, with respect to the distribution of subjects across environmental conditions (χ² test for equal frequencies). If differences with respect to the environmental measure are absent, one can proceed with the extended sib pair model as presented here.

Gene by environment interaction studies are relatively new and such studies are often characterized by difficulties concerning measurement and modeling (e.g., Eaves 2006). In general however, researchers seem to agree that studies aimed at revealing the sources of individual differences in specific qualities need to take G × E interaction into account, in order to arrive at a full account of individual differences (e.g., Caspi et al. 2006, Moffitt et al. 2005, 2006). Tests for G × E interaction are thus likely to become standard in future (association) studies.

Notes

The Mx program is freely available at http://www.vcu.edu/mx/.
www.psy.vu.nl/u/s.van.der.sluis.
Effect size is defined as the variance explained by the effect, divided by the total variance, i.e., [½(μ_A − μ_G)² + ½(μ_B − μ_G)²]/σ ²_TOT , where μ_A and μ_B are the means of the different subpopulations, and μ_G is the mean of the combined population, which is defined as (μ_A + μ_B)/2 = ½μ_A + ½μ_B. Substitution of μ_G with ½μ_A + ½μ_B gives (μ_A − μ_B)²/4σ ²_TOT .
R is a freely accessible software environment for statistical computing and graphics, see http://www.r-project.org/.
A small R-program is available online (www.psy.vu.nl/u/s.van.der.sluis), which can be used to calculate sample size N required to obtain a chosen level of power, given non-centrality parameter λ, and critical value α.
http://www.psy.vu.nl/u/s.van.der.sluis.

References

Abecasis GR, Cardon LR, Cookson WOC (2000a) A general test of association for quantitative traits in nuclear families. Am J Human Genet 66:279–292
Article CAS Google Scholar
Abecasis GR, Cookson WOC, Cardon LR (2000b) Pedigree tests of transmission disequilibrium. Eur J Human Genet 8:545–551
Article CAS Google Scholar
Azzelini A (1996) Statistical inference based on the likelihood. Chapman and Hall, London
Google Scholar
Beem AL, Boomsma DI (2006) Implementation of a combined association-linkage model for quantitative traits in Linear Mixed Model procedures of statistical packages. Twin Res Human Genet 9(3):325–333
Article Google Scholar
Berg K, Kondo I, Drayna D, Lawn R (1989) ‘Variability gene’ effect of cholesteryl ester transfer protein (CEPT) genes. Clin Genet 35:437–445
PubMed CAS Google Scholar
Bollen KA, Stine R (1993) Bootstrapping goodness of fit measures in structural equation models. In: Bollen KA, Long JS (eds) Testing structural equation models. Sage, Newbury Park, CA, pp 111–135
Google Scholar
Boomsma DI, Martin NG (2002) Gene-environment interactions. In: D’haenen H, den Boer JA, Willner P (eds) Biological psychiatry. John Wiley & Sons, London, pp 181–187
Chapter Google Scholar
Cardon LR, Bell JI (2001) Association study designs for complex diseases. Nat Genet 2:91–99
Article CAS Google Scholar
Caspi A, McClay J, Moffitt TE, Mill J, Martin J, Craig IW, Taylor A, Poulton R (2002) Role of genotype[e in the cycle of violence in maltreated children. Science 297(5582):851–854
Article PubMed CAS Google Scholar
Caspi A, Sugden K, Moffitt TE, Taylor A, Craig IW, Harrington H et al (2003) Influence of life stress on depression: Moderation by a polymorphism in the 5-htt gene. Science 301(5631):386–389
Article PubMed CAS Google Scholar
Caspi A, Moffitt TE (2006) Gene-environment interactions in psychiatry: joining forces with neuroscience. Nat Neurosci 7:583–590
CAS Google Scholar
Dolan CV, van der Sluis S, Grasman R (2005) A note on normal theory power calculation in SEM with data missing completely at random. Struct Equat Model 12(2):245–262
Article Google Scholar
Eaves LJ (1984) The resolution of genotype-environment interaction in segregation analysis of nuclear families. Genet Epidemiol 1:215–228
Article PubMed CAS Google Scholar
Eaves LJ (2006) Genotype × environment interaction in psychopathology: fact or artifact? Twin Res Human Genet 9(1):1–8
Article Google Scholar
Falconer DS, Mackay TFC (1996) Introduction to quantitative genetics, 4th edn. Pearson Education Ltd., Essex, England
Google Scholar
Fisher RA (1918) The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinb 52:399–433
Google Scholar
Foley DL, Eaves LJ, Wormley B, Silberg JL, Maes HH, Kuhn J, Riley B (2004) Childhood adversity, monoamine oxidase A genotype, and risk for conduct disorder. Arch Genet Psychiatry 61:738–744
Article CAS Google Scholar
Fulker DW, Cherny SS, Sham PC, Hewitt JK (1999) Combined linkage and association sib-pair analysis for quantitative traits. Am J Human Genet 64:259–267
Article CAS Google Scholar
Huizinga D, Haberstick BC, Smolen A, Menard S, Young SE, Corley RP, Stallings MC, Grotpeter J, Hewitt JK (2006) Childhood maltreatment, subsequent antisocial behavior, and the role of monoamine oxidase A genotype. Biol Psychiatry 60:677–683
Article PubMed CAS Google Scholar
Kendler K, Eaves L (1986) Models for the joint effect of genotype and environment on liability to psychiatric illness. Am J Psychiatry 143:279–289
PubMed CAS Google Scholar
Khoury MJ, Adams MJ, Flanders WD (1988) AN epidemiologic approach to ecogenetics. Am J Human Genet 42:89–95
CAS Google Scholar
Khoury MJ, James LM (1993) Population and family relative risks of disease associated with environmental factors in the presence of gene-environment interaction. Am J Epidemiol 137:1241–1250
PubMed CAS Google Scholar
Lasky-Su J, Faraone SV, Lange C, Tsuang MT, Doyle AE, Smoller JW, Laird NM, Biedermand J (2007) A study of how socioeconomic status moderates the relationship between SNPs encompassing BDNF and ADHD symptom count in ADHD families. Behav Genet 37:487–497
Article PubMed CAS Google Scholar
Mather K, Jinks JL (1977) Introduction to biometrical genetics. Chapman and Hall, London
Google Scholar
Martin N (1999) Gene-environment interaction and twin studies. In: Spector TD, Snieder H, MacGregor AJ (eds) Advances in twin and sib-pairanalysis. Greenwich Medical Media, London
Google Scholar
Moffitt TE, Caspi A, Rutter M (2005) Strategy for investigating interactions between measured genes and measured environments. Arch Genet Psychiatry 62:473–481
Article CAS Google Scholar
Moffitt TE, Caspi A, Rutter M (2006) Measured gene-environment interactions in psychopathology: concepts, research strategies, and implications for research, intervention, and public understanding of genetics. Perspect Psychol Sci 1(1):5–27
Article Google Scholar
Neale MC, Cherny SS, Sham PC, Whitfield JB, Heath AC, Birley AJ, Martin NG (1999) Distinguishing population stratification from genuine allelic effects with Mx: association of ADH2 with alcohol consumption. Behav Genet 29(4):233–243
Article Google Scholar
Neale MC, Boker SM, Xie G, Maes HH (2003) Mx: statistical modeling, 6th edn. Department of Psychiatry, Richmond, VA
Google Scholar
Plomin R, DeFries JC, Loehlin JC (1977) Genotype-environment interaction and correlation in the analysis of human behavior. Psychol Bull 84:309–322
Article PubMed CAS Google Scholar
Posthuma D, de Geus EJC, Boomsma DI, Neale MC (2004) Combined linkage and association tests in Mx. Behav Genet 34(2):179–196
Article PubMed CAS Google Scholar
Purcell S (2002) Variance components models for gene-environment interaction in twin analysis. Twin Res 5:554–571
Article PubMed Google Scholar
Rutter M, Silberg JL (2002) Gene-environment interplay in relation to emotional and behavioral disturbance. Ann Rev Psychol 53:463–490
Article Google Scholar
Saris WE, Satora A (1993) Power evaluations in structural equation models. In: Bollen KA, Long JS (eds) Testing structural equation models. Sage, Newbury Park, CA, pp 181–204
Google Scholar
Turkheimer E, D’Onofrio BM, Meas HH, Eaves LJ (2005) Analysis and interpretation of twin studies including measures of the shared environment. Child Dev 76(6):1217–1233
PubMed Google Scholar
Van den Berg SM, Glas CAW, Boomsma DI (2007) Variance decomposition using an IRT measurement model. Behav Genet 37:604–616
Article PubMed Google Scholar
Van den Oord EJCG (1999) Method to detect genotype-environment interactions for quantitative trait loci in association studies. Am J Epidemiol 150(11):1179–1187
PubMed Google Scholar
Van der Sluis S, Dolan CV, Neale MC, Posthuma D (2006) Detecting genotype-environment interaction in monozygotic twin data: comparing the Jinks and Fulker test and a new test based on marginal maximum likelihood estimation. Twin Res Human Genet 9(3):377–392
Google Scholar
Van der Sluis S, Dolan CV, Neale MC, Posthuma D (2008) Power calculations using exact data simulation: a useful tool for genetic study designs. Behav Genet 38:202–211
Article Google Scholar
Yaffe K, Haan M, Byers A, Tangen C, Kuller L (2000) Estrogen, apoe, and cognitive decline: evidence for gene-environment interaction. Neurology 54:1949–1954
PubMed CAS Google Scholar

Download references

Acknowledgements

Preparation of this manuscript was financially supported by NWO/MbGW VIDI-016-065-318, MH-65322, and DA-18673.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Biological Psychology, Vu University Amsterdam, Van der Boechorststraat 1, Room 2B-37, Amsterdam, 1081 BT, The Netherlands
Sophie van der Sluis, Michael C. Neale & Danielle Posthuma
Department of Psychology, FMG, University of Amsterdam, Roeterstraat 15, Amsterdam, 1018 WB, The Netherlands
Conor V. Dolan
Departments of Psychiatry and Human Genetics, Virginia Institute of Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, USA
Michael C. Neale

Authors

Sophie van der Sluis
View author publications
You can also search for this author in PubMed Google Scholar
Conor V. Dolan
View author publications
You can also search for this author in PubMed Google Scholar
Michael C. Neale
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Posthuma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sophie van der Sluis.

Additional information

Edited by Stacey Cherny.

Appendices

Appendix 1

Appendix 2

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

van der Sluis, S., Dolan, C.V., Neale, M.C. et al. A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits. Behav Genet 38, 372–389 (2008). https://doi.org/10.1007/s10519-008-9201-8

Download citation

Received: 19 June 2007
Accepted: 04 March 2008
Published: 04 April 2008
Issue Date: July 2008
DOI: https://doi.org/10.1007/s10519-008-9201-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits

Abstract

Similar content being viewed by others

Genotype-Environment Correlation in the Era of DNA

Minor Allele Frequency Changes the Nature of Genotype by Environment Interactions

Fitting Procedures for Novel Gene-by-Measured Environment Interaction Models in Behavior Genetic Designs

Introduction