A General Test for Gene–Environment Interaction in Sib Pair-based Association Analysis of Quantitative Traits
- First Online:
- Received:
- Accepted:
DOI: 10.1007/s10519-008-9201-8
- Cite this article as:
- van der Sluis, S., Dolan, C.V., Neale, M.C. et al. Behav Genet (2008) 38: 372. doi:10.1007/s10519-008-9201-8
Abstract
Several association studies support the hypothesis that genetic variants can modify the influence of environmental factors on behavioral outcomes, i.e., G × E interaction. The case-control design used in these studies is powerful, but population stratification with respect to allele frequencies can give rise to false positive or false negative associations. Stratification with respect to the environmental factors can lead to false positives or false negatives with respect to environmental main effects and G × E interaction effects as well. Here we present a model based on Fulker et al. (1999) and Purcell (2002) for the study of G × E interaction in family-based association designs, in which the effects of stratification can be controlled. Simulations illustrate the power to detect genetic and environmental main effects, and G × E interaction effects for the sib pair design. The power to detect interaction was studied in eight different situations, both with and without the presence of population stratification, and for categorical and continuous environmental factors. Results show that the power to detect genetic and environmental main effects, and G × E interaction effects, depends on the allele frequencies and the distribution of the environmental moderator. Admixture effects of realistic effect size lead only to very small stratification effects in the G × E component, so impractically large numbers of sib pairs are required to detect such stratification.
Keywords
Gene by environment interactionGenetic associationSib pair-based designsIntroduction
Several studies have demonstrated that genetic variants may modify the influence of environmental factors on behavioral outcomes, or, equivalently, that environmental factors modify the effects of genes (e.g., Caspi et al. 2002, 2003; Foley et al. 2004; Huizinga et al. 2004; Yaffe et al. 2000). Recently, for example, Lasky-Su et al. (2007) reported SNP-by-socioeconomic status interaction with respect to attention hyperactivity deficit (ADHD) symptom count in and around the BDNF gene. Although some of these studies may be subject to methodological limitations (Eaves 2006), gene by environment interaction (G × E) should be considered in genetic association studies.
Most genetic association studies are based on a case-control design. While case-control designs for genetic association are powerful, they suffer from potential effects of population stratification, leading to false positives or negatives (e.g., Cardon and Bell 2001; Posthuma et al. 2004). Family-based designs, which compare genetically related subjects, are therefore preferred. Fulker et al. (1999) proposed a design for association analysis of quantitative traits in sib pair data using maximum-likelihood variance-components procedures. They showed that the design is robust against spurious association stemming from population stratification, because the association effect is decomposed into a within-family effect and a between-family effect. The within-family effect is free of the potential effects of population stratification, because sibling pairs are drawn from the same family, and thus from the same genetic stratum. This design was extended by Neale et al. (1999) to include covariates, and by Abecasis et al. (2000a) to include multiple sibs, and parental information. The Fulker model now forms the basis for widely used statistical packages such as QTDT (Abecasis et al. 2000a, b).
Just like the association between genotypes and phenotypes, the associations between the environment and a phenotype, and between the G × E interaction and a phenotype are susceptible to the effects of population stratification. If two populations with (a) different allele frequencies, (b) different environmental frequencies (categorical environmental measure) or different environmental means (continuous environmental measure), and (c) different phenotypic means, are mixed, spurious environmental effects and spurious interaction effects can result, in addition to spurious allelic effects. In the sib pair design, it is therefore expedient to decompose into orthogonal between- and within-family effects (1) the allelic association; (2) the main effect of the environment; and (3) the G × E interaction.
In the present paper, we extend the sib pair model proposed by Fulker et al. (1999) to include environmental main effects and G × E interaction effects. The measured environmental variable may be either categorical or continuous. We report simulations carried out to investigate the statistical power to detect the presence of environmental main effects and G × E interaction effects for different effect sizes, different allele frequencies, and different environmental frequencies or means. In addition, we examine the statistical power to detect spurious G × E interaction due to population stratification.
Sib pair-based association including environmental effects and G × E interaction
We assume a diallelic marker with allele A_{1} with frequency p, and allele A_{2} with frequency 1 − p = q, and genotypes A_{1}A_{1}, A_{1}A_{2} and A_{2}A_{2} with genotypic effects a, d and −a, respectively, (Fisher 1918; Falconer and Mackay 1996). For simplicity we assume throughout the paper that the marker under study is the actual quantitative trait locus (QTL), i.e., recombination fraction θ is zero. In reality, the genotypic value of a marker is unequal to zero only if the marker is the QTL, or if the marker is in linkage disequilibrium (LD) with the QTL. We assume that the observed trait value of an individual is a function of a major gene effect (QTL), an additive polygenic genetic background effect, a shared familial or ‘common environmental’ effect, and an unshared, unique environmental effect (which also includes measurement error). Furthermore we assume that the effects of the additive polygenic genetic background, the common and unique environment, and the QTL are mutually uncorrelated, and that the additive polygenic genetic background effect and the environmental effects are normally distributed with mean zero.
If data from sibling pairs are available, the additive and dominance QTL effects may be partitioned into between- and within-family effects, as specified in Fulker et al. (1999), Abecasis et al. (2000a, b), and Posthuma et al. (2004). We will now introduce parameters for the effects of the environment, and for the interaction between genotype and environment.
Categorical environment
We assume that there are two environmental levels or conditions. The probability of being in either one of the environmental conditions is assumed not to depend on one’s genotype, i.e., the correlation between genotype and environmental status (rGE) is zero. We also assume that the probability of being in either one of the environmental conditions is independent of the environmental condition of other family members.
Expected phenotypic means for genotypic groups distinguished with respect to three environmental conditions
Genotype | Mean | ||
---|---|---|---|
Condition 1 | Condition 2 | Condition 3 | |
A_{1}A_{1} | τ_{i} + a | τ_{i} + a + e_{1} + i_{1} | τ_{i} + a + e_{2} + i_{2} |
A_{1}A_{2} | τ_{i} + d | τ_{i} + d + e_{1} + c_{1} | τ_{i} + d + e_{2} + c_{2} |
A_{2}A_{2} | τ_{i} − a | τ_{i} − a + e_{1} − i_{1} | τ_{i} − a + e_{2} − i_{2} |
Both sibs in environmental Condition 1
Genotype | Genotypic effect | Environmental effect | Additive and interaction | Dominance and interaction | Environment | Partitioned effects | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sib1 | Sib2 | Sib1 | Sib2 | Sib1 | Sib2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Sib1 (B + W) | Sib2 (B − W) |
A_{1}A_{1} | A_{1}A_{1} | a | a | 0 | 0 | a_{b} | 0 | 0 | 0 | 0 | 0 | a_{b} | a_{b} |
A_{1}A_{1} | A_{1}A_{2} | a | d | 0 | 0 | ½ a_{b} | ½ a_{w} | ½ d_{b} | −½ d_{w} | 0 | 0 | ½[a_{b }+ a_{w}] + ½[d_{b} − d_{w}] | ½[a_{b} − a_{w}] + ½[d_{b }+ d_{w}] |
A_{1}A_{1} | A_{2}A_{2} | a | −a | 0 | 0 | 0 | a_{w} | 0 | 0 | 0 | 0 | a_{w} | −a_{w} |
A_{1}A_{2} | A_{1}A_{1} | d | a | 0 | 0 | ½ a_{b} | −½ a_{w} | ½ d_{b} | ½ d_{w} | 0 | 0 | ½[a_{b} − a_{w}] + ½[d_{b }+ d_{w}] | ½[a_{b }+ a_{w}] + ½[d_{b} − d_{w}] |
A_{1}A_{2} | A_{1}A_{2} | d | d | 0 | 0 | 0 | 0 | d_{b} | 0 | 0 | 0 | d_{b} | d_{b} |
A_{1}A_{2} | A_{2}A_{2} | d | −a | 0 | 0 | −½ a_{b} | ½ a_{w} | ½ d_{b} | ½ d_{w} | 0 | 0 | ½[−a_{b }+ a_{w}] + ½[d_{b }+ d_{w}] | −½[a_{b }+ a_{w}] + ½[d_{b} − d_{w}] |
A_{2}A_{2} | A_{1}A_{1} | −a | a | 0 | 0 | 0 | −a_{w} | 0 | 0 | 0 | 0 | −a_{w} | a_{w} |
A_{2}A_{2} | A_{1}A_{2} | −a | d | 0 | 0 | −½ a_{b} | −½ a_{w} | ½ d_{b} | −½ d_{w} | 0 | 0 | −½[a_{b }+ a_{w}] + ½[d_{b} − d_{w}] | ½[−a_{b }+ a_{w}] + ½[d_{b }+ d_{w}] |
A_{2}A_{2} | A_{2}A_{2} | −a | −a | 0 | 0 | −a_{b} | 0 | 0 | 0 | 0 | 0 | −a_{b} | −a_{b} |
Both sibs in environmental Condition 2
Genotype | Genotypic effect | Environmental effect | Additive and interaction | Dominance and interaction | Environment | Partitioned effects | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sib1 | Sib2 | Sib1 | Sib2 | Sib1 | Sib2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Sib1 (B + W) | Sib2 (B − W) |
A_{1}A_{1} | A_{1}A_{1} | a + i | a + i | e | e | a_{b }+ i_{b} | 0 | 0 | 0 | e_{b} | 0 | [a_{b }+ i_{b}] + e_{b} | [a_{b }+ i_{b}] + e_{b} |
A_{1}A_{1} | A_{1}A_{2} | a + i | d + c | e | e | ½(a_{b }+ i_{b}) | ½(a_{w }+ i_{w}) | ½(d_{b }+ c_{b}) | −½(d_{w }+ c_{w}) | e_{b} | 0 | ½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w})] + e_{b} | ½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w}] + e_{b} |
A_{1}A_{1} | A_{2}A_{2} | a + i | −a − i | e | e | 0 | a_{w }+ i_{w} | 0 | 0 | e_{b} | 0 | [a_{w }+ i_{w}] + e_{b} | −[a_{w }+ i_{w}] + e_{b} |
A_{1}A_{2} | A_{1}A_{1} | d + c | a + i | e | e | ½(a_{b }+ i_{b}) | −½(a_{w }+ i_{w}) | ½(d_{b }+ c_{b}) | ½(d_{w }+ c_{w}) | e_{b} | 0 | ½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w}] + e_{b} | ½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w}] + e_{b} |
A_{1}A_{2} | A_{1}A_{2} | d + c | d + c | e | e | 0 | 0 | d_{b }+ c_{b} | 0 | e_{b} | 0 | [d_{b }+ c_{b}] + e_{b} | [d_{b }+ c_{b}] + e_{b} |
A_{1}A_{2} | A_{2}A_{2} | d + c | −a − i | e | e | −½(a_{b }+ i_{b}) | ½(a_{w }+ i_{w}) | ½(d_{b }+ c_{b}) | ½(d_{w }+ c_{w}) | e_{b} | 0 | −½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w}] + e_{b} | −½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w})] + e_{b} |
A_{2}A_{2} | A_{1}A_{1} | −a − i | a + i | e | e | 0 | −(a_{w }+ i_{w}) | 0 | 0 | e_{b} | 0 | −[a_{w }+ i_{w}] + e_{b} | [a_{w }+ i_{w}] + e_{b} |
A_{2}A_{2} | A_{1}A_{2} | −a − i | d + c | e | e | −½(a_{b }+ i_{b}) | −½(a_{w }+ i_{w}) | ½(d_{b }+ c_{b}) | −½(d_{w }+ c_{w}) | e_{b} | 0 | −½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w}] + e_{b} | −½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w})] + e_{b} |
A_{2}A_{2} | A_{2}A_{2} | −a − i | −a − i | e | e | −(a_{b }+ i_{b}) | 0 | 0 | 0 | e_{b} | 0 | −[a_{b }+ i_{b}] + e_{b} | −[a_{b }+ i_{b}] + e_{b} |
Sibs in different environmental conditions: sib1 in Condition 1 and sib2 in Condition 2
Genotype | Genotypic effect | Environmental effect | Additive and interaction | Dominance and interaction | Environment | Partitioned effects | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Sib1 | Sib2 | Sib1 | Sib2 | Sib1 | Sib2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Between mean | Within diff/2 | Sib1 (B + W) | Sib2 (B − W) |
A_{1}A_{1} | A_{1}A_{1} | a | a + i | 0 | e | a_{b }+ ½ i_{b} | −½ i_{w} | 0 | 0 | ½ e_{b} | −½ e_{w} | [a_{b }+ ½ i_{b} − ½ i_{w}] + ½[e_{b} − e_{w}] | [a_{b }+ ½ i_{b }+ ½ i_{w}] + ½[e_{b }+ e_{w}] |
A_{1}A_{1} | A_{1}A_{2} | a | d + c | 0 | e | ½ a_{b} | ½ a_{w} | ½ d_{b }+ ½ c_{b} | −½ d_{w} − ½ c_{w} | ½ e_{b} | −½ e_{w} | ½[a_{b }+ a_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w}] + ½[e_{b} − e_{w}] | ½[a_{b} − a_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w}] + ½[e_{b }+ e_{w}] |
A_{1}A_{1} | A_{2}A_{2} | a | −a − i | 0 | e | −½ i_{b} | a_{w }+ ½ i_{w} | 0 | 0 | ½ e_{b} | −½ e_{w} | [a_{w} − ½ i_{b }+ ½i_{w}] + ½[e_{b} − e_{w}] | [−a_{w }− ½ i_{b }− ½ i_{w}] + ½[e_{b }+ e_{w}] |
A_{1}A_{2} | A_{1}A_{1} | d | a + i | 0 | e | ½ a_{b} + ½ i_{b} | −½ a_{w} − ½ i_{w} | ½ d_{b} | ½ d_{w} | ½ e_{b} | −½ e_{w} | ½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w}] + ½[e_{b} − e_{w}] | ½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w}] + ½[e_{b }+ e_{w}] |
A_{1}A_{2} | A_{1}A_{2} | d | d + c | 0 | e | 0 | 0 | d_{b }+ ½ c_{b} | −½ c_{w} | ½ e_{b} | −½ e_{w} | [d_{b }+ ½ c_{b} − ½ c_{w}] + ½[e_{b} − e_{w}] | [d_{b }+ ½ c_{b }+ ½ c_{w}] + ½[e_{b }+ e_{w}] |
A_{1}A_{2} | A_{2}A_{2} | d | −a − i | 0 | e | −½ a_{b} − ½ i_{b} | ½ a_{w }+ ½ i_{w} | ½ d_{b} | ½ d_{w} | ½ e_{b} | −½ e_{w} | −½[a_{b} − a_{w }+ i_{b} − i_{w}] + ½[d_{b }+ d_{w}] + ½[e_{b} − e_{w}] | −½[a_{b }+ a_{w }+ i_{b }+ i_{w}] + ½[d_{b} − d_{w}] + ½[e_{b }+ e_{w}] |
A_{2}A_{2} | A_{1}A_{1} | −a | a + i | 0 | e | ½ i_{b} | −a_{w} − ½ i_{w} | 0 | 0 | ½ e_{b} | −½ e_{w} | −[a_{w} − ½ i_{b }+ ½ i_{w}] + ½[e_{b} − e_{w}] | [a_{w }+ ½ i_{b }+ ½ i_{w}] + ½[e_{b }+ e_{w}] |
A_{2}A_{2} | A_{1}A_{2} | −a | d + c | 0 | e | −½ a_{b} | −½ a_{w} | ½ d_{b }+ ½ c_{b} | −½ d_{w} − ½ c_{w} | ½ e_{b} | −½ e_{w} | −½[a_{b }+ a_{w}] + ½[d_{b} − d_{w }+ c_{b} − c_{w}] + ½[e_{b} − e_{w}] | −½[a_{b} − a_{w}] + ½[d_{b }+ d_{w }+ c_{b }+ c_{w}] + ½[e_{b }+ e_{w}] |
A_{2}A_{2} | A_{2}A_{2} | −a | −a − i | 0 | e | −a_{b} − ½ i_{b} | ½ i_{w} | 0 | 0 | ½ e_{b} | −½ e_{w} | −[a_{b }+ ½ i_{b} − ½ i_{w}] + ½[e_{b} − e_{w}] | [−a_{b }− ½ i_{b }− ½ i_{w}] + ½[e_{b }+ e_{w}] |
Continuous environment
In the variance-components approach, the means and variances of related individuals are modeled simultaneously, as a function of the set of parameters θ which equals θ = {τ_{i}, a_{b}, a_{w}, d_{b}, d_{w}, e_{b}, e_{w}, i_{bg}, i_{wg}, σ_{s}^{2}, σ_{u}^{2}}, if the marker under study is the QTL. Maximum likelihood estimation can be used to obtain parameter estimates, and likelihood ratio tests can be used to test specific constraints on the parameters (Azzelini 1996). For example, one can test whether the regression weight for the between-family additive genetic marker effect, a_{b}, is equal to the within-family additive genetic marker effect, a_{w}, the idea being that a_{b} only differs from a_{w} when population stratification significantly influences the results of the test for genetic association.
Sib pair association models including a measured environment and G × E interaction effects can readily be implemented in the Mx software package^{1} (Neale et al. 2003). Appendices I and II include example Mx-scripts for the case of sib pair data and a dichotomous environment or a continuous environment, respectively. Adaptation of these scripts for the modeling of more than two siblings, or categorical environments with multiple levels is quite straightforward. Extension of these scripts to include data from nuclear families (parents and offspring; Abecasis et al. 2000a) requires some modifications which are spelled out in the Mx-script provided by Posthuma et al. (2004) in their Appendix II. An example script for the modeling of data from monozygotic and dizygotic twin pairs is available online.^{2} Note that whereas sib pair data only allow distinction between σ_{s}^{2} and σ_{u}^{2}, twin data allow a more detailed decomposition of the background variance into variance due to additive genetic effects (σ_{A}^{2}), common environmental effects (σ_{C}^{2}) or dominance genetic effects (σ_{D}^{2}), and unshared environmental effects (σ_{E}^{2}).
Power calculations for the G × E model
We performed a series of simulation studies to investigate the power of the extended sib pair model to detect the G × E interaction effects. We considered both a dichotomous environmental measure and a continuous environmental measure. All simulations were based on simulated sibling pairs only, and the simulated marker was assumed to be the actual QTL. The power analyses are thus limited to the detection of effects on the means (association), not the variances (linkage).
Procedures
Simulations involved a diallelic marker locus with frequency p of the increaser allele A_{1} being .5 or .2. Except where noted, QTL dominance effects were absent. In the case of a dichotomous environment, the frequency b_{1} of Condition 1 was either .5 or .2. The continuous environmental measure was standard normally distributed, i.e., Environment ∼ N(0,1). Simulated environmental values were uncorrelated to the simulated genotypes (e.g., rGE = 0). The continuous phenotype was standard normally distributed when all measured allelic and environmental effects were zero. When these effects are not zero, the phenotypic mean and variance deviate from 0 and 1, respectively. The degree of deviation depends on their effect size.
The QTL effect, the environmental effect, and the interaction were manipulated so that in isolation, these factors each explained 1%, 2.5% or 5% of the total phenotypic variance in the total sample. In the simulations with a dichotomous environment, these effect sizes were determined for the case that both environmental conditions and alleles were evenly distributed (i.e., b_{1} = b_{2} = .5 and p = q = .5). Note, however, that the percentage of explained variance depends on the allele frequencies and the distribution of the environmental variable. For instance, if the parameters representing the genotypic effect of the QTL locus are chosen such that the locus explains 5% of the variance in the case that p = q = .5, this same locus (i.e., same genotypic values) only explains 3.3% of the variance in the case that p = .2. Likewise, an environmental effect that explains 5% of the variance if b_{1} = b_{2} = .5, explains only 3.3% of the variance if b_{1} = .2. For the simulations including a continuous environment, effect sizes were determined for the case that p =q = .5.
Where noted, population stratification was generated by mixing two samples (A and B) of equal proportions, with different phenotypic means (μ_{A} and μ_{B}), and different marker allele frequencies (p_{A} = .7, p_{B} = .3). In the case of a dichotomous environment, environmental frequencies differed between samples A and B (b_{A1} = .3, b_{B1} = .7). In the case of a continuous environment, the environmental means differed between samples A and B (μ_{envA} = 0, μ_{envB} = 2). The phenotypic means of samples A and B were selected such that admixture accounted for 20% of the total phenotypic variance in the combined population, i.e., (μ_{A} − μ_{B})^{2}/4σ_{TOT}^{2} = .20 (see Abecasis et al. 2000a).^{3} The mixture of these two samples with different phenotypic means, different allele frequencies, and different environmental frequencies or means, results in spurious allelic, spurious environmental, and spurious interaction effects. The emphasis in these simulations is thus on the detection of false positives, but false negatives are theoretically possible (e.g., Posthuma et al. 2004; Neale et al. 1999).
For all simulations, background variance was modeled such that, after accounting for the QTL-effect, the environmental main effect, and the interaction, 30% of the remaining variance was attributable to additive polygenic genetic effects (A), and 70% was due to non-shared environmental effects (E). Covariance between the sibs due to shared environmental components (C) was fixed to zero, so all resemblance between the sibs was due to genetic factors only (i.e., the QTL and other unidentified genes). Because A and C cannot be distinguished unless the sample includes monozygotic twins, in addition to regular siblings or dizygotic twins, the term σ_{s}^{2} will include all the siblings’ resemblance due to shared genes other than the QTL, and common environmental influences. The term σ_{u}^{2} then includes all variance due to unidentified non-shared genes and non-shared environmental effects. Note that, in general, the power to detect the effects of interest increases as the residual sibling resemblance σ_{s}^{2} increases, even if the exact nature of resemblance (genes or environment) cannot be distinguished. This is because, as a result of increasing σ_{s}^{2}, the non-shared component σ_{u}^{2} decreases, and less unshared variance implies less “noise” (i.e., unexplained variance), which increases statistical power. The choice to fix shared environmental effects to zero in all simulations, thus results in conservative estimations of the power to detect the effects of interest.
All data simulations were performed in the R program,^{4} and exact data simulation was used for all analyses (van der Sluis et al. 2008; Bollen and Stine 1993; Dolan et al. 2005). Exact data simulation can be used when sufficient summary statistics are available in theory, i.e., when all information present in the raw data can be summarized sufficiently in the variance covariance matrix Σ, and the means vector μ. Exact data simulation implies the simulation of raw data that are transformed to fit the true model exactly. Consequently, when the true model is fitted to these data, all parameter estimates used to simulate the data are recovered exactly. Subsequently, the constrained, nested (wrong) model is fitted to the data, in which parameters of interest are fixed to zero, or constrained to be equal. Minus twice the difference in the log-likelihoods of the true model and the nested model asymptotically equals the non-centrality parameter λ of the non-central χ^{2}-distribution, with df equal to the difference in the number of parameters estimated. This non-centrality parameter can subsequently be used to calculate the sample size N required for a chosen power level, given a chosen critical value α (Saris and Satorra 1993).^{5}
The results of power analyses based on exact data simulation equal exactly the results obtained through the analysis of (population or expected) summary statistics Σ and μ. Also, as in power calculations based on summary statistics, these results are asymptotically similar to results obtained through Monte Carlo simulation (depending on the number of runs, and the sample size N used in the Monte Carlo procedure). In contrast to Monte Carlo simulation, however, exact data simulation obviates the requirement to replicate the analyses in different runs because the quasi-randomly generated data are transformed to fit the true model exactly. Exact data simulation is therefore not only easy to perform but also computationally light compared to Monte Carlo simulation, which is why we chose to use exact data simulation here. We refer to Van der Sluis et al. (2008) for an extensive discussion on exact data simulation.
Given non-centrality parameter λ, the Mx program computes the total sample size that would be required, given the reported proportion of subjects in each distinguishable group, to reject the tested hypothesis at various power levels, ranging from .25 to .99. Here, we focus on the conditions required for a power of 80%. For all statistical tests, α was chosen to equal .05.
Patterns of G × E interaction
Results
Number of sib pairs required to detect main effects of QTL and environment, and G × E interaction effects of different effect sizes, in the context of different allele frequencies, and different types of environments (categorical versus continuous) for power of .80 with α = .05 when all other effects are 0
Effect size | Zero effects freely estimated | Zero effects fixed to zero | |||||
---|---|---|---|---|---|---|---|
Environment | Environment | ||||||
b_{1} = .2 | b_{1} = .5 | Continuous | b_{1} = .2 | b_{1} = .5 | Continuous | ||
Main effect QTL^{a} | |||||||
1% | p = .2 | 3,607 | 1,441 | 737 | 600 | 600 | 600 |
2.5% | 1,400 | 561 | 288 | 235 | 235 | 235 | |
5% | 664 | 266 | 142 | 116 | 116 | 116 | |
1% | p = .5 | 2,290 | 921 | 473 | 385 | 385 | 385 |
2.5% | 890 | 359 | 185 | 151 | 151 | 151 | |
5% | 422 | 171 | 92 | 75 | 75 | 75 | |
Main effect environment^{b} | |||||||
1% | p = .2 | 3,864 | 2,468 | 2,581 | 600 | 384 | 386 |
2.5% | 1,509 | 964 | 1,010 | 228 | 146 | 152 | |
5% | 716 | 458 | 490 | 110 | 71 | 75 | |
1% | p = .5 | 1,157 | 743 | 770 | 600 | 384 | 386 |
2.5% | 452 | 291 | 302 | 228 | 146 | 152 | |
5% | 216 | 139 | 147 | 110 | 71 | 75 | |
Interaction^{a} | |||||||
1% | p = .2 | 1,111 | 710 | 725 | 149 | 186 | 278 |
2.5% | 415 | 266 | 289 | 57 | 70 | 111 | |
5% | 186 | 120 | 142 | 27 | 33 | 55 | |
1% | p = .5 | 706 | 454 | 463 | 117 | 296 | 378 |
2.5% | 265 | 171 | 186 | 44 | 70 | 151 | |
5% | 119 | 77 | 92 | 21 | 33 | 75 |
To start with, we studied the power to detect specific effects in the situation where all other effects are zero. The simulated data included either a main effect for the QTL, or a main effect of the environment, or a G × E interaction effect (i.e., interaction in the absence of main effects). Within this context, we studied the effects on the power of allele frequencies, the scale of the environmental measures (dichotomous or continuous), and in the case of a dichotomous environment, the frequencies of the environmental conditions. Knowledge of the power to detect isolated effects of given effect sizes, provides a useful guide to subsequent analyses, where interaction effects are tested in the presence of other effects. Data were simulated such that the specific effects explained 1%, 2.5% or 5% of the variance when p = .5 and, if applicable, b_{1} = .5. Note that these simulations included no population stratification. All between and within parameters could thus be constrained to be equal without loss of fit (given the exact data simulation, this implies χ^{2} = 0 for all tests concerning admixture effects). Recall that the background variance (i.e., the variance not due to the marker under study, the environmental measure under study, or their interaction) was simulated to consist of 30% additive polygenic genetic effects (σ_{s}^{2}) and 70% environmental effects not-shared by the siblings (σ_{u}^{2}). In addition, note that in determining the power to detect the effects of interest, we first fitted the full model, i.e., the model including all effects, both zero and non-zero effects. Subsequently we fitted the model in which only the parameters of interest were constrained to zero.
The results are presented in the first three columns of Table 5. With respect to the main effects of the QTL, all tests have 2 degrees of freedom (df), as parameters for both additive and dominance allelic effects are constrained to zero. The power is greatest when p = q = .5, and when the environment is a continuous measure. A more uneven distribution of alleles is detrimental to the power to detect allelic effects, as is an uneven distribution of environmental conditions in the case of a dichotomous environmental measure. Interestingly, the distribution of the environmental variable influences the power to detect the QTL main effect, even though association between the environment and the phenotype is absent. These results are consistent with those in Table 6 of Neale et al. (1999).
All tests for environmental main effects have one degree of freedom. As can be seen from the first three columns of Table 5, the power to detect main effects of the environment is somewhat lower when the environmental effects are continuous, compared to a dichotomous environment with equally distributed conditions. The power to detect environmental main effects is lowest when both the alleles and the environmental conditions are unevenly distributed. Evidently, the allele frequencies influence the power to detect the environmental main effect when the genotypic effects are estimated freely, even though association between the QTL and the phenotype is absent.
All tests for interaction effects are 2 df-tests as both the interaction effects in the homozygotes and the heterozygotes are constrained to zero. The first three columns of Table 5 show that the power to detect interaction effects is greatest when both the allele frequencies and the environmental frequencies are evenly distributed. The power to detect interaction in the context of a continuous environment is only slightly lower.
In conclusion, if alleles are approximately evenly distributed, representative samples of about 200–400 sibling pairs are sufficient to detect main effects for the QTL or the environment, or interaction effects with effect sizes as small as 2.5–5% of the variance.
For illustrative purposes, the last three columns of Table 5 show the sample sizes required to detect the isolated effects with a power of 80% when all zero-effects are actually fixed to zero. As knowledge about which effects are actually zero is usually absent in practice, this is not a realistic situation. It does however illustrate two interesting points. First, the power to detect the effects of interest is much better in the context of a more constrained model. Practically, this implies that the order in which constraints are imposed on the model, may determine the probability to detect effects. This is something to bear in mind when deciding on model fitting procedures. Second, we previously noted that the power to detect effects (e.g., a QTL main effect, an environmental main effect) depends on the distribution of other variables in the model (e.g., the environmental variable, allele frequencies), even when these other variables are not actually associated with the phenotype under study. Naturally, this effect disappears when these zero-effects are excluded from the model.
Subsequently, we examined the power to detect genuine G × E interaction effects in the eight different designs distinguished by van den Oord (1999, see Fig. 1). For these simulations, parameter values for all non-zero effects were chosen such that in isolation, these effects would explain 2.5%. However, in the case of a dichotomous environment, the presence of other effects influences the percentage of variance explained by the G × E interaction. Using the same parameter values, the actual percentage of variance explained by the G × E interaction varied from 2.1% for Design_{(i,a,e)}, to 3.5% for Design_{(i = c)}. Also, as is well known in the context of ANOVA analysis, interaction effects can show up as main effects. In this case, the interaction effects show up as allelic main effects when the environment is dichotomous. Consequently, for Design_{(i)} through Design_{(i,a,e)}, the main effects of the QTL deviated from zero, with effect sizes ranging from 2.5% (Design_{(i)}) to 8.6% (Design_{(i,a)}). For Design_{(−i,a)} and Design_{(−i,a,e)} on the other hand, the QTL main effect explained 0% of the variance as the actual effect of the QTL was nullified entirely by the reversed interaction effect. The G × E interaction only turned up as a main environmental effect in Design_{(i = c)}. In all cases that the main effect of the environment was specifically modeled to be larger than zero (Design_{(i,e)}, Design_{(i,a,e)} and Design_{(−i,a,e)}), the effect was slightly lower than 2.5% (2.3, 2.0 and 2.4, respectively) due to the presence of the G × E interaction effect. Again, the background variance was simulated to consist of 30% additive polygenic genetic effects (σ_{s}^{2}), and 70% environmental effects not-shared by the siblings (σ_{u}^{2}) in all conditions. These simulations included no population stratification, so all between and within parameters could be constrained to be equal without loss of fit.
Number of sib pairs required to detect G × E interaction effects in eight different conditions (see Fig. 1) for power of .80 with α = .05^{a}
Environment | ||||||
---|---|---|---|---|---|---|
b_{1} = .2 | b_{1} = .5 | Continuous | ||||
p = .2 | p = .5 | p = .2 | p = .5 | p = .2 | p = .5 | |
Design_{(i)} | 410 | 261 | 263 | 169 | 289 | 185 |
Design_{(i = c)} | 144 | 175 | 93 | 113 | 102 | 124 |
Design_{(i,e)}; Design_{(i,a)}; Design_{(i,a,d)}; Design_{(i,a,e)}; Design_{(−i,a)}; Design_{(−i,a,e)} | 410 | 261 | 263 | 169 | 289 | 185 |
The power to detect the interaction effect is not influenced by the presence or absence of an environmental main effect (Design_{(i,e)} versus Design_{(i)}, and Design_{(i,a,e)} and Design_{(−i,a,e)} versus Design_{(i,a)}, Design_{(i,a,d)} and Design_{(−i,a)}). This is understandable, given that the environmental main effect only influences the phenotypic means of the genotypic groups, but not the differences in phenotypic means between the genotypic groups. The environmental main effect may thus be viewed as a constant, which does not influence the power to detect interaction.
The presence or absence of a main effect of the QTL also has no influence on the power to detect G × E interaction. (To assure that this finding was not due to the size of the allelic effect, additional analyses including a larger allelic effect, explaining 10% and 20% of the variance rather than 2.5%, were run, which showed similar results.)
Finally, we studied the power to detect population stratification with respect to the interaction component of the model. As described above, we mixed two subsamples of equal proportions, which differed with respect to allele frequencies (p_{A }= .7, p_{B }= .3), and environmental distribution (in case of a dichotomous environmental measure, b_{A1 }=_{ }.3, b_{B1 }=_{ }.7; in case of a continuous environmental measure, μ_{envA }= 0, μ_{envB }= 2), choosing phenotypic means of the subsamples such that the admixture accounted for 20% of the total phenotypic variance in the combined sample. When these admixture proportions were used to simulate data in which the actual effects (allelic, environmental and interaction) were zero, spurious allelic, environmental, and interaction effects were observed in the combined sample due to the admixture. For the dichotomous environment, the between family effects deviated from the within family effects, with the stratification effect being largest for the allelic effects (N = 184 for 80% power), intermediate for the environmental effects (N = 1,465 for 80% power), and smallest for the interaction effects (N = 7,028 for 80% power). For the continuous environment, the between family effects also deviated from the within family effects, with the stratification effect being largest for the environmental effects (N = 278 for 80% power), medium for the allelic effects (N = 576 for) and smallest for the interaction effects (N = 3,233 for 80% power). It is clear that very large numbers of sib pairs are required to detect stratification effects in the interaction component. It is also noteworthy that the allele frequencies in the subsamples determine how the spurious G × E interaction is expressed. With the present admixture settings (p_{A }= .7, p_{B }= .3, i.e., contrasting allele frequencies), spurious G × E is only apparent with respect to the interaction parameter for the heterozygous group, while the interaction parameter for the homozygous group obtained in the combined sample does not deviate from its actual value in the subsamples. However, if the allele frequencies in the subsamples are not contrasting (e.g., p_{A }= .3, p_{B }= .5), both interaction parameters for the heterozygous and homozygous groups are informative about spurious interaction.
Given population stratification, we again considered the eight different interaction designs (Fig. 1) to study (a) the power to detect stratification effects with respect to the interaction component of the model (tests with 2 df as both homozygote and heterozygote interaction effects are constrained to be equal within and across families) and (b) the power to detect genuine interaction effects (tests with 2 df as both homozygote and heterozygote interaction effects are constrained to be zero within families, while the between-family effects are freely estimated). For all conditions, the background variance was again simulated to consist of 30% additive polygenic genetic effects (σ_{s}^{2}), and 70% environmental effects not-shared by the siblings (σ_{u}^{2}).
Number of sib pairs required to detect spurious (H1:B = W vs. H0: B≠W) and genuine (H1:B≠W = 0 vs. H0: B≠W≠0) G × E interaction effects in eight different conditions for power of .80 with α = .05^{a}
Dichotomous environment | Continuous environment | |||
---|---|---|---|---|
B = W | B≠W = 0 | B = W | B≠W = 0 | |
Design_{(i)} | 4,745 | 274 | 2,543 | 184 |
Design_{(i = c)} | 4,129 | 192 | 3,277 | 126 |
Design_{(i,e)} | 3,832 | 274 | 4,000 | 184 |
Design_{(i,a)} | 3,470 | 274 | 2,071 | 184 |
Design_{(i,a,d)} | 3,658 | 274 | 2,146 | 184 |
Design_{(i,a,e)} | 2,914 | 274 | 3,045 | 184 |
Design_{(−i,a)} | 6,922 | 274 | 3,235 | 184 |
Design_{(−i,a,e)} | 5,318 | 274 | 5,563 | 184 |
An indication of the power to detect the genuine interaction effect is obtained by freely estimating the between-family effect, while the within-family effect is constrained to be zero (B≠W = 0). The results in Table 7 show that the power to detect G × E effects on the means is about as large as one would expect given the previous results presented in Table 6, and the distribution of the genotypes in the mixed population (i.e., freq(A_{1}A_{1}) = (.7^{2 }+^{ }.3^{2})/2 = .29; freq(A_{1}A_{2}) = ((2 * .3 * .7) + (2 * .3 * .7))/2 = .42; freq(A_{2}A_{2}) = (.3^{2 }+^{ }.7^{2})/2 = .29).
Discussion
In this paper, the family-based association design was extended to include G × E interaction effects and environmental main effects. Power calculation showed that allele frequencies, and characteristics of environment (e.g., measurement level, and in the case of a categorical environment, the frequencies of the environmental conditions) affect the power to detect G × E interaction. Relatively small interaction effects, explaining 2.5–5% of the phenotypic variance in the total sample, can be detected with reasonably small sample sizes (200–400 sib pairs, respectively), if alleles are evenly distributed. The power to detect main effects and interaction effects generally is reasonable, particularly when all zero-effects are removed from the model first.
Throughout the paper, we assumed that the marker locus under study is the actual QTL. In practice, this will often not be the case and markers will usually be more or less strongly in LD with the QTL. Also, a criterion level α of .05 was employed in the simulation studies. Often, however, one will not test for association in one, but several marker loci, and α will be adjusted downwards to control for Type I errors. The power results presented here thus concern the most favorable conditions, and in practice, larger sample sizes may be required to obtain a power of 80%.
Modeling measured environmental effects in association studies is standard (e.g., Caspi et al. 2002, 2003; Foley et al. 2004; Huizinga et al. 2006; Lasky-Su et al. 2007; Yaffe et al. 2000). The use of the extended sib pair model in such studies has the advantage of controlling for population stratification, and excluding spurious main effects of the QTL and the environment, and, given sufficiently large sample size, spurious interaction effects. This extension can be implemented readily in packages such as Mx (Appendices I and II), or, in case of a categorical environmental factor, in SPSS (Beem and Boomsma 2006).
Some caveats are in order. First, it has often been shown that non-normality can result in spurious interaction effects (e.g., Boomsma and Martin 2002; Martin 1999; Purcell 2002; van den Berg et al. 2007; van der Sluis et al. 2006). However, the actual presence of G × E can also render the distribution non-normal (e.g., Purcell 2002; van der Sluis et al. 2006), resulting in the problem that non-normality of the data can either indicate the presence of G × E (i.e., G × E being the source of the non-normality) or mimic the presence of G × E (i.e., non-normality due to e.g. censoring or poor scaling of the phenotypic measure). The model presented here is equally susceptible to this phenomenon.
Although there is no ready solution to this problem, researchers should at least investigate alternative reasons for the non-normality of their data than the presence of G × E (e.g., poor measurement scale, selective sampling, etc.). As has been argued before (e.g., Martin 1999; van der Sluis et al. 2006), transformation of the data is no solution as it will remove both spurious and genuine G × E from the data.
Here we presented a model with measured genotypes and a measured environment. If these measured variables are indeed the ones involved in the G × E interaction, and thus the ones causing the heteroscedasticity, then accounting for these measures (i.e., modeling their effects) should render the remaining variance (as summarized in Eq. 4) homoscedastic. In a previous paper (van der Sluis et al. 2006), marginal maximum likelihood showed to be useful in the detection of heteroscedasticity. If heteroscedasticity is present before modeling the genotypic and environmental effects, but absent when these effects are controlled for, then this can be taken to indicate that the heteroscedasticity was due to the interaction between the locus and environment under study. Yet, if the heteroscedasticity is still present, this can mean (a) that the heteroscedasticity is caused by scaling problems in the instrument used to measure the phenotype, or (b) that G × E interaction is present but the genes and environment controlled for are not the ones involved in this interaction, or only ‘rough approximations’ of the actual gene/environment involved (e.g., a poorly designed environmental measure, or a marker that is only slightly in LD with the actual QTL). Important in this context is an issue discussed by Eaves (1984) in the light of plant studies, that the genes that control average performance (i.e., main effects) may not be the genes that control the sensitivity to the environment (i.e., the genes involved in the interaction effect, giving rise to the heteroscedasticity, see Berg et al. 1989, for a similar distinction between ‘level’ and ‘variability’ genes). Within a design as discussed here, where both genes and environment are measured entities, level and variability genes can be distinguished. This distinction may be important in understanding the biological basis of the G × E interaction.
Second, throughout this paper, we assumed that the environmental measure is independent of genotype and family membership. Using so-called family-level environmental measures, i.e., environmental measures that are by definition equal for all siblings within a family, is problematic in the sib pair design discussed here, because the decomposition in between and within family environmental effects (e_{b} vs. e_{w}) depends on siblings that are discordant with respect to the environmental measure (see Tables 2–4). The use of family-level environmental measures thus excludes the possibility to test for stratification effects in family-level environmental components, such as socioeconomic status, divorce status of the parents, domestic violence, and loss of a parent. However, stratification with respect to the allelic effects and the interaction effect can still be controlled for, and one can still test the significance of the interaction effect, and allelic and environmental main effects. In this context it is important to note that there is ample debate about whether genuine family-level environmental measures actually exist. For example, the fact that divorce status of the parents is necessarily equal for siblings from the same family does not necessarily imply that this event has similar effects on the siblings, or is experienced in exactly the same manner by all siblings. We refer to Turkheimer et al. (2005) for an extensive discussion of this subject.
Third, the model presented so far does not account for the presence of gene-environment correlation (rGE). rGE represents the genetic liability to experience different environmental events, or the genetic control of exposure to different environments (e.g., Kendler and Eaves 1986; Plomin et al. 1977). Genetic factors have been found to substantially influence individual differences in, for example, the likelihood of experiencing stressful life events, lack of social support, participation in leisure activities, martial status, and age of first sexual intercourse (see Rutter and Silberg 2002 for a review). The finding that so many diverse ‘environmental’ measures are under genetic control, suggests that the present sib pair model may prove to be of limited use. Extension of this model to include the possibility to test and account for rGE is therefore desirable. For now, we advise researchers to test for the presence of rGE before they proceed. For instance, one can test whether the genotypic groups differ with respect to their environmental mean (ANOVA), or, if the environmental measure is categorical, with respect to the distribution of subjects across environmental conditions (χ^{2} test for equal frequencies). If differences with respect to the environmental measure are absent, one can proceed with the extended sib pair model as presented here.
Gene by environment interaction studies are relatively new and such studies are often characterized by difficulties concerning measurement and modeling (e.g., Eaves 2006). In general however, researchers seem to agree that studies aimed at revealing the sources of individual differences in specific qualities need to take G × E interaction into account, in order to arrive at a full account of individual differences (e.g., Caspi et al. 2006, Moffitt et al. 2005, 2006). Tests for G × E interaction are thus likely to become standard in future (association) studies.
Effect size is defined as the variance explained by the effect, divided by the total variance, i.e., [½(μ_{A} − μ_{G})^{2} + ½(μ_{B} − μ_{G})^{2}]/σ_{TOT}^{2}, where μ_{A} and μ_{B} are the means of the different subpopulations, and μ_{G} is the mean of the combined population, which is defined as (μ_{A} + μ_{B})/2 = ½μ_{A} + ½μ_{B}. Substitution of μ_{G} with ½μ_{A} + ½μ_{B} gives (μ_{A} − μ_{B})^{2}/4σ_{TOT}^{2}.
R is a freely accessible software environment for statistical computing and graphics, see http://www.r-project.org/.
A small R-program is available online (www.psy.vu.nl/u/s.van.der.sluis), which can be used to calculate sample size N required to obtain a chosen level of power, given non-centrality parameter λ, and critical value α.
Acknowledgements
Preparation of this manuscript was financially supported by NWO/MbGW VIDI-016-065-318, MH-65322, and DA-18673.
Open Access
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.