Gain of power of the general regression model compared to Cochran-Armitage Trend tests: simulation study and application to bipolar disorder

Dizier, Marie-Hélène; Demenais, Florence; Mathieu, Flavie

doi:10.1186/s12863-017-0486-6

Gain of power of the general regression model compared to Cochran-Armitage Trend tests: simulation study and application to bipolar disorder

Research article
Open access
Published: 10 March 2017

Volume 18, article number 24, (2017)
Cite this article

Download PDF

You have full access to this open access article

BMC Genetics Aims and scope Submit manuscript

Gain of power of the general regression model compared to Cochran-Armitage Trend tests: simulation study and application to bipolar disorder

Download PDF

Marie-Hélène Dizier¹,
Florence Demenais¹ &
Flavie Mathieu ORCID: orcid.org/0000-0002-3386-0299²

2270 Accesses
8 Citations
7 Altmetric
1 Mention
Explore all metrics

Abstract

Background

Most genome-wide association studies assumed an additive model of inheritance which may result in significant loss of power when there is a strong departure from additivity. The General Regression Model (GRM), which allows performing an assumption-free test for association by testing for both additive effect and deviation from additive effect, may be more appropriate for association tests. Additionally, GRM allows testing the underlying genetic model. We compared the power of GRM association test to additive and other Cochran-Armitage Trend (CAT) tests through simulations and by applying GRM to a large case/control sample, the bipolar Welcome Trust Case Control Cohort data. Simulations were performed on two sets of case/control samples (1000/1000 and 2000/2000), using a large panel of genetic models. Four association tests (GRM and additive, recessive and dominant CAT tests) were applied to all replicates.

Results

We showed that GRM power to detect association was similar or greater than the additive CAT test, in particular in case of recessive inheritance, with up to 67% gain in power. GRM analysis of genome-wide bipolar disorder Welcome Trust Consortium data (1998 cases/3004 controls) showed significant association in the 16p12 region (rs420259; P = 3.4E-7) which has not been identified using the additive CAT test. As expected, rs42025 fitted a non-additive (recessive) model.

Conclusions

GRM provides increased power compared to the additive CAT test for association studies and is easily applicable.

Analysis of Genetic Association Studies Incorporating Prior Information of Genetic Models

Article 10 March 2015

Optimal Trend Tests for Genetic Association Studies of Heterogeneous Diseases

Article Open access 09 June 2016

Relative performance of gene- and pathway-level methods as secondary analyses for genome-wide association studies

Article Open access 08 April 2015

Background

During the last decades, numerous genetic association studies for diseases or traits have been applied to large panels of SNPs (for single-nucleotide polymorphism), either at the genome-wide level (Genome Wide Association Studies (GWAS)) or in candidate regions. To limit the multiple testing problem, association studies were usually based on a single association test statistic between each SNP and disease. However, it is not obvious which test should be used. The simplest association test is allele-based and requires the strong assumption of Hardy Weinberg (HW) equilibrium. Model-based tests such as the Cochran-Armitage Trend (CAT) tests [1] have the advantage of not requiring this assumption and have thus been recommended for association studies [2]. CAT tests have been designed for different genetic models of the SNP effect on disease: additive (CAT_ADD), dominant (CAT_DOM) and recessive (CAT_REC), depending on the coding scheme assigned to the three genotypes. As the true genetic model is often unknown, CAT_ADD test is commonly used as it can represent an intermediate test between recessive and dominant tests. A major disadvantage of CAT tests is their sensitivity to model misspecification, as they are model-based. In case of deviation from additivity, the power of CAT_ADD test to detect association may be decreased [3–5]. A new likelihood-based method, which compares allelic frequencies between cases and controls and does not require specification of the genetic model or HW equilibrium assumption has recently been proposed. However the power of this approach did not exceed that of the CAT_ADD test [6].

Other tests, such as the “maximin efficiency robusts tests” (MERT and MAX), which are based on efficiency robustness theory, have relatively high power for any of the three commonly used genetic models (additive, recessive and dominant) [4]. The MERT test is a linear combination of the standardized optimal tests (additive, recessive and dominant) while the MAX test is the maximum of the standardized optimal tests. However, these tests are computationally intensive.

When the underlying genetic model is unknown, the General Regression Model (GRM), which includes both a term for additive effect and a term for deviation from additivity (dominance term) may be more appropriate for association tests. The GRM allows to first testing for association without making assumption on the mode of transmission and then testing for the underlying genetic model. The goal of this study was to compare the power of the GRM test for association with those of the most commonly used CAT_ADD test as well as CAT_DOM and CAT_REC tests through a simulation study that considered a large panel of genetic models. We then applied GRM and CAT_ADD tests to the bipolar disorder Wellcome Trust Cases-Controls Cohort’ data (WTCCC), in order to assess whether GRM was able to replicate CAT_ADD test results and to detect additional loci.

Methods

Association tests

CAT tests

The CAT tests can be applied to different genetic models. They are based on a logistic regressive model such that: logit(P) = α + β (X), where X is equal to 0, 1 and 2 for each of three SNP genotypes (AA, Aa, aa respectively) in case of an additive model (CAT_ADD); 0, 0, 1 for a recessive model (CAT_REC) and 0, 1, 1, for a dominant model (CAT_DOM) (see Table 1 for details on the coding scheme). The association test (β = 0 under H0) is a likelihood-ratio test which asymptotically follows a Chi-square distribution with one degree of freedom (df).

Table 1 Coding scheme of each genotype used for each CAT model and GRM

Full size table

General regression model

The General Regression Model (GRM), which includes two terms, an additive term and a dominance term (deviation from additivity), as proposed by Fisher and Wilson [7], allows testing for association without making assumption on the genetic model. The logistic regression model is written as:

$$ \mathrm{logit}\left(\mathrm{P}\right)=\upalpha +{\upbeta}_{\mathrm{Add}}\left(\mathrm{Add}\right)+{\upbeta}_{\mathrm{DomDev}}\left(\mathrm{Domdev}\right) $$

where β_Add is the regression coefficient for the additive effect (coded as 0, 1, 2 for the three genotypes AA, Aa, aa, see Table 1) and β_DomDev is the regression coefficient for the dominance term (coded as 0, 1, 0, see Table 1). The test for association (β_Add = β_DomDev = 0 under H0) is a likelihood-ration test which is assumed to follow a chi-square distribution with 2 df.

If there is significant evidence for association, the following genetic models can then be examined: by setting β_DomDev = 0 for the additive model, β_DomDev = β_Add for the dominant model β_DomDev = - β_Add for the recessive model. The decision tree is shown in Fig. 1.

The underlying genetic model is only tested if the association test is significant. First the additive model (Under H0, β_DomDev = 0) is tested. If H0 is not rejected, the additive model is retained. If the additive model is rejected, the dominant and recessive models are then tested: 1/if (β_DomDev = β_Add) is not rejected, the dominant model is retained and 2/if (β_DomDev = -β_Add) is not rejected, the recessive model is retained (see Fig. 1).

Simulation studies

A total of 200 000 or 1.0E8 replicates (for power or type 1 error estimation respectively) of samples of 1000 cases and 1000 controls were simulated. A binary trait was generated, using three different prevalence of disease (1%, 5 and 10%). We considered three genetic models (additive, dominant and recessive) for the causal variant. For each of these models, the minor allele frequency (MAF) was set at 0.1, 0.2, 0.3 or 0.4, and, for each MAF, the Odds-Ratios (OR) were varied between 1.0 and 3.2 (with a step of 0.2). Association analyses were performed for all simulated replicates using GRM and the CAT tests (CAT_ADD, CAT_DOM and CAT_REC). Thresholds of 1.0E-5 and 1.0E-7 were used to declare significance as currently used in association studies of large panels of markers.

Type one error rate

To estimate the type one error rate, simulations were done under the null hypothesis of no association (OR = 1.0 under H0). The type one error rate was estimated by the proportion of replicates showing significant association using either GRM or the CAT tests, for three significance thresholds: 5, 1% and 1.0E-5.

Comparison of power of association tests

Empirical power of each statistical test was estimated by the proportion of simulated replicates showing significant association.

Test of genetic model

For each simulated model (additive, dominant or recessive), the proportion of replicates retaining the true model was estimated among all replicates showing significant association.

Sample size effect

To assess the sensitivity of our results to sample size, samples of 2000 cases and 2000 controls were also generated for all genetic models and combinations of parameter values (MAF, ORs).

Results

Type one error rate

Under the null hypothesis of no association, the estimated type I error rate was equal or close to the three theoretical thresholds considered of 5, 1% and 1.0E-5. Results are provided in Table 2.

Table 2 Type one error rate

Full size table

Comparison of power of association tests

Results were similar for the three disease prevalence (1%, 5 and 10%). For sake of simplicity, only results obtained for a prevalence of 5% are provided. Results for simulated samples of 1000 cases/1000 controls are shown in Fig. 2 for MAFs of 0.2 and 0.4 and in Tables 3 and 4 for all MAFs.

Table 3 Power of GRM and CAT tests to detect association for a P-value threshold of 1.0E-5 using a sample size of 1000 cases/1000 controls

Full size table

Table 4 Power of GRM and CAT tests to detect association for a P-value threshold of 1.0E-7 using a sample size of 1000 cases/1000 controls

Full size table

When the simulated model was additive, the power of GRM and CAT_ADD tests to detect association were similar, for both critical thresholds of 1.0E-5 and 1.0E-7. For ORs less than or equal to 1.8, the CAT_ADD was slightly more powerful than GRM only for a few situations, with an increase in power never exceeding 15%, for all MAFs and P-value thresholds. For highest ORs, there was no difference as all power estimates reached 1.

When the simulated model was dominant, the GRM test was as powerful as the CAT_ADD test for a MAF of 0.2. For a MAF of 0.4, GRM was slightly more powerful, with highest gains in power reaching 18% for OR = 1.6 and significance threshold of 1.0E-5 or 25% for OR = 1.8 and threshold of 1.0E-7. As expected the CAT_DOM test had always the highest power when the simulated model was dominant, but the difference with the GRM never exceeded 12%.

When the simulated model was recessive, the GRM test was always more powerful than the CAT_ADD test, especially for SNP allele frequency of 0.2, with a gain in power of 52% (for OR = 2.6 and P =1.0E-5) or 59% (for OR = 3.2 and P =1.0E-7). When the MAF was 0.4, the gains in power were smaller but were obtained for lower ORs (30% for OR = 1.8 and P =1.0E-5 or 35% for OR = 2 and P =1.0E-7). As expected, the CAT_REC test also had the highest power when the simulated model was recessive, but the difference in power with respect to GRM never exceeded 22%. For ORs less than 1.4, there was no difference as all power estimates were close to 0 for all tests.

Using a larger sample size of 2000 cases/2000 controls (results provided in Fig. 2, Additional file 1: Table S1 and S2), similar conclusions could be drawn for the power comparison between GRM and CAT_ADD tests, for all simulated model. However, the strongest gain in power of GRM test versus CAT_ADD test increased and was obtained for smaller ORs. For example, for a MAF of 0.2 the highest gain in power with a recessive simulated model reached 67% (OR = 2.4 and P =1.0E-7) and, when the MAF was 0.4, the power gain reached 40% (OR = 1.6 and P =1.0E-7).

Tests of genetic model

Results for both simulated sample sizes are provided in Fig. 3. The genetic model was tested only for SNP(s) significantly associated with the disease at the critical threshold of 1.0E-5. The test of the genetic model was based on a less stringent threshold of 0.01, as it only applies to SNP(s) showing significant association. When the power to detect association was less than 1%, tests of genetic models were not performed to avoid a bias in the estimation of the true model detection. For a sample of 1000 cases/1000 controls, when data were simulated under an additive model, the true model was retained in most replicates. As expected, the proportion of replicates retaining the true model was close to [1 - type 1 error] ranging between 98 and 99%.

When data were simulated under a dominant model, the true model was retained in most replicates; for an OR greater than 2, the proportion of replicates retaining the true model ranged between 62 and 87%. For an OR less than or equal to 2, this proportion was smaller and depended on the MAF: ranging between 10 and 48% for a MAF of 0.2 and between 45 and 81% for a MAF of 0.4.

When data were simulated under a recessive model, the true model was retained by GRM in more than 70% of replicates (ranging between 72 and 99%) for an OR greater than or equal to 1.6, for all MAFs.

When the data were generated in a larger sample size of 2000 cases/2000 controls, the proportion of replicates retaining the true model was increased for all simulated models (see Fig. 3).

We can notice that not concluding to the true model, when it was dominant or recessive, was mostly due to lack of power to reject an additive model (β_DomDev = 0, see Additional file 2: Figure S1). This lack of power was observed for smallest ORs and decreased when the sample size increased.

Application to the WTCCC Bipolar data

Sample description

We obtained approval for using the raw genotype and phenotypic data for the original WTCCC bipolar disorder (BD) data set. The dataset consisted of 1998 BD cases and 3004 controls genotyped using the Affymetrix 500K array (see WTCCC 2007 [8] for details). We applied similar quality control (QC) filtering as the original WTCCC 2007 study, i.e. 1) individual samples excluded in case of missing data across all SNPs >3% or genome-wide heterozygosity greater than 30% or lower than 23%, 2) SNPs excluded in case of MAF < 5% or significant deviation from HW equilibrium in controls (P <5.7E-7) or between the two controls groups (P <5.7E-7). A total of 371 137 SNPs were retained for analysis.

Test of association

For a critical threshold of 5.0E-7 (as used in the original WTCCC 2007) the GRM test showed significant association of BD with one SNP located in the 16p12 region: rs420259 (P =3.4E-7) (see Table 5 for details), whereas the CAT_ADD test did not (P =9.3E-4). Note that no other SNP was detected by either GRM or CAT_ADD test.

Table 5 Results of GRM association test in bipolar disorder WTCCC case-control sample (WTCCC 20007)

Full size table

Using a less stringent threshold (5.0E-5) to detect “suggestive” association, 10 SNPs (in addition to rs420259) were detected by GRM test. Results are detailed in Table 5a. Among them, 9 SNPs were detected by both CAT_ADD and GRM tests and 1 SNP was detected only by the GRM test.

Test of genetic model

For the SNP rs420259 significantly associated to BD using GRM, the additive model was rejected (P =1.6E-7) and the recessive model was retained (i.e. β_ADD = -β_DomDev was not rejected). A lower risk was observed for the risk allele homozygote carriers, with an Odds-ratio of 0.75 IC (95%) = [0.67 - 0.84]) (see Table 5b for details).

Discussion

Genetic association studies are usually conducted using the CAT_ADD test which is model based and known to be sensitive to model misspecification. Indeed, when there is departure from additivity, this test may lead to decrease in power to detect association [3–5].

Our simulation study showed that the GRM test, which does not make any assumption on the genetic model, is as powerful as or even more powerful to detect association than the CAT_ADD test. An important finding is that GRM and CAT_ADD tests had similar power when the true model was additive. In the latter situation, the decrease in power never exceeded 15%, although the GRM test has an additional degree of freedom as compared to the CAT_ADD test. We also showed that the GRM association test may be more powerful than the CAT_ADD test when the true model was dominant and even more when it was recessive. The gain in power reached 67% for a recessive model when using a significance threshold of 1.0E-7, as currently done in GWAS. This increase in power was higher for increased sample size, especially for low ORs. Thus, the advantage of GRM test over CAT_ADD test will be particularly important for multifactorial diseases where most associated variants have small ORs and which require large sample sizes to detect association.

The two maximin efficiency robust tests which were developed by Freidlin et al [4] to have relatively high power for any of the three additive, dominant and recessive models, are computationally very intensive because of permutation testing. The MAX test which is generally more powerful but even more computationally intensive than MERT [4], has been extended to derive the exact and/or the asymptotic distribution of the test statistic to be less computationally intensive [9]. Note however that this test remains twice as computationally intensive as the logistic regression-based test [10]. Moreover, MAX test is very sensitive to allele frequency: for a frequency lower than 0.3, it has smallest power than CAT_ADD under dominant and additive models [10] whereas GRM has similar power as CAT_ADD. Under other models, MAX test is always less powerful than the genotypic test [10] and consequently than the GRM test, as the genotypic and GRM tests have similar power, as expected (personal data). Based on these findings, we can argue that the power of the MAX test never exceeds the power of the GRM test. Moreover, a power comparison between MAX and GRM tests for a few number of models showed similar or higher power of GRM comparing to MAX (results not shown).

A major advantage of the GRM test is that it allows to test the underlying genetic model in the same modelling framework, whereas the genotypic test, CATs and the MAX tests do not. GRM might also be further developed to estimate and test more complex models, as it has already been done in case of gene x gene interaction [11]. GRM can be applied to association studies of large panels of markers but can also be used to perform gene-based or pathway-based analyses.

Re-analysis of WTCCC cases-controls bipolar disorder data illustrates the gain in power of GRM association test as compared to CAT_ADD test, especially when there is deviation from additivity. Using the classical GWAS threshold of 5.0E-7, the GRM test detected one SNP, significantly associated with BP, whereas CAT_ADD test did not. As expected, deviation from additivity was observed for this SNP and the recessive model was retained.

Ten additional SNPs showed suggestive association at the threshold of 5.0E-5, 9 of these SNPs were detected by both GRM and CAT_ADD tests and one SNP was detected by GRM test only. This shows once again that GRM can not only replicate results of CAT_ADD test but also allows detecting additional SNPs.

Association of BD with the rs420259 SNP, as found here using GRM test, has been initially reported by the Welcome Trust Consortium by applying the genotypic test [8], which represents a general modeling framework as GRM and genotypic tests has similar power. Interestingly, association of the same SNP with BD was also reported by applying either the MAX test [12] or a score-based nonparametric test [13] to the same WTCCC case-control BD data. Moreover, a meta-analysis (including WTCCC, STEP-BD, Iceland and Scandinavia samples; n = 5547 BD cases and 20241 controls) [14] suggested association between rs420259 and BD (P =1.2E-5). However, such association was not further reported by GWAS in extended datasets ([15], see Craddock and Sklar for review [16]), which were based on the CAT_ADD test.

The rs420259 is located in an intron of PALB2 gene which is involved in tumor suppression. Interestingly, the DCTN5 gene is in the immediate vicinity of the PALB2 gene. DCTN5 is known to be involved in intracellular transport, and its knockdown in vitro leads to an abnormal hyper-activity and disrupted development of neural networks [17]. DCTN5 also interacts with DISC1 gene (Disrupted in schizophrenia 1), a gene associated with bipolar disorder in several studies [18].

Conclusions

Overall, the GRM modeling framework is a user-friendly and powerful approach which allows testing for association with disease and for the underlying genetic model. This association test is easy and quick to apply and thus particularly appropriate for association studies of large panels of markers in simple and complex situations.

Abbreviations

BD:: Bipolar disorder
CAT:: Cochran-Armitage trend
CAT_ADD:: Cochran-Armitage additive trend test
CAT_DOM:: Cochran-Armitage dominant trend test
CAT_REC:: Cochran-Armitage recessive trend test
DCTN5:: Dynactin Subunit 5
Df:: Degree of freedom
DISC1:: Disrupted In Schizophrenia 1
GRM:: General regression model
GWAS:: Genome wide association studies
H0:: Null hypothesis
HW:: Hardy-Weinberg
MAF:: Minor allele frequency
MAX:: Maximum of the standardized optimal tests
MERT:: Maximin efficiency robust test
NS:: Non significant
OR:: Odds-ratio
PALB2:: Partner and localizer of BRCA2
QC:: Quality control
S:: Significant
SNP:: Single-nucleotide polymorphism
WTCCC:: Welcome trust case control cohort

References

Armitage P. Tests for linear trends in proportions and frequencies. Biometrics. 1955;11:375–86.
Article Google Scholar
Sasieni PD. From genotypes to genes: Doubling the sample size. Biometrics. 1997;53:1253–61.
Article CAS PubMed Google Scholar
Slager SL, Schaid DJ. Case-control studies of genetic markers: Power and sample size approximations for Armitage’s test for trend. Hum Hered. 2001;52:149–53.
Article CAS PubMed Google Scholar
Freidlin B, Zheng G, Li ZH & Gastwirth J: L.Trend tests for case–control studies of genetic markers: power. sample size and robustness. Hum. Hered 2002 53: 146–152
Schaid DJ, McDonnell SK, Hebbring SJ, Cunningham JM, Thibodeau SN. Nonparametric tests of association of multiple genes with human disease. Am J Hum Genet. 2005;76(5):780–93.
Article CAS PubMed PubMed Central Google Scholar
Wang K. Statistical tests of genetic association for case-control study designs. Biostatistics. 2012;13(4):724–33.
Article PubMed Google Scholar
Wilson SR. A note on the correct definition of additive deviation and dominance deviation. Ann Hum Genet Lond. 1980;44:113.
Article CAS Google Scholar
Wellcome Trust Case Control Consortium. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature. 2007;447:661–78.
Article Google Scholar
So HC, Sham PC. Robust association tests under different genetic models, allowing for binary or quantitative traits and covariates. Behav Genet. 2011;41:768–75.
Article PubMed PubMed Central Google Scholar
Loley C, König IR, Hothorn L, Ziegler A. A unifying framework for robust association testing, estimation, and genetic model selection using the generalized linear model. Eur J Hum Genet. 2013;21(12):1442–8.
Article PubMed PubMed Central Google Scholar
Cordell HJ. Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet. 2002;11(20):2463–8.
Article CAS PubMed Google Scholar
Joo J, Kwak M, Ahn K, Zheng G. A robust genome-wide scan statistic of the wellcome trust case-control consortium. Biometrics. 2009;65:1115–22.
Article PubMed Google Scholar
Jiang Y, Zhang H. Propensity scored-based nonparametric test revealing genetic variants underlying bipolar disorder. Genet Epidemiol. 2011;35:125–32.
Article PubMed PubMed Central Google Scholar
Tesli M, Athanasiu L, Mattingsdal M, et al. Association analysis of PALB2 and BRCA2 in bipolar disorder and schizophrenia in a Scandinavian case-control sample. Am J Med Genet B Neuropsychiatr Genet. 2010;153B(7):1276–82.
Article CAS PubMed Google Scholar
Mühleisen TW, Leber M, Schulze TG, Strohmaier J, et al. Genome-wide association study reveals two new risk loci for bipolar disorder. Nat Commun. 2014;5:3339.
Article PubMed Google Scholar
Craddock N, Sklar P. Genetics of bipolar disorder. Lancet. 2013;381:1654–62.
Article CAS PubMed Google Scholar
MacLaren EJ, Charlesworth P, Coba MP, Grant SG. Knockdown of mental disorder susceptibility genes disrupts neuronal network physiology in vitro. Mol Cell Neurosci. 2011;47(2):93–9.
Article CAS PubMed PubMed Central Google Scholar
Serretti A, Mandelli L. The genetics of bipolar disorder: genome ‘hot regions’, genes, new potential candidates and future directions. Mol Psychiatry. 2008;13(8):742–71.
Article CAS PubMed Google Scholar
Purcell S, Neale B, Todd-Brown K, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Gene. 2007;81(3):559–75.
Article CAS Google Scholar

Download references

Acknowledgements

This study makes use of data generated by the Wellcome Trust Case-Control Consortium. A full list of the investigators who contributed to the generation of the data is available from www.wtccc.org.uk. We thank Cécile Julier from U958 for critical review and for helpful discussions. We are grateful to the INRA MIGALE bioinformatics platform (http://migale.jouy.inra.fr) for providing computational resources.

Funding

Not applicable

Availability of supporting data

As simulated data represent a very large volume of data (around 120 Giga), script used to generate 200000 replicates for the 396 genetic models for each sample sizes (N = 1000 and N = 2000 cases and controls samples) is provided in Additional file 3: Appendix S1.

The data that support the findings of this study are available from the Wellcome Trust Case-Control Consortium but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of the Wellcome Trust Case-Control Consortium. The Wellcome Trust Case-Control Consortium authors may be contacted at http://www.wtccc.org.uk/.”

Authors’ contributions

FM and MHD jointly conceived the study and designed the simulation models. FM implemented simulation study and conducted all data analyses. FM and MHD interpreted the results and wrote the paper. FD gave conceptual advice. All authors discussed the results and implications and commented on the manuscript at all stages. All authors gave approval of the version to be published.

Competing interests

The authors declared that they have no competing interests.

Consent for publication

Not applicable

Ethics approval and consent to participate

Not applicable

Author information

Authors and Affiliations

Genetic Variation and Human Diseases Unit, UMR-946, Inserm, Université Paris Diderot, Université Sorbonne Paris Cité, Paris, France
Marie-Hélène Dizier & Florence Demenais
Inserm Siège, Université Paris Diderot, Université Sorbonne Paris Cité, Paris, France
Flavie Mathieu

Authors

Marie-Hélène Dizier
View author publications
You can also search for this author in PubMed Google Scholar
Florence Demenais
View author publications
You can also search for this author in PubMed Google Scholar
Flavie Mathieu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Flavie Mathieu.

Additional files

Additional file 1:

Table S1. and Table S2. reported the power of GRM and CAT tests to detect association for a P-value threshold of 1.0E-5 (Table S1) or 1.0E-7 (Table S2) using a sample size of 2000 cases/2000 controls. Table S1. GRM and CAT tests’ powers to detect association (P-value threshold ≤1.0E-5; N = 2000 cases/2000 controls). Table S2. GRM and CAT tests’ powers to detect association (P-value threshold ≤1.0E-7; N = 2000 cases/2000 controls) (ZIP 301 kb)

Additional file 2:

Figure S1. reported the proportion of replicates rejecting the additive model using a threshold P-value of 1.0E-5. Figure S1. Proportion of replicates rejecting the additive model (P = 0.01) among replicates showing significant association (P-value threshold =1.0E-5). (PDF 189 kb)

Additional file 3:

Appendix S1. and Appendix S2. reported shell and Pearl scripts which included PLINK commands [19]) used to analyze real dataset and to perform all simulations, computation of type I error and power estimations. (ZIP 33 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Dizier, MH., Demenais, F. & Mathieu, F. Gain of power of the general regression model compared to Cochran-Armitage Trend tests: simulation study and application to bipolar disorder. BMC Genet 18, 24 (2017). https://doi.org/10.1186/s12863-017-0486-6

Download citation

Received: 18 October 2016
Accepted: 02 March 2017
Published: 10 March 2017
DOI: https://doi.org/10.1186/s12863-017-0486-6

Gain of power of the general regression model compared to Cochran-Armitage Trend tests: simulation study and application to bipolar disorder

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Analysis of Genetic Association Studies Incorporating Prior Information of Genetic Models

Optimal Trend Tests for Genetic Association Studies of Heterogeneous Diseases

Relative performance of gene- and pathway-level methods as secondary analyses for genome-wide association studies

Background

Methods

Association tests

CAT tests

General regression model

Simulation studies

Type one error rate

Comparison of power of association tests

Test of genetic model

Sample size effect

Results

Type one error rate

Comparison of power of association tests

Tests of genetic model

Application to the WTCCC Bipolar data

Sample description

Test of association

Test of genetic model

Discussion

Conclusions

Abbreviations

References

Acknowledgements

Funding

Availability of supporting data

Authors’ contributions

Competing interests

Consent for publication

Ethics approval and consent to participate

Author information

Authors and Affiliations

Corresponding author

Additional files

Additional file 1:

Additional file 2:

Additional file 3:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation