Skip to main content

Advertisement

Log in

Population structure, differential bias and genomic control in a large-scale, case-control association study

  • Letter
  • Published:

From Nature Genetics

View current issue Submit your manuscript

Abstract

The main problems in drawing causal inferences from epidemiological case-control studies are confounding by unmeasured extraneous factors, selection bias and differential misclassification of exposure1. In genetics the first of these, in the form of population structure, has dominated recent debate2,3,4. Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations. To avoid excluding SNPs and losing valuable information, we extended the genomic control method2,3,4,5 by applying a variable downweighting to each SNP.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1: Signal intensity plots.
Figure 2: Quantile-quantile plots of Cochran-Armitage test statistics.
Figure 3
Figure 4
Figure 5

Similar content being viewed by others

References

  1. Breslow, N.E. & Day, N.E. Statistical Methods in Cancer Research Vol. I. The Analysis of Case-Control Studies (International Agency for Research on Cancer, Lyon, 1980).

    Google Scholar 

  2. Devlin, B., Bacanu, S.A. & Roeder, K. Genomic control to the extreme. Nat. Genet. 36, 1129–1130; author reply 1131 (2004).

    Article  CAS  Google Scholar 

  3. Freedman, M.L. et al. Assessing the impact of population stratification on genetic association studies. Nat. Genet. 36, 388–393 (2004).

    Article  CAS  Google Scholar 

  4. Marchini, J., Cardon, L.R., Phillips, M.S. & Donnelly, P. The effects of human population structure on large genetic association studies. Nat. Genet. 36, 512–517 (2004).

    Article  CAS  Google Scholar 

  5. Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, 997–1004 (1999).

    Article  CAS  Google Scholar 

  6. Vella, A. et al. Localization of a type 1 diabetes locus in the IL2RA/CD25 region by use of tag single-nucleotide polymorphisms. Am. J. Hum. Genet. 76, 773–779 (2005).

    Article  CAS  Google Scholar 

  7. Lowe, C.E. et al. Cost-effective analysis of candidate genes using htSNPs: a staged approach. Genes Immun. 5, 301–305 (2004).

    Article  CAS  Google Scholar 

  8. Wang, W.Y., Barratt, B.J., Clayton, D.G. & Todd, J.A. Genome-wide association studies: theoretical and practical concerns. Nat. Rev. Genet. 6, 109–118 (2005).

    Article  CAS  Google Scholar 

  9. Hardenbol, P. et al. Multiplexed genotyping with sequence-tagged molecular inversion probes. Nat. Biotechnol. 21, 673–678 (2003).

    Article  CAS  Google Scholar 

  10. Hardenbol, P. et al. Highly multiplexed molecular inversion probe genotyping: over 10,000 targeted SNPs genotyped in a single tube assay. Genome Res. 15, 269–275 (2005).

    Article  CAS  Google Scholar 

  11. Ueda, H. et al. Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease. Nature 423, 506–511 (2003).

    Article  CAS  Google Scholar 

  12. The International HapMap Consortium. The International HapMap Project. Nature 426, 789–796 (2003).

  13. Armitage, P. Test for linear trend in proportions and frequencies. Biometrics II, 375–386 (1955).

    Article  Google Scholar 

  14. Mantel, N. Chi-square tests with one degree of freedom: extensions of the Mantel-Haenszel procedure. J. Am. Stat. Assoc. 58, 690–700 (1963).

    Google Scholar 

  15. Nelder, J. & Wedderburn, R. Generalised linear models. J. R. Statist. Soc. A 135, 370–384 (1972).

    Article  Google Scholar 

  16. Moorhead, M. et al. Optimal genotype determination in highly multiplexed SNP data. Eur. J. Hum. Genet. (in the press).

Download references

Acknowledgements

We thank the individuals with T1D and control individuals for their participation; G. Coleman, S. Field, T. Mistry, K. Bourget, S. Clayton, M. Hardy, P. Lauder, M. Maisuria, W. Meadows and S. Wood for preparing DNA samples; D. Strachan, R. Jones, S. Ring and W. McArdle for providing DNA from the 1958 British Birth Cohort collection; and A. Long, N. Naclerio, T. Cormier, K. Tran, C. Bruckner and S. Picton for genotyping and technical assistance. We acknowledge use of DNA from the 1958 British Birth Cohort collection, funded by the Medical Research Council and the Wellcome Trust. We thank the Juvenile Diabetes Research Foundation, the Wellcome Trust, Diabetes UK and the Medical Research Council for financial support. D.G.C. is a Juvenile Diabetes Research Foundation and Wellcome Trust Principal Research Fellow.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to David G Clayton or John A Todd.

Ethics declarations

Competing interests

M. Faham, M.M., H.B.J., M. Falkowski, P.H. and T.D.W. are currently employed by ParAllele Bioscience.

Supplementary information

Rights and permissions

Reprints and permissions

About this article

Cite this article

Clayton, D., Walker, N., Smyth, D. et al. Population structure, differential bias and genomic control in a large-scale, case-control association study. Nat Genet 37, 1243–1246 (2005). https://doi.org/10.1038/ng1653

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/ng1653

  • Springer Nature America, Inc.

This article is cited by

Navigation