Abstract
Association studies are the most powerful method available for identifying modest gene effects in complex disorders, but they often produce inconsistent results. With the rapidly growing SNP databases, haplotype maps and high throughput genotyping, the use of association studies is expected to increase; therefore, it is critical and timely that the problems with study design are identified and fixed. We questioned if unrecognized allele and genotype frequency variations in controls could be responsible for some of the inconsistent association findings. We performed a population genetic study of apolipoprotein E (APOE) and cytochrome P450 2D6 (CYP2D6) in 1,748 individuals ranging in age from newborns to centenarians. Although APOE and CYP2D6 are two of the most commonly used candidate genes, this is the first study to examine age- and gender-specific frequency distributions over the entire age spectrum, using a large, ethnically and geographically uniform population. We found significant, previously unrecognized variations in APOE allele frequencies, and deviations from Hardy-Weinberg expectations in CYP2D6 genotype frequencies starting at birth. The allele frequency variations within controls were larger than some reported case-control differences. We demonstrate that unrecognized frequency fluctuations in controls are a serious and potentially common confounder whose impact on association studies has not been appreciated, and one that can be addressed with proper study design. We recommend that population genetic studies be performed on commonly used candidate markers and that rigorous standards be applied for case-control matching.
Similar content being viewed by others
References
Bathum L, Andersen-Ranberg K, Boldsen J, Brosen K, Jeune B (1998) Genotypes for the cytochrome P450 enzymes CYP2D6 and CYP2C19 in human longevity Role of CYP2D6 and CYP2C19 in longevity. Eur J Clin Pharmacol 54:427–430
Bernardo J, Smith A (2000) Bayesian theory. Wiley, Chichester
Botstein D, Risch N (2003) Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease. Nat Genet 33(Suppl):228–237
Davignon J (1994) Apolipoprotein E polymorphism and atherosclerosis. Current Science, London
Davignon J, Bouthillier D, Nestruck AC, Sing CF (1988) Apolipoprotein E polymorphism and atherosclerosis: insight from a study in octogenarians. Trans Am Clin Climatol Assoc 99:100–110
Devlin B, Roeder K, Bacanu SA (2001) Unbiased methods for population-based association studies. Genet Epidemiol 21:273–284
Farrer L, Cupples A, Haines J, Hyman B, Kukull W, Mayeux R, Pericack-vance M, Risch N, van Duijn C (1997) Effects of age, sex and ethnicity on the association between apolipoprotein E genotype and Alzheimer disease. JAMA 278:1349–1356
Freedman ML, Reich D, Penney KL, McDonald GJ, Mignault AA, Patterson N, Gabriel SB, Topol EJ, Smoller JW, Pato CN, Pato MT, Petryshen TL, Kolonel LN, Lander ES, Sklar P, Henderson B, Hirschhorn JN, Altshuler D (2004) Assessing the impact of population stratification on genetic association studies. Nat Genet 36:388–393
Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, Cox DR (2005) Whole-genome patterns of common DNA variation in three human populations. Science 307:1072–1079
Hirschhorn JN, Lohmueller K, Byrne E, Hirschhorn K (2002) A comprehensive review of genetic association studies. Genet Med 4:45–61
Montgomery DC (1996) Introduction to statistical quality control. Wiley, New York
Payami H, Monte KR, Kaye JA, Wijsman EM, Bird T, Yu C, Heston LL, Schellenberg GD (1995) The Apolipoprotein E E4 allele and sex specific risk of Alzheimer’s disease. JAMA 273:374
Pritchard JK, Donnelly P (2001) Case-control studies of association in structured or admixed populations. Theor Popul Biol 60:227–237
Rea IM, Mc Dowell I, McMaster D, Smye M, Stout R, Evans A (2001) Apolipoprotein E alleles in nonagenarian subjects in the Belfast Elderly Longitudinal Free-living Ageing Study (BELFAST). Mech Ageing Dev 122:1367–1372
Risch N, Merikangas K (1996) The future of genetic studies of complex human diseases. Science 273:1516–1517
Rostami-Hodjegan A, Lennard MS, Woods HF, Tucker GT (1998) Meta-analysis of studies of the CYP2D6 polymorphism in relation to lung cancer and Parkinson’s disease. Pharmacogenetics 8:227–238
Sache C, Brockmoller J, Bauer S, Roots I (1997) Cytochrome P450 2D6 variants in a Caucasian population: allele frequencies and phenotypic consequences. Am J Hum Genet 60:284–295
Schachter F, Faure-Delanef L, Guenot F, Rouger H, Froguel P, Lesueur-Ginot L, Cohen D (1994) Genetic associations with human longevity at the APOE and ACE loci. Nat Genet 6:29–32
Sell SM, Ren K (1997) Automated capillary electrophoresis in the genotyping of apolipoprotein E. Genomics 46:163–164
Wilson PWF, Myers RH, Larson MG, Ordovas JM, Wolf PA, Schaefer EJ (1994) Apolipoprotein E alleles, dyslipidemia, and coronary heart disease. JAMA 272:1666–1671
Zondervan KT, Cardon LR (2004) The complex interplay among factors that influence allelic association. Nat Rev Genet 5:89–100
Acknowledgements
We thank the subjects who participated in this study, and the Oregon Newborn Screening Program for providing the anonymous blood spots. This study was funded in part by the National Institutes of Health grants NS R01-36960 and AG 08017, and by institutional support from the New York State Department of Health Wadsworth Center.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Payami, H., Zhu, M., Montimurro, J. et al. One step closer to fixing association studies: evidence for age- and gender-specific allele frequency variations and deviations from Hardy-Weinberg expectations in controls. Hum Genet 118, 322–330 (2005). https://doi.org/10.1007/s00439-005-0057-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-005-0057-1