Abstract
The genetic influences on complex disease traits generally depend on the joint effects of multiple genetic variants, environmental factors, as well as their interplays. Gene × environment (G × E) interactions play vital roles in determining an individual’s disease risk, but the underlying genetic machinery is poorly understood. Traditional analysis assuming linear relationship between genetic and environmental factors, along with their interactions, is commonly pursued under the regression-based framework to examine G × E interactions. This assumption, however, could be violated due to nonlinear responses of genetic variants to environmental stimuli. As an extension to our previous work on continuous traits, we proposed a flexible varying-coefficient model for the detection of nonlinear G × E interaction with binary disease traits. Varying coefficients were approximated by a non-parametric regression function through which one can assess the nonlinear response of genetic factors to environmental changes. A group of statistical tests were proposed to elucidate various mechanisms of G × E interaction. The utility of the proposed method was illustrated via simulation and real data analysis with application to type 2 diabetes.
Similar content being viewed by others
Abbreviations
- BIC:
-
Bayesian information criterion
- BMI:
-
Body mass index
- G × E:
-
Gene–environment interaction
- GENVEA:
-
Gene, Environment Association Studies Consortium
- GWAS:
-
Genome-wide association study
- HPFS:
-
Health Professionals Follow-up Study
- LM:
-
Linear predictor model
- LM-I:
-
Linear predictor model with interaction
- MAF:
-
Minor allele frequency
- NHS:
-
Nurses’ Health Study
- SNP:
-
Single nucleotide polymorphism
- T2D:
-
Type 2 diabetes mellitus
- VC:
-
Varying-coefficient
References
Cai Z, Fan J, Li R (2000) Efficient estimation and inferences for varying-coefficient models. J Am Stat Assoc 95:888–902
Carey VJ, Walters EE, Colditz GA, Caren G, Solomon et al (1997) Body fat distribution and risk of non-insulin-dependent diabetes mellitus in women. The Nurses’ Health Study. Am J Epidemiol 145:614–619
Chan JM, Rimm EB, Colditz GA, Stampfer MJ, Willett WC (1994) Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care 17:961–969
Colditz GA, Hankinson SE (2005) The Nurse’s Health Study: lifestyle and health among women. Nat Rev Cancer 5:388–396
Cornelis MC, Agrawal A, Cole JW, Hansel NN, Barnes KC et al (2010) The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions. Genet Epidemiol 34:364–372
Cornelis MC, Tchetgen Tchetgen EJ, Liang L, Qi L, Chatterjee N, Hu FB, Kraft P (2011) Gene–environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes. Am J Epidemiol 175:191–202. doi:10.1093/aje/kwr368
Fan J, Zhang W (2008) Statistical methods with varying coefficient models. Stat Interface 1:179–195
Feinberg AP (2004) Phenotypic plasticity and the epigenetics of human disease. Nature 447:433–440
Feinberg AP, Irizarry RA (2010) Stochastic epigenetic variation as a driving force of development, evolutionary adaptation, and disease. Proc Natl Acad Sci USA 107:1757–1764
Grant SF, Thorleifsson G, Reynisdottir I, Benediktsson R, Manolescu A et al (2006) Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes. Nat Genet 38:320–323
Holbrook TL, Barrett-Connor E, Wingard DL (1989) The association of lifetime weight and weight control patterns with diabetes among men and women in an adult community. Int J Obes 13:723–729
Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 89:111–128
Huang J, Wu C, Zhou L (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Stat Sin 14:763–788
Jin T, Liu L (2008) The Wnt signaling pathway effector TCF7L2 and type 2 diabetes mellitus. Mol Endocrinol 22:2383–2392
Liu L, Li Y, Tollefsbol TO (2008) Gene–environment interactions and epigenetic basis of human diseases. Curr Issues Mol Biol 10:25–36
Laitala VS, Kaprio J, Silventoinen K (2008) Genetics of coffee consumption and its stability. Addiction 103:2054–2061
Ma SJ, Yang LJ, Romero R, Cui YH (2011) Varying coefficient model for gene–environment interaction: a non-linear look. Bioinformatics 27(15):2119–2126
Gamboa-Melndez MA, Huerta-Chagoya A, Moreno-Macas H, Vzquez-Crdenas P et al (2012) Contribution of common genetic variation to the risk of type 2 diabetes in the Mexican Mestizo population. Diabetes 61:3314–3321. doi:10.2337/db11-0550
Martinez JA, Corbalan MS, Sanchez-Villegas A et al (2003) Obesity risk is associated with carbohydrate intake in women carrying the Gln27Glu beta2-adrenoceptor polymorphism. J Nutr 133:2549–2554
McCarthy MI (2010) Genomics, type 2 diabetes, and obesity. N Engl J Med 363:2339–2350
Mukherjee B, Ahn J, Gruber SB, Chatterjee N (2012) Testing gene–environment interaction in large-scale case–control association studies: possible choices and comparisons. Am J Epidemiol 175:177–190
Patel CJ, Chen R, Kodama K, Ioannidis JP, Butte AJ (2013) Systematic identification of interaction effects between genome- and environment-wide associations in type 2 diabetes mellitus. Hum Genet 132:495–508. doi:10.1007/s00439-012-1258-z
Peacock M, Turner CH, Econs MJ, Foroud T (2002) Genetics of osteoporosis. Endocr Rev 23:303–326
Perry JRB, Voight BF, Yengo L, Amin N, Dupuis J et al (2012) Stratifying type 2 diabetes cases by BMI identifies genetic risk variants in LAMA1 and enrichment for risk variants in lean compared to obese cases. PLoS Genet 8(5):e1002741. doi:10.1371/journal.pgen.1002741
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association. Nat Genet 38:904–909
Qi L, Cho YA (2008) Gene–environment interaction and obesity. Nutr Rev 66:684–694
Qi L, Cornelis MC, Kraft P et al (2010) Genetic variants at 2q24 are associated with susceptibility to type 2 diabetes. Hum Mol Genet 19:2706–2715
Rimm EB, Giovannucci EL, Willett WC, Colditz GA, Ascherio A, Rosner B, Stampfer MJ (1991) Prospective study of alcohol consumption and risk of coronary disease in men. Lancet 338:464–468
Sparrow DB et al (2012) A mechanism for gene–environment interaction in the etiology of congenital scoliosis. Cell 149:295–306
Vaccaro O, Boemi M, Cavalot F, De Feo P, Miccoli R, Patti L, Rivellese AA, Trovati M, Ardigo D, Zavaroni I (2008) The clinical reality of guidelines for primary prevention of cardiovascular disease in type 2 diabetes in Italy. Atherosclerosis 198:396–402
Zimmet P, Alberti KGMM, Shaw J (2001) Global and societal implications of the diabetes epidemic. Nature 414:782–787. doi:10.1038/414782a
Acknowledgments
The authors wish to thank three anonymous referees for their constructive comments that greatly improved the manuscript. This work was partially supported by NSF grant DMS-1209112 and by National Natural Science Foundation of China grant 31371336. Funding support for the GWAS of Gene and Environment Initiatives in Type 2 Diabetes was provided through the NIH Genes, Environment and Health Initiative [GEI] (U01HG004399). The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000091.v2.p1, through dbGaP accession number phs000091.v2.p1.
Conflict of interest
The authors declare no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wu, C., Cui, Y. A novel method for identifying nonlinear gene–environment interactions in case–control association studies. Hum Genet 132, 1413–1425 (2013). https://doi.org/10.1007/s00439-013-1350-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-013-1350-z