Skip to main content
Log in

A novel method for identifying nonlinear gene–environment interactions in case–control association studies

  • Original Investigation
  • Published:
Human Genetics Aims and scope Submit manuscript

Abstract

The genetic influences on complex disease traits generally depend on the joint effects of multiple genetic variants, environmental factors, as well as their interplays. Gene × environment (G × E) interactions play vital roles in determining an individual’s disease risk, but the underlying genetic machinery is poorly understood. Traditional analysis assuming linear relationship between genetic and environmental factors, along with their interactions, is commonly pursued under the regression-based framework to examine G × E interactions. This assumption, however, could be violated due to nonlinear responses of genetic variants to environmental stimuli. As an extension to our previous work on continuous traits, we proposed a flexible varying-coefficient model for the detection of nonlinear G × E interaction with binary disease traits. Varying coefficients were approximated by a non-parametric regression function through which one can assess the nonlinear response of genetic factors to environmental changes. A group of statistical tests were proposed to elucidate various mechanisms of G × E interaction. The utility of the proposed method was illustrated via simulation and real data analysis with application to type 2 diabetes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Abbreviations

BIC:

Bayesian information criterion

BMI:

Body mass index

G × E:

Gene–environment interaction

GENVEA:

Gene, Environment Association Studies Consortium

GWAS:

Genome-wide association study

HPFS:

Health Professionals Follow-up Study

LM:

Linear predictor model

LM-I:

Linear predictor model with interaction

MAF:

Minor allele frequency

NHS:

Nurses’ Health Study

SNP:

Single nucleotide polymorphism

T2D:

Type 2 diabetes mellitus

VC:

Varying-coefficient

References

  • Cai Z, Fan J, Li R (2000) Efficient estimation and inferences for varying-coefficient models. J Am Stat Assoc 95:888–902

    Article  Google Scholar 

  • Carey VJ, Walters EE, Colditz GA, Caren G, Solomon et al (1997) Body fat distribution and risk of non-insulin-dependent diabetes mellitus in women. The Nurses’ Health Study. Am J Epidemiol 145:614–619

  • Chan JM, Rimm EB, Colditz GA, Stampfer MJ, Willett WC (1994) Obesity, fat distribution, and weight gain as risk factors for clinical diabetes in men. Diabetes Care 17:961–969

    Article  PubMed  CAS  Google Scholar 

  • Colditz GA, Hankinson SE (2005) The Nurse’s Health Study: lifestyle and health among women. Nat Rev Cancer 5:388–396

    Article  PubMed  CAS  Google Scholar 

  • Cornelis MC, Agrawal A, Cole JW, Hansel NN, Barnes KC et al (2010) The Gene, Environment Association Studies consortium (GENEVA): maximizing the knowledge obtained from GWAS by collaboration across studies of multiple conditions. Genet Epidemiol 34:364–372

    Article  PubMed  Google Scholar 

  • Cornelis MC, Tchetgen Tchetgen EJ, Liang L, Qi L, Chatterjee N, Hu FB, Kraft P (2011) Gene–environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes. Am J Epidemiol 175:191–202. doi:10.1093/aje/kwr368

    Google Scholar 

  • Fan J, Zhang W (2008) Statistical methods with varying coefficient models. Stat Interface 1:179–195

    Article  PubMed  Google Scholar 

  • Feinberg AP (2004) Phenotypic plasticity and the epigenetics of human disease. Nature 447:433–440

    Article  Google Scholar 

  • Feinberg AP, Irizarry RA (2010) Stochastic epigenetic variation as a driving force of development, evolutionary adaptation, and disease. Proc Natl Acad Sci USA 107:1757–1764

    Article  PubMed  CAS  Google Scholar 

  • Grant SF, Thorleifsson G, Reynisdottir I, Benediktsson R, Manolescu A et al (2006) Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes. Nat Genet 38:320–323

    Article  PubMed  CAS  Google Scholar 

  • Holbrook TL, Barrett-Connor E, Wingard DL (1989) The association of lifetime weight and weight control patterns with diabetes among men and women in an adult community. Int J Obes 13:723–729

    PubMed  CAS  Google Scholar 

  • Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 89:111–128

    Article  Google Scholar 

  • Huang J, Wu C, Zhou L (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Stat Sin 14:763–788

    Google Scholar 

  • Jin T, Liu L (2008) The Wnt signaling pathway effector TCF7L2 and type 2 diabetes mellitus. Mol Endocrinol 22:2383–2392

    Article  PubMed  CAS  Google Scholar 

  • Liu L, Li Y, Tollefsbol TO (2008) Gene–environment interactions and epigenetic basis of human diseases. Curr Issues Mol Biol 10:25–36

    PubMed  CAS  Google Scholar 

  • Laitala VS, Kaprio J, Silventoinen K (2008) Genetics of coffee consumption and its stability. Addiction 103:2054–2061

    Article  PubMed  Google Scholar 

  • Ma SJ, Yang LJ, Romero R, Cui YH (2011) Varying coefficient model for gene–environment interaction: a non-linear look. Bioinformatics 27(15):2119–2126

    Article  PubMed  CAS  Google Scholar 

  • Gamboa-Melndez MA, Huerta-Chagoya A, Moreno-Macas H, Vzquez-Crdenas P et al (2012) Contribution of common genetic variation to the risk of type 2 diabetes in the Mexican Mestizo population. Diabetes 61:3314–3321. doi:10.2337/db11-0550

    Google Scholar 

  • Martinez JA, Corbalan MS, Sanchez-Villegas A et al (2003) Obesity risk is associated with carbohydrate intake in women carrying the Gln27Glu beta2-adrenoceptor polymorphism. J Nutr 133:2549–2554

    PubMed  CAS  Google Scholar 

  • McCarthy MI (2010) Genomics, type 2 diabetes, and obesity. N Engl J Med 363:2339–2350

    Google Scholar 

  • Mukherjee B, Ahn J, Gruber SB, Chatterjee N (2012) Testing gene–environment interaction in large-scale case–control association studies: possible choices and comparisons. Am J Epidemiol 175:177–190

    Article  PubMed  Google Scholar 

  • Patel CJ, Chen R, Kodama K, Ioannidis JP, Butte AJ (2013) Systematic identification of interaction effects between genome- and environment-wide associations in type 2 diabetes mellitus. Hum Genet 132:495–508. doi:10.1007/s00439-012-1258-z

    Google Scholar 

  • Peacock M, Turner CH, Econs MJ, Foroud T (2002) Genetics of osteoporosis. Endocr Rev 23:303–326

    Article  PubMed  CAS  Google Scholar 

  • Perry JRB, Voight BF, Yengo L, Amin N, Dupuis J et al (2012) Stratifying type 2 diabetes cases by BMI identifies genetic risk variants in LAMA1 and enrichment for risk variants in lean compared to obese cases. PLoS Genet 8(5):e1002741. doi:10.1371/journal.pgen.1002741

  • Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association. Nat Genet 38:904–909

    Article  PubMed  CAS  Google Scholar 

  • Qi L, Cho YA (2008) Gene–environment interaction and obesity. Nutr Rev 66:684–694

    Article  PubMed  Google Scholar 

  • Qi L, Cornelis MC, Kraft P et al (2010) Genetic variants at 2q24 are associated with susceptibility to type 2 diabetes. Hum Mol Genet 19:2706–2715

    Article  PubMed  CAS  Google Scholar 

  • Rimm EB, Giovannucci EL, Willett WC, Colditz GA, Ascherio A, Rosner B, Stampfer MJ (1991) Prospective study of alcohol consumption and risk of coronary disease in men. Lancet 338:464–468

    Article  PubMed  CAS  Google Scholar 

  • Sparrow DB et al (2012) A mechanism for gene–environment interaction in the etiology of congenital scoliosis. Cell 149:295–306

    Article  PubMed  CAS  Google Scholar 

  • Vaccaro O, Boemi M, Cavalot F, De Feo P, Miccoli R, Patti L, Rivellese AA, Trovati M, Ardigo D, Zavaroni I (2008) The clinical reality of guidelines for primary prevention of cardiovascular disease in type 2 diabetes in Italy. Atherosclerosis 198:396–402

    Article  PubMed  CAS  Google Scholar 

  • Zimmet P, Alberti KGMM, Shaw J (2001) Global and societal implications of the diabetes epidemic. Nature 414:782–787. doi:10.1038/414782a

    Article  PubMed  CAS  Google Scholar 

Download references

Acknowledgments

The authors wish to thank three anonymous referees for their constructive comments that greatly improved the manuscript. This work was partially supported by NSF grant DMS-1209112 and by National Natural Science Foundation of China grant 31371336. Funding support for the GWAS of Gene and Environment Initiatives in Type 2 Diabetes was provided through the NIH Genes, Environment and Health Initiative [GEI] (U01HG004399). The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000091.v2.p1, through dbGaP accession number phs000091.v2.p1.

Conflict of interest

The authors declare no conflict of interest.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuehua Cui.

Electronic supplementary material

Below is the link to the electronic supplementary material.

PDF (224 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, C., Cui, Y. A novel method for identifying nonlinear gene–environment interactions in case–control association studies. Hum Genet 132, 1413–1425 (2013). https://doi.org/10.1007/s00439-013-1350-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00439-013-1350-z

Keywords

Navigation