Background

Fifteen replicated gene variants for type 2 diabetes (T2D) have been assessed [1]. While more variants remain to be uncovered, challenge exists in better understanding of the interplay among these variants underneath complex T2D pathophysiology and pathogenesis. These variants were either common with negligible effect or rare with relatively larger effect size [1]. In reality, it is very likely that the individual main effect of each variant is undetectable, but collectively their effect emerges. Concurrent increased triglyceride (TG) and decreased high-density lipoprotein (HDL) were characteristics of subjects with insulin resistance, mostly seen in patients with T2D. The inverse relationship and combined information of these two measures, TG/HDL ratio, represents a single inherited phenotype [2] as a surrogate for insulin resistance [3]. In this analysis, we hypothesized that multiple variants jointly confer T2D susceptibility, and we wanted to examine main and interacting effects of the reported variants in the Offspring Cohort of the Framingham Heart Study (FHS) using different statistical analysis approaches.

Methods

Subjects

All participants in the FHS were Caucasians. Their characteristics were described elsewhere [4]. In brief, participants with diabetes comprised 7.59% (28/369), 9.91% (242/2,441), and 2.38% (95/3,997) of the Original, Offspring, and Children Cohorts, respectively. Average age of diagnosis of diabetes at study visit were 66, 57, and 46 years in the three cohorts, respectively. From a total of 6,807 members in 1,157 families, 228 families had members with diabetes (3,217 members, 366 diabetes cases); 46% (1,489/3,217) were men. In the Offspring Cohort, 155 families had members with diabetes (727 members) who had valid fasting TG and HDL measures at 4 time points. In the Offspring Cohort families with member with diabetes, the average age was 33.8, 46.4, 53.3, and 60.2 years, average BMI was 25.2, 26.2, 27.6, and 28.3 kg/m2, average TG was 83.7, 108.5, 134.0, and 135.5 mg/dl, average HDL was 51.6, 52.6, 51.6, and 54.5 mg/dl, and average TG/HDL ratio was 1.9, 2.5, 3.1, and 3.0, at Visit 1, 3, 5, and 7, respectively. Every study subject provided written informed consent. The study was approved by Boston University Institutional Review Board.

Genotyping

Affymetrix 100 k single-nucleotide polymorphisms (SNPs) and genotype annotation resources were described elsewhere [4]. Identical variants or variants in the same intron/exon regions were identified using Affymetrix 500 k SNPs.

Statistical analysis

Covariates included age, age2, and sex. BIMBAM (Bayesian imputation-based association mapping) [5] was used to assess single-SNP effect according to Bayes Factor (BF) and p-value (based on 10,000 permutations). This was followed by logistic-GEE (generalized estimating equations) single-SNP test under general and additive assumptions. BIMBAM output corrected p for multiple testing. BIMBAM was further used to identify the best multiple-SNP combination with the highest BF. LOGREG [6] was also used to find the best multiple SNP combination as well as parameter estimates. Findings from these two approaches were compared. The best interacting variants may be tested for main and interacting effects using the logistic-GEE approach. Further, TG/HDL ratios at the four visits in the families in the Offspring Cohort with member with diabetes were used to derive TG/HDL ratio change (slope) via the mixed growth curve model. The mixed sandwich estimator approach was used to assess associations between the identified interacting variants and the slope phenotype.

Results

Single-SNP test results are presented in Table 1 (BIMBAM) and Table 2 (logistic-GEE, general model). Three variants were found to significant under additive assumption: CDKN2B rs10811661 (p = 0.0011), TCF7L2 rs4506565 (p = 0.0043), and JAZF1 rs864745 (p = 0.0402). Multiple SNP test results are also given in Table 1. LOGREG results, consistent with the BIMBAM finding, suggested almost the same best interacting variants. Interaction between CDKN2B rs10811661 and TCF7L2 rs4506565 was significantly detected using the logistic-GEE model (OR = 3.0, p = 0.02 and 0.01 and 0.05 for main and interacting effects). For the slope trait, single-SNP test results were non-significant for CDKN2B rs10811661 (p = 0.4621), but significant for TCF7L2 rs4506565 (p = 0.0379) and JAZF1 rs864745 (p = 0.0349) in the mixed sandwich estimator additive model. Interaction between CDKN2B rs10811661 and TCF7L2 rs4506565 variants were significant (β = 1.6667, p = 0.0002, 0.0341, and 0.0012 for main and interacting effects), and it was associated with high slope values, which may be expressed as an increase in insulin resistance across ages.

Table 1 Search results for best single and multiple SNP associations with T2D status
Table 2 Single-SNP association test results for T2D status confirmed by regression analysis

Discussion

Our finding of TCF7L2 intron 3 variant for T2D is in line with a previous FHS genome-wide association study report [7] and a good number of replicating reports across large and independent study cohorts. Evidence of CDKN2B near/promoter and JAZF1 intron 1 variants for T2D risk was also found. We thought the T2D families in FHS may contain an enriched genetic component for SNP association assessment of T2D gene variants. Currently, functions of these two genes are yet not clear, but as is the case for TCF7L2 and other confirmed T2D genes [1], they may be directly or indirectly involved in pancreatic β-cell failure, insulin resistance, and lipid toxicity. Our finding of interactions among the three variants for T2D risk was interesting and novel, which may help disentangle underlying gene pathways and further our understanding of T2D etiology and pathophysiology.

Correction for multiplicity to minimize the false-positive declaration rate seems to be less sensitive for this analysis for two reasons. First, we aimed to identify and evaluate the best combination of SNPs from confirmed T2D variants. Second, the FHS T2D status information was obtained at the initial visit of the study, and 'new' T2D cases may have remained in the 'non-diabetic' control group after decades of follow-up. This would bias our association findings towards reduction of type 1 error. The results of BIMBAM were consistent with those of logistic-GEE analyses for the single-SNP main effect; likewise, BIMBAM and logic regression results for multiple-SNP joint/interacting effect were also consistent. These synthetic approaches apparently worked well in association detection for both main and joint effects of candidate gene variants with prior knowledge.

This 500 k SNP genotype data allowed us to examine 15 of the 18 confirmed exact variants (genotypic imputation was not applied). While reproducibility may be cohort-specific and sensitive to sample heterogeneity and size, it was a bit surprising that we could only identify three meaningful variants. It is very possible that this was due to the presence in the control groups of persons with diabetes who had not received a clinical diagnosis. We did not use a transmission-disequilibrium test-like FBAT program for this analysis simply because we wanted to maximize the sample size by using all available family information. We did not correct the data for potential effect of body max index because obesity coexisting with T2D is very likely rooted in the same pathway genes among most T2D patients, at least in most Western countries.

We were excited to find that the identified interacting variants also affected the TG/HDL ratio. Using the ROC/AUC (receiver operator characteristic/area under the curve) model, we found that it was an imperfect predictor of T2D with best sensitivity and specificity of approximately 75%. Applying a mixed growth curve approach to model longitudinal multiple measures of the TG/HDL ratio proved useful. The derived slope phenotype represented the TG/HDL ratio change across decades of observation time. This approach is advantageous in handling missing data points, as well as remote values, especially in intervention and interaction studies. Genetic heritability of the slope phenotype was 20%, and these three variants were individually and interactively associated with an increase of the slope phenotype. Previously, it was thought that the TCF7L2 variant was only associated with β-cell function failure but not insulin resistance [1]. Our current finding suggests that TCF7L2, together with JAZF1 variants, may indirectly impart insulin resistance through joint modulation of lipid metabolism.

In conclusion, using the FHS T2D family data and different statistical approaches, we found variants in the TCF7L2, CDKN2B, and JAZF1 genes were independently and interactively associated with increased T2D risk and the TG/HDL ratio change across ages. For the remaining T2D risk conferring gene variants, we did not find significant association evidence in this analysis.