Introduction

Type 2 diabetes affects ~400 million people globally [1]. The prevalence of type 2 diabetes has increased substantially in Asian populations, and in China it is estimated that 100 million adults (~11% of the adult population) are affected [2]. Lifestyle factors (e.g. physical inactivity), nutrition transitions and increased adiposity are the chief determinants of type 2 diabetes, but genetic factors also play an important role.

Genome-wide association studies (GWAS) and large-scale genotyping studies (e.g. MetaboChip and ExomeChip genotyping arrays) have identified more than 90 type 2 diabetes associated risk loci [328]. GWAS and replication studies conducted in a range of ancestry groups have revealed that most common-variant susceptibility loci are shared across ethnic groups [24, 29, 30]. While many type 2 diabetes susceptibility variants identified in Europeans have been successfully replicated in East Asians, failure to replicate (e.g. at ADCY5, NOTCH2 and PRC1) likely arises from poor coverage by genotyping arrays, ethnic differences in allele frequency, variable linkage disequilibrium (LD) and limited statistical power.

As the number of type 2 diabetes associated variants has increased, so has the value in including genetic data in models to predict type 2 diabetes risk, weighting individual genetic variants according to their reported effect size [31]. However, effect estimates obtained from GWAS using case–control studies are often inflated due to spectrum bias and/or ‘winner’s curse’ [32, 33]. Spectrum bias describes the overestimation of test performance that can arise from studying ‘clear-cut’ cases or extremes of the underlying distribution (so-called ‘extreme phenotypes’) [32]. ‘Winner’s curse’ refers to the upward bias in the estimated effect of a newly identified variant, particularly when there is limited power to detect the true association [33]. To avoid these biases, large-scale population-based studies are required to obtain robust population-specific estimates of both individual and joint effects of GWAS-identified variants. The availability of such data remains limited, especially in non-European populations, and this motivated us to obtain population-based estimates of effect size in the China Kadoorie Biobank (CKB) study.

In addition, we constructed genetic risk scores (GRSs) to investigate the separate genetic effects on diabetes of SNPs that have been associated with beta cell dysfunction or insulin resistance (IR). The association between certain genetic variants and type 2 diabetes risk has been reported to vary according to obesity status [34], which could impact on the utility of predictive models. Therefore, we assessed whether associations of these GRSs with diabetes were modified in individuals with different degrees of adiposity.

Methods

Study population

The study sample consisted of 93,131 individuals with genotype data, randomly selected from the CKB study (www.ckbiobank.org), a prospective cohort of 512,891 Chinese adults. Details of the study design, protocol, procedures and characteristics of CKB have been described elsewhere [35]. Briefly, the baseline survey took place from June 2004 to July 2008 in ten geographically defined areas (5 urban, 5 rural) across China. In each study area, permanent residents were identified through official residential records, and invited to participate in the study. All participants are prospectively followed up for cause-specific mortality, morbidity and hospitalisation, using China CDC’s Disease Surveillance Points and linkages to the national health insurance claim databases [35]. Information about socio-demographic, lifestyle, medical history and current medication were collected by laptop-based questionnaires. Physical measurements were recorded including height, weight, waist and hip circumferences, and bio-impedance (Tanita BC-418MA, Tokyo, Japan). Except in one study area, where the protocol included fasting by all participants, initial screening for hyperglycaemia involved immediate on-site testing of non-fasting blood glucose using the SureStep Plus meter (LifeScan, Milpitas, CA, USA). Participants with non-fasting glucose levels ≥7.8 and <11.1 mmol/l were invited to return for a fasting blood glucose test the next day.

Diabetes was defined either as a self-report of physician diagnosis of diabetes or screen-detected diabetes, as previously reported [36]. For self-reported diabetes, those with an onset under age 30 and currently treated with insulin were considered as type 1 diabetes, and were excluded from the present analyses. Screen-detected diabetes was defined as no prior history of diabetes with a blood glucose level meeting any one of the following criteria, if applicable: (1) a random blood glucose level ≥7.0 mmol/l and a fasting time >8 h; (2) a random blood glucose level ≥11.1 mmol/l and a fasting time <8 h; (3) a fasting blood glucose level ≥7.0 mmol/l. For the current analysis, follow-up data were collected up to 31 December 2013 (Snapshot Database Release 9, April 2015). We combined all cases of prevalent (5,483) and incident (1,626) diabetes to give a total of 7,109 cases and 86,022 non-diabetes controls. All participants provided written informed consent for follow-up and long-term storage of biological samples.

Genotyping

A panel of 384 single nucleotide polymorphisms (SNPs), selected on the basis of prior association with cardiovascular disease, risk factors and related phenotypes, were genotyped in 95,680 randomly selected individuals from CKB on the Illumina Golden Gate platform at the BGI laboratory in Shenzhen, China. A total of 93,131 individuals aged 30–79 years passed quality control criteria (call rate ≥98%, no sex mismatch, heterozygosity F statistic SD score <5). SNPs with low call rate (<95%) or Hardy–Weinberg disequilibrium (p < 0.05/384 = 1.3 × 10−4) were excluded. Mean genotyping concordance was 99.98% (range 98.66–100%) based on 2,063 duplicate samples included for quality control (QC) purposes. The SNP panel included 59 GWAS-identified type 2 diabetes risk variants reported by October 2012, of which five were originally reported in South Asians, 15 in East Asians and 36 in Europeans. These lead SNPs were selected based on the available association data from East Asian populations and/or fine-mapping data in Europeans at the time of array design and manufacturing. Since HNF1A rs12427353 is monomorphic in East Asians and genotyping of two variants (PEPD rs3786897, KCNK16 rs3734618) failed QC, data were available for 56 variants (Fig. 1). The majority of SNPs were successfully genotyped in all selected samples except for four (WFS1 rs10010131, DGKB rs2191349, RASGRP1 rs7403531 and GRK5 rs10886471), which were genotyped only in batches comprising subsets of the cohort (49%, 80%, 90% and 90% of participants, respectively). Estimates of relatedness based on 235 independent SNPs, using the R package SNPRelate [37], identified that 19% of participants had at least one first-degree relative among those genotyped. We present the results for the full dataset, but exclusion of 10,654 participants to eliminate first-degree relationships had no appreciable effect on individual results or our overall conclusions. Genomic inflation was estimated at λ 1000 = 1.06–1.08, based on associations for 235 independent SNPs.

Fig. 1
figure 1

Flow chart of the analyses. T2D, type 2 diabetes

GRSs

The risk variants at DUSP8/INS, DUSP9 and WFS1 were not included in GRS calculations because of parent-of-origin-specific effects, location on the X-chromosome and low genotyping rate, respectively. RBMS1 rs7593730 was also excluded as it was associated with type 2 diabetes only in Europeans. The remaining 52 variants were selected for the overall GRS (GRS-T) (Fig. 1 and Electronic Supplementary Material [ESM] Tables 1, 2). Five types of weighted GRS (using weights derived from the natural logarithm of the per-allele OR) were calculated, using data from: (1) DIAGRAMv3 GWAS meta-analysis (DIAGRAMv3) [15]; (2) GWAS meta-analysis in East Asians (Asian Genetic Epidemiology Network-Type 2 Diabetes Consortium [AGEN-T2D]) [14]; (3) DIAGRAM Metabochip meta-analysis (Metabochip) [15]; (4) a trans-ethnic type 2 diabetes GWAS meta-analysis (TransEthnic) [24]; and (5) a combined meta-analysis of the CKB and trans-ethnic GWAS studies (TransEthnic + CKB) (ESM Fig. 1).

Type 2 diabetes risk variants were classified, based on previously published data concerning their pathophysiological mechanism, as being predominantly related to beta cell dysfunction, IR or neither (ESM Table 1). We updated the strategy proposed by Vassy et al [38] by including more lines of genetic and physiological evidence [15, 3941]. Beta cell dysfunction related SNPs were identified by: (1) association with decreased HOMA of beta cell function (HOMA-B; p < 0.05, β for HOMA-B < 0 for risk allele) in non-diabetic individuals [15]; (2) association with one of the beta cell function indices during an OGTT (p < 0.05, β < 0 for risk allele) [40, 41]; (3) presence in a locus influencing beta cell function according to cluster analysis [40]; and/or (4) the existence of rare variants responsible for forms of monogenic diabetes characterised by insulin secretory failure (such as neonatal diabetes and MODY). IR-related SNPs were identified by: (1) association with increased HOMA-IR (p < 0.05, β > 0 for risk allele) in non-diabetic individuals [41] or decreased insulin sensitivity index (p < 0.05, β < 0 for risk allele) [15]; (2) association with fasting insulin (p < 0.05, β > 0) [39]; (3) presence in a locus influencing insulin sensitivity according to cluster analysis [40]; (4) association with increased triacylglycerol or other IR-related traits [39]; and (5) not acting primarily through obesity (FTO rs9939609, MC4R rs12970134) [15]. Thus, GRSs were constructed from 25 beta cell dysfunction related SNPs (GRS-BC) and seven IR-related SNPs (GRS-IR) (ESM Table 2). Missing genotypes were imputed by assigning the mean genotype for that participant’s regional centre. To make the weighted GRSs easier to interpret and more directly comparable to the unweighted score, values were rescaled as follows: GRS′ = GRS × total number of the risk alleles/(2 × sum of weights). Each point of the rescaled GRS thus corresponded to, on average, one additional risk allele.

Statistical analysis

Departure from Hardy–Weinberg equilibrium was assessed using a 1-df χ2 test. For the primary outcome, logistic regression was used to estimate ORs and 95% CIs of individual variants and GRSs for combined prevalent/incident diabetes, adjusting for age, sex and regional centre. Comparison of effect sizes (loge ORs) between CKB and previous studies was performed by inverse-variance weighted least squares regression through the origin. To combine our results with those from AGEN-T2D [14] or the TransEthnic meta-analysis [24], fixed effects meta-analysis was performed by inverse-variance weighting. We carefully checked the region of recruitment of the studies contributing to AGEN-T2D and found no evidence of overlap with CKB. Floating absolute risks were used to provide estimates of variance across GRS quartiles [42]. BMI cut-point categories were defined according to Asian criteria proposed by the WHO: normal weight (BMI < 23 kg/m2); overweight (23 ≤ BMI < 27.5 kg/m2); obese (BMI ≥ 27.5 kg/m2) [43]. Strata of waist circumference (WC), WHR and percentage body fat (PBF) were defined by sex-specific tertiles. Tests for interaction between adiposity and GRSs used logistic regression models including GRS, adiposity variable of interest and GRS × variable interaction term, with additional adjustment for age, sex and regional centre. Given that all SNPs were previously identified at GWAS significance for type 2 diabetes in Europeans and/or Asians, conventional Bonferroni correction would be overly conservative; we used the Holm–Bonferroni method or permutation procedures to control the family-wise error rate. For completeness, we also present findings using a 5% false discovery rate (Benjamini–Hochberg). In the meta-analyses, Cochran’s Q test was used to assess between-study heterogeneity and Bonferroni correction was used to account for multiple testing (p < 0.05/[55 × 3] = 3.0 × 10−4). The discriminative abilities of unweighted and weighted GRSs for risk of diabetes were assessed by receiver operating characteristic (ROC) curve analysis and compared using the DeLong test [44]. For 1000-fold cross-validation, weights were derived from a repeated analysis excluding randomly selected sets of 0.1% of the CKB sample, and using the resulting weights for the excluded individuals. The proportion of variance in phenotype explained for each SNP or GRS was calculated according to Shim et al [45] using previously reported means and SEs for SNP effect sizes [15]. We estimated power using ORs reported in the original GWAS and sample size and risk allele frequencies of our study with Quanto software (http://biostats.usc.edu/Quanto.html) (ESM Table 3). We investigated regional LD patterns among East Asians (CHB + JPT panel) and Europeans (CEU panel) from HapMap release 27 using the varLD algorithm [46], and presented results as Monte-Carlo p values from 10,000 iterations. All reported p values are nominal and 2-sided. Association analyses were performed using R software version 3.0.2 (www.r-project.org).

Results

Participant characteristics

Among the 93,131 CKB participants, there were 7,109 (7.6%) diabetes cases comprising 2,903 (3.1%) self-reported and 2,580 (2.8%) screen-detected at baseline, and 1,626 (1.7%) incident cases of diabetes that occurred during a mean (SD) of 7.1 (1.3) years follow-up (Table 1). A total of 86,022 participants without diabetes were considered controls. The overall mean BMI was 23.6 kg/m2. Women had slightly higher BMI than men, and also had higher prevalence and incidence of diabetes.

Table 1 Selected characteristics at baseline among 93,131 genotyped participants in CKB

Association with individual variants in the CKB study

Table 2 shows the associations of 56 variants that passed QC with diabetes risk, together with the corresponding values from AGEN-T2D and a meta-analysis of CKB and AGEN-T2D. Risk allele frequencies observed in CKB were comparable with those in the 1000 Genomes Project Phase 3 CHB + CHS population. Sensitivity analysis showed that there was no evidence of heterogeneity between regional centres in the association of loci and risk of diabetes (ESM Tables 4-6). All SNPs were common in CKB (minor allele frequency, MAF > 0.05) except for variants at five loci: (NOTCH2 rs10923931 [MAF = 0.032]; THADA rs7578597 [0.007]; ADCY5 rs11708067 [0.003]; TCF7L2 rs7901695 [0.031]; and PRC1 rs8042680 [0.010]). Among the 56 variants, 48 had effects directionally consistent with those in the original reports (binomial test, p = 2.3 × 10−8) (ESM Tables 7, 8). Five SNPs reached GWAS significance (p < 5 × 10−8) and another 14 variants showed statistically significant association after multiple-testing correction (Holm–Bonferroni, p < 0.05); alternatively, association was replicated for 30 SNPs at 5% false discovery rate (Benjamini–Hochberg). All eight risk loci that were identified in East Asian and Chinese Han GWAS [14, 18] showed consistent effect directions. Among them, MAEA and GLIS3 loci were significantly associated with type 2 diabetes after correction for multiple testing.

Table 2 Associations of previously identified type 2 diabetes susceptibility variants with risk of type 2 diabetes in CKB and meta-analysis in East Asian populations

Meta-analysis of CKB and AGEN-T2D studies

Meta-analysis combining the results of the present study with those from AGEN-T2D [14], providing a total of 32,188 cases and 115,633 controls, further improved concordance of effect estimates: after excluding variants identified in AGEN-T2D or its contributing cohorts, 37 of 40 variants were directionally consistent with European populations (binomial test p = 9.73 × 10−9). Ten variants identified in GWAS studies of Europeans, plus nine variants reported in East Asian GWAS studies, were genome-wide significant (Table 2, ESM Tables 7, 8). With the exception of ZFAND3, we found no heterogeneity for the associations at these loci across CKB and AGEN-T2D (Table 2).

Consistency of effect sizes between East Asians and Europeans

Allelic ORs estimated in CKB were highly correlated with those from Europeans (Fig. 2, r = 0.81, p = 2.1 × 10−36). However, there was a clear trend towards lower effect sizes in this population-based study of Chinese than in the predominantly case–control samples of European descent included in GWAS discovery studies, with a mean proportional reduction in loge OR of 19% (95% CI 6, 32; p = 4.8 × 10−3). A very similar reduction in effect size was observed when comparing CKB and AGEN-T2D, which also comprised predominantly case–control cohorts (22%; p = 3.4 × 10−3, ESM Fig. 2a). Effect sizes were also strongly correlated when comparing the meta-analysis of CKB and AGEN-T2D with Europeans (r = 0.85, p = 7.6 × 10−37; proportional reduction of 15%, p = 0.026) (ESM Fig. 2b).

Fig. 2
figure 2

Comparison estimates of effect size (loge (OR)) between Chinese and Europeans. The solid line represents the line of equality, and the result of regression is shown as a broken line with 95% CI

Heterogeneity in effect size was observed at RBMS1 rs7593730 and GCC1-PAX4 rs6467136 (p < 10–4) (ESM Table 9), whose associations with diabetes were significant only in Europeans and East Asians, respectively. This potentially reflects the different LD patterns between East Asians and Europeans at these loci (p < 0.0002 for both, ESM Table 10). A further large difference in estimated effect size between CKB (or AGEN-T2D + CKB) and Europeans, for ADCY5 rs11708067 (OR [95% CI]: 1.92 [1.28, 2.88] vs 1.10 [1.06, 1.15]), likely reflects low power and uncertainty in effect size in CKB: neither the difference nor the diabetes association itself was significant after correction for multiple testing. This SNP shows large differences in MAF (0.003 and 0.217 in Chinese and Europeans, respectively). In general, however, risk allele frequencies were similar in CKB and Europeans (CEU) (ESM Fig. 3, r = 0.62, p = 2.9 × 10−7).

GRSs and type 2 diabetes risk prediction

ROC analysis to assess prediction of diabetes in CKB by GRS-T based on 52 type 2 diabetes risk variants genotyped in the majority of samples showed that, compared with the unweighted risk score (C statistic [95% CI]: 0.574 [0.567, 0.580]), there were significant improvements in discrimination when using risk scores based on weights from previous meta-analyses, TransEthnic in particular (0.590 [0.583, 0.597], p = 3.6 × 10−20, TransEthnic vs unweighted). There was a further small but significant improvement in diabetes prediction by GRS-T using weights from a meta-analysis including CKB (TransEthnic + CKB) (0.593 [0.586, 0.600]; p = 3.0 × 10−12, TransEthnic + CKB vs TransEthnic) (Fig. 3 and ESM Table 11). Although somewhat reduced, there remained an improvement following 1000-fold cross-validation to minimise ‘over-fitting’ (0.591 [0.584, 0.598], p = 1.8 × 10−3). Thus, in terms of diabetes prediction/discrimination this TransEthnic + CKB meta-analysis (ESM Tables 2, 11) provides the best-performing currently available estimates of effect size for these type 2 diabetes associated SNPs.

Fig. 3
figure 3

ROC curves for type 2 diabetes GRS-Ts predicting type 2 diabetes in CKB

As expected, both unweighted and weighted GRS robustly associated with risk of diabetes (ESM Table 12). Individuals in the highest quartile of GRS-T had an OR of 2.34 (2.25, 2.45) compared with the lowest quartile. Note that, for this and subsequent analyses, we present the results of analyses employing TransEthnic-weighted GRSs, which represent the best-performing external weights—i.e. which avoid potential over-fitting and, therefore, do not require unnecessarily complex cross-validation analyses. Conclusions were not materially affected by using unweighted risk scores or other externally weighted scores (ESM Table 12).

To investigate the genetic contribution to diabetes related to beta cell dysfunction or IR, two separate GRSs (GRS-BC and GRS-IR) were calculated based on 25 variants predominantly associated with beta cell dysfunction and seven variants with IR, respectively. Assessments of the proportion of variances in HOMA-B and HOMA-IR explained by these GRSs confirmed that they successfully targeted the phenotype of interest (ESM Table 13). We identified associations of both GRS-BC and GRS-IR with diabetes (ESM Table 12). The ORs for diabetes were 2.17 (2.08, 2.26) and 1.19 (1.14, 1.25) when comparing the extreme quartiles of TransEthnic-weighted GRS-BC and GRS-IR, respectively (p for trend = 4.82 × 10−111 and 1.68 × 10−7) (ESM Fig. 4).

Effect modification by adiposity

Stratified analyses were performed to investigate the possible modifying effects of adiposity on the associations of GRSs with diabetes. Both GRS-T and GRS-BC showed significant interactions with strata for each of BMI, WC, WHR and PBF (p interaction < 1 × 10−4) (Table 3, ESM Table 15). In each case, per unit GRS score effects were greater in participants who were leaner. We also observed a strong inverse association of GRS-BC with BMI, WC and PBF, but not with WHR (ESM Table 15). In contrast, we found no evidence of interaction between GRS-IR and any of BMI, WC, WHR or PBF (p interaction ≥ 0.11).

Table 3 Association of GRSs with type 2 diabetes risk in CKB, overall and by levels of adiposity

Discussion

We tested associations of 56 variants with risk of diabetes in a large-scale population-based study of Chinese adults. The effect sizes for the majority of diabetes loci were broadly similar between Chinese and European populations. However, there was an overall tendency towards lower effect sizes in our unselected population, likely the consequence of reduced bias (spectrum bias and ‘winner’s curse’). Similar reductions in effect size were observed when comparing with previous data from East Asians, so this is unlikely to be a reflection of differences in patterns of LD.

Improved estimates of SNP effect sizes enabled construction of more accurate weighted GRS for disease prediction. Although GRSs alone remain relatively poor predictors of diabetes risk compared with traditional risk models, the increasing numbers of associated SNPs nevertheless afford improvements for risk prediction [31]. Optimally, integration of genotyping data into type 2 diabetes risk prediction models requires reliable, unbiased, population-specific estimates of the effect of risk variants. Most current effect size estimates have been derived from gene discovery studies largely involving case–control samples and may suffer from ‘winner’s curse’ and disease spectrum bias, the latter not being overcome by ever-larger non-population-based cohort studies. Re-estimation in population-based cohort studies of the effects of GWAS-identified loci limits such biases [32, 33]. Thus, the results from this study can be extrapolated to the Chinese general population and used for inclusion of genetic data in type 2 diabetes risk prediction models.

Previous studies have reported that the majority of common variants are shared across different ethnic groups [24, 30, 47, 48]. We have provided further evidence for shared genetic architecture of type 2 diabetes between East Asian and European populations. Further meta-analysis of CKB with published data from AGEN-T2D improved our statistical power to replicate associations originally reported in other ancestries: the number of variants that achieved genome-wide significance was increased from four to ten, and eight additional SNPs showed significant association after multiple-testing correction (ESM Tables 7, 8).

There are several potential reasons for failure to replicate variants or inconsistencies of effect sizes. First, differences in allele frequency between the original discovery population and the replicating studies in other ethnic groups may affect power for replication. Second, as effect sizes of more-recently identified type 2 diabetes loci become smaller (facilitated by ever-larger sample sizes for discovery), a correspondingly larger sample size is needed for replication. Thus, 19 out of the 26 SNPs with the largest effect sizes (OR ≥ 1.08) but none of the remainder reached genome-wide significance in the CKB-AGEN-T2D meta-analysis. Third, differing patterns of LD may mean that genotyped SNPs are less effective proxies for the underlying causal variant in East Asians than in Europeans: at most loci, the SNPs investigated in the present study were identified in European studies. However, this is likely to apply to only a small subset of loci since common-variant associations map to common haplotypes that are broadly shared between Europeans and East Asians [24, 30, 48]. Thus, at loci where there was apparent non-concordance of effects (e.g. RBMS1 rs7593730 and GCC1-PAX4 rs6467136), it remains possible that the same causal variant is present in the two populations and has similar effects. Fourth, these may represent rare instances of ethnic differences in the occurrence of particular causal variants. This may be the case in Europeans for GCC1-PAX4 rs6467136, for which locus there are no SNPs with a type 2 diabetes association at even p < 10−3 within 500 kb [15]. This may be elucidated by future fine-mapping and sequencing studies [30].

We further investigated the genetic influence on disease risk by stratifying on genetic variants related primarily to either beta cell function or insulin sensitivity. While variants identified in early GWAS are mainly implicated in beta cell function, more recent studies have identified several variants with a primary impact on IR [15, 40, 41]. Stratification of type 2 diabetes cases according to the separate contributions of genetic effects on beta cell function and IR has the potential to be informative for so-called ‘precision medicine’.

Consistent with previous reports [10, 34, 49], we identified interactions of GRS with measures of adiposity: GRS-BC had a larger effect size on diabetes among individuals with lower BMI, WC, WHR or PBF. Since we observed a strong inverse association of GRS-BC with BMI, WC and PBF (ESM Table 15), it remains possible that the observed interactions with these measures of adiposity are an artefact of the dual effects of GRS-BC on both diabetes risk and adiposity, for instance due to ‘collider bias’ whereby analyses stratified by a potential mediator can induce new relationships and introduce confounding. However, this is not relevant for the interaction with WHR, which displayed a similar magnitude of interaction despite not showing association with GRS-BC. Therefore, we conclude that these observed effects of adiposity of GRS-BC effect size are likely to be genuine. By contrast, we found no evidence for interactions between GRS-IR and adiposity measures, which may reflect limited power (owing to GRS-IR comprising one-third the number of SNPs compared with GRS-BC and being less strongly associated with diabetes), but our findings are in line with previous findings that IR scores are associated with incident type 2 diabetes independent of body size [50].

In conclusion, we report estimates, expected to be largely free of ‘winner’s curse’ and spectrum bias, of the effect sizes of diabetes risk variants in a general population cohort of Chinese adults. We thereby identify the extent to which previous GWAS based on predominantly case–control studies are affected by these biases. In addition to their utility for improvements in type 2 diabetes risk prediction, these more accurate effect size estimates promise to be a powerful resource for future Mendelian randomisation studies in Chinese cohorts.