Background

The prevalence of type 2 diabetes (T2D) is not equal among race-ethnic groups in the United States, with a prevalence of 12.8% in non-Hispanic blacks (NHB), 8.4% in Mexican Americans (MA), and 6.6% in non-Hispanic whites (NHW) aged 20 yrs or older [1]. Diabetes-related complications also differ between race-ethnicities [2] and there is greater impact of diabetes on life-years in minority groups [3]. Race-ethnic differences in environmental exposures and health care experiences [4] likely influence different outcomes for people with diabetes, but genetic differences may also play an important role. Despite recent advances in the study of T2D genetics, relatively little is known about how race-ethnic genetic differences contribute to inter-race variability in diabetes risk or diabetes-related traits.

Percent HbA1c (glycated hemoglobin) is an informative trait for diabetes diagnosis and management. It is accurate in quantifying chronic glycemic exposure of erythrocytes for the preceding 2–3 months, and there is a robust correlation between HbA1c levels and occurrence of diabetes complications [5, 6]. Recently, MAGIC (Meta-analyses of Glucose and Insulin related traits Consortium) identified ten genetic loci associated with HbA1c[7]. The ten loci associated included three loci in or near genes likely involved in glycemic control pathways: G6PC2 (glucose-6-phosphatase catalytic subunit 2, MIM 608058), GCK (glucokinase, maturity onset diabetes of the young (MODY) 2, MIM 138079), and MTNR1B (melatonin receptor 1B, MIM 600804) and seven loci in or near genes likely to be involved in erythrocyte biology, including SPTA1 (spectrin alpha erythrocytic 1, MIM 182860), HFE (hemochromatosis, MIM 235200), ANK1 (ankyrin 1, MIM 612641), HK1 (hexokinase 1, MIM 142600), APT11A (ATPase Class VI, type 11A, MIM 605868), FN3K (fructosamine 3 kinase, MIM 608425), and TMPRSS6 (transmembrane protease serine 6, MIM 609862). Since MAGIC only included individuals of European ancestry, nothing is known about the impact of these risk alleles on HbA1c levels in non-European-ancestry populations.

Given the selection pressure by infectious diseases such as malaria on some erythrocyte-related genes in African populations [810] and the influence of erythrocyte genes on HbA1c[7, 11], we hypothesized that risk alleles at HbA1c-associated loci may have substantial race-ethnic frequency variation and that associations with HbA1c levels may also differ by race. Furthermore, since elevated HbA1c is associated with risk of cardiovascular disease or mortality [1219], we hypothesized that an association between HbA1c-associated SNPs and mortality may exist and there may be race-ethnic differences in this association. Using 11 confirmed HbA1c-associated SNPs at ten loci [7], we compared NHB, MA, and NHW individuals from NHANES (National Health and Nutrition Examination Survey) III to test the hypotheses that there is significant race-ethnic variation in HbA1c risk (HbA1c-raising) allele frequency, risk-allele association with HbA1c levels and risk-allele association with mortality.

Methods

Study subjects from the third national health and nutrition examination survey

NHANES III was a nationally representative sample of the non-institutionalized civilian U.S. population collected using stratified multistage probability sampling. NHANES participants underwent a physical examination, phlebotomy, and a household interview [20]. This study was limited to non-diabetic patients (aged 20 or older) with 8–23 hours of fasting prior to blood sampling. Blood from NHANES III Phase II (1991–1994) participants aged 12 or older were used to generate Epstein-Barr transformed lymphocyte cell lines for DNA extraction. Mortality data (death within a mean of 13.5 years of follow-up) were merged from the NHANES III mortality-linked data file. Race-ethnic group was assigned based on self-report. The survey asked each subject to categorize his/her race as “white,” “black,” or “other” and his/her ethnicity as “Mexican-American,” “other Hispanic,” or “not Hispanic.” Of 3,894 individuals with complete data for analysis, we excluded 149 who were not of NHB, MA or NHW race-ethnicity and 704 with diabetes (293 NHW, 167 NHB and 244 MA), leaving 901 NHB, 909 MA, and 1,231 NHW individuals in the analysis. Written informed consent was obtained from all subjects and this study was approved by the National Center for Health Statistics (NCHS) Ethics Review Board.

Diabetes definition and HbA1c measures

Individuals with diabetes were excluded to avoid the confounding effects of treatment on HbA1c. We defined diabetes as a fasting plasma glucose ≥ 7.0 mmol/L, report of a diagnosis of diabetes or use of hypoglycemic medications. HbA1c levels were measured using HPLC (Bio-Rad DIAMAT glycosylated hemoglobin analyzer system) [21].

SNP genotyping and allele frequencies

Genotyping was performed using Sequenom iPLEX. We genotyped 11 SNPs at ten loci shown among white non-diabetic individuals in MAGIC to have genome-wide significant association with HbA1c.[7] We used SNP rs282606 as a proxy for ATP11A rs7998202 (CEU r2 = 1.0), SNP rs10830956 as a proxy for MTNR1B rs1387153 (CEU r2 = 1.0), and rs2022003 as a proxy for SPTA1 rs2779116 (CEU r2 = 0.927) [r2 for ASW and MEX populations not available]. The minimum call rate for genotyping was 95%. Allele frequencies of all SNPs were in Hardy Weinberg Equilibrium (HWE) based on National Center for Health Statistics standards (HWE rejected if p < 0.01 in ≥ 2 or more race-ethnic groups). We compared NHANES observed allele frequencies with those available from HapMap (http://hapmap.ncbi.nlm.nih.gov/, Release 27, Phases II and III, NCBI build 36), comparing NHW with CEU (Utah residents with Northern and Western European ancestry from the CEPH collection), NHB with ASW (African ancestry in Southwest USA), and MA with MEX (Mexican ancestry in Los Angeles, California).

Genotype risk score

We calculated a genotype risk score to test the collective association with HbA1c of 11 SNPs at 10 loci (2 uncorrelated SNPs at ANK1). We assumed that each SNP was associated with HbA1c based on previous association results in whites, despite potential ancestral differences in NHB or MA in linkage disequilibrium (LD) patterns [22]. Since we did not know the effect size of the MAGIC SNPs in non-white populations, we did not apply SNP-specific weights to account for SNP-specific differences in effect on HbA1c, but simply summed the presence of 0, 1, or 2 risk alleles carried by individuals at each SNP. In addition to the 11-SNP GRS, we also performed a secondary analysis using an eight SNP “non-glycemic” risk score by excluding the three glycemic loci (G6PC2, GCK, MTNR1B) for score calculation.

Statistical analyses of association

We stratified the analysis by race-ethnicity (NHB, MA, and NHW) and to estimate rates and proportions within groups used weights to account for sampling probabilities using methods previously described [23]. P-values for differences across race-ethnic groups were calculated using Satterthwaite adjusted- F statistics for continuous variables and chi-square tests for categorical variables. To estimate the significance of differences in allele frequencies across groups we used Fisher’s Exact tests.

To investigate the relationship between SNPs and HbA1c level we used linear regression and an additive genetic model adjusted for age and sex. We included one SNP at a time in the models for individual SNP associations with HbA1c, with genotypes coded as 0, 1 or 2 depending on the number of HbA1c-raising alleles present. To study the collective effect of the 11 SNPs on HbA1c we used linear regression adjusted for age and sex, totaled the number of risk alleles at all 11 SNPs to calculate a risk score, and tested associations of a per-risk-allele increase in genotype risk score with HbA1c. We calculated the adjusted model R2 with and without the genotype risk score for each group to determine the percent variance in HbA1c explained by genetic effects. The same procedure was carried out for the 8 SNP “non-glycemic” risk score, as well as for genetic associations with mortality (percent dead as of 13.5 years post-baseline exam). To determine if a significant genetic risk score x ethnicity interaction effect on HbA1c exists, we also applied the following linear regression model on the whole sample: Hba1c level (outcome) = sex, age, genetic risk score, ethnicity, genetic risk score x ethnicity interaction. For tests of association with mortality we used logistic regression to estimate the odds of mortality with per-risk-allele increase in HbA1c. For analysis of mortality, Cox models yielded similar results to logistic regression, so Cox model results are not shown. We also applied the following logistic regression model on the whole sample: mortality (outcome) = sex, age, GRS, ethnicity, GRS x ethnicity interaction. For the analyses we used SUDAAN (version 10.0) [24] and SAS (version 9.2, SAS Institute Inc, Cary, NC). We considered p values less than 0.05 to indicate statistical significance, based on one test per previously established SNP at each locus for each hypothesis (SNP is associated with HbA1c; SNP is associated with mortality).

Linkage disequilibrium, signatures of population differentiation and natural selection at HbA1c-associated loci

To evaluate inter-ethnic differences in LD near the SNPs, we examined 500 kb around each SNP (HapMap Release 27, Build 36, phases II and III) for four populations (CEU, YRI, ASW, and MEX). Using Haploview version 4.2, [25] we counted the number of “Gabriel” LD regions (based on confidence intervals) [26] in that region for each population. We investigated natural selection around the ten loci using Haplotter [27] and HapMap Phase II data. Standardized Integrated Haplotype Score (iHS) (a statistic based on differential LD around positively selected alleles that compares haplotype length with ancestral allele versus derived allele to detect positive selection) [27], Fay and Wu’s H + statistic (a measure used to scan a region for allele frequencies that are skewed from the neutral model) [28] and the Fixation Index (FST) (a statistic using allele frequencies to measure genetic divergence between subpopulations) [29] were obtained through Haplotter SNP queries spanning 2 Mb regions at each locus.

Results

Characteristics of participants

NHW individuals were older, had lower BMI and lower mean HbA1c than did NHB and MA individuals (global p values all <0.0001, Table 1).

Table 1 Characteristics of participants by race-ethnicity, Third National Health and Nutrition Examination Survey (NHANES III)

Risk allele frequencies of HbA1c-associated variants

Risk allele frequencies across the 11 loci varied widely within the three race-ethnic groups ( Additional file 1 Table S1). Six out of 11 HbA1c–associated SNPs had risk allele frequencies that differed significantly across race-ethnic groups (Fisher’s p <0.0002). At five of these six loci, risk allele frequency of NHB was most divergent, including SNPs near ANK1 (two uncorrelated SNPs), MTNR1B, ATP11A/TUBGCP3 and TMPRSS6. At the SNP near SPTA1, risk allele frequency differed most in MA. The HbA1c -raising allele was the minor (less frequent) allele in all three ethnic groups for SNPs near SPTA1, GCK, MTNR1B, FN3K, and TMPRSS6. The HbA1c –raising allele was the major (more frequent) allele at SNPs near ABCB11, HFE, ANKI (rs6474359) and HK1. At two loci, ATP11A, and ANK1 (rs4737009), the HbA1c –raising allele was the minor allele in NHW and MA, but the major allele in NHB ( Additional file 1: Table S1). Risk allele frequencies observed in this study and those available from HapMap were generally similar, although at some loci minor dissimilarity with HapMap was observed in NHB and MA cohorts (Figure 1; Additional file 1: Table S2).

Figure 1
figure 1

Risk allele frequencies of 11 HbA 1c -associated SNPs. NHB: Non-Hispanic black NHANES III; MA: Mexican American NHANES III; NHW: Non-Hispanic white NHANES III; ASW: African ancestry in Southwest USA HapMap; MEX: Mexican ancestry in Los Angeles, California HapMap; CEU: Utah residents with Northern and Western European ancestry from the CEPH collection HapMap. *ASW frequencies were missing at ANK1 (rs6474359), HK1 and MTNR1B loci and YRI frequencies were substituted (YRI: Yoruba in Ibadan, Nigeria). MEX frequencies were missing at HK1 and MTNR1B loci.

SNP associations with HbA1c

Though single-SNP associations are underpowered ( Additional file 1: Table S3), we did observe that in NHW, eight of the 11 SNPs in NHW were consistent with Soranzo et al. (2010) in having a positive risk effect on HbA1c levels, with three of the SNPs used in our analysis (rs282606 [ATP11A], rs10830956 [MTNR1B], and rs2022003 [SPTA1]) serving as proxies for those in the MAGIC study. Beta coefficients were negative for three, three and four of the 11 SNPs in NHW, NHB and MA groups, respectively, but corresponding SNPs did not generate significant associations (Table 2). Three out of 11 HbA1c-associated SNPs had nominally significant (p < 0.05) associations with HbA1c levels in at least one of the three race-ethnic groups, but altogether only four of the 33 possible associations (11 SNPs x three race-ethnic groups) were significant (p < 0.05). No significant associations were observed in NHB. Two HbA1c-SNPs produced a significant association only in NHW (both SNPs at ANK1), and one produced a significant association in both NHW and MA (rs855791 near TMPRSS6).

Table 2 Regression coefficients of 11 HbA 1c -associated SNPs on HbA 1c levels by race-ethnicity, Third National Health and Nutrition Examination Survey (NHANES III)

Combined associations of 11 HbA1c SNPS with HbA1c

The mean 11-SNP genotype scores (actual scores ranged from 1–18) were 11.0 (± 0.09 [SE]) in NHB, 10.7 (± 0.08) in MA and 10.4 (± 0.07) in NHW, (p value for global difference across race-ethnicity < 0.0001, Table 3). Median genetic risk scores (unweighted) were 11.0 (SD =2.2), 11.0 (SD =2.3) and 11.0 (SD =2.0) in NHW, NHB and MA, respectively, with distributions of genetic risk scores negatively skewed toward a lower score in all three ethnic groups. The per-risk allele increase in the score was significantly associated with HbA1c levels in NHW and MA, but not NHB. When comparing the top and bottom 10% of the genotype score distribution for each race-ethnic group, the smallest difference in HbA1c was observed in NHW (NHW: 0.49%; NHB: 0.56%; MA: 0.54%). The genotype score explained very little of the variance in HbA1c levels in NHB (0.0005%) compared with NHW (0.0016%) and MA (0.0121%). Variance explained in NHW is comparable to the previously published value [7]. We observed no significant genetic risk score x ethnicity interaction on HbA1c level (p = 0.68).

Table 3 Association of HbA 1c with the 11 SNP genetic risk score by race-ethnicity, Third National Health and Nutrition Examination Survey (NHANES III)

Combined associations of eight non-glycemic SNPs with HbA1c

The mean “non-glycemic” 8-SNP genotype scores (actual scores ranged from 4–15) were 8.80 (± 0.06[SE]) in NHB, 8.72 (± 0.06) in MA and 8.41(± 0.06) in NHW, (p value for global difference across race-ethnicity < 0.0001) ( Additional file 1: Table S4). The per-risk allele increase in the score was significantly associated with HbA1c levels in NHW, but not in NHB and MA.

Association of 11 HbA1c SNPs with mortality

Mortality rates differed between race-ethnic groups (Table 4) with a higher mortality rate observed in NHB (19.4%) compared with NHW (12.8%) and MA (14.5%). The 11-SNP genotype score was not associated with mortality in any race-ethnic group. We observed no significant genetic risk score x ethnicity interaction on mortality (p=0.62). Power calculations for the mortality analysis are provided in Additional file 1: Table S5.

Table 4 Association of 11 SNPs and the Genotype Score with mortality by race-ethnicity, Third National Health and Nutrition Examination Survey (NHANES III)

Linkage disequilibrium at HbA1c-associated loci

There were consistently fewer LD regions in the CEU population compared to YRI at every locus (YRI:CEU): SPTA1 42:25; ABCB11/G6PC2 49:27; HFE 31:17; GCK 28:21; ANK1 (rs41668351) 35:29; ANK1 (rs41749562) 30:25; HK1 45:38; MTNR1B 39:22; FN3K 42:27; TMPRSS6 78:52; ATP11A/TUBGCP3 49:31 ( Additional file 1: Table 6). ASW, which represents a population with African ancestry in the southwestern United States, only had higher numbers of LD regions compared to CEU in two out of 11 regions, possibly due to lower coverage of ASW compared to CEU (and YRI) in HapMap Release 27.

Evidence of population differentiation and natural selection at HbA1c-associated loci

Fay and Wu’s H + was highly skewed at two loci (HK1 and ATP11A) in CEU ( Additional file 1: Table S7). Integrated haplotype scores (iHS) were not highly negative or positive at these SNPs, as would be characteristic for regions undergoing recent natural selection. FST, a measure of the amount of allelic fixation due to drift, was greater than 15% at ANK1 and ATP11A in both CEU and YRI, suggesting population differentiation at these loci [29]. Haplotter queries by gene did not reveal evidence of natural selection directly at the genes queried, but evidence of natural selection was observed within a 2 Mb region of ABC11/G6PC2 and TMPRSS6 for CEU and YRI, respectively.

Discussion

Genome-wide association studies of HbA1c levels in cohorts of white individuals of European ancestry revealed a combination of glycemic and non-glycemic biological influences on HbA1c, with three loci associated with HbA1c in or near genes likely involved in glycemic control pathways and seven loci associated with HbA1c in or near genes likely to be involved in erythrocyte biology [7]. In this study we found that in the nationally representative NHANES III sample of US adults, heterogeneity in risk allele frequencies exists across race-ethnic groups for six of these HbA1c-associated SNPs. Five SNP risk allele frequencies in NHB were significantly lower or higher than the other two groups. Risk allele frequencies observed in NHANES III were generally consistent with frequencies of comparable populations available in HapMap, suggesting that HapMap and NHANES III can be considered representative of each other at these SNPs at least with respect to white, African American and Mexican American race-ethnic populations. An 11-HbA1c- associated SNP genotype score was subtly different by race-ethnicity and was associated with increase in HbA1c in NHW and MA but not NHB. The 11-SNP genotype score was not significantly associated with mortality in any group.

There are several potential sources for the inter-race-ethnic heterogeneity of SNP and genotype risk score associations with HbA1c that we observed. One potential source of heterogeneity is race-specific selection acting on erythrocyte-related loci that influence HbA1c. Variants in the β- hemoglobin gene (HBB), for example, produce abnormal erythrocytes that can affect HbA1c levels [30] but are protective against malaria and are thus maintained in populations and found at highest frequencies in regions historically exposed to this disease like Africa and India [31]. Rare mutations in many loci associated with HbA1c (SPTA1 ANK1 HK1 TMPRSS6) are known to cause hereditary red blood cell disorders [7] and common variants at several loci (SPTA1, HFE, ANK1 HK1 TMPRSS6) are associated with hematological traits like hemoglobin concentration and mean corpuscular volume [3234]. Adjustment of models of these common variants predicting HbA1c levels for levels of hemoglobin concentration or mean corpuscular volume attenuate SNP-HbA1c relationships, suggesting mediation of HbA1c varation by elements of erythrocyte biology [7]. Further, a recent genetic association study showed some differences in the genetic regulation of hematological traits in Europeans compared with Africans [35]. Our analyses of differentiation and selection suggest that there may be some selection pressure at the ANK1 HK1 ATP11A, TMPRSS6 and ABC11/G6PC2 loci, the first four of which are erythrocyte-related loci. However, in the present study, race-ethnic differences in association with HbA1c by SNP were observed at only two of these loci (ANK1 [rs4737009] and TMPRSS6). We also examined inter-population allele frequency differences of trait-associated SNPs which may indicate that selection is operating on the trait [36]. While frequencies of some disease-associated alleles have been reported as largely heterogeneous between race-ethnicities [3639], other data suggest no greater differentiation than would be expected from a random set of SNPs [40]. We found heterogeneous inter-race-ethnic risk allele frequencies at six of the HbA1c-associated SNPs and three of these (SNPs near ANK1 [both SNPs] and TMPRSS6) showed inter-race heterogeneity in SNP association with HbA1c.

We found modest race-ethnic differences in the association of individual or collective HbA1c-associated SNPs and levels of HbA1c. We found nominally significant associations with an HbA1c-associated SNP genotype score and levels of HbA1c in NHW, as expected, and also in MA, but not in NHB individuals. Ancestral variation in LD probably accounts for some of this difference in association. LD is more fine-grained in genomes of African individuals [22], so some of the HbA1c-associated SNPs may be more tightly linked to putative functional alleles in NHW and MA than in NHB. Modest power given the relatively small sample size of NHANES III could also account for the relatively weak association of HbA1c SNPs with HbA1c in each race-ethnic group ( Additional file 1: Table S3). No significant interactions were observed, also possibly due to low power. T2D diagnosis was based on fasting glucose with no OGTT, which may have introduced misclassification in T2D status of study subjects. Furthermore, greater heterogeneity exists in NHB, and this heterogeneity may have influenced variability in HbA1c levels. Since there are no ancestry markers available in NHANES to evaluate genetic heterogeneity within populations, we were unable to evaluate substructure within ethnic groups and, for the purposes of this study, assumed little to no intra-population substructure.

Despite previous epidemiological associations of HbA1c levels with mortality or cardiovascular disease [1219] and race-ethnic variation in mortality rates in NHANES III, we did not see any evidence of an association of HbA1c-associated loci with mortality in any race-ethnic group. If HbA1c is associated with mortality, it is likely to be mediated through HbA1c’s association with hyperglycemia and insulin resistance, but many HbA1c-associated loci are associated with erythrocyte biology and not hyperglycemia. A lack of association of the HbA1c-associated SNPs studied here and cardiovascular disease events has also been shown previously in white cohorts [7]. This unlinking of hyperglycemia from HbA1c biology also has bearing on diabetes screening and diagnosis. Another explanation for a lack of association of the HbA1c genetic risk score with mortality is the lack of statistical power due to small sample size within each ethnicity ( Additional file 1: Table S5). When pooling the entire sample and carrying out an interaction model we also observed no significant genetic risk score x ethnicity interaction on mortality.

Race-ethnic differences in HbA1c levels were observed in the present study and have been shown previously [4146]. Population differences in HbA1c levels are partly attributable to variability in non-biological factors including race-ethnic differences in lifestyle, socioeconomics, health insurance access or screening intensity [41, 44]. Further, there are likely race ethnic differences in non-glycemic biological factors including glycemic level, hemoglobinopathies [30, 4749], iron deficiency anemias [21, 48, 5054], and erythrocyte survival [48, 55, 56]. The data suggest that glycemic control is not the only root cause of inter-race-ethnic differences in HbA1c. Although the clinical impact of HbA1c genetics on diabetes detection appears to be modest in whites, at least , whether race-ethnic heterogeneity in HbA1c genetics influences diabetes diagnosis in other race-ethnic groups requires further investigation.

The major strengths of this study include genotyping of all 11 known HbA1c-associated SNPs in the nationally representative, multi-race-ethnic NHANES III cohort. The heterogeneity of HbA1c–associated SNP frequencies across race-ethnic groups and the limited impact of these SNPs on HbA1c level in NHB individuals underscore the importance of extending association studies and the discovery of causal variants to diverse populations for a comprehensive understanding of HbA1c genetic architecture. As diverse populations become increasingly incorporated into genetic studies for variant detection, inter-race-ethnic variation will likely continue to be revealed, necessitating careful investigation of its sources and significance.

Conclusions

In NHANES III there is substantial RAF race-ethnic heterogeneity at many HbA1c loci. An 11-HbA1c- associated SNP genotype score was subtly different by race-ethnicity and was associated with increase in HbA1c in NHW and MA but not NHB. While the numerous potential sources for this race-ethnic heterogeneity in association with HbA1c require further exploration, the data underscore the importance of extending genetic analysis to non-white populations, especially where they may have impact on guidelines for disease screening, diagnosis or management.

Disclaimer

The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention.