Abstract
Type 2 diabetes (T2D) is caused by both genetic and environmental factors and is associated with an increased risk of cardiorenal complications and mortality. Though disproportionately affected by the condition, African Americans (AA) are largely underrepresented in genetic studies of T2D, and few estimates of heritability have been calculated in this race group. Using genome-wide association study (GWAS) data paired with phenotypic data from ~ 19,300 AA participants of the Reasons for Geographic and Racial Differences in Stroke (REGARDS) study, Genetics of Hypertension Associated Treatments (GenHAT) study, and the Electronic Medical Records and Genomics (eMERGE) network, we estimated narrow-sense heritability using two methods: Linkage-Disequilibrium Adjusted Kinships (LDAK) and Genome-Wide Complex Trait Analysis (GCTA). Study-level heritability estimates adjusting for age, sex, and genetic ancestry ranged from 18% to 34% across both methods. Overall, the current study narrows the expected range for T2D heritability in this race group compared to prior estimates, while providing new insight into the genetic basis of T2D in AAs for ongoing genetic discovery efforts.
Similar content being viewed by others
Introduction
Diabetes mellitus is a heterogeneous group of metabolic and health conditions characterized by glucose dysregulation and defects in insulin secretion and/or insulin action1. Chronic hyperglycemia has been associated with long-term damage and dysfunction of the kidneys, heart, and blood vessels2. Diabetes is a major risk factor for cardiovascular disease (CVD), particularly coronary heart disease and stroke3. In the United States, type 2 diabetes (T2D) is the most common form of diabetes in adults, constituting > 90% of cases4 and is growing in prevalence among adolescents5. T2D is more often associated with increased age6; however, the T2D epidemic can largely be attributed to a worldwide increase in obesity7. Since lifestyle intervention focused on obesogenic behaviors is effective at preventing T2D8, identifying populations with increased genetic susceptibility could lower disease morbidity.
It is well-established that T2D is a complex disease, and the risk of developing T2D depends on environmental and genetic factors. Further supporting the genetic background of the disease is the high concordance in monozygotic twins compared to dizygotic twins in familial studies9,10,11,12. While traditional twin studies have been considered the “gold standard” for measuring the heritability of a trait, more recent literature has suggested that twin studies may overestimate heritability due to shared environment and non-additive genetic effects creating “phantom heritability”13.
While broad sense heritability of a trait is the proportion of phenotypic variation attributed to genetics, the narrow-sense heritability (h2) is the proportion attributable to the additive gene effects14. These additive effects of variants underlying a trait is of particular importance because they constitute the genetic variation component transferred from parent to offspring 15. With advances in genotyping technologies, the generation of genome-wide common genetic variant data has enabled approaches that provide an alternative to twin or family heritability studies16,17. These methods estimate heritability in unrelated individuals via correlation between genetic and phenotypic sharing, similar to family-based studies. However, instead of using the theoretic estimates of genetic sharing (i.e., Mendel’s Laws), single nucleotide polymorphism (SNP)-based heritability analyses allow for empirical estimates of genetic sharing to be directly obtained from the genotype data17. Thus, by extending the models to utilize the contributions from all common genetic variants, these methods can detect considerable shares of the additive genetic effect13.
While previous h2 estimates of T2D and related clinical traits (e.g., fasting glucose, fasting insulin) have varied between 25% and 80%, they have either excluded or underrepresented individuals of non-European ancestry, particularly African Americans (AAs)2,18,19,20,21,22. Importantly, known disparities exist in T2D, where AAs have a higher prevalence of T2D, increased mortality rates, and an increased risk of T2D complications compared to individuals of European descent23,24. Further, there is a concern that AAs and African ancestry groups may benefit less from genetic research, potentially exacerbating health disparities of common chronic conditions, such as T2D25. Given these trends, additional heritability studies are warranted for these populations. For example, accurate heritability estimates are needed to understand the utility of polygenic risk scores (PRS) in multi-ancestral populations with regard to the maximum trait variance expected to be explained by a PRS.
In the present study, we seek to capture the T2D additive genetic variation (i.e., h2) from ~ 19,300 AA participants in the Reasons for Geographic and Racial Differences in Stroke (REGARDS) study, the Genetics of Hypertension Associated Treatments (GenHAT) study, and the Electronic Medical Records and Genomics (eMERGE) network. Each study used an overlapping subset of 8.2 million imputed genetic variants to estimate the h2 using two common approaches, Genome-Wide Complex Trait Analysis (GCTA) and Linkage-Disequilibrium Adjusted Kinships (LDAK), making this one of the most extensive studies to estimate T2D heritability in AAs.
Results
Descriptive statistics for 7957 AA T2D cases and 11,378 AA controls are presented in Table 1. On average, cases were slightly older (65 years versus 63 years for controls, 66 years versus 66 years for controls, and 68 years versus 67 years in controls for REGARDS, GenHAT, and eMERGE, respectively). Furthermore, T2D cases were more likely to be men in REGARDS (59%), but women in GenHAT (61%) and eMERGE (65%).
Study-level heritability estimates are provided in Table 2. The base model (Model 1) h2 estimates using LDAK were 19% in REGARDS, 19% in GenHAT, and 33% in eMERGE. Similar trends were observed when estimates were calculated using GCTA, ranging from 21% in GenHAT to 31% in eMERGE. Upon age, sex, and genetic ancestry adjustment (Model 3), h2 estimates from LDAK were similar to Model 1 (18% in GenHAT, 18% in REGARDS, and 33% for eMERGE). In the fully-adjusted (Model 3) using GCTA, h2 estimates for REGARDS, GenHAT, and eMERGE were 24%, 21%, and 32%, respectively. In a sensitivity analysis, the study site was included as a covariate for the eMERGE cohort and results remained similar with and without adjustment. Further, when adjusting for the AA-specific population prevalence of T2D (heritability liability, h2liab), estimates increased marginally across all models and methods (Supplemental Table 1).
Discussion
African American populations continue to be underrepresented in genomic research. In the era of genomic medicine, this could unintentionally create new disparities in prevention and prediction of T2D. By leveraging genome-wide SNP array data from unrelated participants belonging to well-characterized cohort studies the current study provides insight into the additive genetic factors that contribute to the phenotypic variance of T2D in this non-European population. This type of common genetic variant heritability study is less prone to the confounding effects of shared environment observed in family studies. Therefore, the results presented may give a more precise estimation of the genetic contribution to T2D which may inform how genome-wide data is used in the future for the treatment and prevention of the condition.
We estimated the h2 of T2D using two well-characterized estimation approaches, LDAK and GCTA. Upon covariate adjustment, we observed h2 estimates from GCTA ranging from 21 to 32%. LDAK provided more conservative fully-adjusted estimates, ranging from 18 to 33%. This is most likely due to the inclusion of LD and allele frequency patterns into the estimation, correcting uneven LD patterns across the genome26. Subsequent h2liab estimates, aimed to measure heritability independent of disease prevalence, ranged from 19% to 34%.
As described, we observed variable estimates across studies. This leads us to believe that age and sex were confounders, justifying the need for a fully adjusted model and age-matching cases and controls in eMERGE to be representative of what was present in GenHAT and REGARDS. The h2liab estimates also showed slight inflation in REGARDS across both methods and we speculate that could be a result of the smaller case proportion (~ 30%) compared to eMERGE and GenHAT (~ 50%). It is also important to note that h2liab could potentially be biased, as previously reported in Golan et al., as it is a function of the prevalence in a given sample population27.
Prior reports of heritability of T2D and T2D-related traits (ranging 25–80%) have been described largely in populations or families of European descent. A 2005 familial aggregation study in ~ 2400 non-diabetic AA participants, described heritability of 29% ± 9% for fasting glucose and 28% ± 8% for fasting insulin measures19, similar to our findings. Importantly, over 700 unique genetic loci associated with T2D have been reported by GWAS28,29,30,31,32, explaining almost 20% of T2D heritability in a multi-ancestry sample31. The majority of these loci were identified in European or Asian populations33, though > 25 loci were discovered in AAs31,34,35,36. Given the wide-range of reported heritability estimates it’s difficult to decipher how much missing heritability remains after accounting for discovered significant loci. Therefore, more precise estimates of heritability such as that presented in our study are needed to inform future GWAS discovery and precision medicine applications of GWAS data.
Our study is subjected to several limitations. First, heterogeneity in the estimates could result from heterogeneity in the T2D phenotype, heterogeneity in study designs, and population admixture. To account for these limitations, we adjusted for genetic principal components in each study, as well as the study site in sensitivity models in eMERGE. We followed the same validated T2D phenotyping as a previously published report37, attempting to harmonize the phenotype between the population-based longitudinal study (REGARDS), randomized clinical trial (GenHAT/ALLHAT), and electronic medical records (EMR) from eMERGE. While attempting to harmonize the genetic variants, it is important to note the differences in the imputation reference panel between REGARDS and GenHAT (TOPMed release 2) versus eMERGE (HRC), which did result in the exclusion of nearly 7 million variants from REGARDS and GenHAT. However, this number of variants is still compatible with other publications that have used these approaches38. Lastly, while the use of the contemporary arrays and methodologies (e.g., Illumina MEGA array and TOPMed reference panel) allows for interrogation of more African-specific variants, the potential genetic contribution of rare minor allele frequency (MAF) < 1% or structural variants was not investigated in this study. This is important since rare variants are thought to harbor more deleterious effects than common variants, as well as playing an important role in accurately calculating the heritability of complex diseases39.
In conclusion, we conducted one of the largest heritability estimates of T2D to date utilizing genetic array data and to the best of our knowledge, the largest study of h2 in AAs, comprising three independent studies with sizable AA populations and T2D prevalence. All three studies have well-described and extensive phenotyping, allowing for appropriate covariate adjustment and previously validated T2D case and control definitions. The use of imputed data allows for more consistency across studies, with the inclusion of more than 8.2 million overlapping genetic variants. Though we observed similar trends using LDAK and GCTA, the considerations of LD resulted in a more conservative h2 range across studies using LDAK. Our h2 estimates are lower with smaller confidence intervals than most familial studies on T2D that have been predominately described in European populations; however, they are similar to a family study on fasting insulin and glucose levels in AAs. Future work in determining the heritability of complex diseases, including T2D, is warranted in order to advance the availability of genomic resources for non-European populations.
Methods
Study populations
Three cohorts, REGARDS, GenHAT, and eMERGE contributed genetic data for this study, comprising 7,957 T2D cases and 11,378 controls of African descent (Fig. 1). Descriptive statistics for each study are described in Table 1. Signed informed consent was collected from all participants in each study. All studies were reviewed and approved by the institutional review boards of the participating institutions and study sites.
Reasons for geographic and racial differences in stroke (REGARDS) study
REGARDS is a national, longitudinal study of incident stroke and associated risk factors, enrolling over 30,000 Black and white adults aged 45 years or older from all 48 contiguous US states and the District of Columbia40. Participants completed a computer-assisted telephone interview (CATI) and an in-home visit where blood and urine were collected, as well as blood pressure measurements and a medicine review. Participants are contacted at 6 month intervals to obtain information regarding incident stroke or secondary outcomes. A subset of 8916 Black REGARDS participants underwent genotyping using Illumina Infinium AMR/AFR (MEGA) BeadChip arrays. Quality control procedures have been previously described41, but briefly, participants were excluded based on sex mismatches, internal duplicates, or HapMap controls. Variants were excluded if they were located on sex chromosomes, had ambiguous strands, were not bi-allelic, were in violation of Hardy Weinberg (p < 1.00e-12 for REGARDS), had MAF < 5%, and/or had a missing rate > 10%. Imputation was performed using the Trans-omics for Precision Medicine (TOPMed) release 2 (Freeze 8) reference panel42.
Genetics of hypertension associated treatments (GenHAT) study
GenHAT is an ancillary study to the Antihypertensive and Lipid Lowering Treatment to Prevent Heart Attack Trial (ALLHAT). ALLHAT was a randomized, double-blind multicenter clinical trial that enrolled over 42,000 high-risk individuals 55 years or older with hypertension and at least one additional risk factor for cardiovascular disease43,44. The GenHAT study (N = 39,114) evaluated the interaction between candidate hypertensive genetic variants and antihypertensive treatments to modify the risk of CVD outcomes. A subset of 7711 Black adults with hypertension was genotyped on Illumina MEGA BeadChip arrays. Similar to GenHAT, QC procedures have been previously described41. Participants were excluded based on sex mismatches, internal duplicates, or HapMap controls, while variants were excluded if they were located on sex chromosomes, had ambiguous strands, were not bi-allelic, were in violation of Hardy Weinberg (p < 1.00e-05), had MAF < 5%, and/or had a missing rate > 10%. Imputation was performed using the Trans-omics for Precision Medicine (TOPMed) release 2 (Freeze 8) reference panel42.
Electronic medical records and genomics (eMERGE) network
The eMERGE network combines DNA biorepositories with electronic medical records (EMR) for the purpose of research focused on advancing efforts in genomic medicine. The eMERGE III cohort was initiated in 2015 and culminated in over 100,000 GWAS samples across the network, while eMERGE IV aimed to develop and disseminate methodologies for genetic risk assessment, integrate genetics into routine medical practice to identify individuals at high risk for common disease, and recommend interventions37,45. For the current study, genetic data from eight sites (Cincinnati Children’s Hospital Medical Center, Children’s Hospital of Philadelphia, Columbia University, Mass General Brigham, Mayo Clinic, Icahn School of Medicine at Mount Sinai, Northwestern University, and Vanderbilt University Medical Center)46 was imputed against the Haplotype Reference Consortium (HRC) panel47.
To age-match cases and controls from eMERGE to the REGARDS and GenHAT studies, samples were selected from the age range 30–100. Individuals were divided into deciles and numbers of cases and controls were matched in each decile (Supplemental Fig. 1).
Definition of type-2 diabetes status
T2D status was defined independently across all three studies. In the REGARDS study, T2D cases were classified based on fasting glucose ≥ 126 mg/dL (7 mmol/L), non-fasting glucose ≥ 200 mg/dL (11.1 mmol/L), or the use of diabetes medications (e.g., oral hypoglycemic pills or insulin). The ALLHAT/GenHAT definition of T2D was described as a fasting glucose ≥ 140 mg/dL or use of diabetes medication43,44. Therefore, in the current study we excluded controls that had baseline fasting glucose ≥ 126 mg/dL or missing a fasting glucose measure. In eMERGE, a revised EMR-based phenotyping algorithm based on ICD9/ICD10 codes was applied across the participants37. Further information regarding all T2D definitions has been previously described in detail37.
Statistical analysis
To estimate the h2 for T2D, we employed two widely used methodologies, GCTA and LDAK, using an overlapping subset of 8,240,835 imputed genetic variants (Fig. 1). Three statistical models were fit: a base model without covariates (Model 1), a model adjusting for genetic ancestry through principal component analysis48 (Model 2), and a model adjusting for genetic ancestry, age, and sex (Model 3). In addition to the stated Model 3, a sensitivity analysis further adjusting for the study site to account for potential heterogeneity in sample characteristics across sites in the eMERGE cohort was performed, and the results remained consistent.
Genome-wide complex trait analysis (GCTA) method
Both genotypes and imputed variants that passed quality control (QC) filters were used to construct a genomic relationship matrix (GRM) using the GCTA tool, as previously described using the genome-based restricted maximum likelihood (GREML)49,50. The GRM reflects allele sharing (\({A}_{ij}\)) between two individuals (i and j) across variants with entries
where m is the number of variants, xik and xjk are the genotypes coded as 0, 1, or 2 of individuals i and j, respectively, at the kth locus, and pk is the MAF of the kth locus. The variance of T2D was calculated as
where the variance explained by the genetic variants (\({\sigma }_{v}^{2}\)) corresponding to GRM and residual error variance (\({\sigma }_{e}^{2}\)) were estimated using restricted maximum likelihood (REML), A is an n × n matrix with elements Aij, and I is an n × n identity matrix. The proportion of the variance of T2D explained by all the genetic variants (h2) on the observed scale was then calculated as:
We removed one individual from relative pairs with estimated genetic relatedness greater than 0.025 to ensure no closely-related individuals were included in heritability estimates (e.g., parent-offspring, siblings, cousins).
Linkage-disequilibrium adjusted kinships (LDAK) method
Knowing that African ancestral populations have greater haplotype diversity and, in turn, shorter segments of linked alleles51, we utilized LDAK, which incorporates linkage disequilibrium (LD), as an additional approach. LDAK can be used as an alternative method of generating a GRM by weighting genetic variants based on local LD patterns52. As previously described, the genetic variance of variants in high LD with a causal variant is typically overestimated in GCTA, while the genetic variance is underestimated in lower LD regions52, thus demonstrating the importance of accounting for LD in the construction of the LD-weighted GRM. Therefore LD-weighting eliminates the overestimation of heritability in high LD regions and underestimation of heritability in low LD regions by giving smaller weights to markers in the high-LD regions and large weights to markers in low LD regions53.
The GRM for LDAK is constructed as follows:
where W is the diagonal matrix with elements representing the LD-weight for each variant, N is the total number of variants, and X is a matrix with the general term
with \({p}_{j}\) being the frequency of a given allele at variant j and \({m}_{ij}\) being the genotype for the j-th variant in the i-th individual (represented by 0, 1, or 2).
When estimating heritability, LDAK assumes:
where \(E\left[{h}_{j}^{2}\right]\) is the expected heritability contribution of genetic variant j and \({f}_{i}\) is its observed MAF. The parameter α determines the assumed relationship between heritability and MAF. The genetic variant weighs ( \({\varpi }_{j}\)) are based on the local level of LD and tend to be higher for variants in low-LD regions; thus, LDAK assumes that these variants contribute more than those in the high-LD areas. \({r}_{j}\) \(\epsilon \left[\text{0,1}\right]\) is an information score measuring genotype certainty, where LDAK assumes higher-quality variants contribute more than lower-quality ones54.
Liability scale
In order to account for the inflated proportion of cases in case–control designs, the heritability estimation on the observed scale was transformed to that on the liability as
where \(K\) is the population prevalence of the T2D in AAs, P is the sample prevalence of T2D, and z is the height of the standard normal probability density function at the truncation threshold t, as previously described55. The AA-specific T2D prevalence of 12.5% was extracted from recent literature56.
Data availability
The REGARDS (Study Accession: phs002719.v1.p1), GenHAT (Study Accession: phs002716.v1.p1) and eMERGE (Study Accession: phs001584.v2.p2) phenotypic and genetic data are available on dbGaP.
References
Virani, S. S. et al. Heart disease and stroke statistics-2021 update: A report from the american heart association. Circulation 143, e254–e743. https://doi.org/10.1161/CIR.0000000000000950 (2021).
Prasad, R. B. & Groop, L. Genetics of type 2 diabetes-pitfalls and possibilities. Genes 6, 87–123. https://doi.org/10.3390/genes6010087 (2015).
Factors, E. R. et al. Diabetes mellitus, fasting blood glucose concentration, and risk of vascular disease: A collaborative meta-analysis of 102 prospective studies. Lancet 375, 2215–2222. https://doi.org/10.1016/S0140-6736(10)60484-9 (2010).
Mayer-Davis, E. J. et al. Incidence trends of type 1 and type 2 diabetes among youths, 2002–2012. N. Engl. J. Med. 376, 1419–1429. https://doi.org/10.1056/NEJMoa1610187 (2017).
Lawrence, J. M. et al. Trends in prevalence of type 1 and type 2 diabetes in children and adolescents in the US, 2001–2017. JAMA 326, 717–727. https://doi.org/10.1001/jama.2021.11165 (2021).
Stankov, K., Benc, D. & Draskovic, D. Genetic and epigenetic factors in etiology of diabetes mellitus type 1. Pediatrics 132, 1112–1122. https://doi.org/10.1542/peds.2013-1652 (2013).
Lyssenko, V. et al. Clinical risk factors, DNA variants, and the development of type 2 diabetes. N. Engl. J. Med. 359, 2220–2232. https://doi.org/10.1056/NEJMoa0801869 (2008).
Galaviz, K. I., Narayan, K. M. V., Lobelo, F. & Weber, M. B. Lifestyle and the prevention of type 2 diabetes: A status report. Am. J. Lifestyle Med. 12, 4–20. https://doi.org/10.1177/1559827615619159 (2018).
Kaprio, J. et al. Concordance for type 1 (insulin-dependent) and type 2 (non-insulin-dependent) diabetes mellitus in a population-based cohort of twins in Finland. Diabetologia 35, 1060–1067. https://doi.org/10.1007/BF02221682 (1992).
Newman, B. et al. Concordance for type 2 (non-insulin-dependent) diabetes mellitus in male twins. Diabetologia 30, 763–768. https://doi.org/10.1007/BF00275741 (1987).
Poulsen, P., Kyvik, K. O., Vaag, A. & Beck-Nielsen, H. Heritability of type II (non-insulin-dependent) diabetes mellitus and abnormal glucose tolerance–a population-based twin study. Diabetologia 42, 139–145. https://doi.org/10.1007/s001250051131 (1999).
Medici, F., Hawa, M., Ianari, A., Pyke, D. A. & Leslie, R. D. Concordance rate for type II diabetes mellitus in monozygotic twins: Actuarial analysis. Diabetologia 42, 146–150. https://doi.org/10.1007/s001250051132 (1999).
Chen, X. et al. Dominant genetic variation and missing heritability for human complex traits: Insights from twin versus genome-wide common SNP models. Am. J. Hum. Genet. 97, 708–714. https://doi.org/10.1016/j.ajhg.2015.10.004 (2015).
Wang, Y., Vik, J. O., Omholt, S. W. & Gjuvsland, A. B. Effect of regulatory architecture on broad versus narrow sense heritability. PLoS Comput. Biol. 9, e1003053. https://doi.org/10.1371/journal.pcbi.1003053 (2013).
Karavolias, N. G. et al. Low additive genetic variation in a trait under selection in domesticated rice. G3 (Bethesda) 10, 2435–2443. https://doi.org/10.1534/g3.120.401194 (2020).
Genomes Project, C. et al. 2012 An integrated map of genetic variation from 1,092 human genomes. Nature 491, 56-65, https://doi.org/10.1038/nature11632 (2012).
Hall, J. B. & Bush, W. S. Analysis of heritability using genome-wide data. Curr. Protoc. Hum. Genet. https://doi.org/10.1002/cphg.25 (2016).
Miljkovic-Gacic, I. et al. Genetic determination of adiponectin and its relationship with body fat topography in multigenerational families of African heritage. Metabolism 56, 234–238. https://doi.org/10.1016/j.metabol.2006.09.019 (2007).
Freedman, B. I. et al. Genome-wide scans for heritability of fasting serum insulin and glucose concentrations in hypertensive families. Diabetologia 48, 661–668. https://doi.org/10.1007/s00125-005-1679-5 (2005).
Poveda, A. et al. The heritable basis of gene-environment interactions in cardiometabolic traits. Diabetologia 60, 442–452. https://doi.org/10.1007/s00125-016-4184-0 (2017).
Vattikuti, S., Guo, J. & Chow, C. C. Heritability and genetic correlations explained by common SNPs for metabolic syndrome traits. PLoS Genet. 8, e1002637. https://doi.org/10.1371/journal.pgen.1002637 (2012).
Almgren, P. et al. Heritability and familiality of type 2 diabetes and related quantitative traits in the Botnia Study. Diabetologia 54, 2811–2819. https://doi.org/10.1007/s00125-011-2267-5 (2011).
Mansour, O., Golden, S. H. & Yeh, H. C. Disparities in mortality among adults with and without diabetes by sex and race. J. Diabetes Complicat. 34, 107496. https://doi.org/10.1016/j.jdiacomp.2019.107496 (2020).
Lanting, L. C., Joung, I. M., Mackenbach, J. P., Lamberts, S. W. & Bootsma, A. H. Ethnic differences in mortality, end-stage complications, and quality of care among diabetic patients: A review. Diabetes Care 28, 2280–2288. https://doi.org/10.2337/diacare.28.9.2280 (2005).
Horowitz, C. R. et al. Race, genomics and chronic disease: what patients with African ancestry have to say. J. Health Care Poor Underserved 28, 248–260. https://doi.org/10.1353/hpu.2017.0020 (2017).
Srivastava, A. K., Williams, S. M. & Zhang, G. Heritability estimation approaches utilizing genome-wide data. Curr. Protoc. 3, e734. https://doi.org/10.1002/cpz1.734 (2023).
Golan, D., Lander, E. S. & Rosset, S. Measuring missing heritability: Inferring the contribution of common variants. Proc. Natl. Acad. Sci. USA 111, E5272-5281. https://doi.org/10.1073/pnas.1419064111 (2014).
Goodarzi, M. O. & Rotter, J. I. Genetics insights in the relationship between type 2 diabetes and coronary heart disease. Circ. Res. 126, 1526–1548. https://doi.org/10.1161/CIRCRESAHA.119.316065 (2020).
Morris, A. P. Progress in defining the genetic contribution to type 2 diabetes susceptibility. Curr. Opin. Genet. Dev. 50, 41–51. https://doi.org/10.1016/j.gde.2018.02.003 (2018).
Mahajan, A. et al. Fine-mapping type 2 diabetes loci to single-variant resolution using high-density imputation and islet-specific epigenome maps. Nat. Genet. 50, 1505–1513. https://doi.org/10.1038/s41588-018-0241-6 (2018).
Vujkovic, M. et al. Discovery of 318 new risk loci for type 2 diabetes and related vascular outcomes among 1.4 million participants in a multi-ancestry meta-analysis. Nat. Genet. 52, 680–691. https://doi.org/10.1038/s41588-020-0637-y (2020).
Mahajan, A. et al. Multi-ancestry genetic study of type 2 diabetes highlights the power of diverse populations for discovery and translation. Nat. Genet. 54, 560–572. https://doi.org/10.1038/s41588-022-01058-3 (2022).
DeForest, N. & Majithia, A. R. Genetics of type 2 diabetes: Implications from large-scale studies. Curr. Diab. Rep. 22, 227–235. https://doi.org/10.1007/s11892-022-01462-3 (2022).
Ng, M. C. et al. Meta-analysis of genome-wide association studies in African Americans provides insights into the genetic architecture of type 2 diabetes. PLoS Genet. 10, e1004517. https://doi.org/10.1371/journal.pgen.1004517 (2014).
Palmer, N. D. et al. A genome-wide association search for type 2 diabetes genes in African Americans. PLoS One 7, e29202. https://doi.org/10.1371/journal.pone.0029202 (2012).
Chen, J. et al. Genome-wide association study of type 2 diabetes in Africa. Diabetologia 62, 1204–1211. https://doi.org/10.1007/s00125-019-4880-7 (2019).
Ge, T. et al. Development and validation of a trans-ancestry polygenic risk score for type 2 diabetes in diverse populations. Genome Med. 14, 70. https://doi.org/10.1186/s13073-022-01074-2 (2022).
Jung, H. U. et al. Gene-environment interaction explains a part of missing heritability in human body mass index. Commun. Biol. 6, 324. https://doi.org/10.1038/s42003-023-04679-4 (2023).
Manolio, T. A. Genomewide association studies and assessment of the risk of disease. N. Engl. J. Med. 363, 166–176. https://doi.org/10.1056/NEJMra0905980 (2010).
Howard, V. J. et al. The reasons for geographic and racial differences in stroke study: Objectives and design. Neuroepidemiology 25, 135–143. https://doi.org/10.1159/000086678 (2005).
Armstrong, N. D. et al. Genetic contributors of incident stroke in 10,700 African Americans With hypertension: A meta-analysis from the genetics of hypertension associated treatments and reasons for geographic and racial differences in stroke studies. Front. Genet. 12, 781451. https://doi.org/10.3389/fgene.2021.781451 (2021).
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287. https://doi.org/10.1038/ng.3656 (2016).
Major cardiovascular events in hypertensive patients randomized to doxazosin vs chlorthalidone: the antihypertensive and lipid-lowering treatment to prevent heart attack trial (ALLHAT). ALLHAT Collaborative Research Group. JAMA 283, 1967–1975 (2000).
Arnett, D. K. et al. Pharmacogenetic approaches to hypertension therapy: Design and rationale for the genetics of hypertension associated treatment (GenHAT) study. Pharmacogenomics J. 2, 309–317. https://doi.org/10.1038/sj.tpj.6500113 (2002).
Consortium, e. Lessons learned from the eMERGE Network: balancing genomics in discovery and practice. HGG Adv 2, 100018, https://doi.org/10.1016/j.xhgg.2020.100018 (2021).
Stanaway, I. B. et al. The eMERGE genotype set of 83,717 subjects imputed to ~40 million variants genome wide and association with the herpes zoster medical record phenotype. Genet. Epidemiol. 43, 63–81. https://doi.org/10.1002/gepi.22167 (2019).
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283. https://doi.org/10.1038/ng.3643 (2016).
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909. https://doi.org/10.1038/ng1847 (2006).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82. https://doi.org/10.1016/j.ajhg.2010.11.011 (2011).
Evans, L. M. et al. Narrow-sense heritability estimation of complex traits using identity-by-descent information. Heredity (Edinb) 121, 616–630. https://doi.org/10.1038/s41437-018-0067-0 (2018).
Charles, B. A., Shriner, D. & Rotimi, C. N. Accounting for linkage disequilibrium in association analysis of diverse populations. Genet. Epidemiol. 38, 265–273. https://doi.org/10.1002/gepi.21788 (2014).
Speed, D., Hemani, G., Johnson, M. R. & Balding, D. J. Improved heritability estimation from genome-wide SNPs. Am. J. Hum. Genet. 91, 1011–1021. https://doi.org/10.1016/j.ajhg.2012.10.010 (2012).
Ren, D. et al. Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation. Genet. Sel. Evol. 54, 47. https://doi.org/10.1186/s12711-022-00737-3 (2022).
Ma, Y. et al. Excess heritability contribution of alcohol consumption variants in the “Missing Heritability” of type 2 diabetes mellitus. Int. J. Mol. Sci. https://doi.org/10.3390/ijms222212318 (2021).
Lee, S. H., Wray, N. R., Goddard, M. E. & Visscher, P. M. Estimating missing heritability for disease from genome-wide association studies. Am. J. Hum. Genet. 88, 294–305. https://doi.org/10.1016/j.ajhg.2011.02.002 (2011).
Wang, L. et al. Trends in prevalence of diabetes and control of risk factors in diabetes among US adults, 1999–2018. JAMA https://doi.org/10.1001/jama.2021.9883 (2021).
Acknowledgements
The authors thank all the eMERGE sites for providing genomic and health information. Additionally, the authors thank the other investigators, the staff, and the participants of the REGARDS study for their valuable contributions. A full list of participating REGARDS investigators and institutions can be found at: https://www.uab.edu/soph/regardsstudy/.
Funding
The eMERGE Network was funded by the National Human Genome Research Institute (NHGRI) through the following grants: U01HG006828 (Cincinnati Children's Hospital Medical Center and Boston Children’s Hospital); U01HG006830 (Children’s Hospital of Philadelphia); U01HG006389 (Essentia Institute of Rural Health, Marshfield Clinic Research Foundation, and Pennsylvania State University); U01HG006382 (Geisinger Clinic); U01HG006375 (Group Health Cooperative and the University of Washington); U01HG006379 (Mayo Clinic); U01HG006380 (Icahn School of Medicine at Mount Sinai); U01HG006388 (Northwestern University); U01HG006378 (Vanderbilt University Medical Center); and U01HG006385 (Vanderbilt University Medical Center serving as the Coordinating Center). The eMERGE IV Mass General Brigham site was funded by the NHGRI through U01HG008685, the Columbia University site was funded through U01HG008680, and the University of Alabama at Birmingham site was funded through U01HG011167. The REGARDS (R01HL136666, MRI, LAL) and GenHAT (R01HL123782, MRI) genetic studies were supported by the National Heart, Lung, and Blood Institute (NHLBI). The parent REGARDS study was supported by cooperative agreement U01 NS041588, co-funded by the National Institute of Neurological Disorders and Stroke (NINDS) and the National Institute on Aging (NIA).The content is solely the responsibility of the authors and does not necessarily represent the official views of the NINDS or the NIA. Representatives of the NINDS were involved in the review of the manuscript but were not directly involved in the collection, management, analysis, or interpretation of the data. Other funding sources include NHLBI T32HL072757 (N.D.A.), UM1 DK078616 (J.B.M.) R01HL151855 (J.B.M.), R01HL092173 (N.A.L.), and R00AG054573 (T.G.).
Author information
Authors and Affiliations
Contributions
NDA, NAL, MRI, and HKT contributed to the concept and design of the study. AP and VS performed quality control of the genomics data. AP performed the heritability analysis. TG, EWK, MRI, and HKT provided insight into methodology. LAL, LK, BN, ASS, LJR-T, GPJ, JBM, provided interpretation of the results. MRI and HKT supervised the study. NDA, MRI, and HKW drafted the manuscript. All authors provided critical edits on subsequent versions of the manuscript and approved the final version.
Corresponding author
Ethics declarations
Competing interests
JBM is an Academic Associate with Quest Diagnostics R&D. The remaining authors declare that they have no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Armstrong, N.D., Patki, A., Srinivasasainagendra, V. et al. Variant level heritability estimates of type 2 diabetes in African Americans. Sci Rep 14, 14009 (2024). https://doi.org/10.1038/s41598-024-64711-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-64711-3
- Springer Nature Limited