INTRODUCTION

Reports of hospitalized patients with COVID-19 during the early months of the global pandemic noted a high proportion of patients with diabetes, and also that diabetes was an independent risk factor for progression to acute respiratory distress syndrome and death.1,2,3,4 Later studies have reaffirmed these findings. For instance, in the COVID-19-Associated Hospitalization Surveillance Network, 33% of hospitalized patients with COVID-19 had diabetes compared with 11% of the US population,5 and diabetes was an independent risk factor for intensive care unit admission and death.6 The excess mortality related to diabetes and COVID-19 has been staggering. In England, deaths in the first half of 2020 compared with historical figures were 51% higher among individuals with type 1 diabetes and 64% higher among individuals with type 2 diabetes; two-thirds of these excess deaths were attributed to COVID-19 on death certificates.7

Although it is now well established that diabetes is a potent risk factor for COVID-19 mortality, there are major gaps in our understanding of this relationship. Nearly all studies of diabetes and COVID-19 have included only hospitalized patients,8,9 which is likely to introduce selection bias that can result in spurious associations or obscure genuine ones. This study design also precludes an examination of diabetes as a potential risk factor for COVID-19 infection, rather than complications such as critical illness or mortality.10 In addition, little is known about factors that might explain variability in COVID-19 risk among individuals with diabetes, such as glycemic control and obesity. A review of the epidemiology of diabetes and COVID-19 at the outset of the pandemic noted the lack of population- or community-based studies addressing these questions.11 More than 2 years into the pandemic, they remain largely unanswered.12

In the setting of three large integrated healthcare systems with comprehensive information on diagnoses, hospitalizations, laboratory test results, and out-of-hospital deaths, we conducted a population-based cohort study to investigate diabetes and measures of diabetes severity as risk factors for two outcomes: COVID-19 infection and severe COVID-19. In contrast with previous studies, which have studied intensive care unit admission as a surrogate for severe COVID-19, we used a validated algorithm for invasive mechanical ventilation and state death certificate data to define this outcome. Because of the important role obesity plays in the development of both diabetes and COVID-19 complications, we also investigated the independent relationship between body mass index (BMI) and study outcomes in patients with diabetes after adjustment for several diabetes severity measures.

METHODS

Population

This study included adults aged 18 years or greater who were members of Kaiser Permanente (KP)—an integrated health care delivery system providing both health care and insurance coverage—in one of three KP regions: Colorado, Washington, and Northwest (includes Oregon and southwest Washington). Cohort inclusion criteria were continuous enrollment for at least 6 months and no COVID-19 prior to the cohort entry date of February 29, 2020. Data sources included electronic health records (EHRs) and laboratory test results from outpatient, emergency department, and inpatient facilities; health system administrative and billing information; and state death certificates. Individuals were followed until the time of their first COVID-19 outcome, loss of enrollment, death, or end of follow-up on February 28, 2021. KP Institutional Review Boards approved the study.

Outcomes

The two outcomes in this study were COVID-19 infection and severe COVID-19. COVID-19 was defined as the first occurrence of a (1) positive nucleic acid antigen test (NAAT) for SARS-CoV-2, (2) hospitalization with an International Classification Diseases version 10 (ICD-10) diagnosis code for COVID-19 (B342, B9721, B9729, U071, U072), or (3) death certificate with COVID-19 listed as a cause of death. Severe COVID-19 was defined as the first occurrence of a COVID-19 hospitalization involving invasive mechanical ventilation or a COVID-19 death. COVID-19 hospitalizations included hospitalizations with an ICD-10 code for COVID-19 and any hospitalization within 28 days of the initial COVID-19 diagnosis. COVID-19 death was defined as a COVID-19 hospitalization with a discharge status of death, a death with COVID-19 listed as a cause of death, or any death within 28 days of an initial COVID-19 diagnosis. In a previous publication, we reported results of a validation study (n = 76): the positive predictive (PPV) value for confirmed COVID-19 infection from hospitalizations with COVID-19 diagnosis codes was 96% (95% confidence interval [CI] 89–99%), and the PPV for invasive mechanical ventilation identified through diagnosis and procedure codes was 100% (95% CI 86–100%).13

Exposures

The exposures of interest were diabetes, glycemic control, treatment intensity, presence of diabetes complications, and BMI. The definition of diabetes, assessed prior to cohort entry and updated at the start of each month of follow-up, was adapted from a previous study: an ICD-9 or ICD-10 diagnosis code for diabetes mellitus (250.x, E10, E11, E13) in an inpatient or at least 2 outpatient claims, a laboratory measure of hemoglobin A1c (HbA1c) ≥ 6.5%, or a pharmacy fill of a diabetes medication.14 Type 1 diabetes was defined as the presence of an ICD-9 (250. × 1 or 250. × 3) or ICD-10 code (E10) for type 1 diabetes; all other diabetes was considered type 2. Diabetes with complications was defined using ICD-9 and ICD-10 codes from the Charlson index (250.4–250.63, E10.2-E10.5, E10.61-E10.619, E11.2-E11.5, E11.61-E11.619, E13.2-E13.4, E13.61-E13.619).15 HbA1c was assessed each month based on the most recent measure in the prior 12 months and categorized as < 7%, 7– < 9%, ≥ 9%, or missing. Current treatment (insulin, non-insulin diabetes medication only, no medication) was determined each month based on whether the days’ supply overlapped with the start of the month for non-insulin medications, and whether the medication was filled in the prior 6 months for insulin.

Covariates

As described previously,13 demographic variables at cohort entry were ascertained from the EHR: age, sex, self-reported race/ethnicity, and neighborhood deprivation index (NDI).16 Smoking status was categorized as current, former, or never (including individuals with no smoking information recorded). BMI at cohort entry was categorized as underweight, normal, overweight, and obese class I, II, and III based on the ranges < 18.5, 18.5– < 25, 25– < 30, 30– < 35, 35– < 40, and ≥ 40 kg/m2. The following comorbidities were ascertained from ICD-10 codes in the 12 months prior to cohort entry: hypertension, myocardial infarction, peripheral vascular disease, cerebrovascular disease, heart failure, atrial fibrillation and flutter, liver disease, asthma, chronic obstructive pulmonary disease, rheumatological disease, dementia, and cancer.13,15 Pharmacy records were used to assess time-varying measures of medication use including angiotensin-converting enzyme inhibitors, angiotensin receptor blockers, statins, and systemic corticosteroids. All covariates, as well as KP region and month of follow-up, were included as adjustment variables in all statistical models. Age was parameterized with natural cubic splines; other adjustment covariates were categorical.

Statistical Analysis

We estimated odds ratios (ORs) and 95% CIs from a series of covariate-adjusted logistic regression models relating COVID-19 outcomes in each month of follow-up to exposure variables assessed at the start of that month. These models incorporated inverse probability weights to account for missing data on race/ethnicity, NDI, and BMI, and to account for censoring due to disenrollment during follow-up. We estimated logistic regression model parameters using weighted generalized estimating equations with a working correlation matrix and with standard errors calculated via the sandwich estimator to account for the missing data and censoring weights and within-person correlation across time.17,18 We first fit models for the outcome of COVID-19 infection and then repeated the same models for the outcome of severe COVID-19. All models were estimated in the entire study population and compared individuals with the exposure of interest (diabetes or category of severity measure) with a referent group of individuals without diabetes. HbA1c levels were also fit with natural cubic splines to allow for estimation of a smooth, potentially non-linear relationship between HbA1c and odds of study outcomes. For analyses of BMI, we restricted our sample to individuals with diabetes and estimated covariate-adjusted models relating COVID-19 outcomes to BMI categories, before and after adjustment for all diabetes severity measures. For analyses of the outcome of severe COVID-19, because of the limited number of events, we used fewer age spline parameters and combined two race/ethnicity categories.

Before June 2020, NAAT testing for SARS-CoV-2 was not widely available in all geographic regions included in this study. To evaluate the impact of testing bias, we conducted sensitivity analyses in which observation for COVID-19 outcomes began on June 1, 2020, and individuals with COVID-19 before that date were excluded. We also evaluated a definition of type 1 diabetes that excluded individuals who used non-insulin glucose-lowering medications and we estimated associations for severe COVID-19 restricted to individuals with COVID-19 infection, an approach that has been used often in previous studies but may be susceptible to collider bias.10 All analyses were conducted using R, version 4.0.2.

RESULTS

Of 1,086,918 eligible individuals (33.2% Colorado, 38.3% Northwest, 28.5% Washington), 53.2% were female, 13.1% had diabetes, and 92.7% were enrolled for at least 12 months at cohort entry. Compared with individuals who did not have diabetes, individuals with diabetes were older (mean age 62.8 vs 48.7 years), more likely to be Black (5.8 vs 3.6%) or Hispanic/Latino (12.2 vs 8.7%), and more likely to live in the highest quartile of NDI (29.8 vs 23.6%), indicating greater social deprivation. Individuals with diabetes also had a higher mean BMI (33.0 vs 28.4 kg/m2) and a greater burden of comorbidities (Table 1). Among individuals with diabetes, 5.9% were classified as type 1 diabetes, 35.7% had diabetes complications, the mean HbA1c was 7.1%, and 16.8% did not have information on HbA1c. Most received some form of diabetes treatment (36.7% non-insulin drugs only, 20.8% insulin). Among the 907,263 (83.5%) individuals with complete data on race/ethnicity, NDI, and BMI, 560,946 NAAT tests for COVID-19 were done; the prevalence of diabetes among individuals with positive tests was 16.4%, and among those with only negative tests it was 15.0%. Censoring due to loss of enrollment or non-COVID death occurred in 112,248 (12.4%) individuals, 30,935 (3.4%) had COVID-19 including 3906 hospitalizations, and 996 (0.1%) met criteria for severe COVID-19 including 752 COVID-19 deaths.

Table 1 Characteristics of study population at cohort entry

Compared with the reference group with no diabetes, diabetes was associated with an increased risk of COVID-19 (OR of 1.27; 95% CI, 1.23–1.31, Table 2). Odds ratios for adjustment variables are displayed in Appendix Table 1. There was strong statistical evidence the ORs for COVID-19 infection differed based on the presence of diabetes complications, treatment intensity, and HbA1c levels (P < 0.001 for each measure), although the differences in these ORs were not large in magnitude. For instance, individuals receiving insulin treatment had a greater risk of COVID-19 (OR 1.43; 95% CI, 1.34–1.52) than individuals receiving only non-insulin treatment (OR 1.26; 95% CI, 1.20–1.33) and untreated individuals (OR 1.24; 95% CI, 1.18–1.29). The relationship between COVID-19 risk and HbA1c category was dose-dependent; cubic spline modeling suggested a linear relationship up to an HbA1c of approximately 8% and then a plateau (Fig. 1a).

Table 2 Associations of diabetes and diabetes severity measures with COVID-19 of any severity
Fig. 1
figure 1

Association of hemoglobin A1c levels with a COVID-19 of any severity, and b severe COVID-19. Modeled with cubic splines, confidence intervals shaded in gray. The reference group is individuals without diabetes

Associations with diabetes and diabetes severity measures were greater in magnitude for severe COVID-19. Compared with the reference group with no diabetes, diabetes was associated with nearly twofold increased risk of severe COVID-19 (OR 1.85; 95% CI, 1.59–2.14, Table 3). Odds ratios for adjustment variables are displayed in Appendix Table 2. Type 1 diabetes was associated with a greater increased risk (OR 2.87; 95% CI, 1.99–4.15) than type 2 diabetes (OR 1.80; 95% CI, 1.55–2.09). Insulin use was associated with a greater increased risk (OR 2.65; 95% CI, 2.13–3.28) than use of only non-insulin drugs (OR 1.69; 95% CI, 1.36–2.09) and no treatment (OR 1.68; 95% CI, 1.41–2.01), which had similar risks. There was a strong, dose-dependent relationship between A1c category and risk of severe COVID-19. Cubic spline modeling suggested a linear relationship between an HbA1c of approximately 6 to 8%, with slightly greater risks below 6% and a plateau above 8% (Fig. 1b).

Table 3 Associations of diabetes and diabetes severity measures with severe COVID-19

In analyses restricted to individuals with diabetes, we evaluated associations between BMI categories and COVID-19 outcomes. Compared with the reference group of normal BMI (18 to < 25 kg/m2), there was a graded relationship between BMI category and COVID-19 risk (Wald P < 0.001, Table 4), ranging from OR 1.10 (95% CI, 1.00–1.22) for overweight (BMI 25 to < 30 kg/m2) to OR 1.32 (95% CI, 1.19–1.47) for obese III (BMI 40 ≥ kg/m2). BMI was also associated with an increased risk of severe COVID-19 in a dose-dependent manner (P < 0.001), and the associations were greater in magnitude, ranging from an OR 1.03 (95% CI, 0.76–1.41) for overweight to OR 2.30 (95% CI, 1.59–3.33) for obese III. When analyses were repeated adjusting for all diabetes severity measures, associations were unchanged for both outcomes.

Table 4 Associations of body mass index (BMI) with COVID-19 outcomes among individuals with diabetes

Sensitivity analyses that excluded the period when NAAT testing for SARS-CoV-2 was not widely available yielded estimates that were slightly attenuated but statistically significant for COVID-19 infection, and estimates that were slightly greater in magnitude for severe COVID-19 (Appendix Tables 3 and 4). Excluding individuals classified as type 1 diabetes who used non-insulin glucose-lowering medications (n = 961) resulted in similar results (Appendix Table 5). Analyses of severe COVID-19 that included only individuals with COVID-19 infection yielded slightly attenuated associations compared to the primary analysis (Appendix Tables 6 and 7).

DISCUSSION

In this community-based study of COVID-19 outcomes conducted in three large integrated healthcare systems, diabetes was not only a strong risk factor for severe COVID-19 but also independently associated with COVID-19 infection of any severity, after adjustment for demographic factors, neighborhood measures of socioeconomic status, BMI, and comorbidities. Among individuals with diabetes, several diabetes markers of severity were associated with substantial increases in risk, especially for the outcome of severe COVID-19. Even individuals with good glycemic control (HbA1c < 7%) according to current treatment guidelines19 had an increased risk of COVID-19 and severe COVID-19 compared to individuals without diabetes. We also identified a strong dose-dependent relationship between BMI and COVID-19 outcomes among individuals with diabetes after adjusting for all diabetes severity measures, supporting evidence from Mendelian randomization experiments that obesity is an important etiologic factor for adverse COVID-19 outcomes.20,21

Our results are consistent with findings from previous studies of severe COVID-19, all of which included data from early in the global pandemic (through mid-2020). Two population-based studies from England found evidence of a graded relationship between HbA1c levels and the risk of COVID-19 death.7,22 A study of national health records from Scotland found that diabetes, HbA1c levels, and number of diabetes medications prescribed were associated with increased risks of fatal or intensive care unit (ICU)-treated COVID-19.23 Among veterans receiving healthcare from the Department of Veterans Affairs with a positive NAAT test for SARS-CoV-2, diabetes was associated with increased risks of hospitalization, ICU admission, and death within 30 days, and the risks were greater among individuals receiving insulin treatment.24 In contrast with earlier reports that severe COVID-19 outcomes are similar among individuals with type 1 and type 2 diabetes,23,25 we found that type 1 diabetes was associated with 2.9-fold (95% CI 2.0–4.2) risk of severe COVID-19 compared with a 1.8-fold (95% CI 1.6–2.1) risk for type 2 diabetes (P = 0.01 for difference).

Our study is one of the few to evaluate diabetes and diabetes severity measures as risk factors for a validated severe COVID-19 outcome, without conditioning on an initial COVID-19 diagnosis or hospitalization, which can generate biased estimates of association.10 In the absence of a consensus definition of severe COVID-19,26,27 previous studies have included hospitalizations or ICU admissions, criteria which may vary considerably across geographic locations and even by hospital. Criteria for providing these levels of care have also varied over time during the pandemic. Because of concerns about the reproducibility and relevance of these subjective outcomes, we created and validated an algorithm for severe COVID-19 that included death or treatment with invasive mechanical ventilation, which is more likely to reflect respiratory and end organ failure.

Several potential mechanisms have been proposed that may link diabetes with an increased susceptibility to COVID-19 infection and severe outcomes after an infection, including a direct effect of elevated glucose levels SARS-CoV-2 replication, upregulation of harmful immune and inflammatory responses, hypercoagulability, and activation of the renin–angiotensin–aldosterone system.28 In addition, other comorbidities that associate with diabetes and the cardiovascular complications that results from diabetes are also potent risk factors for adverse COVID-19 outcomes. Whether or not diabetes has a direct biological effect on COVID-19 outcomes, it is clear that diabetes and measures of the severity of diabetes are important risk factors that can be used to identify the most patients most susceptible to infection and severe outcomes.

Obesity has been identified as a key risk factor for COVID-19-related mortality. A study of people enrolled in Kaiser Permanente Southern California and diagnosed with COVID-19 identified a strong relationship between BMI and COVID-19 death, even after adjustment for diabetes and other comorbidities.29 This finding has been replicated in the UK.30 Because of the causal role obesity plays in the development of diabetes,31,32 studies that evaluate the relationship between BMI and COVID-19 outcomes in populations with diabetes can help to disentangle the effects of these cardiometabolic risk factors on disease risk. In the national study of diabetes and COVID-19 mortality from Scotland, there was evidence of a J-shaped relationship; BMI < 20 kg/m2 and all BMI categories greater than the reference category of 20 to < 25 kg/m2 were associated with an increased risk of COVID-19 death.23 We found evidence of a strong, graded relationship between BMI and both COVID-19 infection of any severity and severe COVID-19. Because of the small number of individuals in the underweight category (BMI < 18 kg/m2), our study was underpowered to detect differences in risk compared to individuals with a normal BMI.

Our study is one of only a few to evaluate diabetes and diabetes severity measures as risk factors for developing COVID-19 of any severity. We used rich EHR data from integrated healthcare systems that provide both healthcare and insurance coverage to members, with long periods of prior enrollment and near-complete capture of COVID-19 outcomes. This rigorous study design allowed us to estimate population-based associations of diabetes risk factors with severe COVID-19, without conditioning on the presence of a COVID-19 infection or hospitalization as previous studies have done. Detailed laboratory and pharmacy data allowed for ascertainment of several diabetes severity measures. Additional strengths of this study include the community-based design representing three geographic regions of the USA, the validated definition of severe COVID-19, and adjustment for comorbidities and several sociodemographic factors as potential confounding variables.

Our study also had limitations. Although we included more recent data than previous studies of diabetes and COVID-19, widespread vaccination against COVID-19 had not yet occurred and information on the delta and omicron variants is lacking.33 Also, a true physiologic endpoint of severe COVID-19 based on oxygenation levels in large-scale observational studies remains elusive. Residual confounding from factors associated with diabetes and COVID-19 outcomes and were not measured is a possibility, and associations may not reflect causal risk estimates.20,21 For example, individual-level social determinants such as income and lack of access to effective medical care may underlie some of the risk attributed to diabetes in this study. Because autoantibody testing results and age at onset of diabetes were not available, some individuals with type 2 diabetes may have been misclassified as having type 1 diabetes, and vice versa. Also, many asymptomatic and mildly symptomatic cases of COVID-19 that did not result in diagnostic testing were most likely not captured.

Patients with diabetes suffer a large proportion of the severe morbidity and mortality associated with COVID-19. Within this heterogeneous population, we have demonstrated that the magnitude of risk differs by type and severity of diabetes. Our study confirms several observations from previous population-based studies of risk factors for COVID-19 mortality and provides new evidence about how diabetes, diabetes severity measures, and obesity related to the risk of developing COVID-19 of any severity. As new SARS-CoV-2 variants emerge and new treatments and risk mitigation interventions are developed,34,35,36 these findings may help to allocate scarce resources among individuals at greatest risk and with the greatest potential to benefit from these interventions. Additional community- or population-based studies are needed to extend these findings to time periods when different SARS-CoV-2 variants predominate.