Introduction

The first reported case of a pneumonia-like illness later identified as COVID-19 occurred in December 2019 in Wuhan, China [1]. Since January 2020, there have been over 770 million cases of COVID-19 across the world, with more than 100 million confirmed cases in the U.S. [2]. These cases have resulted in over 6 million COVID-19 hospitalizations, and over 1 million COVID-19 deaths in the U.S. to date [3]. Recent studies on patient care and outcomes of COVID-19 patients are providing insight into the national burden that COVID-19 had in the U.S. [4,5,6,7,8,9]. As with many medical conditions, healthcare and socioeconomic disparities disproportionately affect marginalized populations and put them at higher risk for morbidity and mortality [10,11,12,13]. As highlighted by the CDC, interpersonal and structural racism negatively affects marginalized groups and is pervasive among all aspects of their health and potential [14]. Published research finds Black, Native American, and Hispanic patients were at higher risk for adverse outcomes and mortality related to COVID-19 [4,5,6,7,8,9]. However, as populations in different race and ethnic groups may have varying age distributions, the risk of disease and associated outcomes can differ by age [15]. In particular, marginalized race and ethnic groups in the U.S. tend to have younger age distributions than their White counterparts [16, 17]. Thus, appropriately age-standardized national estimates are still needed in order to remove confounding related to age. This allows for more accurate comparisons of COVID-19 hospitalization and mortality rates, as well as other outcomes and complications, between different race and ethnic groups.

To better understand the national health impact of the first year of the COVID-19 pandemic and health equity in treatment access and outcomes, we use nationally representative data standardized to the 2000 U.S. standard population to evaluate age-adjusted racial and ethnic disparities in COVID-19 hospitalizations, comorbid risk factors, hospital procedures performed, and inpatient outcomes. Our analysis adds to the current literature by providing in-depth detail regarding appropriately age-standardized outcomes, procedures, and complications of COVID-19 hospitalizations among distinct race and ethnic groups.

Methods

Study Design and Database

This study is a cross-sectional observational study using nationally representative data from the 2020 National Inpatient Sample (NIS). As part of the Healthcare Research and Quality Healthcare Cost and Utilization Project (HCUP), the NIS is a publicly available de-identified database containing information on healthcare utilization, cost, and outcomes for inpatient hospitalization. The NIS database samples inpatient data from 20% of participating hospitals across the nation, and is able to estimate > 97% of all U.S. hospitalizations [18]. All estimations and analyses were performed using the recommended NIS trend weights [19]. NIS data from the year 2020 is the first nationally representative all-payer database to provide a perspective on events and outcomes related to the COVID-19 pandemic in U.S. hospitals. The NIS does not account for readmissions and each observation is coded as a unique hospitalization. HCUP databases, including the Clinical Classifications Software Refined and Elixhauser Comorbidity Software Refined databases, were used to group ICD-10 diagnostic and procedure codes into meaningful categories for analysis (Supplementary Table 1) [20, 21].

Study Population

All COVID-19 hospitalizations for children and adults ages 0 to 124 from the NIS database from the year 2020 were included. COVID-19 hospitalizations were further characterized as primary COVID-19 hospitalizations if the first listed discharge code (ICD-10 Code U07.1) was for COVID-19, and secondary COVID-19 hospitalizations if COVID-19 was included in any hospitalization code excluding the primary position. COVID-19 hospitalizations were categorized as primary versus secondary hospitalizations in an attempt to distinguish hospitalizations that were primarily affected by COVID-19 as opposed to another primary diagnosis in which the patient was concurrently diagnosed with COVID-19. NIS race and ethnicity categories used include non-Hispanic White, non-Hispanic Black, Hispanic, non-Hispanic Asian-American and Pacific Islander (AAPI), non-Hispanic Native American, and non-Hispanic Other. As defined by HCUP, other race and ethnicity include patients who identify with multiple races or race and ethnicity other than White, Black, Hispanic, AAPI, and Native American. If data on race and ethnicity were available for each observation in the NIS database, ethnicity took precedence over race. Data from missing race and ethnicity was excluded from the analysis [22].

The CDC created a COVID-19 hospitalization surveillance network (COVID-NET) to track numbers and rates of COVID-19 hospitalizations on March 1, 2020 [1, 23]. We excluded COVID-19 codes prior to March 1, 2020, as they were likely coding errors.

Outcomes

The primary outcome of this study was to estimate disparities in the national age-standardized COVID-19 hospitalization rates, complications, medical procedures, and mortality rates by race and ethnicity.

Secondary outcomes included investigating disparities in baseline patient and hospital characteristics, comorbid conditions, and disposition among COVID-19 hospitalizations. Data on median income was estimated by neighborhood zip code provided by the NIS database. To estimate hospitalization costs, we utilized the total charges and adjusted by cost-to-charge ratio, as supplied by HCUP. All endpoints assessed in this study were analyzed for COVID-19 hospitalizations utilizing ICD-10 diagnostic and procedure codes for outcomes of interest. All ICD-10 codes utilized in this study are listed in Supplementary Table 1.

Statistical Analysis

Descriptive statistics are reported for patient and hospital characteristics, common comorbidities, and inpatient mortality among COVID-19 hospitalizations. Direct standardization with single-year of life age adjustments was used to age-adjust rates relative to the 2000 U.S. standard population, as recommended by the National Center for Health Statistics. Race and ethnicity data from the 2020 United States Census Bureau was further used to calculate crude and age-standardized hospitalization rates per 100,000 people. The race and ethnicity categories extracted from the Census Bureau correlated to those defined by HCUP, and included Hispanic, non-Hispanic White, non-Hispanic Black, non-Hispanic AAPI, non-Hispanic Native American, and non-Hispanic Other, which consists of patients of multiple races or race/ethnicity other than White, Black, Hispanic, AAPI, and Native American [24]. Standardized hospitalization rate ratios and excess rates for the race and ethnic subgroups were estimated, utilizing White patients as the reference group. In particular, rate ratio was defined as the ratio of age-standardized hospitalization rate of the race/ethnic subgroup, compared to White patients. Excess hospitalization rate was defined as the absolute difference of age-standardized hospitalization rate between the race/ethnic subgroup, compared to White patients. Direct age-standardization was calculated to remove confounding related to age, allowing for more accurate comparisons between different race and ethnic groups. We used multivariable logistic regression using a restricted cubic spline model to adjust for age to evaluate the independent association of prespecified risk factors, such as insurance status, median income level, race and ethnicity, hospital location/division, and comorbid conditions, on inpatient mortality among COVID-19 hospitalizations [25]. All estimations were performed with appropriate NIS survey weights and cluster adjustment to account for the sampling strategy. All statistical analyses were done in Stata 18.0 (StataCorp, College Station, TX).

The UCLA institutional review board granted an oversight exemption given the use of de-identified, public data.

Results

Demographic and Clinical Characteristics Associated with COVID-19 Hospitalizations

In the 2020 NIS, there were 1,058,815 primary COVID-19 hospitalizations in children and adults ages 0 to 124, of which 8775 were pediatric hospitalizations, and 1,050,040 were adult hospitalizations. There were 1030 primary COVID hospitalizations that were excluded due to missing demographic data. Of those hospitalized with primary COVID-19, 47.2% were female. With respect to race and ethnicity, 52.5% of patients were White, 20.7% Hispanic, 18.5% Black, 4.1% other race, 3.2% AAPI, and 1.0% Native American. The median age was 66 (IQR: 54, 77). White patients admitted for primary COVID-19 tended to be older than those from other race and ethnic groups. Overall, most primary COVID-19 hospitalizations occurred in the south (Table 1). Native American and Black patients had lower median income compared to other race and ethnic groups. Among the race and ethnic groups, AAPI patients with primary COVID-19 had the highest proportion of patients with private insurance, whereas White and Native American patients with primary COVID-19 had the highest proportion of patients with public insurance (Table 1). The most common comorbidities included hypertension, diabetes, and dyslipidemia. Heart failure, atrial fibrillation/flutter, coronary artery disease, chronic kidney disease, and chronic obstructive pulmonary disease (COPD) were also prevalent. Among the different race and ethnic groups, prevalence of these comorbidities varied (Table 1).

Table 1 Baseline patient characteristics/demographics and hospital characteristics for primary COVID-19, by race and ethnicity

Characteristics of Hospital Stays for COVID-19

The mean length of stay was 7.5 days (95% CI 7.4–7.5). Hispanic, Native American, and Other patients had longer mean lengths of stay (Table 2). Hispanic and Other patients admitted for primary COVID-19 had the highest mean cost of stay (Table 2). Total inpatient cost for primary COVID-19 hospitalizations from March 2020 through December 2020 was $20,033,334,264. Only 54.9% of all hospitalized patients were discharged home, with 16.6% being transferred to another care facility such as a skilled nursing facility, and 13.3% requiring home healthcare. Age-adjusted rates used for comparison between race and ethnic groups showed that Native American patients had the least age-adjusted proportion of discharges home, the greatest age-adjusted proportion of discharges to a short-term hospital, and the greatest age-adjusted proportion of discharges against medical advice. AAPI patients had the greatest age-adjusted proportion of discharges home and the lowest age-adjusted proportion of discharges against medical advice. White patients had the greatest age-adjusted proportion of discharges to a skilled nursing facility. Black patients had the greatest age-adjusted rates of patients requiring home healthcare (Table 2).

Table 2 Hospital procedures, complications, and disposition for primary COVID-19 by race and ethnicity, unadjusted (age-adjusted)a

Hospital Complications and Procedures in COVID-19 Hospitalizations

Acute renal failure, acute thromboembolic events, and acute respiratory failure, requiring noninvasive and/or mechanical ventilation, were the most common inpatient complications. Black patients with primary COVID-19 had the highest age-adjusted rates for acute renal failure, acute cerebrovascular event, and acute thromboembolic event. Native American patients had the highest age-adjusted rates of acute myocardial infarction (MI), cardiac arrest, and acute respiratory failure. AAPI patients had the lowest age-adjusted rates of acute liver failure, acute thromboembolic event, and shock (Table 2). Native American patients with primary COVID-19 also had higher age-adjusted procedure rates, including mechanical ventilation, mechanical circulatory support/extracorporeal membrane oxygenation (MCS/ECMO), cardioversion, hemodialysis, and continuous renal replacement therapy (CRRT) compared to other race and ethnic groups. Black patients had the highest age-adjusted rates of cardiopulmonary resuscitation (CPR) and blood transfusions. AAPI patients had the highest age-adjusted rates of vasopressor infusion (Table 2).

Age-Adjusted Incidence of Hospitalization and In-Hospital Mortality

Throughout 2020, there was a general uptrend in the number of COVID-19 admissions and downtrend in inpatient deaths with hospitalizations occurring in early 2020 exhibiting higher rates of inpatient mortality (Supplementary Fig. 1). Males had a higher age-adjusted COVID-19 hospitalization rate, at 313.3 per 100,000 than females with 237.8 per 100,000. Black, Hispanic, and Native American patients had higher crude and age-adjusted primary COVID-19 hospitalization rates compared to White patients (Table 3).

Table 3 Disparity measures of the age-standardized hospitalization rate for primary COVID-19 by sex, race, and ethnicity

Overall inpatient mortality rate was 11.1% (Table 2). Patients of Native American, Hispanic, and other race and ethnicity had higher age-adjusted mortality rates compared to their counterparts, at 8.7%, 4.7%, and 4.9%, respectively (Table 2). Black and AAPI primary COVID-19 patients had similar age-adjusted inpatient mortality rates despite the lower age-adjusted primary COVID-19 hospitalization rate among AAPI patients (Tables 2 and 3). A sub-analysis of crude COVID-19 mortality rates was performed, stratified by median income among the different race and ethnic groups (Supplementary Table 2). Among COVID-19 patients in the lowest median income quartile, Black and Native American patients were found to have the highest proportion of inpatient deaths, while AAPI patients had the lowest proportion of deaths (Supplementary Table 2).

Multivariable logistic regression models using a restricted cubic spline model to adjust for age were created to assess the independent association of prespecified risk factors with inpatient mortality (Table 4). Factors associated with inpatient mortality include male sex, heart failure, atrial fibrillation/flutter, ventricular tachycardia, cerebrovascular disease, coagulopathy, obesity, COPD, and chronic liver disease. Patients with lower income and public health insurance were at increased odds of inpatient death (Table 4). Native American patients were at the highest odds of inpatient mortality relative to White patients after adjusting for other factors and comorbidities (Table 4). Although Black patients had higher age-adjusted primary COVID-19 hospitalization rates compared to White patients, they were at decreased odds of inpatient mortality (Tables 3 and 4). There was a non-linear increasing risk for inpatient mortality with age (Supplementary Fig. 2).

Table 4 Patient factors associated with inpatient mortality for primary COVID-19 hospitalizations

Secondary COVID-19 Hospitalizations

There was a total of 620,180 secondary COVID-19 hospitalizations in children and adults ages 0 to 124 in 2020, with increased mortality rates early on in 2020 (Supplementary Fig. 3). Of those 620,180 secondary COVID-19 hospitalizations, 11,185 were pediatric hospitalizations and 608,995 were adult hospitalizations. There were 1565 secondary COVID hospitalizations that were excluded due to missing demographic data. Total inpatient costs for secondary COVID-19 hospitalizations from March 2020 through December 2020 were $15,773,031,618. Similar trends in patient and hospital characteristics, as well as hospital procedures, complications, and outcomes, were observed for secondary COVID-19 (Supplementary Table 3 for baseline patient/hospital characteristics, Supplementary Table 4 for procedures and outcomes, Supplementary Table 5 for age-standardized hospitalization rates and Supplementary Table 6 for risk factors associated with inpatient mortality).

Discussion

There have been many early studies, including multi-center studies and systemic reviews, that have reported disparities in COVID-19 hospitalizations and death in non-White patients, particularly among Black and Hispanic patients [26, 27]. Recent publications utilizing the 2020 NIS database have been insightful in describing racial and ethnic disparities using nationally representative data. However, these studies focused primarily on adult hospitalizations and did not utilize direct standardization to adjust for age when comparing different race and ethnic groups [4, 6]. Thus, any confounding bias attributed to age may skew the perceived disparities reported by these studies. Using the COVID-NET database, CDC estimates have utilized direct standardization to report greater age-standardized COVID-19 hospitalization rates among Black, Hispanic, and Native American adult patients [28,29,30]. However, these estimations were standardized to the 2019 U.S. intercensal population, and not the 2000 U.S. standard population, which is the standard population recommended by the National Center for Health Statistics. To help address this gap in the literature, our study implements direct-standardization of rates among all-age pediatric and adult COVID-19 hospitalizations to estimate age-adjusted rates standardized to the 2000 U.S. standard population. When appropriately standardizing COVID-19 hospitalization rates by race and ethnic national populations, we found Black, Hispanic, and Native American patients had over double the hospitalization rate when compared to the national White population. AAPI patients had roughly four-fifths of the primary COVID-19 hospitalization rate compared to the White national population.

Compared to previous studies, our study also allows a more accurate and detailed comparison of inpatient procedures, complications, and outcomes between race and ethnic groups by adjusting for age [5,6,7, 27,28,29,30,31,32,33]. While crude inpatient mortality was highest for White and Native American patients, age-adjusted mortality rates found White patients had the lowest mortality rate and Native American patients had the highest risk. With appropriate age adjustment, we find that Black, Hispanic, and Native American patients not only had higher hospitalization rates, but also worse outcomes. Many prior studies utilized race groups that combined AAPI, Native American, and other races into one joint group, losing the granularity of detail for outcomes in each of these specific race groups [6, 27]. For example, using data from the 2020 NIS database, Vardar et al. report crude rates of 59.1%, 51.5%, 59.8%, and 60.3% for acute respiratory failure in White, Black, Hispanic, and Other adult patients, respectively [6]. Our study reports similar crude rates of acute respiratory failure for White, Black, Hispanic, AAPI, Native American, and Other patients. However, after adjusting for age, we find that Native American patients were at highest risk of acute respiratory failure at 64.5% compared to 45.7%, 45.2%, 40.9%, 48.6%, 49.6%, and 47.2% for White, Black, Hispanic, AAPI, and Other patients, respectively. We report similar disparities for other age-adjusted inpatient complications and procedures among marginalized communities with COVID-19, as described in Table 2.

As emphasized by the CDC, interpersonal, structural, and systemic racism greatly contribute to health inequities that impact non-White persons in the U.S. Marginalized populations are at increased risk for adverse health outcomes [10,11,12,13,14]. They are at increased risk for chronic comorbid conditions, lower access to healthcare, and lower working and neighborhood living conditions relative to White populations in the U.S. [8]. The large difference in COVID-19 hospitalization burden that we noted in our study likely relates to increased transmission related to employment differences and household and neighborhood characteristics. Working adults without flexible sick leave are more likely to work while sick, and spread transmissible illness among their coworkers, clients, and/or customers [34]. Prior research has suggested that essential workers are at increased risk for COVID-19 infection and transmission due to occupational exposures and the nature of their work. Among non-White individuals, Black and Hispanic individuals are more likely to hold essential worker positions and have been disproportionately impacted by COVID-19 [35,36,37]. In many marginalized communities, multiple generations live in a single household, putting the young and elderly at increased risk of COVID-19 infection [38]. Access to and usage of high-quality personal protective equipment may also contribute to differences in infection rates between national populations.

Moreover, socioeconomic status is a multi-dimensional concept that is strongly influenced by the social construct of race and racism. Differences in socioeconomic status among different race and ethnic groups in turn play a major role in health disparities [39, 40]. Various studies have suggested that the number of deaths due to social determinants of health, including low education, racial segregation, and income equality, is comparable to the total number of deaths from leading medical conditions, such as acute MI, cerebrovascular disease, and lung cancer [41, 42]. In one study, poverty was demonstrated to be the strongest socioeconomic factor for heart failure mortality and was likely mediated by chronic illnesses such as diabetes and obesity [43]. Although a thorough analysis of socioeconomic factors could not be assessed in our study due to the inherent limitations of the NIS database, we did find that differences in neighborhood household income and Medicaid insurance status were associated with higher odds of inpatient mortality. In our study, AAPI patients were noted to live in higher neighborhood income quartiles, had higher rates of private insurance, and had the lowest age-adjusted rates of inpatient mortality. Furthermore, among COVID-19 patients in the lowest median income quartile, Black, Hispanic, and Native American patients had the highest proportion of deaths, while AAPI patients had the lowest proportion of deaths in this income quartile.

Among the COVID-19 healthcare disparities noted in our study, of particular note is the marked discrepancies demonstrated in Native American patients. As highlighted in previous studies, medical mistrust and lower satisfaction are prevalent among Native American patients in the U.S. [44]. This is partly due to the unfortunate history of displacement of indigenous people from their lands, as well as unethical research among members of their community [45, 46]. These factors may very well be contributing to the marked disparities in COVID-19 hospitalizations demonstrated in our study. As such, an understanding of the history and culture of indigenous patients, as well as further research, is encouraged to address these health inequities.

In conclusion, the large disparities in hospitalization and mortality among different race and ethnic groups during the COVID-19 pandemic were likely driven by many of the same factors of systemic and structural racism that have afflicted marginalized communities for centuries [10,11,12,13, 47,48,49,50]. Developing more direct pandemic response for marginalized populations is encouraged for all pandemic responses and communicable disease response. Programs and policies should aim at investigating and addressing the root causes for health inequities among vulnerable groups. It is imperative to take into account the history of racism and its contributions in creating structural barriers to education, employment, and health and how this impacts the wellness of individuals and communities as a whole. Future research into the status and availability of COVID-19 vaccinations and therapies among marginalized populations may also be insightful.

Limitations

This study was performed using nationally representative claims data. Inaccurate administrative coding may weaken the estimates we have provided. Administrative codes do not indicate whether a secondary condition was present on admission or developed during hospitalization. This may overestimate the prevalence of certain inpatient complications or baseline comorbid conditions. In addition, data from the NIS does not account for readmissions. Each observation is coded as a unique hospitalization and may affect the outcomes analyzed in this study. Furthermore, the collection and reporting of race and ethnicity data from the NIS can vary greatly by hospital, from self-identification to presumed identity. Data for the AAPI and other race/ethnic categories as defined by NIS and utilized in this study are limited by the problem of data aggregation of very different populations. Thus, reported differences may be difficult to quantify in a meaningful way. Furthermore, as the sample size for each race and ethnic group assessed in this study varied significantly, outcomes analyzed may be at increased risk of false positives. As the NIS database only provides information on inpatient factors and outcomes, residual measured and unmeasured confounding factors may bias estimates and/or limit the conclusions that can be made from this study. As a result, our study may not fully reflect the potential systemic and structural racism factors that affect COVID-19 health outcomes. Further research into detailed social determinant of health factors affecting COVID-19 outcomes would be beneficial. This study was also completed using data early in the COVID-19 pandemic when patterns of care were rapidly changing and there was limited and shifting evidence on what interventions and treatments were successful.

We tried to identify the impact that COVID-19 had on patients with primary or secondary COVID-19. Initially, we had hypothesized that patients with primary COVID-19 diagnoses may have been more significantly affected by the infection, leading to COVID-19 as the primary diagnosis of their hospitalization. On the other hand, patients who were admitted for a different primary diagnosis and had secondary COVID-19 diagnoses may have experienced more complications and mortality due to underlying acute disease. However, given the variability with administrative coding, our reported data on primary and secondary COVID-19 diagnoses may not be truly reflective of differences between these hospitalizations.

Conclusions

In summary, marginalized populations in the U.S. faced the largest age-adjusted hospitalization and mortality burden related to the COVID-19 pandemic. Social differences, likely driven by intrinsic, systemic, and structural racism, led to large observable differences in disease transmission and hospitalization outcomes during a novel viral pandemic. Future pandemic planning should strive to integrate direct strategies to better protect vulnerable populations to disrupt viral transmission while new treatments and vaccines are being developed.

Supplementary Information

Supplementary Fig. 1 Figure showing number of primary COVID-19 admissions and mortality rate, by month.

aMortality rate defined as the number of primary COVID-19 deaths/total number of primary COVID-19 admissions.

Supplementary Fig. 2 Age spline model, showing non-linear relationship between age and odds of inpatient mortality.

Supplementary Fig. 3 Figure showing number of secondary COVID-19 admissions and mortality rate, by month.

aMortality rate defined as the number of secondary COVID-19 deaths/total number of secondary COVID-19 admissions.