Introduction

New York City (NYC) was an early United States (U.S.) epicenter of the coronavirus disease 2019 (COVID-19) pandemic [1]. By April 30, 2020, 174,501 confirmed COVID-19 cases, 47,779 hospitalizations, and 14,905 deaths occurred among NYC residents [2]. During this time, NYC public health messaging encouraged the public to stay home [3,4,5,6,7], possibly contributing to delays in obtaining emergency care [8, 9].

Among persons hospitalized with COVID-19 in the U.S., underlying conditions, including chronic kidney disease, cardiovascular disease, immunosuppression, chronic lung disease, neurologic disorders, diabetes, and obesity, have been identified as risk factors for mortality [10,11,12,13]. NYC-based COVID-19 in-hospital mortality studies have primarily used data from single hospital systems or focused on older persons, for whom elevated mortality rates have been observed [12,13,14,15,16,17,18,19]. However, many deaths in NYC were reported among persons aged 21–64 years. By April 30, 2020, such persons accounted for nearly 26% of COVID-19–related mortality [20]. In the U.S., COVID-19 diagnoses and hospitalizations have disproportionately impacted communities of color [1, 11, 21,22,23]. While age-adjusted COVID-19 hospitalization and mortality rates in NYC have been higher among Hispanic/Latino (1060 and 298 per 100,000, respectively) and non-Hispanic Black (988 and 274 per 100,000, respectively) persons compared with non-Hispanic White (531 and 155 per 100,000, respectively) and non-Hispanic Asian (493 and 140 per 100,000, respectively) persons, higher in-hospital mortality risk among persons of color in NYC has not been previously reported [12, 13, 18, 19, 24].

We examined underlying conditions, stratified by race/ethnicity and age, to assess risk factors associated with death among NYC residents aged 21–64 years hospitalized with COVID-19. These findings can inform clinicians and public health professionals in allocating resources and tailoring preventive measures during the COVID-19 pandemic.

Methods

Data Sources

Laboratories are required to electronically report SARS-CoV-2 test results for NYC residents to the NY State Electronic Clinical Laboratory Reporting System [25]. To obtain hospitalization status, the NYC Department of Health and Mental Hygiene (DOHMH) matched patient identifiers from laboratory reports of confirmed COVID-19 cases with emergency department syndromic surveillance, the NY State Hospital Emergency Response Data System, regional health information organizations, NYC public hospitals, and DOHMH’s death registry [1]. On March 26, 2020, the DOHMH requested remote access to electronic health records (EHRs) for patients hospitalized with COVID-19 in selected NY metropolitan area hospitals. Eighty hospitals and two temporary hospitals agreed, representing facilities where 89% of decedents between February 22 and April 9, 2020, were first hospitalized with COVID-19.

Investigators used a medical chart abstraction guidance document and an investigation form to abstract patients’ demographics, substance use, occupation, past medical history, and clinical course. Substance use and occupation were inconsistently documented in EHRs and omitted from analyses. Presence of underlying conditions/symptoms was collected as “Indicated” when documented in EHRs, otherwise “Not indicated,” because absence of documentation could not be assumed to mean absence of those conditions/symptoms. Categories of underlying conditions were created for (1) conditions prevalent among persons with COVID-19 [17, 21, 26] and (2) conditions commonly abstracted by investigations that did not fit existing categories. Data quality was supported by two independent investigators performing each abstraction. Study patients were matched [27] with the tuberculosis, HIV/AIDS, and hemoglobin A1c DOHMH registries [28] using standard key deterministic algorithms, and with the hepatitis B and C registries using a patient-level unique identifier. This activity was reviewed by the CDC and was conducted consistent with applicable federal law and CDC policy [1]. The DOHMH Institutional Review Board determined this activity to be non-research public health surveillance.

Analytic Sample

We conducted a matched case-control study of NYC residents aged 21–64 years who were reported to the DOHMH with nucleic acid amplification test–confirmed COVID-19 [29] during March 13–April 9, 2020, and were hospitalized ≥1 time for reasons other than labor and delivery in facilities for which the DOHMH had EHR access. Sample size calculations for tests for two correlated proportions in a matched case-control design [30] were conducted using PASS 2019 (v19.0.3, NCSS, LLC, Kaysville, UT) (Appendix 1).

Among 15,097 eligible persons who met case-patient eligibility criteria described above, 1698 (11.2%) died in an emergency department (ED) or hospital by April 13. Of these, 45 (2.7%) were excluded because they died at an unknown location or outside of the hospital. Of the remaining 1653 eligible decedents, 350 were randomly selected, and medical chart abstraction was completed for 343 case-patients.

Control-patients met laboratory and hospitalization eligibility criteria described above, and were discharged alive from hospitalization(s). Matching was performed to reduce variance of effect estimates [31]. Control-patients were matched 2:1 to case-patients on age (± 3 years, within the bounds of 21–64) and neighborhood of residence, as defined by the NYC United Hospital Fund [32]. Control-patients who left hospitals against medical advice or who were discharged to nursing homes, group homes, or step-down facilities were considered “discharged” and were included if they met the other inclusion criteria. Those discharged to hospice or still receiving inpatient hospital care at the time of medical chart abstraction were excluded and substituted with other matched control-patients. Among 13,399 eligible persons, medical chart abstractions were completed for 686 randomly selected, matched control-patients. Through a citywide death registry match, no known deaths among control-patients were documented as of July 1, 2020, when data collection was completed.

Data Analysis

To assess generalizability, given lack of EHR access to all NY metropolitan hospitals, we calculated relative risks (RRs) and 95% confidence intervals (CIs) to compare whether key demographics of case-patients differed significantly from other decedents who died in hospitals or EDs (including hospitals for which the DOHMH did not have EHR access) not selected for the study.

All case-patients with identical matching criteria and their matched controls were pooled, totaling 260 strata. We conducted conditional logistic regression analyses to evaluate associations between selected underlying conditions and odds of death and presented matched adjusted odds ratios (aORs) and 95% CIs. All models included one condition of a priori interest and adjusted for sex (male, female) and two covariates thought to be potential confounders and significant in bivariate analyses: insurance status (private insurance, public insurance, uninsured) and marital status (married, not married).

Patient residence was assigned a neighborhood poverty level, defined as the percent of the population living in a given census tract whose household income was below the federal poverty level (FPL) per American Community Survey 2014–2018, with low poverty being <10.0% below FPL, medium poverty 10.0–19.9% below FPL, high poverty 20.0–29.9% below FPL, and very high poverty ≥30.0% below FPL [33]. Obesity, a key exposure of interest, was considered in multiple ways. Body mass index (BMI) was calculated when height and weight were known. A five-level categorical variable for obesity was created based on National Institutes of Health (NIH) criteria: “Underweight” (<18.5 kg/m2), “Normal weight” (18.5–<25.0 kg/m2), “Overweight” (25.0–<30.0 kg/m2), “Obese” (30.0–<40.0 kg/m2), and “Morbidly obese” (≥40.0 kg/m2) [34]. A two-level categorical obesity variable (“Indicated” vs. “Not indicated”) was also created; obesity was “Indicated” if BMI was ≥ 30 kg/m2 or BMI was unknown, but obesity was documented in the study patient’s chart.

We described clinical characteristics of study patients, including presentation of symptoms, diagnoses of pneumonia and acute respiratory distress syndrome, use of respiratory support, initiation of dialysis, admission to the intensive care unit, and use of inpatient treatments. We evaluated combinations of comorbidities as risk factors for death by assessing the odds of death among patients with two specified underlying conditions. Conditions chosen for these analyses were prevalent (occurring among approximately ≥10% of case-patients) and significantly associated with death when adjusting for sex, marital status, and insurance status, and for which, directionality of causation is not always clear [35,36,37,38,39]. We analyzed combinations of conditions as four-level categorical variables, with patients having neither, one, or both conditions.

To assess racial/ethnic disparities in associations between selected underlying conditions and odds of death, we stratified analyses by the following racial/ethnic groups: non-Hispanic Asian (Asian), non-Hispanic Black (Black), Hispanic/Latino (Hispanic), non-Hispanic White (White), and non-Hispanic (Other), which included non-Hispanic patients of other or multiple races. Patients of other races were included in regression models, but effect estimates for this group are not presented because of sparsity. We present one aOR for each racial/ethnic group per condition, comparing patients with the condition against patients for whom the condition was not indicated. To determine which conditions to include, we used the following criteria: any significant overall aOR with approximately ≥10% of case-patients having the condition, or, if not statistically significant, an overall aOR ≥1.5 and ≥15% of case-patients having the condition. Because sparse data within some strata could contribute to underpowered analyses, we do not present stratified aORs for specific conditions that were part of grouped categories (e.g., hypertension as a subset of heart disease). To help explain racial/ethnic disparities in COVID-19 deaths in NYC, we assessed prevalence of underlying conditions by race/ethnicity among control-patients. Given minimal data collected during the pandemic on underlying condition prevalence among non-hospitalized COVID-19 patients, control-patients were the best available proxy for NYC’s population to understand risk factors for in-hospital mortality with COVID-19. Similarly, to assess differences in associations between selected conditions and odds of death among those aged <50 years and ≥50 years, we calculated aORs restricted to each age group.

All analyses were conducted using SAS version 9.4 (Cary, NC).

Results

Distributions of age, sex, borough of residence, and neighborhood-level poverty for case-patients (n = 343) were similar to other decedents who died in hospitals or EDs (including hospitals for which the DOHMH did not have EHR access) but were not selected for the study (n = 1310). Asian race/ethnicity was the only demographic characteristic that was significantly differentially distributed between case-patients (9.1%) and decedents not selected for the study (5.0%) (RR, 1.52 [95% CI: 1.04, 2.23]).

Among study patients, the median age of both case-patients and control-patients was 56 years (range: 23–64 years) (Table 1); 95.2% of control-patients matched case-patients on age exactly (± 0 years). Overall, patients were predominantly male (65.5%), Hispanic (37.6%) or Black (29.1%), and residents of Queens (37.1%) or the Bronx (29.8%), and resided in medium poverty neighborhoods (37.0%); these characteristics were similarly distributed among case-patients and control-patients, as expected from matching on residential neighborhood. However, case-patients were more likely to be unmarried (54.9% vs. 44.6%) and less likely to have private health insurance (26.8% vs. 41.8%).

Table 1 Matched odds ratios (mOR) comparing demographic information of study patients (N = 1029), by patient status—New York City, March 13–April 9, 2020

Hospitalization Course

Most patients were known to be symptomatic prior to or during hospitalization (99.8%); the most common symptoms were fever (90.5%), shortness of breath (87.9%), and cough (86.5%) (Table 2). The median numbers of days from symptom onset to diagnosis and from symptom onset to first hospitalization were both 5 days for case-patients and 7 days for control-patients. Among case-patients, 95.3% and 58.9% had a diagnosis of pneumonia and acute respiratory distress syndrome, respectively, compared with 91.0% and 9.6% of control-patients. Most case-patients (86.3%) and control-patients (83.8%) received supplemental oxygen, and 78.7% of case-patients and 9.8% of control-patients were intubated and placed on a mechanical ventilator. Among case-patients (90.1%) and control-patients (96.5%) who did not report being on dialysis at time of hospitalization, dialysis was initiated for 13.1% and 2.5% of case-patients and control-patients respectively. ICU admission occurred for 66.7% of case-patients and 14.9% of control-patients. The most common inpatient treatments administered to case-patients and control-patients were hydroxychloroquine (88.1% and 81.5%, respectively), azithromycin (85.7% and 72.2%), corticosteroids (30.0% and 18.8%), and ascorbic acid (21.9% and 23.0%).

Table 2 Clinical information for study patients (N = 1029), by patient status—New York City, March 13–April 9, 2020

Underlying Conditions

Most case-patients (96.5%) and control-patients (86.4%) had ≥1 underlying condition (Table 3). After adjusting for sex, marital status, and insurance status, having ≥1 selected underlying condition was strongly associated with death (aOR, 4.45 [95% CI: 2.33, 8.49]). Increased odds of death were observed for patients with diabetes (aOR, 2.09 [95% CI: 1.56, 2.78]) and obesity (aOR,1.62 [95% CI: 1.22, 2.14]), particularly morbid obesity (versus patients with normal BMI (aOR, 3.07 [95% CI: 1.59, 5.90]). Heart disease was associated with an increased odds of death (aOR, 2.16 [95% CI: 1.55, 3.01]); specifically, history of heart attack, congestive heart failure, coronary artery disease, and hypertension were each significantly associated with increased odds of death. Lung disease was associated with an increased odds of death (aOR, 1.52 [95% CI: 1.08, 2.14]). Specifically, having chronic obstructive pulmonary disease/chronic bronchitis/emphysema was significantly associated with an increased odds of death (aOR, 2.40 [95% CI: 1.39, 4.16]). Asthma was not significantly associated with death (aOR, 1.22 [95% CI: 0.82, 1.81]). Patients with kidney disease had 2.51 (95% CI: 1.72, 3.68) times the odds of death compared with those without kidney disease. Other underlying conditions significantly associated with death included cancer, neurologic or neurodevelopmental condition(s), mental health condition(s), anemia, and HIV/AIDS.

Table 3 Matched adjusted odds ratios (aOR) of death with COVID-19 among study patients (N = 1029) with specific underlying medical conditions—New York City, March 13–April 9, 2020

Combinations of Underlying Conditions

Compared with having neither condition, having both heart disease and diabetes, obesity and diabetes, or hypertension and diabetes was significantly associated with an approximately threefold increased odds of death (Table 4). Patients with both heart disease and diabetes had 3.13 times (95% CI: 2.11, 4.62) the odds of death compared with those with neither heart disease nor diabetes. Patients with heart disease only had 1.97 times (95% CI: 1.30, 2.97) the odds of death and patients with diabetes only had 2.57 times (95 % CI: 1.33, 4.98) the odds of death compared with patients with neither condition. This pattern—patients with both conditions having increased odds of death compared with those having one or neither—was observed for all combinations considered.

Table 4 Matched adjusted odds ratios (aOR) of death with COVID-19 for study patients (N = 1029) with combinations of underlying medical conditions—New York City, March 13–April 9, 2020

Race/Ethnicity and Age Stratification

The eight selected underlying conditions included in our stratified analysis had aOR point estimates >1 for death among all race/ethnicities, except lung disease among White patients (aOR, 0.85 [95% CI: 0.36, 1.98]) (Table 5). Five of these conditions were most prevalent among Black control-patients, including heart disease (74.9%), obesity (61.5%), diabetes (43.8%), lung disease (24.6%), and kidney disease (13.4%), compared with control-patients of other races/ethnicities. Magnitudes of association for conditions in groups aged <50 and ≥50 years were similar (Appendix 2).

Table 5 Matched adjusted odds ratios (aOR) of death with COVID-19 among study patients (N = 1029) with specific underlying medical conditions, stratified by race/ethnicity—New York City, March 13–April 9, 2020

Discussion

This case-control study assessed risk factors for death among approximately 1000 persons aged 21–64 years hospitalized with COVID-19 across 82 NY metropolitan area hospitals. We found having any of the underlying conditions examined was associated with an over 4-fold increase in odds of death.

Black and Hispanic populations in NYC are at higher risk of COVID-19 compared with other racial and ethnic groups [13, 18, 19, 24]. Additionally, COVID-19 mortality rates in NYC are highest among Black and Hispanic persons [20]. In our study, 67% of case-patients were Black or Hispanic. Despite increased diagnosis and mortality rates, risk factors associated with greater in-hospital mortality among Black and Hispanic patients with COVID-19 have not been reported [13, 18, 19, 24]. We could not fully assess the association between race/ethnicity or poverty and in-hospital mortality because matching by neighborhood likely contributed to case-patients and control-patients having similar demographic and neighborhood-level characteristics. Race/ethnicity was not independently associated with mortality among study patients, yet racial/ethnic disparities in deaths were observed in NYC. Notably, the prevalence of five conditions was highest among Black control-patients. A similar pattern has been observed among NYC residents aged ≥18 years; the prevalence of hypertension, obesity, diabetes, and asthma among Black and Latino persons is higher compared with White and Asian persons [40]. Increased prevalence of conditions demonstrated to be associated with death likely contributed to the greater COVID-related mortality burden among Black and Latino persons in NYC.

In the U.S., White persons have a lower prevalence of diabetes and chronic kidney disease and lower rates of associated mortality compared with Black and Hispanic persons [41,42,43]. Racial/ethnic disparities associated with underlying condition prevalence and severe health outcomes can be explained by disparities in access to resources and opportunities, including health care [44, 45], housing [46], food [47], employment [44, 45], and education [48]. Further studies are needed to assess the associations between racial/ethnic disparities in these resources, SARS-CoV-2 infection, and severe outcomes. Additionally, research is needed to examine the associations between severe outcomes and (1) mitigation and severity of underlying conditions and (2) time to care.

Our findings may help guide the public health and medical communities’ efforts to prevent COVID-19 using health communication campaigns encouraging vaccine uptake, social distancing, face coverings, and handwashing. While citywide COVID-19 cases, hospitalizations, and deaths decline, and vaccination rates rise, racial/ethnic disparities persist. As of June 2021, only 30% of Black and 39% of Hispanic NYC adult residents were fully vaccinated, compared with 47% of White and 67% of Asian NYC adult residents [49]. These observed racial/ethnic disparities, including those in underlying condition prevalence, underscore the importance of prioritizing and increasing COVID-19 prevention efforts in these communities. Such efforts may include provision of additional social services, increased health resources, and especially vaccination efforts for racial and ethnic minority groups, given the increased prevalence of underlying conditions that are risk factors for death with COVID-19 and the increased burden of overall mortality experienced by Black and Hispanic populations both during and after our study period [1].

Our study has limitations. First, the multiple EHR systems did not consistently capture information. Although known to be associated both with certain underlying conditions [50, 51] and death [52, 53], substance use and occupation could not be analyzed given incomplete data, which is a known limitation to EHR data [54]. Additionally, BMI was missing from 22% of study patients, potentially limiting our ability to distinguish risk among BMI categories. Second, clinicians may have had fewer opportunities to ascertain complete medical histories for case-patients before their deaths. By combining patients who did not have certain conditions with those for whom information was not captured, our observed effect sizes are likely more conservative than the true associations. Third, while control-patients were discharged and not known to have died as of July 1, 2020, some might have been discharged to locations providing a higher level of care than they received prior to hospital admission for COVID-19 and/or might have experienced persistent or new symptoms related to COVID-19; such patients would be less likely to represent healthy controls, and observed associations between underlying conditions and death might be more conservative than true estimates. Fourth, stratified analyses by race/ethnicity were underpowered, and lack of observed significant associations should not be interpreted as no association. Fifth, we did not account for severity or control of underlying conditions, nor did we collect dosing information about inpatient treatments, so we cannot assess differences in association and death by these factors. Finally, generalizability of our results might be limited; our study was restricted to one city and hospitalized patients, and conducted early in the pandemic when the DOHMH had temporary surge staffing resources and remote EHR access, and when clinical management and treatment guidance was rapidly evolving. Despite resource constraints, which precluded us from extending the study period, findings nonetheless help explain why COVID-related health inequities persist within NYC.

Following the study period, COVID-19 mortality rates declined in NYC [20]. COVID-19 treatment guidelines [55] have evolved since the study period, which likely contributed to improved clinical decision-making and inpatient management. These circumstances reinforced our decision to only describe, and not conduct statistical tests on, the clinical characteristics of our study patients. Additionally, public health messaging in NYC shifted since the beginning of the pandemic, encouraging patients with severe symptoms to promptly seek care [56]. Obtaining comprehensive medical histories for hospitalized patients aged 21–64 years with COVID-19 is critical and may inform clinical decision-making since underlying conditions, particularly combinations of conditions, are risk factors for in-hospital mortality. Finally, observed racial/ethnic disparities in underlying condition prevalence, COVID-19 diagnosis, hospitalization, and mortality suggest the importance of increasing COVID-19 prevention efforts, especially among racial/ethnic minorities.