Introduction

The novel coronavirus, SARS-CoV-2 (COVID-19), pandemic has devastated countries worldwide. In the United States (US), as of late June 2021, approximately 48,000,000 cases had been reported, with more than 750,000 cumulative deaths [1]. As data became available and the mechanisms and factors governing community spread were elucidated, it became known that rates of COVID-19 infection were not distributed homogeneously among populations. Similar to obstetrical outcomes, risk factors for COVID-19 accumulate in discrete locations, often due to structural determinants of health, even within geographic confines. Neighborhood-level COVID-19 infection rates [New York City], provided by the New York City Department of Health and Mental Hygiene, unsurprisingly demonstrated higher positivity rates in lower-income areas—an unfortunate trend that persists [2]. Fueled by the presence of structural racism, Black and other minority communities, which have experienced disinvestment for decades, also, are disproportionately impacted by COVID-19 [1, 3,4,5,6].

Racial and ethnic disparities in infant and peripartum mortality in the US clearly pre-date the COVID-19 pandemic [7,8,9,10]. Individuals who identify as Black carry the burden of worse adverse obstetric outcomes, and this disproportion extends across the spectrum of morbidity to include preterm birth (PTB). Non-Hispanic Black individuals experience a preterm birth rate at least 50% higher than non-Hispanic white individuals [11] and rates are comparatively higher in homogeneous geographic communities [12]. Societal factors such as structural racism, influenced, for example, by the legacy of racial discrimination in housing (Home Owner’s Loan Corporation redlining), correlate with modern health inequities, preterm birth being foremost [13, 14]. With propagated structural inequities, PTB continues to remain high in certain ZIP codes.

Assessment of the geographic variation in regional PTB outcomes may provide us with the perspective to better understand communities that are disproportionally at higher risk for COVID-19 infection. Our study, thus, sought to investigate the geographical correlation between pre-COVID-19 regional PTB disparities and subsequent COVID-19 disease burden.

Materials and Methods

This study is a retrospective, ecological cohort study using the Finger Lakes Region Perinatal and Obstetric Data System. The Finger Lakes Region database encompasses a 9-county region and contains birth certificate and supplemental question data that has been collected and processed. Live births from 2004 to 2018 were identified in the city of Rochester, NY, and the eight counties to the south and east of the city, representing a total of 903 ZIP codes. ZIP codes with greater than 100 deliveries during the study period (which, given the noted time frame, excludes ZIP codes with less than 10 births per year) were stratified into cohorts. Publicly available data as described by the US Census Bureau—based on the 2017 American Community Survey [15]—were then merged into the ZIP-stratified dataset to characterize community socioeconomic data including (1) percentage of community members living below the US poverty level ($26,200 for a family of four [16]), (2) median household income for community members, and (3) percentage of community members attaining at least a high school diploma. This fusion facilitated calculation of the frequencies of patient-level obstetric outcomes in each ZIP cohort with retained association between patient-level and community-level socioeconomic information.

We chose to focus our study of COVID-19 within Monroe County, which contains the Rochester city center. The Rochester metropolitan statistical area (MSA) had a population of 1,069,644 as of the 2019 census, the third largest MSA in New York state [15]. Median household income averaged at $62,103, with 41% of household incomes less than $50,000. Approximately 12.7% of individuals have incomes below the poverty line [15]. There are two major healthcare systems encompassing 11 birthing hospitals across Monroe and adjacent counties.

The Monroe County COVID-19 dashboard [16] was utilized to associate encompassed ZIP codes with the cumulative rate of coronavirus diagnosis following the first wave of community spread (March–September 2020). The county-wide distribution of rates was analyzed to generate the median absolute deviation (MAD) of the local infection rates. Distributions of disease concentration in relationship to these local rates were then defined and assigned to “low-” (< 1 MAD), “moderate-” (within 1 MAD), and “high-” (> 1 MAD) prevalence categories of COVID-19 disease burden, normalized to regional rates of infection, rather than infection rates nationally or internationally. ZIP code cohorts were included in the statistical analysis only if they met the obstetric inclusion criteria of greater than 100 births in the study period and had a COVID-19 disease rate available in the dashboard.

The historical ZIP code level cohort of births, categorized by low, medium, and high coronavirus prevalence from March through September 2020 was analyzed for the primary outcome of PTB. Outcomes were analyzed using ANOVA, and odds ratios were calculated to assess the magnitude of disparity. Preterm birth was defined as gestational age of 36 or fewer completed weeks of pregnancy and was further divided into categories of PTB < 28 weeks gestation (i.e., births between 23w0d and 27w6d) and periviable births (i.e., births between 22w0d and 24w6d); groupings were defined by the Statewide Perinatal Data System and were not mutually exclusive. The obstetric data set was collapsed into ZIP code cohorts and was analyzed utilizing Stata 16.0 (StataCorp LLC, College Station, Texas). Graphic representations of the distributions of pandemic and obstetric data were generated in ArcGISPro 2.5.0 (Esri, Redlands, California).

Results

From 2004 until 2018, the birth data system contained detailed information for 210,984 live births in 903 regional ZIP codes. A total of 199,088 pregnancies were eligible for inclusion as being associated with a ZIP code containing greater than 100 deliveries in the study period; this resulted in 120 ZIP code cohorts. Of these births, there were 123,909 within the 38 ZIP codes with publicly available COVID-19 community resource data for Rochester, NY (Monroe County has 42 ZIP codes; 38 ZIP codes were included with both available COVID-19 data and greater than 100 deliveries within the study period, Fig. 1).

Fig. 1
figure 1

Identifying births within COVID-19 cohorts. From 2004 to 2018, the birth data system contained detailed information for 210,984 live births. A total of 199,088 pregnancies were eligible for inclusion as being associated with a ZIP code containing greater than 100 deliveries in the study period; this resulted in 120 ZIP code cohorts. Of these births, there were 123,909 within the 38 ZIP codes with publicly available COVID-19 community resource data for Rochester, New York

The rate of preterm deliveries across all study cohorts was 9.5%. The median COVID-19 infection rate among all Rochester ZIP codes was 616.5 per 100,000 as of September 30, 2020. Preterm birth and COVID-19 were positively correlated, with high-prevalence COVID-19 ZIP codes also being the areas with the highest prevalence of PTB (F = 11.06, P = 0.0002)—8% of all live births in the low-prevalence COVID-19 ZIP codes, 9% of all live births in the moderate-prevalence COVID-19 ZIP codes, and 11% of all live births in the high-prevalence COVID-19 ZIP codes were preterm.

Notably, significance was also demonstrated for PTB < 28 weeks (F = 15.87, P < 0.0001) and periviable birth (F = 16.28, P < 0.0001). Odds of PTB < 28 weeks were significantly higher in the “high-prevalence” COVID-19 cohort compared to the “low-prevalence” COVID 19 cohort (OR 3.27 (95% CI 2.42–4.42)). Percentages of all PTB, PTB < 28 weeks, and periviable birth across “low-,” “moderate-,” and “high-” prevalence COVID-19 cohorts are presented in Fig. 2.

Fig. 2
figure 2

Percentage of A all preterm birth (PTB), B PTB < 28 weeks, and C periviable PTB across low-, moderate-, and high-prevalence COVID-19 ZIP code cohorts. The percentage of PTB across all study cohorts is 9.5%

With respect to socioeconomic characteristics—specifically poverty level, median household income, and educational attainment—similar findings were observed in relationship to COVID-19 infection rates. COVID-19 prevalence was positively associated with number of individuals below poverty level and was inversely associated with median household income and educational attainment (Fig. 3).

Fig. 3
figure 3

Social determinants of health including A percent below the poverty line, B median household income in dollars, and C percent educational attainment of at least high school graduation across low-, moderate-, and high-prevalence COVID-19 ZIP code cohorts

Notably, geographic information system (GIS) mapping demonstrated ZIP code clustering in the Rochester city center, with highest rates of PTB < 28 weeks overlapping with high COVID-19 disease burden (Fig. 4).

Fig. 4
figure 4

Geographic information system mapping demonstrates ZIP code clustering. ZIP codes with high pre-COVID-19 rates of preterm birth < 28 weeks (A) and overlap ZIP codes with high COVID-19 disease burden (B), with ZIP code clustering at the urban center (rectangle)

Structured Discussion/Comment

Our data illustrate that community disparities in social determinants of health, as exemplified by preterm birth outcomes, map to geographic areas of COVID-19 disease burden in our region. Historical geographic prevalence of PTB was positively associated with 2020 cohort prevalence of COVID-19, reaching significance not only for all PTB but also for PTB < 28 weeks and periviable births. GIS mapping demonstrated this variance and showed ZIP code clustering in the urban center with highest rates of PTB < 28 weeks overlapping with high COVID-19 disease burden.

This analysis exposes the high rates of both PTB and COVID-19 infection where low-income neighborhoods are concentrated. Assessed within the last 6–7 years, over 50% of Rochester dwellers live in neighborhoods of extreme poverty [17], with individuals identifying as Black and Latinx individuals experiencing poverty at a rate more than three times that of individuals identifying as white [18]. This condensed poverty may be secondary to Rochester’s heritage as a factory town in the mid-twentieth century, which was home to close communities of immigrant blue collar workers. While white immigrants were overall more successful in securing steady employment and income, Black individuals were discriminated against both in terms of employment and housing through redlining and other forms of systemic racism. As higher earning families migrated from the urban center to the suburbs, the urban-suburban schism expanded, and associated racial and economic fissures became more profound [18]. This system of socioeconomic segregation has propagated into the twenty-first century and, likely, has contributed significantly to the specific course of community spread of COVID-19.

Available literature further supports a positive correlation between neighborhood income and physical distancing [19]—those individuals in lower-income neighborhoods were more likely required to work outside the home. State orders for cessation of non-essential work were only associated with small increases in staying home in low-income neighborhoods. COVID-19 infection rates were also found to be higher in neighborhoods that were densely populated, and predominantly Black [6, 20, 21]. Based on his New York City analysis, Whittle et al. suggested that an increase of 10,000 people per km2 was associated with a 2.4% (95% CI 0.6 to 4.2%, P = 0.011) increase in COVID-19 positivity rate [6]. Moreover, in dose–response fashion, a decrease of 10% in the white population was associated with a 1.8% (95% CI 0.8 to 2.8%, P < 0.001) increase in positivity rate, while an increase of 10% in the Black population was associated with a 1.1% (95% CI 0.3 to 1.8%, P < 0.001) increase in positivity rate [6]. These trends may suggest an explanation of why urban clustering may have occurred in the Rochester city center.

Alternatively, outliers to the above discussed trends do exist. Hamlin, for example, is a small, rural, town in the northwest of Monroe County with a widespread ZIP code area and approximately 5000 residents [22]. Rural residence has been consistently linked with elevated preterm birth rates [23], an association that remains significant even after controlling for behavioral risk factors and obstetric characteristics [24, 25]. These considerations are likely applicable to this community and correspond to the increased rates of PTB < 28 weeks observed (Fig. 4). However, a less densely populated ZIP code would, in turn, allow for social distancing in a way that likely prevented widespread COVID-19 infection.

Our analysis of socioeconomic factors demonstrated increased COVID-19 disease burden among those with lower community income levels, higher poverty levels, and lower educational attainment. The persistent association of PTB with increasing COVID-19 burden suggests the disproportionate targeting of a community demographic that is shaped by these, and possibly other, social determinants. As a global indicator of parental and neonatal health, PTB can be utilized as a vital statistic that indicates how well our communities are being served with respect to access to healthcare, among other influences. In suggesting that pre-pandemic regional PTB rates predict COVID-19 distribution, we demonstrate that community disparities unacceptably exist. Ongoing studies seek to explore the geographic correlation of PTB and COVID-19 infection rates over a wide range of communities (i.e., urban, suburban, rural) and assess the influence of other medical and obstetric comorbidities on this correlation. Collectively, these data should inspire socioeconomic policies supporting economic vibrancy to promote optimal health outcomes across all communities.

Our study is unique in that it assesses the geo-spatial association between pre-pandemic PTB rates and COVID-19 infection rates in a mid-sized US city. We included 38 of Monroe County’s 42 ZIP codes, with both available COVID-19 data and greater than 100 deliveries in the study period; inclusion of such a large number of ZIP codes provided for a robust assessment of the city’s geographic disparities. Furthermore, we used publicly available data from the US Census Bureau merged into the ZIP-stratified dataset to characterize community socioeconomic data. This fusion facilitated calculation of the frequencies of patient-level obstetric outcomes in each ZIP cohort with retained association between patient-level and community-level socioeconomic information. In this way, we sought to reduce the influence of ecological fallacy in our study.

Lastly, in choosing to associate ZIP codes with cumulative rates of coronavirus diagnoses (as opposed to current infection rates), we were able to capture a unique snapshot of community disease burden at the close of the first wave of the novel coronavirus pandemic. Analysis of this time frame highlighted in stark contrast the social disparities that emerged during the pandemic’s early months. As these disparities have not been yet fully addressed, our analysis is robust to a changing temporal scale.

Despite application of a novel geo-spatial lens to improve understanding COVID-19 spread, there are limitations to our study. Foremost, our analysis only addressed one mid-sized city. While available literature suggests that these results may be generalizable to other cities with urban centers [6, 19, 20], we are unable to definitively elucidate determinants that may be common across urban, suburban, and rural populations. Additionally, we chose to focus on a particular portion of potential socioeconomic factors that may influence the pandemic and cannot comment on how other parental or obstetric comorbidities may additionally bear on geographic variation of COVID-19 disease burden. Lastly, to clarify, we associated historical preterm birth rates with current COVID-19 disease, not the risk of PTB associated with SARS-CoV-2 infection in a current pregnancy.

In conclusion, merging of COVID-19 community ZIP code datasets into obstetric ZIP code datasets offers a unique opportunity to analyze contemporary, geographic health disparities. We demonstrate that geographic disparities in our regional preterm birth outcomes, ahead of the global pandemic, predicted community distribution of COVID-19 disease burden. Given the distinct impact of the disproportionate burden of both preterm labor and high COVID-19 infection rates, attempts to end these disparities must be multi-faceted and informed by integrated analysis of previously disparate data sets.