Urban density and COVID-19: understanding the US experience

Carozzi, Felipe; Provenzano, Sandro; Roth, Sefi

doi:10.1007/s00168-022-01193-z

Urban density and COVID-19: understanding the US experience

Original Paper
Open access
Published: 28 November 2022

Volume 72, pages 163–194, (2024)
Cite this article

Download PDF

You have full access to this open access article

The Annals of Regional Science Aims and scope Submit manuscript

Urban density and COVID-19: understanding the US experience

Download PDF

Felipe Carozzi¹,
Sandro Provenzano¹ &
Sefi Roth¹

3158 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

This paper revisits the debate around the link between population density and the severity of COVID-19 spread in the USA. We do so by conducting an empirical analysis based on graphical evidence, regression analysis and instrumental variable strategies borrowed from the agglomeration literature. Studying the period between the start of the epidemic and the beginning of the vaccination campaign at the end of 2020, we find that the cross-sectional relationship between density and COVID-19 deaths changed as the year evolved. Initially, denser counties experienced more COVID-19 deaths. Yet, by December, the relationship between COVID deaths and urban density was completely flat. This is consistent with evidence indicating density affected the timing of the outbreak—with denser locations more likely to have an early outbreak—yet had no influence on time-adjusted COVID-19 cases and deaths. Using data from Google, Facebook, the US Census and other sources, we investigate potential mechanisms behind these findings.

Counting COVID: Quantitative Geographical Approaches to COVID-19

Revisiting the Economic Effects of Density in the Wake of the COVID-19 Pandemic

Spatiotemporal prediction of COVID-19 cases using inter- and intra-county proxies of human interactions

Article Open access 08 November 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Historically, cities have been associated with the propagation of infectious diseases.^{Footnote 1} It is therefore not surprising that the impact of density—the defining feature of cities—on the spread of COVID-19 was a frequent talking point from the very outset of the COVID-19 pandemic. As early as 22nd of March 2020, in the context of a critical outbreak in New York City, state governor Andrew Cuomo tweeted “There is a density level in NYC that is destructive. It has to stop and it has to stop now. NYC must develop an immediate plan to reduce density.”^{Footnote 2}

The notion that dense cities would be hotbeds of virus transmission prompted a flurry of academic research on the topic. Initial studies—especially those looking at the United States’ experience—suggested urban density fostered a faster spread of the disease.^{Footnote 3} Similar evidence was reported for other countries including India (Bhadra et al. 2021), Brazil (Pequeno et al. 2020) and Germany (Ehlert 2021). However, subsequent research exploring a longer time series yielded mixed findings (see for example McFarlane 2021; Kim et al. 2021; Florida et al. 2021). This prompted a more nuanced approach to the question, and subsequent work on the roles of crowding, experienced density and other more direct measures of social interactions.^{Footnote 4}

Now that massive vaccination campaigns have gradually reduced the threat of COVID-19 worldwide, can we draw any definitive conclusions about the mediating role of density in shaping the health impact of COVID-19 in cities? We turn to this question by looking at the evolution of the epidemic in the contiguous USA, in the period between the first registered cases in January 2020 and the beginning of the vaccination campaign in mid-December. By looking at the whole of 2020, we seek to understand how the results of initial studies indicating density was an important determinant of the impact of COVID-19 progressively led to more ambiguous findings as the pandemic evolved. Our empirical analysis combines descriptive evidence with an instrumental variable strategy borrowed from the agglomeration literature in economics. In doing so, our methodological approach avoids some of the pitfalls of conventional regression estimates and is close to methods that are familiar to both economists and economic geographers.

We find convincing evidence that density affected the timing of the outbreak in each county, with denser locations more likely to have an early outbreak. We show this leads to an initially positive and significant relationship between the impact of COVID-19 and population density at the county level, consistent with the results of early studies on the spread of the virus in the USA. However, after adjusting for the timing of the onset of the disease in each county, we find no evidence that population density is positively associated with the impact of COVID-19. Interestingly, we find a negative relationship between density and the spread of COVID-19 within a county at the very beginning of an outbreak, but this relationship fades completely within 2 months. We also show that, by the end of 2020, density could no longer explain the cross-sectional pattern of accumulated cases or deaths. Dense locations were hit first, but, as the pandemic evolved, they were not hit harder. These results help us frame other studies on this topic and understand how the findings in that literature changed as the pandemic developed.

The fact that—by the end of 2020—density had no effect on the local impact of COVID-19 appears counter-intuitive. The virus spreads via human contact and denser areas provide more opportunities for human interaction. Yet, this is not the only way in which density can affect the spread of disease. Several mediating factors can make the direction of this relationship theoretically ambiguous. We analyze social/behavioral factors that could explain our findings, bearing in mind that the spread of disease is a social as well as a biological phenomenon (Papageorge et al. 2020). To do so we use data from Google, Facebook, the US Census and The County Health Rankings and Roadmaps program. First, we show that density is positively associated with the reduction in work- and leisure-related activities throughout the pandemic, suggesting that compliance with social distancing measures was higher in denser locations. Second, we use our empirical strategy to illustrate the well-known fact that density is negatively associated with the share of Republican voters, which have been shown to be less engaged in social distancing and other efforts to reduce transmission (Allcott et al. 2020). Third, we show population density is positively associated with access to healthcare and income and negatively associated with inhabitants’ age. Collectively, these results yield suggestive evidence of mechanisms generating offsetting negative effects of density on the spread and severity of the COVID-19 outbreak, and help us rationalize the estimates of the overall effects reported in our main analysis.

Estimating how population density shaped the spread and severity of the COVID-19 outbreak, as well as its effects on local behavioral responses and demographics is challenging for several reasons. First, population densities are not randomly assigned and they might be correlated with unobserved confounding factors. For example, population densities can be affected by locational productive advantages, whether natural or man-made (e.g., soil quality or transportation infrastructure), that may also simultaneously affect local economic conditions. Insofar as the COVID-19 outbreak is affected by economic factors, unobservable locational advantages can confound the effect of density on the spread and severity of the disease. Second, differences in the timing of the onset of the disease can generate cross-sectional differences in the severity of the outbreak at one point in time in the absence of true differences in the local reproduction rate. Finally, data on COVID-19 cases might be reported with error due to variation in local testing strategy and capacity.

We overcome the empirical challenges mentioned above in several ways. We use two Instrumental Variable (IV) strategies borrowed from the agglomeration literature in economics to induce plausibly exogenous variation in population density without affecting COVID-19 cases and deaths directly. More specifically, in our geological IV approach, we use the presence of aquifers, earthquake risk, and soil drainage capacity to as instruments for density (as in Duranton and Turner 2018). In our historical IV strategy, we use the traditional long-lag instrument, which measures urban population density in the 1880 US Census (as in Ciccone and Hall 1996 and a large subsequent literature). We use these tools to study both how density affected the timing of the outbreak in each county and the time-adjusted number of deaths after that outbreak. We focus on the daily number of confirmed COVID-19 deaths rather than cases as our main outcome of interest since this is considered to be a more accurate indicator of local COVID-19 prevalence (Subbaraman 2020), and discuss COVID-reported cases as a robustness check. Finally, we cross-validate our COVID-19 figures with official data from the CDC to ensure reported deaths are consistent with other measures of COVID-19 mortality.

As discussed above, a number of papers have examined the link between density and COVID-19 incidence in the USA.^{Footnote 5} Alongside these studies, a vast number of papers in economics and economic geography have focused on other social determinants of differences in the spread of COVID-19 such as mobility Glaeser et al. (2020), Almagro et al. (2020), racial composition Benitez et al. (2020), Hamman (2021), social capital and institutions Ding et al. (2020), Rodríguez-Pose and Burlina (2021) as well as on the predicted long-run impact of the pandemic on cities (Florida et al. 2021), Nathan and Overman (2020). We contribute to this literature by looking specifically at density—arguably one of the first explanatory factors that attracted the attention of the field in early 2020—and its changing role throughout the US epidemic. Given that density is associated with many of the factors that were studied subsequently—mobility, race, urbanization—our findings also help interpret the results reported in the broader literature.

2 Data

Our dataset combines information on COVID-19 cases and deaths, population density, demographics, social connectedness, behavioral changes, voting behavior, healthcare provision, income and geological features at the US county level. We will use COVID data extending over the period between the 22nd of January, when the first US case was confirmed in King County, up until the 15th of December 2020, the day after the COVID vaccination campaign began in the USA. We restrict our sample to urban counties^{Footnote 6} in the contiguous USA which leaves us with 1759 counties comprising $\sim$ 93% of the total US population. When analyzing the pace of the outbreak, we further restrict the sample further to those counties that had at least one confirmed COVID-19- related death 60 days before the end of our sample period. This Outbreak Sub-sample consists of 1441 counties representing $\sim$ 89 % of the total US population (see Fig. 4). In the following, we describe the dataset and provide further information about the sources and URLs for download in “Appendix B” and descriptive statistics in Table 1.

Table 1 Descriptive statistics

Full size table

2.1 COVID-19 cases and deaths

We obtain a panel of daily confirmed COVID-19 fatalities and cases for US counties from usafacts.org.^{Footnote 7} The most intuitive indicator to monitor the COVID-19 outbreak is the daily number of confirmed cases. However, this figure is likely to be distorted by varying local testing strategy and capacity. Furthermore, the ability of the virus to spread across asymptomatic people makes the task of recording the number of infections in the community extremely difficult (Subbaraman 2020). Therefore, we mainly use the daily number of confirmed COVID-19 deaths as this is a more accurate indicator of the local COVID-19 prevalence.^{Footnote 8} In order to ensure that our COVID-19 data are reliable, we cross-validate our COVID-19 figures with official data from the Centers for Disease Control and Prevention (CDC). In the left panel of Fig. 5, we compare our total COVID-19 fatality counts by county to the latest figures on officially confirmed deaths due to COVID-19. In the right panel, we compare total fatalities to CDC excess death estimates. Both graphs exhibit strong linear relationships and support the validity of our COVID-19 data.^{Footnote 9} The evolution of daily COVID-19 fatality numbers used in this paper is illustrated in “Appendix Fig. 6.” In our analysis below, when we refer to deaths taking place in the first-wave, we refer to those taking place up to the 5th of July, which is the minimum in the moving average of deaths after April 2020.

2.2 Population density

Based on the US census for 2010, we compute two measures of population density. The first is simply the total population of a county over its total area. This will constitute the independent variable of interest throughout most of our analysis. The second variable takes the population density for all census-blocks within a county and computes the associated population-weighted mean. Population-weighted density is meant to measure average “experienced” density and was popularized in economics by Glaeser and Kahn (2004) and Rappaport (2008). It can be computed using spatially disaggregated data on the distribution of population and weighting each small unit of population density by its relative population in the county.

2.2.1 Instrumental variables

For our geological instrumental variable estimates, we use three different instruments. More specifically, we use variables measuring earthquake risks and presence of aquifers from the US Geological Survey (USGS) (also used in Duranton and Turner 2018), and data on soil drainage quality from NRCS State Soil Geographic Data Base. We match our grid cells to the geological data using grid cell centroids to spatially impute data on aquifers, earthquake risks and soil drainage quality. For our historical instrument, we use population density obtained from the 1880 US census. We impute these data on the county level using spatial matching based on the assumption of uniform population distribution within 1880 counties.^{Footnote 10}

Table 2 Cases and deaths in first COVID-19 wave in 2020: baseline OLS estimates

Full size table

Table 3 Onset of the disease and deaths after 60 days in 2020

Full size table

2.2.2 Behavioral adjustment/social distancing

To measure how much people in different counties adjusted their behavior as a response to the COVID-19 outbreak, we use the ‘COVID-19 Community Mobility Reports’ by Google (Google CMR). This database aggregates extensive anonymized mobile device GPS user data and estimates the percentage change in activities (such as work, retail or transit) by county and day. The five week period from January 3rd to February 6th before the start of the COVID-19 outbreak in the US serves as the corresponding baseline period.

2.2.3 Other variables

We obtain data on county-level demographic characteristic estimates for 2018 from the US census. Social connectedness is measured with Facebook’s Social Connectedness Index (Facebook SCI), which captures the intensity of the link between locations using the number of friend links in this social network (see Bailey et al. 2018 for further details on the SCI). Finally, data on access to healthcare and income comes from the County Health Rankings and Roadmaps program. Specifically, we use three indicators: (1) the ratio of population to primary care physicians (2) the percentage of adults under the age of 65 without health insurance and (3) median household income.

3 Empirical analysis

Our empirical analysis proceeds in two ways. We first provide a series of figures that illustrate the main results, both in terms of the relationship between density and COVID-19 deaths, the evolution of that relationship over time and the explanations behind this evolution. We then provide formal quantitative estimates for these relationships using our OLS and IV strategies. The fact that by-and-large the quantitative findings are the same regardless of the methods employed in the analysis gives us confidence on the robustness of our results to methodological decisions made in the research process.

3.1 Graphical evidence

The top-left panel of Fig. 1 illustrates the positive cross-sectional correlation between a county’s population density—calculated as the total population over the surface area—and the number of COVID-19- related deaths per capita by the end of the first wave on the 5th of July.^{Footnote 11} This is the basic fact that had been noticed in Wheaton (2020) and Dubner (2020) as early as April 2020. Similar graphs, again displaying positive relationships using population-weighted densities and number of cases, are reported in “Appendix Fig. 7.”

Naturally, these cross-sectional patterns do not constitute conclusive evidence that urban density results in faster or more deadly COVID-19 spread. There are at least two problems that could arise in this context. First, the positive correlation in the top left panel of Fig. 1 can be the result of differences in the timing of the onset of the disease across locations. Second, certain location characteristics which are correlated with both density and COVID-19 spread and severity could induce a correlation in the absence of any actual causal link. We discuss this second issue in detail in the next section.

Table 4 Density and time-adjusted deaths in 2020 for different post-onset windows

Full size table

Table 5 Suggested mechanisms: social connectedness and behavioral responses

Full size table

The top right panel of Fig. 1 illustrates the point on differences in the timing of the onset of the disease across locations by showing that the positive correlation between population density and COVID-19- related deaths observed in the first-wave becomes almost flat when we use data extending to the 15th of December 2020. We investigate the timing dimension further in the bottom left panel of Fig. 1 where we show the relationship between population density and the number of days between the 22nd of January and the first fatality in each county. The figure exhibits a clear negative relationship, indicating that dense locations experienced COVID-19 fatalities earlier than more sparsely populated locations.

We can adjust for the differences in the timing of the onset of the disease by computing the number of deaths after a fixed number of days from that onset. This is what is typically shown in cross-country comparisons of the early evolution of the pandemic. In our case, we can compute the number of COVID-19 deaths at a specified time after the outbreak started in a county. We define the start of the outbreak as the first day with 10 reported cases and compute the number of deaths 60 days after this date for all counties.^{Footnote 12} The link between this time-adjusted variable and density is illustrated in the bottom-right panel of Fig. 1. The relationship is almost flat after time-adjusting, suggesting that density does not simply translate into a higher rate of COVID-19 fatalities.

How is it possible that initial studies reported a clear positive influence of density on the impact of COVID-19, yet we report no relationship here? The answer is illustrated in Panel A of Fig. 2, where we report how the slope of the relationship between population density and accumulated deaths evolved over 2020. These are simply the coefficients of a univariate regressions of the logarithm of total accumulated deaths—up to the period in the horizontal axis—on the logarithm of a county’s population density.^{Footnote 13} Panel A of Fig. 2 shows a positive relationship between deaths and density appeared at the beginning of the US epidemic, with the positive relationship peaking by May 15th 2020. Yet in subsequent months the relationship progressively flattened, with the slopes of interest shrinking progressively until becoming statistically insignificant by November 15th. Thus, there was an apparently positive relationship at the beginning of the US epidemic, but this relationship became flat as the pandemic evolved.

Several factors could explain this result. We will turn to these in detail when we discuss mechanisms in Sect. 3.4, but consider as an illustration the role of changes in mobility across cities. Figure 3 shows the change in mobility relative to the January 2020 baseline for sparse and dense counties, with the split based on median county density.^{Footnote 14} The left panel corresponds to changes in workplace-related mobility, the middle panel corresponds to changes in mobility for leisure activities and the right panel for transit. As expected, we observe a sharp reduction in mobility starting around mid-March. Importantly, in all cases we observe that this reduction is more acute in denser counties. Glaeser et al. (2020) show reductions in mobility had a substantial effect on the spread of COVID-19 over our sample period. Therefore, a sharper reduction in mobility in denser cities could contain the spread of the disease in these locations.

3.2 Estimation

To obtain credible quantitative estimates of the relationship between time-adjusted COVID-19- related mortality and density, we also need to deal with potential confounders affecting both density and the prevalence and severity of the disease. Climate conditions, for example, can simultaneously influence household location decisions (see Glaeser et al. 2001) and COVID-19 spread.^{Footnote 15} Local amenities such as waterfronts or low precipitation levels can themselves influence travel patterns—e.g., by increasing tourist arrivals—which could in turn affect COVID-19 rates. Insofar as some of these elements are observable, we can include them as controls in our regressions. Yet, some confounders may be unobservable due to their inherent nature or lack of accurate data. For instance, locational productive advantages can simultaneously affect local economic conditions and increase local densities.^{Footnote 16} Examples range from natural factors such as fertile or irrigable lands to man-made infrastructures such as ports or highways. Insofar as COVID-19 incidence and deaths are affected by economic conditions, unobservable locational advantages can confound the effect of density on the spread and severity of the disease.

Table 6 Mechanisms: healthcare provision and demographics

Full size table

To overcome the problem posed by potential unobservable confounding factors, we borrow canonical instruments for density from the agglomeration literature Combes et al. (2011) and our previous work on the relationship between density and air pollution Carozzi and Roth (2020). Specifically, we will instrument population density with either geological factors which can affect the costs of compact urban development or a long-lags in population density.

We use three geological instruments: the fraction of the urban footprint with aquifer presence, a measure of average earthquake risks and an estimate of soil drainage quality. The rationale for the aquifer instrument is that new dwellings in the periphery of urban areas need to either to pay for a costly connection with the municipal network or to directly connect with an underwater source. Given that the option of the underwater source is only available if there is an aquifer where the dwelling is located, cities with more land over aquifers can sprawl out further, contain more sparse development and lower densities. This instrument is motivated by the work in Burchfield et al. (2006) which reports that aquifers in the urban fringe are associated with urban sprawl. The rationale for our earthquake risk instrument is the expectation that the risk of an earthquake might influence building regulations, construction practices and the space between buildings, thus also affecting urban density. We also expect this instrument to satisfy the exogeneity condition, once we condition for distance to sea, average precipitation, latitude, longitude, and state fixed effects. Finally, the soil drainage quality variable is expected to affect land suitability for building at different densities. In fully urbanized land, a significant fraction of rainfall is drained through drainage networks and sewage systems (Konrad 2003). However, at lower densities, soil drainage capacity is important to avoid stagnant water and, possibly, floods. In addition, high drainage soil is not ideal for laying down heavy infrastructure, making the task of building high density development more expensive.

We use a separate instrument for density based on historical population as recorded in the 1880 US census. Settlements in this period were in place before much of the technological revolutions in transportation that have affected location patterns in the last decades and also precede current patterns of industrial location. The use of historical population instruments for density was popularized by Ciccone and Hall (1996) and has been featured recurrently in the literature on agglomeration economies since (see Combes and Gobillon 2015 for a review).

Our main estimating equation will regress measures of COVID-19 presence on the logarithm of population density:

$$\begin{aligned} Y_{i}=\alpha _s +\beta Ln(Pop. Density)_{i}+\gamma 'X_{i}+\varepsilon _{i} \end{aligned}$$

(1)

where i indexes individual counties, $\alpha _s$ is a set of state effects and $X_{i}$ is a set of controls. In all specifications, we control for average maximum and minimum temperatures, average yearly precipitation, latitude, longitude, distance between the county centroid and the closest sea front and distance to the closest waterfront. Our outcomes include different measures of COVID-19 presence. In most of our analysis, these are either variables capturing the time it took for the disease to arrive at a county or a time-adjusted measure of COVID-19 presence - the logarithm of the number of COVID-19 fatalities in the county 60 days after the 10th case was confirmed.

Before presenting our results, it is important to highlight that our estimates of parameter $\beta$ from Eq. (1) will capture the overall effect of density on the outcome of interest. This includes the effect of geographic proximity facilitating transmission but also effects operating through the impact of density on agglomeration economies, personal behavior, local population compositions, healthcare systems, etc. After reporting estimates of the overall effect of density, we will turn to investigate the specific mediating factors behind it in Sect. 3.4 below.

3.3 Main results

We first report baseline cross-sectional correlations between population density and COVID-19 cases and deaths during the first-wave. In Table 2, we estimate Eq. (1) via Ordinary Least Squares (OLS) using the logarithm of the number of cases per 100,000 inhabitants and the logarithm of the number of deaths per 100,000 inhabitants as outcome variables. We find positive and statistically significant effects of population density on COVID-19 incidence, in line with the descriptive evidence reported in the top-left panel of Fig. 1. Specifically, when using the conventional measure of population density, we find elasticities of 22% and 13% for cases and deaths, respectively. This suggests that a 1% increase in population density increases cases and deaths per 100,000 people by 0.22% and 0.13%. When using our population-weighted measure of density, we also find very similar positive elasticities. The findings for COVID-19 cases are consistent with the evidence presented by Wheaton (2020) and Almagro and Orane-Hutchinson (2020). Yet this should not be taken as conclusive evidence that density has a causal effect on the spread of COVID-19. As argued above, potential differences in the timing of the onset of the disease across locations or the presence of potential unobservable confounders can induce substantial bias in these coefficients.

Estimates reported in Table 3 deal with these empirical issues by looking explicitly at differences in the onset of the COVID-19 epidemic across locations and incorporating our instrumental variable strategy. In panels A and B, we report estimates for the effect of density on the number of days to the first case and the number of days to the first death. These numbers are measured relative to the date of the first reported case in the USA, so that small numbers correspond to an earlier onset of an outbreak. In column 1, we report OLS estimates obtained after controlling for state effects and covariates. In columns 2 and 3, we show IV estimates obtained using our Geological and Historical instruments, respectively. Note that the first-stage F-stats lie at 25 or above and the instruments explain between 5% and 10% of the variance in population density, indicating that they are not weak. Our second-stage estimates confirm that denser areas have indeed experienced earlier onsets of the disease whether we use days to the first case or days to the first death. A one log-point increase in density reduces the time to the first case by between 4 and 6 days depending on the specification. The effect on the time to the first deaths is even larger. These estimates demonstrate the importance of adjusting for differences in the timing of the onsets across locations when estimating the relationship between population density and COVID-19 health outcomes.

In Panel C of Table 3, we examine our main outcome of interest; the effect of population density on time-adjusted COVID-19- related mortality. As mentioned previously, we focus on confirmed COVID-19- related deaths rather than cases as our main outcome of interest because it is considered to be a more accurate indicator of local COVID-19 prevalence. We provide a complementary analysis using reported cases in Sect. 3.5. In column 1, we find that the cross-sectional correlation observed in Table 2 becomes negative and statistically insignificant, suggesting that the positive link between population density and COVID-19 deaths might have been confounded by differences in the timing of the local outbreak. In columns 2 and 3, we use our instrumental variable approach to test this hypothesis more convincingly. Our second-stage results reveal a statistically insignificant relationship between population density and COVID-19- related deaths in both columns, portraying a similar picture as the OLS estimate presented in column 1. Our 2SLS results are unsurprisingly less precise, but the overall picture is clear. We find no evidence that population density is positively linked with COVID-19- related deaths.

We can use our IV strategy to reproduce the findings illustrated in Panel A of Fig. 2 showing the evolution of the cross-sectional relationship between COVID-19 deaths and population density over time. For this purpose, we estimate modified versions of Eq. (1) where the dependent variable is now the accumulated number of deaths up to the 15th day of each month in 2020 from March to December. Estimates of the different $\beta _t$ slope coefficients obtained using 2SLS are reported in Panel B of Fig. 2. In this case, we use both our geological and historical instruments as a source of exogenous variation. We observe that these results mimic those in Panel A, with an initially positive and significant relationship emerging by April 15th giving way to a progressive flatter relationship throughout 2020.

Finally, we test whether the relationship between density and time-adjusted COVID-19 deaths changes with the window used. To do this, we obtain estimates corresponding to 21, 30, and 45 day windows, all measured after the 10th case is reported in each county. The results are reported in Table 4 and show that the time-adjusted number of deaths is not positively affected by density, regardless of the window used. Interestingly, we find that at a beginning of an outbreak in a given county this relationship is in fact negative but becomes flat within two months.

On first reflection, the null (or negative) results for COVID-19 spread in this section appear surprising given that the virus spreads via human contact and denser areas can provide more opportunities for human interactions. Nevertheless, there are several mediating factors that might offset this intuitive mechanism. For example, density itself might attract younger residents who are less likely to develop significant symptoms. In addition, both behavioral and/or policy induced changes in behavior may be different in dense counties. In fact, studies on previous pandemics (e.g., the 1918 influenza pandemic) also show that population density is not necessarily linked with the spread and severity of a disease (Mills et al. 2004). In the next section, we explore potential mechanisms that can explain our reduced-form findings.

3.4 Mechanisms

Variation in density might lead to changes in several local conditions, which can themselves affect the spread and severity of the disease. These types of changes may provide mechanisms that reinforce or offset the hypothesized positive effects that have been suggested in the literature, both in terms of timing of the local onset of the pandemic and subsequent spread. We turn to study some of these mechanisms by estimating the effect of density on other determinants of COVID-19 spread and severity. To do so, we re-estimate Eq. (1) using these hypothetical mediators as outcomes. The resulting estimates do not provide definite proof regarding the mechanisms explaining the effect of density on COVID-19 incidence and mortality, but should be interpreted as suggestive evidence in this regard.

We begin by looking at possible factors explaining the early onset of the disease in denser cities and show that density is associated with higher social connectedness with other US counties. Our proxy for this variable relies on Facebook’s Social Connectedness Index (SCI).^{Footnote 17} This index is based on the relative frequency of friendship links between users of the social network, with higher index values corresponding to a larger number of friendship links. To proxy for social connectedness with other counties, we aggregate the SCI of each county with all other counties and normalize it by the own-county SCI. The resulting variable is large when inhabitants in a county are disproportionately connected to other counties. Coefficients resulting from estimating Eq. (1) using the logarithm of this proxy as an outcome variable are provided in Panel A of Table 5. As above, we report both OLS estimates (column 1) and 2SLS estimates using our geological and historical instruments (columns 2 and 3). We observe consistently positive elasticities of roughly 0.4-0.5 across columns, indicating denser counties are more intensely related to other counties in the USA.^{Footnote 18} These results provide a plausible explanation to our findings of early onsets of COVID-19 cases and deaths in denser counties illustrated in Fig. 1 and Table 3.

Next, we study how density affects behavioral responses to the pandemic (e.g., compliance with social distancing measures). We use data from the Google COVID-19 Community Mobility Reports (CMR) to measure how mobility patterns in each county have changed relative to baseline levels measured in January 2020. In Panels B and C of Table 5, we show the relationship between county density and the change in mobility to workplaces and retail activity, respectively. We find that population density is associated with a larger decline in mobility for both indicators. Doubling density reduces workplace-related mobility and retail-related activity by approximately 2.6–3.4% and 1.7–2.4%, respectively. Given the significant variation in density across US counties, these estimates are large. Insofar as social distancing reduces the spread of the disease, these differences in behavior might explain why we find limited differences in spread by location after accounting for the timing of onset of the disease and confounding factors.

Several factors could explain this difference in behavior across dense and sparse counties. One candidate that could account for both policy responses and individual differences in behavior relates to ideological or political views. Allcott et al. (2020) show that the Republican county vote share has a positive and significant association with the number of weekly visits to points of interest during the peak of the social distancing measures in April. Anecdotal evidence also reveals substantial differences in the tone of the Democratic and Republican parties when discussing the pandemic and its consequences. If density is associated with reduced support for the Republican party, residents of denser areas may be more likely to comply with the social distancing advise. In Panel D of Table 5, we estimate this link using voting data from the 2016 presidential election as a proxy for Republican support. We find that population density has a negative association with the share of Republican voters, an observation that should come as no surprise for observers of US politics.^{Footnote 19} This difference in political preferences across locations could explain, at least in part, the observed differences in the behavioral response to the pandemic illustrated in Fig. 3 and Table 5.

We can arrive at two conclusions from the results reported in Table 5. First, dense counties are more connected with other locations and this may account for earlier onset of the COVID-19 epidemic in these areas. Second, the behavioral response to the disease was larger in denser counties, with less mobility for work and leisure and reduced use of public transit in these locations.

Finally, in Table 6, we examine the effect of density on access to healthcare and demographics, as these are likely to affect COVID-19- related mortality. In Panels A and B, we examine the effect of density on access to healthcare using the ratio of population to primary care physicians and the percentage of adults under the age of 65 without health insurance as proxies. We find that density is positively associated with the former and negatively associated with the latter, suggesting that denser locations benefit from better access to healthcare. In our context, this could be an important mediating factor for two main reasons. First, access to primary healthcare might affect the presence and management of underlying health conditions which consider being risk factors for COVID-19 mortality (Zhou et al. 2020). Second, access might also affect the probability of seeking and receiving medical treatment once infected with COVID-19. Relatedly, we also examine the link between population density and income in Panel C as it is likely to affect access to healthcare and also health status more broadly. As expected, we find that the density is positively associated with median household income, offering an additional explanation for our headline results. Finally, in Panel D, we examine the effect of density on the share of the population above 60 years of age. This is of particular importance given that older age considered to be a significant risk factor (Zhou et al. 2020) and that population density is likely to affect the age structure of local areas via its impact on employment opportunities Glaeser (1999). Indeed, we find some evidence that population density is linked with a smaller share of residents above 60 years of age. In other words, dense counties are “younger” than sparse counties and this could reduce the number of deaths in these areas.

Overall, our points relating to behavioral responses, healthcare provision and demographics provide probable explanations for the surprisingly flat relationship between density and COVID-19- related mortality reported in panel C of Table 3.

3.5 Robustness checks

In this section, we provide several tests to evaluate the robustness of our main findings. We first revisit our results for the time-adjusted COVID-19 deaths by controlling for time of onset. In Panel A of “Appendix Table 7,” we test whether the null effect of density is robust to flexibly controlling by week of onset in each state. This goes beyond simply time-adjusting the outcome variable of interest as it also incorporates differences in knowledge regarding the disease or country-wide behavioral adjustments. We find that our qualitative results remain unchanged, with coefficients being insignificantly different from 0 across specifications. In panel B, we test whether our results are affected by excluding the New York metropolitan area.^{Footnote 20} In this case, we find a negative and statistically significant relationship between density and time-adjusted COVID-19 deaths in our OLS estimate but statistically insignificant effects when we use our IV methodologies. We interpret these results with caution, as we are imposing sample selection that simultaneously exclude the MSA with the largest initial outbreak and the highest density.

Much of the evidence featured in the discussion around the role of urban density in shaping the impact of COVID-19 has focused on the conventional, area-weighted definition of density (i.e., population divided by surface). In order to speak to that debate, this has been the object of our main analysis. But we can evaluate the robustness of our results to the definition of density by studying the effect of population-weighted densities. In “Appendix Table 8,” we reproduce our main results using this variable as our main independent variable of interest. Unfortunately, since our geological instruments do not provide a strong first stage for this variable, our IV analysis relies solely on our long-lag instrument. Reassuringly, we find that the overall results are qualitatively similar to those obtained in Table 3. Panels A and B show denser counties had earlier onsets of the disease compared to sparse counties. In panel C, we find a negative association between weighted density and COVID-19- related deaths when using OLS. However, our IV estimates again show a statistically insignificant elasticity. We therefore conclude that variation in density did not result in more COVID-19 incidence and deaths in the USA beyond the effect on early onset of the disease despite prior descriptive evidence. We also check the robustness of our results regarding suggested mechanisms using population-weighted density as our main regressor of interest in “Appendix Table 9.” Reassuringly, we find that the overall results are qualitatively analogous to those reported in Table 3.

Finally, we test whether density affects the time-adjusted number of reported cases of COVID-19. As argued above, the number of cases is more likely to be affected by variation in testing resources and by the presence of asymptomatic cases. This motivates our focus on number of deaths in much of the main analysis. In Table 10, we report estimates of the relationship between density and the number of cases per 100,000 inhabitants measured 21, 30, 45 and 60 days after the 10th reported case in the county. IV estimates for the effect of density on time-adjusted cases are similar to estimates reported in Table 4. We conclude that the data do not yield any evidence indicating a positive effect of density on the spread of the disease.

4 Conclusions

Urban areas are often places of intense social interaction, crowded living and close contact. Whether Justinian’s Constantinople, fourteenth century Florence or 1918 Philadelphia - cities have historically been associated with the propagation of infectious disease. In the first three months of the global COVID-19 pandemic, large, dense urban areas around the world such as New York, Madrid and London were identified as disease hotspots. Increased awareness of the risks of present and future epidemics has understandably prompted a debate about the future of cities. Did density—the defining feature of cities—promote the spread of the disease?

Our analysis of the onset of the COVID-19 pandemic in the USA raises a series of important points regarding these questions. First, density is associated with an early arrival of COVID-19, so that urban cores and superstar cities get a head start on the spread of the disease. Second, the subsequent spread—once COVID-19 has arrived—is not faster or deadlier than in smaller towns or sparsely populated peripheries. Cities get hit first, but do not get hit harder. We argue this is one of the reasons why many of the early studies of the impact of density on the impact of COVID-19 reported positive findings. A wider look at the whole period before vaccination began yields a different overall view of this relationship.

Several mechanisms may explain these findings. Large cities are intensely inter-connected with other locations, which can explain early onset. In the case of within-city spread, different offsetting forces may be at play. Crowding may promote the spread of the disease but differences in precautionary measures, access to healthcare and demographics may contain it. As a result, our findings emphasize the importance of distinguishing between differences in spread between and within locations.

Our study contributes to the understanding of how a summary feature of urban structure—population density—shapes spread of disease and deaths. The way in which other elements or urban form, cities’ transport infrastructure or housing conditions (e.g., overcrowding) shaped the impact of the COVID-19 pandemic is not addressed here and remains an active area of research (see e.g., Kamis et al. 2021; Borsati et al. 2022 and Brotherhood et al. 2022).

Notes

See Duranton and Puga (2020), Voigtländer and Voth (2013) for treatments of this relationship in economics.
The attribution of detrimental effects of density for the evolution of the epidemic was not specific to the USA. On 9 of December 2020, Michael Gove (Chancellor of the Duchy of Lancaster and Minister for the UK Cabinet Office) said on ITV’s Good Morning Britain that population density was one of the reasons why the UK had more COVID-19- related deaths in comparison to Germany.
See for example Angel et al. (2020), Whittle and Diaz-Artiles (2020), Zhang and Schwartz (2020), Wheaton (2020) and Almagro and Orane-Hutchinson (2020). For a review of the empirical literature on the topic—covering papers in urban planning, economics and medical sciences—see Teller (2021).
For example, there is evidence that a higher percentage of overcrowded households and poor housing conditions in US counties have both lead to higher mortality from COVID-19 (Ahmad et al. 2020; Krieger et al. 2020; Kamis et al. 2021).
The literature on the relationship between the 1918 Influenza pandemic (the Spanish Flu) and population density is naturally more developed and can shed light on the link between pandemics and density more broadly. Interestingly, while it may seem intuitive that the influenza pandemic was positively associated with population density as the virus spread via human contact, a review of the literature produce mixed results. For example, Garrett (2007) finds a positive relationship between mortality rates and population density in the USA. In contrast, Mills et al. (2004) find no statistical association between population density and the initial reproductive number (R) using data on 45 US cities. Chowell et al. (2008) also find no association between transmissibility, death rates and indicators of population density in England and Wales. Ferguson et al. (2006) studies the development of the 1918 pandemic and finds evidence for an early onset in dense urban cores before a more smooth development of the disease across space.
Urban counties are those that are classified as either ‘metropolitan’ or ‘micropolitan’ core-based statistical areas in the 2010 census.
These are obtained from county-level reports by local health authorities across the USA. See “Appendix B” for further details.
Recent work led by Diego Puga looks at the relationship between density and COVID-19 incidence in Spain using prevalence data obtained from randomized serological tests. Cross-sectional correlations using this information point to a flat (or weakly negative) relationship between the disease’s spread and density.
In contrast, the correlation between county-level COVID-19 fatalities and USAFacts is −0.001 and insignificant indicating that COVID-19 mortality is not simply an amplification of fatalities occurring under normal circumstances but rather follows distinct patterns that are consistently capture by our database.
Note that, while the assumption of uniform distribution is clearly a simplification which could lead to measurement error, this should not have a substantial impact on our main estimates. This is because measurement error in the instruments could affect the relevance of the instruments but should not generate bias in the coefficients of interest unless the measurement error itself is correlated with COVID-19 incidence.
We define the first wave as the period between the onset of the disease in the USA in February 2020 and the minimal daily death rate before the second rise in COVID-19 fatalities. See “Appendix Fig. 6.”
The choice of 10 cases as marking the start of an outbreak from which we take the 60-day window is taken so as to ensure that there is some degree of within-county transmission at the time the window starts. We study how results change using different post-onset time windows in Sect. 3.2.
Specifically, we estimate $Ln(\text {Acc. Deaths}^t_{i}+1)=\alpha _0+\alpha _t Ln(Pop. Dens_{i})+\varepsilon _i$, where i is an index for counties and t indicates the end period, so that $\text {Acc. Deaths}^t_{i}$ corresponds to accumulated deaths in county i from the start of the pandemic up to date t (e.g., the 15th of April).
The data are based on COVID-19 Community Mobility Reports released by Google and is based on data from portable device users in US counties.
A number of recent papers document a negative effect of temperature on COVID-19 incidence, at least in temperate weathers. See for example Prata et al. (2020), Tobías and Molina (2020).
Locational advantages increase local densities because higher land prices in these areas trigger a substitution of land for capital in the production of structures (i.e. an increase in building heights).
Kuchler et al. (2020) study how interpersonal networks provided a channel for the spread of the disease based on the SCI.
Dense counties are also candidates to have higher connectedness with locations outside of the USA.
This relationship remains highly robust upon controlling for the share of black population as well as the population above 60 years of age. In fact, when adding these additional controls, the relationship remains between -0.04 and -0.05 and significant at the 99% confidence level for all three estimation approaches.
We use the census 2010 definition corresponding to the New York-Northern New Jersey-Long Island CBSA.

References

Ahmad K, Erqou S, Shah N, Nazir U, Morrison AR, Choudhary G, Wen-Chih W (2020) Association of poor housing conditions with COVID-19 incidence and mortality across US counties. PLoS ONE 15(11):e0241327
Article Google Scholar
Allcott H, Boxell L, Conway J, Gentzkow M, Thaler M, Yang DY (2020) “Polarization and public health: Partisan differences in social distancing during the Coronavirus pandemic.” NBER Working Paper, (w26946)
Almagro M, Orane-Hutchinson A (2020) The determinants of the differential exposure to COVID-19 in New York City and their evolution over time. Vetted and Real-Time Papers, Covid Economics, p 13
Google Scholar
Almagro M, Coven J, Gupta A, Orane-Hutchinson A (2020) “Racial disparities in frontline workers and housing crowding during COVID-19: Evidence from geolocation data.” Available at SSRN 3695249
Angel S, Lamson-Hall P, Tamayo MMS et al (2020) Coronavirus and the cities: explaining variations in the onset of infection and in the number of reported cases and deaths in US metropolitan areas as of 27 March 2020. New York University, Marron Institute of Urban Management
Bailey M, Cao R, Kuchler T, Stroebel J, Wong A (2018) Social connectedness: measurement, determinants, and effects. J Econ Perspect 32(3):259–80
Article Google Scholar
Benitez J, Courtemanche C, Yelowitz A (2020) Racial and ethnic disparities in COVID-19: evidence from six large cities. J Econ Race Policy 3(4):243–261
Article Google Scholar
Bhadra A, Mukherjee A, Sarkar K (2021) Impact of population density on Covid-19 infected and mortality rate in India. Model Earth Syst Environ 7(1):623–629
Article Google Scholar
Borsati M, Nocera S, Percoco M (2022) Questioning the spatial association between the initial spread of COVID-19 and transit usage in Italy. Res Transp Econ 95:101194
Article Google Scholar
Brotherhood L, Cavalcanti T, Da Mata D, Santos C (2022) Slums and pandemics. J Dev Econ 157:102882
Article Google Scholar
Burchfield M, Overman HG, Puga D, Turner MA (2006) Causes of sprawl: a portrait from space. Quart J Econ 121(2):587–633
Article Google Scholar
Carozzi F, Roth S (2020) “Dirty density: air quality and the density of American cities.” IZA Discussion Paper
Chowell G, Bettencourt LMA, Johnson N, Alonso WJ, Viboud C (2008) The 1918–1919 influenza pandemic in England and Wales: spatial patterns in transmissibility and mortality impact. Proc R Soc B Biol Sci 275(1634):501–509
Article Google Scholar
Ciccone A, Hall RE (1996) Productivity and the density of economic activity. Am Econ Rev 86:54–70
Google Scholar
Combes P-P, Gobillon L (2015) The empirics of agglomeration economies. Handbook of regional and urban economics, vol 5. Elsevier, Amsterdam, pp 247–348
Google Scholar
Combes P-P, Duranton G, Gobillon L (2011) The identification of agglomeration economies. J Econ Geogr 11(2):253–266
Article Google Scholar
Ding W, Levine R, Lin C, Xie W (2020) Social distancing and social capital: why US counties respond differently to COVID-19. National Bureau of Economic Research
Dubner SJ (2020) “What Does Covid-19 Mean for Cities (and Marriages)?” Freakonomics Podcast Ep. 401
Duranton G, Puga D (2020) The Economics of Urban Density. J Econ Perspect 34(3):3–26
Article Google Scholar
Duranton G, Turner MA (2018) Urban form and driving: evidence from US cities. J Urban Econ 108:170–191
Article Google Scholar
Ehlert A (2021) The socio-economic determinants of COVID-19: a spatial analysis of German county level data. Socio-Econ Plan Sci 78:101083. https://doi.org/10.1016/j.seps.2021.101083
Article Google Scholar
Ferguson NM, Cummings DAT, Fraser C, Cajka JC, Cooley PC, Burke DS (2006) Strategies for mitigating an influenza pandemic. Nature 442(7101):448–452
Article Google Scholar
Florida R, Rodríguez-Pose A, Storper M (2021) Cities in a post-covid world. Urban Studies https://doi.org/10.1177/00420980211018072
Article Google Scholar
Garrett TA (2007) Economic effects of the 1918 influenza pandemic. Federal Reserve Bank of St, Louis
Google Scholar
Glaeser EL (1999) Learning in cities. J Urban Econ 46(2):254–277
Article Google Scholar
Glaeser EL, Kahn ME (2004) Sprawl and urban growth. In: Handbook of regional and urban economics. Vol 4, pp 2481–2527. Elsevier, Amsterdam
Glaeser EL, Gorback C, Redding SJ (2020) JUE insight: how much does COVID-19 increase with mobility? Evidence from New York and four other US cities. J Urban Econ 127:103292.https://doi.org/10.1016/j.jue.2020.103292
Article Google Scholar
Glaeser EL, Kolko J, Saiz A (2001) Consumer city. J Econ Geogr 1(1):27–50
Article Google Scholar
Hamman MK (2021) Disparities in COVID-19 mortality by county racial composition and the role of spring social distancing measures. Econ Hum Biol 41:100953
Article Google Scholar
Kamis C, Stolte A, West JS, Fishman SH, Brown T, Brown T, Farmer HR (2021) Overcrowding and COVID-19 mortality across US counties: are disparities growing over time? SSM-population Health 15:100845
Article Google Scholar
Kim H, Zanobetti A, Bell ML (2021) Temporal transition of racial/ethnic disparities in COVID-19 outcomes in 3108 counties of the United States: three phases from january to december 2020. Sci Total Environ 791:148167. https://doi.org/10.1016/j.scitotenv.2021.148167
Article Google Scholar
Konrad CP (2003) Effects of urban development on floods.
Krieger N, Waterman PD, Chen JT (2020) COVID-19 and overall mortality inequities in the surge in death rates by zip code characteristics: Massachusetts, January 1 to May 19, 2020. Am J Public Health 110(12):1850–1852
Article Google Scholar
Kuchler T, Russel D, Stroebel J (2020) The geographic spread of COVID-19 correlates with structure of social networks as measured by Facebook. National Bureau of Economic Research
McFarlane C (2021) Repopulating density: COVID-19 and the politics of urban value. Urban Studies. https://doi.org/10.1177/00420980211014810
Article Google Scholar
Mills CE, Robins JM, Lipsitch M (2004) Transmissibility of 1918 pandemic influenza. Nature 432(7019):904–906
Article Google Scholar
Nathan M, Overman H (2020) Will coronavirus cause a big city exodus? Environ Plan B Urban Anal City Sci 47(9):1537–1542
Article Google Scholar
Papageorge NW, Zahn MV, Belot M, van den Broek-Altenburg E, Choi S, Jamison JC, Tripodi E et al (2020) Socio-Demographic Factors Associated with Self-Protecting Behavior during the COVID-19 Pandemic. Institute of Labor Economics (IZA)
Pequeno P, Mendel B, Rosa C, Bosholn M, Souza JL, Baccaro F, Barbosa R, Magnusson W (2020) Air transportation, population density and temperature predict the spread of COVID-19 in Brazil. PeerJ 8:e9322
Article Google Scholar
Prata DN, Rodrigues W, Bermejo PH (2020) Temperature significantly changes COVID-19 transmission in (sub) tropical cities of Brazil. Sci Total Environ 729:138862. https://doi.org/10.1016/j.scitotenv.2020.138862
Article Google Scholar
Rappaport J (2008) A productivity model of city crowdedness. J Urban Econ 63(2):715–722
Article Google Scholar
Rodríguez-Pose Andrés, Burlina Chiara (2021) “Institutions and the uneven geography of the first wave of the COVID-19 pandemic.” Journal of Regional Science
Subbaraman N (2020) Why daily death tolls have become unusually important in understanding the coronavirus pandemic. Nature
Teller J (2021) Urban density and Covid-19: towards an adaptive approach. Build Cities 2(1):150–165
Article Google Scholar
Tobías A, Molina T (2020) Is temperature reducing the transmission of COVID-19? Environ Res 186:109553
Article Google Scholar
Voigtländer N, Voth H-J (2013) The three horsemen of riches: plague, war, and urbanization in early modern Europe. Rev Econ Stud 80(2):774–811
Article Google Scholar
Wheaton WC, Thompson AK (2020) The Geography of COVID-19 growth in the US: Counties and Metropolitan Areas. Available at SSRN 3570540
Whittle RS, Diaz-Artiles A (2020) An ecological study of socioeconomic predictors in detection of COVID-19 cases across neighborhoods in New York City. BMC Med 18(1):1–17
Article Google Scholar
Zhang CH, Schwartz GG (2020) Spatial disparities in coronavirus incidence and mortality in the United States: an ecological analysis as of May 2020. J Rural Health 36(3):433–445
Article Google Scholar
Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, Xiang J, Wang Y, Song B, Gu X et al (2020) Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. Lancet 395(10229):1054–1062
Article Google Scholar

Download references

Acknowledgements

We would like to thank Gabriel Ahlfeldt, Steve Gibbons, Janet Kohlhase, Henry Overman and two anonymous referees for useful comments and suggestions. The work by Provenzano was supported by the Economic and Social Research Council [Grant No.: ES/P000622/1]. Many of the results in included in this paper had previously been included in the working paper circulated under the title “COVID-19 and Urban Density".

Author information

Authors and Affiliations

Department of Geography and Environment, London School of Economics, London, UK
Felipe Carozzi, Sandro Provenzano & Sefi Roth

Authors

Felipe Carozzi
View author publications
You can also search for this author in PubMed Google Scholar
Sandro Provenzano
View author publications
You can also search for this author in PubMed Google Scholar
Sefi Roth
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Felipe Carozzi.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix

A: Additional figures and tables

See Figs. 4, 5, 6, 7 and Tables 7, 8, 9, 10.

Table 7 Robustness: density and deaths

Full size table

Table 8 Weighted densities: onset of the disease and deaths after 60 days (2020)

Full size table

Table 9 Robustness: suggested mechanisms and weighted densities

Full size table

Table 10 Robustness: cases

Full size table

B: Data sources

USAfacts.org COVID-19 Data

USAFacts is a non-profit civic initiative that provides data on the US population and government and works in partnership with the Penn Wharton Budget Model and the Stanford Institute for Economic Policy Research (SIEPR). The data can be retrieved at: https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/. [Last visited: December 18th 2020]
CDC Official COVID-19 Mortality Rate This database comprises confirmed or presumed COVID-19 fatalities and is limited to counties with at least 10 COVID-19 deaths. It should be noted, the dataset is incomplete because of the time lag between the death and the official certificate submitted to the National Center for Health Statistics (NCHS). For this reason, these data correspond to 514 counties only. The latest figures can be downloaded at: https://data.cdc.gov/NCHS/Provisional-COVID-19-Death-Counts-in-the-United-St/kn79-hsxy. [Last visited: December 18th 2020]
CDC Excess Mortality Excess mortality corresponds to the deviation of total deaths to average expected deaths based on the experience in past years for each state. The latest estimates can be downloaded at: https://www.cdc.gov/nchs/nvss/vsrr/covid19/excess_deaths.htm. [Last visited: December 18th 2020]
US Census contains information about demographics on the country level and can be accessed via: https://www.census.gov/data/tables/time-series/demo/popest/2010s-counties-detail.html. [Last visited: May 14th 2020]
‘COVID-19 Community Mobility Reports’ by Google

This report contains information about the behavioral activity change and social distancing in response to the COVID outbreak by county and day. For more detail on this database please visit https://www.google.com/covid19/mobility/data_documentation.html?hl=en. [Last visited: December 18th 2020]
Social Connectedness Data Obtained after presenting a brief email application for the data based on this paper’s outline to Mike Bailey and others at Facebook. April 6 2020 Release Version.
Healthcare and Income Data from The County Health Rankings and Roadmaps program contains information on healthcare access and various social and economics indicators at the country level and can be accessed via: https://www.countyhealthrankings.org. [Last visited: July 3rd 2020]

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Carozzi, F., Provenzano, S. & Roth, S. Urban density and COVID-19: understanding the US experience. Ann Reg Sci 72, 163–194 (2024). https://doi.org/10.1007/s00168-022-01193-z

Download citation

Received: 13 August 2021
Accepted: 28 October 2022
Published: 28 November 2022
Issue Date: January 2024
DOI: https://doi.org/10.1007/s00168-022-01193-z

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Urban density and COVID-19: understanding the US experience

Abstract

Similar content being viewed by others

Counting COVID: Quantitative Geographical Approaches to COVID-19

Revisiting the Economic Effects of Density in the Wake of the COVID-19 Pandemic

Spatiotemporal prediction of COVID-19 cases using inter- and intra-county proxies of human interactions

1 Introduction