Keywords

1 Introduction

By now, it has become clear that behavioural change—starting to wear a mask and keeping a social distance—is one of the most important keys to fight the COVID-19 outbreak. As the outbreak enlarges, we see that people’s readiness for such behavioural change depends on various socio-life scientific factors: Epidemiological, political scientific, economic, sociological, behavioural, among others. The present study investigates the roles of these factors by focusing on a very early stage of the pandemic (mid-April, 2020) in the US state of Florida; this focus is set to capture the pure effects of those factors without unnecessary noises. Data on 67 counties is used. Our results support the existence of a natural epidemiological factor in the way the disease spread and the importance of political leadership in containing the pandemic. Some evidence is found that suggests the positive role of social learning in guiding people to make behavioural change towards self-protection.

There are several reasons why we focus on Florida. One is that Florida is a big peninsula mostly surrounded by the sea; only the northern border is connected to other states. Moreover, the population is spread over the state with population centers at Miami (population, 5,502,400) on the southeast coast of the peninsula, Tampa (2,441,900) on the middle west coast, and Orlando (1,510,600) in the center. These geographical features are desirable to detect the natural factors that might be at work in the spread of the virus.

Another reason is that Florida offers a good proxy representing the role of political leadership. The more transparent information is, the better the decision. Despite this, however, President Trump downplayed the danger of, and overstated the government's capability of handling, the outbreak of COVID-19 through the middle of March 2020.Footnote 1 The governor of Florida, Ron DeSantis, strongly supported the president’s view. If people were more readily to believe the overly-optimistic messages from these leaders, they would tend to underprepare against the outbreak, thereby increasing the likelihood of catching the virus and spreading it to others. By investigating this scenario, we capture the role of political leadership.

This study tries to capture the social learning factor of people by the population density and number of cities of a particular region.Footnote 2 By mid-April 2020, it was clear that the instances of COVID-19 were worse in more populated urban regions on the northeast coast of the US, including New York, New Jersey, and Connecticut. If people learned from those experiences, those who lived in an urban region with a higher population density would be more careful and more readily change their behaviour to protect themselves against the virus. If so, the pandemic should spread less in a more populated urban region, thereby evidencing the presence of a social learning factor towards self-protection.

Our results show that the more core supporters of President Trump in a region, the larger number per capita of COVID-19 cases in Florida, which suggests the importance of political messages that a leader delivers in the fight against the pandemic. The more populated a region is, the smaller number per capita of COVID-19 cases. This effect is clearer if county population is adjusted by an urbanization factor of county. Except for the seven most populated counties, the more urban a county is, the smaller the number per capita of COVID-19 cases. For the seven most populated counties, the more urban a region is, the larger the number per capita. These results suggest that social learning towards self-protection takes place in more populated and more urban areas. However, if a county has too large a population, the self-protection effort is not enough effectively to fight against the virus.

Finally, the number of cases accelerates more quickly with the population of a region. This is consistent with the nature of virus transmission.

These results are likely to be special features of Florida, which is relatively isolated from other populated regions. To confirm this, I conduct similar tests for the state of Ohio, which has similar political features, and find that no relationship exists between the number per capita of COVID-19 cases and various potential determinants.

2 Results

We capture the effect of local and national politics by the behaviour of core Trump supporters, which we represent by the ratio between the number of votes for the Republican Trump-supporting governor in the 2018 election and that for the president in the 2016 election, or

$$ R = \frac{{\# Votes\,for\,~the\,Republican\,candidata\,~in~\,the~\,2018\,Gubernatorial\,~election}}{{\# Votes\,~for~\,President\,~Trump~\,in~\,the~\,2016~\,election}} $$
(1)

This choice of a proxy can be explained as follows: Florida is well known as a swing state, in which Republican and Democratic candidates have divided the total votes almost evenly between them in many past elections. In 2016, people voted for President Trump rather decisively. In 2018, Florida elected a new governor, Governor DeSantis, who has been widely regarded as a disciple of President Trump.Footnote 3 He has followed closely the policy taken by Trump. In 2018, as is well known, the president significantly lost popularity due to many scandals; the Republicans lost large numbers of seats nationwide in the 2018 election. From these facts, we may assume that the voters for Governor DeSantis in 2018 election were likely to be strong supporters for President Trump; in other words, the number of votes for Governor DeSantis in 2018 should closely be correlated to that of votes for President Trump in the 2016 election. minus the number of voters who decided no longer to support the president in 2018 election. For this reason, we assume that the ratio in a county between the number of votes for Governor DeSantis and that for President Trump captures the relative side of core Trump supporters in that county, which we call the leader index. Over the Florida counties, the index is distributed between 0.73 and 0.98.

This study conduct OLS (ordinary least squares) estimations to test a number of models that may explain the number per capita of COVID-19 cases by different explanatory variables. In those explanatory variables, the leader index may suffer from the endogeneity problem. In an election, people tend to vote for the candidate with whom they share personal philosophy; it is natural that the same personal philosophy influences how a person acts against the pandemic. As a result, the number of coronavirus cases and the governor/president ratio can be endogenously determined. To deal with this problem, the present study conducts instrumental variable estimations after deciding the best fit model specification by OLS. As the instrumental variable, we adopt the county property tax, which may be positively correlated to the leader index. Because the county property tax is pre-determined way before the current COVID-19 outbreak, it is impossible that the number per capita of COVID-19 cases influences the county property tax. Moreover, it is highly unlikely that there is a direct channel through which the property tax influences the number per capita of COVID-19 cases. For these reasons, the property tax rate can be a good candidate for an instrument for the core Trump supporter index to explain the number per capita of COVID-19 cases.

Our findings may be summarized as follows.

Observation 1

In Florida, where the local leader followed the national leader by downplaying the danger of the virus at an early stage of the outbreak:

  1. (1)

    the spread of the coronavirus accelerates with population,

  2. (2)

    the larger the number of core Trump voters, the worse the spread of the virus,

  3. (3)

    except for the seven most populated counties, the coronavirus cases per capita decrease with urbanization, and

  4. (4)

    many economic factors have little effects on the spread of the virus.

It is highly likely that these results reflect unique features of Florida. To double check this possibility, we examine the cases in Ohio, which shares several crucial political characteristics. None of the above results are observed in Ohio data.

3 Literature

There are a number of studies that, like this study, focus on the early development of the pandemic. Here, I will explain only those studies; this study contributes to the literature that is concerned with political factors as a determinant of the outbreak. Bursztyn et al. (2020) investigate the effect of misinformation on the spread of a pandemic. They focus on two Fox News shows and demonstrate that greater viewership of the more pro-Trump show (Hannity) is associated with more COVID-19 cases and deaths in the early stages of the pandemic. Both this and their studies are based on Johns Hopkins data.

The rest of the literature that is concerned with the effectiveness of social distancing. Ajzenman et al. (2020) investigate the effect on social distancing of the Brazilian president's dismissal of the coronavirus risk. They show that his dismissal reduced social distancing more in pro-government localities, captured by geo-localized mobile phone data. Adolph et al. (2020) show that Republican governors and governors from states with more Trump supporters were slower to adopt social distancing policies. Allcott et al. (2020) report significant gaps between Republicans and Democrats in beliefs about personal risk, which may influence the future path of the pandemic. Barrios and Hockberg (2020) show that as Trump vote share rises, individuals search less for information on the virus and unemployment benefits and reduce less both their daily distance traveled and their visits to non-essential businesses. Anderson (2020) shows that mandatory measures on social distancing may have been effective at reducing the frequency of visits to locations outside of one's home. Brzezinski et al. (2020) show that individuals skeptical about the human causes of climate change are less likely to comply with social distancing regardless of their political views.

More broadly, this study is related to several studies that treat the epidemiology of the coronavirus pandemic from the social scientific viewpoint. By using internet data, Fetzer et al. (2020) document a rapid surge in economic anxieties after the arrival of the coronavirus in the US and substantial heterogeneity in participants’ beliefs about mortality from and the contagiousness of the virus. Using county data on population movement derived from cell phone location data, Wright et al. (2020) build a theoretical model that shows that poverty and negative economic shocks from the US trade war will reduce compliance with social distancing.

There is a large volume of empirical studies that capture the effect of misinformation by media. For studies in that literature, see Bursztyn et al. (2020).

4 Data

We use only publicly available data. The data source for coronavirus cases is from the Johns Hopkins University Coronavirus Resource Center at https://coronavirus.jhu.edu/us-map/, which is the same as Bursztyn et al. (2020). Every day, this site refreshes, and updates, the numbers of cases and deaths in each US county. Our estimation is based on April 15, 2020, data for Florida and April 20, 2020, data for Ohio.

In addition to the leader index, \(R\), the main explanatory variables include population, area and the number of cities, which are regarded as exogenous variables in our estimation equations. Many other explanatory variables are included as control and instrumental variables. They are: urban and rural population density, water area, per capital income, education, wage, median home value, average property tax, poverty rate, and the number of hospitals. We will come back to the choice of instrumental variables in Sect. 4.4.

In place of descriptive statistics, we present the plots of vectors of main variables; the left-hand side panels are on Florida whereas the right-hand side panels are on Ohio. Figure 1 shows that in each data set one outlier exists. For Florida, Miami-Dade County, where Miami is located, has both a large number of coronavirus cases and a large population. For Ohio, the outlier is Marion county, in which more than 1000 people contracted the virus at a state prison. As is shown below, the existence of these outliers has little effect on our estimation results.

Fig. 1
figure 1

Descriptive statistics

All election data is from the New York Times site. Florida population estimates for 2019 are from the Office of Economic and Demographic Research of the Florida Legislature.Footnote 4 Ohio population estimates are from Ohio Demographics at the Cubit site.Footnote 5 County area data is from the site of Florida Association of Counties for Florida and from the USA.com site for Ohio.Footnote 6 Data on the number of cities is from the Florida Association of Counties site for Florida and from Wikipedia (list of cities in Ohio) for Ohio.Footnote 7 Data for instrumental variables are from the following sites. Education data from the Town Charts.Footnote 8 Median home value and average property tax data for Florida from the Smart Asset site.Footnote 9 Data for control variables are from the following sites. Median home value data for Ohio from the Cleveland.com site.Footnote 10 Urban and rural population density data from 2010 Census Urban and Rural Classification and Urban Area Criteria.Footnote 11 Florida water area data from the USA.com site.Footnote 12 Per capita income data from the US Department of Commerce, Bureau of Economic Analysis (2018).Footnote 13 Wage rate data from the US Bureau of Labor Statistics site.Footnote 14 Poverty data from Sandoval (2015, percentage of poor) for Florida and from the Index Mundi site for Ohio.Footnote 15 Data on the number of hospitals from Wikipedia.Footnote 16

5 Possible Determinants of the COVID-19 Spread

5.1 Acceleration Effect of Population

To capture the determinant of the COVID-19 spread, we estimate several different models. The first is the single explanatory variable model. That is, we estimate the number per capita of COVID-19 cases on one of the demographic and economic factors considered here, that is, population, population density, the number of cities in a county, per capita income, education level, median-home value, wage rate, the area of a county, the number of hospitals, and poverty level. The estimation equation is.

$$ n = c + \beta _{o} \left[ {one\,~social\,~factor} \right], $$
(2)

where the dependent variable is the number of COVID-19 cases per capita in a county, \(n = N/P\), where \(N\) and \(P\) are the number of cases in county and the population of the county, respectively.

Table 1a summarizes the estimation results. This shows that the number per capita of COVID-19 cases are significantly correlated to the socio-economic factors considered in this study. Many of these correlations are, however, absorbed by population.

Table 1a Factors contributing to the outbreak in an early stage in Florida. Dependent variable: Per capita cases in Florida counties (# of observation = 67)

That is, once we include population in addition to one of the factors, the correlation with the latter factor disappears. That is, we estimate

$$ n = c + \beta _{P} P + \beta _{o} \left[ {another\,~explanatory\,~factor} \right]. $$
(3)

The estimation results are summarized in Table 1b. The table shows that once population is included in the estimation, most of the other explanatory variables become insignificant except for population density and the leader index.

Table 1b Factors contributing to the outbreak in an early stage in Florida:. Joint effects with population. Dependent variable: Per capita cases in Florida counties (# of observation = 67)

In all estimation results, the number of per capita cases is positively correlated to population. This implies, by \(n = N/P\), that column 1 in Table 1a, for example, gives rise to

$$ N_{j} = 41.5\left( {\frac{{P_{j} }}{{10^{5} }}} \right) + 5.8\left( {\frac{{P_{j} }}{{10^{5} }}} \right)^{2} $$
(4)

This shows that, as population increases, the coronavirus cases increase with an acceleration rate of 11.6 people per 10,000 people. All the other estimations show that the coefficient on \(P^{2}\) is significant, mostly at the 1% level and positive. This is consistent with the theoretical prediction that if people would believe leaders who downplay the danger of an epidemic, the epidemic would spread in a natural manner. In the literature on epidemiology, it has been established that the spread of the outbreak accelerates with population at its early stage (see Kermack and McKendrick (1927)).

Finding 1

In Florida, the spread of COVID-19 cases accelerates with county population at an acceleration rate between 9 and 14 per 10,000 people with/without control variables included.

5.2 Pandemic Suppressing Effect of Population Density

See Table 1b. An important finding in the previous model, (3), is that the significant effects of population density \(D\) and the leader index \(R\), defined by (1). Section 2. That is, leader index \(R\) is positively correlated to the number per capita of COVID-19 cases. This suggests that the presence of core Trump supporters worsens the contagion.

Even more interesting is the finding that population density is negatively correlated to the number per capital of cases. As is discussed in the Introduction, if people realized that COVID-19 spreads more readily in a more populated area, they might put more self-protective efforts. Our finding is consistent with this possibility.

With these considerations, in this section, we add population density \(D\) and leader index \(R\) in our estimation equation, which is

$$ n = c + D + R + \beta _{P} P + \beta _{o} \left[ {another\,~explanatory\,factor} \right]. $$
(5)

Table 1c summarizes the results. An important finding is that once population, population density and the president/governor ratio are controlled, each of the other explanatory variables loses explanatory power completely. This suggests that model (5) fits our data better than models (2) and (3). At the same time, the effect of the leader index remains significant and positive.

Table 1c Factors contributing to the outbreak in an early stage in Florida: Joint effects with population, population density, and G/P ratio. Dependent variable: Per capita cases in Florida counties (# of observation = 67)

The effect of population density remains significant and negative. This sharply contrasts with the common intuition that, as population density increases, the chance of contracting an epidemic would become higher.

One possible reason might be that population density (population per square mile) may not capture how crowded a particular region actually is. With this consideration, we estimate (2) by controlling urban and rural population and water area. Table 2a shows, however, that even if they are controlled, the negative effect of population density persists whereas urban and rural population densities and water area are insignificant for all but one specification. Another possible explanation may be that the negative correlation is due to the outlier (Miami-Dade) in the dataset. As Table 2b shows, in fact, population density becomes insignificant if Miami-Dade County is excluded.

Table 2a Effects of missing geographic factors and Miami-Dade (Outlier) in Florida: without Miami
Table 2b Effects of missing geographic factors and Miami (Outlier) in Florida: With Miami

Finding 2

A higher population density leads to a fewer number per capita of COVID-19 cases.

5.3 Pandemic Suppressing Effect of Urbanization

As is discussed above, congestion might lead people to behave more carefully against the COVID-19 outbreak, thereby negatively affecting the per capita cases. This result may be reinforced by including in estimation the degree of urbanization in a county, \(K\), which is measured by the number of cities, and the augmented population by the degree of urbanization, that is, \(K \cdot P\), where \(K\) denotes the number of cities in a county. Equation (5) now becomes

$$ n = c + \beta_{K} K + \beta_{KP} K \cdot P + \beta_{D} D + \beta_{P} P + \beta_{o} $$
(6)

Columns 1 through 3 of Table 3 summarizes the estimation results for various combinations of explanatory variables, population \(P\) and effective population \(K \cdot P\). In order to check multicollinearity, we report the vif values on each coefficient in double parentheses; we report only results in which the vif values are all smaller than 10, which suggests that multicollinearity does not pose a serious problem.

Table 3 Population or urbanization

The results in Table 3 show that once effective population \(K \cdot P\) is included, neither population \(P\) nor population density \(D\) is significant any more. Columns 4 and 5 of Table 4 show that urbanization factor \(K\) alone does not have a similar effect.

Table 4 Effects of urbanization and urbanization adjusted population

These results suggest that the main factors related to the number per capita of COVID-19 cases are urbanization \(K\) and urbanization-adjusted population \(K \cdot P\). The last model we examine, therefore, includes only these variables with other control variables. The estimation equation is.

$$ n = c + \beta_{K} K + \beta_{KP} K \cdot P + \beta_{o} \left[ {control \, variables} \right] $$
(7)

Table 4 summarizes estimation results. The first four columns are with all data and the second four columns are without Miami-Dade data. With or without Miami Dade County, the coefficients on effective population, urban factor (the number of cities), and education are all significant at the 1% level. The coefficient on urban factor is negative, which we may capture the preventive effect of congestion against COVID-19 infection.

In all estimations, effective population \(K \cdot P\) is positively correlated to the number per capita of COVID-19 cases. In contrast, urbanization \(K\) is negatively correlated to it. This suggests that those who live in a county with a high level of urbanization tend to contract COVID-19 less. This suggests that people might learn from the experience of northeastern states and make more self-protection efforts.

Finding 3

A higher urbanization leads to a fewer the number per capita of COVID-19 cases.

6 Determinants of the Pandemic: Political Leadership and Self-Protection

The results summarized in Tables 3 and 4 suggest that we may replace population \(P\) and population density \(D\) by effective population \(K \cdot P\) and urbanization factor \(K\). With this consideration, we now investigate whether or not the number of coronavirus cases per capita is positively influenced by the core Trump supporters, unbanization, and urbanization adjusted population. Our estimation equation is

$$ n = c + \beta_{GP} R + \beta_{K} K + \beta_{KP} K \cdot P + \beta_{o} \left[ {control \, variables} \right] $$
(8)

Voting behaviour and self-protection behaviour against COVID-19 might be simultaneously determined by some missing variables, political beliefs, types of information that voters receive from media, among others. In order to cope with this endogeneity problem and to find a causality between the core Trump supporters and the COVID cases, we adopt the property tax of a county as an instrumental variable for the leader index. Because the property tax was determined before the COVID-19 pandemic, it is unlikely that the property tax of a county directly affects the number per capital of COVID-19 cases in the county. Thus, the property tax is a very good candidate for an instrumental variable for the leader index.

Table 5 presents an estimation result, which clears standard tests for instrumental variable estimation. As the table shows, the effects of the core Trump supporters, effective population, and urbanization are all significant; the number per capita of COVID-19 cases may be expressed as.

Table 5 Effects of urbanization and urbanization adjusted population
$$ n = - 0.0043013 - 2.83E^{ - 11} \left( {812701 - P} \right)K + 0.00570R. $$
(9)

This shows that an increase in the leader index leads to a higher number per capita of COVID-19 cases.

Finding 4

Political leadership matters in the fight against the COVID-19 pandemic.

Equation (9) captures the preventive effect of urbanization against COVID-19. That is, the number of per capita cases decreases with urbanization (an increase in \(K\)) among the counties with populations smaller than the cutoff value of \(812701\). Of the 67 Florida counties, Miami Dade County has the largest population (2,812,130), followed by Broward (1,919,644), Palm Beach (1,447,857), Hillsborough (1,444,870), Orange (1,386,080), Pinnella (978,045), Duval (970,672), and Polk (690,606). Thus, we may summarize this finding as follows:

Finding 5

Except for the seven most populated counties, the preventive effect of urbanization against COVID-19 exists; that is, the higher the degree of urbanization, the smaller the number of per capita COVID-19 cases, if other factors are kept constant.

7 Unique Environments Surrounding Florida

Our results show the importance of political leadership, in particular national political leadership, and self-protection in the fight against COVID-19. We believe it unlikely to obtain similar results with data other than Florida; there is no other industrial state in the US that is surrounded by the sea. Because Florida is surrounded almost completely by the sea, it eliminates potential noises that may dampen the effect of political and social learning factors. Moreover, Florida happened to have the gubernatorial election in 2018 and elected a Republican governor, who strongly supports President Trump, who is also a Republican. Unless these two conditions were not met, it would have been difficult to capture our results.

In order to double check this possibility, we study Ohio by the same method. Ohio elected a Republican governor in 2018, Governor Dewine. However, Governor Dewine has maintained a distance from the president since the time of his election.Footnote 17 Ohio was the first to close schools in the middle of March.Footnote 18 In April 2020, it was considered to be one of the more successful states in controlling the virus.Footnote 19

To check these differences matters, we make the same estimations with Florida data. The estimation results on models (1), (2) and (4) by using Ohio data show that almost no variables have significant effects; see Yano (2020, Tables 1 and 2) for details. This suggests that Florida is uniquely suited for the purpose of our study.

8 Conclusion

Our results show that political and demographic factors are important in determining the way in which COVID-19 spreads. In a state in which both local and national leaders downplay the danger of the virus, the number of per capita coronavirus cases increases with the number of core supporters of the national leader (Finding 4). At an early stage, the outbreak spreads in a natural manner, accelerating with population (Finding 1). Except for a handful of highly populated counties, urbanization reduces the number of per capita coronavirus cases (Finding 5). No such effects emerge in a state in which the local governor took a careful approach, as shown by Ohio data.

This result may be interpreted as follows. If both local and national leaders downplay the danger of the virus, their core supporters may be led to become less careful against the virus. At the same time, many people compensate for inadequate information and take their own self-protective measures, which may explain the preventive effect of urbanization against COVID-19 observed in most of the Florida counties (except the seven most populated counties).

Developing a cheap-talk game, Honryo and Yano (2020) show that a government can easily manipulate information if the government cares how the public perceives the government's performance. This theoretical prediction is empirically supported by our results as well as the recent work of Ajzenman et al. (2020), Bursztyn et al. (2020), and Brzezinski et al. (2020). The recent work of Blickle (2020) indicates that the future course of a society can be badly affected by a pandemic. Our result suggests the importance of transparency in the information provided by the government.