We linked several databases together for the analysis. The first database used is the central restaurant visitor data in the USA. The database is obtained from SafeGraph’s ‘Core Places’ database (SafeGraph (2020)).Footnote 2 SafeGraph tracks consumers’ mobile devices with their consent (using different sources) and tracks daily visits to hundreds of thousands of points of interest in the USA, including all kinds of restaurants. SafeGraph uses GPS pings from different mobile applications to estimate foot traffic patterns in restaurants in the USA. This detailed database provides daily consumer visit data to different restaurants by address, zip code, and 12-digit block-group FIPS codes. Using this data, we aggregated daily visits to get weekly visits for each restaurant within a 12-digit FIPS code area. We dropped duplicate observations by restaurants within a day (this was less that 0.03%). Our data is restaurant level, and each restaurant is one observation within a day. We calculated total weekly restaurant visits per 1000 population (i.e., all our restaurant visit rates are calculated per 1000 of population of the corresponding counties. This SafeGraph data also includes all curbside pickups by food delivery services. However, since workers are excluded from the visit counts, if restaurant workers did home deliveries, that would be excluded from the database.
SafeGraph acknowledges small geographical biases in the data collection due to tower tracking technological differences. However, SafeGraph measured these small geographical biases in the data and found the range of these biases to be from less than 1% to a maximum of 3% in any state.
The period of the database used in this study is from February 1, 2020 to April 5, 2020 for all counties in the USA. We selected this period to analyze changes in restaurant visits immediately before and during the COVID-19 pandemic in the USA. For this analysis, we concentrate on four different kinds of restaurants defined by the NAICS codes: (1) full-service restaurants, (2) fast-food and quick-service restaurants, including McDonald’s, Burger King, etc., (3) buffet restaurants, and (4) nonalcoholic drink bars such as Starbucks.
We added county-level income and poverty data from the U.S. Census Bureau and Small Area Income and Poverty Estimates (United States Census Bureau 2019) respectively. By merging the latter data with the block-group FIPS code for the restaurants, we added economic control variables in our model. Next, we obtained the rural–urban characteristics of counties from the United States Department of Agriculture Economic Research Service’s (2020) Rural–Urban Continuum Codes. If the code is higher than 3, the county is categorized as rural (following federal rules).
Independent variables
The independent variables were as follows: county cases (total number of cases), county deaths (total number), median income (in thousands of dollars), population below poverty level (in percentage), urban and rural differences (1 if respondent lived in an urban or suburban area; 0 otherwise), population (total population), and restaurant category (1 if restaurant matches the category full-service, fast food, buffet, or drink bar; 0 otherwise).
Table 1 presents the summary statistics for all variables. It shows that the average per week visits to any restaurant open in the urban counties is 56 for the weeks from February 1 to April 5, 2020. The corresponding average weekly visits for all open rural restaurants is 49 during the same period. For the time considered in this paper, average urban COVID-19 cases are 122, and rural cases are 1.4. Urban death rates are also higher than in the rural areas. Urban median household income is, on average, $68,395, while the average rural income is $50,365. About 13% of urban county residents are below the poverty level, whereas about 15% of rural county residents are below the poverty level. The p values representing the significance of the differences between the means are listed at the bottom of the table. The p values range from 0.001 to 0.003, and show that there are significant differences between urban and rural restaurant visits, the extent of COVID-19 spread, and resident incomes.
Table 1 Summary statistics for all variables We present the change in the total US weekly restaurant visits per 1000 population (the red line) in Fig. 1. The x-axis represents the days from February 1 to April 5. The y-axis is the total number of COVID-19 cases (i.e., the actual count) in the USA (blue line; left y-axis) and restaurant visits per 1000 population (right y-axis). According to Fig. 1, per capita weekly US restaurant visits started to decrease around the end of February. This was when media outlets started reporting the COVID-19 pandemic spread within small communities in the USA and people became more apprehensive. After COVID-19 cases increased in the USA (i.e., after March 1), there were rapid drops in restaurant visits until April 5. Figure 1 shows a clear negative connection between per capita restaurant visits and the number of COVID-19 cases.
Estimation strategy
We indicate the treatment variable as a dummy taking the value 1 for the dates the shelter-in-place order was implemented in the county, from February 1 to April 5, 2020. Urbanit and ruralit are binary variables representing the urban and rural status of county i. The control groups for the urban and rural counties are the urban and rural counties with no effective shelter-in-place orders. We estimate the difference-in-differences specification (DID) — i.e., the comparative changes in restaurant visits across urban and rural counties, and between those counties that had effective lockdowns versus those counties that did not — as:
$$ \ln {(visit)}_{jit}={\beta}_1\left({urban}_{it}\times {treatment}_{it}\right)+{\beta}_2\left({rural}_{it}\times {treatment}_{it}\right)+\alpha {X}_{it-1}+t+{\varepsilon}_{ijt.} $$
(1)
In the above Eq. (1), the dependent variable is the natural logarithm of weekly visits to restaurant j in county i for week t. ln(visit)jit represents natural logarithm. The urbanit and ruralit variables represent the urban or rural status for restaurant j’s county for week t. The slope coefficients β1 and β2 estimate average responses to state-mandated orders between the treatment and control counties.
The coronavirus spread could differentially influence the following week’s restaurant visits. To capture this effect, we include the county-level previous week’s coronavirus deaths in our first specification, as this captures the pandemic nature of the disease and public health hazard. Xit-1 represents lagged weekly county-specific variables and include county-specific coronavirus deaths during the week (t-1). The slope coefficient α estimates the lagged county-specific effects on weekly restaurant visits.
We also include contemporaneous and previous week coronavirus cases as well as state-specific cases as robustness checks in separate specifications. Median household income, percentage of people under the poverty level, and population are added as additional controls for the county. Time-fixed effects are included. The fixed effect controls for the time variation in the data. These variations include coronavirus testing rates, political partisan differences occurring on a daily basis, etc. Standard errors are clustered at the state level.