1 Introduction

The coronavirus disease 2019 (COVID-19) crisis was caused by a viral pandemic that abruptly and severely constricted human mobility and economic activity. Governments worldwide have increasingly implemented strict public health measures in response to the outbreak, such as social distancing and even complete lockdowns.

The shock of the COVID-19 crisis gives us a unique opportunity to investigate the causal effect of social distancing or a pause in human mobility and economic activities, which is generally impossible, on air quality. In this paper, we exploit the exogenous shock of COVID-19 and use a variety of high-quality data sets to study the causal effect of an unprecedented change in human mobility and economic activity on air pollution in the Chinese context.

We focus on China for two reasons. First, China suffers from severe air pollution (Chen et al. 2018; Freeman et al. 2019; Heyes and Zhu 2019; Shi and Xu 2018; Wang et al. 2018). Second, China offers a good setting for assessing the economic impact of COVID-19. It was the first country in the world to experience a large-scale outbreak, starting in January 2020, which means that there has been a sufficiently long post-outbreak period to ensure we have the statistical power for credible estimation.

We use various difference-in-differences (DID) estimation strategies to disentangle the effect of COVID-19 on AQI from other confounding effects (Fang et al. 2020). In particular, we introduce two alternative specifications of the treatment group to ensure the reliability of our estimation. In both specifications, we present strong evidence that lockdown policies significantly reduced air pollution. In our estimation, lockdown policies designed to suppress the spread of COVID-19 in China decreased air pollution by 12% on average.Footnote 1

We then check the robustness of our findings. First, we verify the identifying assumption associated with our baseline DID specification; that is, we assume that the change in air pollution in the treatment group would have followed the same trend as that in the control group if no lockdown policies had been issued. Second, we introduce a regression discontinuity design (RDD) to examine whether our findings are sensitive to the window width before and after the policy period. In both robustness checks, we got highly consistent results.

To explore the mechanism underlying our findings, we conduct three tests examining the anticipatory effects, spillover effects, and cities’ connection to Wuhan. First, we present evidence that the prior information leakage about lockdowns distorted human anticipation and behaviors, induced a significant increase in air pollution. Second, the Wuhan lockdown had significant spillover effects on other cities in terms of increased air quality, due to the effect on human mobility. Third, we construct a variable \( Connections \) to capture a city’s level of connection to Wuhan based on real-time between-city population movements data from Baidu. Our results show that connection with Wuhan significantly strengthened the effect of lockdown policies and the spillover effect of the Wuhan lockdown.

Finally, we examine the heterogeneous effects of public willingness to self-isolate, public panic, or government GDP incentives on our baseline findings, and find that the effect of lockdown policies is higher in cities that have more people willing to self-isolate, more public panic, or governments under less pressure to stimulate economic growth.

Our study is related to, but differs from, a recent paper (He et al. 2020) that also examines the effect of the COVID-19 pandemic on air pollution. First, we use two alternative settings to conduct DID estimation, to ensure that our findings are solid. Second, our rich data set based on population movements data from Baidu allows us to explore the moderating effect of human mobility and a city’s connection to Wuhan behind our baseline results. Third, based on a mechanism analysis, we explore how the anticipations of lockdown policies moderates the effect of these policies and the spillover effect of the Wuhan lockdown on other cities’ air quality. Finally, we analyze cross-sectional heterogeneity based on willingness to self-isolate, public panic, or government’s GDP incentives, which enriches our understanding of the effects of COVID-19 on air quality.

We contribute to the literature in two ways. First, we add to the rapidly growing literature about the economic consequences of the COVID-19 pandemic by showing its sizeable effects on air quality. We thus complement existing studies of consumption, labor markets, capital market violability, human mobility, and firm operations (Baker et al. 2020; Bartik et al. 2020; Chen et al. 2020; Ding et al. 2020; Hassan et al. 2020; Dingel and Neiman 2020; Fang et al. 2020; Coibion et al. 2020). We also complement policy evaluation of the unintended consequences of different policies for air quality (Barron and Torero 2017; Fu and Gu 2017; Lalive et al. 2018; Zhang et al. 2017). Second, by evaluating the determinants of air quality in China, this study complements the emerging literature on its causes and consequences (Chen et al. 2018; Freeman et al. 2019; Heyes and Zhu 2019; Wang et al. 2018), thereby offering timely implications for policymakers. In addition, we present strong evidence that human mobility, economic activities, economic connection, willingness to self-isolate, public panic, and GDP growth incentives are important driving forces of air pollution. Such findings could be taken as a benchmark for regulators aiming to improve environmental protection and reduce toxic emissions when the economy recovers from COVID-19.

The remainder of this paper is organized as follows. Section 2 describes the data set and empirical strategy. Section 3 presents the baseline results. Section 4 explores the underlying mechanisms and cross-sectional heterogeneity. Section 5 concludes the paper.

2 Data and Empirical Strategy

2.1 Data and Descriptive Statistics

The two main data sets that we use are daily air quality index (AQI) data and daily weather reports for major cities in China.

2.1.1 AQI Data

To describe the air pollution level in each city, we obtain AQI data from the website of the Ministry of Ecology and Environment of the People’s Republic of China (http://www.zhb.gov.cn/). The website provides daily information on the AQI, air quality class, and primary pollutants in 400 cities in China. According to the standard of the Ministry of Ecology and Environment of the People’s Republic of China, the \( AQI \) is based on the level of six atmospheric pollutants: sulfur dioxide (\( SO_{2} \)), \( NO_{2} \), suspended particulates smaller than 10 μm in aerodynamic diameter (\( PM_{10} \)), suspended particulates smaller than 2.5 μm in aerodynamic diameter (\( PM_{2.5} \)), carbon monoxide (\( CO \)), and ozone (\( O_{3} \)). Based on the concentrations of each pollutant, an \( Individual\,Air\;Quality\;Index \) (\( IAQI \)) score is assigned, and the final \( AQI \) is the highest of these six scores. A higher \( AQI \) means more atmospheric pollutants in air and a higher level of air pollutions.

2.1.2 Weather Data

To measure the weather conditions, we obtain data on the maximum wind speed, precipitation, average humidity, and average temperature at each station from the China Meteorological Administration (CMA) website. We then define city level weather conditions as the average of stations within the corresponding city.

2.2 Model Specifications

According to some reports, the COVID-19 virus is so dangerous that the median time between the onset of symptoms and death may be as little as 10 days. Moreover, COVID-19 can be transmitted while the carrier is still asymptomatic. These characteristics make COVID-19 a huge threat to healthcare capacity. To suppress the spread of COVID-19, China imposed an unprecedented extreme intervention by placing Hubei province and especially Wuhan city on lockdown. Wuhan was placed on lockdown at 10 a.m., January 20, 2020, and other cities in Hubei were locked down several days later. As COVID-19 spread, more and more cities issued different lockdown policies. According to Fang et al. (2020), 80 cities in 22 provinces issued different levels of lockdown policies before February 20, 2020.

Based on this setting, we use several DID estimation strategies by comparing different treatment and control groups to identify the impact of lockdown policies on air quality. The DID specification can be described as follows:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \beta_{1} Treat_{i,t} *Post_{i,t} + W_{i,t} + X_{i,t} + \epsilon_{i,t} } \\ \end{array} $$
(1)

where \( i \) denotes the city and \( t \) denotes the day. The dependent variable \( ln\left( {AQI} \right)_{i,t} \) is a measure of air pollution in city \( i \) on day \( t \), which equals the logarithm of the air quality index. The key variable of our DID design is \( Treat_{i,t} *Post_{i,t} \). Considering that within different specific designs we use different control groups, and the definitions of \( Treat_{i,t} \) and \( Post_{i,t} \) thus vary between research designs, we are explicit about their definitions below. According to Fu and Gu (2017), weather conditions influence emissions and air quality. For example, precipitation and wind speed are key meteorological variables influencing near-surface particulate matter within cities (Jones et al. 2010), and cloudy, cool, rainy, or windy days hinder the formulation of ozone. We also control for daily weather variables \( W_{i,t} \), a vector of daily weather for city \( i \) on day \( t \), including maximum wind speed, precipitation, average humidity, and average temperature, to eliminate the impact of weather conditions. Lastly, we control for several fixed effects. We control for city fixed effect \( \alpha_{i} \) to eliminate the impact of unobserved, time-unvarying city attributes on air quality. We control for the date fixed effect \( X_{i,t} \) to eliminate the impact of time-varying attributes on air quality, including a weekend dummy, month dummies, year 2020 dummy, and Spring Festival Holiday dummy.

Model (1) has the special cases in our different research designs. First, following the DID strategy, we can estimate Model (1) by studying the different effects of the lockdown policy on a treatment group versus a control group. Second, if we choose a small window before and after the lockdown policy, Model (1) can be estimated following the regression discontinuity design (RDD) method, because such an estimation can show whether the trend in air quality has a “jump” during and after the policy period (Fu and Gu 2017). Following Gelman and Imbens (2019), who provide three reasons why high-order polynomials should not be used in RDD, we add a linear or quadratic polynomial term \( f\left( t \right) \) in our RDD estimation. The RDD specification can be described as follows:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \beta_{1} Treat_{i,t} *Post_{i,t} + W_{i,t} + X_{i,t} + f\left( t \right) + \epsilon_{i,t} } \\ \end{array} $$
(2)

where \( i \) and \( t \) denote the city and day respectively, \( f\left( t \right) \) denotes a flexible polynomial time trend term controlling for unobserved confounding factors that may affect daily air quality.

3 Estimation Result

3.1 Summary Statistics

We present the summary statistics of the key variables in Table 1. We winsorize all of the continuous variables at the 1st and 99th percentiles to alleviate the concern that our results may be driven by outliers. Panel A of Table 1 presents the summary statistics for the full sample. The dependent variable \( ln\left( {AQI} \right) \) has a mean of 4.172 with a variance of 0.586. This means that during our sample period, the average air quality of China could be categorized as “good” according to the standard of the Ministry of Ecology and Environment of the People’s Republic of China. However, the 75th percentile of \( ln\left( {AQI} \right) \) is 4.566, which means that nearly a quarter of city-days should still be categorized as “lightly polluted” or worse.

Table 1 Summary statistics

Panel B presents the air quality within the pre-lockdown period of cities that implemented lockdown interventions and Panel C reports the air quality for the post-lockdown period of these cities as. Whether it is measured by \( ln\left( {AQI} \right) \), \( ln\left( {PM_{2.5} } \right) \), or \( ln\left( {NO_{2} } \right) \), the air quality after lockdown is much better than the air quality before lockdown, suggesting a reduction in air pollution due to the lockdown policy.Footnote 2

3.2 Baseline Results

In this section, we estimate Model (1) to identify the impact of lockdown policies on air pollution. Here, we implement two DID strategies that differ in the estimation sample, and the definition of the variable Treat.

First, we use the sample of 80 cities that issued lockdown policies during the coronavirus outbreak. Following Fang et al. (2020), we compare the change in air quality in these cities in 2020 to the same cities in the matching lunar calendar period in 2019. Under this setting, \( Treat_{i,t} \) takes the value of 1 for the 2020 data of city \( i \), and 0 for the 2019 data. \( Post_{i,t} \) is a dummy variable that takes the value of 1 for the sample period after city \( i \) issued a lockdown policy in 2020, or after the same lunar calendar date in 2019, and 0 otherwise.

Second, we use the sample of 2020 data for 80 locked-down cities and 284 cities that were never locked down during the coronavirus outbreak. Under this setting, \( Treat_{i,t} \) takes the value of 1 if city \( i \) issued any lockdown policy in 2020, and 0 otherwise. Thus, the control group consists of 284 cities that were not subject to lockdowns.Footnote 3\( Post_{i,t} \) takes the value of 1 for the sample period after city \( i \) issued a lockdown policy, and 0 otherwise.

We interpret the coefficient of \( Treat_{i,t} *Post_{i,t} \) as the impact of lockdowns on air quality or pollution in both of the above strategies. Panel A of Table 2 presents the results.

Table 2 Baseline results

The dependent variable of Panel A is the logarithm of AQI. Columns (1) and (2) are results of the first strategy and columns (3) and (4) are results of the second strategy. The coefficient of \( Treat *Post \) in column (1) is significantly negative and shows that in 2020 the AQI decreased by 13.5% during the lockdown period compared with the same lunar calendar period in 2019. Column (2) presents the results of Model (1) with a vector of weather variables and a vector of fixed effects. The coefficient of \( Treat *Post \) is significantly negative and is larger in magnitude. Columns (3) and (4) present consistent results.

The \( AQI \) level is based on the levels of six atmospheric pollutants. Based on the concentrations of each pollutant, an individual \( Individual\;Air\;Quality\;Index \) (\( IAQI \)) score is assigned to each and the final \( AQI \) is the highest of these six scores. Therefore, we conduct two robustness checks by investigating the impact of lockdown policies on the \( IAQI \) of \( PM_{2.5} \) and \( NO_{2} \). we define \( ln\left( {PM_{2.5} } \right) \) and \( ln\left( {NO_{2} } \right) \) as the logarithms of the \( IAQI \) s of \( PM_{2.5} \) and \( NO_{2} \), and regress them as Model (1). Panel B and C of Table 2 report the regression results.

Panel B of Table 2 presents the results of \( ln\left( {PM_{2.5} } \right) \), and shows that the coefficients of \( Treat *Post \) are all significantly negative and suggest that the level of PM2.5 decreased by 11% or 24% during the lockdown period. Panel C of Table 2 presents the result of \( ln\left( {NO_{2} } \right) \), and shows consistent results.Footnote 4

In summary, we find a significant and robust relationship between lockdown policies and air pollution with different control groups. The lockdown policies intended to suppress the spread of COVID-19 in China significantly decreased air pollution by 12% on average.Footnote 5

3.3 Robustness Checks

First, the identifying assumption associated with our baseline DID estimation specification is that the change in \( AQI \) in the treatment group would have followed the same trend as that in the control group if no lockdown policies had been issued. Thus, we examine the dynamic effect in the regression framework to address possible concerns about our identifying assumption. Specifically, we create a series of time dummy variables \( Post_{i,t}^{\eta } \) by modifying the official issue date of lockdown policies within 1 week around the real date, and interact each of them with our treatment variable using the following regression:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \sum\limits_{{{\text{n}} = - 4}}^{{{\text{n}} = 3}} {\beta_{n} Treat_{i,t} *Post_{i,t}^{n} + W_{i,t} + X_{i,t} + \epsilon_{i,t} } } \\ \end{array} $$
(3)

where \( Post_{i,t}^{n} \) is a dummy variable that takes the value of 1 for \( n \) days after (before) city \( i \) issues a lockdown policy in 2020, or the same lunar calendar date in 2019, and 0 otherwise. A total of seven dummies are accumulated, spanning a week around the official issue date. In this regression, we are particularly interested in the difference in the coefficients for all of the interaction terms, which we plot in Fig. 1.

Fig. 1
figure 1

Dynamic DID. Note This figure plots the difference of interaction terms’ coefficients between the period before and after the official lockdown day under a continuous treatment variable. The solid line shows the estimated coefficients over time. In each day, the dashed line surrounding the coefficient is 95% confidence intervals of it

Figure 1 presents the difference in the interaction terms’ coefficients. The left (right) graph shows the distribution of interaction terms’ coefficients when we compare locked-down cities in 2020 to the same cities in 2019 (locked-down cities to unlocked-down cities in 2020). Both these plots show that during the pre-lockdown days the coefficients are clearly insignificantly different from zero, while the coefficients during the post-lockdown days are clearly significantly negative. This figure demonstrates that the impact of lockdown policies on air pollution was not induced by time trends.

Next, to check whether the preceding results are sensitive to the window width before and after the policy period, we use the RDD method with local linear or quadratic polynomial time terms to estimate the “jump” in air pollution before and after the lockdown policies within a narrow window. Specifically, we regress \( ln\left( {AQI} \right) \) as Model (2), which adds a linear or quadratic time trend before and after the issue of each city’s lockdown policies. To perfectly capture the “jump” of AQI, we only focus on observations 7 days around the issue of lockdown policies. Table 3 reports the regression results.

Table 3 RDD estimation

Column (1) (2) and (3) present the results with no, linear and quadratic time trend respectively. All coefficients of \( Treat *Post \) are significantly negative, suggesting that air pollution did “jump” with the issuing of lockdown policies, which is consistent with Table 2.

4 Potential Mechanisms and Heterogenous Effects

4.1 Anticipatory Effects

Although the lockdown had a considerable impact on air pollution and significant effects on \( AQI \) and \( IAQI \), it may also have increased air pollution. Fang et al. (2020) show that a panic effect in reaction to the outbreak could significantly impact human behaviors. For example, before the government officially confirmed the lockdown, rumors that Wuhan would be locked down spread on the Internet. For example, the Baidu Search Index for “Wuhan lockdown” was 0 before January 20, increased to 15,649 and 21,905 in the next 2 days. Such information leakage has a significant impact on human behaviors and can cause major social chaos. It can also have had a significant effect on air pollution, such as increasing public and private transportation, or abnormal information seeking of people who anticipated the lockdown policy.

To investigate the impact of anticipated future lockdowns, we first provide visual evidence in Fig. 2 by applying the RDD method and estimating the effect of the lockdown policy on air pollution in Wuhan, the first city to be locked down in response to the COVID-19 outbreak.

Fig. 2
figure 2

AQI of Wuhan. Note This figure plots the distributions of AQI of Wuhan city against date. The left and right graphs present the result in 2020 or 2019 around the lunar calendar date of official locked down day, respectively. The lunar calendar date of official locked down day of cities in 2020 is normalized to 0

Figure 2 shows the quadratic trend of \( AQI \) within three periods: from 11 to 3 days before the issue of the lockdown policy, from 3 days before to the issue day, and from the issue day to 7 days after. The left graph presents the trend in 2020 and suggests that the air pollution of Wuhan “jumps” twice in our sample. Due to anticipatory of lockdown, air pollution increased greatly from 3 days before. After Wuhan was officially placed on lockdown, air pollution decreased much more, and even lower than the level within the period 11 days before. The right graph of Fig. 2 presents that there is no similar pattern in 2019. Figure 2 suggests that although the lockdown policy considerably decreased air pollution during the lockdown period, the anticipatory effects induced a significant increase of air pollution before the official issue of the lockdown policy, due to information leakage.

Next, we provide empirical evidence of the “anticipatory effects” of lockdown policies in terms of air pollution. We include a pre-lockdown period indicator \( Before_{i,t} \), which is a dummy variable and takes the value of 1 for the period from 3 days before the unprecedented announcement of a lockdown policy in city \( i \), and 0 otherwise. \( Before_{i,t} \) allows us to capture the anticipatory effects of lockdown policies. We re-estimate the impact of lockdown policies on air pollution using city-level data and the following DID specifications:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \beta_{2} Treat_{i,t} *Before_{i,t} + \beta_{2} Treat_{i,t} *Post_{i,t} + W_{i,t} + X_{i,t} + \epsilon_{i,t} } \\ \end{array} $$
(4)

where \( i \) denotes the city and \( t \) denotes the day. The key variables here are \( Treat_{i,t} *Before_{i,t} \) and \( Treat_{i,t} *Post_{i,t} \): the former captures the anticipatory effects of lockdown policies and the latter captures the normal effect of lockdown policies. Table 4 reports the regression results.

Table 4 Anticipatory effects

The coefficient of \( Treat *Before \) is significantly positive, which suggests that air pollution increased by 28% during the period from 3 days before to the official announcement date. We interpret this as the anticipatory effects of leaked information about lockdown policies. Columns (2) to (4) show that this finding is consistent when weather conditions are taken into account and when locked-down cities and unlocked-down cities in 2020 are compared.Footnote 6

4.2 Spillover Effects

Although the lockdown policy issued by the Wuhan government was only effective in Wuhan city, it had repercussions across China. People began to realize the seriousness of COVID-19, and modified their social behaviors. Many people outside Wuhan also tried to spend less time outdoors to reduce the risk of exposure to the virus. Factories and corporations began to reduce their physical interaction with firms or people from Wuhan. All of these behavioral shifts changed local economic and social life, and in turn affected air pollution.

To investigate the spillover effects of the Wuhan lockdown, we include a dummy variable \( PostWuhan_{t} \), which takes the value of 1 after Wuhan city issued its lockdown policy, or after January 23, 2020, for all cities, and 0 otherwise. \( PostWuhan_{t} \) allows us to capture the change in air pollution induced by the Wuhan lockdown outside Wuhan. Specifically, we re-estimate the impact of lockdown policies on air pollution using the following DID specification:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \beta_{2} Treat_{i,t} *Post_{i,t} + \beta_{2} Treat_{i,t} *PostWuhan_{i} + W_{i,t} + X_{i,t} + \epsilon_{i,t} } \\ \end{array} $$
(5)

where \( i \) denotes the city and \( t \) denotes the day. The key variables here are \( Treat_{i,t} *Post_{i,t} \) and \( Treat_{i,t} *PostWuhan_{i} \), which captures the impact of each city’s lockdown policies and the spillover effect of the Wuhan lockdown policy. Table 5 reports the regression results.

Table 5 Spillover effects

Column (1) and (2) compares the locked-down cities in 2020 to the same cities in 2019 and the locked-down cities to unlocked-down cities in 2020, respectively. For robustness, we also compare the unlocked-down cities in 2020 to the same cities in 2019 in column (3). The coefficients of \( Treat *Post \) are consistent with previous results. These coefficients are significantly negative in all columns, although they are slightly smaller than the coefficients of \( Treat *Post \) in magnitude. Furthermore, column (3) estimates the impact of the Wuhan lockdown policy on cities without any official lockdown policy in 2020 and suggests that those cities also had lower air pollution in 2020 after Wuhan was locked down. These results suggest that the Wuhan lockdown had a significant spillover effect on other cities.Footnote 7

4.3 Connections with Wuhan

In this section, we investigate how cities’ connections with Wuhan affect our findings. As a key element of virus propagation and the target of lockdown policies, the route of transmission considerably impacts people’s risk of exposure to COVID-19. It would also distort the behaviors of people and governments by increasing public panic, and encouraging people to escape from dangerous cities, promoting government intervention. Thus, it should also affect the relationship between lockdown policies and air pollution.

We examine this by adding the variable \( Connections \) to capture the effect of connections to Wuhan. We collect between-city population movements data during January 1 to 15, 2020, from the Baidu Migration data. The Baidu Migration data are based on real-time location records for every smart phone that uses the Baidu Map app or other apps imbedding the Baidu Map Software Development Kit. Baidu currently dominates the search engine market in China for both desktop computers and especially mobile phones (accounting for 79.63% of the mobile phone market in May 2019).Footnote 8 Thus, Baidu’s migration data can precisely reflect population movements. We choose the daily In-Migration Index of Wuhan city, which is the percentage of population inflow from a corresponding city into Wuhan relative to the total population inflow for Wuhan, and calculate the average In-Migration Index of each city to Wuhan. Next, we define \( Connections \) as a dummy variable takes the value of 1 if population inflow from city \( i \) to Wuhan is larger than average, and 0 otherwise, and then estimate the following model:

$$ \begin{array}{*{20}c} {ln\left( {AQI} \right)_{i,t} = \alpha_{i} + \beta_{2} Treat_{i,t} *Post_{i,t} + \beta_{2} Connections_{i} *Treat_{i,t} *Post_{i} + W_{i,t} + X_{i,t} + \epsilon_{i,t} } \\ \end{array} $$
(6)

where \( i \) denotes the city and t denotes the day. The key variable of our DID design here is \( Connections_{i} *Treat_{i,t} *Post_{i,t} \), which captures the impact of a city’s connection to Wuhan and how this connection affect the effect of lockdown. Table 6 reports the regression results.

Table 6 Connections with Wuhan

The coefficient of \( Connections *Treat *Post \) in column (1) is significantly negative, which suggests that air pollution decreased more in cities that had more connections with Wuhan. Columns (2) to (4) show that the impact of connections is consistent when weather conditions are taken into account or when locked-down cities are compared to unlocked-down cities in 2020. Note that the coefficient of Treat*Post in column (3) is insignificantly different from zero. However, when we include weather conditions in estimating the model in column (4), this coefficient becomes significantly negative again. All of these results demonstrate that connections with Wuhan significantly affected the effect of the lockdown policy.

Next, we also investigate whether this kind of connection affects the spillover effect of the Wuhan lockdown. Specifically, we add \( Connection \) to Model (6). Table 7 reports the regression results.

Table 7 Connections and spillover

The first, second, and last two columns of Table 7 compare the locked-down cities in 2020 to the same cities in 2019, locked-down cities to unlocked-down cities in 2020, and unlocked-down cities in 2020 to the same cities in 2019, respectively. We are interested in the coefficient of \( Connections*Treat*PostWuhan \), which is significantly negative. These results suggest that connections with Wuhan also affected the spillover effect of the Wuhan lockdown.

4.4 Heterogeneity

There is substantial heterogeneity in city characteristics across China which could have crucial implications for our findings. We now explore whether heterogeneity in citizens’ willingness to self-isolate, public panic, or government GDP incentives might affect our main results.

First, we capture people’s willingness to self-isolate by the daily Within-City Migration Index of Wuhan city \( Movements \), which is the logarithm of the percentage of the population traveling outdoors to the total population living in the same city. Similar to the daily in-migration index, this index is based on Baidu Migration data. We interact this variable with \( Treat *Post \) and include it in Model (1). Panel A of Table 8 reports the regression results.

Table 8 Heterogeneity

Second, we capture public panic by Internet searches for COVID-19. Specifically, we collect the number of searches for COVID-19 from Baidu Index. First, Internet users commonly use a search engine to collect information, and Baidu continues to be the favorite in China, holding an 80% share of the Chinese search engine market. Second, and more critically, a search is a revealed private behavior. If someone searches for COVID-19, then he or she is undoubtedly feeling confused and threatened by it. Therefore, the aggregate search frequency is a good proxy for public panic. We define \( BaiduSearch \) as the logarithm of the search index and adjust it by the average search during the whole period within the corresponding city, and then interact it with \( Treat *Post \). Panel B of Table 8 reports the regression results.

Third, we capture governments’ GDP incentive by comparing provincial-level GDP growth and the GDP growth target. Following Chen et al. (2020), we hand-collect the GDP data of each province from the websites of the Statistics Communiqué on National Economy, government reports, and the popular press. Then we define \( GDP \) as the difference between actual GDP growth in 2019 and the GDP growth target at the beginning of 2019, and interact it with \( Treat *Post \). Panel C of Table 8 reports the regression results.

The first two columns and second two columns of Table 8 compare the locked-down cities in 2020 to the same cities in 2019 and locked-down cities to unlocked-down cities in 2020, separately. Panel A shows that the coefficients of Movements*Treat*Post are significantly positive, suggesting that within-city population movements considerably decrease the impact of lockdown policies. This demonstrates that the effectiveness of lockdown policies is significantly reduced within cities where fewer people prefer self-isolation. Panel B shows that the coefficients of Baidu Search*Treat*Post are significantly negative, which suggests that public panic would stop people going outside. This in turn would significantly decrease air pollution induced by daily activities and transportation. This demonstrates that the effectiveness of lockdown policies would be significantly increased within cities with a greater sense of public panic. Panel C shows that the coefficients of GDP*Treat*Post are significantly positive, which suggests that governments that failed to achieve their economic target in the previous year are less effective when implementing lockdown policies.

In summary, we find that the effectiveness of lockdown policies is higher in cities that have more people willing to self-isolate, a greater sense of public panic, or a government facing less pressure to stimulate economic growth.

5 Conclusions

The COVID-19 crisis was caused by a viral pandemic that abruptly and severely constricted human mobility and economic activity. To respond to the outbreak, governments worldwide have increasingly implemented strict public health measures, such as social distancing or even complete lockdown.

We contribute to the literature by examining the causal effects of COVID-19 on air quality in the Chinese context. Using the lockdowns of different cities as exogenous shocks, we conduct DID estimations and show that lockdown policies reduced air pollution in a statistically significant manner. We further present three underlying mechanisms driving our findings: anticipatory effects, spillover effects, and the connection with Wuhan. Lastly, our findings are more pronounced in cities that have more people willing to self-isolate, greater public panic, or governments that face less pressure to stimulate economic growth.

Overall, by evaluating the unintended consequences of the COVID-19 pandemic for air quality, this study provides timely policy implications for policymakers. In particular, our results indicate that policy makers should pay more attention to the fairness and timeliness of information disclosure to avoid the anticipatory effects, and take better into account the existing connections between cities, including economic, labor or cultural connections with other areas.