Skip to main content

Advertisement

Log in

Can Building Subway Systems Improve Air Quality? New Evidence from Multiple Cities and Machine Learning

  • Published:
Environmental and Resource Economics Aims and scope Submit manuscript

Abstract

Public investments in subway systems are often motivated by improving local air quality. Recent studies, however, have reached different conclusions on the air quality benefits of subway investment. To reconcile these findings, this paper examines the air quality effects of all 359 subway line openings in China between 2013 and 2018. The machine learning method adopted in this paper substantially improves the consistency and precision of the estimates by purging seasonality, volatility, and the nonlinear effects of meteorological conditions in air quality data. The empirical results suggest an insignificant short-term effect and a significant long-term effect, which is expected as the adjustment of commuting mode takes time. Using the causal forest approach, the heterogeneity analysis find that a city that is experiencing rapid economic growth from a lower income level and currently has fewer subway lines is more likely to experience statistically significant improvements in air quality from a subway opening. These findings help reconcile the different findings in the literature and shed light on air pollution reduction as one of the objectives of public transit investment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Notes

  1. We report the results on other pollutants (e.g., SO2, PM10, and ozone) in the Appendix, as the literature (e.g., Cole et al. 2020; Li et al. 2019) suggests that SO2 is predominately caused by industrial emissions, and the formation of ozone is a very complicated process, in which the contribution of the emissions from commuting is not clear.

  2. Source: www.urbanrail.net.

  3. The calculation varies across countries. For example, when the concentration of PM2.5 is 35 µg per cubic meter, China computes the index as 50, whereas the US computes the index as 100. For more information on the calculation, refer to https://www.cnblogs.com/tiandi/p/6158576.html.

  4. Source: Technical Regulation on Ambient Air Quality Index (GB3095-2012).

  5. We compare the standard deviations for stations reporting hourly data and those with interpolation. For example, the standard deviation of pressure reported by stations with hourly data is 104.04, which is 1.02% of the mean. For stations with interpolations, the standard deviation is 102.97, which is 1.01% of the mean.

  6. Grange et al. (2018) included relative humidity rather than dew point, but our meteorological dataset does not have relative humidity. Given that the dew point is a function of relative humidity and temperature, and our data include temperature, in the context of the random forest algorithm, relative humidity and dew point contain the same information.

  7. If a city has more than one weather station, we keep the meteorological data of all weather stations for each single air pollution monitor in the same city. This increases the meteorological information available for machine learning, which improves training performance.

  8. The number of variables that can be split at each node equals the (rounded down) square root of the number of all predictors. The minimum size of the leaf nodes is five. Grange et al. (2018) suggested that the choice of these hyperparameters does not significantly affect the training results, and the training results in this paper support this argument.

  9. The anti-fascist parade is a celebration of the 70th anniversary of the end of World War II.

  10. For example, coal-based heavy industry is one of the main industries of Shanxi province in China, and burning coal is a major source of local air pollution (Song et al. 2021). For such cities, the effect of subway opening is not necessarily greater than in other cities with low initial pollution levels.

  11. Athey and Wager (2019) further improved the causal forest approach by casting forests as adaptive locally weighted estimators rather than simply averaging the treatment effect of each tree. They also slightly adjusted the splitting rule to reduce the calculation burden.

  12. Note that in the short window of each opening, the exact opening date is assigned arbitrarily. Thus, the treatment is orthogonal to the covariates, so the unconfoundedness assumption holds.

  13. This correlation reflects the average treatment effect because the causal forest can be viewed as a residual-on-residual regression that regresses \(predicted AQI-\widehat{predicted AQI}\) on \(open-\widehat{open}\).

  14. We focus on PM2.5 when evaluating the economic benefit, as the literature documents the effect of PM2.5 on health and most major urban areas in China have high levels of PM2.5.

  15. According to the official document (document number: 000013039–2013-00176) released in 2013, the investment in a subway line in Wuhan is 2.0 billion dollars, and there are 12 stations. http://zfxxgk.ndrc.gov.cn/web/iteminfo.jsp?id=12397

References

Download references

Acknowledgements

Financial support from National Science Foundation of China (No. 72103220) and the Environment for Development (EfD) Initiative at the University of Gothenburg are gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Haosheng Yan.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Consent for Publication

The manuscript is approved by all authors for publication. The work is original research that has not been published previously, nor that is under consideration for publication elsewhere.

Ethical Approval

All the authors listed have approved the manuscript that is enclosed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Joshua Linn is a Senior Fellow at resources for the future.

Appendices

Appendix A

The Fig.

Fig. 6
figure 6

Policy Interventions and Predicted AQI. Panel a Predicted AQI in Beijing. Panel b Predicted AQI in Shanghai

6 shows the predicted AQI in Beijing and Shanghai as well as policy interventions, indicated by the vertical lines. For Beijing, the policies are,

  1. (1)

    Implementation of Beijing Air Pollution Prevention and Control Regulations. (2014/1/22)

  2. (2)

    APEC Leaders' 8th Informal Meeting (2014/11/3).

  3. (3)

    Anti-fascist Parades (2015/9/3).

  4. (4)

    Implementation of Beijing Stage 6 "Gasoline for Vehicles" (DB11/238-2016) and "Diesel for Vehicles" (DB11/239-2016) standards (2017/1/1).

  5. (5)

    Intensive inspection of the Blue-sky Defense Campaign (2018/6/11).

For Shanghai, the policies are:

  1. (1)

    Implementation of Emergency Plan for Heavy Air Quality Pollution (2013/12/1).

  2. (2)

    Prohibiting vehicles according to the National II standard (GB 18352.2) (2016/1/1).

  3. (3)

    Regional Air Pollution Prevention and Control Cooperation Group meeting held (2014/4/21).

  4. (4)

    National Environmental Protection Inspection (2016/11/28).

  5. (5)

    National Environmental Protection Inspection Review Works (huitoukan) (2018/5/31).

As an example, the first red line in Panel (a) refers to January 22, 2014, which is the date of the implementation of Beijing air pollution prevention and control regulations. The regulations set standards for air quality, as well as emission control standards and industry governance measures. These measures tend to reduce air pollution. The blue line in Panel (a) shows a rapid decline in AQI after that day. The other events reported in Fig. 6 are similar to the example. The figure indicates that air quality improved during these events and that the magnitude of the improvements varied across events. The air quality improvements that we detect help validate the model and suggest that the predicted air pollution data is sensitive to non-weather shocks (Fig. 

Fig. 7
figure 7

CATEs of Subway Opening on PM2.5 by the Minimum Distance between Subway and Air Pollution Monitor

7).

This figure shows how the conditional average treatment effects (CATEs) vary with the minimum distance between the subway and air pollution monitor. CATEs are estimated using causal forest algorithms (Tables 

Table 9 New subway lines opened between 2013 and 2018 in Mainland China

9,

Table 10 Effects of subway opening on other air pollutants

10,

Table 11 Estimation of the effects of subway openings using alternative time trends

11,

Table 12 Subway lines whose opening was delayed due to exogenous factors

12,

Table 13 Factors mainly driving the splitting of causal forest

13 and

Table 14 Conditional average treatment effects group of interest

14). The outcome variable was predicted PM2.5, and the covariates are those listed in Fig. 4.

Appendix B

2.1 An Illustrative Example of Predicting the AQI

To predict the counterfactual Air Quality Index (AQI), we train a random forest model to fit air pollution with predictors, where \(\widehat{AQI}=f(meteorological, temporal,human)\).

  1. 1.

    To calculate the expectation, we randomly assign meteorological and temporal factors and conduct counterfactual predictions using the trained model, obtaining the counterfactual AQI for each random assignment, denoted as \({\widehat{AQI}}_{random1}^{Jan 1st 2010}=f({meteorological}_{random1},{ temporal}_{random1},{human}^{Jan 1st 2010})\)

  2. 2.

    Repeating the second step 1000 times, we obtain \({\widehat{AQI}}_{random300}^{Jan 1st 2010}=f({meteorological}_{random300},{ temporal}_{random300},{human}^{Jan 1st 2010})\). We then consider the average of the predicted outcomes, \({\overline{\widehat{AQI}} }^{Jan 1st 2010}\) as the predicted AQI. This predicted AQI is the counterfactual AQI if the meteorological and temporal factors are typical on that day.

Appendix C

3.1 Methods of Calculating Standard Errors

In the two-stage method we employ, uncertainty arises from both the prediction and regression steps, and currently, there is no established guideline for calculating standard errors in this context.

In the main text, we utilize the bootstrap method to calculate standard errors, considering only the uncertainty from the second step. Theoretically, considering the uncertainty from the first step could lead to a larger standard error. Since our baseline result indicates an insignificant effect, this conclusion remains robust even with a potentially larger standard error.

To further affirm the robustness, following the approach suggested by Burlig et al. (2020), we attempt to capture uncertainty from both steps by employing a bootstrap method. Unfortunately, we find that performing the bootstrap is not feasible. As an alternative, we opt to conduct the robustness check using the residual approach (Burlig et al. 2020).

We employ the following procedure: First, for each air quality monitor, we sample N observations with replacement (where N equals the original number of observations). We then train the random forest model using bootstrapped data and obtain the residual (\(y-\widehat{y}\)). We repeat these steps 20 times. Next, we should sample the data from step one with replacement, generate multiple subsamples, estimate treatment effects using these subsamples, and calculate the standard deviation as the bootstrapped standard error. However, the data obtained from step one is substantial, estimated at approximately 400 GB (20 GB for the baseline dataset * sampled 20 times), exceeding the computer’s available memory for processing. Therefore, we opt for an approximate estimation. In the approximation, we exclusively employ the Air Quality Index (AQI) as the dependent variable, disregarding other indices of pollutants and restricting our analysis to the subsample representing the first occurrence of a subway opening in a city. To ensure comparability, we also apply similar approximations to the cluster regressions used for comparison. The estimates and standard errors may deviate from the baseline, primarily because we use subsamples instead of the full sample, leading to a reduction in the number of observations.

As demonstrated in Table 

Table 15 Effects of subway openings on AQI (bootstrap)

15 we observe that the standard errors, accounting for prediction errors from the first step, are quantitatively similar (albeit slightly larger) to the standard errors that only include the uncertainty from the second step. Given our assertion of an insignificant effect in the baseline estimation, a slightly larger standard error further reinforces the robustness of our baseline results.

Considering both the theoretical analysis and the bootstrap results of the residual approach, we believe that the issue of uncertainty from the first step may not pose a significant threat to our baseline results.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xie, L., Zou, T., Linn, J. et al. Can Building Subway Systems Improve Air Quality? New Evidence from Multiple Cities and Machine Learning. Environ Resource Econ 87, 1009–1044 (2024). https://doi.org/10.1007/s10640-024-00852-3

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10640-024-00852-3

Keywords

JEL Classification

Navigation