Skip to main content

Advertisement

Log in

Modeling and predicting city-level CO2 emissions using open access data and machine learning

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

Globally, urban has been the major contributor to greenhouse gas (GHG) emissions and thus plays an increasingly important role in its efforts to reduce CO2 emissions. However, quantifying city-level CO2 emissions is generally a difficult task due to lacking or lower quality of energy-related statistics data, especially for some underdeveloped areas. To address this issue, this study used a set of open access data and machine learning methods to estimate and predict city-level CO2 emissions across China. Two feature selection technologies including Recursive Feature Elimination and Boruta were used to extract the important critical variables and input parameters for modeling CO2 emissions. Finally, 18 out of 31 predictor variables were selected to establish prediction models of CO2 emissions. We found that the statistical indicators of urban environment pollution (such as industrial SO2 and dust emissions per capita) are the most important variables for predicting the city-level CO2 emissions in China. The XGBoost models obtained the highest estimation accuracy with R2 > 0.98 and lower relative error (about 0.8%) than other methods. The CO2 emissions predictive accuracy can be improved modestly by combing geospatial and meteorological interpolation predictor variables (e.g., DEM, annual average precipitation, and air temperature). We also observed an S-shape relationship between urban CO2 emissions per capita and urban economic growth when the rest variables were held constant, rather than a U-shaped one. The findings presented herein provide a first proof of concept that easily available socioeconomic statistical records and geospatial data at urban areas have the potential to accurately predict city-level CO2 emissions with the aid of machine learning algorithms. Our approach can be used to generate carbon footprint maps frequently for the undeveloped regions with scarce detailed energy-related statistical data, to assist policy-makers in designing specific measures of reducing and allocating carbon emissions reduction goal.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

Data are available from the authors upon request.

References

Download references

Acknowledgments

The authors extend their appreciation to the reviewers who gave constructive comments that helped bring considerable improvements to a previous version of the manuscript.

Funding

This work was sponsored by K.C. Wong Magna Fund in Ningbo University and Philosophical and Social Science Planning Foundation of Zhejiang Province (20NDJC077YB) and National Natural Science Foundation of China (41571018 and 41871024).

Author information

Authors and Affiliations

Authors

Contributions

Ying Li and Yanwei Sun designed this study; Yanwei Sun revised the paper. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Yanwei Sun.

Ethics declarations

Competing interests

The authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Consent to participate

Not applicable.

Consent to publish

All authors read and approved the final manuscript.

Additional information

Responsible Editor: Marcus Schulz

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Sun, Y. Modeling and predicting city-level CO2 emissions using open access data and machine learning. Environ Sci Pollut Res 28, 19260–19271 (2021). https://doi.org/10.1007/s11356-020-12294-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-020-12294-7

Keywords

Navigation