Skip to main content

Using machine learning to understand the temporal morphology of the PM2.5 annual cycle in East Asia

Abstract

PM2.5 air pollution is a significant issue for human health all over the world, especially in East Asia. A large number of ground-based measurement sites have been established over the last decade to monitor real-time PM2.5 concentration. However, even this enhanced observational network leaves many gaps in characterizing the PM2.5 spatial distribution. Machine learning provides a variety of algorithms to help deal with these large spatial gaps—combining both remotely sensed and in situ observation data to estimate the global PM2.5 concentration. This study used a PM2.5 data product of six regions from the results of an unsupervised self-organizing map (SOM) with optimized ensemble learning approaches to highlight the most important meteorological and surface variables associated with PM2.5 concentration. These variables were then examined via multiple linear regression models to provide physical mechanistic insights into the morphology of the PM2.5 annual cycles.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

References

  • Bosilovich, M.G., Robertson, F.R., Chen, J. (2011). Global energy and water budgets in merra. Journal of Climate, 24(22), 5721–5739.

    Article  Google Scholar 

  • Breiman, L. (1996a). Bagging predictors. Machine Learning, 24(2), 123–140.

    Google Scholar 

  • Breiman, L. (1996b). Bias, variance, and arcing classifiers.

  • Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.

    Article  Google Scholar 

  • Brook, R.D., Rajagopalan, S., Pope, C.A., Brook, J.R., Bhatnagar, A., Diez-Roux, A.V., Holguin, F., Hong, Y., Luepker, R.V., Mittleman, M.A., et al. (2010). Particulate matter air pollution and cardiovascular disease an update to the scientific statement from the american heart association. Circulation, 121(21), 2331–2378.

    CAS  Article  Google Scholar 

  • Choi, Y. -S., Ho, C. -H., Chen, D., Noh, Y. -H., Song, C. -K. (2008). Spectral analysis of weekly variation in pm10 mass concentration and meteorological conditions over china. Atmospheric Environment, 42(4), 655–666.

    CAS  Article  Google Scholar 

  • Dan, M., Zhuang, G., Li, X., Tao, H., Zhuang, Y. (2004). The characteristics of carbonaceous species and their sources in pm2. 5 in beijing. Atmospheric Environment, 38(21), 3443–3452.

    CAS  Article  Google Scholar 

  • Dickinson, E., Henderson-Sellers, A., Kennedy, J. (1993). Biosphere-atmosphere transfer scheme (bats) version 1e as coupled to the ncar community climate model.

  • Dietterich, T.G., & et al. (2000). Ensemble methods in machine learning. Multiple Classifier Systems, 1857, 1–15.

    Article  Google Scholar 

  • Dockery, D.W., Pope, C.A., Xu, X., Spengler, J.D., Ware, J.H., Fay, M.E., Ferris, Jr B.G., Speizer, F.E. (1993). An association between air pollution and mortality in six us cities. New England Journal of Medicine, 329(24), 1753–1759.

    CAS  Article  Google Scholar 

  • Dominici, F., Peng, R.D., Bell, M.L., Pham, L., McDermott, A., Zeger, S.L., Samet, J.M. (2006). Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases. Jama, 295(10), 1127–1134.

    CAS  Article  Google Scholar 

  • Elsom, D., & Chandler, T. (1967). Meteorological Controls upon ground level concentrations of smoke and sulphur dioxide in two urban areas of the united kingdom. Atmospheric Environment, 12(6-7), 1543–1554.

    Article  Google Scholar 

  • Engel-Cox, J.A., Hoff, R.M., Haymet, A. (2004). Recommendations on the use of satellite remote-sensing data for urban air quality. Journal of the Air & Waste Management Association, 54(11), 1360–1371.

    Article  Google Scholar 

  • Forster, P., Ramaswamy, V., Artaxo, P., Berntsen, T., Betts, R., Fahey, D.W., Haywood, J., Lean, J., Lowe, D.C., Myhre, G., et al. (2007). Changes in atmospheric constituents and in radiative forcing. chapter 2. In Climate Change 2007. The Physical Science Basis.

  • Franklin, M., Zeka, A., Schwartz, J. (2007). Association between pm2. 5 and all-cause and specific-cause mortality in 27 us communities. Journal of Exposure Science and Environmental Epidemiology, 17(3), 279–287.

    CAS  Article  Google Scholar 

  • Genuer, R., Poggi, J. -M., Tuleau-Malot, C. (2010). Variable selection using random forests. Pattern Recognition Letters, 31(14), 2225–2236.

    Article  Google Scholar 

  • Green, M.C., Chow, J.C., Watson, J.G., Dick, K., Inouye, D. (2015). Effects of snow cover and atmospheric stability on winter pm2. 5 concentrations in western us valleys. Journal of Applied Meteorology and Climatology, 54(6), 1191–1201.

    Article  Google Scholar 

  • Greenpeace. (2012). Study on premature deaths reveals health impact of pm2.5 in china. http://http://www.greenpeace.org/eastasia/press/releases/climate-energy/2012/air-pollution-health-economic/.

  • Hansen, J., Sato, M., Ruedy, R., et al. (1997). Radiative forcing and climate response. Journal of Geophysical Research-All Series-, 102, 6831–6864.

    CAS  Article  Google Scholar 

  • He, K., Yang, F., Ma, Y., Zhang, Q., Yao, X., Chan, C.K., Cadle, S., Chan, T., Mulawa, P. (2001). The characteristics of pm 2.5 in beijing, china. Atmospheric Environment, 35(29), 4959–4970.

    CAS  Article  Google Scholar 

  • Hien, P.D., Bac, V., Tham, H., Nhan, D., Vinh, L. (2002). Influence of meteorological conditions on pm2. 5 and pm2. 5- 10 concentrations during the monsoon season in hanoi, vietnam. Atmospheric Environment, 36(21), 3473–3484.

    CAS  Article  Google Scholar 

  • Hoff, R.M., & Christopher, S.A. (2009). Remote sensing of particulate pollution from space: have we reached the promised land? Journal of the Air & Waste Management Association, 59(6), 645–675.

    CAS  Article  Google Scholar 

  • Kampa, M., & Castanas, E. (2008). Human health effects of air pollution. Environmental Pollution, 151(2), 362–367.

    CAS  Article  Google Scholar 

  • Katsouyanni, K. (2005). Short-term effects of air pollution on total and cardiovascular mortality. Epidemiology, 16(1), 49–57.

    Article  Google Scholar 

  • Koċak, M., Mihalopoulos, N., Kubilay, N. (2007). Contributions of natural sources to high pm10 and pm2. 5 events in the eastern mediterranean. Atmospheric Environment, 41(18), 3806–3818.

    Article  Google Scholar 

  • Kohonen, T. (1982). Self-organized formation of topologically correct feature maps. Biological cybernetics, 43(1), 59–69.

    Article  Google Scholar 

  • Kohonen, T. (1998). The self-organizing map. Neurocomputing, 21(1), 1–6.

    Article  Google Scholar 

  • Lary, D.J., Faruque, F.S., Malakar, N., Moore, A., Roscoe, B., Adams, Z.L., Eggelston, Y. (2014). Estimating the global abundance of ground level presence of particulate matter (pm2. 5). Geospatial Health, 8(3), 611–630.

    Article  Google Scholar 

  • Lary, D., Lary, T., Sattler, B. (2015). Using machine learning to estimate global pm2. 5 for environmental health studies. Environmental Health Insights, 9(Suppl 1), 41.

    CAS  Google Scholar 

  • Li, H., Chen, R., Meng, X., Zhao, Z., Cai, J., Wang, C., Yang, C., Kan, H. (2015). Short-term exposure to ambient air pollution and coronary heart disease mortality in 8 chinese cities. International Journal of Cardiology, 197, 265–270.

    Article  Google Scholar 

  • Liu, Y., Sarnat, J.A., Kilaru, V., Jacob, D.J., Koutrakis, P. (2005). Estimating ground-level pm2. 5 in the eastern united states using satellite remote sensing. Environmental Science & Technology, 39 (9), 3269–3278.

    CAS  Article  Google Scholar 

  • Lyamani, H., Olmo, F., Alcántara, A., Alados-Arboledas, L. (2006). Atmospheric aerosols during the 2003 heat wave in southeastern Spain i: Spectral optical depth. Atmospheric Environment, 40 (33), 6453–6464.

    CAS  Article  Google Scholar 

  • Melin, F., Zibordi, G., Carlund, T., Holben, B.N., Stefan, S. (2013). Validation of seawifs and modis aqua/terra aerosol products in coastal regions of european marginal seas. Oceanologia, 55(1), 27–51.

    Article  Google Scholar 

  • Ministry of Environmental Protection (MEP), N. D., & Reform Commission (NDRC), M. o. F. (2012). ”twelfth five-year plan” on air pollution prevention and control in key regions. http://http://en.cleanairchina.org/product/6285.html.

  • Niu, G. -Y., Yang, Z. -L., Dickinson, R.E., Gulden, L.E. (2005). A simple topmodel-based runoff parameterization (simtop) for use in global climate models. Journal of Geophysical Research: Atmospheres, 110(D21).

  • Paciorek, C.J., & Liu, Y. (2012). Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and pm2. 5 in the eastern united states. Research report (Health Effects Institute), (167), 5–83.

  • Pope, III., C. A., brook, R.D., burnett, R.T., dockery, D.W. (2011). How is cardiovascular disease mortality risk affected by duration and intensity of fine particulate matter exposure? an integration of the epidemiologic evidence. Air Quality, Atmosphere & Health, 4(1), 5–14.

    Article  Google Scholar 

  • Ramaswamy, V., Boucher, O., Haigh, J., Hauglustine, D., Haywood, J., Myhre, G., Nakajima, T., Shi, G.Y., Solomon, S. (2001). Radiative forcing of climate change. In Climate change 2001: the scientific basis, (Vol. 18 pp. 349–416). Cambridge University Press.

  • Remer, L.A., Kleidman, R.G., Levy, R.C., Kaufman, Y.J., Tanré, D., Mattoo, S., Martins, J.V., Ichoku, C., Koren, I., Yu, H., et al. (2008). Global aerosol climatology from the modis satellite sensors. Journal of Geophysical Research: Atmospheres, 113(D14).

  • Rienecker, M.M., Suarez, M.J., Gelaro, R., Todling, R., Bacmeister, J., Liu, E., Bosilovich, M.G., Schubert, S.D., Takacs, L., Kim, G. -K., et al. (2011). Merra: Nasa’s modern-era retrospective analysis for research and applications. Journal of Climate, 24(14), 3624–3648.

    Article  Google Scholar 

  • Schapire, R.E. (2003). The boosting approach to machine learning: An overview. In Nonlinear estimation and classification (pp. 149–171): Springer.

  • Shi, G. -L., Feng, Y. -C., Wu, J. -H., Li, X., Wang, Y. -Q., Xue, Y. -H., Zhu, T. (2009). Source identification of polycyclic aromatic hydrocarbons in urban particulate matter of tangshan, china. Aerosol and Air Quality Resarch, 9(3), 309–315.

    CAS  Article  Google Scholar 

  • Song, Y., Xie, S., Zhang, Y., Zeng, L., Salmon, L.G., Zheng, M. (2006). Source apportionment of pm2. 5 in beijing using principal component analysis/absolute principal component scores and unmix. Science of the Total Environment, 372(1), 278–286.

    CAS  Article  Google Scholar 

  • Stier, P., Seinfeld, J.H., Kinne, S., Boucher, O. (2007). Aerosol absorption and radiative forcing. Atmospheric Chemistry and Physics, 7(19), 5237–5261.

    CAS  Article  Google Scholar 

  • WHO. (2014). 7 million premature deaths annually linked to air pollution. http://www.who.int/mediacentre/news/releases/2014/air-pollution/en/. Accessed: 2016-08-29.

  • Wu, D., Zewdie, G.K., Liu, X., Kneen, M.A., Lary, D.J. (2017). Insights into the morphology of the east asia pm annual cycle provided by machine learning. Environmental Health Insights, 11, 1178630217699611.

    Article  Google Scholar 

  • Zhang, Y., Ma, G., Yu, F., Cao, D. (2013). Health damage assessment due to pm2. 5 exposure during haze pollution events in beijing-tianjin-hebei region in january 2013. Zhonghua yi xue za zhi, 93(34), 2707–2710.

    CAS  Google Scholar 

  • Zheng, M., Salmon, L.G., Schauer, J.J., Zeng, L., Kiang, C., Zhang, Y., Cass, G.R. (2005). Seasonal trends in pm2. 5 source contributions in beijing, china. Atmospheric Environment, 39 (22), 3967–3976.

    CAS  Article  Google Scholar 

  • Zhou, Z.-H., Wu, J., Tang, W. (2002). Ensembling neural networks: many could be better than all. Artificial Intelligence, 137(1-2), 239–263.

    Article  Google Scholar 

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to Daji Wu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Geospatial Technology in Environmental Health Applications

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Wu, D., Lary, D.J., Zewdie, G.K. et al. Using machine learning to understand the temporal morphology of the PM2.5 annual cycle in East Asia. Environ Monit Assess 191, 272 (2019). https://doi.org/10.1007/s10661-019-7424-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10661-019-7424-1

Keywords

  • PM2.5
  • Air pollution
  • Machine learning
  • Ensemble learning
  • Multiple linear regression