Abstract
Air pollution data are large-scale datasets that can be analyzed in low scales by clustering to recognize the pattern of pollution and have simpler and more comprehensible interpretations. So, this study aims to cluster the days of the year 2017 according to the hourly O3 and PM10 amounts collected from four stations of Tabriz by using spatiotemporal mixture model–based clustering (STMC). Besides, mixture model–based clustering with temporal dimension (TMC) and mixture model–based clustering without considering spatiotemporal dimensions (MC) were utilized to compare with STMC. To evaluate the efficiency of these three models, and obtain the optimal number of clusters in each model, BIC and ICL criteria were used. According to BIC and ICL, STMC outperforms TMC and MC. Three clusters for O3 and four clusters for PM10 were selected as the optimal number of clusters to fit STMC models. Regarding PM10, the average concentration was the highest in cluster 4. Regarding O3, all summer days were in cluster 3, and the average concentration of this cluster was the highest. Cluster 2 had the lowest concentration with a high difference from clusters 1 and 3, and its average temperature was the lowest. Autumn days make up about 84% of this cluster. The clustering of polluted and clean days into separate groups and observing the effect of meteorological factors on the amount of concentration in each cluster clearly prove the efficiency of the model. Results of STMC showed that the efficiency of clustering in air pollution data increases by considering both spatiotemporal dimensions.
Similar content being viewed by others
Data Availability
The datasets analyzed during the current study are available from the corresponding author on reasonable request.
Code Availability
Not applicable.
Change history
10 December 2021
A Correction to this paper has been published: https://doi.org/10.1007/s10666-021-09813-2
References
Cheam, A. S. M., Marbac, M., & McNicholas, P. D. (2017). Model-based clustering for spatiotemporal data on air quality monitoring. Environmetrics, 28(3), e2437. https://doi.org/10.1002/env.2437
Faridi, S., Shamsipour, M., Krzyzanowski, M., Künzli, N., Amini, H., Azimi, F., Malkawi, M., Momeniha, F., Gholampour, A., Hassanvand, M. S., & Naddafi, K. (2018). Long-term trends and health impact of PM2.5 and O3 in Tehran, Iran, 2006–2015. Environmental International, 114, 37–49. https://doi.org/10.1016/j.envint.2018.02.026.
Manju, A., Kalaiselvi, K., Dhananjayan, V., Palanivel, M., Banupriya, G. S., Vidhya, M. H., Panjakumar, K., & Ravichandran, B. (2018). Spatio-seasonal variation in ambient air pollutants and influence of meteorological factors in Coimbatore, Southern India. Air Quality, Atmosphere and Health., 11, 1179–1189. https://doi.org/10.1007/s11869-018-0617-x
Zhang, H., Wang, Y., Hu, J., Ying, Q., & Hu, X. M. (2015). Relationships between meteorological parameters and criteria air pollutants in three megacities in China. Environmental Research., 140, 242–254. https://doi.org/10.1016/j.envres.2015.04.004
Shukla, J. B., Misra, A. K., Sundar, S., & Naresh, R. (2008). Effect of rain on removal of a gaseous pollutant and two different particulate matters from the atmosphere of a city. Mathematical and Computer Modelling., 48, 832–844.
Goyal, S. K., & Rao, C. V. C. (2007). Assessment of atmospheric assimilation potential for industrial development in an urban environment: Kochi (India). Science of the total environment., 376, 27–39.
Owoade, O. K., Olise, F. S., Ogundele, L. T., Fawole, O. G., & Olaniyi, H. B. (2012). Correlation between particulate matter concentrations and meteorological parameters at a site in Ile-Ife, Nigeria. Ife Journal of Science no, 1(14), 83–93.
Dominick, D., Latif, M. T., Juahir, H., Aris, A. Z., & Zain, S. M. (2012). An assessment of influence of meteorological factors on PM sub (10) and NO sub (2) at selected stations in Malaysia. Sustainable Environment Research., 22, 305–315.
Islam, M. M., Afrin, S., Ahmed, T., & Ali, M. A. (2015). Meteorological and seasonal influences in ambient air quality parameters of Dhaka city. Journal of Civil Engineering, 43, 67–77.
Galindo, N., Yubero, E., Nicola, J. F., & Crespo, J. (2015). Chemical characterization of PM1 at a regional background site in the western Mediterranean. Aerosol and Air Quality Research., 16, 530–541.
Number of deaths from air pollution, 1990 to 2017. https://ourworldindata.org/. Accessed 10 November 2021.
Naddafi, K., Hassanvand, M. S., Yunesian, M., Momeniha, F., Nabizadeh, R., Faridi, S., & Gholampour, A. (2012). Health impact assessment of air pollution in megacity of Tehran, Iran. Iranian journal of environmental health science & engineering, 9, 28.
Hassanvand, M. S., Naddafi, K., Faridi, S., Arhami, M., Nabizadeh, R., Sowlat, M. H., Pourpak, Z., Rastkari, N., Momeniha, F., & Kashani, H. (2014). Indoor/outdoor relationships of PM10, PM2. 5, and PM1 mass concentrations and their water-soluble ions in a retirement home and a school dormitory. Atmospheric Environment, 82, 375–382.
Lavecchia, C., Angelino, E., Bedogni, M., Bravetti, E., Gualdi, R., Lanzani, G., Musitelli, A., & Valentini, M. (1996). The ozone patterns in the aerological basin of Milan (Italy). Environmental Software., 11, 73–80.
Saksena, S., Joshi, V., & Patil, R. S. (2003). Cluster analysis of Delhi’s ambient air quality data. Journal of Environmental monitoring., 5, 491–499.
Gramsch, E., Cereceda-Balic, F., Oyola, P., & Von Baer, D. (2006). Examination of pollution trends in Santiago de Chile with cluster analysis of PM10 and ozone data. Atmospheric environment., 40, 5464–5475.
Molinari, N. (2007). Free knot splines for supervised classification. Journal of classification., 24, 221–234.
Gabusi, V., & Volta, M. (2005). A methodology for seasonal photochemical model simulation assessment. International journal of environment and pollution., 24, 11–21.
Morlini, I. (2007). Searching for structure in measurements of air pollutant concentration. Environmetrics: The official journal of the International Environmetrics Society, 18, 823–840.
Fraley, C., & Raftery, A. E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American statistical Association, 97(458), 611-631.
Vrbik, I., & Mcnicholas, P. D. (2014). Parsimonious skew mixture models for model-based clustering and classification. Computational Statistics & Data Analysis., 71, 196–210.
Murphy, K., & Murphy, T. B. (2017). Parsimonious model-based clustering with covariates. arXiv preprint arXiv:1711.05632.
Asghari, F. B., & Mohammadi, A. A. (2019). The effect of the decreasing level of Urmia Lake on particulate matter trends and attributed health effects in Tabriz, Iran. Microchemical Journal, 104434 https://doi.org/10.1016/j.microc.2019.104434.
Amini Parsa, V., Salehi, E., Yavari, A. R., & van Bodegom, P. M. (2019). Analyzing temporal changes in urban forest structure and the effect on air quality improvement. Sustainable Cities and Society., 48, 101548. https://doi.org/10.1016/j.scs.2019.101548
Barzeghar, V., Sarbakhsh, P., Hassanvand, M. S., Faridi, S., & Gholampour, A. (2020). Long-term trend of ambient air PM10, PM2. 5, and O3 and their health effects in Tabriz city, Iran, during 2006–2017. Sustainable Cities and Society, 54, 101988.
Yicun, G., Khorshiddoust, A. M., Mohammadi, G. H., & Sadr, A. H. (2020). The relationship between PM 2.5 concentrations and atmospheric conditions in severe and persistent urban pollution in Tabriz , Northwest of Iran.
Azarafza, M., & Ghazifard, A. (2016). Urban geology of Tabriz City: environmental and geological constraints. Advances in environmental research. 5, 95–108. https://doi.org/10.12989/aer.2016.5.2.095.
Kalajahi, M.J., Khazini, L., Rashidi, Y., & Heris, S.Z. (2019). Development of reduction scenarios based on urban emission estimation and dispersion of exhaust pollutants from light duty public transport: case of Tabriz, Iran. Emission Control Science and Technology, 1–19.
Barrero, M. A., G. Orza, J., Cabello, M., & Cantón, L. (2015). Categorisation of air quality monitoring stations by evaluation of PM10 variability. The Science of the total environment, 524–525C, 225–236. https://doi.org/10.1016/j.scitotenv.2015.03.138.
Song, C., He, J., Wu, L., Jin, T., Chen, X., Li, R., Ren, P., Zhang, L., & Mao, H. (2017). Health burden attributable to ambient PM2.5 in China. Environmental pollution (Barking, Essex : 1987), 223, 575–586. https://doi.org/10.1016/j.envpol.2017.01.060.
Norazian, M. N., Shukri, Y. A., Azam, R. N., & Al Bakri, A. M. M. (2008). Estimation of missing values in air pollution data using single imputation techniques. ScienceAsia, 34, 341–345 https://doi.org/10.2306/scienceasia1513-1874.2008.34.341.
Schwarz, G. (1978). Estimating the dimension of a model. The annals of statistics., 6, 461–464.
Akaike, H. (1974). A new look at the statistical model identification. IEEE transactions on automatic control, 19(6), 716-723.
Vuorenmaa, J., Augustaitis, A., Beudert, B., Bochenek, W., Clarke, N., de Wit, H. A., Dirnbock, T., Frey, J., Hakola, H., & Kleemola, S. (2018). Long-term changes (1990–2015) in the atmospheric deposition and runoff water chemistry of sulphate, inorganic nitrogen and acidity for forested catchments in Europe in relation to changes in emissions and hydrometeorological conditions. Science of the total environment., 625, 1129–1145.
Cerro, J. C., Cerda, V., & Pey, J. (2015). Trends of air pollution in the Western Mediterranean Basin from a 13-year database: A research considering regional, suburban and urban environments in Mallorca (Balearic Islands). Atmospheric Environment., 103, 138–146.
Ahmed, E., Kim, K.-H., Shon, Z.-H., & Song, S.-K. (2015). Long-term trend of airborne particulate matter in Seoul, Korea from 2004 to 2013. Atmospheric Environment., 101, 125–133.
McLachlan, G., & Peel, D. (2000). Finite mixture models, willey series in probability and statistics.
McNicholas, P. D. (2016). Mixture model-based classification. Chapman and Hall/CRC,
Mcnicholas, P. D. (2016). Model-based clustering., 373, 331–373. https://doi.org/10.1007/s0035
Same, A., Chamroukhi, F., Govaert, G., & Aknin, P. (2011). Model-based clustering and segmentation of time series with changes in regime. Advances in Data Analysis and Classification., 5, 301–321.
Jin, L., Harley, R. A., & Brown, N. J. (2011). Ozone pollution regimes modeled for a summer season in California’s San Joaquin Valley: A cluster analysis. Atmospheric environment., 45, 4707–4718.
Pandey, B., Agrawal, M., & Singh, S. (2014). Assessment of air pollution around coal mining area: Emphasizing on spatial distributions, seasonal variations and heavy metals, using cluster and principal component analysis. Atmospheric pollution research., 5, 79–86.
Huang, P., Zhang, J., Tang, Y., & Liu, L. (2015). Spatial and temporal distribution of PM2. 5 pollution in Xi’an City, China. International journal of environmental research and public health, 12, 6608–6625.
Tian, D., Fan, J., Jin, H., Mao, H., Geng, D., Hou, S., Zhang, P., & Zhang, Y. (2020). Characteristic and spatiotemporal variation of air pollution in Northern China based on correlation analysis and clustering analysis of five air pollutants. Journal of Geophysical Research: Atmospheres, 125, e2019JD031931.
Sicard, P., Serra, R., & Rossello, P. (2016). Spatiotemporal trends in ground-level ozone concentrations and metrics in France over the time period 1999–2012. Environmental research., 149, 122–144.
Zhao, S., Yu, Y., Yin, D., He, J., Liu, N., Qu, J., & Xiao, J. (2016). Annual and diurnal variations of gaseous and particulate pollutants in 31 provincial capital cities based on in situ air quality monitoring data from China National Environmental Monitoring Center. Environment international., 86, 92–106.
Carvalho, V. S. B., Freitas, E. D., Martins, L. D., Martins, J. A., Mazzoli, C. R., & de Fatima Andrade, M. (2015). Air quality status and trends over the Metropolitan Area of Sao Paulo, Brazil as a result of emission control policies. Environmental Science & Policy, 47, 68–79.
Lacressonniere, G., Foret, G., Beekmann, M., Siour, G., Engardt, M., Gauss, M., Watson, L., Andersson, C., Colette, A., & Josse, B. (2016). Impacts of regional climate change on air quality projections and associated uncertainties. Climatic Change., 136, 309–324.
Pawlak, I., & Jarosawski, J. (2015). The influence of selected meteorological parameters on the concentration of surface ozone in the central region of Poland. Atmosphere-Ocean, 53, 126–139.
Jang, E., Do, W., Park, G., Kim, M., & Yoo, E. (2017). Spatial and temporal variation of urban air pollutants and their concentrations in relation to meteorological conditions at four sites in Busan. South Korea. Atmospheric Pollution Research., 8, 89–100.
Giri, D., ADHIKARY, P. R., MURTHY, V. K. (2008). The influence of meteorological conditions on PM10 concentrations in Kathmandu Valley.
Funding
This work was supported by the Tabriz University of Medical Sciences. The Health and Environment Research Center provided financial support.
Author information
Authors and Affiliations
Contributions
Parisa Saeipourdizaj (first author): formulation and evaluation of overarching research goals and aims; setting the data in software package format; application of statistical, computational, and other formal techniques to analyze; application of available software codes; preparation (drafting, reviewing, translating, and revising the paper), and presentation of the manuscript. Saeed Musavi: statistical analysis, manuscript preparation and reviewing the paper. Akbar Gholampour: reviewing the paper. Parvin Sarbakhsh (corresponding author): formulation and evaluation of overarching research goals and aims; statistical analysis; preparation (drafting, reviewing, translating, and revising the paper), and presentation of the manuscript. All the authors have read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics Approval
This article has been extracted from the thesis submitted for MSc degree in Biostatistics which has been approved by the ethics committee of Tabriz University of Medical Sciences (Ethic number: IR.TBZMED.REC.1398.352).
Competing Interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Saeipourdizaj, P., Musavi, S., Gholampour, A. et al. Clustering the Concentrations of PM10 and O3: Application of Spatiotemporal Model–Based Clustering. Environ Model Assess 27, 45–54 (2022). https://doi.org/10.1007/s10666-021-09802-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10666-021-09802-5