Skip to main content

Clustering Air Monitoring Stations According to Background and Ambient Pollution Using Hidden Markov Models and Multidimensional Scaling

  • Conference paper
  • First Online:
Data Science

Abstract

In order to study the cluster of monitoring sites from an urban air quality monitoring network (AQMN) with respect to the background and ambient pollution of key pollutants, a combined methodology is proposed: firstly, to obtain the ambient and background levels of air pollution from every selected pollutant, time series obtained from the AQMN were modeled with hidden Markov models; secondly, to study the grouping of these monitoring sites according to these levels of pollution, both ambient and background pollution, multidimensional scaling (Smacof MDS) was used and the stability of these solutions obtained with a Jacknife procedure (smacof library—R software). Results show that the clustering behaviour of sites is different when studying the ambient from the background pollution. However, sites marked with a distinct pollution contribution could locate them distant from the main cluster of sites as long as they show a marked stability in the MDS solutions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Arunachalam, S., Valencia, A., Akita, Y., Serre, M.L., Omary, M., Garcia, V., Isakov, V.: A method for estimating urban background concentrations in support of hybrid air pollution modeling for environmental health studies. Int. J. Environ. Res. Public. Health 11(10), 10518–10536 (2014)

    Article  Google Scholar 

  2. Borg, I., Groenen, P.J.F., Mair, P.: Applied Multidimensional Scaling. Springer Briefs in Statistics. Springer, Berlin (2013)

    Book  Google Scholar 

  3. Cox, T.F., Cox, M.A.A.: Multidimensional Scaling, 2nd edn. Chapman & Hall/CRC, Boca Raton (2001)

    MATH  Google Scholar 

  4. de Leeuw, J., Mair, P.: Multidimensional scaling using majorization: SMACOF in R. J. Stat. Softw. 31(3), 1–30 (2009)

    Article  Google Scholar 

  5. de Leeuw, J., Meulman, J.: A special jacknife for multidimensional scaling. J. Classif. 3, 97–112 (1986)

    Article  Google Scholar 

  6. Gómez-Losada, Á., Pires, J.C.M., Pino-Mejías, R.: Characterization of background air pollution exposure in urban environments using a metric based on Hidden Markov Models. Atmos. Environ. 127, 255–261 (2016)

    Article  Google Scholar 

  7. Harte, D: HiddenMarkov: hidden Markov models, 2010. R package version 1.8-3 (2015)

    Google Scholar 

  8. Himmelmann, L.: HMM: HMM-hidden Markov models. Scientific Software Development, 2010. R package version 1.0. (2010)

    Google Scholar 

  9. IECA (Instituto de Estadística y Cartografía de Andalucía). Anuario estadístico de Andalucía. Consejería de Economía, Innovación, Ciencia y Empleo. Junta de Andalucía, Sevilla (2013)

    Google Scholar 

  10. Pires, J.C.M., Sousa, S.I.V., Pereira, M.C., Alvim-Ferraz, M.C.M., Martins, F.G.: Management of air quality monitoring using principal component and cluster analysis-Part I: SO2 and PM10. Atmos. Environ. 42, 1249–1260 (2008)

    Article  Google Scholar 

  11. Pires, J.C.M., Sousa, S.I.V., Pereira, M.C., Alvim-Ferraz, M.C.M., Martins, F.G.: Management of air quality monitoring using principal component and cluster analysis-Part II: CO, NO2 and O3. Atmos. Environ. 42, 1261–1274 (2008)

    Article  Google Scholar 

  12. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna (2015). Available from http://www.R-project.org/.Cited26Dic2015

  13. Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978)

    Article  MathSciNet  MATH  Google Scholar 

  14. SG (Sevilla Global). Barómetro de economía urbana 2013. Agencia Urbana de Desarrollo Integral. Ayuntamiento de Sevilla, Sevilla (2013)

    Google Scholar 

  15. Visser, I., Speekenbrink, M. depmixS4: an R package for Hidden Markov Models. J. Stat. Softw. 36(7), 1–21 (2010)

    Google Scholar 

Download references

Acknowledgements

The author is grateful to the Regional Ministry of Environment and Land Planning of Andalusia for kindly providing the air quality data.

Disclaimer The views expressed are purely those of the author and may not in any circumstances be regarded as stating an official position of the European Commission.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Álvaro Gómez-Losada .

Editor information

Editors and Affiliations

1 Appendix

The aim of this appendix is to show how the values TS Mean (M) and TS SD (SD) are calculated for the TS of Torneo site (Table 2):

$$ \displaystyle\begin{array}{rcl} M& =& \sum _{1}^{3}\pi _{ i}\,m_{i} = (0.41 \cdot 20.4) + (0.47 \cdot 33.7) + (0.12 \cdot 46.7) = 29.8 {}\\ SD& =& \Biggl [\sum _{1}^{3}(m_{ i}^{2} + sd_{ i}^{2})\,\pi _{ i} - M^{2}\Biggl ]^{1/2} {}\\ & =& \Biggl [(20.4^{2} \cdot 4.9^{2}) \cdot 0.41 + (33.7^{2} \cdot 5.3^{2}) \cdot 0.47 + (46.7^{2} \cdot 6.1^{2}) \cdot 0.12 - 29.8^{2}\Biggl ]^{1/2} {}\\ & =& 10.3 {}\\ \end{array} $$

1 Electronic supplementary material

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Gómez-Losada, Á. (2017). Clustering Air Monitoring Stations According to Background and Ambient Pollution Using Hidden Markov Models and Multidimensional Scaling. In: Palumbo, F., Montanari, A., Vichi, M. (eds) Data Science . Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-55723-6_10

Download citation

Publish with us

Policies and ethics