Time Analysis of Air Pollution in a Spanish Region Through k-means

  • Ángel ArroyoEmail author
  • Verónica Tricio
  • Álvaro Herrero
  • Emilio Corchado
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 527)


This study presents the application of clustering techniques to a real-life problem of studying the air quality of the Castilla y León region in Spain. The goal of this work is to analyze the level of air pollution in eight points of this Spanish region between years 2008 and 2015. The analyzed data were provided by eight acquisition stations from the regional network of air quality. The main pollutants recorded at these stations are analyzed in order to study the characterization of such stations, according to a zoning process, and their time evolution. Four cluster evaluation and a clustering technique, with the main distance measures, have been applied to the dataset under analysis.


Clustering K-means Air quality Time evolution 


  1. 1.
    Government of Spain - Aporta Project.
  2. 2.
    Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)CrossRefGoogle Scholar
  3. 3.
    Kassomenos, P., Vardoulakis, S., Borge, R., Lumbreras, J., Papaloukas, C., Karakitsios, S.: Comparison of statistical clustering techniques for the classification of modelled atmospheric trajectories. Theoret. Appl. Climatol. 102, 1–12 (2010)CrossRefGoogle Scholar
  4. 4.
    Pires, J.C.M., Sousa, S.I.V., Pereira, M.C., Alvim-Ferraz, M.C.M., Martins, F.G.: Management of air quality monitoring using principal component and cluster analysis—Part I: SO2 and PM10. Atmos. Environ. 42(6), 1249–1260 (2008)CrossRefGoogle Scholar
  5. 5.
    European Commission - Air Quality Standards.
  6. 6.
    Liu, Y., Li, Z., Xiong, H., Gao, X., Wu, J.: Understanding of internal clustering validation measures. In: IEEE International Conference on Data Mining, pp. 911–916 (2010)Google Scholar
  7. 7.
    Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31, 651–666 (2010)CrossRefGoogle Scholar
  8. 8.
    Barlow, H.: Unsupervised learning. Neural Comput. 1, 295–311 (1989)CrossRefGoogle Scholar
  9. 9.
    Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. Theory Methods 3, 1–27 (1974)MathSciNetCrossRefGoogle Scholar
  10. 10.
    Rousseeuw, P.J.: Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)CrossRefGoogle Scholar
  11. 11.
    Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)CrossRefGoogle Scholar
  12. 12.
    Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a data set via the gap statistic. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 63, 411–423 (2001)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Ding, C., He, X.: K-means clustering via principal component analysis. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 29 (2004)Google Scholar
  14. 14.
    Danielsson, P.E.: Euclidean distance mapping. Comput. Graph. Image Process. 14, 227–248 (1980)CrossRefGoogle Scholar
  15. 15.
    Government of Castilla y León - Zoning of the territory in Castilla y León.
  16. 16.
    European Union Law - Directive 2008/50/EC of the European Parliament and of the Council of 21 May 2008 on ambient air quality and cleaner air for Europe.
  17. 17.
    Government of Castilla y León - Annual reports of the Air Quality.
  18. 18.
    PubChem - PubChem Compounds.
  19. 19.
    ISO - International Organization for Standardization. PM10/PM2.5.

Copyright information

© Springer International Publishing AG 2017

Open Access This chapter is licensed under the terms of the Creative Commons Attribution-NonCommercial 2.5 International License (, which permits any noncommercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.

The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

Authors and Affiliations

  • Ángel Arroyo
    • 1
    Email author
  • Verónica Tricio
    • 2
  • Álvaro Herrero
    • 1
  • Emilio Corchado
    • 3
  1. 1.Department of Civil EngineeringUniversity of BurgosBurgosSpain
  2. 2.Department of PhysicsUniversity of BurgosBurgosSpain
  3. 3.Departamento de Informática y AutomáticaUniversity of SalamancaSalamancaSpain

Personalised recommendations