Abstract
The aim is to integrate the processing and visualization of geographic data related to air pollution within a massive data architecture, to optimize processing and response times. Air pollutant data and climatic variables from Quito, Ecuador, measured by the Atmospheric Monitoring Network of the city, were used. After cleaning, Quito Air Pollution Index was calculated. The proposed architecture is open-source and is made up of a cluster with a master node and two worker-nodes. This cluster consists of a unified analysis computational system in Spark, managed by Yarn, and linked to a graphical interface provided by Zeppelin. It processes data and displays it visually through geographic maps. To verify that this architecture improves response times, a comparison was made between using the system and not using it. The geographic interpolation results were 4.52 s with a Geographical Information System, while the proposed system showed an execution time of 2.0 s, indicating a reduction of 56%. This architecture showed an improvement in the traditional interpolation and map visualization processes, and generated a new open-source alternative with resources and time optimization. In addition, this research work could contribute to the making of strategic decisions through a new way of analyzing environmental problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rodríguez, A., Cuvi, N.: Air pollution and environmental justice in Quito, Ecuador. Front. J. Soc. Technol. Environ. Sci. 8(3), 13–46 (2019). https://doi.org/10.21664/2238-8869.2019v8i3.p13-46
World Health Organization (WHO), “Nine out of ten people around the world breathe polluted air.” 2018
World Health Organization (WHO), “Ambient (outdoor) air quality and health.” 2018
World Health Organization (WHO), “Ambient air pollution: A global assessment of exposure and burden of disease,” 2016
Primicias Newspaper, “Quito air exceeds the permitted limits of contamination.” 2019
Isabel, H., i Caralt, J.: Using analytics to support teacher decision-making. In: Actas de las XX JENUI, 2014, vol. 9, no. 11, pp. 83–90 (2014)
Barzaga, O., Vélez, H., Nevárez, J., Arroyo, M.: Information management and decision-making in educational organizations. Rev. Ciencias Soc. XXV(2), 120–130 (2019)
Rodríguez, Y., Pinto, M.: Information use model for strategic decision making in information organizations. Transinformacao 30(1), 51–64 (2018). https://doi.org/10.1590/2318-08892018000100005
Schmidt, D.C., Levine, D.L., Cleeland, C.: Architectures and patterns for developing high-performance, real-time ORB endsystems. Adv. Comput. 48(C), 1–118 (1999). https://doi.org/10.1016/S0065-2458(08)60018-2
Apache Spark, “Spark Overview.”
Ghaffar, A., Rahim, T.: Big data analysis: apache spark perspective. Glob. J. Comput. Sci. Technol. XV(1) (2015)
QGIS, “Spatial Analysis (Interpolation),” Documentation QGIS 2.18.
Ecological Transition and Demographic Challenge Ministry, “Air Quality Index.”
MDMQ Environment Secretary, “Quito Air Quality Index.” 2013
Romero, M., Diego, F., Álvarez, M.: Air pollution: its impact as a health problem. Rev. Cubana Hig. Epidemiol. 44(2) (2006)
Campozano, L., Sanchez, E., Aviles, A., Samaniego, E.: Evaluation of infilling methods for time series of daily precipitation and temperature: the case of the Ecuadorian Andes. Maskana 5(1) (2014)
Apache Spark, “Cluster Mode Overview.”
Buitrago, B.: What’s behind Apache Spark processing?, iWannaBeDataDriven (2020)
Dauphiné, A.: Models of basic structures: points and fields. Geogr. Model. Math. 163–197 (2017). https://doi.org/10.1016/B978-1-78548-225-0.50010-5
Delgado, E.: The map: an important means of support for the teaching of history. Rev. Mex. Investig. Educ. 7(15), 331–356 (2002)
Asgari, M., Farnaghi, M., Ghaemi, Z.: Predictive mapping of urban air pollution using apache spark on a hadoop cluster. In: ACM International Conference Proceeding Series (ICPS), pp. 89–93 (2017). https://doi.org/10.1145/3141128.3141131
U.S. Environmental Protection Agency (EPA), “Volcanoes.” 2021
National Geographic, “Climate change, droughts and floods.” 2022
Yao, Z., Zhang, J., Li, T., Ding, Y.: A trajectory big data storage model incorporating partitioning and spatio-temporal multidimensional hierarchical organization. ISPRS Int. J. Geo-Inf. 11(12) (2022). https://doi.org/10.3390/ijgi11120621
Acknowledgements
We want to thank Francisco Gallegos and Brigitte Balón for their valuable ideas in the graphical interface integration into architecture. Since this article is derived from the postgraduate thesis of one of the authors, a special thanks to Lorena Recalde PhD. For her thesis writing review.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Mora-Villacís, G., Calle-Jimenez, T. (2023). Big Data Architecture for Air Pollution Spatial Visualization: Quito, Ecuador. In: Maldonado-Mahauad, J., Herrera-Tapia, J., Zambrano-Martínez, J.L., Berrezueta, S. (eds) Information and Communication Technologies. TICEC 2023. Communications in Computer and Information Science, vol 1885. Springer, Cham. https://doi.org/10.1007/978-3-031-45438-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-031-45438-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45437-0
Online ISBN: 978-3-031-45438-7
eBook Packages: Computer ScienceComputer Science (R0)