Storing and Clustering Large Spatial Datasets Using Big Data Technologies

  • Alejandro Cortiñas
  • Miguel R. Luaces
  • Tirso V. Rodeiro
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10819)


In this paper we present the architecture of a system to store, query and visualize on the web large datasets of geographic information. The architecture includes a component to simulate a large number of drivers that report their position on a regular basis, an ingestion component that is generic and can acommodate three different storage technologies, a query component that aggregates the results in order to reduce the query time and the data transfered, and a web-based map viewer. In addition, we define an evaluation methodology to be used to benchmark and compare different alternatives for some components of the system, and we validate the architecture with experiments using a dataset of 40 million locations of drivers.


Spatial big data Web-based GIS Software architectures 


  1. 1.
    car2go Iberia S.L.: car2go Madrid website (2017). Consulted 29 Dec 2017
  2. 2.
    Chodorow, K.: MongoDB: The Definitive Guide. O’Reilly Media Inc., Sebastopol (2013)Google Scholar
  3. 3.
    Creelman, D.: Top Trends in Workforce Management: How Technology Provides Significant Value Managing Your People (2014). Consulted 29 Dec 2017
  4. 4.
    Crickard, P.: Leaflet.Js Essentials. Packt Publishing, Birmingham (2014)Google Scholar
  5. 5.
    Eldawy, A.: Spatialhadoop: towards flexible and scalable spatial processing using mapreduce. In: Proceedings of the 2014 SIGMOD PhD Symposium, SIGMOD 2014 PhD Symposium, pp. 46–50. ACM, New York (2014).
  6. 6.
    Henderson, C.: Mastering GeoServer. Packt Publ., Birmingham (2014)Google Scholar
  7. 7.
    Obe, R.O., Hsu, L.S.: PostgreSQL: Up and Running a Practical Introduction to the Advanced Open Source Database, 2nd edn. O’Reilly Media Inc., Sebastopol (2014)Google Scholar
  8. 8.
    Yang, F., Tschetter, E., Léauté, X., Ray, N., Merlino, G., Ganguli, D.: Druid: a real-time analytical data store. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, SIGMOD 2014, pp. 157–168. ACM, New York (2014).

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Laboratorio de Bases de DatosUniversidade da CoruñaA CoruñaSpain

Personalised recommendations