Skip to main content

SQL or NoSQL? Which Is the Best Choice for Storing Big Spatio-Temporal Climate Data?

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11158))

Abstract

Management of big spatio-temporal data such as the results from large scale global climate models has long been a challenge because of the sheer vastness of the dataset. Although different data management systems like that incorporate a relational database management system have been proposed and widely used in prior studies, solutions that are particularly designed for big spatio-temporal data management have not been studied well. In this paper, we propose a general data management platform for high-dimensional spatio-temporal datasets like those found in the climate domain, where different database systems can be applied. Through this platform, we compare and evaluate several database systems including SQL database and NoSQL database from various aspects and explore the key impact factors for system performance. Our experimental results indicate advantages and disadvantages of each database system and give insight into the best system to use for big spatio-temporal data applications. Our analysis provides important insights into the understanding of performance of different data management systems, which is very useful for designing high dimensional big data applications.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Unidata: NetCDF. http://www.unidata.ucar.edu/software/netcdf/

  2. Apache: Hadoop (2011). http://hadoop.apache.org/

  3. MongoDB: Mongodb. http://www.mongodb.org/

  4. Cuzzocrea, A., Moussa, R.: A cloud-based framework for supporting effective and efficient OLAP in big data environments. In: IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp. 680–684 (2014)

    Google Scholar 

  5. Brezany, P., Yan, Z., Janciak, I., Chen, P., Ye, S.: An elastic OLAP cloud platform. In: IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing, pp. 356–363 (2011)

    Google Scholar 

  6. Singh, H., Bawa, S.: A survey of traditional and MapReducebased spatial query processing approaches. ACM SIGMOD Rec. 46(2), 18–29 (2017)

    Article  Google Scholar 

  7. Chiang, G.T., Dove, M.T., Bovolo, C.I., Ewen, J.: Implementing a grid/cloud eScience infrastructure for hydrological sciences. In: Yang, X., Wang, L., Jie, W. (eds.) Guide to e-Science. CCN, pp. 3–28. Springer, London (2011). https://doi.org/10.1007/978-0-85729-439-5_1

    Chapter  Google Scholar 

  8. Ameri, P., Grabowski, U., Meyer, J., Streit, A.: On the application and performance of MongoDB for climate satellite data. In: IEEE International Conference on Trust, Security and Privacy in Computing and Communications, pp. 652–659 (2014)

    Google Scholar 

  9. Jern, M., Franzen, J.: “GeoAnalytics” - exploring spatio-temporal and multivariate data. In: Tenth International Conference on Information Visualization, pp. 25–31 (2006)

    Google Scholar 

  10. Lian, J., Mcguire, M.P., Moore, T.W.: FunnelCloud: a cloud-based system for exploring tornado events. Int. J. Digit. Earth 10, 1–25 (2017)

    Article  Google Scholar 

  11. Baumann, P.: Management of multidimensional discrete data. VLDB J. 3, 401–444 (1994)

    Article  Google Scholar 

  12. Baumann, P., Dehmel, A., Furtado, P., Ritsch, R., Widmann, N.: The multidimensional database system RasDaMan. In: ACM SIGMOD International Conference on Management of Data, pp. 575–577 (1998)

    Article  Google Scholar 

  13. Bimonte, S., Zaamoune, M., Beaune, P.: Conceptual design and implementation of spatial data warehouses integrating regular grids of points. Int. J. Digit. Earth 10, 1–22 (2017)

    Article  Google Scholar 

  14. Tang, W., Feng, W.: Parallel map projection of vector-based big spatial data: coupling cloud computing with graphics processing units. Comput. Environ. Urban Syst. 61(11), 187–197 (2014)

    Google Scholar 

  15. Arndt, D.S., et al.: State of the climate in 2011 special supplement to the bulletin of the American meteorological society. Bull. Am. Meteorol. Soc. 93(7), S1–S263 (2012)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jie Lian .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lian, J., Miao, S., McGuire, M., Tang, Z. (2018). SQL or NoSQL? Which Is the Best Choice for Storing Big Spatio-Temporal Climate Data?. In: Woo, C., Lu, J., Li, Z., Ling, T., Li, G., Lee, M. (eds) Advances in Conceptual Modeling. ER 2018. Lecture Notes in Computer Science(), vol 11158. Springer, Cham. https://doi.org/10.1007/978-3-030-01391-2_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-01391-2_32

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-01390-5

  • Online ISBN: 978-3-030-01391-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics