Encyclopedia of Big Data Technologies

Living Edition
| Editors: Sherif Sakr, Albert Zomaya

Cloud-based SQL Solutions for Big Data

  • Marcin Zukowski
Living reference work entry
DOI: https://doi.org/10.1007/978-3-319-63962-8_318-1


Cloud computing is one of the most important trends in the current software industry. Cloud possesses unique technical and business characteristics, enabling new approaches to software design and new usage models. In this chapter we present the key characteristics of the cloud systems, and discuss opportunities and challenges traditional database systems face when deployed on this new platform. Using a set of existing systems, we demonstrate various approaches to building cloud-based SQL Big Data solutions.


Cloud computing is possibly the largest shift in computing since the client-server model became popular. In recent years, we see companies of all sizes embrace it, often for very different reasons. Architecturally, it introduces a lot of previously unavailable features, that provide amazing opportunities to system and application developers. At the same time, careful (re)design of software is needed to take full advantage of them.

The term “Cloud” tends to be used in...
This is a preview of subscription content, log in to check access.


  1. Chen Y et al (2009) Partial join order optimization in the paraccel analytic database In: Proceedings of SIGMODGoogle Scholar
  2. Dageville B et al (2016) The snowflake elastic data warehouse In: Proceedings of SIGMODGoogle Scholar
  3. Google Cloud Platform: Storage Options. https://cloud.google.com/compute/docs/disks/
  4. Gupta A et al (2015) Amazon redshift and the case for simpler data warehouses In: Proceedings of SIGMODGoogle Scholar
  5. Magic Quadrant for Cloud Infrastructure as a Service, Worldwide. https://www.gartner.com/doc/reprints?id=1-2G2O5FC&ct=150519
  6. Melnik S, Gubarev A, Long JJ, Romer G, Shivakumar S, Tolton M, Vassilakis T (2010) Dremel: interactive analysis of web-scale datasets. In: Proceedings of VLDBGoogle Scholar
  7. Oracle Autonomous Data Warehouse Cloud. https://cloud.oracle.com/en_US/datawarehouse
  8. Pivotal Greenplum on Amazon Web Services. https://pivotal.io/partners/aws/pivotal-greenplum
  9. Stonebraker M (1985) The case for shared nothing. In: Proceedings of HPTSGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Snowflake ComputingSan MateoUSA

Section editors and affiliations

  • Yuanyuan Tian
    • 1
  • Fatma Özcan
    • 2
  1. 1.IBM Almaden Research CenterSAN JOSEUnited States
  2. 2.IBM Research – AlmadenSan JoseUSA