SPIN: Concurrent Workload Scaling over Data Warehouses

Costa, João Pedro; Furtado, Pedro

doi:10.1007/978-3-642-40131-2_6

SPIN: Concurrent Workload Scaling over Data Warehouses

João Pedro Costa¹⁸ &
Pedro Furtado¹⁹

Conference paper

1327 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8057))

Abstract

Increasingly, data warehouse (DW) analyses are being used not only for strategic business decisions but also as valuable tools in operational daily decisions. As a consequence, a large number of queries are concurrently submitted, stressing the database engine ability to handle such query workloads without severely degrading query response times. The query-at-time model of common database engines, where each query is independently executed and competes for the same resources, is inefficient for handling large DWs and does not provide the expected performance and scalability when processing large numbers of concurrent queries. However, the query workload, which is mainly composed of aggregation star queries, frequently has to process similar data and perform similar computations. While materialized views can help in this matter, their usefulness is limited to queries and query patterns that are known in advance. The reviewed proposals on data and processing sharing suffer from memory limitations, reduced scalability and unpredictable execution times when applied to large DWs, particularly those with large dimensions. We present SPIN, a data and processing sharing model that delivers predictable execution times to aggregated star-queries even in the presence of large concurrent query loads, without the memory and scalability limitations of existing approaches. We used the TPC-H benchmark to experimentally evaluate SPIN.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Candea, G., Polyzotis, N., Vingralek, R.: A scalable, predictable join operator for highly concurrent data warehouses. Proc. VLDB Endow. 2, 277–288 (2009)
Article Google Scholar
Candea, G., Polyzotis, N., Vingralek, R.: Predictable performance and high query concurrency for data analytics. The VLDB Journal 20(2), 227–248 (2011)
Article Google Scholar
Zukowski, M., Héman, S., Nes, N., Boncz, P.: Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS. In: Proceedings of the 33rd International Conference on Very Large Data Bases, pp. 723–734 (2007)
Google Scholar
Harizopoulos, S., Shkapenyuk, V., Ailamaki, A.: QPipe: A Simultaneously Pipelined Relational Query Engine. In: Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pp. 383–394 (2005)
Google Scholar
Costa, J.P., Cecílio, J., Martins, P., Furtado, P.: ONE: a predictable and scalable DW model. In: Proceedings of the 13th International Conference on Data Warehousing and Knowledge Discovery, Toulouse, France, pp. 1–13 (2011)
Google Scholar
Costa, J.P., Martins, P., Cecílio, J., Furtado, P.: A Predictable Storage Model for Scalable Parallel DW. In: 15th International Database Engineering and Applications Symposium (IDEAS 2011), Lisbon, Portugal (2011)
Google Scholar
PostgreSQL, http://www.postgresql.org/
TPC-H, TPC-H Benchmark (April 13, 2012), http://www.tpc.org/tpch/

Download references

Author information

Authors and Affiliations

ISEC-Institute Polytechnic of Coimbra, Portugal
João Pedro Costa
University of Coimbra, Portugal
Pedro Furtado

Authors

João Pedro Costa
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Furtado
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LIAS/ISAE-ENSMA, Téléport 2, 1 avenue Clément Ader, BP 40109, 86961, Futuroscope Chasseneuil Cedex, France
Ladjel Bellatreche
IBM India Research Lab, Block No. 1, IIT, Hauz Khas, 110016, New Delhi, India
Mukesh K. Mohania

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Costa, J.P., Furtado, P. (2013). SPIN: Concurrent Workload Scaling over Data Warehouses. In: Bellatreche, L., Mohania, M.K. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2013. Lecture Notes in Computer Science, vol 8057. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40131-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-40131-2_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40130-5
Online ISBN: 978-3-642-40131-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics