Scaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling

Psaroudakis, Iraklis; Wolf, Florian; May, Norman; Neumann, Thomas; Böhm, Alexander; Ailamaki, Anastasia; Sattler, Kai-Uwe

doi:10.1007/978-3-319-15350-6_7

Scaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling

Iraklis Psaroudakis^15,17,
Florian Wolf^15,18,
Norman May¹⁵,
Thomas Neumann¹⁶,
Alexander Böhm¹⁵,
Anastasia Ailamaki¹⁷ &
…
Kai-Uwe Sattler¹⁸

Conference paper
First Online: 01 January 2015

1838 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 8904))

Abstract

The common “one size does not fit all” paradigm isolates transactional and analytical workloads into separate, specialized database systems. Operational data is periodically replicated to a data warehouse for analytics. Competitiveness of enterprises today, however, depends on real-time reporting on operational data, necessitating an integration of transactional and analytical processing in a single database system. The mixed workload should be able to query and modify common data in a shared schema. The database needs to provide performance guarantees for transactional workloads, and, at the same time, efficiently evaluate complex analytical queries. In this paper, we share our analysis of the performance of two main-memory databases that support mixed workloads, SAP HANA and HyPer, while evaluating the mixed workload CH-benCHmark. By examining their similarities and differences, we identify the factors that affect performance while scaling the number of concurrent transactional and analytical clients. The three main factors are (a) data freshness, i.e., how recent is the data processed by analytical queries, (b) flexibility, i.e., restricting transactional features in order to increase optimization choices and enhance performance, and (c) scheduling, i.e., how the mixed workload utilizes resources. Specifically for scheduling, we show that the absence of workload management under cases of high concurrency leads to analytical workloads overwhelming the system and severely hurting the performance of transactional workloads.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Available online at: http://www3.in.tum.de/research/projects/CHbenCHmark/.

References

Sap, HANA Live for SAP Business Suite (2014). http://help.sap.com/hba
Transaction processing performance council (2014). http://www.tpc.org
Alagiannis, I., Idreos, S., Ailamaki, A.: H2O: a hands-free adaptive store. In: SIGMOD (2014)
Google Scholar
Cao, T., Salles, M.A.V., Sowell, B., Yue, Y., Demers, A.J., Gehrke, J., White, W.M.: Fast checkpoint recovery algorithms for frequently consistent applications. In: SIGMOD (2011)
Google Scholar
Cole, R., Funke, F., Giakoumakis, L., Guy, W., Kemper, A., Krompass, S., Kuno, H.A., Nambiar, R.O., Neumann, T., Poess, M., Sattler, K.U., Seibold, M., Simon, E., Waas, F.: The mixed workload CH-benCHmark. In: DBTest (2011)
Google Scholar
Difallah, D.E., Pavlo, A., Curino, C., Cudré-Mauroux, P.: OLTP-Bench: an extensible testbed for benchmarking relational databases. PVLDB 7(4), 53–63 (2014)
Google Scholar
Färber, F., May, N., Lehner, W., Große, P., Müller, I., Rauhe, H., Dees, J.: The SAP HANA database - an architecture overview. IEEE Data Eng. Bull. 35(1), 28–33 (2012)
Google Scholar
Florescu, D., Kossmann, D.: Rethinking cost and performance of database systems. SIGMOD Rec. 38(1), 43–48 (2009)
Article Google Scholar
Grund, M., Krüger, J., Plattner, H., Zeier, A., Cudre-Mauroux, P., Madden, S.: HYRISE: a main memory hybrid storage engine. PVLDB 4(2), 105–116 (2010)
Google Scholar
Kemper, A., Neumann, T.: HyPer: a hybrid OLTP&OLAP main memory database system based on virtual memory snapshots. In: ICDE (2011)
Google Scholar
Lee, J., Kwon, Y.S., Färber, F., Muehle, M., Lee, C., Bensberg, C., Lee, J.Y., Lee, A.H., Lehner, W.: SAP HANA distributed in-memory database system: Transaction, session, and metadata management. In: ICDE (2013)
Google Scholar
Leis, V., Boncz, P., Kemper, A., Neumann, T.: Morsel-driven parallelism: A NUMA-aware query evaluation framework for the many-core age. In: SIGMOD (2014, to appear)
Google Scholar
Leis, V., Kemper, A., Neumann, T.: Exploiting hardware transactional memory in main-memory databases. In: ICDE (2014)
Google Scholar
Neumann, T.: Efficiently compiling efficient query plans for modern hardware. In: VLDB (2011)
Google Scholar
Nguyen, T.M., Schiefer, J., Tjoa, A.M.: Sense & response service architecture (saresa): an approach towards a real-time business intelligence solution and its use for a fraud detection application. In: Proceedings of the 8th ACM International Workshop on Data Warehousing and OLAP (2005)
Google Scholar
Olofson, C., Morris, H.: Blending transactions and analytics in a single in-memory platform: key to the real-time enterprise. Techical report, IDC, February 2013. http://www.saphana.com/docs/DOC-4132
Plattner, H.: A common database approach for OLTP and OLAP using an in-memory column database. In: SIGMOD (2009)
Google Scholar
Psaroudakis, I., Scheuer, T., May, N., Ailamaki, A.: Task scheduling for highly concurrent analytical and transactional main-memory workloads. In: ADMS (2013)
Google Scholar
Raman, V., Attaluri, G., Barber, R., Chainani, N., Kalmuk, D., Samy, V.K., Leenstra, J., Lightstone, S., Liu, S., Lohman, G.M., Malkemus, T., Mueller, R., Pandis, I., Schiefer, B., Sharpe, D., Sidle, R., Storm, A., Zhang, L.: DB2 with BLU acceleration: So much more than just a column store. In: VLDB (2013)
Google Scholar
Stonebraker, M., Cetintemel, U.: “One Size Fits All": an idea whose time has come and gone. In: ICDE (2005)
Google Scholar
Stonebraker, M., Weisberg, A.: The VoltDB main memory DBMS. IEEE Data Eng. Bull. 36(2), 21–27 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

SAP SE, 69190, Walldorf, Germany
Iraklis Psaroudakis, Florian Wolf, Norman May & Alexander Böhm
TU Munich, Munich, Germany
Thomas Neumann
EPFL, Lausanne, Switzerland
Iraklis Psaroudakis & Anastasia Ailamaki
TU Ilmenau, Ilmenau, Germany
Florian Wolf & Kai-Uwe Sattler

Authors

Iraklis Psaroudakis
View author publications
You can also search for this author in PubMed Google Scholar
Florian Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Norman May
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Neumann
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Böhm
View author publications
You can also search for this author in PubMed Google Scholar
Anastasia Ailamaki
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Uwe Sattler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Iraklis Psaroudakis or Florian Wolf .

Editor information

Editors and Affiliations

Cisco Systems, Inc., San Jose, California, USA
Raghunath Nambiar
Oracle Corporation, Redwood Shores, California, USA
Meikel Poess

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Psaroudakis, I. et al. (2015). Scaling Up Mixed Workloads: A Battle of Data Freshness, Flexibility, and Scheduling. In: Nambiar, R., Poess, M. (eds) Performance Characterization and Benchmarking. Traditional to Big Data. TPCTC 2014. Lecture Notes in Computer Science(), vol 8904. Springer, Cham. https://doi.org/10.1007/978-3-319-15350-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-15350-6_7
Published: 05 February 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15349-0
Online ISBN: 978-3-319-15350-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics