TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark

  • Peter Boncz
  • Thomas Neumann
  • Orri Erling
Conference paper

DOI: 10.1007/978-3-319-04936-6_5

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8391)
Cite this paper as:
Boncz P., Neumann T., Erling O. (2014) TPC-H Analyzed: Hidden Messages and Lessons Learned from an Influential Benchmark. In: Nambiar R., Poess M. (eds) Performance Characterization and Benchmarking. TPCTC 2013. Lecture Notes in Computer Science, vol 8391. Springer, Cham

Abstract

The TPC-D benchmark was developed almost 20 years ago, and even though its current existence as TPC-H could be considered superseded by TPC-DS, one can still learn from it. We focus on the technical level, summarizing the challenges posed by the TPC-H workload as we now understand them, which we call “choke points”. We identify 28 different such choke points, grouped into six categories: Aggregation Performance, Join Performance, Data Access Locality, Expression Calculation, Correlated Subqueries and Parallel Execution. On the meta-level, we make the point that the rich set of choke-points found in TPC-H sets an example on how to design future DBMS benchmarks.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Peter Boncz
    • 1
  • Thomas Neumann
    • 2
  • Orri Erling
    • 3
  1. 1.CWIAmsterdamThe Netherlands
  2. 2.Technical University MunichGermany
  3. 3.Openlink SoftwareUnited Kingdom

Personalised recommendations