Parameter Curation for Benchmark Queries

Gubichev, Andrey; Boncz, Peter

doi:10.1007/978-3-319-15350-6_8

Andrey Gubichev¹⁵ &
Peter Boncz¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 8904))

Included in the following conference series:

Technology Conference on Performance Evaluation and Benchmarking

1446 Accesses
4 Citations

Abstract

In this paper we consider the problem of generating parameters for benchmark queries so these have stable behavior despite being executed on datasets (real-world or synthetic) with skewed data distributions and value correlations. We show that uniform random sampling of the substitution parameters is not well suited for such benchmarks, since it results in unpredictable runtime behavior of queries. We present our approach of Parameter Curation with the goal of selecting parameter bindings that have consistently low-variance intermediate query result sizes throughout the query plan. Our solution is illustrated with IMDB data and the recently proposed LDBC Social Network Benchmark (SNB).

Partially supported by EU project LDBC (FP7-317548), see http://ldbc.eu.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
See http://github.com/ldbc and http://ldbcouncil.org.

References

LDBC Benchmark. http://ldbc.eu:8090/display/TUC/Interactive+Workload
Barahmand, S., Ghandeharizadeh, S.: BG: a benchmark to evaluate interactive social networking actions. In: CIDR (2013)
Google Scholar
Boncz, P., Neumann, T., Erling, O.: TPC-H analyzed: hidden messages and lessons learned from an influential benchmark. In: Nambiar, R., Poess, M. (eds.) TPCTC 2013. LNCS, vol. 8391, pp. 61–76. Springer, Heidelberg (2014)
Chapter Google Scholar
Pham, M.-D., Boncz, P., Erling, O.: S3G2: a scalable structure-correlated social graph generator. In: Nambiar, R., Poess, M. (eds.) TPCTC 2012. LNCS, vol. 7755, pp. 156–172. Springer, Heidelberg (2013)
Chapter Google Scholar
Moerkotte, G.: Building query compilers. http://pi3.informatik.uni-mannheim.de/~moer/querycompiler.pdf
Poess, M., Stephens Jr., J.M.: Generating thousand benchmark queries in seconds. In: VLDB 2004, pp. 1045–1053 (2004)
Google Scholar
Stephens, J.M., Poess, M.: MUDD: a multi-dimensional data generator. SIGSOFT Softw. Eng. Notes 29(1), 104–109 (2004)
Article Google Scholar

Download references

Author information

Authors and Affiliations

TU Munich, Munich, Germany
Andrey Gubichev
CWI, Amsterdam, The Netherlands
Peter Boncz

Authors

Andrey Gubichev
View author publications
You can also search for this author in PubMed Google Scholar
Peter Boncz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrey Gubichev .

Editor information

Editors and Affiliations

Cisco Systems, Inc., San Jose, California, USA
Raghunath Nambiar
Oracle Corporation, Redwood Shores, California, USA
Meikel Poess

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gubichev, A., Boncz, P. (2015). Parameter Curation for Benchmark Queries. In: Nambiar, R., Poess, M. (eds) Performance Characterization and Benchmarking. Traditional to Big Data. TPCTC 2014. Lecture Notes in Computer Science(), vol 8904. Springer, Cham. https://doi.org/10.1007/978-3-319-15350-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-15350-6_8
Published: 05 February 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15349-0
Online ISBN: 978-3-319-15350-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics