SPLODGE: Systematic Generation of SPARQL Benchmark Queries for Linked Open Data

  • Olaf Görlitz
  • Matthias Thimm
  • Steffen Staab
Conference paper

DOI: 10.1007/978-3-642-35176-1_8

Volume 7649 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Görlitz O., Thimm M., Staab S. (2012) SPLODGE: Systematic Generation of SPARQL Benchmark Queries for Linked Open Data. In: Cudré-Mauroux P. et al. (eds) The Semantic Web – ISWC 2012. ISWC 2012. Lecture Notes in Computer Science, vol 7649. Springer, Berlin, Heidelberg

Abstract

The distributed and heterogeneous nature of Linked Open Data requires flexible and federated techniques for query evaluation. In order to evaluate current federation querying approaches a general methodology for conducting benchmarks is mandatory. In this paper, we present a classification methodology for federated SPARQL queries. This methodology can be used by developers of federated querying approaches to compose a set of test benchmarks that cover diverse characteristics of different queries and allows for comparability. We further develop a heuristic called SPLODGE for automatic generation of benchmark queries that is based on this methodology and takes into account the number of sources to be queried and several complexity parameters. We evaluate the adequacy of our methodology and the query generation strategy by applying them on the 2011 billion triple challenge data set.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Olaf Görlitz
    • 1
  • Matthias Thimm
    • 1
  • Steffen Staab
    • 1
  1. 1.Institute for Web Science and TechnologyUniversity of Koblenz-LandauGermany