ParlBench: A SPARQL Benchmark for Electronic Publishing Applications

  • Tatiana Tarasova
  • Maarten Marx
Conference paper

DOI: 10.1007/978-3-642-41242-4_2

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7955)
Cite this paper as:
Tarasova T., Marx M. (2013) ParlBench: A SPARQL Benchmark for Electronic Publishing Applications. In: Cimiano P., Fernández M., Lopez V., Schlobach S., Völker J. (eds) The Semantic Web: ESWC 2013 Satellite Events. ESWC 2013. Lecture Notes in Computer Science, vol 7955. Springer, Berlin, Heidelberg

Abstract

ParlBench is an RDF benchmark modelling a large scale electronic publishing scenario. The benchmark offers large collections of the Dutch parliamentary proceedings together with information about members of the parliament and political parties. The data is real, but free of intellectual property rights issues. On top of the benchmark data sets several application benchmarks as well as targeted micro benchmarks can be developed. This paper describes the benchmark data sets and 19 analytical queries covering a wide range of SPARQL constructs. The potential use of ParlBench is demonstrated by executing the queries for 8 different scaling of the benchmark data sets on Virtuoso RDF store. Measured on a standard laptop, data loading times varied from 43 seconds (for 1% of the data set) to 48 minutes (for the complete data set), and execution of the complete set of queries (570 queries in total) varied from 9 minutes to 13 hours.

Keywords

SPARQL RDF benchmark parliamentary proceedings 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Tatiana Tarasova
    • 1
  • Maarten Marx
    • 1
  1. 1.ISLAUniversity of AmsterdamAmsterdamThe Netherlands

Personalised recommendations