ParlBench: A SPARQL Benchmark for Electronic Publishing Applications
- Cite this paper as:
- Tarasova T., Marx M. (2013) ParlBench: A SPARQL Benchmark for Electronic Publishing Applications. In: Cimiano P., Fernández M., Lopez V., Schlobach S., Völker J. (eds) The Semantic Web: ESWC 2013 Satellite Events. ESWC 2013. Lecture Notes in Computer Science, vol 7955. Springer, Berlin, Heidelberg
ParlBench is an RDF benchmark modelling a large scale electronic publishing scenario. The benchmark offers large collections of the Dutch parliamentary proceedings together with information about members of the parliament and political parties. The data is real, but free of intellectual property rights issues. On top of the benchmark data sets several application benchmarks as well as targeted micro benchmarks can be developed. This paper describes the benchmark data sets and 19 analytical queries covering a wide range of SPARQL constructs. The potential use of ParlBench is demonstrated by executing the queries for 8 different scaling of the benchmark data sets on Virtuoso RDF store. Measured on a standard laptop, data loading times varied from 43 seconds (for 1% of the data set) to 48 minutes (for the complete data set), and execution of the complete set of queries (570 queries in total) varied from 9 minutes to 13 hours.
KeywordsSPARQL RDF benchmark parliamentary proceedings
Unable to display preview. Download preview PDF.