FEASIBLE: A Feature-Based SPARQL Benchmark Generation Framework

Saleem, Muhammad; Mehmood, Qaiser; Ngonga Ngomo, Axel-Cyrille

doi:10.1007/978-3-319-25007-6_4

Muhammad Saleem²⁵,
Qaiser Mehmood²⁶ &
Axel-Cyrille Ngonga Ngomo²⁵

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9366))

Included in the following conference series:

International Semantic Web Conference

2526 Accesses
35 Citations

Abstract

Benchmarking is indispensable when aiming to assess technologies with respect to their suitability for given tasks. While several benchmarks and benchmark generation frameworks have been developed to evaluate triple stores, they mostly provide a one-fits-all solution to the benchmarking problem. This approach to benchmarking is however unsuitable to evaluate the performance of a triple store for a given application with particular requirements. We address this drawback by presenting FEASIBLE, an automatic approach for the generation of benchmarks out of the query history of applications, i.e., query logs. The generation is achieved by selecting prototypical queries of a user-defined size from the input set of queries. We evaluate our approach on two query logs and show that the benchmarks it generates are accurate approximations of the input query logs. Moreover, we compare four different triple stores with benchmarks generated using our approach and show that they behave differently based on the data they contain and the types of queries posed. Our results suggest that FEASIBLE generates better sample queries than the state of the art. In addition, the better query selection and the larger set of query types used lead to triple store rankings which partly differ from the rankings generated by previous works.

Download to read the full chapter text

Chapter PDF

Iguana: A Generic Framework for Benchmarking the Read-Write Performance of Triple Stores

ASPG: Generating OLAP Queries for SPARQL Benchmarking

BeSEPPI: Semantic-Based Benchmarking of Property Path Implementations

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Acosta, M., Vidal, M.-E., Lampo, T., Castillo, J., Ruckhaus, E.: ANAPSID: an adaptive query processing engine for SPARQL endpoints. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 18–34. Springer, Heidelberg (2011)
Chapter Google Scholar
Aluç, G., Hartig, O., Özsu, M.T., Daudjee, K.: Diversified stress testing of RDF data management systems. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 197–212. Springer, Heidelberg (2014)
Google Scholar
Arias, M., Fernández, J.D., Martínez-Prieto, M.A., de la Fuente, P.: An empirical study of real-world SPARQL queries. CoRR (2011)
Google Scholar
Bizer, C., Schultz, A: The berlin SPARQL benchmark. IJSWIS (2009)
Google Scholar
Duan, S., Kementsietsidis, A., Srinivas, K., Udrea, O: Apples and oranges: a comparison of RDF benchmarks and real RDF datasets. In: SIGMOD (2011)
Google Scholar
Görlitz, O., Thimm, M., Staab, S.: SPLODGE: systematic generation of SPARQL benchmark queries for linked open data. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 116–132. Springer, Heidelberg (2012)
Chapter Google Scholar
Guo, Y., Heflin, J.: LUBM: A benchmark for OWL knowledge base systems. JWS (2005)
Google Scholar
Kamdar, M., Iqbal, A., Saleem, M., Deus, H., Decker, S.: Genomesnip: fragmenting the genomic wheel to augment discovery in cancer research. In: CSHALS (2014)
Google Scholar
Morsey, M., Lehmann, J., Auer, S., Ngonga Ngomo, A.-C.: DBpedia SPARQL benchmark – performance assessment with real queries on real data. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 454–469. Springer, Heidelberg (2011)
Chapter Google Scholar
Ngonga Ngomo, A.-C., Auer, S.: LIMES - a time-efficient approach for large-scale link discovery on the web of data. In: IJCAI (2011)
Google Scholar
Picalausa, F., Vansummeren, S.: What are real SPARQL queries like? In: SWIM (2011)
Google Scholar
Saleem, M., Ali, I., Hogan, A., Mehmood, Q., Ngonga Ngomo, A.-C.: LSQ: the linked SPARQL queries dataset. In: ISWC (2015)
Google Scholar
Saleem, M., Kamdar, M.R., Iqbal, A., Sampath, S., Deus, H.F., Ngonga Ngomo, A.-C.: Big linked cancer data: Integrating linked TCGA and pubmed. JWS (2014)
Google Scholar
Saleem, M., Ngonga Ngomo, A.-C.: HiBISCuS: hypergraph-based source selection for SPARQL endpoint federation. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 176–191. Springer, Heidelberg (2014)
Chapter Google Scholar
Saleem, M., Ngonga Ngomo, A.-C., Xavier Parreira, J., Deus, H.F., Hauswirth, M.: DAW: duplicate-aware federated query processing over the web of data. In: Alani, H., et al. (eds.) ISWC 2013, Part I. LNCS, vol. 8218, pp. 574–590. Springer, Heidelberg (2013)
Chapter Google Scholar
Schmidt, M., Görlitz, O., Haase, P., Ladwig, G., Schwarte, A., Tran, T.: FedBench: a benchmark suite for federated semantic data query processing. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 585–600. Springer, Heidelberg (2011)
Chapter Google Scholar
Schmidt, M., Hornung, T., Lausen, G., Pinkel, C: Sp2bench: a SPARQL performance benchmark. In: ICDE (2009)
Google Scholar
Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 601–616. Springer, Heidelberg (2011)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Universität Leipzig, IFI/AKSW, PO 100920, 04009, Leipzig, Germany
Muhammad Saleem & Axel-Cyrille Ngonga Ngomo
Insight Center for Data Analytics, National University of Ireland, Galway, Ireland
Qaiser Mehmood

Authors

Muhammad Saleem
View author publications
You can also search for this author in PubMed Google Scholar
Qaiser Mehmood
View author publications
You can also search for this author in PubMed Google Scholar
Axel-Cyrille Ngonga Ngomo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Muhammad Saleem .

Editor information

Editors and Affiliations

Pontificia Universidad Católica de Chile, Santiago de Chile, Chile
Marcelo Arenas
Universidad Politecnica de Madrid, Boadilla del Monte, Spain
Oscar Corcho
University of Southampton, Southampton, United Kingdom
Elena Simperl
Department of Computational Social Science, GESIS Leibniz-Institut, Köln, Nordrhein-Westfalen, Germany
Markus Strohmaier
The Open University, Milton Keynes, United Kingdom
Mathieu d'Aquin
IBM Research, Yorktown Heights, New York, USA
Kavitha Srinivas
Elsevier Labs., Amsterdam, The Netherlands
Paul Groth
School of Medicine, Stanford University, Stanford, California, USA
Michel Dumontier
Lehigh University, Bethlehem, Pennsylvania, USA
Jeff Heflin
DAYTON, Ohio, USA
Krishnaprasad Thirunarayan
Wright State University, Dayton, Ohio, USA
Krishnaprasad Thirunarayan
University of Koblenz-Landau, Koblenz, Rheinland-Pfalz, Germany
Steffen Staab

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saleem, M., Mehmood, Q., Ngonga Ngomo, AC. (2015). FEASIBLE: A Feature-Based SPARQL Benchmark Generation Framework. In: Arenas, M., et al. The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science(), vol 9366. Springer, Cham. https://doi.org/10.1007/978-3-319-25007-6_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-25007-6_4
Published: 30 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25006-9
Online ISBN: 978-3-319-25007-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

FEASIBLE: A Feature-Based SPARQL Benchmark Generation Framework

Abstract

Chapter PDF

Similar content being viewed by others

Iguana: A Generic Framework for Benchmarking the Read-Write Performance of Triple Stores

ASPG: Generating OLAP Queries for SPARQL Benchmarking

BeSEPPI: Semantic-Based Benchmarking of Property Path Implementations

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

FEASIBLE: A Feature-Based SPARQL Benchmark Generation Framework

Abstract

Chapter PDF

Similar content being viewed by others

Iguana: A Generic Framework for Benchmarking the Read-Write Performance of Triple Stores

ASPG: Generating OLAP Queries for SPARQL Benchmarking

BeSEPPI: Semantic-Based Benchmarking of Property Path Implementations

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation