DLUBM: A Benchmark for Distributed Linked Data Knowledge Base Systems

  • Felix Leif KeppmannEmail author
  • Maria Maleshkova
  • Andreas Harth
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10574)


Linked Data is becoming a stable technology alternative and is no longer only an innovation trend. More and more companies are looking into adapting Linked Data as part of the new data economy. Driven by the growing availability of data sources, solutions are constantly being newly developed or improved in order to support the necessity for data exchange both in web and enterprise settings. Unfortunately, currently the choice whether to use Linked Data is more an educated guess than a fact-based decision. Therefore, the provisioning of open benchmarking tools and reports, which allow developers to assess the fitness of existing solutions, is key for pushing the development of better Linked Data-based approaches and solutions. To this end we introduce a novel Linked Data benchmark – Distributed LUBM, which enables the reproducible creation and deployment of distributed interlinked LUBM datasets. We provide a system architecture for distributed Linked Data benchmark environments, accompanied by guiding design requirements. We instantiate the architecture with the actual DLUBM implementation and evaluate a Linked Data query engine via DLUBM.


Linked Data Linked Data benchmarking Distributed benchmarking LUBM DLUBM 


  1. 1.
    Abele, A., McCrae, J.P., Buitelaar, P., Jentzsch, A., Cyganiak, R.: Linking Open Data cloud diagram, March 2017,
  2. 2.
    Aluç, G., Hartig, O., Özsu, M.T., Daudjee, K.: Diversified stress testing of RDF data management systems. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 197–212. Springer, Cham (2014). doi: 10.1007/978-3-319-11964-9_13 Google Scholar
  3. 3.
    Angles, R., Boncz, P.A., Larriba-Pey, J.L., Fundulaki, I., Neumann, T., Erling, O., Neubauer, P., Martinez-Bazan, N., Kotsev, V., Toma, I: The Linked data benchmark council: a graph and RDF industry benchmarking effort. SIGMOD Rec. (2014)Google Scholar
  4. 4.
    Armstrong, T.G., Ponnekanti, V., Borthakur, D., Callaghan, M.: LinkBench: a database benchmark based on the facebook social graph. In: Proceedings of the SIGMOD International Conference on Management of Data (2013)Google Scholar
  5. 5.
    Atzori, L., Iera, A., Morabito, G.: The internet of things: a survey. Comput. Netw. (2010)Google Scholar
  6. 6.
    Bagan, G., Bonifati, A., Ciucanu, R., Fletcher, G.H.L., Lemay, A., Advokaat, N.: gMark: schema-driven generation of graphs and queries. IEEE Trans. Knowl. Data Eng. (2016)Google Scholar
  7. 7.
    Barahmand, S., Ghandeharizadeh, S.: BG: a benchmark to evaluate interactive social networking actions. In: Proceedings of the Conference on Innovative Data Systems Research (2013)Google Scholar
  8. 8.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. (2009)Google Scholar
  9. 9.
    Bizer, C., Schultz, A.: The Berlin SPARQL benchmark. Int. J. Semant. Web Inf. Syst. (2009)Google Scholar
  10. 10.
    Blum, D., Cohen, S.: Grr: generating random RDF. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6644, pp. 16–30. Springer, Heidelberg (2011). doi: 10.1007/978-3-642-21064-8_2 CrossRefGoogle Scholar
  11. 11.
    Dominguez-Sal, D., Martinez-Bazan, N., Muntes-Mulero, V., Baleta, P., Larriba-Pey, J.L.: A discussion on the design of graph database benchmarks. In: Nambiar, R., Poess, M. (eds.) TPCTC 2010. LNCS, vol. 6417, pp. 25–40. Springer, Heidelberg (2010). doi: 10.1007/978-3-642-18206-8_3 Google Scholar
  12. 12.
    Duan, S., Kementsietsidis, A., Srinivas, K., Udrea, O.: Apples and oranges: a comparison of RDF benchmarks and real RDF datasets. In: Proceedings of the SIGMOD International Conference on Management of Data. ACM (2011)Google Scholar
  13. 13.
    Duquennoy, S., Grimaud, G., Vandewalle, J.J.: The Web of Things: interconnecting devices with high usability and performance. In: Proceedings of the International Conference on Embedded Software and Systems (2009)Google Scholar
  14. 14.
    Erling, O., Averbuch, A., Larriba-Pey, J., Chafi, H., Gubichev, A., Prat, A., Pham, M.D., Boncz, P.: The LDBC social network benchmark: Interactive workload. In: Proceedings of the SIGMOD International Conference on Management of Data (2015)Google Scholar
  15. 15.
    Guo, Y., Pan, Z., Heflin, J.: LUBM: A benchmark for OWL knowledge base systems. Web Semantics: Science, Services and Agents on the World Wide Web (2005)Google Scholar
  16. 16.
    Harth, A., Hose, K., Karnstedt, M., Polleres, A., Sattler, K.U., Umbrich, J.: Data summaries for on-demand queries over Linked Data. In: Proceedings of the International Conference on World Wide Web (2010)Google Scholar
  17. 17.
    Hartig, O., Bizer, C., Freytag, J.-C.: Executing SPARQL queries over the web of linked data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 293–309. Springer, Heidelberg (2009). doi: 10.1007/978-3-642-04930-9_19 CrossRefGoogle Scholar
  18. 18.
    Huppler, K.: The art of building a good benchmark. In: Proceedings of the TPC Technology Conference on Performance Evaluation & Benchmarking (2009)Google Scholar
  19. 19.
    Jara, A.J., Olivieri, A.C., Bocchi, Y., Jung, M., Kastner, W., Skarmeta, A.F.: Semantic web of things: an analysis of the application semantics for the IoT moving towards the IoT convergence. Int. J. Web Grid Serv. (2014)Google Scholar
  20. 20.
    Joshi, A.K., Hitzler, P., Dong, G.: LinkGen: multipurpose linked data generator. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 113–121. Springer, Cham (2016). doi: 10.1007/978-3-319-46547-0_12 CrossRefGoogle Scholar
  21. 21.
    Keppmann, F.L., Harth, A.: Adaptable interfaces, interactions, and processing for linked data platform components. In: Proceedings of the SEMANTiCS Conference (2017)Google Scholar
  22. 22.
    Schmidt, M., Hornung, T., Lausen, G., Pinkel, C.: SP2Bench: a SPARQL performance benchmark. In: Proceedings of the International Conference on Data Engineering (2009)Google Scholar
  23. 23.
    Stadtmüller, S., Speiser, S., Harth, A., Studer, R.: Data-fu: a language and an interpreter for interaction with read/write Linked Data. In: Proceedings of the International World Wide Web Conference (2013)Google Scholar
  24. 24.
    Weithöner, T., Liebig, T., Luther, M., Böhm, S.: What’s wrong with OWL benchmarks? In: Proceedings of the International Workshop on Scalable Semantic Web Knowledge Base Systems (2006)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Felix Leif Keppmann
    • 1
    Email author
  • Maria Maleshkova
    • 1
  • Andreas Harth
    • 1
  1. 1.Karlsruhe Institute of TechnologyKarlsruheGermany

Personalised recommendations