DLUBM: A Benchmark for Distributed Linked Data Knowledge Base Systems
- 707 Downloads
Abstract
Linked Data is becoming a stable technology alternative and is no longer only an innovation trend. More and more companies are looking into adapting Linked Data as part of the new data economy. Driven by the growing availability of data sources, solutions are constantly being newly developed or improved in order to support the necessity for data exchange both in web and enterprise settings. Unfortunately, currently the choice whether to use Linked Data is more an educated guess than a fact-based decision. Therefore, the provisioning of open benchmarking tools and reports, which allow developers to assess the fitness of existing solutions, is key for pushing the development of better Linked Data-based approaches and solutions. To this end we introduce a novel Linked Data benchmark – Distributed LUBM, which enables the reproducible creation and deployment of distributed interlinked LUBM datasets. We provide a system architecture for distributed Linked Data benchmark environments, accompanied by guiding design requirements. We instantiate the architecture with the actual DLUBM implementation and evaluate a Linked Data query engine via DLUBM.
Keywords
Linked Data Linked Data benchmarking Distributed benchmarking LUBM DLUBMReferences
- 1.Abele, A., McCrae, J.P., Buitelaar, P., Jentzsch, A., Cyganiak, R.: Linking Open Data cloud diagram, March 2017, http://lod-cloud.net/
- 2.Aluç, G., Hartig, O., Özsu, M.T., Daudjee, K.: Diversified stress testing of RDF data management systems. In: Mika, P., Tudorache, T., Bernstein, A., Welty, C., Knoblock, C., Vrandečić, D., Groth, P., Noy, N., Janowicz, K., Goble, C. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 197–212. Springer, Cham (2014). doi: 10.1007/978-3-319-11964-9_13 Google Scholar
- 3.Angles, R., Boncz, P.A., Larriba-Pey, J.L., Fundulaki, I., Neumann, T., Erling, O., Neubauer, P., Martinez-Bazan, N., Kotsev, V., Toma, I: The Linked data benchmark council: a graph and RDF industry benchmarking effort. SIGMOD Rec. (2014)Google Scholar
- 4.Armstrong, T.G., Ponnekanti, V., Borthakur, D., Callaghan, M.: LinkBench: a database benchmark based on the facebook social graph. In: Proceedings of the SIGMOD International Conference on Management of Data (2013)Google Scholar
- 5.Atzori, L., Iera, A., Morabito, G.: The internet of things: a survey. Comput. Netw. (2010)Google Scholar
- 6.Bagan, G., Bonifati, A., Ciucanu, R., Fletcher, G.H.L., Lemay, A., Advokaat, N.: gMark: schema-driven generation of graphs and queries. IEEE Trans. Knowl. Data Eng. (2016)Google Scholar
- 7.Barahmand, S., Ghandeharizadeh, S.: BG: a benchmark to evaluate interactive social networking actions. In: Proceedings of the Conference on Innovative Data Systems Research (2013)Google Scholar
- 8.Bizer, C., Heath, T., Berners-Lee, T.: Linked data - the story so far. Int. J. Semant. Web Inf. Syst. (2009)Google Scholar
- 9.Bizer, C., Schultz, A.: The Berlin SPARQL benchmark. Int. J. Semant. Web Inf. Syst. (2009)Google Scholar
- 10.Blum, D., Cohen, S.: Grr: generating random RDF. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011. LNCS, vol. 6644, pp. 16–30. Springer, Heidelberg (2011). doi: 10.1007/978-3-642-21064-8_2 CrossRefGoogle Scholar
- 11.Dominguez-Sal, D., Martinez-Bazan, N., Muntes-Mulero, V., Baleta, P., Larriba-Pey, J.L.: A discussion on the design of graph database benchmarks. In: Nambiar, R., Poess, M. (eds.) TPCTC 2010. LNCS, vol. 6417, pp. 25–40. Springer, Heidelberg (2010). doi: 10.1007/978-3-642-18206-8_3 Google Scholar
- 12.Duan, S., Kementsietsidis, A., Srinivas, K., Udrea, O.: Apples and oranges: a comparison of RDF benchmarks and real RDF datasets. In: Proceedings of the SIGMOD International Conference on Management of Data. ACM (2011)Google Scholar
- 13.Duquennoy, S., Grimaud, G., Vandewalle, J.J.: The Web of Things: interconnecting devices with high usability and performance. In: Proceedings of the International Conference on Embedded Software and Systems (2009)Google Scholar
- 14.Erling, O., Averbuch, A., Larriba-Pey, J., Chafi, H., Gubichev, A., Prat, A., Pham, M.D., Boncz, P.: The LDBC social network benchmark: Interactive workload. In: Proceedings of the SIGMOD International Conference on Management of Data (2015)Google Scholar
- 15.Guo, Y., Pan, Z., Heflin, J.: LUBM: A benchmark for OWL knowledge base systems. Web Semantics: Science, Services and Agents on the World Wide Web (2005)Google Scholar
- 16.Harth, A., Hose, K., Karnstedt, M., Polleres, A., Sattler, K.U., Umbrich, J.: Data summaries for on-demand queries over Linked Data. In: Proceedings of the International Conference on World Wide Web (2010)Google Scholar
- 17.Hartig, O., Bizer, C., Freytag, J.-C.: Executing SPARQL queries over the web of linked data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 293–309. Springer, Heidelberg (2009). doi: 10.1007/978-3-642-04930-9_19 CrossRefGoogle Scholar
- 18.Huppler, K.: The art of building a good benchmark. In: Proceedings of the TPC Technology Conference on Performance Evaluation & Benchmarking (2009)Google Scholar
- 19.Jara, A.J., Olivieri, A.C., Bocchi, Y., Jung, M., Kastner, W., Skarmeta, A.F.: Semantic web of things: an analysis of the application semantics for the IoT moving towards the IoT convergence. Int. J. Web Grid Serv. (2014)Google Scholar
- 20.Joshi, A.K., Hitzler, P., Dong, G.: LinkGen: multipurpose linked data generator. In: Groth, P., Simperl, E., Gray, A., Sabou, M., Krötzsch, M., Lecue, F., Flöck, F., Gil, Y. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 113–121. Springer, Cham (2016). doi: 10.1007/978-3-319-46547-0_12 CrossRefGoogle Scholar
- 21.Keppmann, F.L., Harth, A.: Adaptable interfaces, interactions, and processing for linked data platform components. In: Proceedings of the SEMANTiCS Conference (2017)Google Scholar
- 22.Schmidt, M., Hornung, T., Lausen, G., Pinkel, C.: SP2Bench: a SPARQL performance benchmark. In: Proceedings of the International Conference on Data Engineering (2009)Google Scholar
- 23.Stadtmüller, S., Speiser, S., Harth, A., Studer, R.: Data-fu: a language and an interpreter for interaction with read/write Linked Data. In: Proceedings of the International World Wide Web Conference (2013)Google Scholar
- 24.Weithöner, T., Liebig, T., Luther, M., Böhm, S.: What’s wrong with OWL benchmarks? In: Proceedings of the International Workshop on Scalable Semantic Web Knowledge Base Systems (2006)Google Scholar