MULDER: Querying the Linked Data Web by Bridging RDF Molecule Templates

Endris, Kemele M.; Galkin, Mikhail; Lytra, Ioanna; Mami, Mohamed Nadjib; Vidal, Maria-Esther; Auer, Sören

doi:10.1007/978-3-319-64468-4_1

Kemele M. Endris¹⁹,
Mikhail Galkin^19,20,21,
Ioanna Lytra^19,20,
Mohamed Nadjib Mami^19,20,
Maria-Esther Vidal²⁰ &
…
Sören Auer^19,20

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10438))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

1313 Accesses
17 Citations

Abstract

The increasing number of RDF data sources that allow for querying Linked Data via Web services form the basis for federated SPARQL query processing. Federated SPARQL query engines provide a unified view of a federation of RDF data sources, and rely on source descriptions for selecting the data sources over which unified queries will be executed. Albeit efficient, existing federated SPARQL query engines usually ignore the meaning of data accessible from a data source, and describe sources only in terms of the vocabularies utilized in the data source. Lack of source description may conduce to the erroneous selection of data sources for a query, thus affecting the performance of query processing over the federation. We tackle the problem of federated SPARQL query processing and devise MULDER, a query engine for federations of RDF data sources. MULDER describes data sources in terms of RDF molecule templates, i.e., abstract descriptions of entities belonging to the same RDF class. Moreover, MULDER utilizes RDF molecule templates for source selection, and query decomposition and optimization. We empirically study the performance of MULDER on existing benchmarks, and compare MULDER performance with state-of-the-art federated SPARQL query engines. Experimental results suggest that RDF molecule templates empower MULDER federated query processing, and allow for the selection of RDF data sources that not only reduce execution time, but also increase answer completeness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Querying Interlinked Data by Bridging RDF Molecule Templates

IDSM ChemWebRDF: SPARQLing small-molecule datasets

Article Open access 12 May 2021

Bridging the Semantic Web and NoSQL Worlds: Generic SPARQL Query Translation and Application to MongoDB

Notes

1.
https://github.com/EIS-Bonn/MULDER.

References

Acosta, M., Vidal, M.-E., Lampo, T., Castillo, J., Ruckhaus, E.: ANAPSID: an adaptive query processing engine for SPARQL endpoints. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 18–34. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25073-6_2
Chapter Google Scholar
Alexander, K., Hausenblas, M.: Describing linked datasets-on the design and usage of voiD, the ‘vocabulary of interlinked datasets’. In: LDOW (2009)
Google Scholar
Basca, C., Bernstein, A.: Querying a messy web of data with avalanche. J. Web Semant. 26, 1–28 (2014)
Article Google Scholar
Görlitz, O., Staab, S.: SPLENDID: SPARQL endpoint federation exploiting VOID descriptions. In: COLD (2011)
Google Scholar
Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput. 20(1), 359–392 (1998)
Article MathSciNet MATH Google Scholar
Montoya, G., Skaf-Molli, H., Molli, P., Vidal, M.: Decomposing federated queries in presence of replicated fragments. J. Web Semant. 42, 1–18 (2017)
Article Google Scholar
Palma, G., Vidal, M.-E., Raschid, L.: Drug-target interaction prediction using semantic similarity and edge partitioning. In: Mika, P., et al. (eds.) ISWC 2014. LNCS, vol. 8796, pp. 131–146. Springer, Cham (2014). doi:10.1007/978-3-319-11964-9_9
Google Scholar
Saleem, M., Khan, Y., Hasnain, A., Ermilov, I., Ngomo, A.N.: A fine-grained evaluation of SPARQL endpoint federation systems. Semant. Web 7(5), 493–518 (2015)
Article Google Scholar
Saleem, M., Ngonga Ngomo, A.-C.: HiBISCuS: hypergraph-based source selection for SPARQL endpoint federation. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds.) ESWC 2014. LNCS, vol. 8465, pp. 176–191. Springer, Cham (2014). doi:10.1007/978-3-319-07443-6_13
Chapter Google Scholar
Scheufele, W., Moerkotte, G.: On the complexity of generating optimal plans with cross products. In: 16th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, pp. 238–248 (1997)
Google Scholar
Schmidt, M., Görlitz, O., Haase, P., Ladwig, G., Schwarte, A., Tran, T.: FedBench: a benchmark suite for federated semantic data query processing. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 585–600. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25073-6_37
Chapter Google Scholar
Schwarte, A., Haase, P., Hose, K., Schenkel, R., Schmidt, M.: FedX: optimization techniques for federated query processing on linked data. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011. LNCS, vol. 7031, pp. 601–616. Springer, Heidelberg (2011). doi:10.1007/978-3-642-25073-6_38
Chapter Google Scholar
Verborgh, R., Sande, M.V., Hartig, O., Herwegen, J.V., Vocht, L.D., Meester, B.D., Haesendonck, G., Colpaert, P.: Triple pattern fragments: a low-cost knowledge graph interface for the web. J. Web Semant. 37, 184–206 (2016)
Article Google Scholar
Vidal, M., Castillo, S., Acosta, M., Montoya, G., Palma, G.: On the selection of SPARQL endpoints to efficiently execute federated SPARQL queries. Trans. Large-Scale Data Knowl.-Cent. Syst. 25, 109–149 (2016)
Article Google Scholar
Wylot, M., Cudré-Mauroux, P.: DiploCloud: efficient and scalable management of RDF data in the cloud. IEEE Trans. Knowl. Data Eng. 28(3), 659–674 (2016)
Article Google Scholar

Download references

Acknowledgements

This work has been partially funded by the EU Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 642795 (WDAqua) and the EU H2020 programme for the project BigDataEurope (GA 644564).

Author information

Authors and Affiliations

Enterprise Information Systems (EIS), University of Bonn, Bonn, Germany
Kemele M. Endris, Mikhail Galkin, Ioanna Lytra, Mohamed Nadjib Mami & Sören Auer
Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Sankt Augustin, Germany
Mikhail Galkin, Ioanna Lytra, Mohamed Nadjib Mami, Maria-Esther Vidal & Sören Auer
ITMO University, Saint Petersburg, Russia
Mikhail Galkin

Authors

Kemele M. Endris
View author publications
You can also search for this author in PubMed Google Scholar
Mikhail Galkin
View author publications
You can also search for this author in PubMed Google Scholar
Ioanna Lytra
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Nadjib Mami
View author publications
You can also search for this author in PubMed Google Scholar
Maria-Esther Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Sören Auer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail Galkin .

Editor information

Editors and Affiliations

University of Lyon, Villeurbanne, France
Djamal Benslimane
University of Milan, Milan, Italy
Ernesto Damiani
University of Michigan, Dearborn, Michigan, USA
William I. Grosky
Paul Sabatier University, Toulouse, France
Abdelkader Hameurlain
Wright State University, Dayton, Ohio, USA
Amit Sheth
Johannes Kepler University, Linz, Austria
Roland R. Wagner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Endris, K.M., Galkin, M., Lytra, I., Mami, M.N., Vidal, ME., Auer, S. (2017). MULDER: Querying the Linked Data Web by Bridging RDF Molecule Templates. In: Benslimane, D., Damiani, E., Grosky, W., Hameurlain, A., Sheth, A., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2017. Lecture Notes in Computer Science(), vol 10438. Springer, Cham. https://doi.org/10.1007/978-3-319-64468-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-64468-4_1
Published: 01 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64467-7
Online ISBN: 978-3-319-64468-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

MULDER: Querying the Linked Data Web by Bridging RDF Molecule Templates

Abstract

Access this chapter

Similar content being viewed by others

Querying Interlinked Data by Bridging RDF Molecule Templates

IDSM ChemWebRDF: SPARQLing small-molecule datasets

Bridging the Semantic Web and NoSQL Worlds: Generic SPARQL Query Translation and Application to MongoDB

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

MULDER: Querying the Linked Data Web by Bridging RDF Molecule Templates

Abstract

Access this chapter

Similar content being viewed by others

Querying Interlinked Data by Bridging RDF Molecule Templates

IDSM ChemWebRDF: SPARQLing small-molecule datasets

Bridging the Semantic Web and NoSQL Worlds: Generic SPARQL Query Translation and Application to MongoDB

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation