Abstract
Constructing a knowledge graph with mapping languages, such as RML or SPARQL-Generate, allows seamlessly integrating heterogeneous data by defining access-specific definitions for e.g., databases or files. However, such mapping languages have limited support for describing Web APIs and no support for describing data with varying velocities, as needed for e.g., streams, neither for the input data nor for the output RDF. This hampers the smooth and reproducible generation of knowledge graphs from heterogeneous data and their continuous integration for consumption since each implementation provides its own extensions. Recently, the Web of Things (WoT) Working Group released a set of recommendations to provide a machine-readable description of metadata and network-facing interfaces for Web APIs and streams. In this paper, we investigated (i) how mapping languages can be aligned with the newly specified recommendations to describe and handle heterogeneous data with varying velocities and Web APIs, and (ii) how such descriptions can be used to indicate how the generated knowledge graph should be exported. We extended RML’s Logical Source to support WoT descriptions of Web APIs and streams, and introduced RML’s Logical Target to describe the generated knowledge graph reusing the same descriptions. We implemented these extensions in the RMLMapper and RMLStreamer, and validated our approach in two use cases. Mapping languages are now able to use the same descriptions to define the input data but also the output RDF. This way, our work paves the way towards more reproducible workflows for knowledge graph generation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsChange history
11 May 2021
In Chapter 10, the term “paths” was used instead of the term “operation.” This has been corrected and the term “operations” is now used throughout the paper.
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
CARML’s Stream: https://github.com/carml/carml
RMLStreamer’s RML extension: https://github.com/RMLio/rmlstreamer
Chimera’s InputStream: https://github.com/cefriel/chimera.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
References
Alexander, K., Cyganiak, R., Hausenblas, M., Zhao, J.: Describing Linked Datasets with the VoID Vocabulary. Interest group note, World Wide Web Consortium (W3C) (2011). https://www.w3.org/TR/void/
Charpenay, V., Lefrançois, M., Poveda Villalón, M., Käbisch, S.: Thing Description (TD) Ontology. Working group editor’s draft, World Wide Web Consortium (W3C) (2020). https://www.w3.org/2019/wot/td
Chortaras, A., Stamou, G.: Mapping diverse data to RDF in practice. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 441–457. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_26
Christensen, E., Curbera, F., Meredith, G., Weerawarana, S.: Web ServicesDescription Language (WSDL) 1.0 (2000)
Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF Mapping Language. Working group recommendation, World Wide Web Consortium (W3C) (2012). http://www.w3.org/TR/r2rml/
Dimou, A., Sande, M.V., Colpaert, P., Verborgh, R., Mannens, E., Van de Walle, R.: RML: a generic language for integrated RDF mappings of heterogeneous data. In: Proceedings of the 7th Workshop on Linked Data on the Web. CEUR Workshop Proceedings, vol. 1184. CEUR-WS.org (2014)
Dimou, A., Verborgh, R., Sande, M.V., Mannens, E., de Walle, R.V.: Machine-interpretable dataset and service descriptions for heterogeneous data access and retrieval. In: Proceedings of the 11th International Conference on Semantic Systems - SEMANTICS 2015. ACM Press (2015)
Haesendonck, G., Maroy, W., Heyvaert, P., Verborgh, R., Dimou, A.: Parallel RDF generation from heterogeneous big data. In: Proceedings of the International Workshop on Semantic Big Data - SBD 2019. ACM Press, Amsterdam, Netherlands (2019)
Harris, S., Seaborne, A.: SPARQL 1.1 Query Language. Recommendation, World Wide Web Consortium (W3C) (2013). https://www.w3.org/TR/sparql11-query/
Kaebisch, S., Kamiya, T., McCool, M., Charpenay, V., Kovatsch, M.: Web of Things (WoT) Thing Description. Working group recommendation, World Wide Web Consortium (W3C) (2020). http://www.w3.org/TR/wot-thing-description/
Koch, J., Valesco, C.A., Ackermann, P.: HTTP Vocabulary in RDF 1.0. Working group note, World Wide Web Consortium (W3C) (2017). http://www.w3.org/TR/HTTP-in-RDF10/
Konstantinou, N., Spanos, D.E., Houssos, N., Mitrou, N.: Exposing scholarlyinformation as Linked Open Data: RDFizing DSpace contents. The ElectronicLibrary (2014)
Koster, M., Korkan, E.: Web of Things (WoT) Binding Templates. Working group note, World Wide Web Consortium (W3C) (2020). http://www.w3.org/TR/wot-binding-templates/
Lanthaler, M.: Hydra Core Vocabulary: A Vocabulary for Hypermedia-Driven Web APIs. Unofficial draft, World Wide Web Consortium (W3C) (2019). http://www.hydra-cg.com/spec/latest/core/
Lefrançois, M., Zimmermann, A., Bakerally, N.: A SPARQL extension for generating RDF from heterogeneous formats. In: Blomqvist, E., Maynard, D., Gangemi, A., Hoekstra, R., Hitzler, P., Hartig, O. (eds.) ESWC 2017. LNCS, vol. 10249, pp. 35–50. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58068-5_3
Maali, F., Erickson, J.: Data Catalog Vocabulary (DCAT). Recommendation, World Wide Web Consortium (W3C) (2014). https://www.w3.org/TR/vocab-dcat/
Martin, D., et al.: OWL-S: Semantic Markup for Web Services. Member submission, World Wide Web Consortium (W3C) (2004). http://www.w3.org/Submission/OWL-S/
Mauri, A., et al.: TripleWave: spreading RDF streams on the web. In: Groth, P., et al. (eds.) ISWC 2016. LNCS, vol. 9982, pp. 140–149. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46547-0_15
Michel, F., Djimenou, L., Faron-Zucker, C., Montagnat, J.: Translation of heterogeneous databases into RDF, and application to the construction of a SKOS taxonomical reference. In: Monfort, V., Krempels, K.-H., Majchrzak, T.A., Turk, Ž. (eds.) WEBIST 2015. LNBIP, vol. 246, pp. 275–296. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-30996-5_14
Santipantakis, G.M., Kotis, K.I., Vouros, G.A., Doulkeridis, C.: RDF-Gen: generating RDF from streaming and archival data. In: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics (2018)
Scrocca, M., Comerio, M., Carenini, A., Celino, I.: Turning transport data to comply with EU standards while enabling a multimodal transport knowledge graph. In: Pan, J.Z., et al. (eds.) ISWC 2020. LNCS, vol. 12507, pp. 411–429. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-62466-8_26
Tommasini, R., et al.: VoCaLS: vocabulary and catalog of linked streams. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11137, pp. 256–272. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00668-6_16
Williams, G.: SPARQL 1.1 Service Description. Recommendation, World Wide Web Consortium (W3C) (2013). https://www.w3.org/TR/sparql11-service-description/
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Van Assche, D. et al. (2021). Leveraging Web of Things W3C Recommendations for Knowledge Graphs Generation. In: Brambilla, M., Chbeir, R., Frasincar, F., Manolescu, I. (eds) Web Engineering. ICWE 2021. Lecture Notes in Computer Science(), vol 12706. Springer, Cham. https://doi.org/10.1007/978-3-030-74296-6_26
Download citation
DOI: https://doi.org/10.1007/978-3-030-74296-6_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-74295-9
Online ISBN: 978-3-030-74296-6
eBook Packages: Computer ScienceComputer Science (R0)