Distributed Linked Data Business Communication Networks: The LUCID Endpoint

Tramp, Sebastian; Piris, Ruben Navarro; Ermilov, Timofey; Petersen, Niklas; Frommhold, Marvin; Auer, Sören

doi:10.1007/978-3-319-25639-9_30

Sebastian Tramp²⁰,
Ruben Navarro Piris²⁰,
Timofey Ermilov²⁰,
Niklas Petersen²¹,
Marvin Frommhold²⁰ &
…
Sören Auer²²

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 9341))

Included in the following conference series:

European Semantic Web Conference

1504 Accesses
1 Citations

Abstract

With the LUCID Endpoint, we demonstrate how companies can utilize Linked Data technology to provide major data items for their business partners in a timely manner, machine readable and with open and extensible schemata. The main idea is to provide a Linked Data infrastructure which enables all partners to fetch, as well as to clone and to synchronize datasets from other partners over the network. This concept allows for building of networks of business partners much like as social network but in a distributed manner. It furthermore provides a technical infrastructure for business communication acts such as supply chain communication or master data management.

You have full access to this open access chapter, Download conference paper PDF

The Business Network Data Management Platform

Sharing

Linked Data Finland: A 7-star Model and Platform for Publishing and Re-using Linked Datasets

1 The LUCID Endpoint

The LUCID endpoint^{Footnote 1} provides the necessary technology stack to manage and publish Linked Data, as well as to consume Linked Data from other LUCID endpoints.

This includes authentication and authorization mechanisms to guarantee that data consumers access only the data that the endpoint owner explicitly allows. To ensure consumer authentication, OAuth2 [6] is used. Access control rules can be defined on a named graph level.

Once a local graph was modified, the endpoint will notify its subscribers by sending the latest change sets for inclusion^{Footnote 2}. An example of this approach is depicted in Fig. 1. By sending only the modifications instead of the complete new dataset, subscribers can easily recognize the changed triples without the need for calculation of expensive data diffs itself. Subscriber endpoints can then apply those modifications to their local dataset clones automatically. In order to describe these dataset changes, a vocabulary and exchange format is needed, which we will explain in the next section.

2 The Eccenca Revision Vocabulary

In order to both keep track of the modifications on the local quad store and notify subscribers of it about those modifications, we developed the eccenca Revision Vocabulary^{Footnote 3}. This vocabulary is modelled using OWL (OWL 2 DL profile) and extends as well as reuses several concepts of the PROV-O ontology [7].

Unlike other approaches, such as [1], which try to describe changes on higher semantic levels, our approach is based on triple (or rather quad) changes, where each revision or modification event (called commit) contains a diff representing the changed (either inserted and/or deleted) quads. This simple model enables applications to rebuild and revert each commit as well as to merge diverted evolution branches as explained in [3].

Our data modelling approach is build on top of the one proposed in [5], but instead of holding separate revision histories for each revisioned named graph, our approach keeps a unified revision history on any number of named graphs. This enables applications to track revisions across different graphs or for the whole quad store.

Figure 2 illustrates the main parts of the vocabulary: The Commit class defines an instantaneous event containing a set of graph revisions. This class contains also the meta data associated to this event such as author, date and commit message. Revisions (modelled as the Revision class) refer each to a specific named graph which was changed. Changes in an RDF store are defined either as triple insertions (deltaInsertion) or deletions (deltaDeletions) inline with the approach in [2].

Further work to support branching, commit signing and blank nodes is in progress.

3 Demonstration Use-Case: Master Data Management

Our setup for the demonstration of the LUCID endpoint deploys a very basic but pressing use case in business to business communication: master data management. Enterprise master data is the single source of basic business information used across all enterprise systems, applications and processes for an entire enterprise. This includes resources such as persons, company sites and subsidiaries as well as contact details.

Our proposed demo consists of the following parts:

Publishing of master data datasets with a browser based user interface: A LUCID endpoint provides a dataset for each account. The account owner is free to upload any data to this dataset. All resources from the dataset namespace are available as Linked Data and enabled for publish/subscribe as well as OAuth (in case the dataset is non-public). In addition to the generic access via SPARQL, the user can utilize a master data management application. This single page JavaScript application allows for creation of master data resources such as company subsidiaries and contact details. The RDF data model for these resources is based on the master data model from Odette International, a collaboration platform for the automotive supply chain.
Versioning of the dataset changes on the SPARQL endpoint backend: All changes to the user dataset are logged as part of the internal LUCID endpoint triple store. The changed triples are calculated directly by the SPARQL query processor and added to the versioning store.
Subscription to datasets of another LUCID endpoint by employing the dataset URL: All resources which are Linked Data accessible, are enabled for publish/subscribe activities as well. The user interface is able to manage subscriptions to other endpoints as well as to provide a preview for the incoming data.
A publish/subscribe mechanism which uses commit push notifications based on the eccenca revision vocabulary described in Sect. 2: For each resource, a change log dataset is available, which provides the last Commit information. In addition to that, notifications with these Commit information as payload are pushed to all subscribers in case of a change. The subscribing endpoint adds the incoming data to its dataset clone as well as hold the change log in order to provide versioning information to the user.

Figure 3 depicts two screenshots of the master data management user interface which lies on top of the versioning and OAuth2 enabled SPARQL endpoint^{Footnote 4}.

Notes

1.
Which is based on the eccenca Linked Data Suite Backend.
2.
The overall process is compatible with the PubSubHubbub working draft v0.4 [4]. The specification of a more general architecture for this kind of distributed semantic social networks is available as [8].
3.
The eccenca Revision Vocabulary is available at https://vocab.eccenca.com/revision/.
4.
An annotated demonstration video is available at http://downloads.eccenca.com/2015/03/13/eswc2015-lucid-demo.mp4.

References

Auer, S., Herre, H.: A versioning and evolution framework for RDF knowledge bases. In: Proceedings of Ershov Memorial Conference (2006)
Google Scholar
Berners-lee, T., Connolly, D.: Delta: an ontology for the distribution of differences between RDF graphs. Technical report, W3C (2004). http://www.w3.org/DesignIssues/lncs04/Diff.pdf
Cassidy, S., Ballantine, J.: Version control for RDF triple stores. In: Filipe, J., Shishkov, B., Helfert, M. (eds.) Proceedings of the Second International Conference on Software and Data Technologies, ICSOFT 2007, ISDM/EHST/DC, 22–25 July 2007, Barcelona, Spain, pp. 5–12. INSTICC Press (2007)
Google Scholar
Fitzpatrick, B., Slatkin, B., Atkins, M., Genestoux, J.: PubSubHubbub Core 0.4. Working draft, PubSubHubbub W3C Community Group (2013). https://pubsubhubbub.googlecode.com/git/pubsubhubbub-core-0.4.html
Graube, M., Hensel, S., Urbas, L.: R43ples: revisions for triples - an approach for version control in the semantic web. In: Knuth, M., Kontokostas, D., Sack, H. (eds.) Proceedings of the 1st Workshop on Linked Data Quality Co-located with 10th International Conference on Semantic Systems, LDQ@SEMANTiCS 2014, 2nd September 2014, Leipzig, Germany, vol. 1215. CEUR Workshop Proceedings. CEUR-WS.org (2014)
Google Scholar
Hardt, D.: The OAuth 2.0 Authorization Framework. RFC 6749, IETF, October 2012. https://tools.ietf.org/html/rfc6749
Lebo, T., Sahoo, S., McGuinness, D., Belhajjame, K., Cheney, J., Corsar, D., Garijo, D., Soiland-Reyes, S., Zednik, S., Zhao, J.: PROV-O: The PROV Ontology. W3C Recommendation, W3C, April 2013. http://www.w3.org/TR/prov-o/
Tramp, S., Frischmuth, P., Ermilov, T., Shekarpour, S., Auer, S.: An architecture of a distributed semantic social network. Semant. Web 5(1), 77–95 (2014)
Article Google Scholar

Download references

Acknowledgement

This work was partly supported by a grant from the German Federal Ministry of Education and Research (BMBF) in the IKT 2020 funding programme (GA no. 01IS14019) for the LUCID Project (http://lucid-project.org).

Author information

Authors and Affiliations

eccenca GmbH, Hainstr. 8, 04109, Leipzig, Germany
Sebastian Tramp, Ruben Navarro Piris, Timofey Ermilov & Marvin Frommhold
Enterprise Information Systems (EIS) at the Institute for Applied Computer Science at University of Bonn, Römerstr. 164, 53117, Bonn, Germany
Niklas Petersen
Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Schloss Birlinghoven, 53757, Sankt Augustin, Germany
Sören Auer

Authors

Sebastian Tramp
View author publications
You can also search for this author in PubMed Google Scholar
Ruben Navarro Piris
View author publications
You can also search for this author in PubMed Google Scholar
Timofey Ermilov
View author publications
You can also search for this author in PubMed Google Scholar
Niklas Petersen
View author publications
You can also search for this author in PubMed Google Scholar
Marvin Frommhold
View author publications
You can also search for this author in PubMed Google Scholar
Sören Auer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sebastian Tramp .

Editor information

Editors and Affiliations

Inria, Sophia Antipolis, France
Fabien Gandon
Data Archiving and Networked Services, Den Haag, The Netherlands
Christophe Guéret
Inria - Sophia Antipolis-Méditerran, Sophia Antipolis, France
Serena Villata
Eng-3047, Engineering, National University of Ireland, Galway City, Ireland
John Breslin
Laboratoire I3S, Polytech Nice Sophia, Sophia Antipolis, France
Catherine Faron-Zucker
Ecole des Mines de Saint-Etienne, Saint-Etienne, France
Antoine Zimmermann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tramp, S., Piris, R.N., Ermilov, T., Petersen, N., Frommhold, M., Auer, S. (2015). Distributed Linked Data Business Communication Networks: The LUCID Endpoint. In: Gandon, F., Guéret, C., Villata, S., Breslin, J., Faron-Zucker, C., Zimmermann, A. (eds) The Semantic Web: ESWC 2015 Satellite Events. ESWC 2015. Lecture Notes in Computer Science(), vol 9341. Springer, Cham. https://doi.org/10.1007/978-3-319-25639-9_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-25639-9_30
Published: 09 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25638-2
Online ISBN: 978-3-319-25639-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Distributed Linked Data Business Communication Networks: The LUCID Endpoint

Abstract

Similar content being viewed by others