Discovering and Maintaining Links on the Web of Data

  • Julius Volz
  • Christian Bizer
  • Martin Gaedke
  • Georgi Kobilarov
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5823)


The Web of Data is built upon two simple ideas: Employ the RDF data model to publish structured data on the Web and to create explicit data links between entities within different data sources. This paper presents the Silk – Linking Framework, a toolkit for discovering and maintaining data links between Web data sources. Silk consists of three components: 1. A link discovery engine, which computes links between data sources based on a declarative specification of the conditions that entities must fulfill in order to be interlinked; 2. A tool for evaluating the generated data links in order to fine-tune the linking specification; 3. A protocol for maintaining data links between continuously changing data sources. The protocol allows data sources to exchange both linksets as well as detailed change information and enables continuous link recomputation. The interplay of all the components is demonstrated within a life science use case.


Linked data web of data link discovery link maintenance record linkage duplicate detection 


  1. 1.
    Berners-Lee, T.: Linked Data - Design Issues,
  2. 2.
    Bizer, C., Heath, T., Berners-Lee, T.: Linked Data - The Story So Far. Journal on Semantic Web and Information Systems (in press, 2009)Google Scholar
  3. 3.
    Bizer, C., Cyganiak, R., Heath, T.: How to publish Linked Data on the Web,
  4. 4.
    Bizer, C., et al.: DBpedia - A Crystallization Point for the Web of Data. Journal of Web Semantics: Sci. Serv. Agents World Wide Web (2009), doi:10.1016/j.websem.2009.07.002Google Scholar
  5. 5.
    Jentzsch, A., et al.: Enabling Tailored Therapeutics with Linked Data. In: Proceedings of the 2nd Workshop about Linked Data on the Web (2009)Google Scholar
  6. 6.
    Jaro, M.: Advances in Record-linkage Methodology as Applied to the 1985 Census of Tampa, Florida. Journal of the American Statistical Society 84(406), 414–420 (1989)Google Scholar
  7. 7.
    Winkler, W.: Overview of Record Linkage and Current Research Directions. Bureau of the Census - Research Report Series (2006)Google Scholar
  8. 8.
    Zhong, J., et al.: Conceptual Graph Matching for Semantic Search. The 2002 International Conference on Computational Science (2002)Google Scholar
  9. 9.
    Elmagarmid, A.K., Ipeirotis, P.G., Verykios, V.S.: Duplicate Record Detection: A Survey. IEEE Transactions on Knowledge and Data Engineering 19(1), 1–16 (2007)CrossRefGoogle Scholar
  10. 10.
    Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)zbMATHGoogle Scholar
  11. 11.
    Raimond, Y., Sutton, C., Sandler, M.: Automatic Interlinking of Music Datasets on the Semantic Web. In: Proceedings of the 1st Workshop about Linked Data on the Web (2008)Google Scholar
  12. 12.
    Hassanzadeh, O., et al.: Semantic Link Discovery Over Relational Data. In: Proceedings of the 18th ACM Conference on Information and Knowledge Management (2009)Google Scholar
  13. 13.
    Nikolov, A., et al.: Integration of Semantically Annotated Data by the KnoFuss Architecture. In: 16th International Conference on Knowledge Engineering and Knowledge Management, pp. 265–274 (2008)Google Scholar
  14. 14.
    Auer, S., et al.: Triplify – Light-Weight Linked Data Publication from Relational Databases. In: Proceedings of the 18th International World Wide Web Conference (2009)Google Scholar
  15. 15.
    Haslhofer, B., Popitsch, N.: DSNotify – Detecting and Fixing Broken Links in Linked Data Sets. In: Proceedings of 8th International Workshop on Web Semantics (2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Julius Volz
    • 1
  • Christian Bizer
    • 2
  • Martin Gaedke
    • 1
  • Georgi Kobilarov
    • 2
  1. 1.Distributed and Self-Organizing Systems GroupChemnitz University of TechnologyChemnitzGermany
  2. 2.Web-based Systems GroupFreie Universität BerlinBerlinGermany

Personalised recommendations