When owl:sameAs Isn’t the Same: An Analysis of Identity in Linked Data

  • Harry Halpin
  • Patrick J. Hayes
  • James P. McCusker
  • Deborah L. McGuinness
  • Henry S. Thompson
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6496)

Abstract

In Linked Data, the use of owl:sameAs is ubiquitous in interlinking data-sets. There is however, ongoing discussion about its use, and potential misuse, particularly with regards to interactions with inference. In fact, owl:sameAs can be viewed as encoding only one point on a scale of similarity, one that is often too strong for many of its current uses. We describe how referentially opaque contexts that do not allow inference exist, and then outline some varieties of referentially-opaque alternatives to owl:sameAs. Finally, we report on an empirical experiment over randomly selected owl:sameAs statements from the Web of data. This theoretical apparatus and experiment shed light upon how owl:sameAs is being used (and misused) on the Web of data.

Keywords

linked data identity coreference 

References

  1. 1.
    Bechhofer, S., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., Stein, L.A.: OWL Web Ontology Language Reference (2004)Google Scholar
  2. 2.
    Bizer, C., Cyganiak, R.: Quality-driven information filtering using the wiqa policy framework. Web Semantics: Science, Services and Agents on the World Wide Web 7(1), 1–10 (2009)CrossRefGoogle Scholar
  3. 3.
    Bizer, C., Cygniak, R., Heath, T.: How to publish Linked Data on the Web (2007), http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/ (last accessed on May 28, 2008)
  4. 4.
    Bouquet, P., Stoermer, H., Giacomuzzi, D.: OKKAM: Enabling a Web of Entities. In: I3: Identity, Identifiers, Identification. Proceedings of the WWW 2007 Workshop on Entity-Centric Approaches to Information and Knowledge Management on the Web, Banff, Canada, May 8. CEUR Workshop Proceedings (2007) ISSN 1613-0073, http://CEUR-WS.org/Vol-249/submission_150.pdf
  5. 5.
    Brickley, D., Guha, R.V.: RDF Vocabulary Description Language 1.0: RDF Schema (2004)Google Scholar
  6. 6.
    Carletta, J.: Assessing agreement on classification tasks: The kappa statistic. Computational Linguistics 22, 249–254 (1996)Google Scholar
  7. 7.
    Glaser, H., Millard, I., Jaffri, A.: RKBExplorer.com: A knowledge driven infrastructure for Linked Data providers. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds.) ESWC 2008. LNCS, vol. 5021, pp. 797–801. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  8. 8.
    Halpin, H.: A query-driven characterization of linked data. In: Proceedings of the Linked Data Workshop at the World Wide Web Conference, Madrid, Spain (2009)Google Scholar
  9. 9.
    Halpin, H., Hayes, P.: When owl: sameas isn’t the same. In: Proceedings of the WWW 2010 Workshop on Linked Data on the Web, Raleigh, USA (April 25, 2010), http://events.linkeddata.org/ldow2010/papers/ldow2010_paper09.pdf
  10. 10.
    Jentzsch, A., Hassanzadeh, O., Bizer, C., Andersson, B., Stephens, S.: Enabling tailored therapeutics with linked data. In: Proceedings of the WWW 2009 Workshop on Linked Data on the Web, April 20th, 2010, Madrid, Spain (April 2009), http://events.linkeddata.org/ldow2009/papers/ldow2009_paper9.pdf
  11. 11.
    Leibniz, G., Loemker, L.: Philosophical papers and letters. Springer, Heidelberg (1976)CrossRefGoogle Scholar
  12. 12.
    McCusker, J., McGuinness, D.: Towards identity in linked data. In: Proceedings of OWL: Experience and Directions, San Francisco, USA (June 21-22, 2010), http://www.webont.org/owled/2010/papers/owled2010_submission_12.pdf
  13. 13.
    Mcguinness, D.L., Silva, P.P.: Explaining answers from the semantic web: The inference web approach. Journal of Web Semantics 1, 397–413 (2004)CrossRefGoogle Scholar
  14. 14.
    Miles, A., Bechhofer, S.: SKOS Simple Knowledge Organization System Reference (2009)Google Scholar
  15. 15.
    Nikolov, A., Uren, V., Motta, E.: Knofuss: a comprehensive architecture for knowledge fusion. In: Proceedings of the 4th International Conference on Knowledge Capture, K-CAP 2007, pp. 185–186. ACM, New York (2007)Google Scholar
  16. 16.
    Pipino, L.L., Lee, Y.W., Wang, R.Y.: Data quality assessment. Communications of the ACM 45(4), 211–218 (2002)CrossRefGoogle Scholar
  17. 17.
    Volz, J., Bizer, C., Gaedke, M., Kobilarov, G.: Discovering and maintaining links on the web of data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 650–665. Springer, Heidelberg (2009)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Harry Halpin
    • 1
  • Patrick J. Hayes
    • 2
  • James P. McCusker
    • 3
  • Deborah L. McGuinness
    • 3
  • Henry S. Thompson
    • 1
  1. 1.School of InformaticsUniversity of EdinburghEdinburghUK
  2. 2.Institute for Human and Machine CognitionPensacolaUSA
  3. 3.Tetherless World Constellation, Department of Computer ScienceRensselaer Polytechnic InstituteTroyUSA

Personalised recommendations