Linked-Data Aware URI Schemes for Referencing Text Fragments

  • Sebastian Hellmann
  • Jens Lehmann
  • Sören Auer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7603)

Abstract

The NLP Interchange Format (NIF) is an RDF/OWL-based format that aims to achieve interoperability between Natural Language Processing (NLP) tools, language resources and annotations. The motivation behind NIF is to allow NLP tools to exchange annotations about text documents in RDF. Hence, the main prerequisite is that parts of the documents (i.e. strings) are referenceable by URIs, so that they can be used as subjects in RDF statements. In this paper, we present two NIF URI schemes for different use cases and evaluate them experimentally by benchmarking the stability of both NIF URI schemes in a Web annotation scenario. Additionally, the schemes are compared with other available schemes used to address text with URIs. The String Ontology, which is the basis for NIF, fixes the referent (i.e. a string in a given text) of the URIs unambiguously for machines and thus enables the creation of heterogeneous, distributed and loosely coupled NLP applications, which use the Web as an integration platform.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chiarcos, C.: Ontologies of linguistic annotation: Survey and perspectives. In: LREC. European Language Resources Association (2012)Google Scholar
  2. 2.
    Hepp, M., Siorpaes, K., Bachlechner, D.: Harvesting wiki consensus: Using wikipedia entries as vocabulary for knowledge management. IEEE Internet Computing 11(5), 54–65 (2007)CrossRefGoogle Scholar
  3. 3.
    Kannan, N., Hussain, T.: Live urls: breathing life into urls. In: 15th Int. Conf. on World Wide Web, WWW 2006, pp. 879–880. ACM, New York (2006)CrossRefGoogle Scholar
  4. 4.
    Rizzo, G., Troncy, R., Hellmann, S., Bruemmer, M.: NERD meets NIF: Lifting NLP extraction results to the linked data cloud. In: LDOW (2012)Google Scholar
  5. 5.
    Wilde, E., Baschnagel, M.: Fragment identifiers for plain text files. In: ACM HYPERTEXT 2005, pp. 211–213. ACM, New York (2005)CrossRefGoogle Scholar
  6. 6.
    Wilde, E., Duerst, M.: URI Fragment Identifiers for the text/plain Media Type (2008), http://tools.ietf.org/html/rfc5147 (Online; accessed April 13, 2011)
  7. 7.
    Yee, K.: Text-Search Fragment Identifiers (1998), http://zesty.ca/crit/draft-yee-url-textsearch-00.txt (Online; accessed April 13, 2011)

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Sebastian Hellmann
    • 1
  • Jens Lehmann
    • 1
  • Sören Auer
    • 2
  1. 1.IFI/BIS/AKSWUniversität LeipzigLeipzigGermany
  2. 2.Informatik/ISSTTechnische Universität ChemnitzChemnitzGermany

Personalised recommendations