Skip to main content

Reliable and Persistent Identification of Linked Data Elements

  • Chapter
  • First Online:
Linking Enterprise Data
  • 1033 Accesses

Abstract

Linked Data techniques rely upon common terminology in a manner similar to a relational database’vs reliance on a schema. Linked Data terminology anchors metadata descriptions and facilitates navigation of information. Common vocabularies ease the human, social tasks of understanding datasets sufficiently to construct queries and help to relate otherwise disparate datasets. Vocabulary terms must, when using the Resource Description Framework, be grounded in URIs. A current bestpractice on the World Wide Web is to serve vocabulary terms as Uniform Resource Locators (URLs) and present both human-readable and machine-readable representations to the public. Linked Data terminology published to theWorldWideWeb may be used by others without reference or notification to the publishing party. That presents a problem: Vocabulary publishers take on an implicit responsibility to maintain and publish their terms via the URLs originally assigned, regardless of the inconvenience such a responsibility may cause. Over the course of years, people change jobs, publishing organizations change Internet domain names, computers change IP addresses,systems administrators publish old material in new ways. Clearly, a mechanism is required to manageWeb-based vocabularies over a long term. This chapter places Linked Data vocabularies in context with the wider concepts of metadata in general and specifically metadata on the Web. Persistent identifier mechanisms are reviewed, with a particular emphasis on Persistent URLs, or PURLs. PURLs and PURL services are discussed in the context of Linked Data. Finally, historic weaknesses of PURLs are resolved by the introduction of a federation of PURL services to address needs specific to Linked Data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Akscyn, R., McCracken, D. and Yoder, E. (November 1987). KMS: A Distributed Hypermedia System for Managing Knowledge in Organizations. Proc. HYPERTEXT ’87, pp. 1-20.

    Google Scholar 

  2. Alesso, H. P. and Smith, C.F. (2006). Thinking on the Web: Berners-Lee, Godel, and Turing, Wiley, Hoboken, New Jersey.

    Google Scholar 

  3. Archer, P. (ed.) (2007, October 31). POWDER: Use Cases and Requirements, W3C Working Group Note. Retrieved March 13, 2008 from http://www.w3.org/TR/2007/NOTE-powderuse-cases-20071031/.

  4. Berners-Lee, T. (1989). Information Management: A Proposal, Retrieved February 17, 2008 from http://www.w3.org/History/1989/proposal.html.

  5. Berners-Lee, T. and Connolly, D. (1995, November). Hypertext Markup Language - 2.0, Internet Engineering Task Force, Network Working Group, RFC 1866. Retrieved February 8, 2008 from http://www.ietf.org/rfc/rfc1866.txt.

  6. Berners-Lee, T., Fielding, R. and Frystyk, H. (1996, May). Hypertext Transfer Protocol –HTTP/1.0, Internet Engineering Task Force, Network Working Group, RFC 1945. Retrieved February 8, 2008 from http://www.w3.org/Protocols/rfc1945/rfc1945.

  7. Berners-Lee, T. (1997, January 6). Metadata Architecture. Retrieved February 20, 2008 from http://www.w3.org/DesignIssues/Metadata.html.

  8. Berners-Lee, T. (1998, September). Semantic Web Road Map. Retrieved February 20, 2008 from http://www.w3.org/DesignIssues/Semantic.html.

  9. Berners-Lee, T. (2000). Weaving the Web. The Original Design and Ultimate Destiny of the World Wide Web. Harper Business, New York, pp. 4.

    Google Scholar 

  10. Berners-Lee, T., Fielding, R. and Masinter, L. (2005, January). Uniform Resource Identifier (URI): Generic Syntax, Internet Engineering Task Force, Network Working Group, RFC 3986 (STD 66), Section 1.2.3. Retrieved March 1, 2008 from

    Google Scholar 

  11. http://tools.ietf.org/html/rfc3986#section- 1.2.3.

  12. Bos, B. (1999, June 30). Handling of Fragment Identifiers in Redirected URLs, Internet Engineering Task Force, Expired Internet Draft. Retrieved March 1, 2008 from http://www.w3.org/Protocols/HTTP/Fragment/draft-bos-http-redirect-00.txt.

  13. Bradner, S. (1997, March). Key Words for Use in RFCs to Indicate Requirement Levels, Internet Engineering Task Force Best Current Practice (BCP) 14, RFC 2119. Retrieved March

    Google Scholar 

  14. , 2008 from http://www.ietf.org/rfc/rfc2119.txt.

  15. Bulterman, D.C.A. (2004). ¨Is It Time for a Moratorium on Metadata?,¨IEEE MultiMedia, 11(4), pp. 10-17.

    Google Scholar 

  16. Bush, V. (1945, July). As We May Think, The Atlantic Monthly, pp.101-108.

    Google Scholar 

  17. Butler, M., Huynh, D., Hyde, B., Lee, R., Mazzocchi, S. (2006) Longwell Project Page. Retrieved March 3, 2008 from http://simile.mit.edu/wiki/Longwell.

  18. Casson, L. (2001). Libraries in the Ancient World, Yale University Press, New Haven &London.

    Google Scholar 

  19. Clark, T.,Martin, S. and Liefeld, T. (2004, March). Globally Distributed Object Identification for Biological Knowledgebases, Brief Bioinform, 5 (1), pp. 59-70.

    Google Scholar 

  20. Collins, A.M. and Loftus, E.F. (1975, November). A Spreading-Activation Theory of Semantic Processing. Psychological Review, 82, pp. 407-428.

    Google Scholar 

  21. Communications Decency Act of 1996 (1996) Section 5 of Telecommunications Act of 1996, 104th Congress of the United States of America, Retrieved February 17, 2008 from http://www.fcc.gov/Reports/tcom1996.txt.

  22. Dubost, K., Haas, H. and Jacobs, I. (2001, February 6). Common User Agent Problems, W3C Note, Section 4. Retrieved March 1, 2008 from http://www.w3.org/TR/2001/NOTE-cuap-20010206#uri.

  23. Fielding, R.T., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P. and Berners-Lee, T. (1999). Hypertext Transfer Protocol HTTP/1.1, Internet Engineering Task Force, Network Working Group, RFC 2068. Retrieved February 19, 2008 from

    Google Scholar 

  24. http://www.w3.org/Protocols/rfc2616/rfc2616.html.

  25. Fielding, R.T. (2000). Architectural Styles and the Design of Network-based Software Architectures. Doctoral dissertation, University of California, Irvine.

    Google Scholar 

  26. Fielding, R.T. and Jacobs, I. (2006, April 12). Authoritative Metadata, W3C TAG Finding. Retrieved March 6, 2008 from http://www.w3.org/2001/tag/doc/mime-respect-20060412.

  27. Furrie, B. and The Follett Software Company (2003). Understanding MARC Bibliographic: Machine Readable Cataloging, Seventh Edition, Cataloging Distribution Service, Library of Congress. Retrieved June 12, 2007 from http://www.loc.gov/marc/umb/.

  28. Gill, T. (1998). Metadata and theWorldWideWeb in Introduction to Baca,M. (ed). Metadata; Pathways to Digital Information, Getty Information Institute, pp. 11.

    Google Scholar 

  29. Gilliland-Swetland, A.J. (1998). Defining Metadata, in Baca, M., ed., Introduction to Metadata, Pathways to Digital Information, Getty Information Institute.

    Google Scholar 

  30. Golbeck, J. (2005). Computing and Applying Trust inWeb-based Social Networks, Ph.D. dissertation, Department of Computer Science, University ofMaryland College Park, pp. 30- 34. Retrieved February 17, 2008 from http://trust.mindswap.org/papers/GolbeckDissertation.pdf.

  31. Guha, R.V (1995, February 10). Contexts: A Formalization and Some Applications, Ph.D. thesis, Department of Computer Science, Stanford Univesity.

    Google Scholar 

  32. Guha, R.V. and Bray, T. (1997, June). Meta Content Framework Using XML,W3C Technical Note, Retrieved February 17, 2008 from http://www.w3.org/TR/NOTE-MCF-XML/.

  33. Guha, R.V. (1999, March 15). RSS 0.90 Specification, Netscape Communications Corporation, Retrieved February 17, 2008 from http://www.rssboard.org/rss-0-9-0.

  34. Halasz, F.G.,Moran, T.P. and Trigg, R.H. (May 1987). Notecards in a nutshell, ACMSIGCHI Bull., 17, SI, pp. 45-52.

    Google Scholar 

  35. Huynh, D.F. (2007, August). User Interfaces Supporting Casual Data-Centric Interactions on the Web, Ph.D. Thesis, Department of Electrical Engineering and Computer Science, MIT.

    Google Scholar 

  36. Jacobs, I. and Walsh, N. (eds.) (2004). Architecture of the World Wide Web, Volume One, W3C Recommendation, Retrieved June 7, 2007, from http://www.w3.org/TR/2004/RECwebarch-20041215/.

  37. Japan Electronics And Information Technology Industries Association (JEITA) (2002, April). JEITA CP-3451: Exchangeable Image File Format for Digital Still Cameras: Exif, Version 2.2. Retrieved February 16, 2008 from http://www.digicamsoft.com/exif22/exif22/html/exif22 1.htm.

  38. Karger, D., Bakshi, K., Huynh, D., Quan, D. and Sinha, V. (2005). Haystack: A General Purpose Information Management Tool for End Users of Semistructured Data. Proc. 2nd Biennial CIDR, Asilomar, CA, pp. 13-27.

    Google Scholar 

  39. Kollock, P. (1999). ¨ The Economies of Online Cooperation: Gifts and Public Goods in Cyberspace. ¨In Communities in Cyberspace, edited by Marc Smith and Peter Kollock. London, Routledge.

    Google Scholar 

  40. Lenat, D., Guha, R.V., Pittman, K., Pratt, D. and Shepherd, M. (1990, August). Cyc: Towards Programs with Common Sense, Communications of the ACM, 33/8.

    Google Scholar 

  41. Manola, F. and Miller, E. (eds.) (2004). RDF Primer, W3C Recommendation, Retrieved July 29, 2007 from http://www.w3.org/TR/rdf-primer/.

  42. Martin, S. (2006, June 30). LSID URN/URI Notes, W3C ESW Wiki. Retrieved February 24, 2008 from http://esw.w3.org/topic/HCLSIG BioRDF Subgroup/LSID URN URI.

  43. Miller, J., Resnick, P. and Singer, D. (1996, October 31). Rating Services and Rating Systems (and Their Machine Readable Descriptions) Version 1.1, W3C Recommendation, Retrieved February 17, 2008 from http://www.w3.org/TR/REC-PICS-services.

  44. Mitchell, T.M., Shinkareva, S.V., Carlson, A., Chang,K.-M., Malave, V.L., Mason, R.A. and Just, M.A. (2008, May 30). Predicting Human Brain Activity Associated with the Meanings of Nouns, Science, 320 (5880), pp. 1191-1195.

    Google Scholar 

  45. Nelson, T.H. (1965). Complex Information Processing: A File Structure for the Complex, the Changing and the Indeterminate. Proc. 1965 20th National Conference of the ACM, ACM Press, New York, pp. 84-100.

    Book  Google Scholar 

  46. Nelson, T.H. (1972). As We Will Think, in Nyce, J.M. and Kahn, P. (1991). From Memex to Hypertext. Academic Press, Boston, pp. 245-260.

    Google Scholar 

  47. Nelson, T.H. (1987). Computer Lib; DreamMachines. Redmond: Tempus Books ofMicrosoft Press.

    Google Scholar 

  48. Nyce, J.M. and Kahn, P. (1991). From Memex to hypertext : Vannevar Bush and the Minds Machine, Academic Press, 1991.

    Google Scholar 

  49. Object Management Group (OMG) (2004, April). Life Sciences Identifiers, OMG Adopted Specification dtc/04-05-01. Retrieved February 24, 2008 from http://www.omg.org/docs/dtc/04-05-01.pdf.

  50. Paskin, N. (ed.) (2006, October 6). The DOI Handbook, Edition 4.4.1, International DOI Foundation, Inc., Oxford, UK. Retrieved February 24, 2008 from http://www.doi.org/handbook 2000/DOIHandbook-v4-4.pdf.

  51. Resnick, P. and Miller, J. (1996). PICS: Internet Access Controls Without Censorship, Communications of the ACM, 1996, 39 (10), pp. 87-93.

    Google Scholar 

  52. Shafer, K., Weibel, S., Jul, E. and Fausey, J. (1996). Introduction to Persistent Uniform Resource Locators, OCLC Online Computer Library Center, Inc. Retrieved February 23, 2008 from http://purl.oclc.org/docs/inet96.html.

  53. Shera, J.H. (1965). Libraries and the Organization of Knowledge, Archon Books, Hamden, Connecticut.

    Google Scholar 

  54. Smith, M. (1992). Voices from the WELL: The Logic of the Virtual Commons, Master’s thesis, Department of Sociology, University of California at Los Angeles.

    Google Scholar 

  55. Stockwell, F. (2001). A History of Information Storage and Retrieval, McFarland & Company, Jefferson, NC.

    Google Scholar 

  56. Van de Sompel, H., Hammond, T., Neylon, E., Weibel, S. (2003, September). The ¨ınfoU¨ RI Scheme for Information Assets with Identifiers in Public Namespaces, Internet Engineering Task Force, Internet Draft (expired). Retrieved February 23, 2008 from http://infouri.info/registry/docs/drafts/draft-vandesompel-info-uri-00.txt.

  57. Weibel, S., Kunze, J.,Lagoze, C. and Wolf, M. (1998, September). Dublin Core Metadata for Resource Discovery, Internet Engineering Task Force RFC 2413, Retrieved February 17, 2008 from http://www.ietf.org/rfc/rfc2413.txt.

  58. Wright, A. (2007). Glut: Mastering Information Through the Ages. Joseph Henry Press, Washington, D.C.

    Google Scholar 

  59. Yakel, E. (2007). Digital Curation, OCLC Systems & Services

    Google Scholar 

  60. , (4), pp. 335-340. Retrieved May 26, 2008 from http://www.ingentaconnect.com/content/mcb/164/2007/00000023/00000004/art00003.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Wood .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Wood, D. (2010). Reliable and Persistent Identification of Linked Data Elements. In: Wood, D. (eds) Linking Enterprise Data. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-7665-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-1-4419-7665-9_8

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4419-7664-2

  • Online ISBN: 978-1-4419-7665-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics