Abstract
During recent years an increasing number of data providers adopted the Linked Data principles for publishing and connecting structured data on the Web, thus creating a globally distributed dataspace—the Web of Data. While the execution of structured, SQL-like queries over this dataspace opens possibilities not conceivable before, query execution on the Web of Data poses novel challenges. These challenges provide great opportunities for the database community.
In this article we introduce the concept of Linked Data and discuss different approaches to query the Web of Data. Our goal is to provide a general understanding of this new research area and of the challenges and open issues that must be addressed.
Similar content being viewed by others
Notes
For current statistics we refer to the Wiki of the Semantic Web Interest Group at http://esw.w3.org/topic/TaskForces/CommunityProjects/LinkingOpenData/DataSets/Statistics.
References
Abadi DJ, Marcus A, Madden SR, Hollenbach K (2009) Sw-store: a vertically partitioned dbms for semantic web data management. VLDB J 18(2):385–406
Alexander K, Cyganiak R, Hausenblas M, Zhao J (2009) Describing linked datasets. In Proceedings of the 2nd linked data on the web workshop (LDOW) at WWW
Berners-Lee T Design issues: linked data. Online at http://www.w3.org/DesignIssues/LinkedData.html
Berners-Lee T, Chen Y, Chilton L, Connolly D, Dhanaraj R, Hollenbach J, Lerer A, Sheets D (2006) Tabulator: exploring and analyzing linked data on the semantic web. In Proceedings of the 3rd semantic web user interaction workshop (SWUI) at ISWC, Nov. 2006
Berners-Lee T, Fielding R, Masinter L (2005) Uniform resource identifier (URI): Generic syntax. RFC 3986
Bizer C, Heath T, Ayers D, Raymond Y (2007) Linking open data. In Proceedings of the poster session at the 4th European semantic web conference
Bizer C, Heath T, Berners-Lee T (2009) Linked data—the story so far. Int J Sem Web Inform Syst 5(3):1–22. Special issue on linked data
Bouquet P, Ghidini C, Serafini L (2009) Querying the web of data: a formal approach. In Proceedings of the 4th Asian semantic web conference (ASWC)
Brickley D, Guha RV (2004) RDF vocabulary description language 1.0: RDF schema. W3C Recommendation, Feb. 2004. Online at http://www.w3.org/TR/rdf-schema/
Carroll JJ, Bizer C, Hayes PJ, Stickler P (2005) Named graphs. J Web Sem 3(4):247–267
Chaudhuri S, Dayal U (1997) An overview of data warehousing and olap technology. SIGMOD Rec 26(1):65–74
Cheng G, Qu Y (2009) Searching linked objects with falcons: approach, implementation and evaluation. Int J Sem Web Inform Syst 5(3):49–70. Special issue on linked data
Fielding R, Gettys J, Mogul J, Frystyk H, Masinter L, Leach P, Berners-Lee T (1999) Hypertext transfer protocol—HTTP/1.1. RFC 2616
Hartig O, Bizer C, Freytag J-C (2009) Executing SPARQL queries over the web of linked data. In Proceedings of the 8th International semantic web conference (ISWC), Nov. 2009
Hartig O, Mühleisen H, Freytag J-C (2009) Linked data for building a map of researchers. In Proceedings of 5th workshop on scripting and development for the semantic web (SFSW) at ESWC, June 2009
Heath T (2008) How will we interact with the web of data? IEEE Internet Comput 12(5):88–91
Klyne G, Carroll JJ (2004) Resource description framework (RDF): concepts and abstract syntax. W3C Recommendation, Feb. 2004. Online at http://www.w3.org/TR/rdf-concepts/
Kobilarov G, Scott T, Raimond Y, Oliver S, Sizemore C, Smethurst M, Bizer C, Lee R (2009) Media meets semantic web—how the bbc uses dbpedia and linked data to make connections. In Proceedings of the 6th European semantic web conference (ESWC), June 2009
Kossmann D (2000) The state of the art in distributed query processing. ACM Comput Surv 32(4):422–469
Langegger A (2010) A flexible architecture for virtual information integration based on semantic web concepts. Dissertation, J. Kepler University Linz, January 2010
Langegger A, WößW (2009) RDFStats—an extensible rdf statistics generator and library. In Proceedings of the international workshop on database and expert systems applications (DEXA)
Mendelzon AO, Milo T (1998) Formal models of web queries. Inform Syst 23(8):615–637
Neumann T, Weikum G (2008) Rdf-3x: a risc-style engine for rdf. In Proceedings of the 34th international conference on very large data bases (VLDB)
Oren E, Delbru R, Catasta M, Cyganiak R, Stenzhorn H, Tummarello G (2008) Sindice.com: a document-oriented lookup index for open linked data. Int J Metadata Sem Ontol 3(1):37–52
Pérez J, Arenas M, Gutierrez C (2006) Semantics and complexity of sparql. In: Cruz I, Decker S, Allemang D, Preist C, Schwabe D, Mika P, Uschold M, Aroyo L (eds) 5th international semantic web conference, Athens, USA. LNCS, vol 4273. Springer, Berlin/Heidelberg
Prud’hommeaux E (2007) Case study: federate for drug research. Online at http://www.w3.org/2004/10/04-pharmaFederate/
Prud’hommeaux E, Seaborne A (2008) SPARQL query language for RDF. W3C Recommendation, Jan. 2008. Online at http://www.w3.org/TR/rdf-sparql-query/
Quilitz B, Leser U (2008) Querying distributed RDF data sources with SPARQL. In: Proceedings of the 5th European semantic web conference (ESWC). Lecture Notes in Computer Science, vol 5021. Springer, Berlin, pp 524–538
Sheth AP, Larson JA (1990) Federated database systems for managing distributed, heterogeneous, and autonomous databases. ACM Comput Surv 22(3):183–236
Umbrich J, Hausenblas M, Hogan A, Polleres A, Decker S (2010) Towards dataset dynamics: change frequency of linked open data sources. In Proceedings of the 3rd linked data on the web workshop (LDOW) located at the 19th international world wide web conference (WWW)
W3C OWL Working Group (2009) OWL 2 web ontology language—document overview. W3C Recommendation, Oct. 2009. Online at http://www.w3.org/TR/owl-overview
Weiss C, Karras P, Bernstein A (2008) Hexastore: sextuple indexing for semantic web data management. In Proceedings of the 34th international conference on very large data bases (VLDB)
Widom J (1995) Research problems in data warehousing. In Proceedings of the international conference on information and knowledge management (CIKM), pp 25–30
Williams GT (2010) PARQL 1.1 service description. W3C Working Draft, Jan. 2010. Online at http://www.w3.org/TR/sparql11-service-description/
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hartig, O., Langegger, A. A Database Perspective on Consuming Linked Data on the Web. Datenbank Spektrum 10, 57–66 (2010). https://doi.org/10.1007/s13222-010-0021-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13222-010-0021-7