Datenbank-Spektrum

, Volume 13, Issue 2, pp 89–99 | Cite as

An Overview on Execution Strategies for Linked Data Queries

Schwerpunktbeitrag

Abstract

The publication of Linked Open Data on the Web has gained tremendous momentum over the last five years. This development makes possible (and interesting) the execution of queries using up-to-date data from multiple, automatically discovered data sources. As a result, we currently witness the emergence of a new research area that focuses on an online execution of Linked Data queries; i.e. queries that range over data that is made available using the Linked Data publishing principles.

This article provides a general overview on this new area. In particular, we introduce the specific challenges that need to be addressed and then focus on possible strategies for executing Linked Data queries. Furthermore, we classify approaches proposed in the literature w.r.t. these strategies.

Keywords

Linked data Query execution Survey Comparison Web of data RDF SPARQL 

References

  1. 1.
    Adida B, Birbeck M, McCarron S, Herman I (2012) RDFa core 1.1—syntax and processing rules for embedding RDF through attributes. W3C rec. Available online at: http://www.w3.org/TR/rdfa-core/
  2. 2.
    Batsakis S, Petrakis EGM, Milios E (2009) Improving the performance of focused web crawlers. Data Knowl Eng 68(10):1001–1013 CrossRefGoogle Scholar
  3. 3.
    Berners-Lee T (2006) Design issues: linked data. Available online at: http://www.w3.org/DesignIssues/LinkedData.html
  4. 4.
    Bizer C, Heath T, Berners-Lee T (2009) Linked data—the story so far. Int J Semantic Web Inf Syst 5(3):1–22 CrossRefGoogle Scholar
  5. 5.
    Bouquet P, Ghidini C, Serafini L (2009) Querying the web of data: a formal approach. In: Proc of the 4th Asian semantic web conference (ASWC) Google Scholar
  6. 6.
    Chakrabarti S, van den Berg M, Dom B (1999) Focused crawling: a new approach to topic-specific web resource discovery. Comput Netw 31(11–16):1623–1640 CrossRefGoogle Scholar
  7. 7.
    Clark KG, Feigenbaum L, Torres E (2008) SPARQL protocol for RDF. W3C rec. Available online at: http://www.w3.org/TR/rdf-sparql-protocol/
  8. 8.
    Diligenti M, Coetzee F, Lawrence S, Giles CL, Gori M (2000) Focused crawling using context graphs. In: Proc of the 26th int conf on very large data bases (VLDB) Google Scholar
  9. 9.
    Ding L, Shinavier J, Shangguan Z, McGuinness DL (2010) SameAs networks and beyond: analyzing deployment status and implications of owl: sameAs in linked data. In: Proc of the 9th international semantic web conference (ISWC) Google Scholar
  10. 10.
    Dolog P, Stuckenschmidt H, Wache H, Diederich J (2009) Relaxing RDF queries based on user and domain preferences. J Intell Inf Syst 33(3):239–260 CrossRefGoogle Scholar
  11. 11.
    Görlitz O, Staab S (2011) Federated data management and query optimization for linked open data. In: New directions in web data management 1, pp 109–137 CrossRefGoogle Scholar
  12. 12.
    Harth A, Decker S (2005) Optimized index structures for querying RDF from the web. In: Proc of the 3rd Latin American web congress (LA-web) Google Scholar
  13. 13.
    Harth A, Hose K, Karnstedt M, Polleres A, Sattler K-U, Umbrich J (2010) Data summaries for on-demand queries over linked data. In: Proc of the 19th int conf on world wide web (WWW) Google Scholar
  14. 14.
    Harth A, Speiser S (2012) On completeness classes for query evaluation on linked data. In: Proc of the 26th AAAI conference Google Scholar
  15. 15.
    Hartig O (2011) How caching improves efficiency and result completeness for querying linked data. In: Proc of the 4th linked data on the web workshop (LDOW) Google Scholar
  16. 16.
    Hartig O (2011) Zero-knowledge query planning for an iterator implementation of link traversal based query execution. In: Proc of the 8th extended semantic web conference (ESWC) Google Scholar
  17. 17.
    Hartig O (2012) SPARQL for a web of linked data: semantics and computability. In: Proc of the 9th extended semantic web conference (ESWC) Google Scholar
  18. 18.
    Hartig O, Bizer C, Freytag J-C (2009) Executing SPARQL queries over the Web of Linked Data. In: Proc of the 8th international semantic web conference (ISWC) Google Scholar
  19. 19.
    Hartig O, Freytag J-C (2012) Foundations of traversal based query execution over linked data. In: Proc of the 23rd ACM conference on hypertext and social media (HT) Google Scholar
  20. 20.
    Hartig O, Langegger A (2010) A database perspective on consuming linked data on the web. Datenbank-Spektrum 10(2) Google Scholar
  21. 21.
    Heath T, Bizer C (2011) Linked data: evolving the web into a global data space. Morgan & Claypool Google Scholar
  22. 22.
    Hogan A, Harth A, Umrich J, Kinsella S, Polleres A, Decker S (2012) Searching and browsing linked data with SWSE: the semantic web search engine. Web semantics: science, services and agents on the world wide web 9(4) Google Scholar
  23. 23.
    Hogan A, Mellotte M, Powell G, Stampouli D (2012) Towards fuzzy query-relaxation for RDF. In: Proc of the 9th extended semantic web conference (ESWC) Google Scholar
  24. 24.
    Huang H, Liu C, Zhou X (2012) Approximating query answering on RDF databases. World Wide Web J 15(1):89–114 MathSciNetCrossRefGoogle Scholar
  25. 25.
    Joshi AK, Jain P, Hitzler P, Yeh PZ, Verma K, Sheth AP, Damova M (2012) Alignment-based querying of linked open data. In: Proc of the 11th int conference on ontologies, DataBases, and applications of semantics (ODBASE) Google Scholar
  26. 26.
    Ladwig G, Tran DT (2010) Linked data query processing strategies. In: Proc of the 9th international semantic web conference (ISWC) Google Scholar
  27. 27.
    Ladwig G, Tran DT (2011) SIHJoin: querying remote and local linked data. In: Proc of the 8th extended semantic web conference (ESWC) Google Scholar
  28. 28.
    Makris K, Gioldasis N, Bikakis N, Christodoulakis S (2010) Ontology mapping and SPARQL rewriting for querying federated RDF data sources. In: Proceedings of OTM conferences Google Scholar
  29. 29.
    Mika P, Potter T (2012) Metadata statistics for a large web corpus. In: Proc of the 5th linked data on the web workshop (LDOW) Google Scholar
  30. 30.
    Miranker DP, Depena RK, Jung H, Sequeda JF, Reyna C (2012) Diamond: a SPARQL query engine, for linked data based on the Rete match. In: Proc of the workshop on artificial intelligence meets the web of data (AImWD) Google Scholar
  31. 31.
    Mühleisen H, Bizer C (2012) Web data commons—extracting structured data from two large web corpora. In: Proc of the 5th linked data on the web workshop (LDOW) Google Scholar
  32. 32.
    Neumann T, Weikum G (2008) RDF-3X: a RISC-style engine for RDF. In: Proc of the 34th int conf on very large data bases (VLDB) Google Scholar
  33. 33.
    Paret E, Van Woensel W, Casteleyn S, Signer B, De Troyer O (2011) Efficient querying of distributed RDF sources in mobile settings based on a source index model. In: Procedia CS Google Scholar
  34. 34.
    Prud’hommeaux E, Buil-Aranda C (2013) SPARQL 1.1 federated query. W3C rec. Available online at: http://www.w3.org/TR/sparql11-federated-query/
  35. 35.
    Prud’hommeaux E, Seaborne A (2008) SPARQL query language for RDF. W3C rec. Available online at: http://www.w3.org/TR/rdf-sparql-query/
  36. 36.
    Schmedding F (2011) Incremental SPARQL evaluation for query answering on linked data. In: Proc of the 2nd int workshop on consuming linked data (COLD) Google Scholar
  37. 37.
    Schneider M (2012) OWL 2 web ontology language, RDF-based semantics, 2nd edn. W3C rec. Available online at: http://www.w3.org/TR/owl2-rdf-based-semantics/
  38. 38.
    Tian Y, Umbrich J, Yu Y (2011) Enhancing source selection for live queries over linked data via query log mining. In: Proc of the joint int semantic technology conference (JIST) Google Scholar
  39. 39.
    Umbrich J, Hogan A, Polleres A, Decker S (2012) Improving the recall of live linked data querying through reasoning. In: Proc of the 6th int conference on web reasoning and rule systems (RR) Google Scholar
  40. 40.
    Umbrich J, Hose K, Karnstedt M, Harth A, Polleres A (2011) Comparing data summaries for processing live queries over linked data. World Wide Web J 14(5–6):495–544 CrossRefGoogle Scholar
  41. 41.
    Wagner A, Tran T, Ladwig G, Harth A (2012) Top-k linked data query processing. In: Proc of the 9th extended semantic web conference (ESWC) Google Scholar
  42. 42.
    Weiss C, Karras P, Bernstein A (2008) Hexastore: sextuple indexing for semantic web data management. In: Proc of the 34th international conference on very large data bases (VLDB) Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  1. 1.David R. Cheriton School of Computer ScienceUniversity of WaterlooWaterlooCanada

Personalised recommendations