Skip to main content

Storing the Semantic Web: Repositories

  • Reference work entry
Handbook of Semantic Web Technologies

Abstract

Semantic repositories are database management systems, capable of handling structured data, taking into consideration their semantics. The Semantic Web represents the next-generation Web of Data, where information is published and interlinked in a way, which facilitates both humans and machines to exploit its structure and meaning. To foster the realization of the Semantic Web, the World Wide Web Consortium (W3C) developed a series of metadata, ontology, and query languages for it. Following the enthusiasm about the Semantic Web and the wide adoption of the related standards, today, most of the semantic repositories are database engines, which deal with data represented in RDF, support SPARQL queries, and can interpret schemas and ontologies represented in RDFS and OWL. Naturally, such engines take the role of Web servers of the Semantic Web.

This chapter starts with an introduction to semantic repositories and discussion on their links to several other technology trends, including relational databases, column-stores, and expert systems. As the most distinguishing quality of the semantic repositories is reasoning, an overview of the strategies for the integration of inference in the data management life cycle is presented. An overall view of the mechanics of the engines is provided from the perspective of a conceptual framework that reveals all their tasks and activities (e.g., storage and retrieval) along with the factors that impact their performance (e.g., data size and complexity). A review of several design issues, including distribution, serves as a basis for understanding the different implementation approaches and their implications on the performance of semantic repositories. Several of the most popular benchmarks and datasets, which are often used as measuring sticks for the performance of the engines, and few of the outstanding semantic repositories, are presented along with the best published evaluation results.

The advantages and the typical applications of semantic repositories are presented focusing on two usage scenarios: reasoning with and the management of linked data (a popular trend in the Semantic Web) and enterprise data integration. The chapter ends with some considerations regarding the future development of semantic repositories and design topics like adaptive indexing and interoperability patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 499.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aasman, J.: AllegroGraph 4.0 – industry's first real time RDF store. Presentation at Semantic Technologies Conference (SemTech 2009), San Jose (2009)

    Google Scholar 

  2. Andersson, B., Momtchev, V.: LarKC requirements summary and data repository. LarKC project deliverable D7a.1.1. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d7a-11_requirements-summary-and-data-repository_m6.pdf (2008)

  3. Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Scheider, P.: The Description Logic Handbook. Theory, Implementation, Applications. Cambridge University Press, Cambridge (2003)

    MATH  Google Scholar 

  4. Berners-Lee, T., Fielding R., Masinter L.: Uniform Resource Identifier (URI): generic syntax. Network Working Group, Request for comments: 3986. http://tools.ietf.org/html/rfc3986 (2005). Accessed Jan 2005

  5. Berners-Lee, T.: Design issues: linked data. http://www.w3.org/DesignIssues/LinkedData.html (2006)

  6. Bizer, Ch., Schultz, A.: Benchmarking the performance of storage systems that expose SPARQL endpoints. In: Proceedings of the Fourth International Workshop on Scalable Semantic Web knowledge Base Systems (SSWS 2008), Karlsruhe (2008)

    Google Scholar 

  7. Bizer, Ch., Schultz, A.: Berlin SPARQL benchmark results. Document version 1.2. http://www4.wiwiss.fu-berlin.de/bizer/BerlinSPARQLBenchmark/results/index.html (2009). Accessed 23 Mar 2009

  8. Bizer, Ch., Schultz, A.: BSBM results for Virtuoso, Jena TDB, BigOWLIM. http://www4.wiwiss.fu-berlin.de/bizer/BerlinSPARQLBenchmark/results/V5/index.html (2009). Accessed 30 Nov 2009

  9. Brewer, E.: Towards robust distributed systems. Keynote at Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC 2000), Portland. http://www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf (2000)

  10. Brickley, D., Guha, R.V. (eds.): Resource Description Framework (RDF) schemas. W3C Recommendation. http://www.w3.org/TR/rdf-schema/ (2004). Accessed 10 Feb 2004

  11. Carroll, J.J., Bizer, B., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: Proceedings of the 14th International conference on World Wide Web (WWW 2005), Chiba. http://www2005.org/cdrom/docs/p613.pdf (2005)

  12. Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: Proceedings of the Seventh Symposium on Operating Systems Design and Implementation (OSDI 2006), Seattle. http://labs.google.com/papers/bigtable.html (2006)

  13. Bechofer, S., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., Stein, L.A.: In: Dean, M., Schreiber, G. (eds.), OWL web ontology language reference, W3C Recommendation. http://www.w3.org/TR/owl-ref/ (Feb 2004)

  14. Erling, O.: LUBM and Virtuoso. OpenLink software product blog. http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling's%20Blog/1562 (2009). Accessed 24 July 2009

  15. Erling, O., Mikhailov, I.: Towards web-scale RDF. http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VOSArticleWebScaleRDF (2009)

  16. Erling, O.: Directions and challenges for Semdata. In: Proceedings of the Semantic Data Management (SemData 2010) Workshop at the 36th International Conference on Very Large Data Bases (VLDB 2010), Singapore (2010)

    Google Scholar 

  17. Fischer, F., Keller, U., Kiryakov, A., Huang, Z., Momtchev, V., Simperl, E.: Initial knowledge representation formalism. LarKC project deliverable D1.1.3. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d113-initial-knowledge-representation-formalism_m7.pdf (2008)

  18. Franz Inc.: AllegroGraph RDFStore 4.0. AllegroGraph product information. http://www.franz.com/agraph/allegrograph/ (2010). Accessed 22 Aug 2010

  19. Franz Inc.: AllegroGraph RDFStore version 4.0 LUBM benchmark results. http://www.franz.com/agraph/allegrograph/agraph_bench_lubm.lhtml (2010). Accessed 22 Aug 2010

  20. Franz Inc.: RDFS++ dynamic materialization. http://www.franz.com/agraph/allegrograph/dynamic-materialization.lhtml (2010). Accessed 22 Aug 2010

  21. Grosof, B.N., Horrocks, I., Volz, R., Decker, S.: Description logic programs: combining logic programs with description logic. In: Proceedings of the 12th Internationl Conference on world Wide Web (WWW 2003), Budapest (2003)

    Google Scholar 

  22. Guo, Y., Pan, Z., Heflin, J.: An evaluation of knowledge base systems for large OWL datasets. J. Web Semant. 3(2), 158–182 (2004)

    Article  Google Scholar 

  23. Harris, S., Lamb, N., Shadbolt, N.: 4store: the design and implementation of a clustered RDF store. In: Proceedings of the Fifth International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2009) at the Eighth International Semantic Web Conference (ISWC 2009), Chantilly. Lecture Notes in Computer Science, vol. 5823. Springer, Berlin (2009)

    Google Scholar 

  24. Harth, A., Umbrich, J., Hogan, A., Decker, S.: YARS2: a federated repository for querying graph structured data from the web. In: Proceedings of the Sixth International Semantic Web Conference (ISWC 2007), Busan. Lecture Notes in Computer Science, vol. 4825, pp. 211–224. Springer, Berlin (2007)

    Google Scholar 

  25. Harth, A: SWSE/YARS@SemData Sofia 2010. SemData roundtable, Sofia. http://semdata.org/sites/default/files/harth-swse-sofia-2010.pdf (2010). Accessed 12 Mar 2010

  26. Hayes, P.: RDF semantics, W3C Recommendation. http://www.w3.org/TR/2004/REC-rdf-mt-20040210/ (Feb 2004)

  27. Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., Decker, S.: Searching and browsing linked data with SWSE: the semantic web search engine. DERI technical report 2010-07-23. http://www.deri.ie/fileadmin/documents/DERI-TR-2010-07-23.pdf (2010)

  28. Horrocks, I., Patel-Schneider, P.F., Bechhofer, S., Tsarkov, D.: OWL rules: a proposal and prototype implementation. J. Web Semant. 3, 23–40 (2005)

    Article  Google Scholar 

  29. Jacobs, A.: The pathologies of big data. Commun. ACM 52(8), 36–44 (2009)

    Article  Google Scholar 

  30. Jupp, S., Bechhofer, S., Kostkova, P., Stevens, R., Yesilada, Y.: Document navigation: ontologies or knowledge organisation systems? In: Proceedings of the Seventh International Workshop on Network Tools and Applications in Biology (NETTAB 2007), Pisa. A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications (2007)

    Google Scholar 

  31. Kerrigan, M., Bradesko, L., Fortuna B.: Rapid prototype of the LarKC. LarKC project deliverable D5.2.1. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d521_rapid-prototype-of-the-larkc.pdf (2009)

  32. Kiryakov, A.: Measurable targets for scalable reasoning. LarKC project deliverable D5.5.1, formerly titled “Definition of validation goals for the prototyping phase”. http://www.larkc.eu/wp-content/uploads/2008/07/larkc_d551.pdf (2008)

  33. Kiryakov, A., Tashev, Z., Ognyanoff, D., Velkov, R., Momtchev, V., Balev, B., Peikov, I.: Validation goals and metrics for the LarKC platform. LarKC project deliverable D5.5.2. http://www.larkc.eu/deliverables/ (2009)

  34. Kiryakov, A., Ognyanoff, D., Velkov, R., Tashev, Z., Peikov, I.: LDSR: materialized reasonable view to the web of linked data. In: Proceedings of the Third International Symposium on Rules, Applications and Interoperability (RuleML 2009), Las Vegas. Lecture Notes in Computer Science, vol. 5858. Springer, Berlin (2009)

    Google Scholar 

  35. Kiryakov, A., Momtchev, V.: Triplesets: tagging and grouping in RDF datasets. W3C Workshop “RDF Next Steps”, Stanford (June 2010)

    Google Scholar 

  36. Kiryakov, A., Bishop, B., Ognyanoff, D., Peikov, D., Tashev, Z., Velkov, R.: The features of BigOWLIM that enabled the BBC's World Cup website. In: Proceedings of the Semantic Data Management (SemData 2010) Workshop at the 36th International Conference on Very Large Data Bases (VLDB 2010), Singapore (2010)

    Google Scholar 

  37. Kotoulas, S., Oren, E., Van Harmelen, F.: Mind the data skew: distributed inferencing by speeddating in elastic regions. In: Proceedings of the 19th International Conference on World Wide Web (WWW 2010), Raleigh (2010)

    Google Scholar 

  38. Heath. T.: Linked data – connect distributed data on the web. http://linkeddata.org/ (2007)

  39. World Wide Web Consortium (W3C): Linking open data. W3C SWEO community project. http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData/ (2007)

  40. Ma, L., Yang, Y., Qiu, Z., Xie, G., Pan, Y.: Towards a complete OWL ontology benchmark. In: Proceedings of the Third European Semantic Web Conference (ESWC 2006), Budva. Lecture Notes in Computer Science, vol. 4011, pp. 125–139. Springer, Berlin (2006)

    Google Scholar 

  41. Manola, F., Miller, E. (eds.): RDF primer, W3C Recommendation. http://www.w3.org/TR/rdf-primer/ (Feb 2004)

  42. McGuinness, D.L., van Harmelen, F. (eds.): OWL web ontology language. Overview. W3C Recommendation. http://www.w3.org/TR/owl-features/ (2004)

  43. Momtchev, V., Kiryakov, A.: Second generation ontology representation and data integration (ORDI) framework, specification. Ontotext technical documentation. http://www.ontotext.com/ordi/ORDI_SG/ORDI_SG_Specification.pdf (2006). Accessed 13 Oct 2006

  44. Momtchev, V., Peychev, D., Primov, T., Georgiev, G.: Expanding the pathway and interaction knowledge in linked life data. In: Proceedings of the International Semantic Web Challenge (ISWC 2009), Chantilly. Lecture Notes in Computer Science, vol. 5823. Springer, Berlin. http://challenge.semanticweb.org/ (2009)

  45. Motik, B., Cuenca Grau, B., Horrocks, I., Wu, Z., Fokoue, A., Lutz, C. (eds.): OWL 2 web ontology language profiles, W3C Candidate Recommendation. http://www.w3.org/TR/owl2-profiles/ (Feb 2009)

  46. Motik, B., Sattler, U., Studer, R.: Query answering for OWL-DL with rules. J. Web Semant. 3(1), 41–60 (2005)

    Article  Google Scholar 

  47. North, K.: The NoSQL alternative. Dr. Dobb’s J. http://www.drdobbs.com/database/224900500. Accessed 21 May 2010

  48. O'Donovan, J.: The World Cup and a call to action around linked data. BBC blog post. http://www.bbc.co.uk/blogs/bbcinternet/2010/07/the_world_cup_and_a_call_to_ac.html. Accessed 9 July 2010

  49. ONTOCORE: Ontology logic and reasoning at semantic Karlsruhe. Home page, http://logic.aifb.uni-karlsruhe.de/

  50. Ontotext Lab: RDF(S), rules and OWL dialects. http://www.ontotext.com/inference/rdfs_rules_owl.html. Accessed Dec 2009 (2009)

  51. Oren, E., Kotoulas, S., Anadiotis, G., Siebes, R., Ten Teije, A., Van Harmelen, F.: MARVIN: distributed reasoning over large-scale semantic web data. J. Web Semant. 7(4), 305–316 (2009)

    Article  Google Scholar 

  52. PricewaterhouseCoopers: Spring of the data web. Technology Forecast. A Quart. J. Spring 2009. http://www.pwc.com/techforecast/ (2009)

  53. Prud'hommeaux, E., Seaborne, A.: SPARQL query language for RDF, W3C Recommendation. http://www.w3.org/TR/rdf-sparql-query/ (Jan 2008)

  54. Rayfield, J.: BBC World Cup 2010 dynamic semantic publishing, BBC blog post. http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html (2010)

  55. Reed, D.P.: Naming and synchronization in a decentralized computer system. MIT dissertation, http://portal.acm.org/citation.cfm?id=889815 (1978)

  56. Ruhloff, K., Dean, M., Emmons, I., Ryder, D., Sumner, J.: An evaluation of triple-store technologies for large data stores. In: Proceedings of the Scalable Semantic Systems Workshop (SSSW 2007), Portugal (2007)

    Google Scholar 

  57. Salvadores, M., Correndo, G., Omitola, T., Gibbins, N., Harris, S., Shadbolt, N.: 4s-reasoner: RDFS backward chained reasoning support in 4store. In: Proceedings of the 2010 International Workshop on Web-Scale Knowledge Representation, Retrieval, and Reasoning (Web-KR3 2010), Toronto (2010)

    Google Scholar 

  58. Stoilos G., Grau B.C., Horrocks I.: How incomplete is your semantic web reasoner? In: Proceedings of the 20th National Conference on Artificial Intelligence (AAAI 10), Atlanta (2010)

    Google Scholar 

  59. Swiss Institute of Bioinformatics. UniProt RDF, http://dev.isb-sib.ch/projects/uniprot-rdf/ (2009)

  60. SYSTAP LLC: Bigdata: approaching web scale for the semantic web. http://www.bigdata.com/whitepapers/bigdata_whitepaper_07-08-2009.pdf (2009)

  61. SYSTAP LLC: Bigdata overview. SemData roundtable, Sofia. http://semdata.org/sites/default/files/bigdata-sofia-roundtable-3-10-2010.pdf (2010). Accessed 12 Mar 2010

  62. ter Horst, H. J.: Combining RDF and part of OWL with rules: semantics, decidability, complexity. In: Proceedings of the Fourth International Semantic Web Conference (ISWC 2005), Galway. Lecture Notes in Computer Science, vol. 3729, pp. 668–684. Springer, Heidelberg (2005)

    Google Scholar 

  63. Thakker, D., Osman, T., Gohil, S., Lakin, P.: A pragmatic approach to semantic repositories benchmarking. In: Proceedings of the Seventh Extended Semantic Web Conference (ESWC 2010), Heraklion (2010)

    Google Scholar 

  64. Todorova, P., Kiryakov, A., Ognyanoff, D., Peikov, I., Velkov, R., Tashev, Z.: Spreading activation components. LarKC project deliverable D2.4.1 (2009)

    Google Scholar 

  65. Urbani, J., Kotoulas, S., Maassen, J., Van Harmelen, F., Bal, H.: OWL reasoning with WebPIE: calculating the closure of 100 billion triples. In: Proceedings of the Seventh Extended Semantic Web Conference (ESWC 2010), Heraklion. Lecture Notes in Computer Science, vol. 6088, pp. 213–227. Springer, Berlin (2010)

    Google Scholar 

  66. Vertica: The vertica database datasheet. Product overview. http://www.vertica.com/_pdf/VerticaDatabaseDataSheet.pdf (2010). Accessed Jan 2010

  67. White, A. (Gartner): Semantic web moving ever close to the ‘Semantic Enterprise?’ http://blogs.gartner.com/andrew_white/2009/04/30/semantic-web-moving-ever-close-to-the-semantic-enterprise/ (2009)

  68. World Wide Web Consortium (W3C): Linking open data. W3C SWEO community project home page. http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData (2010). Accessed Jan 2010

  69. Wu, A., Wu, Z., Kolovski, V.: An enterprise inference engine inside Oracle Database 11g Release 2. Presentation at Semantic Technology Conference (SemTech 2010), San Francisco (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Atanas Kiryakov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Kiryakov, A., Damova, M. (2011). Storing the Semantic Web: Repositories. In: Domingue, J., Fensel, D., Hendler, J.A. (eds) Handbook of Semantic Web Technologies. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92913-0_7

Download citation

Publish with us

Policies and ethics