Abstract
Semantic repositories are database management systems, capable of handling structured data, taking into consideration their semantics. The Semantic Web represents the next-generation Web of Data, where information is published and interlinked in a way, which facilitates both humans and machines to exploit its structure and meaning. To foster the realization of the Semantic Web, the World Wide Web Consortium (W3C) developed a series of metadata, ontology, and query languages for it. Following the enthusiasm about the Semantic Web and the wide adoption of the related standards, today, most of the semantic repositories are database engines, which deal with data represented in RDF, support SPARQL queries, and can interpret schemas and ontologies represented in RDFS and OWL. Naturally, such engines take the role of Web servers of the Semantic Web.
This chapter starts with an introduction to semantic repositories and discussion on their links to several other technology trends, including relational databases, column-stores, and expert systems. As the most distinguishing quality of the semantic repositories is reasoning, an overview of the strategies for the integration of inference in the data management life cycle is presented. An overall view of the mechanics of the engines is provided from the perspective of a conceptual framework that reveals all their tasks and activities (e.g., storage and retrieval) along with the factors that impact their performance (e.g., data size and complexity). A review of several design issues, including distribution, serves as a basis for understanding the different implementation approaches and their implications on the performance of semantic repositories. Several of the most popular benchmarks and datasets, which are often used as measuring sticks for the performance of the engines, and few of the outstanding semantic repositories, are presented along with the best published evaluation results.
The advantages and the typical applications of semantic repositories are presented focusing on two usage scenarios: reasoning with and the management of linked data (a popular trend in the Semantic Web) and enterprise data integration. The chapter ends with some considerations regarding the future development of semantic repositories and design topics like adaptive indexing and interoperability patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aasman, J.: AllegroGraph 4.0 – industry's first real time RDF store. Presentation at Semantic Technologies Conference (SemTech 2009), San Jose (2009)
Andersson, B., Momtchev, V.: LarKC requirements summary and data repository. LarKC project deliverable D7a.1.1. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d7a-11_requirements-summary-and-data-repository_m6.pdf (2008)
Baader, F., Calvanese, D., McGuinness, D., Nardi, D., Patel-Scheider, P.: The Description Logic Handbook. Theory, Implementation, Applications. Cambridge University Press, Cambridge (2003)
Berners-Lee, T., Fielding R., Masinter L.: Uniform Resource Identifier (URI): generic syntax. Network Working Group, Request for comments: 3986. http://tools.ietf.org/html/rfc3986 (2005). Accessed Jan 2005
Berners-Lee, T.: Design issues: linked data. http://www.w3.org/DesignIssues/LinkedData.html (2006)
Bizer, Ch., Schultz, A.: Benchmarking the performance of storage systems that expose SPARQL endpoints. In: Proceedings of the Fourth International Workshop on Scalable Semantic Web knowledge Base Systems (SSWS 2008), Karlsruhe (2008)
Bizer, Ch., Schultz, A.: Berlin SPARQL benchmark results. Document version 1.2. http://www4.wiwiss.fu-berlin.de/bizer/BerlinSPARQLBenchmark/results/index.html (2009). Accessed 23 Mar 2009
Bizer, Ch., Schultz, A.: BSBM results for Virtuoso, Jena TDB, BigOWLIM. http://www4.wiwiss.fu-berlin.de/bizer/BerlinSPARQLBenchmark/results/V5/index.html (2009). Accessed 30 Nov 2009
Brewer, E.: Towards robust distributed systems. Keynote at Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing (PODC 2000), Portland. http://www.cs.berkeley.edu/~brewer/cs262b-2004/PODC-keynote.pdf (2000)
Brickley, D., Guha, R.V. (eds.): Resource Description Framework (RDF) schemas. W3C Recommendation. http://www.w3.org/TR/rdf-schema/ (2004). Accessed 10 Feb 2004
Carroll, J.J., Bizer, B., Hayes, P., Stickler, P.: Named graphs, provenance and trust. In: Proceedings of the 14th International conference on World Wide Web (WWW 2005), Chiba. http://www2005.org/cdrom/docs/p613.pdf (2005)
Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. In: Proceedings of the Seventh Symposium on Operating Systems Design and Implementation (OSDI 2006), Seattle. http://labs.google.com/papers/bigtable.html (2006)
Bechofer, S., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., Stein, L.A.: In: Dean, M., Schreiber, G. (eds.), OWL web ontology language reference, W3C Recommendation. http://www.w3.org/TR/owl-ref/ (Feb 2004)
Erling, O.: LUBM and Virtuoso. OpenLink software product blog. http://www.openlinksw.com/dataspace/oerling/weblog/Orri%20Erling's%20Blog/1562 (2009). Accessed 24 July 2009
Erling, O., Mikhailov, I.: Towards web-scale RDF. http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/VOSArticleWebScaleRDF (2009)
Erling, O.: Directions and challenges for Semdata. In: Proceedings of the Semantic Data Management (SemData 2010) Workshop at the 36th International Conference on Very Large Data Bases (VLDB 2010), Singapore (2010)
Fischer, F., Keller, U., Kiryakov, A., Huang, Z., Momtchev, V., Simperl, E.: Initial knowledge representation formalism. LarKC project deliverable D1.1.3. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d113-initial-knowledge-representation-formalism_m7.pdf (2008)
Franz Inc.: AllegroGraph RDFStore 4.0. AllegroGraph product information. http://www.franz.com/agraph/allegrograph/ (2010). Accessed 22 Aug 2010
Franz Inc.: AllegroGraph RDFStore version 4.0 LUBM benchmark results. http://www.franz.com/agraph/allegrograph/agraph_bench_lubm.lhtml (2010). Accessed 22 Aug 2010
Franz Inc.: RDFS++ dynamic materialization. http://www.franz.com/agraph/allegrograph/dynamic-materialization.lhtml (2010). Accessed 22 Aug 2010
Grosof, B.N., Horrocks, I., Volz, R., Decker, S.: Description logic programs: combining logic programs with description logic. In: Proceedings of the 12th Internationl Conference on world Wide Web (WWW 2003), Budapest (2003)
Guo, Y., Pan, Z., Heflin, J.: An evaluation of knowledge base systems for large OWL datasets. J. Web Semant. 3(2), 158–182 (2004)
Harris, S., Lamb, N., Shadbolt, N.: 4store: the design and implementation of a clustered RDF store. In: Proceedings of the Fifth International Workshop on Scalable Semantic Web Knowledge Base Systems (SSWS 2009) at the Eighth International Semantic Web Conference (ISWC 2009), Chantilly. Lecture Notes in Computer Science, vol. 5823. Springer, Berlin (2009)
Harth, A., Umbrich, J., Hogan, A., Decker, S.: YARS2: a federated repository for querying graph structured data from the web. In: Proceedings of the Sixth International Semantic Web Conference (ISWC 2007), Busan. Lecture Notes in Computer Science, vol. 4825, pp. 211–224. Springer, Berlin (2007)
Harth, A: SWSE/YARS@SemData Sofia 2010. SemData roundtable, Sofia. http://semdata.org/sites/default/files/harth-swse-sofia-2010.pdf (2010). Accessed 12 Mar 2010
Hayes, P.: RDF semantics, W3C Recommendation. http://www.w3.org/TR/2004/REC-rdf-mt-20040210/ (Feb 2004)
Hogan, A., Harth, A., Umbrich, J., Kinsella, S., Polleres, A., Decker, S.: Searching and browsing linked data with SWSE: the semantic web search engine. DERI technical report 2010-07-23. http://www.deri.ie/fileadmin/documents/DERI-TR-2010-07-23.pdf (2010)
Horrocks, I., Patel-Schneider, P.F., Bechhofer, S., Tsarkov, D.: OWL rules: a proposal and prototype implementation. J. Web Semant. 3, 23–40 (2005)
Jacobs, A.: The pathologies of big data. Commun. ACM 52(8), 36–44 (2009)
Jupp, S., Bechhofer, S., Kostkova, P., Stevens, R., Yesilada, Y.: Document navigation: ontologies or knowledge organisation systems? In: Proceedings of the Seventh International Workshop on Network Tools and Applications in Biology (NETTAB 2007), Pisa. A Semantic Web for Bioinformatics: Goals, Tools, Systems, Applications (2007)
Kerrigan, M., Bradesko, L., Fortuna B.: Rapid prototype of the LarKC. LarKC project deliverable D5.2.1. http://www.larkc.eu/wp-content/uploads/2008/01/larkc_d521_rapid-prototype-of-the-larkc.pdf (2009)
Kiryakov, A.: Measurable targets for scalable reasoning. LarKC project deliverable D5.5.1, formerly titled “Definition of validation goals for the prototyping phase”. http://www.larkc.eu/wp-content/uploads/2008/07/larkc_d551.pdf (2008)
Kiryakov, A., Tashev, Z., Ognyanoff, D., Velkov, R., Momtchev, V., Balev, B., Peikov, I.: Validation goals and metrics for the LarKC platform. LarKC project deliverable D5.5.2. http://www.larkc.eu/deliverables/ (2009)
Kiryakov, A., Ognyanoff, D., Velkov, R., Tashev, Z., Peikov, I.: LDSR: materialized reasonable view to the web of linked data. In: Proceedings of the Third International Symposium on Rules, Applications and Interoperability (RuleML 2009), Las Vegas. Lecture Notes in Computer Science, vol. 5858. Springer, Berlin (2009)
Kiryakov, A., Momtchev, V.: Triplesets: tagging and grouping in RDF datasets. W3C Workshop “RDF Next Steps”, Stanford (June 2010)
Kiryakov, A., Bishop, B., Ognyanoff, D., Peikov, D., Tashev, Z., Velkov, R.: The features of BigOWLIM that enabled the BBC's World Cup website. In: Proceedings of the Semantic Data Management (SemData 2010) Workshop at the 36th International Conference on Very Large Data Bases (VLDB 2010), Singapore (2010)
Kotoulas, S., Oren, E., Van Harmelen, F.: Mind the data skew: distributed inferencing by speeddating in elastic regions. In: Proceedings of the 19th International Conference on World Wide Web (WWW 2010), Raleigh (2010)
Heath. T.: Linked data – connect distributed data on the web. http://linkeddata.org/ (2007)
World Wide Web Consortium (W3C): Linking open data. W3C SWEO community project. http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData/ (2007)
Ma, L., Yang, Y., Qiu, Z., Xie, G., Pan, Y.: Towards a complete OWL ontology benchmark. In: Proceedings of the Third European Semantic Web Conference (ESWC 2006), Budva. Lecture Notes in Computer Science, vol. 4011, pp. 125–139. Springer, Berlin (2006)
Manola, F., Miller, E. (eds.): RDF primer, W3C Recommendation. http://www.w3.org/TR/rdf-primer/ (Feb 2004)
McGuinness, D.L., van Harmelen, F. (eds.): OWL web ontology language. Overview. W3C Recommendation. http://www.w3.org/TR/owl-features/ (2004)
Momtchev, V., Kiryakov, A.: Second generation ontology representation and data integration (ORDI) framework, specification. Ontotext technical documentation. http://www.ontotext.com/ordi/ORDI_SG/ORDI_SG_Specification.pdf (2006). Accessed 13 Oct 2006
Momtchev, V., Peychev, D., Primov, T., Georgiev, G.: Expanding the pathway and interaction knowledge in linked life data. In: Proceedings of the International Semantic Web Challenge (ISWC 2009), Chantilly. Lecture Notes in Computer Science, vol. 5823. Springer, Berlin. http://challenge.semanticweb.org/ (2009)
Motik, B., Cuenca Grau, B., Horrocks, I., Wu, Z., Fokoue, A., Lutz, C. (eds.): OWL 2 web ontology language profiles, W3C Candidate Recommendation. http://www.w3.org/TR/owl2-profiles/ (Feb 2009)
Motik, B., Sattler, U., Studer, R.: Query answering for OWL-DL with rules. J. Web Semant. 3(1), 41–60 (2005)
North, K.: The NoSQL alternative. Dr. Dobb’s J. http://www.drdobbs.com/database/224900500. Accessed 21 May 2010
O'Donovan, J.: The World Cup and a call to action around linked data. BBC blog post. http://www.bbc.co.uk/blogs/bbcinternet/2010/07/the_world_cup_and_a_call_to_ac.html. Accessed 9 July 2010
ONTOCORE: Ontology logic and reasoning at semantic Karlsruhe. Home page, http://logic.aifb.uni-karlsruhe.de/
Ontotext Lab: RDF(S), rules and OWL dialects. http://www.ontotext.com/inference/rdfs_rules_owl.html. Accessed Dec 2009 (2009)
Oren, E., Kotoulas, S., Anadiotis, G., Siebes, R., Ten Teije, A., Van Harmelen, F.: MARVIN: distributed reasoning over large-scale semantic web data. J. Web Semant. 7(4), 305–316 (2009)
PricewaterhouseCoopers: Spring of the data web. Technology Forecast. A Quart. J. Spring 2009. http://www.pwc.com/techforecast/ (2009)
Prud'hommeaux, E., Seaborne, A.: SPARQL query language for RDF, W3C Recommendation. http://www.w3.org/TR/rdf-sparql-query/ (Jan 2008)
Rayfield, J.: BBC World Cup 2010 dynamic semantic publishing, BBC blog post. http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html (2010)
Reed, D.P.: Naming and synchronization in a decentralized computer system. MIT dissertation, http://portal.acm.org/citation.cfm?id=889815 (1978)
Ruhloff, K., Dean, M., Emmons, I., Ryder, D., Sumner, J.: An evaluation of triple-store technologies for large data stores. In: Proceedings of the Scalable Semantic Systems Workshop (SSSW 2007), Portugal (2007)
Salvadores, M., Correndo, G., Omitola, T., Gibbins, N., Harris, S., Shadbolt, N.: 4s-reasoner: RDFS backward chained reasoning support in 4store. In: Proceedings of the 2010 International Workshop on Web-Scale Knowledge Representation, Retrieval, and Reasoning (Web-KR3 2010), Toronto (2010)
Stoilos G., Grau B.C., Horrocks I.: How incomplete is your semantic web reasoner? In: Proceedings of the 20th National Conference on Artificial Intelligence (AAAI 10), Atlanta (2010)
Swiss Institute of Bioinformatics. UniProt RDF, http://dev.isb-sib.ch/projects/uniprot-rdf/ (2009)
SYSTAP LLC: Bigdata: approaching web scale for the semantic web. http://www.bigdata.com/whitepapers/bigdata_whitepaper_07-08-2009.pdf (2009)
SYSTAP LLC: Bigdata overview. SemData roundtable, Sofia. http://semdata.org/sites/default/files/bigdata-sofia-roundtable-3-10-2010.pdf (2010). Accessed 12 Mar 2010
ter Horst, H. J.: Combining RDF and part of OWL with rules: semantics, decidability, complexity. In: Proceedings of the Fourth International Semantic Web Conference (ISWC 2005), Galway. Lecture Notes in Computer Science, vol. 3729, pp. 668–684. Springer, Heidelberg (2005)
Thakker, D., Osman, T., Gohil, S., Lakin, P.: A pragmatic approach to semantic repositories benchmarking. In: Proceedings of the Seventh Extended Semantic Web Conference (ESWC 2010), Heraklion (2010)
Todorova, P., Kiryakov, A., Ognyanoff, D., Peikov, I., Velkov, R., Tashev, Z.: Spreading activation components. LarKC project deliverable D2.4.1 (2009)
Urbani, J., Kotoulas, S., Maassen, J., Van Harmelen, F., Bal, H.: OWL reasoning with WebPIE: calculating the closure of 100 billion triples. In: Proceedings of the Seventh Extended Semantic Web Conference (ESWC 2010), Heraklion. Lecture Notes in Computer Science, vol. 6088, pp. 213–227. Springer, Berlin (2010)
Vertica: The vertica database datasheet. Product overview. http://www.vertica.com/_pdf/VerticaDatabaseDataSheet.pdf (2010). Accessed Jan 2010
White, A. (Gartner): Semantic web moving ever close to the ‘Semantic Enterprise?’ http://blogs.gartner.com/andrew_white/2009/04/30/semantic-web-moving-ever-close-to-the-semantic-enterprise/ (2009)
World Wide Web Consortium (W3C): Linking open data. W3C SWEO community project home page. http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData (2010). Accessed Jan 2010
Wu, A., Wu, Z., Kolovski, V.: An enterprise inference engine inside Oracle Database 11g Release 2. Presentation at Semantic Technology Conference (SemTech 2010), San Francisco (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this entry
Cite this entry
Kiryakov, A., Damova, M. (2011). Storing the Semantic Web: Repositories. In: Domingue, J., Fensel, D., Hendler, J.A. (eds) Handbook of Semantic Web Technologies. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-92913-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-540-92913-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-92912-3
Online ISBN: 978-3-540-92913-0
eBook Packages: Computer ScienceReference Module Computer Science and Engineering