Abstract
We believe that the possibility to use SPARQL as a front end to heterogeneous data without significant cost in performance or expressive power is key to RDF taking its rightful place as the lingua franca of data integration. To this effect, we demonstrate how RDF and SPARQL can tackle a mix of standard relational workload and data mining in public data sources.
We discuss extending SPARQL for business intelligence (BI) workloads and relate experiences on running SPARQL against relational and native RDF databases. We use the well known TPC H benchmark as our reference schema and workload. We define a mapping of the TPC H schema to RDF and restate the queries as BI extended SPARQL. To this effect, we define aggregation and nested queries for SPARQL.
We demonstrate that it is possible to perform the TPC H workload restated in SPARQL against an existing RDBMS without loss of performance or expressivity and without changes to the RDBMS.
Finally, we demonstrate how to combine TPC-H or XBRL financial reports with RDF data from CIA factbook and DBpedia.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
W3C RDF Data Access Working Group: SPARQL Query Language for RDF, http://www.w3.org/TR/rdf-sparql-query/
Transaction Processing Performance Council: TPC-H – a Decision Support Benchmark, http://www.tpc.org/tpch/
Linking Open Data Project, http://linkeddata.org/
DBpedia – A Community Effort to Extract Structured Information From Wikipedia, http://dbpedia.org/
XBRL - Extensible Business Reporting Language, http://www.xbrl.org/Home/
Seaborn, A.: Counting and GROUP BY in ARQ, http://seaborne.blogspot.com/2007/09/counting-and-group-by.html
Weiske, C., Auer, S.: Implementing SPARQL Support for Relational Databases and Possible Enhancements. In: Proceedings of the 1st Conference on Social Semantic Web. Leipzig (CSSW 2007), SABRE. LNI 113 GI 2007, Bonner Kollen Verlag (2007), http://www.informatik.uni-leipzig.de/~auer/publication/sparql-enhancements.pdf , ISBN 978-3-88579-207-9
Erling, O., Mikhailov, I.: Adapting an ORDBMS for RDF Storage and Mapping. In: Proceedings of the 1st Conference on Social Semantic Web. Leipzig (CSSW 2007), SABRE. LNI 113 GI 2007, Bonner Kollen Verlag (2007) ISBN 978-3-88579-207-9
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Erling, O., Mikhailov, I. (2008). Integrating Open Sources and Relational Data with SPARQL. In: Bechhofer, S., Hauswirth, M., Hoffmann, J., Koubarakis, M. (eds) The Semantic Web: Research and Applications. ESWC 2008. Lecture Notes in Computer Science, vol 5021. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68234-9_69
Download citation
DOI: https://doi.org/10.1007/978-3-540-68234-9_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68233-2
Online ISBN: 978-3-540-68234-9
eBook Packages: Computer ScienceComputer Science (R0)