Abstract
Growing amount of data are being published online in machinereadable formats, and LOD (Linked Open Data) has emerged as a way to share such data across Web resources. Since LOD data often contain numerical data, such as statistics, there is a growing demand to make OLAP (Online Analytical Processing) analysis over such data. To make it possible to apply off-the-shelf OLAP systems for analyzing LOD data, we propose a framework to streamline the Extract, Transform, and Load (ETL) process from LOD to multidimensional data models for OLAP. Unlike other related approaches, our framework does not require RDF vocabularies dedicated for specifying multidimensional model for OLAP. Instead, given an LOD dataset, we exploit the relationships among entities and external information in the referenced LOD to generate an OLAP data model. In a case study, we demonstrate that our framework can extract OLAP data models from different kinds of real LOD datasets.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A Nucleus for a Web of Open Data. In: Aberer, K., et al. (eds.) ISWC/ASWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Berners-Lee, T.: Linked Data - Design Issues, http://www.w3.org/DesignIssues/LinkedData.html
Carroll, J.J., Klyne, G.: Resource Description Framework (RDF): Concepts and Abstract Syntax. W3C recommendation, W3C (February 2004), http://www.w3.org/TR/2004/REC-rdf-concepts-20040210/
Codd, E., Codd, S., Salley, C., Codd & Date, Inc.: Providing OLAP (On-line Analytical Processing) to User-analysts: An IT Mandate. Codd & Associates (1993)
Cyganiak, R., Reynolds, D.: The RDF Data Cube Vocabulary. W3C working draft, W3C (April 2012), http://www.w3.org/TR/vocab-data-cube/
Etcheverry, L., Vaisman, A.A.: Enhancing OLAP Analysis with Web Cubes. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds.) ESWC 2012. LNCS, vol. 7295, pp. 469–483. Springer, Heidelberg (2012)
Etcheverry, L., Vaisman, A.A.: QB4OLAP: A Vocabulary for OLAP Cubes on the Semantic Web. In: COLD. CEUR Workshop Proceedings, vol. 905. CEUR-WS.org (2012)
Han, J., Kamber, M.: Data Warehouse and OLAP Technology: An Overview. In: Data Mining: Concepts and Techniques, 2nd edn., pp. 105–156. Morgan Kaufmann (2006)
Iqbal, A., Capadisli, S., Cyganiak, R., Hausenblas, M.: Eurostat - Linked Data, http://eurostat.linked-statistics.org/
Kämpgen, B., Harth, A.: Transforming statistical linked data for use in OLAP systems. In: I-SEMANTICS. ACM International Conference Proceeding Series, pp. 33–40. ACM (2011)
McGuinness, D.L., van Harmelen, F.: OWL Web Ontology Language Overview. W3C recommendation, W3C (February 2004), http://www.w3.org/TR/2004/REC-owl-features-20040210/
Niemi, T., Toivonen, S., Niinimäki, M., Nummenmaa, J.: Ontologies with Semantic Web/Grid in Data Integration for OLAP. Int. J. Semantic Web Inf. Syst. 3(4), 25–49 (2007)
Patni, H., Henson, C.A., Sheth, A.P.: Linked Sensor Data. In: CTS, pp. 362–370 (2010)
Vatant, B., Wick, M.: GeoNames Ontology, http://www.geonames.org/ontology/
Wilkinson, K., Sayers, C., Kuno, H.A., Reynolds, D.: Efficient RDF Storage and Retrieval in Jena2. In: SWDB, pp. 131–150 (2003)
Wilkinson, K.: Jena Property Table Implementation. In: SSWS (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Inoue, H., Amagasa, T., Kitagawa, H. (2013). An ETL Framework for Online Analytical Processing of Linked Open Data. In: Wang, J., Xiong, H., Ishikawa, Y., Xu, J., Zhou, J. (eds) Web-Age Information Management. WAIM 2013. Lecture Notes in Computer Science, vol 7923. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38562-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-38562-9_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38561-2
Online ISBN: 978-3-642-38562-9
eBook Packages: Computer ScienceComputer Science (R0)