Optimizing SPARQL Query Processing on Dynamic and Static Data Based on Query Time/Freshness Requirements Using Materialization

Dehghanzadeh, Soheila; Parreira, Josiane Xavier; Karnstedt, Marcel; Umbrich, Juergen; Hauswirth, Manfred; Decker, Stefan

doi:10.1007/978-3-319-15615-6_19

Soheila Dehghanzadeh¹⁸,
Josiane Xavier Parreira¹⁹,
Marcel Karnstedt²⁰,
Juergen Umbrich²¹,
Manfred Hauswirth²² &
…
Stefan Decker¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8943))

Included in the following conference series:

Joint International Semantic Technology Conference

1172 Accesses
2 Citations

Abstract

To integrate various Linked Datasets, data warehousing and live query processing provide two extremes for optimized response time and quality respectively. The first approach provides very fast responses but with low-quality because changes of original data are not immediately reflected on materialized data. The second approach provides accurate responses but it is notorious for long response times. A hybrid SPARQL query processor provides a middle ground between two specified extremes by splitting the triple patterns of SPARQL query between live and local processors based on a predetermined coherence threshold specified by the administrator. Considering quality requirements while splitting the SPARQL query, enables the processor to eliminate the unnecessary live execution and releases resources for other queries. This requires estimating the quality of response provided with current materialized data, compare it with user requirements and determine the most selective sub-queries which can boost the response quality up to the specified level with least computational complexity. In this work, we propose solutions for estimating the freshness of materialized data, as one dimension of the quality, by extending cardinality estimation techniques. Experimental results show that we can estimate the freshness of materialized data with a low error rate.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bishop, B., Kiryakov, A., Ognyanov, D., Peikov, I., Tashev, Z., Velkov, R.: Factforge: A fast track to the web of data. Semantic Web 2(2), 157–166 (2011)
Google Scholar
Bizer, C., Schultz, A.: The berlin sparql benchmark. International Journal on Semantic Web and Information Systems (IJSWIS) 5, 1–24 (2009)
Google Scholar
Bouzeghoub, M.: A framework for analysis of data freshness. In: Proceedings of the 2004 International Workshop on Information Quality in Information Systems, pp. 59–67. ACM (2004)
Google Scholar
Castillo, R., Rothe, C., Ulf, L.: Idexing RDF Data for SPARQL Queries. Professoren des Inst. für Informatik, RDFMatView (2010)
Google Scholar
Dey, D., Kumar, S.: Data quality of query results with generalized selection conditions. Operations Research 61(1), 17–31 (2013)
Article MATH MathSciNet Google Scholar
Goasdoué, F., Karanasos, K., Leblay, J., Manolescu, I.: View selection in semantic web databases. Proceedings of the VLDB Endowment 5(2), 97–108 (2011)
Article Google Scholar
Goldstein, J., Per-Åke, L.: Optimizing queries using materialized views: a practical, scalable solution. In: ACM SIGMOD Record 30, pp. 331–342. ACM (2001)
Google Scholar
Hartig, O., Bizer, C., Freytag, J.-C.: Executing sparql queries over the web of linked data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 293–309. Springer, Heidelberg (2009)
Chapter Google Scholar
Kuno, H., Graefe, G.: Deferred maintenance of indexes and of materialized views. In: Kikuchi, S., Madaan, A., Sachdeva, S., Bhalla, S. (eds.) DNIS 2011. LNCS, vol. 7108, pp. 312–323. Springer, Heidelberg (2011)
Chapter Google Scholar
Labrinidis, A., Qu, H., Xu, J.: Quality contracts for real-time enterprises. In: Bussler, C.J., Castellanos, M., Dayal, U., Navathe, S. (eds.) BIRTE 2006. LNCS, vol. 4365, pp. 143–156. Springer, Heidelberg (2007)
Chapter Google Scholar
Labrinidis, A., Roussopoulos, N.: Exploring the tradeoff between performance and data freshness in database-driven web servers. The VLDB Journal 13(3), 240–255 (2004)
Article Google Scholar
Lupei, D., Shaikhha, A., Koch, C., Nötzli, A.: Oliver Andrzej Kennedy, Milos Nikolic, and Yanif Ahmad. Higher-order delta processing for dynamic, frequently fresh views. Technical report, Dbtoaster (2013)
Google Scholar
Neumann, T., Moerkotte, G.: Characteristic sets: Accurate cardinality estimation for rdf queries with multiple joins. In: 2011 IEEE 27th International Conference on Data Engineering (ICDE), pp. 984–994. IEEE (2011)
Google Scholar
Parssian, A., Sarkar, S., Jacob, V.S.: Assessing information quality for the composite relational operation join. In: IQ, pp. 225–237 (2002)
Google Scholar
Poosala, V., Haas, P.J., Loannidis, Y.E., Shekita, E.J.: Improved histograms for selectivity estimation of range predicates. ACM SIGMOD Record 25(2), 294–305 (1996)
Article Google Scholar
Tummarello, G., Delbru, R., Oren, E.: Sindice.com: weaving the open linked data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 552–565. Springer, Heidelberg (2007)
Chapter Google Scholar
Umbrich, J., Hose, K., Karnstedt, M., Harth, A., Polleres, A.: Comparing data summaries for processing live queries over linked data. World Wide Web 14(5–6), 495–544 (2011)
Article Google Scholar
Parreira, J.X., Umbrich, J., Karnstedt, M., Hogan, A.: Hybrid sparql queries: fresh vs. fast results. In: Cudré-Mauroux, P., Heflin, J., Sirin, E., Tudorache, T., Euzenat, J., Hauswirth, M., Parreira, J.X., Hendler, J., Schreiber, G., Bernstein, A., Blomqvist, E. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 608–624. Springer, Heidelberg (2012)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Insight Centre for Data Analytics, National University of Ireland, Galway, Republic of Ireland
Soheila Dehghanzadeh & Stefan Decker
Siemens AG Sterreich, Wien, Austria
Josiane Xavier Parreira
Bell Labs, Dublin, Ireland
Marcel Karnstedt
Vienna University of Economics and Business, Vienna, Austria
Juergen Umbrich
TU Berlin and Fraunhofer FOKUS, Berlin, Germany
Manfred Hauswirth

Authors

Soheila Dehghanzadeh
View author publications
You can also search for this author in PubMed Google Scholar
Josiane Xavier Parreira
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Karnstedt
View author publications
You can also search for this author in PubMed Google Scholar
Juergen Umbrich
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Hauswirth
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Decker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soheila Dehghanzadeh .

Editor information

Editors and Affiliations

National Electronics and Computer Technology Center, Pathum Thani, Thailand
Thepchai Supnithi
Keio University Fac. of Science & Technology, Yokohama, Japan
Takahira Yamaguchi
Department of Computing Science, University of Aberdeen, Aberdeen, United Kingdom
Jeff Z. Pan
Asian University, Chonburi, Thailand
Vilas Wuwongse
National Electronics Computer Tech Ctr, Sci Park, Pathumthani, Thailand
Marut Buranarach

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dehghanzadeh, S., Parreira, J.X., Karnstedt, M., Umbrich, J., Hauswirth, M., Decker, S. (2015). Optimizing SPARQL Query Processing on Dynamic and Static Data Based on Query Time/Freshness Requirements Using Materialization. In: Supnithi, T., Yamaguchi, T., Pan, J., Wuwongse, V., Buranarach, M. (eds) Semantic Technology. JIST 2014. Lecture Notes in Computer Science(), vol 8943. Springer, Cham. https://doi.org/10.1007/978-3-319-15615-6_19

Download citation

DOI: https://doi.org/10.1007/978-3-319-15615-6_19
Published: 21 February 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15614-9
Online ISBN: 978-3-319-15615-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics