Scheduling Refresh Queries for Keeping Results from a SPARQL Endpoint Up-to-Date (Short Paper)

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10033)

Abstract

Many datasets change over time. As a consequence, long-running applications that cache and repeatedly use query results obtained from a SPARQL endpoint may resubmit the queries regularly to ensure up-to-dateness of the results. While this approach may be feasible if the number of such regular refresh queries is manageable, with an increasing number of applications adopting this approach, the SPARQL endpoint may become overloaded with such refresh queries. A more scalable approach would be to use a middle-ware component at which the applications register their queries and get notified with updated query results once the results have changed. Then, this middle-ware can schedule the repeated execution of the refresh queries without overloading the endpoint. In this paper, we study the problem of scheduling refresh queries for a large number of registered queries by assuming an overload-avoiding upper bound on the length of a regular time slot available for testing refresh queries. We investigate a variety of scheduling strategies and compare them experimentally in terms of time slots needed before they recognize changes and number of changes that they miss.

References

  1. 1.
    Dividino, R., Gottron, T., Scherp, A.: Strategies for efficiently keeping local linked open data caches up-to-date. In: Strohmaier, M., et al. (eds.) ISWC 2015. LNCS, vol. 9367, pp. 356–373. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-25010-6_24 CrossRefGoogle Scholar
  2. 2.
    Endris, K.M., Faisal, S., Orlandi, F., Auer, S., Scerri, S.: Interest-based RDF update propagation. In: Arenas, M., Corcho, O., Simperl, E., Strohmaier, M., d’Aquin, M., Srinivas, K., Groth, P., Dumontier, M., Heflin, J., Thirunarayan, K., Staab, S. (eds.) ISWC 2015. LNCS, vol. 9366, pp. 513–529. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-25007-6_30 CrossRefGoogle Scholar
  3. 3.
    Garrod, C., Manjhi, A., Ailamaki, A., Maggs, B., Mowry, T., Olston, C., Tomasic, A.: Scalable query result caching for web applications. Proc. VLDB Endowment 1(1) (2008)Google Scholar
  4. 4.
    Harris, S., Seaborne, A.: SPARQL 1.1 query language. W3C Recommendation (2013). https://www.w3.org/TR/sparql11-query/
  5. 5.
    Hellmann, S., Stadler, C., Lehmann, J., Auer, S.: DBpedia live extraction. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2009, Part II. LNCS, vol. 5871, pp. 1209–1223. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  6. 6.
    Kjernsmo, K.: A survey of http caching implementations on the open semantic web. In: The Semantic Web. Latest Advances and New Domains, pp. 286–301. Springer (2015)Google Scholar
  7. 7.
    Knuth, M., Hartig, O., Sack, H.: Scheduling Refresh Queries for Keeping Results from a SPARQL Endpoint Up-to-Date (Extended Version). CoRR abs/1608.08130 (2016)Google Scholar
  8. 8.
    Knuth, M., Reddy, D., Dimou, A., Vahdati, S., Kastrinakis, G.: Towards linked data update notifications - reviewing and generalizing the sparqlPuSH approach. In: Proceedings of the NoISE (2015)Google Scholar
  9. 9.
    Käfer, T., Abdelrahman, A., Umbrich, J., O’Byrne, P., Hogan, A.: Observing linked data dynamics. In: Cimiano, P., Corcho, O., Presutti, V., Hollink, L., Rudolph, S. (eds.) ESWC 2013. LNCS, vol. 7882, pp. 213–227. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-38288-8_15 CrossRefGoogle Scholar
  10. 10.
    Martin, M., Unbehauen, J., Auer, S.: Improving the performance of semantic web applications with SPARQL query caching. In: Aroyo, L., Antoniou, G., Hyvönen, E., ten Teije, A., Stuckenschmidt, H., Cabral, L., Tudorache, T. (eds.) ESWC 2010, Part II. LNCS, vol. 6089, pp. 304–318. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  11. 11.
    Passant, A., Mendes, P.N.: sparqlPuSH: Proactive notification of data updates in RDF stores using PubSubHubbub. In: Proceedings of Scripting for the Semantic Web Workshop (2010)Google Scholar
  12. 12.
    Popitsch, N., Haslhofer, B.: DSNotify-a solution for event detection and link maintenance in dynamic datasets. J. Web Semant. 9(3), 266–283 (2011)CrossRefGoogle Scholar
  13. 13.
    Saleem, M., Ali, M.I., Hogan, A., Mehmood, Q., Ngomo, A.-C.N.: LSQ: the linked SPARQL queries dataset. In: Arenas, M., et al. (eds.) ISWC 2015. LNCS, vol. 9367, pp. 261–269. Springer, Heidelberg (2015). doi: 10.1007/978-3-319-25010-6_15 CrossRefGoogle Scholar
  14. 14.
    Tramp, S., Frischmuth, P., Ermilov, T., Auer, S.: Weaving a social data web with semantic pingback. In: Cimiano, P., Pinto, H.S. (eds.) EKAW 2010. LNCS, vol. 6317, pp. 135–149. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  15. 15.
    Williams, G.T., Weaver, J.: Enabling fine-grained HTTP caching of SPARQL query results. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 762–777. Springer, Heidelberg (2011)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.Hasso Plattner InstituteUniversity of PotsdamPotsdamGermany
  2. 2.Department of Computer and Information Science (IDA)Linköping UniversityLinköpingSweden

Personalised recommendations