DAW: Duplicate-AWare Federated Query Processing over the Web of Data

  • Muhammad Saleem
  • Axel-Cyrille Ngonga Ngomo
  • Josiane Xavier Parreira
  • Helena F. Deus
  • Manfred Hauswirth
Conference paper

DOI: 10.1007/978-3-642-41335-3_36

Part of the Lecture Notes in Computer Science book series (LNCS, volume 8218)
Cite this paper as:
Saleem M., Ngonga Ngomo AC., Xavier Parreira J., Deus H.F., Hauswirth M. (2013) DAW: Duplicate-AWare Federated Query Processing over the Web of Data. In: Alani H. et al. (eds) The Semantic Web – ISWC 2013. ISWC 2013. Lecture Notes in Computer Science, vol 8218. Springer, Berlin, Heidelberg

Abstract

Over the last years the Web of Data has developed into a large compendium of interlinked data sets from multiple domains. Due to the decentralised architecture of this compendium, several of these datasets contain duplicated data. Yet, so far, only little attention has been paid to the effect of duplicated data on federated querying. This work presents DAW, a novel duplicate-aware approach to federated querying over the Web of Data. DAW is based on a combination of min-wise independent permutations and compact data summaries. It can be directly combined with existing federated query engines in order to achieve the same query recall values while querying fewer data sources. We extend three well-known federated query processing engines – DARQ, SPLENDID, and FedX – with DAW and compare our extensions with the original approaches. The comparison shows that DAW can greatly reduce the number of queries sent to the endpoints, while keeping high query recall values. Therefore, it can significantly improve the performance of federated query processing engines. Moreover, DAW provides a source selection mechanism that maximises the query recall, when the query processing is limited to a subset of the sources.

Keywords

federated query processing SPARQL min-wise independent permutations Web of Data 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Muhammad Saleem
    • 1
  • Axel-Cyrille Ngonga Ngomo
    • 1
  • Josiane Xavier Parreira
    • Helena F. Deus
      • Manfred Hauswirth
        1. 1.IFI/AKSWUniversität LeipzigLeipzigGermany

        Personalised recommendations