Intra-query Concurrent Pipelined Processing for Distributed Full-Text Retrieval

  • Simon Jonassen
  • Svein Erik Bratsberg
Conference paper

DOI: 10.1007/978-3-642-28997-2_35

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7224)
Cite this paper as:
Jonassen S., Bratsberg S.E. (2012) Intra-query Concurrent Pipelined Processing for Distributed Full-Text Retrieval. In: Baeza-Yates R. et al. (eds) Advances in Information Retrieval. ECIR 2012. Lecture Notes in Computer Science, vol 7224. Springer, Berlin, Heidelberg

Abstract

Pipelined query processing over a term-wise distributed inverted index has superior throughput at high query multiprogramming levels. However, due to long query latencies this approach is inefficient at lower levels. In this paper we explore two types of intra-query parallelism within the pipelined approach, parallel execution of a query on different nodes and concurrent execution on the same node. According to the experimental results, our approach reaches the throughput of the state-of-the-art method at about half of the latency. On the single query case the observed latency improvement is up to 2.6 times.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Simon Jonassen
    • 1
  • Svein Erik Bratsberg
    • 1
  1. 1.Norwegian University of Science and TechnologyTrondheimNorway

Personalised recommendations