Chapter

Advances in Information Retrieval

Volume 7224 of the series Lecture Notes in Computer Science pp 413-425

Intra-query Concurrent Pipelined Processing for Distributed Full-Text Retrieval

  • Simon JonassenAffiliated withNorwegian University of Science and Technology
  • , Svein Erik BratsbergAffiliated withNorwegian University of Science and Technology

* Final gross prices may vary according to local VAT.

Get Access

Abstract

Pipelined query processing over a term-wise distributed inverted index has superior throughput at high query multiprogramming levels. However, due to long query latencies this approach is inefficient at lower levels. In this paper we explore two types of intra-query parallelism within the pipelined approach, parallel execution of a query on different nodes and concurrent execution on the same node. According to the experimental results, our approach reaches the throughput of the state-of-the-art method at about half of the latency. On the single query case the observed latency improvement is up to 2.6 times.