Information Retrieval

, Volume 8, Issue 1, pp 151–166

Inverted Index Compression Using Word-Aligned Binary Codes

  • Vo Ngoc Anh
  • Alistair Moffat
Article

DOI: 10.1023/B:INRT.0000048490.99518.5c

Cite this article as:
Anh, V.N. & Moffat, A. Information Retrieval (2005) 8: 151. doi:10.1023/B:INRT.0000048490.99518.5c

Abstract

We examine index representation techniques for document-based inverted files, and present a mechanism for compressing them using word-aligned binary codes. The new approach allows extremely fast decoding of inverted lists during query processing, while providing compression rates better than other high-throughput representations. Results are given for several large text collections in support of these claims, both for compression effectiveness and query efficiency.

index compressioninteger codingindex representation

Copyright information

© Kluwer Academic Publishers 2005

Authors and Affiliations

  • Vo Ngoc Anh
    • 1
  • Alistair Moffat
    • 1
  1. 1.Department of Computer Science and Software EngineeringThe University of MelbourneAustralia