Adaptive Tuple Differential Coding

  • Jean-Paul Deveaux
  • Andrew Rau-Chaplin
  • Norbert Zeh
Conference paper

DOI: 10.1007/978-3-540-74469-6_12

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4653)
Cite this paper as:
Deveaux JP., Rau-Chaplin A., Zeh N. (2007) Adaptive Tuple Differential Coding. In: Wagner R., Revell N., Pernul G. (eds) Database and Expert Systems Applications. DEXA 2007. Lecture Notes in Computer Science, vol 4653. Springer, Berlin, Heidelberg

Abstract

It is desirable to employ compression techniques in Relational OLAP systems to reduce disk space requirements and increase disk I/O throughput. Tuple Differential Coding (TDC) techniques have been introduced to compress views on a tuple level by storing only the differences between consecutive ordered tuples. These techniques work well for highly regular data in which the differences between tuples are fairly constant but are less effective on real data containing either skew or outliers. In this paper we introduce Adaptive Tuple Differential Coding (ATDC), which employs optimization techniques to analyze blocks of tuples to detect large tuple differences, with the purpose of isolating them to minimize their negative effect on the compression of neighbouring tuples. Our experiments show that this new algorithm provides an increase in compression ratio of 15–30% over TDC on typical real datasets.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Jean-Paul Deveaux
    • 1
  • Andrew Rau-Chaplin
    • 1
  • Norbert Zeh
    • 1
  1. 1.Faculty of Computer Science, Dalhousie University, Halifax NSCanada

Personalised recommendations