International Colloquium on Automata, Languages, and Programming

ICALP 2006: Automata, Languages and Programming pp 560-571

The Myriad Virtues of Wavelet Trees

  • Paolo Ferragina
  • Raffaele Giancarlo
  • Giovanni Manzini
Conference paper

DOI: 10.1007/11786986_49

Volume 4051 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Ferragina P., Giancarlo R., Manzini G. (2006) The Myriad Virtues of Wavelet Trees. In: Bugliesi M., Preneel B., Sassone V., Wegener I. (eds) Automata, Languages and Programming. ICALP 2006. Lecture Notes in Computer Science, vol 4051. Springer, Berlin, Heidelberg

Abstract

Wavelet Trees have been introduced in [Grossi, Gupta and Vitter, SODA ’03] and have been rapidly recognized as a very flexible tool for the design of compressed full-text indexes and data compressors. Although several papers have investigated the beauty and usefulness of this data structure in the full-text indexing scenario, its impact on data compression has not been fully explored. In this paper we provide a complete theoretical analysis of a wide class of compression algorithms based on Wavelet Trees. We also show how to improve their asymptotic performance by introducing a novel framework, called Generalized Wavelet Trees, that aims for the best combination of binary compressors (like, Run-Length encoders) versus non-binary compressors (like, Huffman and Arithmetic encoders) and Wavelet Trees of properly-designed shapes. As a corollary, we prove high-order entropy bounds for the challenging combination of Burrows-Wheeler Transform and Wavelet Trees.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Paolo Ferragina
    • 1
  • Raffaele Giancarlo
    • 2
  • Giovanni Manzini
    • 3
  1. 1.Dipartimento di InformaticaUniversità di PisaItaly
  2. 2.Dipartimento di Matematica ed ApplicazioniUniversità di PalermoItaly
  3. 3.Dipartimento di InformaticaUniversità del Piemonte OrientaleItaly