Efficient Query Evaluation over Compressed XML Data

  • Andrei Arion
  • Angela Bonifati
  • Gianni Costa
  • Sandra D’Aguanno
  • Ioana Manolescu
  • Andrea Pugliese
Conference paper

DOI: 10.1007/978-3-540-24741-8_13

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2992)
Cite this paper as:
Arion A., Bonifati A., Costa G., D’Aguanno S., Manolescu I., Pugliese A. (2004) Efficient Query Evaluation over Compressed XML Data. In: Bertino E. et al. (eds) Advances in Database Technology - EDBT 2004. EDBT 2004. Lecture Notes in Computer Science, vol 2992. Springer, Berlin, Heidelberg

Abstract

XML suffers from the major limitation of high redundancy. Even if compression can be beneficial for XML data, however, once compressed, the data can be seldom browsed and queried in an efficient way. To address this problem, we propose XQueC, an [XQue]ry processor and [C]ompressor, which covers a large set of XQuery queries in the compressed domain. We shred compressed XML into suitable data structures, aiming at both reducing memory usage at query time and querying data while compressed. XQueC is the first system to take advantage of a query workload to choose the compression algorithms, and to group the compressed data granules according to their common properties. By means of experiments, we show that good trade-offs between compression ratio and query capability can be achieved in several real cases, as those covered by an XML benchmark. On average, XQueC improves over previous XML query-aware compression systems, still being reasonably closer to general-purpose query-unaware XML compressors. Finally, QETs for a wide variety of queries show that XQueC can reach speed comparable to XQuery engines on uncompressed data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Andrei Arion
    • 1
  • Angela Bonifati
    • 2
  • Gianni Costa
    • 2
  • Sandra D’Aguanno
    • 1
  • Ioana Manolescu
    • 1
  • Andrea Pugliese
    • 3
  1. 1.INRIA Futurs, Parc Club Orsay-UniversiteOrsay CedexFrance
  2. 2.Icar-CNRRende (CS)Italy
  3. 3.DEISUniversity of CalabriaRende(CS)Italy

Personalised recommendations