Skip to main content

Updates on Grammar-Compressed XML Data

  • Conference paper
Advances in Databases (BNCOD 2011)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7051))

Included in the following conference series:

Abstract

In this paper, we present updates on CluX, a grammar-based XML compression approach based on clustering XML sub-trees. We show that updates on CluX-compressed data can be performed faster than decompressing the data, loading it into main memory and compressing it. Furthermore, we show how to support fast multiple updates, e.g. performing 100 updates in parallel is more than 70 times faster than 100 single updates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhang, N., Kacholia, V., Özsu, M.: A Succinct Physical Storage Scheme for Efficient Evaluation of Path Queries in XML. In: Proceedings of the 20th International Conference on Data Engineering, ICDE 2004, Boston, MA, USA, pp. 54–65 (2004)

    Google Scholar 

  2. Ng, W., Lam, W., Wood, P., Levene, M.: XCQ: A queriable XML compression system. Knowl. Inf. Syst., 421–452 (2006)

    Google Scholar 

  3. Werner, C., Buschmann, C., Brandt, Y., Fischer, S.: Compressing SOAP Messages by using Pushdown Automata. In: 2006 IEEE International Conference on Web Services (ICWS 2006), Chicago, Illinois, USA, pp.19–28 (2006)

    Google Scholar 

  4. Buneman, P., Grohe, M., Koch, C.: Path Queries on Compressed XML. In: Proceedings of 29th International Conference on Very Large Data Bases, Berlin, Germany, pp. 141–152 (2003)

    Google Scholar 

  5. Busatto, G., Lohrey, M., Maneth, S.: Efficient Memory Representation of XML Documents. In: Bierman, G., Koch, C. (eds.) DBPL 2005. LNCS, vol. 3774, pp. 199–216. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  6. Cheney, J.: Compressing XML with Multiplexed Hierarchical PPM Models. In: Proceedings of the IEEE Data Compression Conference (DCC 2001), Snowbird, Utah, USA, p. 163 (2001)

    Google Scholar 

  7. Girardot, M., Sundaresan, N.: Millau: an encoding format for efficient representation and exchange of XML over the Web. Computer Networks 33, 747–765 (2000)

    Article  Google Scholar 

  8. Liefke, H., Suciu, D.: XMILL: An Efficient Compressor for XML Data. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, Dallas, Texas, USA, pp. 153–164 (2000)

    Google Scholar 

  9. Min, J.-K., Park, M.-J., Chung, C.-W.: XPRESS: A Queriable Compression for XML Data. In: Halevy, A., Ives, Z., Doan, A. (eds.) Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, San Diego, California, USA, pp. 122–133 (2003)

    Google Scholar 

  10. Böttcher, S., Hartel, R., Messinger, C.: XML Stream Data Reduction by Shared KST Signatures. In: 42st Hawaii International International Conference on Systems Science (HICSS-42 2009), Proceedings (CD-ROM and online), Waikoloa, Big Island, HI, USA, pp. 1–10 (2009)

    Google Scholar 

  11. Cheng, J., Ng, W.: XQzip: Querying Compressed XML Using Structural Indexing. In: Hwang, J., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 219–236. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  12. Fisher, D., Maneth, S.: Structural Selectivity Estimation for XML Documents. In: Proceedings of the 23rd International Conference on Data Engineering, ICDE 2007, Istanbul, Turkey, pp. 626–635 (2007)

    Google Scholar 

  13. Bayardo Jr., R., Gruhl, D., Josifovski, V., Myllymaki, J.: An evaluation of binary XML encoding optimizations for fast stream based xml processing. In: Feldman, S., Uretsky, M., Najork, M., Wills, C. (eds.) Proceedings of the 13th International Conference on World Wide Web, New York, NY, USA, pp. 345–354 (2004)

    Google Scholar 

  14. Tolani, P., Haritsa, J.: XGRIND: A Query-Friendly XML Compressor. In: Proceedings of the 18th International Conference on Data, ICDE, San Jose, CA, pp. 225–234 (2002)

    Google Scholar 

  15. Subramanian, H., Shankar, P.: Compressing XML Documents Using Recursive Finite State Automata. In: Farré, J., Litovsky, I., Schmitz, S. (eds.) CIAA 2005. LNCS, vol. 3845, pp. 282–293. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  16. Adiego, J., Navarro, G., Fuente, P.: Lempel-Ziv Compression of Structured Text. In: Data Compression Conference, Snowbird, UT, USA, pp. 112–121 (2004)

    Google Scholar 

  17. Böttcher, S., Hartel, R., Krislin, C.: CluX - Clustering XML Sub-trees. In: ICEIS 2010 - Proceedings of the 12th International Conference on Enterprise Information Systems, Funchal, Madeira, Portugal, pp. 142–150 (2010)

    Google Scholar 

  18. Damien, F., Maneth, S.: Selectivity Estimation. Patent WO 2007/134407 A1 (May 2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bätz, A., Böttcher, S., Hartel, R. (2011). Updates on Grammar-Compressed XML Data. In: Fernandes, A.A.A., Gray, A.J.G., Belhajjame, K. (eds) Advances in Databases. BNCOD 2011. Lecture Notes in Computer Science, vol 7051. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24577-0_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24577-0_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24576-3

  • Online ISBN: 978-3-642-24577-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics