Abstract
Grammar-based compressors like e.g. CluX [1], BPLEX [2], TreeRePAIR [3] transform an XML tree X into a context-free straight-line linear tree (CSLT) grammar G and yield strong compression ratios compared to other classes of XML-specific compressors. However, CSLT grammars have the disadvantage that simulating on G update operations like inserting, deleting, or re-labeling a node V of X requires to isolate the path from X’s root to V from all the paths represented by G. Usually, this leads to an increased redundancy within G, as grammar rules are copied and modified, but the original and the modified grammar rules often differ only slightly. In this paper, we propose extended context-free straight-line tree (ECST) grammars that allow reducing the redundancy created by path isolation. Furthermore, we show how to query and how to update ECST compressed grammars.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Böttcher, S., Hartel, R., Krislin, C.: CluX - Clustering XML sub-trees. In: ICEIS 2010, Funchal, Madeira, Portugal (2010)
Busatto, G., Lohrey, M., Maneth, S.: Efficient memory representation of XML documents. In: Bierman, G., Koch, C. (eds.) DBPL 2005. LNCS, vol. 3774, pp. 199–216. Springer, Heidelberg (2005)
Lohrey, M., Maneth, S., Mennicke, R.: Tree structure compression with repair. In: DCC 2011, Snowbird, UT, USA (2011)
Buneman, P., Grohe, M., Koch, C.: Path queries on compressed XML. In: VLDB 2003, Berlin, Germany (2003)
Bätz, A., Böttcher, S., Hartel, R.: Updates on grammar-compressed XML data. In: Fernandes, A.A.A., Gray, A.J.G., Belhajjame, K. (eds.) BNCOD 2011. LNCS, vol. 7051, pp. 154–166. Springer, Heidelberg (2011)
Böttcher, S., Hartel, R., Jacobs, T.: Fast multi-update operations on compressed XML data. In: Gottlob, G., Grasso, G., Olteanu, D., Schallhart, C. (eds.) BNCOD 2013. LNCS, vol. 7968, pp. 149–164. Springer, Heidelberg (2013)
Böttcher, S., Steinmetz, R.: Evaluating XPath queries on XML data streams. In: Cooper, R., Kennedy, J. (eds.) BNCOD 2007. LNCS, vol. 4587, pp. 101–113. Springer, Heidelberg (2007)
Zhang, N., Kacholia, V., Özsu, M.: A succinct physical storage scheme for efficient evaluation of path queries in XML. In: ICDE 2004, Boston, MA, USA (2004)
Cheney, J.: Compressing XML with multiplexed hierarchical PPM models. In: DCC 2001, Snowbird, Utah, USA (2001)
Girardot, M., Sundaresan, N.: Millau: an encoding format for efficient representation and exchange of XML over the Web. Comput. Netw. 33, 747–765 (2000)
Liefke, H., Suciu, D.: XMILL: an efficient compressor for XML data. In: SIGMOD 2000, Dallas, Texas, USA (2000)
Min, J.-K., Park, M.-J., Chung, C.-W.: XPRESS: a queriable compression for XML data. In: SIGMOD 2003, San Diego, California, USA (2003)
Tolani, P., Haritsa, J.: XGRIND: a query-friendly XML compressor. In: ICDE 2002, San Jose, CA (2002)
Ng, W., Lam, W., Wood, P., Levene, M.: XCQ: a queriable XML compression system. Knowl. Inf. Syst. 10, 421–452 (2006)
Werner, C., Buschmann, C., Brandt, Y., Fischer, S.: Compressing SOAP messages by using pushdown automata. In: ICWS 2006, Chicago, Illinois, USA (2006)
Böttcher, S., Hartel, R., Messinger, C.: XML stream data reduction by shared KST signatures. In: HICSS-42 2009, Waikoloa, Big Island, HI, USA (2009)
Cheng, J., Ng, W.: XQzip: querying compressed XML using structural indexing. In: Bertino, E., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 219–236. Springer, Heidelberg (2004)
Adiego, J., Navarro, G., Fuente, P.: Lempel-ziv compression of structured text. In: DCC 2004, Snowbird, UT, USA (2004)
Fisher, D., Maneth, S.: Structural selectivity estimation for XML documents. In: ICDE 2007, Istanbul, Turkey (2007)
Fisher, D., Maneth, S.: Selectivity Estimation. Patent WO 2007/134407 A1, May 2007
Kobayashi, N., Matsuda, K., Shinohara, A., Yaguchi, K.: Functional programs as compressed data. High.-Order Symbolic Comput. 25(1), 39–84 (2012)
Böttcher, S., Hartel, R., Jacobs, T., Maneth, S.: OnlineRePair: a recompressor for XML structures. In: Poster Paper, DCC, Snow Bird, Utah, USA (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Böttcher, S., Hartel, R., Jacobs, T., Jeromin, M. (2015). ECST – Extended Context-Free Straight-Line Tree Grammars. In: Maneth, S. (eds) Data Science. BICOD 2015. Lecture Notes in Computer Science(), vol 9147. Springer, Cham. https://doi.org/10.1007/978-3-319-20424-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-20424-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20423-9
Online ISBN: 978-3-319-20424-6
eBook Packages: Computer ScienceComputer Science (R0)