Abstract
We consider the problem of storing a grammar of size n compressing a string of size N, and a set of positions \(\{i_1,\ldots ,i_b\}\) (bookmarks) such that any substring of length l crossing one of the positions can be decompressed in O(l) time. Our solution uses space \(O((n+b)\max \{1,\log ^* n - \log ^*(\frac{n}{b} + \frac{b}{n} )\})\). Existing solutions for the bookmarking problem either require more space or a super-constant “kick-off” time to start the decompression.
P.H. Cording—Supported by the Danish Research Council under the Sapere Aude Program (DFF 4005-00267).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
The bound is in fact \(O(z+b\log ^* N)\), where z is the size of the LZ77 parse of \(S \). Since it is known that \(z\le n' \le n\) [12], where \(n'\) is the size of the smallest SLP generating \(S \), we replace z by n for clarity.
References
Belazzougui, D., Cording, P.H., Puglisi, S.J., Tabei, Y.: Access, rank, and select in grammar-compressed strings. In: Bansal, N., Finocchi, I. (eds.) ESA 2015. LNCS, vol. 9294, pp. 142–154. Springer, Heidelberg (2015)
Belazzougui, D., Gagie, T., Gawrychowski, P., Kärkkäinen, J., Ordónez, A., Puglisi, S.J., Tabei, Y.: Queries on LZ-bounded encodings. In: DCC, pp. 83–92 (2014)
Bille, P., Christiansen, A.R., Cording, P.H., Gørtz, I.L.: Finger search in grammar-compressed strings (2015). CoRR arXiv:1507.02853
Bille, P., Landau, G.M., Raman, R., Sadakane, K., Satti, S.R., Weimann, O.: Random access to grammar-compressed strings and trees. SIAM J. Comput. 44(3), 513–539 (2015)
Charikar, M., Lehman, E., Liu, D., Panigrahy, R., Prabhakaran, M., Sahai, A., Shelat, A.: The smallest grammar problem. IEEE Trans. Inf. Theor. 51(7), 2554–2576 (2005)
Gagie, T., Gawrychowski, P., Kärkkäinen, J., Nekrich, Y., Puglisi, S.J.: LZ77-based self-indexing with faster pattern matching. In: Pardo, A., Viola, A. (eds.) LATIN 2014. LNCS, vol. 8392, pp. 731–742. Springer, Heidelberg (2014)
Gawrychowski, P.: Faster algorithm for computing the edit distance between SLP-compressed strings. In: Calderón-Benavides, L., González-Caro, C., Chávez, E., Ziviani, N. (eds.) SPIRE 2012. LNCS, vol. 7608, pp. 229–236. Springer, Heidelberg (2012)
Knuth, D.E.: Mathematics and computer science: coping with finiteness. Science (New York, NY) 194(4271), 1235–1242 (1976)
Larsson, N.J., Moffat, A.: Off-line dictionary-based compression. In: DCC, vol. 88, no. 11, pp. 1722–1732 (2000)
Miyazaki, M., Shinohara, A., Takeda, M.: An improved pattern matching algorithm for strings in terms of straight-line programs. In: Apostolico, A., Hein, J. (eds.) CPM 97. LNCS, vol. 1264, pp. 1–11. Springer, Heidelberg (1997)
Nevill-Manning, C.G., Witten, I.H.: Identifying hierarchical structure in sequences: a linear-time algorithm. J. Artif. Intell. Res. (JAIR) 7, 67–82 (1997)
Rytter, W.: Application of Lempel-Ziv factorization to the approximation of grammar-based compression. Theor. Comput. Sci. 302(1), 211–222 (2003)
Welch, T.A.: A technique for high-performance data compression. Computer 6(17), 8–19 (1984)
Ziv, J., Lempel, A.: A universal algorithm for sequential data compression. IEEE Trans. Inf. Theor. 23(3), 337–343 (1977)
Ziv, J., Lempel, A.: Compression of individual sequences via variable-rate coding. IEEE Trans. Inf. Theor. 24(5), 530–536 (1978)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Cording, P.H., Gawrychowski, P., Weimann, O. (2016). Bookmarks in Grammar-Compressed Strings. In: Inenaga, S., Sadakane, K., Sakai, T. (eds) String Processing and Information Retrieval. SPIRE 2016. Lecture Notes in Computer Science(), vol 9954. Springer, Cham. https://doi.org/10.1007/978-3-319-46049-9_15
Download citation
DOI: https://doi.org/10.1007/978-3-319-46049-9_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46048-2
Online ISBN: 978-3-319-46049-9
eBook Packages: Computer ScienceComputer Science (R0)