Abstract
We show that the size \(\gamma (t_n)\) of the smallest string attractor of the n-th Thue-Morse word \(t_n\) is 4 for any \(n\ge 4\), disproving the conjecture by Mantaci et al. [ICTCS 2019] that it is n. We also show that \(\delta (t_n) = \frac{10}{3+2^{4-n}}\) for \(n \ge 3\), where \(\delta (w)\) is the maximum over all \(k = 1,\ldots ,|w|\), the number of distinct substrings of length k in w divided by k, which is a measure of repetitiveness recently studied by Kociumaka et al. [LATIN 2020]. Furthermore, we show that the number \(z(t_n)\) of factors in the self-referencing Lempel-Ziv factorization of \(t_n\) is exactly 2n.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berstel, J., Savelli, A.: Crochemore factorization of Sturmian and other infinite words. In: Královič, R., Urzyczyn, P. (eds.) MFCS 2006. LNCS, vol. 4162, pp. 157–166. Springer, Heidelberg (2006). https://doi.org/10.1007/11821069_14
Brlek, S.: Enumeration of factors in the Thue-Morse word. Discrete Appl. Math. 24(1), 83–96 (1989). https://doi.org/10.1016/0166-218X(92)90274-E
Brlek, S., Frosini, A., Mancini, I., Pergola, E., Rinaldi, S.: Burrows-Wheeler transform of words defined by morphisms. In: Colbourn, C.J., Grossi, R., Pisanti, N. (eds.) IWOCA 2019. LNCS, vol. 11638, pp. 393–404. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-25005-8_32
Burrows, M., Wheeler, D.J.: A block-sorting lossless data compression algorithm. SRC Research Report 124 (1994)
Charikar, M., et al.: The smallest grammar problem. IEEE Trans. Inf. Theory 51(7), 2554–2576 (2005). https://doi.org/10.1109/TIT.2005.850116
Chen, K.T., Fox, R.H., Lyndon, R.C.: Free differential calculus, IV. The quotient groups of the lower central series. Ann. Math. 68(1), 81–95 (1958). http://www.jstor.org/stable/1970044
Christiansen, A.R., Ettienne, M.B., Kociumaka, T., Navarro, G., Prezza, N.: Optimal-time dictionary-compressed indexes (2019). http://arxiv.org/abs/1811.12779v6
I, T., Nakashima, Y., Inenaga, S., Bannai, H., Takeda, M.: Faster Lyndon factorization algorithms for SLP and LZ78 compressed text. In: Kurland, O., Lewenstein, M., Porat, E. (eds.) SPIRE 2013. LNCS, vol. 8214, pp. 174–185. Springer, Cham (2013). https://doi.org/10.1007/978-3-319-02432-5_21
Ido, A., Melançon, G.: Lyndon factorization of the Thue-Morse word and its relatives. Discrete Math. Theor. Comput. Sci. 1(1), 43–52 (1997). http://dmtcs.episciences.org/233
Kärkkäinen, J., Kempa, D., Nakashima, Y., Puglisi, S.J., Shur, A.M.: On the size of Lempel-Ziv and Lyndon factorizations. In: Vollmer, H., Vallée, B. (eds.) 34th Symposium on Theoretical Aspects of Computer Science, STACS 2017. LIPIcs, Hannover, Germany, 8–11 March 2017, vol. 66, pp. 45:1–45:13. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2017). https://doi.org/10.4230/LIPIcs.STACS.2017.45
Kempa, D., Prezza, N.: At the roots of dictionary compression: string attractors. In: Diakonikolas, I., Kempe, D., Henzinger, M. (eds.) Proceedings of the 50th Annual ACM SIGACT Symposium on Theory of Computing (STOC 2018), pp. 827–840. ACM (2018). https://doi.org/10.1145/3188745.3188814
Kociumaka, T., Navarro, G., Prezza, N.: Towards a definitive measure of repetitiveness. In: Proceedings of the 14th Latin American Symposium on Theoretical Informatics (LATIN) (2020, to appear). https://arxiv.org/abs/1910.02151
Lempel, A., Ziv, J.: On the complexity of finite sequences. IEEE Trans. Inf. Theory 22(1), 75–81 (1976). https://doi.org/10.1109/TIT.1976.1055501
Mantaci, S., Restivo, A., Romana, G., Rosone, G., Sciortino, M.: String attractors and combinatorics on words. In: Proceedings of the 20th Italian Conference on Theoretical Computer Science (ICTCS 2019), pp. 57–71 (2019). http://ceur-ws.org/Vol-2504/paper8.pdf
Mantaci, S., Restivo, A., Sciortino, M.: Burrows-Wheeler transform and Sturmian words. Inf. Process. Lett. 86(5), 241–246 (2003). https://doi.org/10.1016/S0020-0190(02)00512-4
Morse, M.: Recurrent geodesics on a surface of negative curvature. Trans. Am. Math. Soc. 22, 84–100 (1921)
Prouhet, E.: Mémoire sur quelques relations entre les puissances des nombres. CR Acad. Sci. Paris Sér. 133, 225 (1851)
Raskhodnikova, S., Ron, D., Rubinfeld, R., Smith, A.D.: Sublinear algorithms for approximating string compressibility. Algorithmica 65(3), 685–709 (2013). https://doi.org/10.1007/s00453-012-9618-6
Storer, J.A., Szymanski, T.G.: Data compression via textual substitution. J. ACM 29(4), 928–951 (1982). https://doi.org/10.1145/322344.322346
Thue, A.: Über unendliche zeichenreihen. Norske vid. Selsk. Skr. Mat. Nat. Kl. 7, 1–22 (1906)
Urabe, Y., Nakashima, Y., Inenaga, S., Bannai, H., Takeda, M.: On the size of overlapping Lempel-Ziv and Lyndon factorizations. In: 30th Annual Symposium on Combinatorial Pattern Matching (CPM 2019), pp. 29:1–29:11 (2019). https://doi.org/10.4230/LIPIcs.CPM.2019.29
Acknowledgments
This work was supported by JSPS KAKENHI Grant Numbers JP18K18002 (YN), JP17H01697 (SI), JP16H02783, JP20H04141 (HB), JP18H04098 (MT), and JST PRESTO Grant Number JPMJPR1922 (SI).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Kutsukake, K., Matsumoto, T., Nakashima, Y., Inenaga, S., Bannai, H., Takeda, M. (2020). On Repetitiveness Measures of Thue-Morse Words. In: Boucher, C., Thankachan, S.V. (eds) String Processing and Information Retrieval. SPIRE 2020. Lecture Notes in Computer Science(), vol 12303. Springer, Cham. https://doi.org/10.1007/978-3-030-59212-7_15
Download citation
DOI: https://doi.org/10.1007/978-3-030-59212-7_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59211-0
Online ISBN: 978-3-030-59212-7
eBook Packages: Computer ScienceComputer Science (R0)