Skip to main content

Compressed Suffix Array

  • Reference work entry
  • First Online:
Encyclopedia of Algorithms

Years and Authors of Summarized Original Work

  • 2000, 2003; Sadakane

  • 2000, 2005; Grossi, Vitter

  • 2000, 2005; Ferragina, Manzini

Problem Definition

Given a text string T = t1t2… t n over an alphabet \(\varSigma\) of size \(\sigma\), the suffix array A[1, n] is a permutation of the interval [1, n] that sorts the suffixes of T. More precisely, it satisfies T[A[i], n] < T[A[i + 1], n] for all 1 ≤ i < n, where “ < ” between strings is the lexicographical order. The suffix array is the canonical full-text index that allows to efficiently compute basic string matching queries on T.

The compressed suffix array (CSA) problem asks to replace A with a space-efficient data structure that is capable of efficiently computing A[i].

If a CSA does not require T to operate, and is capable of efficiently answering substring queries on T, it is called a self-index, as it can be seen as a replacement of T itself. Typical queries required from such an index are the following:

  • count(P): count how many times a...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,599.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 1,999.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Recommended Reading

  1. Belazzougui D, Navarro G (2011) Alphabet-independent compressed text indexing. In: ESA, Saarbrücken, pp 748–759

    MATH  Google Scholar 

  2. Burrows M, Wheeler D (1994) A block sorting lossless data compression algorithm. Technical report 124, Digital Equipment Corporation

    Google Scholar 

  3. Ferragina P, Manzini G (2005) Indexing compressed texts. J ACM 52(4):552–581

    Article  MathSciNet  MATH  Google Scholar 

  4. Ferragina P, Giancarlo R, Manzini G, Sciortino M (2005) Boosting textual compression in optimal linear time. J ACM 52(4):688–713

    Article  MathSciNet  MATH  Google Scholar 

  5. Ferragina P, Manzini G, Mäkinen V, Navarro G (2007) Compressed representations of sequences and full-text indexes. ACM Trans Algorithms 3(2):20

    Article  MathSciNet  MATH  Google Scholar 

  6. Foschini L, Grossi R, Gupta A, Vitter JS (2006) When indexing equals compression: experiments with compressing suffix arrays and applications. ACM Trans Algorithms 2(4):611–639

    Article  MathSciNet  Google Scholar 

  7. Golynski A, Raman R, Srinivasa Rao S (2008) On the redundancy of succinct data structures. In: SWAT, Gothenburg, pp 148–159

    MATH  Google Scholar 

  8. Grossi R, Vitter J (2006) Compressed suffix arrays and suffix trees with applications to text indexing and string matching. SIAM J Comput 35(2):378–407

    Article  MathSciNet  MATH  Google Scholar 

  9. Grossi R, Gupta A, Vitter J (2003) High-order entropy-compressed text indexes. In: Proceedings of the 14th annual ACM-SIAM symposium on discrete algorithms (SODA), Baltimore, pp 841–850

    Google Scholar 

  10. Kärkkäinen J, Puglisi SJ (2011) Fixed block compression boosting in fm-indexes. In: SPIRE, Pisa, pp 174–184

    Google Scholar 

  11. Mäkinen V, Navarro G (2005) Succinct suffix arrays based on run-length encoding. Nord J Comput 12(1):40–66

    MathSciNet  MATH  Google Scholar 

  12. Mäkinen V, Navarro G (2008) Dynamic entropy-compressed sequences and full-text indexes. ACM Trans Algorithms 4(3):32

    Article  MathSciNet  MATH  Google Scholar 

  13. Manber U, Myers G (1993) Suffix arrays: a new method for on-line string searches. SIAM J Comput 22(5):935–948

    Article  MathSciNet  MATH  Google Scholar 

  14. Manzini G (2001) An analysis of the Burrows-Wheeler transform. J ACM 48(3):407–430

    Article  MathSciNet  MATH  Google Scholar 

  15. Navarro G, Mäkinen V (2007) Compressed full-text indexes. ACM Comput Surv 39(1): Article 2

    Google Scholar 

  16. Sadakane K (2003) New text indexing functionalities of the compressed suffix arrays. J Algorithms 48(2):294–313

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media New York

About this entry

Cite this entry

Belazzougui, D., Mäkinen, V., Valenzuela, D. (2016). Compressed Suffix Array. In: Kao, MY. (eds) Encyclopedia of Algorithms. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2864-4_82

Download citation

Publish with us

Policies and ethics