Skip to main content

Range Non-overlapping Indexing and Successive List Indexing

  • Conference paper
Algorithms and Data Structures (WADS 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4619))

Included in the following conference series:

Abstract

We present two natural variants of the indexing problem:

In the range non-overlapping indexing problem, we preprocess a given text to answer queries in which we are given a pattern, and wish to find a maximal-length sequence of occurrences of the pattern in the text, such that the occurrences do not overlap with one another. While efficiently solving this problem, our algorithm even enables us to efficiently perform so in substrings of the text, denoted by given start and end locations. The methods we supply thus generalize the string statistics problemĀ [4,5], in which we are asked to report merely the number of non-overlapping occurrences in the entire text, by reporting the occurrences themselves, even only for substrings of the text.

In the related successive list indexing problem, during query-time we are given a pattern and a list of locations in the preprocessed text. We then wish to find a list of occurrences of the pattern, such that the ith occurrence is the leftmost occurrence of the pattern which starts to the right of the ith location given by the input list.

Both problems are solved by using tools from computational geometry, specifically a variation of the range searching for minimum problem of Lenhof and SmidĀ [12], here considered over a grid, in what appears to be the first utilization of range searching for minimum in an indexing-related context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agarwal, P., Erickson, J.: Geometric range searching and its relatives (1999)

    Google ScholarĀ 

  2. Alstrup, S., Brodal, G.S., Rauhe, T.: New data structures for orthogonal range searching. In: IEEE Symposium on Foundations of Computer Science, pp. 198ā€“207 (2000)

    Google ScholarĀ 

  3. Amir, A., Keselman, D., Landau, G.M., Lewenstein, M., Lewenstein, N., Rodeh, M.: Text indexing and dictionary matching with one error. J. AlgorithmsĀ 37(2), 309ā€“325 (2000)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  4. Apostolico, A., Preparata, F.P.: Data structures and algorithms for the string statistics problem. AlgorithmicaĀ 15(5), 481ā€“494 (1996)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  5. Brodal, G.S., LyngsĆø, R.B., Ɩstlin, A., Pedersen, C.N.S.: Solving the string statistics problem in time O(n log n). In: Widmayer, P., Triguero, F., Morales, R., Hennessy, M., Eidenbenz, S., Conejo, R. (eds.) ICALP 2002. LNCS, vol.Ā 2380, pp. 728ā€“739. Springer, Heidelberg (2002)

    ChapterĀ  Google ScholarĀ 

  6. Dietzfelbinger, M., Karlin, A.R., Mehlhorn, K., aufĀ der Heide, F.M., Rohnert, H., Tarjan, R.E.: Dynamic perfect hashing: Upper and lower bounds. vol. 23, pp. 738ā€“761, Philadelphia, PA, USA, Society for Industrial and Applied Mathematics (1994)

    Google ScholarĀ 

  7. Farach, M.: Optimal suffix tree construction with large alphabets. In: FOCS 1997. Proceedings of the 38th Annual Symposium on Foundations of Computer Science, Washington, DC, USA, p. 137. IEEE Computer Society Press, Los Alamitos (1997)

    Google ScholarĀ 

  8. Ferragina, P.: Dynamic text indexing under string updates. J. AlgorithmsĀ 22(2), 296ā€“328 (1997)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  9. Ferragina, P., Muthukrishnan, S., de Berg, M.: Multi-method dispatching: A geometric approach with applications to string matching problems. In: STOC, pp. 483ā€“491 (1999)

    Google ScholarĀ 

  10. Gabow, H.N., Bentley, J.L., Tarjan, R.E.: Scaling and related techniques for geometry problems. In: STOC 1984. Proceedings of the sixteenth annual ACM symposium on Theory of computing, pp. 135ā€“143. ACM Press, New York (1984)

    ChapterĀ  Google ScholarĀ 

  11. Knuth, D., Morris, J.H., Pratt, V.: Fast pattern matching in strings. SIAM Journal on ComputingĀ 6(2), 323ā€“350 (1977)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  12. Lenhof, H.-P., Smid, M.: Using persistent data structures for adding range restrictions to searching problems. RAIRO Theoretical Informatics and ApplicationsĀ 28, 25ā€“49 (1994)

    MATHĀ  MathSciNetĀ  Google ScholarĀ 

  13. McCreight, E.M.: A space-economical suffix tree construction algorithm. J. ACMĀ 23(2), 262ā€“272 (1976)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  14. Ukkonen, E.: On-line construction of suffix trees. AlgorithmicaĀ 14(3), 249ā€“260 (1995)

    ArticleĀ  MATHĀ  MathSciNetĀ  Google ScholarĀ 

  15. Weiner, P.: Linear pattern matching algorithms. In: 14th Annual Symposium on Switching and Automata Theory, pp. 1ā€“11. IEEE, New York (1973)

    ChapterĀ  Google ScholarĀ 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Frank Dehne Jƶrg-RĆ¼diger Sack Norbert Zeh

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Keller, O., Kopelowitz, T., Lewenstein, M. (2007). Range Non-overlapping Indexing and Successive List Indexing. In: Dehne, F., Sack, JR., Zeh, N. (eds) Algorithms and Data Structures. WADS 2007. Lecture Notes in Computer Science, vol 4619. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73951-7_54

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-73951-7_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-73948-7

  • Online ISBN: 978-3-540-73951-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics