Skip to main content

I/O-Efficient Dictionary Search with One Edit Error

  • Conference paper
String Processing and Information Retrieval (SPIRE 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8799))

Included in the following conference series:

  • 628 Accesses

Abstract

This paper studies the 1-error dictionary search problem in external memory. The input is a set D of strings whose characters are drawn from a constant-size alphabet. Given a string q, a query reports the ids of all strings in D that are within 1 edit distance from q. We give a structure occupying O(n/B) blocks that answers a query in \(O(1 + \frac{m}{wB} + \frac{k}{B})\) I/Os, where n is the total length of all strings in D, m is the length of q, k is the number of ids reported, w is the size of a machine word, and B is the number of words in a block.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, A., Vitter, J.S.: The input/output complexity of sorting and related problems. CACM 31(9), 1116–1127 (1988)

    Article  MathSciNet  Google Scholar 

  2. Arge, L., Vitter, J.S.: Optimal external memory interval management. SIAM J. of Comp. 32(6), 1488–1508 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  3. Belazzougui, D.: Faster and space-optimal edit distance “1” dictionary. In: Annual Symp. on Combinatorial Pattern Matching, pp. 154–167 (2009)

    Google Scholar 

  4. Belazzougui, D., Venturini, R.: Compressed string dictionary look-up with edit distance one. In: Kärkkäinen, J., Stoye, J. (eds.) CPM 2012. LNCS, vol. 7354, pp. 280–292. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  5. Brodal, G.S., Gasieniec, L.: Approximate dictionary queries. In: Hirschberg, D., Meyers, G. (eds.) CPM 1996. LNCS, vol. 1075, pp. 65–74. Springer, Heidelberg (1996)

    Chapter  Google Scholar 

  6. Chan, H.-L., Lam, T.-W., Sung, W.-K., Tam, S.-L., Wong, S.-S.: A linear size index for approximate pattern matching. In: Lewenstein, M., Valiente, G. (eds.) CPM 2006. LNCS, vol. 4009, pp. 49–59. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  7. Cole, R., Gottlieb, L.-A., Lewenstein, M.: Dictionary matching and indexing with errors and don’t cares. In: STOC, pp. 91–100 (2004)

    Google Scholar 

  8. Demaine, E.D., auf der Heide, F.M., Pagh, R., Pǎtraşcu, M.: De dictionariis dynamicis pauco spatio utentibus (lat. on dynamic dictionaries using little space). In: Correa, J.R., Hevia, A., Kiwi, M. (eds.) LATIN 2006. LNCS, vol. 3887, pp. 349–361. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  9. Dietzfelbinger, M., Karlin, A.R., Mehlhorn, K., auf der Heide, F.M., Rohnert, H., Tarjan, R.E.: Dynamic perfect hashing: Upper and lower bounds. SIAM J. of Comp. 23(4), 738–761 (1994)

    Article  MATH  Google Scholar 

  10. Ferragina, P., Grossi, R.: The string B-tree: A new data structure for string search in external memory and its applications. JACM 46(2), 236–280 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  11. Hon, W.-K., Lam, T.W., Shah, R., Tam, S.-L., Vitter, J.S.: Cache-oblivious index for approximate string matching. Theoretical Computer Science 412(29), 3579–3588 (2011)

    Article  MathSciNet  MATH  Google Scholar 

  12. Kukich, K.: Techniques for automatically correcting words in text. ACM Comp. Surv. 24(4), 377–439 (1992)

    Article  Google Scholar 

  13. Navarro, G.: A guided tour to approximate string matching. ACM Comp. Surv. 33(1), 31–88 (2001)

    Article  Google Scholar 

  14. Overmars, M.H.: The Design of Dynamic Data Structures. Springer (1987)

    Google Scholar 

  15. Tsur, D.: Fast index for approximate string matching. Journal of Discrete Algorithms 8(4), 339–345 (2010)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Chung, CW., Tao, Y., Wang, W. (2014). I/O-Efficient Dictionary Search with One Edit Error. In: Moura, E., Crochemore, M. (eds) String Processing and Information Retrieval. SPIRE 2014. Lecture Notes in Computer Science, vol 8799. Springer, Cham. https://doi.org/10.1007/978-3-319-11918-2_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11918-2_19

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11917-5

  • Online ISBN: 978-3-319-11918-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics