Skip to main content

Accelerating Boyer Moore Searches on Binary Texts

  • Conference paper
Implementation and Application of Automata (CIAA 2007)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4783))

Included in the following conference series:

Abstract

The Boyer and Moore (BM) pattern matching algorithm is considered as one of the best, but its performance is reduced on binary data. Yet, searching in binary texts has important applications, such as compressed matching. The paper shows how, by means of some pre-computed tables, one may implement the BM algorithm also for the binary case without referring to bits, and processing only entire blocks such as bytes or words, thereby significantly reducing the number of comparisons. Empirical comparisons show that the new variant performs better than regular binary BM and even than BDM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Knuth, D.E., Morris, J.H., Pratt, V.R.: Fast pattern matching in strings. SIAM J. Comput. 6, 323–350 (1977)

    Article  MATH  MathSciNet  Google Scholar 

  2. Boyer, R.S., Moore, J.S.: A fast string searching algorithm. Commun. ACM 20, 762–772 (1977)

    Article  Google Scholar 

  3. Crochemore, M., Czumaj, A., Gasieniec, L., Jarominek, S., Lecroq, T., Plandowski, W., Rytter, W.: Speeding up two string-matching algorithms. Algorithmica 12, 247–267 (1994)

    Article  MATH  MathSciNet  Google Scholar 

  4. de Moura, E.S., Navarro, G., Ziviani, N., Baeza-Yates, R.A.: Fast and flexible word searching on compressed text. ACM Transactions on Information Systems 18, 113–139 (2000)

    Article  Google Scholar 

  5. Brisaboa, N.R., Farina, A., Navarro, G., Esteller, M.F.: (s,c)-dense coding: An optimized compression code for natural language text databases. In: Nascimento, M.A., de Moura, E.S., Oliveira, A.L. (eds.) SPIRE 2003. LNCS, vol. 2857, pp. 122–136. Springer, Heidelberg (2003)

    Google Scholar 

  6. Choueka, Y., Klein, S.T., Perl, Y.: Efficient variants of Huffman codes in high level languages. In: SIGIR 1985. Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 122–130. ACM Press, New York (1985)

    Chapter  Google Scholar 

  7. Fredriksson, K.: Faster string matching with super-alphabets. In: Laender, A.H.F., Oliveira, A.L. (eds.) SPIRE 2002. LNCS, vol. 2476, pp. 44–57. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  8. Navarro, G., Tarhio, J.: Boyer-Moore string matching over Ziv-Lempel compressed text, pp. 166–180 (2000)

    Google Scholar 

  9. Shibata, Y., Matsumoto, T., Takeda, M., Shinohara, A., Arikawa, S.: A Boyer-Moore type algorithm for compressed pattern matching. In: Giancarlo, R., Sankoff, D. (eds.) CPM 2000. LNCS, vol. 1848, pp. 181–194. Springer, Heidelberg (2000)

    Google Scholar 

  10. Bell, T., Powell, M., Mukherjee, A., Adjeroh, D.: Searching BWT compressed text with the Boyer-Moore algorithm and binary search. In: DCC 2002. Proceedings of the Data Compression Conference (DCC 2002), pp. 112–121. IEEE Computer Society Press, Washington, DC, USA (2002)

    Google Scholar 

  11. Klein, S.T., Bookstein, A., Deerwester, S.: Storing text retrieval systems on CD-ROM: compression and encryption considerations. ACM Trans. Inf. Syst. 7, 230–245 (1989)

    Article  Google Scholar 

  12. Crochemore, M., Rytter, W.: Text algorithms. Oxford University Press, Inc., New York (1994)

    MATH  Google Scholar 

  13. Horspool, R.N.: Practical fast searching in strings. Software Practice and Experience 10, 501–506 (1980)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Jan Holub Jan Žďárek

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Klein, S.T., Kopel Ben-Nissan, M. (2007). Accelerating Boyer Moore Searches on Binary Texts. In: Holub, J., Žďárek, J. (eds) Implementation and Application of Automata. CIAA 2007. Lecture Notes in Computer Science, vol 4783. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76336-9_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-76336-9_14

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-76335-2

  • Online ISBN: 978-3-540-76336-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics