Skip to main content

An algorithm for locating non-overlapping regions of maximum alignment score

  • Conference paper
  • First Online:
Combinatorial Pattern Matching (CPM 1993)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 684))

Included in the following conference series:

Abstract

In this paper we present an O(N 2log2 N) algorithm for finding the two non-overlapping substrings of a given string of length N which have the highest-scoring alignment between them. This significantly improves the previously best known bound of O(N 3) for the worst-case complexity of this problem. One of the central ideas in the design of this algorithm is that of partitioning a matrix into pieces in such a way that all submatrices of interest for this problem can be put together as the union of very few of these pieces. Other ideas include the use of candidate-lists, an application of the ideas of Apostolico et al. [1] to our problem domain, and divide and conquer techniques.

Supported in part by NSF grant CCR-9108969.

Supported in part by NLM grant LM-04960 and NSF grant CCR-9002351.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Apostolico, M.J. Atallah, L.L. Larmore, and S. McFaddin, “Efficient Parallel Algorithms for String Editing and Related Problems,” SIAM J. Comput. 19 (1990), 968–988.

    Google Scholar 

  2. V.I. Levenshtein, “Binary codes of correcting deletions, insertions and reversals,” Soviet Phys. Dokl. 10 (1966), 707.

    Google Scholar 

  3. M.G. Main and R.J. Lorentz, “An O(nlog n) algorithm for finding all repetitions in a string,” J. of Algorithms 5 (1984), 422–432.

    Google Scholar 

  4. W. Miller, “An Algorithm for Locating a Repeated Region”, manuscript.

    Google Scholar 

  5. E.W. Myers, “An O(ND) difference algorithm and its variants,” Algorithmica 1 (1986), 251–266.

    Google Scholar 

  6. T.F. Smith and M.S. Waterman, “Identification of common molecular sequences,” J. Mol. Biol. 147 (1981), 195–197.

    Google Scholar 

  7. R.A. Wagner and M.J. Fischer, “The String-to-String Correction Problem,” J. of ACM 21 (1974), 168–173.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Alberto Apostolico Maxime Crochemore Zvi Galil Udi Manber

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kannan, S.K., Myers, E.W. (1993). An algorithm for locating non-overlapping regions of maximum alignment score. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds) Combinatorial Pattern Matching. CPM 1993. Lecture Notes in Computer Science, vol 684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0029798

Download citation

  • DOI: https://doi.org/10.1007/BFb0029798

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-56764-6

  • Online ISBN: 978-3-540-47732-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics