Abstract
In this paper we present an O(N 2log2 N) algorithm for finding the two non-overlapping substrings of a given string of length N which have the highest-scoring alignment between them. This significantly improves the previously best known bound of O(N 3) for the worst-case complexity of this problem. One of the central ideas in the design of this algorithm is that of partitioning a matrix into pieces in such a way that all submatrices of interest for this problem can be put together as the union of very few of these pieces. Other ideas include the use of candidate-lists, an application of the ideas of Apostolico et al. [1] to our problem domain, and divide and conquer techniques.
Supported in part by NSF grant CCR-9108969.
Supported in part by NLM grant LM-04960 and NSF grant CCR-9002351.
Preview
Unable to display preview. Download preview PDF.
References
A. Apostolico, M.J. Atallah, L.L. Larmore, and S. McFaddin, “Efficient Parallel Algorithms for String Editing and Related Problems,” SIAM J. Comput. 19 (1990), 968–988.
V.I. Levenshtein, “Binary codes of correcting deletions, insertions and reversals,” Soviet Phys. Dokl. 10 (1966), 707.
M.G. Main and R.J. Lorentz, “An O(nlog n) algorithm for finding all repetitions in a string,” J. of Algorithms 5 (1984), 422–432.
W. Miller, “An Algorithm for Locating a Repeated Region”, manuscript.
E.W. Myers, “An O(ND) difference algorithm and its variants,” Algorithmica 1 (1986), 251–266.
T.F. Smith and M.S. Waterman, “Identification of common molecular sequences,” J. Mol. Biol. 147 (1981), 195–197.
R.A. Wagner and M.J. Fischer, “The String-to-String Correction Problem,” J. of ACM 21 (1974), 168–173.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1993 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kannan, S.K., Myers, E.W. (1993). An algorithm for locating non-overlapping regions of maximum alignment score. In: Apostolico, A., Crochemore, M., Galil, Z., Manber, U. (eds) Combinatorial Pattern Matching. CPM 1993. Lecture Notes in Computer Science, vol 684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0029798
Download citation
DOI: https://doi.org/10.1007/BFb0029798
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-56764-6
Online ISBN: 978-3-540-47732-7
eBook Packages: Springer Book Archive