All Semi-local Longest Common Subsequences in Subquadratic Time
For two strings a, b of lengths m, n respectively, the longest common subsequence (LCS) problem consists in comparing a and b by computing the length of their LCS . In this paper, we define a generalisation, called “the all semi-local LCS problem”, where each string is compared against all substrings of the other string, and all prefixes of each string are compared against all suffixes of the other string. An explicit representation of the output lengths is of size Θ ((m+n)2). We show that the output can be represented implicitly by a geometric data structure of size O(m+n), allowing efficient queries of the individual output lengths. The currently best all string-substring LCS algorithm by Alves et al. can be adapted to produce the output in this form. We also develop the first all semi-local LCS algorithm, running in time o(mn) when m and n are reasonably close. Compared to a number of previous results, our approach presents an improvement in algorithm functionality, output representation efficiency, and/or running time.
KeywordsScore Matrix Longe Common Subsequence Longe Common Subsequence Score Matrice Output Length
Unable to display preview. Download preview PDF.
- 9.Jones, N.C., Pevzner, P.A.: An introduction to bioinformatics algorithms. In: Computational Molecular Biology. The MIT Press, Cambridge (2004)Google Scholar
- 16.Preparata, F.P., Shamos, M.I.: Computational Geometry: An Introduction. In: Texts and Monographs in Computer Science, Springer, Heidelberg (1985)Google Scholar