A Faster Algorithm for RNA Co-folding
The current pairwise RNA (secondary) structural alignment algorithms are based on Sankoff’s dynamic programming algorithm from 1985. Sankoff’s algorithm requires O(N6) time and O(N4) space, where N denotes the length of the compared sequences, and thus its applicability is very limited. The current literature offers many heuristics for speeding up Sankoff’s alignment process, some making restrictive assumptions on the length or the shape of the RNA substructures. We show how to speed up Sankoff’s algorithm in practice via non-heuristic methods, without compromising optimality. Our analysis shows that the expected time complexity of the new algorithm is O(N4ζ(N)), where ζ(N) converges to O(N), assuming a standard polymer folding model which was supported by experimental analysis. Hence our algorithm speeds up Sankoff’s algorithm by a linear factor on average. In simulations, our algorithm speeds up computation by a factor of 3-12 for sequences of length 25-250.
Availability: Code and data sets are available, upon request.
Unable to display preview. Download preview PDF.
- 2.Crochemore, M., Landau, G.M., Schieber, B., Ziv-Ukelson, M.: Re-use dynamic programming for sequence alignment: An algorithmic toolkit, pp. 19–60 (2005)Google Scholar
- 9.Giancarlo, R.: Dynamic Programming: Special Cases. Oxford University Press, Oxford (1997)Google Scholar
- 12.Tinoco, I., Borer, P.N., Dengler, B., Levine, M.D., Uhlenbeck, O.C., Crothers, D.M., Gralla, J.: Improved estimation of secondary structure in ribonucleic acids. Nature New Biology 246, 40–41 (1973)Google Scholar
- 17.Kabakcioglu, A., Stella, A.L.: A scale-free network hidden in the collapsing polymer. ArXiv Condensed Matter e-prints (September 2004)Google Scholar
- 19.Mandal, M., Breaker, R.R.: Gene regulation by riboswitches. Cell 6, 451–463 (2004)Google Scholar
- 24.Vanderzande, C.: Lattice Models of Polymers (Cambridge Lecture Notes in Physics 11). Cambridge University Press, Cambridge (1998)Google Scholar