The Trouble with Long-Range Base Pairs in RNA Folding

  • Fabian Amman
  • Stephan H. Bernhart
  • Gero Doose
  • Ivo L. Hofacker
  • Jing Qin
  • Peter F. Stadler
  • Sebastian Will
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8213)

Abstract

RNA prediction has long been struggling with long-range base pairs since prediction accuracy decreases with base pair span. We analyze here the empirical distribution of base pair spans in large collection of experimentally known RNA structures. Surprisingly, we find that long-range base pairs are overrepresented in these data. In particular, there is no evidence that long-range base pairs are systematically overpredicted relative to short-range interactions in thermodynamic predictions. This casts doubt on a recent suggestion that kinetic effects are the cause of length-dependent decrease of predictability. Instead of a modification of the energy model we advocate a modification of the expected accuracy model for RNA secondary structures. We demonstrate that the inclusion of a span-dependent penalty leads to improved maximum expected accuracy structure predictions compared to both the standard MEA model and a modified folding algorithm with an energy penalty function. The prevalence of long-range base pairs provide further evidence that RNA structures in general do not have the so-called polymer zeta property. This has consequences for the asymptotic performance for a large class of sparsified RNA folding algorithms.

Keywords

RNA folding long-range base pair prediction accuracy polymer zeta property 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer International Publishing Switzerland 2013

Authors and Affiliations

  • Fabian Amman
    • 1
  • Stephan H. Bernhart
    • 1
  • Gero Doose
    • 1
    • 2
  • Ivo L. Hofacker
    • 3
    • 5
    • 8
  • Jing Qin
    • 1
    • 4
  • Peter F. Stadler
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6
    • 7
  • Sebastian Will
    • 1
  1. 1.Dept. Computer Science, and Interdisciplinary Center for BioinformaticsUniv. LeipzigLeipzigGermany
  2. 2.LIFE, Leipzig Research Center for Civilization DiseasesUniversity LeipzigLeipzigGermany
  3. 3.Dept. Theoretical ChemistryUniv. ViennaWienAustria
  4. 4.MPI Mathematics in the SciencesLeipzigGermany
  5. 5.RTHUniv. CopenhagenDenmark
  6. 6.FHI Cell Therapy and ImmunologyLeipzigGermany
  7. 7.Santa Fe InstituteSanta FeUSA
  8. 8.Bioinformatics and Computational Biology research groupUniversity of ViennaViennaAustria

Personalised recommendations