Scientific and Statistical Database Management

Volume 5566 of the series Lecture Notes in Computer Science pp 200-216

Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA

  • Weijia XuAffiliated withTexas Advanced Computing Center, The University of Texas at Austin
  • , Stuart OzerAffiliated withOne Microsoft Way Redmond
  • , Robin R. GutellAffiliated withCenter of Computational Biology and Bioinformatics, The University of Texas at Austin

* Final gross prices may vary according to local VAT.

Get Access


With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.


Biological database Bioinformatics Sequence Analysis RNA