Mining RNA Structure Elements from the Structure Data of Protein-RNA Complexes

  • Daeho Lim
  • Kyungsook Han
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2843)


Mining biological data in databases has become the subject of increasing interest over the past several years. But most research of data mining in bioinformatics is limited to the sequence data of molecules. Biological sequences are easy to understand due to their sequential nature and have many well-developed algorithms to handle them since they can be treated as strings of characters. The structure of a molecule, on the other hand, is much more complex but plays an important role since it determines the biological function of the molecule. We have developed a set of algorithms to recognize all the secondary and tertiary structure elements of RNA from the three-dimensional atomic coordinates of protein-RNA complexes. Although there have been computational methods developed for assigning secondary structure elements in proteins, similar methods have not been developed for RNA, due in part to a small number of structure data available for RNA. Therefore, extracting secondary or tertiary structure elements of RNA depends on a significant amount of manual work. This is the first attempt to extracting RNA structure elements from the atomic coordinates in structure databases. The patterns in the structure elements discovered by the algorithms will provide us with useful information for predicting the structure of RNA binding protein.


Protein Data Bank Secondary Structure Element Mouse Mammary Tumor Virus Electronegative Atom Protein Data Bank File 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Kabsch, W., Sander, C.: Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)CrossRefGoogle Scholar
  2. 2.
    Frishman, D., Argos, P.: Knowledge-based protein secondary structure assignment. Proteins 23, 566–579 (1995)CrossRefGoogle Scholar
  3. 3.
    Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)CrossRefGoogle Scholar
  4. 4.
    Tinoco Jr.: The RNA World. In: Gesteland, R.F., Atkins, J.F. (eds.), pp. 603–607. Cold Spring Harbor Laboratory Press (1993)Google Scholar
  5. 5.
    McDonald, I.K., Thornton, J.M.: Satisfying Hydrogen Bonding Potential in Proteins. J. Mol.Biol. 238, 777–793 (1994)CrossRefGoogle Scholar
  6. 6.
    Han, K., Byun, Y.: PseudoViewer2: visualization of RNA pseudoknots of any type. Nucleic Acids Res. 31, 3432–3440 (2003)CrossRefGoogle Scholar
  7. 7.
    Akmaev, V.R., Kelley, S.T., Stormo, G.D.: Phylogenetically enhanced statistical tools for RNA structure prediction. Bioinformatics 16, 501–512 (2000)CrossRefGoogle Scholar
  8. 8.
    Du, X., Wang, E.-D.: Tertiary structure base pairs between D- and TΨC-loops of Escherichia coli tRNA Leu play important roles in both aminoacylation and editing. Nucleic Acids Res. 31, 2865–2872 (2003)CrossRefGoogle Scholar
  9. 9.

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Daeho Lim
    • 1
  • Kyungsook Han
    • 1
  1. 1.School of Computer Science and EngineeringInha UniversityInchonKorea

Personalised recommendations