Detecting the Dependent Evolution of Biosequences

  • Jeremy Darot
  • Chen-Hsiang Yeang
  • David Haussler
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3909)


A probabilistic graphical model is developed in order to detect the dependent evolution between different sites in biological sequences. Given a multiple sequence alignment for each molecule of interest and a phylogenetic tree, the model can predict potential interactions within or between nucleic acids and proteins. Initial validation of the model is carried out using tRNA sequence data. The model is able to accurately identify the secondary structure of tRNA as well as several known tertiary interactions.


Molecular Entity Nucleotide Pair Secondary Interaction Probabilistic Graphical Model tRNA Sequence 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Ohno, S.: Evolution by gene duplication. Springer, Heidelberg (1970)Google Scholar
  2. 2.
    Lynch, M., Conery, J.S.: The evolutionary fate and consequences of duplicated genes. Science 290, 1151–1155 (2000)CrossRefGoogle Scholar
  3. 3.
    Goh, C.S., Bogan, A.A., Joachmiak, M., Walther, D., Cohen, F.E.: Co-evolution of proteins with their interaction partners. J. Mol. Biol. 299, 283–293 (2000)CrossRefGoogle Scholar
  4. 4.
    Ramani, A.K., Marcotte, E.M.: Exploiting the co-evolution of interacting proteins to discover interaction specificity. J. Mol. Biol. 327, 273–284 (2003)CrossRefGoogle Scholar
  5. 5.
    Fraser, H.B., Hirsh, A.E., Steinmetz, L.M., Scharfe, C., Feldman, M.W.: Evolutionary fate in the protein interaction network. Science 296, 750–752 (2002)CrossRefGoogle Scholar
  6. 6.
    Pollock, D.D., Taylor, W.R., Goldman, N.: Coevolving protein residues: maximum likelihood identification and relationship to structure. J. Mol. Biol. 287, 187–198 (1999)CrossRefGoogle Scholar
  7. 7.
    Washietl, S., Hofacker, I.L., Stadler, P.F.: Fast and reliable prediction of noncoding RNAs. PNAS 102, 2454–2459 (2005)CrossRefGoogle Scholar
  8. 8.
    Wall, D.P., Hirsh, A.E., Fraser, H.B., Kumm, J., Giaver, G., Eisen, M., Feldman, M.W.: Functional genomic analysis of the rates of protein evolution. PNAS 102, 5483–5488 (2005)CrossRefGoogle Scholar
  9. 9.
    Jordan, I.K., Marino-Ramfrez, L., Wolf, Y.I., Koonin, E.V.: Conservation and coevolution in the scale-free human gene coexpression network. Mol. Biol. Evol. 21, 2058–2070 (2004)CrossRefGoogle Scholar
  10. 10.
    Noller, H.F., Woese, C.R.: Secondary structure of 16S ribosomal RNA. Science 212, 403–411 (1981)CrossRefGoogle Scholar
  11. 11.
    Hofacker, I.L., Fekete, M., Flamm, C., Huynen, M.A., Rauscher, S., Stolorz, P.E., Stadler, P.F.: Automatic detection of conserved RNA structure elements in complete RNA virus genomes. Nucleic Acids Res. 26, 3825–3836 (1998)CrossRefGoogle Scholar
  12. 12.
    Eddy, S.R.: Non-coding RNA genes and the modern RNA world. Nat. Rev. Genet. 2, 919–929 (2001)CrossRefGoogle Scholar
  13. 13.
    Rivas, E., Klein, R.J., Jones, T.A., Eddy, S.R.: Computational identification of noncoding RNAs in E. coli by comparative genomics. Curr. Biol. 11, 1369–1373 (2001)CrossRefGoogle Scholar
  14. 14.
    di Bernardo, D., Down, T., Hubbard, T.: ddbRNA: detection of conserved secondary structures in multiple alignments. Bioinformatics 19, 1606–1611 (2003)CrossRefGoogle Scholar
  15. 15.
    Coventry, A., Kleitman, D.J., Berger, B.: MSARI: multiple sequence alignments for statistical detection of RNA secondary structure. PNAS 101, 12102–12107 (2004)CrossRefGoogle Scholar
  16. 16.
    Pedersen, J.S., Meyer, I.M., Forsberg, R., Simmonds, P., Hein, J.: A comparative method for finding and folding RNA secondary structures within protein-coding regions. Nucleic Acids Res. 32, 4925–4936 (2004)CrossRefGoogle Scholar
  17. 17.
    Washietl, S., Hofacker, I.L., Lukasser, M., Huttenhofer, A., Stadler, P.F.: Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome. Nat. Biotechnol. 23, 1383–1390 (2005)CrossRefGoogle Scholar
  18. 18.
    Barker, D., Pagel, M.: Predicting functional gene links from phylogenetic-statistical analyses of whole genomes. PLoS Comp. Biol. 1, 24–31 (2005)CrossRefGoogle Scholar
  19. 19.
    Yang, Z.: A space-time process model for the evolution of DNA sequences. Genetics 139, 993–1005 (1995)Google Scholar
  20. 20.
    Felsenstein, J., Churchill, G.: A hidden Markov model approach to variation among sites in rate of evolution. Mol. Biol. Evol. 13, 93–104 (1996)Google Scholar
  21. 21.
    Siepel, A., Haussler, D.: Combining phylogenetic and hidden Markov models in biosequence analysis. JCB 11, 413–428 (2004)Google Scholar
  22. 22.
    Hasegawa, M., Kishino, H., Yano, T.: Dating the human-ape splitting by a molecular clock of mitochondrial DNA. J. Mol. Evol. 22, 160–174 (1985)Google Scholar
  23. 23.
    Felsenstein, J.: Evolutionary trees from DNA sequences: a maximum likelihood approach. J. Mol. Evol. 17, 368–376 (1981)Google Scholar
  24. 24.
    Pagel, M.: Detecting correlated evolution on phylogenies: a general method for the comparative analysis of discrete characters. Proceedings of the Royal Society in London, series B 255, 37–45 (1994)CrossRefGoogle Scholar
  25. 25.
  26. 26.
    MrBayes: Bayesian inference of phylogeny,
  27. 27.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jeremy Darot
    • 2
    • 3
  • Chen-Hsiang Yeang
    • 1
  • David Haussler
    • 1
  1. 1.Center for Biomolecular Science and Engineering, UC Santa Cruz 
  2. 2.Department of Applied Mathematics and Theoretical PhysicsUniversity of Cambridge 
  3. 3.EMBL – European Bioinformatics Institute 

Personalised recommendations