Journal of Molecular Evolution

, Volume 39, Issue 4, pp 418–430 | Cite as

The posterior probability distribution of alignments and its application to parameter estimation of evolutionary trees and to optimization of multiple alignments

  • L. Allison
  • C. S. Wallace
Article

Abstract

How to sample alignments from their posterior probability distribution given two strings is shown. This is extended to sampling alignments of more than two strings. The result is first applied to the estimation of the edges of a given evolutionary tree over several strings. Second, when used in conjunction with simulated annealing, it gives a stochastic search method for an optimal multiple alignment.

Key words

Alignment Estimation Evolutionary tree Gibbs sampling Inductive inference MonteCarlo method Multiple alignment Sampling Stochastic 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allison L, Wallace CS (1994) An information measure for the string to string correction problem with applications. 17th Australian Comp. Sci. Conf, Christchurch, New Zealand, pp 659–668Google Scholar
  2. Allison L, Wallace CS, Yee CN (1992a) Finite-state models in the alignment of macro-molecules. J Mol Evol 35:77–89Google Scholar
  3. Allison L, Wallace CS, Yee CN (1992b) Minimum message length encoding, evolutionary trees and multiple alignment. 25th Hawaii Int. Conf. Sys. Sci. 1:663–674Google Scholar
  4. Bishop MJ, Thompson EA (1986) Maximum likelihood alignment of DNA sequences. J Mol Biol 190:159–165Google Scholar
  5. Duan W, Achen MG, Richardson SJ, Lawrence MC, Wettenhall REH, Jaworowski A, Schreiber G (1991) Isolation, characterisation, cDNA cloning and gene expression of an avian transthyretin: implications for the evolution of structure and function of transthyretin in vertebrates Eur J Biochem 200:679–687Google Scholar
  6. Felsenstein J (1981) Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol 17:368–376Google Scholar
  7. Felsenstein J (1983) Inferring evolutionary trees from DNA sequences. In: Weir BS (ed) Statistical analysis of DNA sequence data. Marcel Dekker, pp 133–150Google Scholar
  8. Hastings WK (1970) Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109Google Scholar
  9. Haussler D, Krogh A, Mian S, Sjolander K (1993) Protein modelling using hidden Markov Models: Analysis of globins. 26th Hawaii Int. Conf. Sys. Sci. 1:792–802Google Scholar
  10. Hirschberg DS (1975) A linear space algorithm for computing maximal common subsequences. Comm ACM 18(6):341–343Google Scholar
  11. Huggins AS, Bannam TL, Rood JI (1992) Comparative sequence analysis of the catB gene from Clostridium butyricum. Antimicro agents Chemother 36(11):2548–2551Google Scholar
  12. Ishikawa M, Toya T, Hoshida M, Nitta K, Ogiwara A, Kanehisa M (1992) Multiple sequence alignment by parallel simulated annealing. Institute for New Generation Computing (ICOT) TR-730Google Scholar
  13. Lawrence CE, Altschul SF, Bogushki MS, Liu JS, Neuwald AF, Wooton JC (1993) Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science 262:208–214Google Scholar
  14. Sankoff D, Cedergren RJ, Lapalme G (1976) Frequency of insertion-deletion, transversion, and transition in evolution of 5S ribosomal RNA. J Mol Evol 7:133–149Google Scholar
  15. Schreiber G, Aldred AR, Duan W (1992) Choroid plexus, brain protein-homeostasis and evoluation. Today's Life Science Sept: 22–28Google Scholar
  16. Thorne JL, Kishino H, Felsenstein J (1991) An evolutionary model for maximum likelihood alignment of DNA sequences. J Mol Evol 33:114–124Google Scholar
  17. Thorne JL, Kishino H, Felsenstein J (1992) Inching towards reality: an improved likelihood model of sequence evolution. J Mol Evol 34:3–16Google Scholar
  18. Wallace CS, Boulton DM (1968) An information measure for classification. Comp J 11(2):185–194Google Scholar
  19. Wallace CS, Freeman PR (1987) Estimation and inference by compact encoding. J R Stat Soc B 49:240–265Google Scholar
  20. Yee CN, Allison L (1993) Reconstruction of strings past. Comp Appl Biosci 9(1):1–7Google Scholar

Copyright information

© Springer-Verlag New York Inc 1994

Authors and Affiliations

  • L. Allison
    • 1
  • C. S. Wallace
    • 1
  1. 1.Department of Computer ScienceMonash UniversityAustralia

Personalised recommendations