Fractal and Dynamical Language Methods to Construct Phylogenetic Tree Based on Protein Sequences from Complete Genomes
The complete genomes of living organisms have provided much information on their phylogenetic relationships. In the past few years, we proposed three alternative methods to model the noise background in the composition vector of protein sequences from a complete genome. The first method is based on the frequencies of the 20 kinds of amino acids appearing in the genome and the multiplicative model. The second method is based on the iterated function system model in fractal geometry. The last method is based on the relationship between a word and its two sub-words in the theory of symbolic dynamics. Here we introduce these methods. The complete genomes of prokaryotes and eukaryotes are selected to test these algorithms. Our distance-based phylogenetic tree of prokaryotes and eukaryotes agrees with the biologists’ “tree of life” based on the 16S-like rRNA genes in a majority of basic branches and most lower taxa.
KeywordsProtein Sequence Complete Genome Iterate Function System Multiplicative Model Simple Correlation Analysis
Unable to display preview. Download preview PDF.
- 2.Brown, T.A.: Genetics, 3rd edn. Chapman & Hall, London (1998)Google Scholar
- 3.Brown, J.R., Doolittle, W.F.: Archaea and the prokaryote-to-eukaryote transition. Micro-biol. Mol. Biol. Rev. 61, 456–502 (1997)Google Scholar
- 5.Chatton, E.: Titres et travaux scientifiques (Sette, Sottano, Italy) (1937)Google Scholar
- 10.Felsenstein, J.: PHYLIP (phylogeny Inference package) version 3.5c (1993), Distributed by the author at http://evolution.genetics.washington.edu/phylip.html
- 13.Gupta, R.S.: Protein phylogenies and signature sequences: A reappraisal of evolutionary relationships among Archaebacteria, Eubacteria, and Eukaryotes. Microbiol. Mol. Biol. Rev. 62, 1435–1491 (1998)Google Scholar
- 22.Saitou, N., Nei, M.: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol. Biol. Evol. 4, 406–425 (1987)Google Scholar
- 25.Stuart, G.W., Moffet, K., Leader, J.J.: A comprehensive vertebrate phylogeny using vector representations of protein sequences from whole genomes. Mol. Biol. Evol. 19, 554–562 (2002b)Google Scholar
- 26.Tekaia, F., Lazcano, A., Dujon, B.: The genomic tree as revealed from whole proteome comparisons. Genome Res. 9, 550–557 (1999)Google Scholar
- 27.Vrscay, E.R.: Fractal Geometry and analysis. In: Belair, J. (ed.). NATO ASI series. Kluwer Academic Publishers, Dordrecht (1991)Google Scholar
- 29.Woese, C.R.: Bacterial evolution. Microbiol. Rev. 51, 221–271 (1987)Google Scholar
- 32.Yu, Z.G., Anh, V.: Phylogenetic tree of prokaryotes based on complete genomes using fractal and correlation analyses. In: Proceedings of the Second Asia-Pacific Bioinformatics Conference, Dunedin, New Zealand. The Australian Computer Society Inc. (2004)Google Scholar