Applied Microbiology and Biotechnology

, Volume 65, Issue 2, pp 203–210 | Cite as

Phylogenetic analysis based on genome-scale metabolic pathway reaction content

Genomics and Proteomics


Phylogenetic classifications based on single genes such as rRNA genes do not provide a complete and accurate picture of evolution because they do not account for evolutionary leaps caused by gene transfer, duplication, deletion and functional replacement. Here, we present a whole-genome-scale phylogeny based on metabolic pathway reaction content. From the genome sequences of 42 microorganisms, we deduced the metabolic pathway reactions and used the relatedness of these contents to construct a phylogenetic tree that represents the similarity of metabolic profiles (relatedness) as well as the extent of metabolic pathway similarity (evolutionary distance). This method accounts for horizontal gene transfer and specific gene loss by comparison of whole metabolic subpathways, and allows evaluation of evolutionary relatedness and changes in metabolic pathways. Thus, a tree based on metabolic pathway content represents both the evolutionary time scale (changes in genetic content) and the evolutionary process (changes in metabolism).


  1. Bansal AK (1999) An automated comparative analysis of 17 complete microbial genomes. Bioinfomatics 15:900–908CrossRefGoogle Scholar
  2. Brown JR, Douady CJ, Italia MJ, Marshall WE, Stanhope MJ (2001) Universal trees based on large combined protein sequence data sets. Nat Genet 28:281–285CrossRefPubMedGoogle Scholar
  3. Dandekar T, Schuster S, Snel B, Huynen M, Bork P (1999) Pathway alignment: application to the comparative analysis of glycolytic enzymes. Biochem J 343:115–124CrossRefPubMedGoogle Scholar
  4. Daubin V, Gouy M, Perriere G (2002) A phylogenic approach to bacterial phylogeny: evidence of core genes sharing a common history. Genome Res 12:1080–1090CrossRefPubMedGoogle Scholar
  5. Eisen MB, Spellman PT, Brown PO, Botstein D (1998) Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci USA 95:14863–14868PubMedGoogle Scholar
  6. Feng DF, Cho G, Doolittle RF (1997) Determining divergence times with a protein clock: update and reevaluation. Proc Natl Acad Sci USA 94:13028–13033Google Scholar
  7. Fitch WM, Margoliash E (1967) Construction of phylogenetic trees. Science 155:279–284PubMedGoogle Scholar
  8. Fitz-Gibbon ST, House CH (1999) Whole genome-based phylogenetic analysis of free-living microorganisms. Nucleic Acids Res 27:4218–4222CrossRefPubMedGoogle Scholar
  9. Kanehisa M, Goto S, Kawashima S, Nakaya A (2002) The KEGG databases at GenomeNet. Nucleic Acids Res 30:42–46CrossRefPubMedGoogle Scholar
  10. Karp PD (2001) Pathway databases: a case study in computational symbolic theories. Science 293:2040–2044PubMedGoogle Scholar
  11. Ma HW, Zeng AP (2004) Phylogenetic comparison of metabolic capacities of organisms at genome level. Mol Phylogenet Evol 31:204–213CrossRefPubMedGoogle Scholar
  12. Meyer TE, Cusanovich MA, Kamen MD (1986) Evidence against use of bacterial amino acid sequence data for construction of all-inclusive phylogenetic trees. Proc Natl Acad Sci USA 83:217–220PubMedGoogle Scholar
  13. Olsen GJ, Woese CR, Overbeek R (1994) The wind of (evolutionary) change: breathing new life into microbiology. J Bacteriol 176:1–6PubMedGoogle Scholar
  14. Ribeiro S, Golding GB (1998) The mosaic nature of the eukaryotic nucleus. Mol Biol Evol 15:779–788PubMedGoogle Scholar
  15. Rivera MC, Jain R, Moore JE, Lake JA (1998) Genomic evidence for two functionally distinct gene classes. Proc Natl Acad Sci USA 95:6239–6244CrossRefPubMedGoogle Scholar
  16. Snel B, Bork P, Huynen MA (1999) Genome phylogeny based on gene content. Nat Genet 21:108–110CrossRefPubMedGoogle Scholar
  17. Tatusov RL, Koonin EV, Lipman DJ (1997) A genomic perspective on protein families. Science 278:631–637PubMedGoogle Scholar
  18. Tekaia F, Lazcano A, Dujon B (1999) The genomic tree as revealed from whole proteome comparisons. Genome Res 9:550–557PubMedGoogle Scholar
  19. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680PubMedGoogle Scholar
  20. Woese CR (1987) Bacterial evolution. Microbiol Rev 51:221–271PubMedGoogle Scholar
  21. Wolf YI, Rogozin IB, Crishin NV, Koonin EV (2002) Genome trees and the tree of life. Trends Genet 18:472–479CrossRefPubMedGoogle Scholar
  22. Zuckerkandl E, Pauling L (1965) Molecular as documents of evolutionary history. J Theor Biol 8:357–366PubMedGoogle Scholar

Copyright information

© Springer-Verlag 2004

Authors and Affiliations

  1. 1.Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical & Biomolecular Engineering and BioProcess Engineering Research CenterKorea Advanced Institute of Science and TechnologyDaejeonSouth Korea
  2. 2.Department of Biosystems and Bioinformatics Research CenterKorea Advanced Institute of Science and TechnologyDaejeonSouth Korea

Personalised recommendations