Abstract.
A heuristic approach to search for the maximum-likelihood (ML) phylogenetic tree based on a genetic algorithm (GA) has been developed. It outputs the best tree as well as multiple alternative trees that are not significantly worse than the best one on the basis of the likelihood criterion. These near-optimum trees are subjected to further statistical tests. This approach enables ones to infer phylogenetic trees of over 20 taxa taking account of the rate heterogeneity among sites on practical time scales on a PC cluster. Computer simulations were conducted to compare the efficiency of the present approach with that of several likelihood-based methods and distance-based methods, using amino acid sequence data of relatively large (5–24) taxa. The superiority of the ML method over distance-based methods increases as the condition of simulations becomes more realistic (an incorrect model is assumed or many taxa are involved). This approach was applied to the inference of the universal tree based on the concatenated amino acid sequences of vertically descendent genes that are shared among all genomes whose complete sequences have been reported. The inferred tree strongly supports that Archaea is paraphyletic and Eukarya is specifically related to Crenarchaeota. Apart from the paraphyly of Archaea and some minor disagreements, the universal tree based on these genes is largely consistent with the universal tree based on SSU rRNA.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Received: 4 January 2001 / Accepted: 16 May 2001
Rights and permissions
About this article
Cite this article
Katoh, K., Kuma, Ki. & Miyata, T. Genetic Algorithm-Based Maximum-Likelihood Analysis for Molecular Phylogeny. J Mol Evol 53, 477–484 (2001). https://doi.org/10.1007/s002390010238
Issue Date:
DOI: https://doi.org/10.1007/s002390010238