Journal of Molecular Evolution

, Volume 18, Issue 6, pp 387–404

Accuracy of estimated phylogenetic trees from molecular data

I. Distantly Related Species

Authors

  • Yoshio Tateno
    • Center for Demographic and Population GeneticsUniversity of Texas at Houston
  • Masatoshi Nei
    • Center for Demographic and Population GeneticsUniversity of Texas at Houston
  • Fumio Tajima
    • Center for Demographic and Population GeneticsUniversity of Texas at Houston
Article

DOI: 10.1007/BF01840887

Cite this article as:
Tateno, Y., Nei, M. & Tajima, F. J Mol Evol (1982) 18: 387. doi:10.1007/BF01840887

Summary

The accuracies and efficiencies of four different methods for constructing phylogenetic trees from molecular data were examined by using computer simulation. The methods examined are UPGMA, Fitch and Margoliash's (1967) (F/M) method, Farris' (1972) method, and the modified Farris method (Tateno, Nei, and Tajima, this paper). In the computer simulation, eight OTUs (32 OTUs in one case) were assumed to evolve according to a given model tree, and the evolutionary change of a sequence of 300 nucleotides was followed. The nucleotide substitution in this sequence was assumed to occur following the Poisson distribution, negative binomial distribution or a model of temporally varying rate. Estimates of nucleotide substitutions (genetic distances) were then computed for all pairs of the nucleotide sequences that were generated at the end of the evolution considered, and from these estimates a phylogenetic tree was reconstructed and compared with the true model tree. The results of this comparison indicate that when the coefficient of variation of branch length is large the Farris and modified Farris methods tend to be better than UPGMA and the F/M method for obtaining a good topology. For estimating the number of nucleotide substitutions for each branch of the tree, however, the modified Farris method shows a better performance than the Farris method. When the coefficient of variation of branch length is small, however, UPGMA shows the best performance among the four methods examined. Nevertheless, any tree-making method is likely to make errors in obtaining the correct topology with a high probability, unless all branch lengths of the true tree are sufficiently long. It is also shown that the agreement between patristic and observed genetic distances is not a good indicator of the goodness of the tree obtained.

Key words

Nucleotide substitutionGenetic distanceSpecies treeGene treeUPGMAFitch/Margoliash methodFarris methodModified Farris method

Copyright information

© Springer-Verlag 1982