Springer Nature is making SARS-CoV-2 and COVID-19 research free. View research | View latest news | Sign up for updates

Criteria for optimising phylogenetic trees and the problem of determining the root of a tree

  • 78 Accesses

  • 19 Citations

Summary

The process of determining the optimal phylogenetic tree from amino acid sequences or comparable data is divided into six stages. Particular attention is given both to the criteria that are used when testing for the optimal tree and the problem of determining the position of the original ancestor. Four types of criteria for evaluating the optimal tree are considered: 1. parsimony (fewest total changes), 2. path lengths from an ancestor to existing species, 3. subtracting the difference between each pair of species as measured on the tree and as compared directly with the data (−excess differences−), 4. Moore Residual Coefficient.

These criteria are examined on a set of test data and some of the reasons for the differences among them are discussed. For example, the −average percent standard deviation− weights excess differences unequally in inverse proportion to the square of the observed differences. The Moore Residual Coefficient and the −excess differences− will not necessarily give a value of zero when there are no duplicated changes unless there can only be two states for each character (i.e. binary data). The path length and difference criteria (as well as the Moore Residual Coefficient) give unequal weighting to the individual branches of the tree by counting some branches more times than others. Particularly because of this some criteria will reject trees that are equally parsimonious and the criteria are said to be invalid. However the criterion of parsimony is insensitive in that it can give the same value for several basic networks and it does not specify the position of the original ancestor, the root of the tree. The importance is emphasised of stating a model and examining its predictions before a criterion is chosen to select the best network.

The number of rooted trees that can be derived from a basic network (or unrooted tree) is described in relation to how detailed a description of the original ancestor is required. Four methods are described for determining the position of the root of the tree or original ancestor. Each method depends upon some additional information to that used in constructing the basic network and the method chosen will depend on this additional knowledge.

This is a preview of subscription content, log in to check access.

References

  1. Boulter, D., Ramshaw, J.A.M., Thompson, E.W., Richardson, M., Brown, R.H. (1972). Proc.Roy.Soc.London B 181, 441

  2. Brown, R.H., Richardson, M., Boulter, D., Ramshaw, J.A.M., Jefferies, R.P.S. (1972). Biochem.J. 128. 971

  3. Camin, J.H., Sokal, R.R. (1965). Evolution 19, 311

  4. Dayhoff, M.O. (1972). Atlas of protein sequence and structure, Vol. 5. Washington: National Biomedical Research Foundation

  5. Dickerson, R.E. (1971). J.Mol.Evol. 1, 26

  6. Estabrook, G.F. (1968). J.Theoret.Biol. 21, 423

  7. Farris, J.S. (1973). System.Zool. 22, 250

  8. Felsenstein, J. (1973). System.Zool. 22, 240

  9. Fitch, W.M. (1971). System.Zool. 20, 406

  10. Fitch, W.M. (1971b). J.Mol.Evol. 1, 84

  11. Fitch, W.M. (1973). J.Mol.Evol. 2, 223

  12. Fitch, W.M., Margoliash, E. (1967). Science 155, 279

  13. Goodman, M. (1972). System.Zool. 21, 174

  14. Goodman, M., Moore, G.W., Barnabas, J. (1974). J.Mol.Evol. 3, 1

  15. Hartigan, J.A. (1973). Biometrics 29, 53

  16. Holmquist, R. (1972). J.Mol.Evol. 1, 211

  17. Kimura, M., Ohta, R. (1971). J.Mol.Evol. 1, 1

  18. Langley, C.H., Fitch, W.M. (1974). J.Mol.Evol. 3, 161

  19. Lundberg, J.G. (1972). System.Zool. 21, 398

  20. Manischewitz, J.R. (1973). System.Zool. 22, 176

  21. Moore, G.W., Goodman, M., Barnabas, J. (1973). J.Theoret.Biol. 38, 423

  22. Penny, D. (1974). J.Mol.Evol. 3, 179

  23. Ramshaw, J.A.M., Richardson, D.L., Meatyard, B.T., Brown, R.H., Richardson, M., Thompson, E.W., Boulter, D. (1972). New Phytol. 71, 773

  24. Rohlf, F.J. (1972). System.Zool. 21, 271

  25. Romer, A.S. (1966). Vertebrate paleontology, 3rd Ed. Chicago: University of Chicago Press

  26. Sneath, P.H.A., Sokal, R.R. (1973). Numerical taxonomy. San Francisco: Freeman

  27. Sokal, R.R. (1974). Science 185, 1115

  28. Sokal, R.R., Sneath, P.H.A. (1963). Principles of numerical taxonomy. San Francisco: Freeman

  29. Strydom, D.J., van der Walt, S.J., Botes, D.P. (1972). Comp.Biochem.Physiol. 43B, 21

Download references

Author information

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Penny, D. Criteria for optimising phylogenetic trees and the problem of determining the root of a tree. J Mol Evol 8, 95–116 (1976). https://doi.org/10.1007/BF01739097

Download citation

Key words

  • Molecular Evolution
  • Phylogenetic Tree
  • Numerical Taxonomy
  • Networks
  • Cytochrome c