Skip to main content

Advertisement

Log in

Most Parsimonious Likelihood Exhibits Multiple Optima for Compatible Characters

  • Original Article
  • Published:
Bulletin of Mathematical Biology Aims and scope Submit manuscript

Abstract

Maximum likelihood estimators are a popular method for scoring phylogenetic trees to best explain the evolutionary histories of biomolecular sequences. In 1994, Steel showed that, given an incompatible set of binary characters and a fixed tree topology, there exist multiple sets of branch lengths that are optima of the maximum average likelihood estimator. Since parsimony techniques—another popular method of scoring evolutionary trees—tend to exhibit favorable behavior on data compatible with the tree, Steel asked if the same is true for likelihood estimators, or if multiple optima can occur for compatible sequences. We show that, despite exhibiting behavior similar to parsimony, multiple local optima can occur for compatible characters for the most parsimonious likelihood estimator. We caution that thorough understanding of likelihood criteria is necessary before they are used to analyze biological data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
EUR 32.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or Ebook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Our goal with the use of the term “sequence” is to call back to the scientific process of obtaining observations in a continuous setting. We expect likelihood estimates to change as more samples are discovered and characterized. We do not use the term to refer to unaligned DNA sequences for a single taxon.

  2. Note that \(I_{{\mathrm{MP}}}\) yields the first lump given in Table 1.

References

  • Barry D, Hartigan J (1987) Statistical analysis of hominoid molecular evolution. Stat Sci 2:191–207

    Article  MathSciNet  Google Scholar 

  • Bininda-Emonds ORP, Gittleman JL, Steel MA (2002) The (super) tree of life. Ann Rev Ecol Syst 33:265–89

    Article  Google Scholar 

  • Charleston MA, Perkins SL (2003) Lizards, malaria, and jungles in the caribbean. In: Page RD (ed) Tangled trees: phylogeny, cospeciation, and coevolution. University of Chicago Press, Chicago, pp 65–92

    Google Scholar 

  • Chor B, Hendy MD, Holland BR, Penny D (2000) Multiple maxima of likelihood in phylogenetic trees: an analytic approach. Mol Biol Evol 17(10):1529–1541

    Article  Google Scholar 

  • Foulds LR, Graham RL (1982) The Steiner problem in phylogeny is NP-complete. Adv Appl Math 3(1):43–49

    Article  MathSciNet  Google Scholar 

  • Foundation PS (2010) Python language reference, version 2.7. http://www.python.org. Accessed 29 Apr 2019

  • Fukami K, Tateno Y (1989) On the maximum likelihood method for estimating molecular trees: uniqueness of the likelihood point. J Mol Evol 28(5):460–464

    Article  Google Scholar 

  • Gusfield D (1991) Efficient algorithms for inferring evolutionary trees. Networks 21(1):19–28

    Article  MathSciNet  Google Scholar 

  • Hillis DM, Mable BK, Moritz C (1996) Molecular systematics. Sinauer Associates, Sunderland

    Google Scholar 

  • Janies DA, Treseder T, Alexandrov B, Habibb F, Chen JJ, Ferreira R, Catalyurek U, Varon A, Wheeler WC (2011) The supramap project: linking pathogen genomes with geography to fight emergent infectious diseases. Cladistics 27:61–66

    Article  Google Scholar 

  • Roch S (2006) A short proof that phylogenetic tree reconstruction by maximum likelihood is hard. IEEE/ACM Trans Comput Biol Bioinform 3(1):92–94

    Article  Google Scholar 

  • Semple C, Steel M (2003) Phylogenetics. Oxford lecture series in mathematics and its applications, vol 24. Oxford University Press, Oxford

    Google Scholar 

  • Steel M (2011) The penny ante challenge problems: open problems from the New Zealand phylogenetics meetings. www.math.canterbury.ac.nz/bio/events/south2012/files/penny_ante_problems.pdf. Accessed 8 Aug 2019

  • Steel MA (1994) The maximum likelihood point for a phylogenetic tree is not unique. Syst Biol 43(4):560–564

    Article  Google Scholar 

  • Steel M, Penny D (2000) Parsimony, likelihood, and the role of models in molecular phylogenetics. Mol Biol Evol 17(6):839–850

    Article  Google Scholar 

  • Stein W et al (2015) Sage mathematics software (version 6.6). The Sage Development Team. http://www.sagemath.org. Accessed 8 Aug 2019

  • Stewart J (2005) Multivariable calculus: concepts and contexts. Brooks/Cole, Pacific Grove ISBN 0-534-41004-9

    Google Scholar 

  • Tuffley C, Steel M (1997) Links between maximum likelihood and maximum parsimony under a simple model of site substitution. Bull Math Biol 59(3):581–607

    Article  Google Scholar 

Download references

Acknowledgements

We would like to thank Dan Gusfield, Rob Gysel, Mike Steel, and Ward Wheeler for helpful comments and conversations. This work was partially supported by a grant from the Simons Foundation to KS.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julia Matsieva.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Matsieva, J., St. John, K. Most Parsimonious Likelihood Exhibits Multiple Optima for Compatible Characters. Bull Math Biol 82, 10 (2020). https://doi.org/10.1007/s11538-019-00689-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11538-019-00689-8

Keywords

Navigation