Advertisement

Journal of Mathematical Biology

, Volume 77, Issue 3, pp 527–544 | Cite as

On the information content of discrete phylogenetic characters

  • Magnus Bordewich
  • Ina Maria Deutschmann
  • Mareike Fischer
  • Elisa Kasbohm
  • Charles Semple
  • Mike Steel
Article

Abstract

Phylogenetic inference aims to reconstruct the evolutionary relationships of different species based on genetic (or other) data. Discrete characters are a particular type of data, which contain information on how the species should be grouped together. However, it has long been known that some characters contain more information than others. For instance, a character that assigns the same state to each species groups all of them together and so provides no insight into the relationships of the species considered. At the other extreme, a character that assigns a different state to each species also conveys no phylogenetic signal. In this manuscript, we study a natural combinatorial measure of the information content of an individual character and analyse properties of characters that provide the maximum phylogenetic information, particularly, the number of states such a character uses and how the different states have to be distributed among the species or taxa of the phylogenetic tree.

Keywords

Phylogeny Character Information content Convexity 

Mathematics Subject Classification

05C05 (Trees) 05C30 (Enumeration in graph theory) 92D15 (Problems related to evolution) 

Notes

Acknowledgements

We thank the two anonymous reviewers for several helpful comments on an earlier version of this paper. I.D. and E.K. thank the International Office at the University of Greifswald and the German Academic Exchange Service (DAAD) for the support through the mobility program PROMOS (travel scholarship). We also thank the (former) Allan Wilson Centre for supporting this research.

References

  1. Bandelt H, Fischer M (2008) Perfectly misleading distances from ternary characters. Syst Biol 57(4):540–543CrossRefGoogle Scholar
  2. Bordewich M, Semple C, Steel M (2006) Identifying X-trees with few characters. Electron J Comb 13:R83MathSciNetzbMATHGoogle Scholar
  3. Carter M, Hendy M, Penny D, Széley L, Wormald N (1990) On the distribution of lengths of evolutionary trees. SIAM J Discrete Math 3(1):38–47MathSciNetCrossRefzbMATHGoogle Scholar
  4. Huber K, Moulton V, Steel M (2005) Four characters suffice to convexly define a phylogenetic tree. SIAM J Discrete Math 18(1):835–843MathSciNetCrossRefzbMATHGoogle Scholar
  5. Maddison D, Schulz KS, Maddison W (2007) The tree of life web project. In: Zhang ZQ, Shear W (eds) Linnaeus tercentenary: progress in invertebrate taxonomy, vol 1668. Zootaxa, Auckland, pp 19–40Google Scholar
  6. McDiarmid C, Semple C, Welsh D (2015) Counting phylogenetic networks. SIAM J Discrete Math 19:205–224MathSciNetzbMATHGoogle Scholar
  7. Schütz A (2016) Der Informationsgehalt von \(r\)-Zustands-Charactern. Bachelor’s thesis, Greifswald University, GermanyGoogle Scholar
  8. Semple C, Steel M (2003) Phylogenetics. Oxford University Press, OxfordzbMATHGoogle Scholar
  9. Sloane N (2010) The on-line encyclopedia of integer sequences. http://oeis.org
  10. Steel M, Penny D (2005) Maximum parsimony and the phylogenetic information in multi-state characters. In: Albert V (ed) Parsimony, phylogeny and genomics. Oxford University Press, OxfordGoogle Scholar
  11. Townsend J (2007) Profiling phylogenetic informativeness. Syst Biol 56:222–231CrossRefGoogle Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2017

Authors and Affiliations

  • Magnus Bordewich
    • 1
  • Ina Maria Deutschmann
    • 2
  • Mareike Fischer
    • 2
  • Elisa Kasbohm
    • 2
  • Charles Semple
    • 3
  • Mike Steel
    • 3
  1. 1.Science Laboratories, School of Engineering and Computing SciencesUniversity of DurhamDurhamUK
  2. 2.Institute of Mathematics and Computer ScienceErnst-Moritz-Arndt-University GreifswaldGreifswaldGermany
  3. 3.School of Mathematics and StatisticsUniversity of CanterburyChristchurchNew Zealand

Personalised recommendations