Skip to main content

Constructing rooted supertrees using distances

Abstract

Suppose that a family of rooted phylogenetic trees T i with different sets X i of leaves is given. A supertree for the family is a single rooted tree T whose leaf set is the union of all the X i , such that the branching information in T corresponds to the branching information in all the trees T i . This paper proposes a polynomial-time method BUILD-WITH-DISTANCES that makes essential use of distance information provided by the trees T i to construct a rooted tree S 0. When a supertree also containing the distance information exists, then S 0 is a supertree. The supertree S 0 often shows increased resolution over the trees found by methods that utilize only the topology of the input trees. When no supertree exists because the input trees are incompatible, several variants of the method are described which still produce trees with provable properties.

This is a preview of subscription content, access via your institution.

References

  • Aho, A. V., Y. Sagiv, T. G. Szymanski and J. D. Ullman (1981). Inferring a tree from lowest common ancestors with an application to the optimization of relational expressions. SIAM J. Comput. 10, 405–421.

    MathSciNet  Article  Google Scholar 

  • Baum, B. R. (1992). Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees. Taxon 41, 3–10.

    Article  Google Scholar 

  • Bininda-Emonds, O., J. L. Gittleman and M. Steel (2002). The (super)tree of life: procedures, problems, and prospects. Annu. Rev. Ecol. Syst. 33, 265–289.

    Article  Google Scholar 

  • Bininda-Emonds, O. and M. J. Sanderson (2001). Assessment of the accuracy of matrix representation with parsimony analysis supertree construction. Syst. Biol. 50, 565–579.

    Article  Google Scholar 

  • Bryant, D., C. Semple and M. Steel (2004). Supertree methods for ancestral dates and other applications, in Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life, O. R. P. Bininda-Emonds (Ed.), Dordrecht, The Netherlands: Kluwer Academic, pp. 129–150.

    Google Scholar 

  • Colonius, H. and H. H. Schulze (1981). Tree structures for proximity data. British J. Math. Statist. Psych. 34, 167–180.

    MathSciNet  Google Scholar 

  • Dress, W. M. and P. L. Erdös (2003). X-trees and weighted quartet systems. Ann. Comb. 7, 155–169.

    MathSciNet  Article  Google Scholar 

  • Felsenstein, J. (1993). PHYLIP (Phylogeny Inference Package) version 3.5c. Distributed by the author. Department of Genetics, University of Washington, Seattle.

    Google Scholar 

  • Fitch, W. M. and E. Margoliash (1967). Construction of phylogenetic trees. Science 155, 279–284.

    Google Scholar 

  • Hartigan, J. A. (1967). Representation of similarity matrics by trees. J. Am. Stat. Assoc. 62, 1140–1158.

    MathSciNet  Article  Google Scholar 

  • Hubert, L. (1972). Some extensions of Johnson’s hierarchical clustering algorithms. Psychometrika 37, 261–274.

    MATH  MathSciNet  Article  Google Scholar 

  • Hubert, L. (1973). Monotone invariant clustering procedures. Psychometrika 38, 47–62.

    MATH  Article  Google Scholar 

  • Janowitz, M. F., F.-J. Lapointe, F. R. McMorris, B. Mirkin and F. S. Roberts (Eds) (2003). Bioconsensus: DIMACS Working Group Meetings on Bioconsensus, October 25–26, 2000 and October 2–5, 2001, DIMACS Series in Discrete Mathematics and Theoretical Computer Science 61, American Mathematical Society.

  • Johnson, S. C. (1967). Hierarchical clustering schemes. Psychometrika 32, 241–254.

    Article  Google Scholar 

  • Lapointe, F.-J. and G. Cucumel (1997). The average consensus procedure: combination of weighted trees containing identical or overlapping sets of taxa. Syst. Biol. 46, 306–312.

    Google Scholar 

  • Lapointe, F.-J. and C. Levasseur (2004). Everything you always wanted to know about the average consensus, and more, in Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life, O. R. P. Bininda-Emonds (Ed.), Dordrecht, The Netherlands: Kluwer Academic, pp. 87–105.

    Google Scholar 

  • Lapointe, F.-J., M. Wilkinson and D. Bryant (2003). Matrix representationswith parsimony or with distances: two sides of the same coin? Syst. Biol. 52, 865–868.

    Article  Google Scholar 

  • Page, R. D. M. (2003). Modified mincut supertrees, in Algorithms in Bioinformatics, Proceedings of the Second International Workshop, WABI 2002, Rome, Italy, September 17–21, 2002, R. Guig and D. Gusfield (Eds), Lecture Notes in Computer Science 2452, Berlin: Springer, pp. 537–552.

    Google Scholar 

  • Ragan, M. A. (1992). Phylogenetic inference based on matrix representation of trees. Mol. Phylogenet. Evol. 1, 53–58.

    Article  Google Scholar 

  • Sanderson, M. J., A. Purvis and C. Henze (1998). Phylogenetic supertrees: assembling the trees of life. Trends Ecol. Evol. 13, 105–109.

    Article  Google Scholar 

  • Semple, C. and M. Steel (2000). A supertree method for rooted trees. Discrete Appl. Math. 105, 147–158.

    MathSciNet  Article  Google Scholar 

  • Semple, C. and M. Steel (2003). Phylogenetics, Oxford: Oxford University Press.

    Google Scholar 

  • Swofford, D. L. (2002). PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods), Version 4, Sunderland, MA: Sinauer Associates.

    Google Scholar 

  • Swofford, D. L., G. J. Olsen, P. J. Waddell and D. M. Hillis (1996). Phylogenetic inference, in Molecular Systematics, 2nd edn, D. Hillis, C. Moritz and B. Mable (Eds), Sunderland, MA: Sinauer, pp. 407–514.

    Google Scholar 

  • Thorley, J. L. and R. D. M. Page (2000). RADCON: phylogenetic tree comparison and consensus. Bioinformatics 16, 486–487.

    Article  Google Scholar 

  • Wilkinson, M., F.-J. Lapointe and D. J. Gower (2003). Branch lengths and support. Syst. Biol. 52, 127–130.

    Google Scholar 

Download references

Author information

Affiliations

Authors

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Willson, S.J. Constructing rooted supertrees using distances. Bull. Math. Biol. 66, 1755–1783 (2004). https://doi.org/10.1016/j.bulm.2004.04.006

Download citation

  • Received:

  • Accepted:

  • Issue Date:

  • DOI: https://doi.org/10.1016/j.bulm.2004.04.006

Keywords

  • Polynomial Time
  • Branch Length
  • Rooted Tree
  • Support Function
  • Threshold Tree