Abstract
A Maximum Agreement SubTree (MAST) is a largest subtree common to a set of trees and serves as a summary of common substructure in the trees. A single MAST can be misleading, however, since there can be an exponential number of MASTs, and two MASTs for the same tree set do not even necessarily share any leaves. In this paper we introduce the notion of the Kernel Agreement SubTree (KAST), which is the summary of the common substructure in all MASTs, and show that it can be calculated in polynomial time (for trees with bounded degree). Suppose the input trees represent competing hypotheses for a particular phylogeny. We show the utility of the KAST as a method to discern the common structure of confidence, and as a measure of how confident we are in a given tree set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adams, E.N.: Consensus techniques and the comparison of taxonomic trees. Syst. Zool. 21, 390–397 (1972)
Bandelt, H., Dress, A.: Split decomposition: A new and useful approach to phylogenetic analysis of distance data. Mol. Phyl. Evol. 1(3), 242–252 (1992)
Barrett, M., Donoghue, M.J., Sober, E.: Against consensus. Syst. Zool. 40(4), 486–493 (1991)
Barrett, M., Donoghue, M.J., Sober, E.: Crusade? a reply to nelson. Syst. Biol. 42(2), 216–217 (1993)
Belda, E., Moya, A., Silva, F.J.: Genome rearrangement distances and gene order phylogeny in γ-proteobacteria. Mol. Biol. Evol. 22(6), 1456–1467 (2005)
Blin, G., Chauve, C., Fertin, G.: Genes order and phylogenetic reconstruction: Application to γ-proteobacteria. In: Lagergren, J. (ed.) RECOMB-WS 2004. LNCS (LNBI), vol. 3388, pp. 11–20. Springer, Heidelberg (2005)
Bryant, D.: Building trees, hunting for trees, and comparing trees. PhD dissertation, Department of Mathematics, University of Canterbury (1997)
Bryant, D.: A classification of consensus methods for phylogenetics. In: Bioconsensus. DIMACS Series in Discrete Mathematics and Theoretical Computer Science, vol. 61, pp. 163–184. AMS Press, New York (2002)
Bryant, D., Moulton, V.: Neighbor-net: an agglomerative method for the construction of phylogenetic networks. Mol. Biol. Evol. 21(2), 255–265 (2004)
Gordon, A.D., Finden, C.R.: Obtaining common pruned trees. J. Classification 2(1), 255–267 (1985)
Cranston, K.A., Rannala, B.: Summarizing a posterior distribution of trees using agreement subtrees. Syst. Biol. 56(4), 578–590 (2007)
Earnest-DeYoung, J.V., Lerat, E., Moret, B.M.E.: Reversing gene erosion – reconstructing ancestral bacterial genomes from gene-content and order data. In: Jonassen, I., Kim, J. (eds.) WABI 2004. LNCS (LNBI), vol. 3240, pp. 1–13. Springer, Heidelberg (2004)
Farach, M., Przytycka, T., Thorup, M.: On the agreement of many trees. Information Processing Letters, 297–301 (1995)
Felsenstein, J.: Phylogenetic Inference Package (PHYLIP), Version 3.5. University of Washington, Seattle (1993)
Gauthier, O., Lapointe, F.-J.: Seeing the trees for the network: consensus, information content, and superphylogenies. Syst. Biol. 56(2), 345–355 (2007)
Herbeck, J.T., Degnan, P.H., Wernegreen, J.J.: Nonhomogeneous model of sequence evolution indicates independent origins of primary endosymbionts within the enterobacteriales (gamma-proteobacteria). Mol. Biol. Evol. 22(3), 520–532 (2005)
Huson, D.H.: SplitsTree: analyzing and visualizing evolutionary data. Bioinformatics 14(1), 68–73 (1998)
Kubicka, E., Kubicki, G., McMorris, F.R.: On agreement subtrees of two binary trees. Congressus Numeratium 88, 217–224 (1992)
Lartillot, N., Brinkmann, H., Philippe, H.: Suppression of long-branch attraction artefacts in the animal phylogeny using a site-heterogeneous model. BMC Evol. Biol. 7(Suppl. 1) (2007); 1st International Conference on Phylogenomics, St Adele, CANADA, March 15-19 (2006)
Lartillot, N., Philippe, H.: A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Mol. Biol. Evol. 21(6), 1095–1109 (2004)
Lerat, E., Daubin, V., Moran, N.A.: From gene trees to organismal phylogeny in prokaryotes:the case of the γ-proteobacteria. PLoS Biol. 1(1), e19 (2003)
Moore, G.W., Goodman, M., Barnabas, J.: An iterative approach from the standpoint of the additive hypothesis to the dendrogram problem posed by molecular data sets. Journal of Theoretical Biology 38(3), 423–457 (1973)
Nelson, G.: Why crusade against consensus? a reply to Barret, Donoghue, and Sober. Syst. Biol. 42(2), 215–216 (1993)
Pattengale, N.D., Aberer, A.J., Swenson, K.M., Stamatakis, A., Moret, B.M.E.: Uncovering hidden phylogenetic consensus in large datasets. IEEE/ACM Transactions on Computational Biology and Bioinformatics 99(PrePrints) (2011)
Philippe, H., Brinkmann, H., Copley, R.R., Moroz, L.L., Nakano, H., Poustka, A.J., Wallberg, A., Peterson, K.J., Telford, M.J.: Acoelomorph flatworms are deuterostomes related to Xenoturbella. Nature 470(7333), 255–258 (2011)
Redelings, B.: Bayesian phylogenies unplugged: Majority consensus trees with wandering taxa, http://www.duke.edu/~br51/wandering.pdf
Robinson, D.F.: Comparison of labeled trees with valency three. Journal of Combinatorial Theory, Series B 11(2), 105–119 (1971)
Shin, K., Kuboyama, T.: Kernels based on distributions of agreement subtrees. In: Wobcke, W., Zhang, M. (eds.) AI 2008. LNCS (LNAI), vol. 5360, pp. 236–246. Springer, Heidelberg (2008)
Stamatakis, A.: RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22(21), 2688–2690 (2006)
Swenson, K.M., Arndt, W., Tang, J., Moret, B.M.E.: Phylogenetic reconstruction from complete gene orders of whole genomes. In: Proc. 6rd Asia Pacific Bioinformatics Conf. (APBC 2008), pp. 241–250 (2008)
Thorley, J.L., Wilkinson, M., Charleston, M.: The information content of consensus trees. In: Rizzi, A., Vichi, M., Bock, H. (eds.) Studies in Classification, Data Analysis, and Knowledge Organization, Advances in Data Science and Classification, pp. 91–98. Springer, Heidelberg (1998)
Wilkinson, M.: Common cladistic information and its consensus representation: reduced adams and reduced cladistic consensus trees and profiles. Syst. Biol. 43(3), 343–368 (1994)
Wilkinson, M.: More on reduced consensus methods. Syst. Biol. 44, 435–439 (1995)
Wilkinson, M.: Majority-rule reduced consensus trees and their use in bootstrapping. Mol. Biol. Evol. 13(3), 437–444 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Swenson, K.M., Chen, E., Pattengale, N.D., Sankoff, D. (2011). The Kernel of Maximum Agreement Subtrees. In: Chen, J., Wang, J., Zelikovsky, A. (eds) Bioinformatics Research and Applications. ISBRA 2011. Lecture Notes in Computer Science(), vol 6674. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-21260-4_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-21260-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-21259-8
Online ISBN: 978-3-642-21260-4
eBook Packages: Computer ScienceComputer Science (R0)