Fast algorithm to reconstruct a species supertree from a set of protein trees
- 45 Downloads
The problem of reconstructing a species supertree from a given set of protein, gene, and regulatorysite trees is the subject of this study. Under the traditional formulation, this problem is proven to be NP-hard. We propose a reformulation: to seek for a supertree, most of the clades of which contribute to the original protein trees. In such a variant, the problem seems to be biologically natural and a fast algorithm can be developed for its solution. The algorithm was tested on artificial and biological sets of protein trees, and it proved to be efficient even under the assumption of horizontal gene transfer. When horizontal transfer is not allowed, the algorithm correctness is proved mathematically; the time necessary for repeating the algorithm is assessed, and, in the worst case scenario, it is of the order n 3 · |V 0|3, where n is the number of gene trees and |V 0| is the number of tree species. Our software for supertree construction, examples of computations, and instructions can be freely accessed at http://lab6.iitp.ru/ru/super3gl/. Events associated with horizontal gene transfer are not included either in this study or in any variant of the software. A general case is described in the authors’ report (journal Problems of Information Transmission, 2011).
Keywordsspecies tree species supertree new formulation of the problem of supertree reconstruction; fast algorithm to reconstruct a supertree generation of a gene set from a supertree modeling the gene evolution along a species tree
Unable to display preview. Download preview PDF.
- 2.Phylogenetic Supertrees. Combining Information to Reveal the Tree of Life. 2004. Ed. Bininda-Emonds O.R.P. Dordrecht: Kluwer.Google Scholar
- 4.Lyubetsky V.A., Gorbunov K.Yu., Rusin L.Y., V’yugin V.V. 2006. Algorithms to reconstruct evolutionary events at molecular level and infer species phylogeny. In: Bioinformatics of Genome Regulation and Structure II. Eds. Kolchanov N., Hofestaedt R., Milanesi L. Springer, pp. 189–204.Google Scholar
- 6.Gorbunov K.Yu., Lyubetsky V.A. 2010. An algorithm of reconciliation of gene and species trees and inferring gene duplications, losses and horizontal transfers. Inform. Protsesses. 10, 140–144.Google Scholar