Abstract
Gene tree/species tree reconciliation is a recent decisive progress in phylogenetic methods, accounting for the possible differences between gene histories and species histories. Reconciliation consists in explaining these differences by gene-scale events such as duplication, loss, transfer, which translates mathematically into a mapping between gene tree nodes and species tree nodes or branches. Gene conversion is a frequent and important evolutionary event, which results in the replacement of a gene by a copy of another from the same species and in the same gene tree. Including this event in reconciliation models has never been attempted because it introduces a dependency between lineages, and standard algorithms based on dynamic programming become ineffective. We propose here a novel mathematical framework including gene conversion as an evolutionary event in gene tree/species tree reconciliation. We describe a randomized algorithm that finds, in polynomial running time, a reconciliation minimizing the number of duplications, losses and conversions in the case when their weights are equal. We show that the space of optimal reconciliations includes an analog of the last common ancestor reconciliation, but is not limited to it. Our algorithm outputs any optimal reconciliation with a non-null probability. We argue that this study opens a research avenue on including gene conversion in reconciliation, and discuss its possible importance in biology.
Similar content being viewed by others
References
Arvestad L, Berglund AC, Lagergren J, Sennblad B (2004) Gene tree reconstruction and orthology analysis based on an integrated model for duplications and sequence evolution. In: Proceedings of the fifth annual international conference on computational molecular biology—RECOMB ’04. ACM Press, New York, pp 326–335. https://doi.org/10.1145/974614.974657
Bansal MS, Alm EJ, Kellis M (2012) Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss. Bioinformatics 28(12):283–291. https://doi.org/10.1093/bioinformatics/bts225
Bansal MS, Alm EJ, Kellis M (2013) Reconciliation revisited: Handling multiple optima when reconciling with duplication, transfer, and loss. J Comput Biol 20(10):738–754. https://doi.org/10.1089/cmb.2013.0073
Bellott DW, Hughes JF, Skaletsky H, Brown LG, Pyntikova T, Tj Cho, Koutseva N, Zaghlul S, Graves T, Rock S, Kremitzki C, Fulton RS, Dugan S, Ding Y, Morton D, Khan Z, Lewis L, Buhay C, Wang Q, Watt J, Holder M, Lee S, Nazareth L, Rozen S, Muzny DM, Warren WC, Gibbs RA, Wilson RK, Page DC (2014) Mammalian Y chromosomes retain widely expressed dosage-sensitive regulators. Nature 508(7497):494–499. https://doi.org/10.1038/nature13206
Boni MF, Posada D, Feldman MW (2007) An exact nonparametric method for inferring mosaic structure in sequence triplets. Genetics 176(2):1035–1047. https://doi.org/10.1534/genetics.106.068874
Bourgon R, Delorenzi M, Sargeant T, Hodder AN, Crabb BS, Speed TP (2004) The serine repeat antigen (SERA) gene family phylogeny in Plasmodium: the impact of GC content and reconciliation of gene and species trees. Mol Biol Evol 21(11):2161–2171. https://doi.org/10.1093/molbev/msh228
Boussau B, Szöllősi GJ, Duret L, Gouy M, Tannier E, Daubin V (2013) Genome-scale coestimation of species and gene trees. Genome Res 23:323–330. https://doi.org/10.1101/gr.141978.112
Brooks DR, Ferrao AL (2005) The historical biogeography of co-evolution: emerging infectious diseases are evolutionary accidents waiting to happen. J Biogeogr 32(8):1291–1299. https://doi.org/10.1111/j.1365-2699.2005.01315.x
Casola C, Conant GC, Hahn MW (2012) Very low rate of gene conversion in the yeast genome. Mol Biol Evol 29(12):3817–3826. https://doi.org/10.1093/molbev/mss192
Chauve C, El-Mabrouk N (2009) New perspectives on gene family evolution: losses in reconciliation and a link with supertrees. In: Batzoglou S (ed) Research in computational molecular biology. Springer, Berlin, pp 46–58. https://doi.org/10.1007/978-3-642-02008-7_4
Chauve C, Doyon JP, El-Mabrouk N (2008) Gene family evolution by duplication, speciation, and loss. J Comput Biol 15(8):1043–1062. https://doi.org/10.1089/cmb.2008.0054
Chauve C, Rafiey A, Davín A, Scornavacca C, Veber P, Boussau B, Szollosi G, Daubin V, Tannier E (2017) Maxtic: fast ranking of a phylogenetic tree by maximum time consistency with lateral gene transfers. Recommended by PCI Evol Biol
Chen JM, Cooper DN, Chuzhanova N, Frec C, Patrinos GP (2007) Gene conversion: mechanisms, evolution and human disease. Nat Rev Genet 8:762–775. https://doi.org/10.1038/nrg2193
Dasgupta B, Ferrarini S, Gopalakrishnan U, Paryani NR (2006) Inapproximability results for the lateral gene transfer problem. J Comb Optim 11(4):387–405. https://doi.org/10.1007/s10878-006-8212-8
Davín AA, Tannier E, Williams TA, Boussau B, Daubin V, Szöllosi GJ (2018) Gene transfers can date the tree of life. Nat Ecol Evol 2(5):904–909. https://doi.org/10.1038/s41559-018-0525-3
Doyon JP, Chauve C, Hamel S (2008) Algorithms for exploring the space of gene tree/species tree reconciliations. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), vol 5267 LNBI, pp 1–13. https://doi.org/10.1007/978-3-540-87989-3_1
Doyon JP, Scornavacca C, Gorbunov KY, Szöllősi GJ, Ranwez V, Berry V (2010) An efficient algorithm for gene/species trees parsimonious reconciliation with losses, duplications and transfers. In: Tannier E (ed) Proceedings of the comparative genomics: international workshop, RECOMB-CG 2010, Ottawa, Canada, October 9—11, 2010. Springer, Berlin, pp 93–108. https://doi.org/10.1007/978-3-642-16181-0_9
Doyon JP, Ranwez V, Daubin V, Berry V (2011) Models, algorithms and programs for phylogeny reconciliation. Brief Bioinform 12(5):392–400. https://doi.org/10.1093/bib/bbr045
Drouin G, Prat F, Ell M, Clarke GD (1999) Detecting and characterizing gene conversions between multigene family members. Mol Biol Evol 16(10):1369–1390. https://doi.org/10.1093/oxfordjournals.molbev.a026047
Dufayard JF, Duret L, Penel S, Gouy M, Rechenmann F, Perriere G (2005) Tree pattern matching in phylogenetic trees: automatic search for orthologs or paralogs in homologous gene sequence databases. Bioinformatics 21(11):2596–2603. https://doi.org/10.1093/bioinformatics/bti325
Felsenstein J (2004) Inferring phylogenies. Sinauer Associates, Sunderland
Gaut BS, Clegg MT (1993) Molecular evolution of the Adh1 locus in the genus Zea. Proc Natl Acad Sci USA 90(11):5095–5099. https://doi.org/10.1073/pnas.90.11.5095
Górecki P, Tiuryn J (2006) DLS-trees: a model of evolutionary scenarios. Theor Comput Sci 359(1–3):378–399. https://doi.org/10.1016/j.tcs.2006.05.019
Groussin M, Mazel F, Sanders JG, Smillie CS, Lavergne S, Thuiller W, Alm EJ (2017) Unraveling the processes shaping mammalian gut microbiomes over evolutionary time. Nat Commun 8(14):319. https://doi.org/10.1038/ncomms14319
Halldorsson BV, Hardarson MT, Kehr B, Styrkarsdottir U, Gylfason A, Thorleifsson G, Zink F, Jonasdottir A, Jonasdottir A, Sulem P, Masson G, Thorsteinsdottir U, Helgason A, Kong A, Gudbjartsson DF, Stefansson K (2016) The rate of meiotic gene conversion varies by sex and age. Nat Genet 48(11):1377–1384. https://doi.org/10.1038/ng.3669
Hallett MT, Lagergren J (2001) Efficient algorithms for lateral gene transfer problems. In: Proceedings of the fifth annual international conference on computational biology. ACM, New York. RECOMB ’01, pp 149–156. https://doi.org/10.1145/369133.369188
Hao W (2010) OrgConv: detection of gene conversion using consensus sequences and its application in plant mitochondrial and chloroplast homologs. BMC Bioinform 11:114–114. https://doi.org/10.1186/1471-2105-11-114
Hein J (1993) A heuristic method to reconstruct the history of sequences subject to recombination. J Mol Evol 36(4):396–405. https://doi.org/10.1007/BF00182187
Hsu CH, Zhang Y, Hardison RC, Program NCS, Green ED, Miller W (2010) An effective method for detecting gene conversion events in whole genomes. J Comput Biol 17:1281–1297. https://doi.org/10.1089/cmb.2010.0103
Hu F, Lin Y, Tang J (2014) MLGO: phylogeny reconstruction and ancestral inference from gene-order data. BMC Bioinform 15:354. https://doi.org/10.1186/s12859-014-0354-6
Hughes AL (1995) Origin and evolution of hla class-I pseudogenes. Mol Biol Evol 12:247–258. https://doi.org/10.1093/oxfordjournals.molbev.a040201
Iwase M, Satta Y, Hirai H, Hirai Y, Takahata N (2010) Frequent gene conversion events between the X and Y homologous chromosomal regions in primates. BMC Evol Biol 10(1):225. https://doi.org/10.1186/1471-2148-10-225
Jakobsen IB, Easteal S (1996) A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequences. Comput Appl Biosci 12(4):291–295. https://doi.org/10.1093/bioinformatics/12.4.291
Ji X, Griffing A, Thorne JL (2016) A phylogenetic approach finds abundant interlocus gene conversion in yeast. Mol Biol Evol 33(9):2469–2476. https://doi.org/10.1093/molbev/msw114
Kejnovsky E, Hobza R, Kubat Z, Widmer A, Marais GAB, Vyskot B (2007) High intrachromosomal similarity of retrotransposon long terminal repeats: evidence for homogenization by gene conversion on plant sex chromosomes? Gene 390:92–97. https://doi.org/10.1016/j.gene.2006.10.007
Ko WY, Kaercher KA, Giombini E, Marcatili P, Froment A, Ibrahim M, Lema G, Nyambo TB, Omar SA, Wambebe C, Ranciaro A, Hirbo JB, Tishkoff SA (2011) Effects of natural selection and gene conversion on the evolution of human glycophorins coding for mns blood polymorphisms in malaria-endemic african populations. Am J Hum Genet 88:741–754. https://doi.org/10.1016/j.ajhg.2011.05.005
Lz Gao (2004) Very low gene duplication rate in the yeast genome. Science 306(5700):1367–1370. https://doi.org/10.1126/science.1102033
Mansai SP, Innan H (2010) The power of the methods for detecting interlocus gene conversion. Genetics 184:517–527. https://doi.org/10.1534/genetics.109.111161
Matassi G (2017) Horizontal gene transfer drives the evolution of Rh50 permeases in prokaryotes. BMC Evol Biol 17(1):2. https://doi.org/10.1186/s12862-016-0850-6
McGrath CL, Casola C, Hahn MW (2009) Minimal effect of ectopic gene conversion among recent duplicates in four mammalian genomes. Genetics 182(2):615–622. https://doi.org/10.1534/genetics.109.101428
McGuire G, Wright F, Prentice MJ (1997) A graphical method for detecting recombination in phylogenetic data sets. Mol Biol Evol 14(11):1125–1131. https://doi.org/10.1093/oxfordjournals.molbev.a025722
Menotti-Raymond M, Starmer WT, Sullivan DT (1991) Characterization of the structure and evolution of the Adh region of Drosophila hydei. Genetics 127(2):355–66
Merkle D, Middendorf M, Wieseke N (2010) A parameter-adaptive dynamic programming approach for inferring cophylogenies. BMC Bioinf 11(1):S60. https://doi.org/10.1186/1471-2105-11-S1-S60
Mirarab S, Bayzid MS, Boussau B, Warnow T (2014) Statistical binning enables an accurate coalescent-based estimation of the avian tree. Science (New York, NY) 346(1250):463. https://doi.org/10.1126/science.1250463
Nakhleh L (2013) Computational approaches to species phylogeny inference and gene tree reconciliation. Trends Ecol Evol (Amst) 28(12):719–728. https://doi.org/10.1016/j.tree.2013.09.004
Osada N, Innan H (2008) Duplication and gene conversion in the Drosophila melanogaster genome. PLoS Genet 4(12):e1000,305. https://doi.org/10.1371/journal.pgen.1000305
Page RD, Charleston MA (1998) Trees within trees: phylogeny and historical associations. Trends Ecol Evol (Amst) 13(9):356–359. https://doi.org/10.1016/S0169-5347(98)01438-4
Peneder P, Wallner B, Vogl C (2017) Exchange of genetic information between Therian X and Y chromosome gametologs in old evolutionary strata. Ecol Evol 7(20):8478–8487. https://doi.org/10.1002/ece3.3278
Planet PJ, Kachlany SC, Fine DH, DeSalle R, Figurski DH (2003) The widespread colonization island of actinobacillus actinomycetemcomitans. Nat Genet 34(2):193–198. https://doi.org/10.1038/ng1154
Sawyer SA (1989) Statistical tests for detecting gene conversion. Mol Biol Evol 6(5):526–538. https://doi.org/10.1093/oxfordjournals.molbev.a040567
Scornavacca C, Paprotny W, Berry V, Ranwez V (2013) Representing a set of reconciliations in a compact way. J Bioinform Comput Biol 11(02):1250,025. https://doi.org/10.1142/S0219720012500254
Searls DB (2003) Pharmacophylogenomics: genes, evolution and drug targets. Nat Rev Drug Discov 2(8):613–623. https://doi.org/10.1038/nrd1152
Song G, Hsu CH, Riemer C, Miller W (2011a) Evaluation of methods for detecting conversion events in gene clusters. BMC Bioinform 12(Suppl 1):S45. https://doi.org/10.1186/1471-2105-12-S1-S45
Song G, Hsu CH, Riemer C, Zhang Y, Kim HL, Hoffmann F, Zhang L, Hardison RC, Green ED, Miller W (2011b) Conversion events in gene clusters. BMC Evol Biol 11(1):226. https://doi.org/10.1186/1471-2148-11-226
Storm CE, Sonnhammer EL (2002) Automated ortholog inference from phylogenetic trees and calculation of orthology reliability. Bioinformatics 18(1):92–99. https://doi.org/10.1093/bioinformatics/18.1.92
Szöllősi GJ, Boussau B, Abby SS, Tannier E (2012) Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations. Proc Natl Acad Sci USA 43:17,513–17,518. https://doi.org/10.1073/pnas.1202997109
Szöllősi GJ, Rosikiewicz W, Boussau B, Tannier E, Daubin V (2013a) Efficient exploration of the space of reconciled gene trees. Syst Biol 62(6):901–912. https://doi.org/10.1093/sysbio/syt054
Szöllősi GJ, Tannier E, Lartillot N, Daubin V (2013b) Lateral gene transfer from the dead. Syst Biol 62(3):386–397. https://doi.org/10.1093/sysbio/syt003
Szöllősi GJ, Tannier E, Daubin V, Boussau B (2015) The inference of gene trees with species trees. Syst Biol 64(1):42–62. https://doi.org/10.1093/sysbio/syu048
Teshima KM (2004) The effect of gene conversion on the divergence between duplicated genes. Genetics 166(3):1553–1560. https://doi.org/10.1534/genetics.166.3.1553
Tofigh A, Hallett M, Lagergren J (2011) Simultaneous identification of duplications and lateral gene transfers. IEEE/ACM Trans Comput Biol Bioinform 8(2):517–535. https://doi.org/10.1109/TCBB.2010.14
Trombetta B, Cruciani F (2017) Y chromosome palindromes and gene conversion. Hum Genet. https://doi.org/10.1007/s00439-017-1777-8
Trombetta B, D’Atanasio E, Cruciani F (2017) Patterns of inter-chromosomal gene conversion on the male-specific region of the human y chromosome. Front Genet. https://doi.org/10.3389/fgene.2017.00054
van der Heijden RT, Snel B, van Noort V, Huynen MA (2007) Orthology prediction at scalable resolution by phylogenetic tree analysis. BMC Bioinform 8:83. https://doi.org/10.1186/1471-2105-8-83
Vanhove MPM, Pariselle A, Van Steenberge M, Raeymaekers JAM, Hablützel PI, Gillardin C, Hellemans B, Breman FC, Koblmüller S, Sturmbauer C, Snoeks J, Volckaert FAM, Huyse T (2015) Hidden biodiversity in an ancient lake: phylogenetic congruence between Lake Tanganyika tropheine cichlids and their monogenean flatworm parasites. Sci Rep 5(13):669. https://doi.org/10.1038/srep13669
Vernot B, Stolzer M, Goldman A, Durand D (2008) Reconciliation with non-binary species trees. J Comput Biol 15(8):981–1006. https://doi.org/10.1089/cmb.2008.0092
Yb Chan, Ranwez V, Scornavacca C (2015) Exploring the space of gene/species reconciliations with transfers. J Math Biol 71(5):1179–1209. https://doi.org/10.1007/s00285-014-0851-2
Yb Chan, Ranwez V, Scornavacca C (2017) Inferring incomplete lineage sorting, duplications, transfers and losses with reconciliations. J Theor Biol 432:1–13. https://doi.org/10.1016/j.jtbi.2017.08.008
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work is funded by the Agence Nationale pour la Recherche, Ancestrome project ANR-10-BINF-01-01.
Rights and permissions
About this article
Cite this article
Hasić, D., Tannier, E. Gene tree species tree reconciliation with gene conversion. J. Math. Biol. 78, 1981–2014 (2019). https://doi.org/10.1007/s00285-019-01331-w
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00285-019-01331-w