Journal of Molecular Evolution

, Volume 69, Issue 1, pp 65–80 | Cite as

Phylogeny of Banana Streak Virus Reveals Recent and Repetitive Endogenization in the Genome of Its Banana Host (Musa sp.)



Banana streak virus (BSV) is a plant dsDNA pararetrovirus (family Caulimoviridae, genus badnavirus). Although integration is not an essential step in the BSV replication cycle, the nuclear genome of banana (Musa sp.) contains BSV endogenous pararetrovirus sequences (BSV EPRVs). Some BSV EPRVs are infectious by reconstituting a functional viral genome. Recent studies revealed a large molecular diversity of episomal BSV viruses (i.e., nonintegrated) while others focused on BSV EPRV sequences only. In this study, the evolutionary history of badnavirus integration in banana was inferred from phylogenetic relationships between BSV and BSV EPRVs. The relative evolution rates and selective pressures (dN/dS ratio) were also compared between endogenous and episomal viral sequences. At least 27 recent independent integration events occurred after the divergence of three banana species, indicating that viral integration is a recent and frequent phenomenon. Relaxation of selective pressure on badnaviral sequences that experienced neutral evolution after integration in the plant genome was recorded. Additionally, a significant decrease (35%) in the EPRV evolution rate was observed compared to BSV, reflecting the difference in the evolution rate between episomal dsDNA viruses and plant genome. The comparison of our results with the evolution rate of the Musa genome and other reverse-transcribing viruses suggests that EPRVs play an active role in episomal BSV diversity and evolution.


Badnavirus Banana (Musa sp.) Banana streak virus (BSV) dN/dS ratio Endogenous pararetrovirus (EPRV) Evolution rate Integration Selective constraints 



We are grateful to Nathalie Laboureau and Serge Galzi for technical assistance. We thank the members of our team, Elisabeth Fournier, Eric Bazin, and Nicolas Galtier for their helpful comments, and Philippe Rott for improving the manuscript. P.G. was supported by a PhD grant CIRAD – Région Languedoc Roussillon.


  1. Ben-Dov Y (1994) A systematic catalogue of the mealybugs of the world (Insecta: Homoptera: Coccoidea: Pseudococcidae and Putoidae) with data on geographical distribution, host plants, biology and economic importance. Intercept Publications, Andover, p 686Google Scholar
  2. Bouhida M, Lockhart BE, Olszewski NE (1993) An analysis of the complete sequence of a Sugarcane bacilliform virus genome infectious to banana and rice. J Gen Virol 74:15–22PubMedCrossRefGoogle Scholar
  3. Bromham L (2002) The human zoo: endogenous retroviruses in the human genome. Trends Ecol Evol 17:91–97CrossRefGoogle Scholar
  4. Daubin V, Ochman H (2004) Start-up entities in the origin of new genes. Curr Opin Genet Dev 14:616–619PubMedCrossRefGoogle Scholar
  5. Duffy S, Shackelton LA, Holmes EC (2008) Rates of evolutionary change in viruses: patterns and determinants. Nat Rev Genet 9:267–276PubMedCrossRefGoogle Scholar
  6. Fauquet CM, Mayo MA, Maniloff J, Desselberger U, Ball LA (2005) Virus taxonomy, VIII report of the ICTV. Elsevier/Academic Press, LondonGoogle Scholar
  7. Flint SJ, Enquist LW, Skalka AM (2003) Principles of virology: molecular biology, pathogenesis, and control of animal viruses, 2nd edn. ASM Press, Washington, DCGoogle Scholar
  8. Gaut BS, Wright SI, Rizzon C, Dvorak J, Anderson LK (2007) Recombination: an underappreciated factor in the evolution of plant genomes. Nat Rev Genet 8:77–84PubMedCrossRefGoogle Scholar
  9. Gawel NJ, Jarret RL (1991) A modified CTAB DNA extraction procedure for Musa and Ipomoea. Plant Mol Biol Rep 9:262–266CrossRefGoogle Scholar
  10. Gayral P, Noa-Carrazana J-C, Lescot M, Lheureux F, Lockhart BEL, Matsumoto T, Piffanelli P, Iskra-Caruana M-L (2008) A single Banana streak virus integration event in the banana genome as the origin of infectious endogenous pararetrovirus. J Virol 82:6697–6710PubMedCrossRefGoogle Scholar
  11. Geering ADW, McMichael LA, Dietzgen RG, Thomas JE (2000) Genetic diversity among Banana streak virus isolates from Australia. Phytopathology 90:921–927PubMedCrossRefGoogle Scholar
  12. Geering ADW, Olszewski NE, Dahal G, Thomas JE, Lockhart BEL (2001) Analysis of the distribution and structure of integrated Banana streak virus DNA in a range of Musa cultivars. Mol Plant Pathol 2:207–213CrossRefGoogle Scholar
  13. Geering ADW, Olszewski NE, Harper G, Lockhart BEL, Hull R, Thomas JE (2005a) Banana contains a diverse array of endogenous badnaviruses. J Gen Virol 86:511–520PubMedCrossRefGoogle Scholar
  14. Geering ADW, Pooggin MM, Olszewski NE, Lockhart BEL, Thomas JE (2005b) Characterisation of Banana streak Mysore virus and evidence that its DNA is integrated in the B genome of cultivated Musa. Arch Virol 150:787–796PubMedCrossRefGoogle Scholar
  15. Gregor W, Mette MF, Staginnus C, Matzke MA, Matzke AJM (2004) A distinct endogenous pararetrovirus family in Nicotiana tomentosiformis, a diploid progenitor of polyploid tobacco. Plant Physiol 134:1191–1199PubMedCrossRefGoogle Scholar
  16. Guindon S, Gascuel O (2003) A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol 52:696–704PubMedCrossRefGoogle Scholar
  17. Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser 41:95–98Google Scholar
  18. Hanada K, Suzuki Y, Gojobori T (2004) A large variation in the rates of synonymous substitution for RNA viruses and its relationship to a diversity of viral infection and transmission modes. Mol Biol Evol 21:1074–1080PubMedCrossRefGoogle Scholar
  19. Hansen CN, Harper G, Heslop-Harrison JS (2005) Characterisation of pararetrovirus-like sequences in the genome of potato (Solanum tuberosum). Cytogenet Genome Res 110:559–565PubMedCrossRefGoogle Scholar
  20. Harper G, Hull R (1998) Cloning and sequence analysis of Banana streak virus DNA. Virus Genes 17:271–278PubMedCrossRefGoogle Scholar
  21. Harper G, Osuji JO, Heslop-Harrison JSP, Hull R (1999) Integration of Banana streak badnavirus into the Musa genome: molecular and cytogenetic evidence. Virology 255:207–213PubMedCrossRefGoogle Scholar
  22. Harper G, Hart D, Moult S, Hull R (2004) Banana streak virus is very diverse in Uganda. Virus Res 100:51–56PubMedCrossRefGoogle Scholar
  23. Harper G, Hart D, Moult S, Hull R, Geering A, Thomas J (2005) The diversity of Banana streak virus isolates in Uganda. Arch Virol 150:2407–2420PubMedCrossRefGoogle Scholar
  24. Hohn T, Richert-Poggeler KR, Harper G, Schwarzacher T, Teo CH, Techeney PY, Iskra-Caruana ML, Hull R (2008) Evolution of integrated plant viruses. In: Roosinck M (ed) Virus evolution. Springer, Heidelberg, chapter 4, pp 53–81Google Scholar
  25. Huelsenbeck JP, Ronquist F (2001) MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:754–755PubMedCrossRefGoogle Scholar
  26. Hull R, Harper G, Lockhart B (2000) Viral sequences integrated into plant genomes. Trends Plant Sci 5:362–365PubMedCrossRefGoogle Scholar
  27. Jakowitsch J, Mette MF, van der Winden J, Matzke MA, Matzke AJM (1999) Integrated pararetroviral sequences define a unique class of dispersed repetitive DNA in plants. Proc Natl Acad Sci USA 96:13241–13246PubMedCrossRefGoogle Scholar
  28. Jaufeerally-Fakim Y, Khorugdharry A, Harper G (2006) Genetic variants of Banana streak virus in Mauritius. Virus Res 115:91–98PubMedCrossRefGoogle Scholar
  29. Jenkins GM, Rambaut A, Pybus OG, Holmes EC (2002) Rates of molecular evolution in RNA viruses: a quantitative phylogenetic analysis. J Mol Evol 54:156–165PubMedCrossRefGoogle Scholar
  30. Jones DR (2000) Diseases of banana, abacá, and enset. CABI, WallingfordGoogle Scholar
  31. Kunii M, Kanda M, Nagano H, Uyeda I, Kishima Y, Sano Y (2004) Reconstruction of putative DNA virus from endogenous rice tungro bacilliform virus-like sequences in the rice genome: implications for integration and evolution. BMC Genomics 5:80PubMedCrossRefGoogle Scholar
  32. Le Provost G, Iskra-Caruana ML, Acina I, Teycheney PY (2006) Improved detection of episomal Banana streak viruses by multiplex immunocapture PCR. J Virol Methods 137:7–13PubMedCrossRefGoogle Scholar
  33. Lescot M, Piffanelli P, Ciampi AY, Ruiz M, Blanc G, Leebens-Mack J, da Silva FR, Santos CM, D’Hont A, Garsmeur O, Vilarinhos AD, Kanamori H, Matsumoto T, Ronning CM, Cheung F, Haas BJ, Althoff R, Arbogast T, Hine E, Pappas GJ Jr, Sasaki T, Souza MT Jr, Miller RN, Glaszmann JC, Town CD (2008) Insights into the Musa genome: syntenic relationships to rice and between Musa species. BMC Genomics 9:58PubMedCrossRefGoogle Scholar
  34. Lheureux F, Laboureau N, Muller E, Lockhart BE, Iskra-Caruana ML (2007) Molecular characterization of Banana streak acuminata Vietnam virus isolated from Musa acuminata siamea (banana cultivar). Arch Virol 152:1409–1416PubMedCrossRefGoogle Scholar
  35. Lockhart B, Jones D (2000) Banana streak. In: Jones DR (ed) Diseases of banana, abaca and enset. CABI, Wallingford, pp 263–274Google Scholar
  36. Lockhart BE, Menke J, Dahal G, Olszewski NE (2000) Characterization and genomic analysis of Tobacco vein clearing virus, a plant pararetrovirus that is transmitted vertically and related to sequences integrated in the host genome. J Gen Virol 81:1579–1585PubMedGoogle Scholar
  37. Malik HS, Eickbush TH (2001) Phylogenetic analysis of ribonuclease H domains suggests a late, chimeric origin of LTR retrotransposable elements and retroviruses. Genome Res 11:1187–1197PubMedCrossRefGoogle Scholar
  38. Mette MF, Kanno T, Aufsatz W, Jakowitsch J, van der Winden J, Matzke MA, Matzke AJM (2002) Endogenous viral sequences and their potential contribution to heritable virus resistance in plants. EMBO J 21:461–469PubMedCrossRefGoogle Scholar
  39. Murad L, Bielawski JP, Matyasek R, Kovarik A, Nichols RA, Leitch AR, Lichtenstein CP (2004) The origin and evolution of geminivirus-related DNA sequences in Nicotiana. Heredity 92:352–358PubMedCrossRefGoogle Scholar
  40. Ndowora T, Dahal G, LaFleur D, Harper G, Hull R, Olszewski NE, Lockhart B (1999) Evidence that badnavirus infection in Musa can originate from integrated pararetroviral sequences. Virology 255:214–220PubMedCrossRefGoogle Scholar
  41. Noreen F, Akbergenov R, Hohn T, Richert-Poggeler KR (2007) Distinct expression of endogenous Petunia vein clearing virus and the DNA transposon dTph1 in two Petunia hybrida lines is correlated with differences in histone modification and siRNA production. Plant J 50:219–229PubMedCrossRefGoogle Scholar
  42. Pahalawatta V, Druffel K, Pappu H (2008) A new and distinct species in the genus Caulimovirus exists as an endogenous plant pararetroviral sequence in its host, Dahlia variabilis. Virology 376:253–257PubMedCrossRefGoogle Scholar
  43. Pal A, Chakrabarti A, Basak J (2007) New motifs within the NB-ARC domain of R proteins: probable mechanisms of integration of geminiviral signatures within the host species of Fabaceae family and implications in conferring disease resistance. J Theor Biol 246:564–573PubMedCrossRefGoogle Scholar
  44. Posada D, Buckley TR (2004) Model selection and model averaging in phylogenetics: advantages of Akaike information criterion and bayesian approaches over likelihood ratio tests. Syst Biol 53:793–808PubMedCrossRefGoogle Scholar
  45. Posada D, Crandall KA (1998) MODELTEST: testing the model of DNA substitution. Bioinformatics 14:817–818PubMedCrossRefGoogle Scholar
  46. Richert-Poggeler KR, Shepherd RJ (1997) Petunia vein-clearing virus: a plant pararetrovirus with the core sequences for an integrase function. Virology 236:137–146PubMedCrossRefGoogle Scholar
  47. Richert-Poggeler KR, Noreen F, Schwarzacher T, Harper G, Hohn T (2003) Induction of infectious petunia vein clearing (pararetro) virus from endogenous provirus in petunia. EMBO J 22:4836–4845PubMedCrossRefGoogle Scholar
  48. Schuermann D, Molinier J, Fritsch O, Hohn B (2005) The dual nature of homologous recombination in plants. Trends Genet 21:172–181PubMedCrossRefGoogle Scholar
  49. Staginnus C, Richert-Poggeler KR (2006) Endogenous pararetroviruses: two-faced travelers in the plant genome. Trends Plant Sci 11:485–491PubMedCrossRefGoogle Scholar
  50. Staginnus C, Gregor W, Mette MF, Teo CH, Borroto-Fernandez EG, Machado ML, Matzke M, Schwarzacher T (2007) Endogenous pararetroviral sequences in tomato (Solanum lycopersicum) and related species. BMC Plant Biol 7:24PubMedCrossRefGoogle Scholar
  51. Su L, Gao S, Huang Y, Ji C, Wang D, Ma Y, Fang R, Chen X (2007) Complete genomic sequence of Dracaena mottle virus, a distinct badnavirus. Virus Genes 35:423–429PubMedCrossRefGoogle Scholar
  52. Swofford DL (2002) PAUP* 4: phylogenetic analysis using parsimony (and other methods). Sinauer, SunderlandGoogle Scholar
  53. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680PubMedCrossRefGoogle Scholar
  54. Ude G, Pillay M, Nwakanma D, Tenkouano A (2002) Analysis of genetic diversity and sectional relationships in Musa using AFLP markers. Theor Appl Genet 104:1239–1245PubMedCrossRefGoogle Scholar
  55. Weiss RA (2006) The discovery of endogenous retroviruses. Retrovirology 3:67PubMedCrossRefGoogle Scholar
  56. Xia X (1999) DAMBE (software package for data analysis in molecular biology and evolution) user manual. Department of Ecology and Biodiversity, University of Hong Kong, Hong KongGoogle Scholar
  57. Xia X, Xie Z (2001) DAMBE: software package for data analysis in molecular biology and evolution. J Hered 92:371–373PubMedCrossRefGoogle Scholar
  58. Xia X, Xie Z, Salemi M, Chen L, Wang Y (2003) An index of substitution saturation and its application. Mol Phylogenet Evol 26:1–7PubMedCrossRefGoogle Scholar
  59. Yang Z (1997) PAML: a programme package for phylogenetic analysis by maximum likelihood. Comput Appl Biosci 13:555–556PubMedGoogle Scholar
  60. Yang Z (1998) Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol Biol Evol 15:568–573PubMedGoogle Scholar
  61. Yang Z, Nielsen R (1998) Synonymous and nonsynonymous rate variation in nuclear genes of mammals. J Mol Evol 46:409–418PubMedCrossRefGoogle Scholar
  62. Yang Z, Yoder AD (2003) Comparison of likelihood and Bayesian methods for estimating divergence times using multiple gene Loci and calibration points, with application to a radiation of cute-looking mouse lemur species. Syst Biol 52:705–716PubMedCrossRefGoogle Scholar
  63. Yang IC, Hafner GJ, Dale JL, Harding RM (2003) Genomic characterisation of taro bacilliform virus. Arch Virol 148:937–949PubMedCrossRefGoogle Scholar
  64. Yoder AD, Yang Z (2000) Estimation of primate speciation dates using local molecular clocks. Mol Biol Evol 17:1081–1090PubMedGoogle Scholar
  65. Zhou Y, Holmes EC (2007) Bayesian estimates of the evolutionary rate and age of hepatitis B virus. J Mol Evol 65:197–205PubMedCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  1. 1.CIRADUMR Biologie et Génétique des Interactions Plante-Parasite (BGPI)Montpellier Cedex 5France

Personalised recommendations