Abstract
In this brief note, we review the taxonomic history of dahlia mosaic virus (DMV) and related viruses. DMV is the only officially recognized caulimovirus known to infect dahlia (Dahlia variabilis) plants, although this virus appears to be relatively rare as a pathogen compared to a more recently described but unclassified caulimovirus called dahlia common mosaic virus (DCMV). We have undertaken a new set of analyses to test the hypothesis that DCMV represents a new caulimovirus species whose members infect dahlia, but we ultimately reject this hypothesis. A probable sequencing error was identified in the reference genome sequence of DMV, and consequently, we recommend that an alternative virus isolate be nominated as the exemplar for this species. In accordance with the new binomial nomenclatural system, it is proposed that the virus species be called “Caulimovirus dahliae”.
Avoid common mistakes on your manuscript.
Dahlia mosaic is one of the most prevalent and damaging viral diseases of dahlia (Dahlia variabilis) in Europe and the USA [4, 19]. Dahlia mosaic virus (DMV), the putative causative agent of the disease, was one of the first officially recognized members of the genus Caulimovirus (initially called the ‘cauliflower mosaic virus group’), appearing in the first report of the International Committee on Nomenclature of Viruses, published in 1971 [23]. At this early point in the development of a classification scheme for viruses, formal taxonomic proposals were not required to recognise a new virus species. Nevertheless, there was strong evidence at the time to support the classification of DMV and cauliflower mosaic virus (CaMV) as closely related but distinct members of this virus genus. The characters that were used for classification included the size and shape of the virion, the presence of electron-dense inclusion bodies in the cytoplasm of infected cells, the mode of transmission, differences in host range, and weak serological relatedness of the viruses in gel diffusion tests [3, 4].
The diagnostic criteria used to distinguish DMV from CaMV are still relevant today, although they have been supplanted by nucleotide-sequence-based criteria to differentiate between related viruses that cause similar disease symptoms and share many other biological characteristics. Genome architecture and phylogeny are now used for classification to the taxonomic rank of genus, and the principal criterion used to demarcate different virus species in the family Caulimoviridae is less than 80% nucleotide sequence identity in the conserved reverse transcriptase and ribonuclease H coding domains of the polymerase gene (open reading frame [ORF] 5) [22].
Nicolaisen [14] provided the first sequence data for dahlia mosaic virus (DMV), obtained from a 1195-bp PCR amplicon from a dahlia plant with mosaic symptoms, which was obtained from the Dutch Flowerbulb Inspection Service. This DNA sequence, deposited in the GenBank database on 13 November 2002 under INSDC accession code AJ515906.1, contains parts of ORF 4 (capsid protein) and ORF 5 (polymerase protein).
Almost contemporaneously with the publication by Nicolaisen [14], IB Maiti and S Pattanaik deposited six sequences representing a nearly complete genome sequence of DMV in the GenBank database on 5 May 2003 (INSDC accession codes AY309479.1, AY309480.1, AY291585.1, AY291586.1, AY291587.1, and AY291588.1). These unpublished sequence records of the genome of DMV are problematic for a stable taxonomy of DMV. Firstly, metadata describing the origin of the sequences were not provided. Secondly, sequences of each open reading frame (ORF) were deposited as individual flat files and not as a contiguous genome sequence. Finally, the intergenic region containing the 35S promoter sequence was excluded from the series of sequence depositions.
A complete genome sequence of DMV was later deposited in GenBank by IB Maiti and S Pattanaik on 15 September 2015, which derived from an infected dahlia plant collected on 20 May 1980 in Portland, Oregon (INSDC accession code JX272320.1). This virus isolate has been nominated the exemplar for DMV by the International Committee on Taxonomy of Viruses (ICTV), and accordingly, the genome sequence is recognized as the reference sequence for DMV by the NCBI (RefSeq accession code NC_018616.1). The citations associated with INSDC accession code JX272320.1 were Banerjee et al. [2] and Sahoo et al. [20], but both publications describe characterisation of the 35S promoter and do not include details of the origin of the virus isolate or methods used to generate the sequence, nor do they describe essential features of the virus genome, including its episomal nature.
Pahalawatta et al. [15], Pahalawatta et al. [16], and Pappu et al. [18] described two more distinct caulimoviruses from dahlia: dahlia common mosaic virus (DCMV) and DMV-D10. In these papers, the virus from the Netherlands [14] was labelled as DMV-Holland, and that sequenced by Maiti and Pattanaik, DMV-Portland. DMV-Holland and DCMV were most alike, having 90% nucleotide (nt) sequence identity in the overlapping region of the genome sequence [18]. However, Pappu et al. [18] used DMV-Portland as the exemplar for DMV because a near-complete genome sequence was available, which allowed for more comprehensive pairwise sequence comparisons and phylogenetic analyses. When ORF 5 protein sequences were compared, DCMV had only 70−71% amino acid (aa) sequence identity to both DMV-Portland and DMV-D10, which justified recognition of a new caulimovirus species. However, this species has never been officially approved by the ICTV despite its persistence as a taxon in the literature [5, 9, 24]. Pappu et al. [18] followed the precedent set by Maiti and Pattanaik and deposited the genome sequence of DCMV, minus the intergenic region, in separate parts representing each ORF (INSDC accession codes EU090952, EU090953, EU090954, EU090955, EU090956, and EU090957). Pappu et al. [18] provided the name ‘DMV-Holland’ instead of DCMV in the sequence definition field, even though the authors had concluded they had sequenced a member of a distinct virus species.
DMV-D10 is quite enigmatic, as it has an aberrant genome architecture compared to all other caulimoviruses [16]. Specifically, ORF 3, which encodes an aphid transmission helper protein, is absent in DMV-D10, and the putative capsid protein also contains a C-terminal deletion and is fused in-frame with the polymerase protein. As a result of these large deletions, the genome of DMV-D10, at 7 kbp, is much shorter than a normal caulimovirus genome, which is about 8 kbp [22]. Southern blot hybridization and fluorescent in situ hybridization experiments suggest that DMV-D10 is integrated in the nuclear genome of dahlia [7, 15], and it is highly unlikely that this endogenous viral element is able to replicate autonomously, as it is missing the conserved zinc finger motif in the nucleocapsid domain of the capsid protein that is essential for infectivity of the virus [8]. However, the replication of DMV-D10 might be supported by a coinfecting virus such as DCMV. DMV-D10, renamed Dahlia variabilis endogenous pararetroviral sequence (DvEPRS) by Eid and Pappu [7], was detected in wild plants of four different Dahlia species (D. coccinea, D. rupicola, D. tenuicaulis and D. sherffii) growing in their natural habitats in west-central Mexico, suggesting that the integration events occurred well before the domestication of dahlia [6].
In recent years, the genomes of many more representatives of DMV, DCMV, and DMV-D10 from around the world have been sequenced in their entirety or in part, providing a much better foundation to resolve the classification of these viruses. To test the hypothesis that DCMV is a member of a distinct virus species, we have undertaken a new analysis using the complete data set available to us in April 2022. The ORF 5 protein sequences of the exemplar isolates of all officially recognized or tentative caulimovirus species, as well as DMV, DCMV, and DMV-D10, were downloaded from the NCBI Protein database and aligned using COBALT [17]. This protein alignment was then used to guide alignment of the corresponding DNA sequences using the TranslatorX server available at http://161.111.161.41/index_v4.html [1]. For reasons that are explained below, the ORF 5 sequence of DMV-Portland (JX272320.1) was added to the existing DNA alignment using the ‘realign selected sequences’ function in CLUSTAL X v. 2.1. For the phylogenetic analysis, the variable 5’ and 3’ ends of the alignment were deleted so that the alignment only represented the conserved aspartic protease (cd00303), reverse transcriptase (cd00304), and ribonuclease H1 (pfam17917) domains. This alignment was further truncated to the reverse transcriptase (RH) and ribonuclease H1 (RH1) domains (nt 4450–5661 of NC_001497.2) for pairwise sequence comparisons using MEGA v. 11 [21]. A phylogenetic tree was inferred by maximum likelihood using IQ-Tree v2 with an automatic substitution model (command -m TEST) and 10,000 replicates each of an approximate likelihood ratio test and ultrafast bootstraps [12, 13]. Relationships among the sampled populations of DCMV and DMV-D10 were visualised with SplitsTree v4 [10], and the pairwise homoplasy index was calculated as a measure of recombination within each population.
Our phylogenetic analysis identified DMV as sister to DCMV (100% support from UltraFast Bootstraps and an approximate likelihood ratio test, Fig. 1A). The sequenced isolates of DCMV have low genetic diversity (98.8–99.5% nt sequence identity in the RT and RH1 coding domains) and a signature of clonal reproduction based on a lack of reticulate branches in SplitsTree (Fig. 1B), and a high pairwise homoplasy index (≥ 0.5 significant for clonal reproduction). These results suggest that DCMV has spread around the world by trade of infected vegetative propagules (rooted cuttings or root tuber pieces) of one or a small number of dahlia cultivars. Within the conserved RT and RH1 coding domains of ORF 5, DMV-Portland and the six isolates of DCMV have 85.5–85.9% nt sequence identity, which would suggest that these viruses are conspecific. However, DMV-Portland and DCMV also share 84.0% and 82.5–82.8% nt sequence identity, respectively, with mirabilis mosaic virus (AF454635), which brings into question the universal applicability of the 80% nt identity threshold for caulimovirus species demarcation. Clearly, more sampling of the caulimovirus sequence populations in dahlia and Mirabilis jalapa is required to more accurately define the species boundaries and to determine if MiMV is also synonymous with DMV.
Pappu et al. [18] obtained a significantly more divergent amino acid (aa) sequence identity figure for the aligned polymerase protein sequences of DMV-Portland and DCMV than would be expected from our pairwise nucleotide sequence comparisons. This result can be explained by a single nucleotide deletion in ORF 5 of DMV-Portland (position 5070 in NC_018616.1) that both alters the reading frame of the sequence and causes premature termination of the ORF, leading to a large deletion at the C-terminus of the RH1 enzyme, including much of the catalytic domain [11]. If the DNA sequence downstream of the putative indel in DMV-Portland is excluded in sequence comparisons, then the conceptually translated RT and RH1 domains have 93.4% aa sequence identity. We hypothesise that the indel in the DMV-Portland genome sequence is a sequencing error, as otherwise it would be predicted to render the virus replication-defective due to deletion of several sequence motifs that are conserved across all ribonuclease H1 proteins classified in protein family pfam17917. Alternatively, it is possible that that the sequence is accurate but corresponds to a mutated endogenous caulimoviral element.
DCMV appears to be a clonal lineage of DMV, and following the current ICTV species demarcation criteria, there is no justification to recognise two different virus species. The debate as to which name to apply to the taxon has been made redundant with the introduction of Latinized binomial species names [25], as an entirely new taxon name will need to be created for the caulimovirus associated with dahlia mosaic disease. We propose that this virus species be called “Caulimovirus dahliae” and that the exemplar isolate for this species be DCMV isolate JP (INSDC accession code LC625373), as this is the only virus isolate for which Koch’s postulates of pathogenicity have been satisfied through use of an infectious clone [24]. We also recommend use of the name “dahlia common mosaic virus” when referring to the virus in a vernacular sense. While DMV-D10 is clearly distinct from DCMV, replication-defective endogenous caulimoviral elements are not yet formally classified by the ICTV, and therefore, there is not a pressing need to introduce a new taxon name.
References
Abascal F, Zardoya R, Telford MJ (2010) TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res 38:W7–W13
Banerjee J, Sahoo DK, Raha S, Sarkar S, Dey N, Maiti IB (2015) A region containing an as-1 element of dahlia mosaic virus (DaMV) subgenomic transcript promoter plays a key role in green tissue- and root-specific expression in plants. Plant Mol Biol Rep 33:532–556
Brierley P (1950) Some vectors, hosts, and properties of dahlia mosaic virus. Plant Dis Rep 34:363
Brunt AA (1971) Some hosts and properties of dahlia mosaic virus. Ann Appl Biol 67:357–368
Chao HY, Chen YK (2012) Molecular characterization of a dahlia mosaic-associated Caulimovirus. Plant Pathol Bull 21:101–114
Eid S, Saar DE, Druffel KL, Pappu HR (2011) Plant pararetroviral sequences in wild Dahlia species in their natural habitats in Mexican mountain ranges. Plant Pathol 60:378–383
Eid S, Pappu HR (2014) Expression of endogenous para-retroviral genes and molecular analysis of the integration events in its plant host Dahlia variabilis. Virus Genes 48:153–159
Haas M, Bureau M, Geldreich A, Yot P, Keller M (2002) Cauliflower mosaic virus: still in the news. Mol Plant Pathol 3:419–429
Hadfield J, Linderme D, Shepherd DN, Bezuidenhout M, Lefeuvre P, Martin DP, Varsani A (2011) Complete genome sequence of a dahlia common mosaic virus isolate from New Zealand. Adv Virol 156:2297–2301
Huson DH, Bryant D (2005) Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 23:254–267
Hyjek M, Figiel M, Nowotny M (2019) RNases H: structure and mechanism. DNA Repair 84:102672
Minh BQ, Nguyen MAT, von Haeseler A (2013) Ultrafast approximation for phylogenetic bootstrap. Mol Biol Evol 30:1188–1195
Minh BQ, Schmidt HA, Chernomor O, Schrempf D, Woodhams MD, von Haeseler A, Lanfear R (2020) IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol Biol Evol 37:1530–1534
Nicolaisen M (2003) Partial molecular characterization of dahlia mosaic virus and its detection by PCR. Plant Dis 87:945–948
Pahalawatta V, Druffel K, Pappu H (2008) A new and distinct species in the genus Caulimovirus exists as an endogenous plant pararetroviral sequence in its host, Dahlia variabilis. Virology 376:253–257
Pahalawatta V, Druffel KL, Wyatt SD, Eastwell KC, Pappu HR (2008) Genome structure and organization of a member of a novel and distinct species of the genus Caulimovirus associated with dahlia mosaic. Adv Virol 153:733–738
Papadopoulos JS, Agarwala R (2007) COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics 23:1073–1079
Pappu H, Druffel K, Miglino R, van Schadewijk A (2008) Nucleotide sequence and genome organization of a member of a new and distinct Caulimovirus species from dahlia. Adv Virol 153:2145–2148
Pappu HR, Wyatt SD, Druffel KL (2005) Dahlia mosaic virus: molecular detection and distribution in dahlia in the United States. HortScience 40:697–699
Sahoo DK, Sarkar S, Raha S, Das NC, Banerjee J, Dey N, Maiti IB (2015) Analysis of dahlia mosaic virus full-length transcript promoter-driven gene expression in transgenic plants. Plant Mol Biol Rep 33:178–199
Tamura K, Stecher G, Kumar S (2021) MEGA11: Molecular Evolutionary Genetics Analysis version 11. Mol Biol Evol 38:3022–3027
Teycheney P-Y, Geering ADW, Dasgupta I, Hull R, Kreuze JF, Lockhart B, Muller E, Olszewski N, Pappu HR, Pooggin MM, Richert-Pöggeler KR, Schoelz JE, Seal S, Stavolone L, Umber M (2020) ICTV virus taxonomy profile: Caulimoviridae. J Gen Virol 101:1025–1026
Wildy P (1971) Classification and Nomenclature of Viruses: First Report of the International Committee on Nomenclature of Viruses. S. Karger, Basel
Yamamoto D, Neriya Y, Suzuki T, Nishigawa H, Natsuaki T (2021) Construction of an infectious dahlia common mosaic virus clone. Adv Virol 166:3179–3182
Zerbini FM, Siddell SG, Mushegian AR, Walker PJ, Lefkowitz EJ, Adriaenssens EM, Alfenas-Zerbini P, Dutilh BE, García ML, Junglen S, Krupovic M, Kuhn JH, Lambert AJ, Łobocka M, Oksanen HM, Robertson DL, Rubino L, Sabanadzovic S, Simmonds P, Suzuki N, Van Doorslaer K, Vandamme A-M, Varsani A (2022) Differentiating between viruses and virus species by writing their names correctly. Adv Virol 167:1231–1234
Funding
Open Access funding enabled and organized by CAUL and its Member Institutions. P-Y.T. was supported by the European Union (ERDF) and the Conseil Régional de La Réunion.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Additional information
Handling Editor: Elvira Fiallo-Olivé.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Geering, A.D.W., McTaggart, A.R. & Teycheney, PY. Untangling the taxonomy of dahlia mosaic virus. Arch Virol 167, 2325–2329 (2022). https://doi.org/10.1007/s00705-022-05567-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00705-022-05567-8