Soil-borne wheat mosaic virus (SBWMV) is a furovirus responsible for mosaic symptoms in winter wheat and was first detected in the United States of America (McKinney 1923) followed by Europe approximately 40 years later (Canova 1964). The virus is transmitted by the soil-inhabiting plasmodiophorid, Polymyxa graminis Led. and by artificial mechanical inoculation. Infections of winter wheat by SBWMV can result in considerable yield losses (Palmer and Brakke 1975; Vallega and Autonell 1985). In Poland SBWMV was detected for the first time in 2012 (Trzmiel et al.). SBWMV-Pol1 isolate was not particularly virulent, and only produced semi-latent symptoms associated with a limited concentration of virus particles in plant tissues. Additionally, SBWMV-Pol1 differed biologically from the other described isolates (Trzmiel et al. 2012).

The study reported herein was conducted to clarify the contribution of nucleotide sequence variability to the observed biological differences among isolates. Experiments were conducted simultaneously to analyze SBWMV-Pol1, SBWMV-Heddesheim and SBWMV-SH1-3 because of the lack of available information regarding European isolates in the NCBI database. The sources of the SBWMV isolates were infested soils containing viruliferous spores of the vector. The SBWMV-Pol1 isolate was taken from Szelejewo in the southern part of the Greater Poland province (Trzmiel et al. 2012). The SBWMV-Heddesheim (from Heddesheim-Baden-Württemberg) and SBWMV-SH1-3 (from Cashagen-Schleswig-Holstein) isolates were provided by Prof. Thomas Kühne from the Julius Kühn Institut (Quedlinburg, Germany). Virus isolates were propagated in triticale (X Triticosecale Wittm. ex A. Camus) cv. Twingo and wheat (Triticum aestivum L.) cv. Brilliant plants grown in infested soils in climate chambers set to 17 °C (Shirako and Brakke 1984). After 2 months, all plants were examined using an enzyme-linked immunosorbent assay (ELISA) (LOEWE, Sauerlach, Germany). Isolates from infected plants were propagated mechanically. Symptomatic leaf homogenates in 0.1 M K2 HPO4 were used for inoculation of wheat plants at 2-3-leaf stage and were maintained in climate chamber at optimal 17 °C. Mechanical inoculation effectiveness were verified by ELISA test. The results indicated 30 and 21 infected out of 84 plants tested, respectively for SBWMV-Hedesheim and -SH1-3. Their optical density (OD) value ranged from 0.3 to 0.9 in comparison to average 0.05 for healthy plants. The same analysis for SBWMV-Pol1 isolate revealed only 9 infected plants out 128 tested. The OD value was lower and did not exceed 0.4. Additionally electron microscopic studies confirmed differences between German and Polish isolates. In plant sap infected with SBWMV-Heddesheim and -SH1-3 numerous typical rod-shape virions were observed while analogical analysis for -Pol1 isolate revealed only single viral particles (data not shown). Polish isolate was hardly reproduced by mechanical inoculation therefore its propagation was conducted only by soil-transmission and German isolates were propagated mechanically. For molecular analyses SBWMV-Pol1 was collected from infected triticale cv. Twingo plants cultivated for more than 4 months and SBWMV-SH1-3 and SBWMV-Heddesheim isolates were obtained from wheat cv. Brilliant plants after mechanical passages (about 4 and 12 months after the first inoculation, respectively). Total RNA was extracted from approximately 100 mg infected plant leaves using the NucleoSpin RNA Plant Kit (Macherey-Nagel, Düren, Germany) and stored at −20 °C. First-strand cDNA was synthesized using SuperScript® Reverse Transcriptase (Thermo Fisher Scientific, Waltham, MA, USA) with random hexamer primers following the manufacturer’s instructions. The polymerase chain reaction (PCR) amplifications were conducted with a set of original primers covering most (99%) of RNA1 coding sequence and the complete RNA2 sequence (Supplementary file). The primers were designed using the Primer3 program (http://frodo.wi.mit.edu /) based on the full-length RNA1 and RNA2 nucleotide sequences of US-Nebraska wild-type (NC002041 and L07938) (Rosen and Skaletski 2000). The PCR samples consisted of 1 μl reverse transcription (RT) mixture, 1 μl 10 × AccuPrime™ PCR Buffer II, 0.2 μl primer mix (10 μmol/μl each), 0.25 μl AccuPrime™ Taq DNA Polymerase (Thermo Fisher Scientific), and sterile Milli-Q water up to a final volume of 10 μl. The PCR conditions were as follows: 94 °C for 2 min; 35 cycles of 94 °C for 30 s, 55 °C for 30 s, 68 °C for 30–90 s (depending on the expected amplicon size); 68 °C for 5 min. Amplicons of the expected size were obtained for most primer pairs, except those amplifying the REP-encoding gene on RNA1. Along with the expected 1690-bp amplicon, SBW2-F2/SBW2-R2 primer pair produced additional shorter PCR products. All amplified products were excised from the agarose gel and purified using the Wizard®SV Gel and PCR Clean-Up System (Promega, Madison, WI, USA). The purified fragments were ligated into the pGEM-T Easy vector (Promega), and then inserted into Escherichia coli TOP10 competent cells (Thermo Fisher Scientific). Plasmid DNA was isolated using the NucleoSpin® Plasmid (NoLid) kit (Macherey-Nagel). At least three clones for each fragment were sequenced at Genomed S.A. (Warsaw, Poland) using the M13-F and M13-R primers. The nucleotide sequences were analyzed using the Standard Nucleotide BLAST online tool (blastn; http://blast.ncbi.nlm.nih.gov/Blast.cgi), after which they were compiled and edited using the BioEdit program (Hall 1999). The sequences were deposited in the NCBI GenBank database. Details regarding primer usage as well as gene fragment positions and their accession numbers are presented in Fig. 1, which was generated using the SnapGene Viewer program (http://www.snapgene.com).

Fig. 1
figure 1

Schematic representation of the Soil-borne wheat mosaic virus genome. Dark boxes represent the replicase (REP) and movement protein (MP) gene sequences on RNA1, as well as the capsid read-through (CP-RT) and cysteine-rich protein (CRP) gene sequences on RNA2. Grey boxes with white asterisks represent untranslated regions. The primers used to amplify fragments of SBWMV isolates -Pol1, −SH3-1, and -Heddesheim (two-sided arrows) are indicated. Two-sided arrows with deleted regions represent different CP-RT gene mutants. Accession numbers are provided for genome fragments

The objective of this investigation was to obtain complete or almost complete genome sequence. Nucleotide sequence analysis of SBWMV RNA1 and RNA2 could not be completed because of the failure to determine the full coding sequence of the replicase (REP) gene of analyzed isolates. The experiments showed the lack of RT-PCR specific products with three out of five tested primers pairs (probably caused by incompatibility between the primers and genome sequence or by partial degradation of 5’terminus RNA1 of SBWMV). The sequencing of the amplicons for SBWMV RNA1 enabled the determination of only partial coding sequence (i.e. two fragments, 30%) of the REP gene and the complete coding sequence of the movement protein (MP) gene. The same analysis for RNA2 revealed the complete coding sequence of the genes encoding the capsid read-through (CP-RT) protein (for SBWMV-SH1-3) and cysteine-rich protein (CRP) (Fig. 1). An analysis of nucleotide sequences revealed the high frequency of deletions in the RNA2 CP-RT gene region in all studied SBWMV isolates. The spontaneous deletions are likely associated with the duration of the growth period (i.e., over 4 months) for systemically infected plants at 17 °C (Shirako and Brakke 1984) or at 25 °C (Mansournia 2008), as well as with serial passage experiments (Yamamiya et al. 2005). There are common and have been detected in CP-RT genes from Oat golden stripe virus (OGSV; genus: Furovirus), Potato mop-top virus (genus: Pomovirus), and Beet necrotic yellow vein virus (genus: Benyvirus) (Diao et al. 1999). The exact position of each deletion was determined by sequencing multiple clones using the SBWMCP-F/SBWMCP-R and SBW2-F2/SBW2-R2 primer pairs (Fig. 1). Four different deletion mutants were detected for SBWMV-SH1-3, with the dominant form being SBWMV-SH1-3-mutant1. There were two different mutants for SBWMV-Heddesheim, and only a single RNA2 mutant for SBWMV-Pol1 (Fig. 1). Additionally, our results suggest the presence of one best-adapted in-frame deletion mutant leading to the loss of 350 amino acids from the CP-RT protein for all isolates. We observed that SBWMV-Pol1 with deletions in the RNA2 sequence did not induce more severe symptoms than wild-type viral strains under field conditions. This result is consistent with those of an earlier study (Yamamiya and Shirako 2000), in which the authors concluded that variations in symptom severities were not simply due to deletions in the CP-RT gene region, as it had been previously suggested (Shirako and Brakke 1984, Chen et al. 1994).

Existing nucleotide sequences of SBWMV cover mostly RNA2 genome fragment. There are only two complete RNA1 and RNA2 genome sequences in the GenBank database. The nucleotide and amino acid sequences of the analyzed isolates were compared with the corresponding sequences of other known SBWMV isolates (Tables 1 and 2). Previous studies (Diao et al. 1999, Shirako et al. 2000) demonstrated that the RNA polymerase domain situated in 3′ REP gene was highly conserved among RNA-1 encoded proteins. Our results, like those published by Koenig et al. (2002, 2003) revealed very close relationships (97.4–100% amino acid sequence identity) within the helicase domain (REP fragment 2) of SBWMV isolates. Analogically on RNA2 the CP amino acid sequences were almost identical among all isolates (sequence identity: 98.8–100%). The results are compatibled with those received by Shirako and Wilson (1993). The genome region containing the CRP gene, which is responsible for viral pathogenicity, included mutations that resulted in some amino acid sequence variability (sequence identity: 96–100%). The genomic variability leading to amino acid sequence differences was also detected for the MP, which enables the virus to move in infected plants (sequence identity: 97.5–99.6%) (Table 1).

Table 1 Percentage of nucleotide/amino acid sequences identity of different genome regions between - Pol1 and other known SBWMV isolates
Table 2 A summary of non-synonymous (NS) and synonymous (S) mutations in nucleotide sequence of different genome regions between Polish and other known SBWMV isolates

A comparison of nucleotide sequences revealed a sequence identity of 87.4–100% for the REP gene, 89.7–99.5% for the MP gene, 86.4–100% for the CP gene, and 93.5–100% for the CRP gene (Table 1). Most of the nucleotide sequence changes were synonymous point mutations, with 100 in the REP gene, 86 in the MP gene, 71 in the CP gene, and 28 in the CRP gene. There were considerably fewer non-synonymous mutations, with 7, 10, and 5 in the REP, MP, and CRP genes, respectively (Table 2).

Phylogenetic analyses were conducted for the analyzed and previously characterized SBWMV isolates, as well as for selected SBCMV, Chinese wheat mosaic virus (CWMV), and OGSV isolates from the GenBank database. The ClustalW program was used for multiple sequence alignments (Thompson et al. 1994). The phylogenetic relationships among sequences were analyzed using the maximum likelihood algorithm, and the optimal DNA/protein models were selected with the MEGA 6.1 program (Tamura et al. 2013). Bootstrap values were calculated using 1000 random replicates. Phylogenetic trees were prepared using TreeExplorer in the MEGA 6.1 program. The tree, which was based on the nucleotide sequence of CP gene, included the following two main clades: (1) European isolates: -Pol1, −Heddesheim, −De1, and -French; American isolates: -Okl-1, −NE, and -US-lab1; (2) European isolate: -SH1-3; American isolates: -NY, −Illinois, −OH, and -Urbana (Fig. 2). It is corresponding to molecular analysis where SBWMV-Pol1 sequence was highly similar to that of three American isolates: -NE, −Nebraska US-lab1, and -Okl-1 (i.e., 97.7, 97.7, and 98.8% sequence identities, respectively). Our results were consistent with previously reported data for these isolates, which were included in the Nebraska-like group (Koenig et al. 2002; Trzmiel et al. 2012). In contrast, we observed an 87% nucleotide sequence identity between the SBWMV-SH1-3 and -Pol1 isolates. The CP gene sequence of -SH1-3 was most similar to that of -OH, −NY, −Urbana, and -Illinois isolates from the New-York-like group (i.e., 98.8, 98.6, 98.3, and 98.4% sequence identities, respectively). These findings were in agreement with those of Ziegler et al. (2013). However, our sequence comparison results for the other SBWMV genes (i.e., REP, MP, and CRP genes) revealed the -Pol1, -Heddesheim, and -SH1-3 isolates were more closely related to the isolates of the New York-like group than to the Nebraska-like isolates. Moreover, the phylogenies of the CRP (Fig. 3) and MP (data not shown) genes demonstrated that -Pol1, −Heddesheim, and -SH1-3 were clustered with -NY and -OH isolates.

Fig. 2
figure 2

Maximum likelihood tree based on the CP gene nucleotide sequences of SBWMV, SBCMV, CWMV, and OGSV isolates. The numbers at each major node indicate bootstrap values out of 1000 replicates (provided only when >50%). The scale bar represents the number of substitutions per base, and corresponds to the nucleotide diversity among taxa. The analyzed isolates are indicated (i.e., Polish isolate: red dot; German isolates: black dots)

Fig. 3
figure 3

Maximum likelihood tree based on the CRP gene nucleotide sequences of SBWMV, SBCMV, CWMV, and OGSV isolates. The numbers at each major node indicate bootstrap values out of 1000 replicates (provided only when >50%). The scale bar represents the number of substitutions per base, and corresponds to the nucleotide diversity among taxa. The analyzed isolates are indicated (i.e., Polish isolate: red dot; German isolates: black dots)

Our data revealed the significant genetic similarity among the studied isolates. The differences between SBWMV-Pol1 and -Heddesheim included one synonymous and three non-synonymous point mutations in the MP gene while the variability between -Pol1 and -SH1-3 consisted of three synonymous point mutations in fragment 1 of the REP gene, one non-synonymous and three synonymous point mutations in the MP gene and 69 synonymous point mutations in the CP gene (Table 2). Despite the considerable similarities in obtained genomic sequences among studied SBWMV isolates there are important biological differences between German isolates and Polish one. SBWMV-Pol1 is characterized by inducing only mild symptoms, low virulence and limited concentration in infected plant tissues.

Although infectious SBWMV clones have been used to study the CP-RT (Yamamiya and Shirako 2000) and CRP (Jarugula et al. 2016) genes, the pathogenicity determinants have not been identified yet. However, even single amino acid substitutions can affect viral pathogenicity as it was previously presented. For example, a point mutation in TGB3, which is associated with viral movement, can convert the mild Pepino mosaic virus pathotype into a necrotic pathotype (Hasiów-Jaroszewska et al. 2011). Kwak et al. (2016) revealed that a symptom determinant was located in the RNA2 of Broad bean wilt virus 2, which includes the MP gene coding sequence. Wieczorek and Obrępalska-Stęplowska (2016) confirmed the importance of a single point mutation within the MP gene for Tomato torrado virus infections and symptom development. Additionally, point mutations in the MP gene can induce systemic SBWMV infections at higher temperatures (e.g., 25 °C) (Mansournia 2008) and alter the type of Brome mosaic virus infection from local to systemic (Trzmiel et al. 2016).

The comparative analysis of the MP protein revealed differences for the 28th amino acid, namely an alanine, occurring in all other SBWMV isolates was replaced by a valine in SBWMV-Pol1. This difference was due to a change in the 83rd nucleotide from a cytosine to a thymine. However, elucidating the function of this point mutation requires additional research. Based on our data, investigations of pathogenicity determinants within the MP gene should be continued.