Introduction

Bacterial wilt is considered one of the most devastating bacterial diseases in the world (Mansfield et al., 2012). The bacterial wilt organism until recently was classified in the Ralstonia solanacearum species complex (RSSC), which is subdivided into 4 phylotypes based on the sequence homology of the 16S-23S rRNA internal transcribed spacer (ITS) region (Fegan and Prior, 2005). The division in phylotypes is useful since the groups have a phylogenetic base and are correlated with geographic origin. Evidence points to phylotype I originating in Asia and Oceania, phylotype II in the Americas, phylotype III in Africa, and phylotype IV in Indonesia and Asia (Prior and Fegan, 2005; Castillo and Greenberg, 2007). Recently, using genomic and proteomic methods these phylotypes were separated into three distinct species with phylotype I and III being placed into Ralstonia pseudosolanacearum, phylotype II into Ralstonia solanacearum, and phylotype IV into Ralstonia syzygii (Safni et al., 2014; Prior et al., 2016). To study subspecies populations, strains have been distinguished on the basis of the endoglucanase (egl) gene sequences into sequevars (Fegan and Prior, 2005). Sequevar classification provides information about the genomic similarity of closely related populations of the RSSC, and it may provide clues on similar host ranges or common geographic origins (Remenant et al., 2010; Wicker et al., 2012). However, so far, no classification has been able to determine a clear relationship between genotype and host range phenotype.

Each year over 4 billion propagative plant cuttings of greenhouse ornamental plants are brought into the United States (Drotleff, 2018). It is impossible to screen all these propagative cuttings for plant diseases. The systemic nature of the bacterial wilt pathogen along with the ability to remain asymptomatic under non- conducive to disease environments, lends it to easy movement between cuttings. The authors and others have documented a number of RSSC strain introductions into the United States (Norman et al., 2009; Hong et al., 2012; Tran et al., 2016; Weibel et al., 2016).

In recent years, there have been three reports of new hosts of R. pseudosolanacearum phylotype I into the United States and another in Europe. The first report in the United States was noted on the ornamental mandevilla (Ruhl et al., 2011). This population was already present in greenhouses in 2007 and has now established itself in mandevilla production in South Florida (Bocsanczy et al., 2014). In 2014 phylotype I strains were found infecting imported cuttings of Osteospermum, Petunia, Calibrachoa, Verbena, and Lobelia spp. originating from Guatemala (Weibel et al., 2016). Multiple sequevars of RSSC were found on blueberry production in Florida in 2016 (Norman et al., 2017; Norman et al., 2018). The most prevalent and destructive was caused by phylotype I strains. The same year, phylotype I strains were described on tea rose in Europe (Tjou-Tam-Sin et al., 2017; Bergsma-Vlami et al., 2018). The presence of phylotype I on new hosts could be due to changes in their genotype, to adapt to new hosts, or climatic changes that expand the geographical area of the pathogen or allow the introduction of new crops in areas where RSCC is endemic. Sarkar et al. (2006) postulated that specific pathogenicity determinants distributed heterogeneously among strains are good candidates to explain host specificity (Sarkar et al., 2006). Among these are Type 3 Effectors (T3Es), proteins with diverse functions that are translocated into host cells by the Type III Secretion System (T3SS) (Sarkar et al., 2006; Guidot et al., 2007). This system is conserved among bacteria, and it can translocate up to 35 T3Es in the host cell (Chang et al., 2005; Schechter et al., 2006; Lonjon et al., 2016). T3Es functions are related to virulence and promoting disease in the host (Khan et al., 2018; Landry et al., 2020). In some cases, T3Es are recognized by the plant, resulting in an Effector Triggered Immunity response (ETI), in that instance the effector is called an avirulence factor (Jones and Dangl, 2006). Thus, depending on the specific pathogen-host interaction, a T3E can act as a virulence factor expanding the host range of the pathogen (Angot et al., 2006; Macho and Zipfel, 2015; Macho, 2016) or it can act as an avirulence factor restricting the host range (Lavie et al., 2002; Poueymiro et al., 2009; Bocsanczy et al., 2012; Nakano and Mukaihara, 2019a). RSSC has a large repertoire of T3Es. Peeters and al. 2013 identified 72 different effectors in the model strain GMI1000. They established the Ralstonia Injected Proteins (Rip) nomenclature, which we follow in this work, as well as their prediction software (https://iant.toulouse.inra.fr/bacteria/annotation/site/prj/T3Ev3/) (Peeters et al., 2013; Sabbagh et al., 2019).

The objective of this study was to identify candidate T3Es associated with host specificity of R. pseudosolanacearum closely related phylotype I strains that had been recently isolated into the US or Europe from blueberry, rose, osteospermum and mandevilla hosts. We compared genomic sequences, specifically their T3Es repertoire, in relation to their host specificity. With this information, we identified the T3Es repertoire of four closely related phylotype I strains recently introduced in the US and Europe and candidate T3Es associated with their host specificity. This work presents a strategy based on comparative T3Es repertoire in phylogenetically closely related strains with different host specificities to reduce the number of host specificity candidates and contributes to the identification of host specificity genetic determinants in R. pseudosolanacearum phylotype I. This knowledge ultimately will aid in assessing risk of epidemics and developing management strategies for this destructive pathogen.

Materials and Methods

Strains and genome sequences

Four phylogenetically closely related strains of R. pseudosolanacearum isolated from different hosts were selected for this work: P781, isolated from mandevilla plants, UW757 from osteospermum, P824 from blueberry and PD7123 (CFBP 8587) from tea rose. As indicated in Table 1 all four strain genomes have been sequenced and annotated previously.

Table 1. Ralstonia pseudosolanacearum (phylotype I) strains and genomes used in this study

Similarity comparisons

To calculate similarity between strains, whole genome sequences similarity was calculated by Average Nucleotide Identity (ANI) index using the ANI/AAI-Matrix estimates all-vs-all distances software (http://enve-omics.ce.gatech.edu/g-matrix/ ) in a collection of 34 R. pseudosolanacearum genomes (Rodriguez-R and Konstantinidis, 2016). The software calculates the distance by building a similarity clustering. The clustering method was UPGMA, tree type was phylogram unrooted. The collection of multi-fasta sequences of the draft genomes were used as input in a zip file for the calculation of similarity.

Comparative genomics and orthologues identification

The annotated genome sequences of strains P781, P824, UW757 and PD7123, were aligned with MAUVE (Darling et al., 2004) using the option “Align with progressive MAUVE”. Since RSSC genomes (including R. pseudosolanacearum) are bipartite (chromosome and megaplasmid) and all the strains genomes are draft genomes, separated in scaffolds or contigs, the sequences were concatenated with Geneious Prime (Biomatters LTD.) version 202.0.4 before alignment with MAUVE. After alignment, the orthologues among strains were identified and extracted with the option “Export Orthologs” with 70% minimum similarity and 70% minimum coverage. The orthologues file was processed, and genes were grouped by their presence or absence in each strain or group of strains.

Type 3 effectors (T3Es) prediction

The repertoire of T3Es was predicted using Ralsto T3E (Peeters et al., 2013; Sabbagh et al., 2019) web software (https://iant.toulouse.inra.fr/bacteria/annotation/site/prj/T3Ev3/). The concatenated fasta files of each strain were used as input files. The predicted T3Es protein and DNA sequences were manually curated. They were compared with the sets of orthologues identified by MAUVE, and searched in the NCBI database using BLASTn, BLAStp, and tBLASTn (Altschul et al., 1990) with default parameters in order to verify presence or absence of individual genes and assign the annotation. T3E sets were compared and candidate T3E associated with host range were selected based on the hypotheses formulated. A Venn diagram was constructed to help visualize distribution of identified effectors with Venny 2.1.0 software (https://bioinfogp.cnb.csic.es/tools/venny/ )(Oliveros, 2007-2015).

Validation of T3Es candidate sequences

PCR primers were designed (Online Resource 1) to amplify the complete gene region of the candidates with uncertain sequences or to confirm pseudogenes. PCR reactions were prepared with the TaqMan™ Universal Master Mix Cat. 4304437 following manufacturer instructions. The PCR protocol was 95 °C 5 min, 25 x [95 °C 15 s, X annealing °C for 30 s, Y extension min], 72 °C 10 min. Annealing temperatures (X) and extension times (Y) are indicated in Online resource 1. PCR products amplified with the same primers (Online Resource 1) were sent to Psomagen USA for Sanger sequencing. Sequences were processed and analyzed with Clone Manager 9 Professional Edition.

Protein alignments

Translated proteins either from T3E predictions, tBLASTn or annotated in NCBI were aligned using the Align Multiple Sequences menu of Clone Manager 9 Professional Edition, using the multiway option (exhaustive pairwise alignments of all sequences and progressive assembly of alignment analysis of the sequences using Neighbor-Joining). Scoring matrix BLOSUM 62. Similarity for the group was taken for the minimum % of Match nucleotides.

Pathogenicity tests

Eight host plants were selected for this study: Mandevilla boliviensis ′Brides Cascade′, Osteospermum sp. ‘Margarita Purple 102/100 TY’, southern highbush blueberry Vaccinium corymbosum ′Arcadia′, hybrid tea rose Rose spp. ′Moonlight Romantica′, grandiflora rose ′American Magic′, miniature rose ′Mandarin Sunblaze′, knock out rose ‘Double Red’, and tomato Solanum lycopersicum ′Walter′. Plant propagation plugs were acquired from local providers and planted into 4” pots containing a peat based potting media (Jolly Gardener Potting Mix #2). Plants were established in a greenhouse until they were well rooted and approximately 10 cm in height. Pathogenicity screening was done in environmental chambers set at 28 °C/20 °C on a 12 h/12 h day/night cycle with a set relative humidity of 65%.

For inoculum production bacterial strains were grown on Bacto Nutrient Agar amended with 5% sucrose at 28 °C for 48 h. Bacterial strains were harvested from culture plates, suspended in sterile saline solution (8.5 g/L NaCl) and spectrophotometrically (600 nm) adjusted to 1 × 108 cfu/ml. Each plant was wounded at the base with a sterile scalpel approximately 1 cm above the soil line. Individual strains were inoculated into the wound by pipetting 10 μl of the inoculum suspension (for a total # of 1 × 106 cfu). Negative control plants were inoculated in the same manner with sterile saline solution. Plants were kept in an environmental chamber for 45 days and monitored for symptom development. Ten plants were used per treatment and experiment was replicated three times. To verify the presence of systemic bacterial infections, a 1 cm stem sample located approx. 7 cm above the site of inoculation was cut from each stem either at the end of the experiment (asymptomatic plants), or when the plant was completely wilted (symptomatic plants). Samples were surface disinfested for 3 min in a 10% Clorox solution, rinsed in sterile distilled water, ground in 300 μl of sterile distilled water, dilution streaked onto triphenyl tetrazolium chloride medium (TTC) (Kelman, 1954) and plates were incubated at 28 °C for 48 h. Presence or absence of typical R. pseudosolanacearum growth was then recorded.

Results

Genomes of phylotype I strains are highly conserved

The whole genome ANI index was calculated for 34 P. pseudosolanacearum strains selected from the NCBI database of Ralstonia genomes spanning the range of know sequevars. Clearly phylotype III strains cluster separately from phylotype I. Although the UPGMA ANI similarity tree does not cluster the four strains together (Fig. 1A), the phylotype I group shown is conserved since the lowest similarity of the 28 phylotype I strains showed in the tree is 98.84% between RS_10_244 and PD3278 and the average similarity is 99.21% (Fig. 1B). The highest similarity for the four strains analyzed in this work is between strain PD7123 (tea rose) and Rs-09-161 (99.99%) isolated from eggplant in India (Ramesh et al., 2014), and the lowest between PD7123 and P781 (99.03%). Similarities between UW757 and P781 (99.23%) were close to the average for the phylotype I group (99.21%), while similarities between P824 and P781 (99.17%); P824 and UW757 (99.15%); PD7123 and P824 (99.09%); and PD7123 and UW757 (99.05%) were below average (Fig 1B). It is important to highlight that in this ANI comparison the differences are small and the resolution of the branch placement in the similarity tree is low for Phylotype I strains.

Fig. 1
figure 1

Genome similarity for 34 R. pseudosolanacearum strains. The whole draft or complete genomes were aligned using the ANI method. a) UPGMA unrooted tree of similarities b) selected pairwise comparisons

Genomic comparison

Genome comparison of strains P781, P824, UW757 and PD7123 confirmed their phylogenetic relatedness. The pangenome contained 6833 genes with 3980 in the core genome (58.25%) (Fig.4). This is a high number compared with the average 2400 to 2800 core genes per R. solanacearum species complex genomes (Remenant et al., 2010; Remenant et al., 2011; Bocsanczy et al., 2017). The number of unique genes present in individual strains ranged from 465 (6.8%) for UW757 to 726 (10.6%) for PD7123. The higher number of unique genes in PD7123 is consistent with its higher genomic distance from the other strains.

Host Specificity

The hosts selected for this study included the original hosts species from where the four R. pseudosolanacearum strains were isolated: Mandevilla boliviensis ′Brides Cascade′, Osteospermum sp. ‘Margarita Purple 102/100 TY’, southern highbush blueberry Vaccinium corymbosum ′Arcadia′, hybrid tea rose Rose spp. ′Moonlight Romantica′. Three different species of rose were added to the host specificity study to observe differences in specificity: grandiflora rose ′American Magic′, miniature rose ′Mandarin Sunblaze′, and knock out rose ‘Double Red’. Tomato Solanum lycopersicum ′Walter′ was added as a positive control since all RSSC strains cause disease to tomato. Pathogenicity tests results on the described hosts when inoculated with P781, P824, PD7123, and UW757 are shown in Table 2. All strains were pathogenic and highly virulent on tomato ‘Walter’ as expected confirming that the strains are pathogenic. Both P824 (blueberry strain) and PD7123 (tea rose strain) were pathogenic on blueberry ‘Arcadia’ and tea rose ‘Moonlight Romantica’, while P781 (mandevilla strain) and UW757 (osteospermum strain) were not. Although phylogenetically P824 and PD7123 genomes are more dissimilar than the average ANI % for phylotype I (Fig. 1B), they had the same specificity in most of the hosts tested. Strains were more aggressive on their original hosts with a higher disease incidence (Table 2). The southern high bush cultivar ′Arcadia′ was very susceptible to both strains. Slight differences in visual symptoms on blueberry were observed between the two strains. Both produced scorch symptoms, red foliage and resulted in wilting of actively growing tips (Fig. 2A). In addition, PD7123 (tea rose strain) produced bright red vascular venial discoloration in blueberry young leaves (Fig. 2B). Asymptomatic infections occurred on knock-out roses and no infection was found with the miniature roses by any of the strains. Consistently, P781 and UW757 did not cause disease in blueberry or any of the roses tested, however were able to survive in the stem of 20% of the blueberry ‘Arcadia’ tested and 7% and 13% respectively in tea rose ‘Moonlight Romantica’. Only P781 was able to survive asymptomatically in 17% of the Grandiflora Rose ‘American Magic’ plants tested, and only UW757 was able to survive in 7% of the Knockout Rose ‘Double Red’. Only the mandevilla strain (P781) was found to infect and produce symptoms on mandevilla, although, infection rate was only 20% (Table 2). In previous experiments P781 but not UW757 did cause symptoms in mandevilla ‘Bride’s cascade’ plants at a higher rate (unpublished results). The previous experiments were performed in greenhouses where the temperature was always above 28 °C compared with the experiments for this work in controlled chambers at 28 °C/20 °C on a 12 h/12 h day/night cycle. In similar works the difference in temperature was also an important factor on disease severity; however, it did not affect pathogenicity (Tjou-Tam-Sin et al., 2017). Visual symptoms on mandevilla ‘Bride’s Cascade’ and osteospermum “Margarita Purple’ infected with P781 also differed. Symptoms on mandevilla ‘Bride’s Cascade’ start with yellowing of the leaves (Fig. 3A) becoming scorched and necrotic as the infection advances. Usually, the infection starts on one of the runners and slowly extends to the others. On osteospermum the symptoms are more like to the ones on solanaceous plants. It starts with wilting of side leaves until all the plant is wilted (Fig. 3B).

Table 2 Pathogenicity tests on eight selected hosts. Ten plants were subsequently inoculated with 106 cfu of the pathogen on an opened wound above the soil line and incubated for 45 days before scoring disease symptoms. The first column indicates the source host where each strain was isolated.
Fig. 2
figure 2

Bacterial wilt symptoms in southern highbush blueberry ‘Arcadia’ when inoculated with R. pseudosolanacearum strains (a) P824, (b) PD7123

Fig. 3
figure 3

Bacterial wilt symptoms when inoculated with R. pseudosolanacearum P781 in a) mandevilla ‘Bride’s Cascade’ b) osteospermum ‘Margarita Purple’

P781 (mandevilla), UW757 (osteospermum), and PD7123 (tea rose) were pathogenic to osteospermum. In contrast, P824 (blueberry) was non-pathogenic (Table 2).

In summary, both P824 and PD7123 were specifically pathogenic to blueberry ‘Arcadia’ and hybrid tea rose ‘Moonlight Romantica’, P781 was specifically pathogenic to mandevilla ‘Bride’s Cascade’, and P824 was specifically non-pathogenic to osteospermum ‘Margarita Purple’.

T3Es prediction and curated repertoire

In Silico predicted T3E using the Ralsto T3E web software (Peeters et al., 2013) are shown in its original form in Online Resource 2A. After a manual curation preliminary candidate virulence and avirulence genes were selected and PCR and sequencing validation were performed on the preliminary candidates as explained below to yield the curated T3E repertoire (Online Resource 2B).

The Ralsto T3E software uses the multifasta genome files as input to predict the gene sequence and translated protein sequence, making it independent of the annotation system. Since draft genomes are fragmented in scaffolds and contigs, there are non-sequenced gaps, and gene annotations could differ among genomes, a manual curation was performed on the automated Ralsto T3E predictions as described in Materials and Methods. Comparisons, between NCBI BLAST searches, the orthologues files from MAUVE alignment and the predicted proteins by Ralsto T3E were used to confirm sequences or join sequences spanning contiguous contigs.

According to the Ralsto T3E web page, 118 effector families have been identified in the RSSC. In our comparison and after manual curation there were 37 families not represented in any of the four genomes compared (Online Resource 2B), including RipC2 which was a pseudogene in all 4 strains, therefore considered not represented. Sixty-two core effector families were found in all four strains. Only 19 families were distributed heterogeneously among the strains (Fig. 5). There was one effector present only in PD7123. No unique (strain specific) effectors were present in P824 or P781, while there were two unique effectors present in UW757. Only two effector families were represented by paralog sets of effectors, RipJ and RipAC (Online Resource 2B). The disrupted or frameshifted gene sequences were annotated as pseudogenes and are not counted in the Venn diagram. The genes not annotated in the public databases, or with modified annotation are indicated in Online Resource 2B. Sequenced CDS not annotated in the genomes were deposited in the NCBI Genbank database. Modified annotations and sequenced CDS for this work accession numbers can be found in Online Resource 3.

Selection of candidate T3E that confer host specificity and validation of candidate genes sequences by PCR and Sanger sequencing

As described in the previous section, the complete list of manually curated T3E of the four strains compared are listed in Online Resource 2B. The table identifies changes in annotations due to manual curation and indicates the preliminary candidates to virulence and avirulence factors. The differences between T3Es repertoires were compared with the host specificity results in order to formulate two hypotheses for the selection of candidate T3Es. Hypothesis 1 assumes that the effectors behave as virulence factors that inhibit or overcome defense responses and expand the host range. Hypothesis 2 assumes that the effectors behave as avirulence factors and are recognized by the host restricting the host range. For host specificity to mandevilla two putative avirulence T3Es (RipAZ1 and RS_T3E_Hyp8) were identified because they are absent only in P781 (Online resource 2B). In contrast, for host specificity to osteospermum only one putative virulence T3E was identified (RipS1) because is absent only in P824. Applying the hypotheses for host specificity to blueberry and tea rose was more complicated because strains P824 and PD7123 are both pathogenic to both hosts, then two cases were considered: 1) A case where P824 and PD7123 are pathogenic due to shared effectors absent in P781 and UW757. Two candidate virulence effectors (RipG1, RipF1) were identified (Online resource 2B), whereas three candidates missing in both PD7123 and P824 for avirulence were identified (RipT, RipBE, RS_T3E_hyp14); and 2) a case where the effectors for each strain are different. RS_T3E_hyp7 only present in PD7123 may explain its pathogenicity in blueberry and rose, whereas two putative avirulence T3Es (RipE2, RipP3) only present in UW757, may explain why UW757 is non-pathogenic in blueberry and rose (Online Resource 2B).

To validate those candidates, primers were designed (Online Resource 1) and PCR was performed on the four strains. The gels confirm the presence of a truncated sequence for RipG1 (Online Resource 4-1) in P781, suggesting an interrupted gene by deletion. RipF1 was eliminated as candidate since the PCR produced a sequence for P781 not predicted because of assembly error (Online Resource 4-2). Rip BE and Rs_T3E_hyp14 were originally identified as candidates for hypothesis 2 as avirulence factors recognized by blueberry and tea rose, however unexpectedly there was a PCR product in P824 (Online Resource 4-3, and 4-4 respectively) and were eliminated as candidates, while RipT was present only in P781 and UW757 as expected (Online Resource 4-5). For mandevilla specificity, RipAZ1 was present in all the strains (Online Resource 4-6). For osteospermum specificity, RipS1 was present in all strains except P824 as expected (Online Resource 4-7). The sequenced PCR fragments for RipG1 were identical to the expected sequences retrieved from NCBI, confirming that P781 is a pseudogene P781 and UW757. The unexpected sequences for RipBE and Rs_T3E_hyp14 in P824 were identical to the corresponding sequences in P781 and UW757. For mandevilla RipAZ1, sequence in P781 was confirmed as a pseudogene and a valid avirulence candidate for mandevilla specificity. RipS1 sequences were identical to the expected sequences retrieved from NCBI in P781, UW757, and PD7123. The sequence was absent from P824, confirming RipS1 as a candidate for osteospermum specificity. The rest of the selected candidate sequences were identical to NCBI confirming their eligibility or elimination (Online Resource 4-8 to 4-12). Presence or absence or bands and sequencing results are summarized in Table 3.

Table 3. Validation of selected T3Es by PCR and sequencing: Expected size PCR met: PG=pseudogene, C=complete, A=Absent

Significant differences in protein structure can alter the function of T3E, allowing them to be recognized by the host or not. Sequences of each effector family were compared for the 4 strains studied. Protein structure was considered altered when orthologous sequences were below 70% similarity. Based on this criteria RS_T3E_hyp6 was added as a candidate virulence factor for blueberry and rose since it was absent in P781, and the sequence similarity was 65% in UW757 compared with P824 and PD7123. Alternatively, if RS_T3E_hyp6 conserves its function in UW757 despite the difference, this effector could be considered as a candidate avirulence factor for mandevilla, since it would be absent only in P781. Table 4 presents a summary with the final candidates after sequence validation

Table 4. Summary table of candidate T3E selected for host specificity

Discussion

Previous works attempted to associate the pan-T3E repertoire of diverse populations of Ralstonia solanacearum with host range, gave mixed results (Hajri et al., 2009; Baltrus et al., 2011; Cellier et al., 2012; Ailloud et al., 2015). Phylotype I strains were not included in these studies because they were not evidently associated with other than Solanaceous hosts, nor were they part of evidently identified ecotypes. Trying to identify specific strain-host (at variety level) interactions is very complex if the analysis is done at species level due to the large T3E repertoire of the species complex, the diversity of the strains, and the specificity of T3E and resistance genes interactions with different varieties of the same host species. Thus, a more specific approach where several of these variables are controlled could be most useful to associate T3E with particular hosts (host specificity). This is the first work that addresses association of T3Es with host specificity in Phylotype I strains, now included in the Ralstonia pseudosolanacearum species. In this work, we compared four genomes of phylogenetically closely related strains with different host ranges. Our working hypothesis was that phylogenetically closely related strains would have a similar T3E repertoire, reducing the number of candidates that determine their host range. We verified that phylotype I strains are very conserved evidenced for ANI % ranging from 98.84% to 99.99 for the 28 phylotype I genomes analyzed (Fig. 2C). The more distant genome of the four genomes compared was PD7123 with an ANI% of 99.03% compared with P781 (Fig. 2C), and by the high percentage of core orthologues (3980) (Fig. 4) compared with an average of 2000-2500 in previous and more diverse pan-genomic comparisons (Remenant et al., 2010; Bocsanczy et al., 2017). Diverse evolutionary mechanisms can influence the susceptibility or resistance of specific hosts to specific strains (Rohmer et al., 2004). The most influential mechanisms include horizontal transfer of T3E to new strains (Hajri et al., 2009), loss of effectors that do not serve a function in a specific environment (Dillon et al., 2019) and/or recognition by specific hosts resistance genes (Deslandes et al., 2002; Oh and Martin, 2011). Thus, comparing closely related R. pseudosolanacearum strains facilitates exploring the interaction between specific hosts and strains. Of 81 effectors present in one or more of the four strains 62 (76.5%) were core effectors (Fig. 5), confirming the closeness of their T3E repertoire. Only 19 effectors were distributed heterogeneously. Ralstonia strains can encounter new hosts in different situations. When in the field new crops are planted in endemic soils infested with the pathogen or in a close recirculation system where the pathogen is present, diseases emerge as is in the case of blueberry. Alternatively, plants grown in greenhouses such as ornamentals (e.g., rose, mandevilla, osteospermum) may encounter new strains that come in latent infections in other hosts. In both cases hosts develop the disease due to the ample repertoire of T3Es and the recent exposure of the host, which likely will not have built-in defenses. In that case, if T3Es are not recognized by the host (Angot et al., 2006; Macho, 2016) will function as virulence factors. After sequence validation, we identified RipG1 as a likely virulence factor in blueberry and rose. In P781 and UW757, RipG1 are pseudogenes due to the presence of a frameshift in position 968 after the first nucleotide of the Open Reading Frame (ORF) that truncates the proteins. RipG1 is a Gala protein 1 homolog (Angot et al., 2006). Gala is a family of seven effectors that contain both F-Box and LRR domains. It was shown that the F-Box domain of GALA proteins interacts with Skip1 subunits (part of the E3 ubiquitin ligase complex) in host plants interfering with their host ubiquitin/proteasome pathway, an important pathway for many developmental processes and stress responses (Angot et al., 2006). Recently, was shown that GALA1 (RipG1) from R. pseudosolanacearum GMI1000 expressed in Nicotiana benthamiana localized to the chloroplasts where it reduced significantly the flg22-induced calcium burst, and when expressed in Arabidopsis it changed the expression of SA-responsive genes following flg-22 treatment and increased susceptibility to Pseudomonas syringae Pto DC3000 (Medina-Puche et al., 2020) acting as a virulence factor. RipG1 protein sequence in GMI1000 similarity to the sequences in P824 and PD7123 is 99%, (only three AA in 661) supporting the hypothesis of RipG1 as virulence factor. Moreover, although there is a high redundancy of GALA proteins in the RSSC, GALA diversification contributes to R. solanacearum adaptation to diverse plant hosts (Remigi et al., 2011), Gala 7 alone was required to cause disease in Medicago truncatula plants acting as a host specificity factor (Angot et al., 2006) and it was shown that RipG7 undergoes diversification of certain AA from different strains presumably to adapt to different hosts (Wang et al., 2016). Our results show that strains P781 and UW757 are capable of invading the vascular system of blueberry and rose plants stems, but infections remain symptomless (Table 2). Alternatively, T3Es act as avirulence proteins inducing resistance in plants. Under this hypothesis we identified RipT as a candidate avirulence factor in P781 and UW757 that would restrict their host range in blueberry ‘Arcadia’ and tea rose ‘Moonlight Romantica’. RipT belongs to the YopT (Yersinia Outer protein) superfamily of cysteine proteases (Shao et al., 2002). These proteins have a triad of conserved amino acids but, the roles and targets of the different homologs of YopT are different (Figaj et al., 2019). Proteases are often recognized by resistance proteins in the host (Ade et al., 2007; Bocsanczy et al., 2012) and identified as avirulence genes. Host specificity for mandevilla seems to be determined by avirulence factors, since no candidate virulence factors were identified. RipAZ1, a likely candidate avirulence factor, is present in all the strains studied, although the gene sequence in P781 has undergone changes that alters the aminoacids structure. The loss of this protein possibly avoids recognition by the host defense system and makes P781 virulent to mandevilla. RipAZ1 was identified as an avirulence determinant in Solanum Americanum (Moon et al., 2021) strengthening our hypothesis. For osteospermum RipS1 was identified as a putative virulence factor that is not present in P824. RipS1 belongs to the RipS multigene family. It contains SKWP repeats domains in different numbers. Each SKWP repeat domain contains 42-45 aminoacids starting with Ser, Lys, Trp and Pro, hence the name (Mukaihara and Tamura, 2009). Each RipS contains 12-18 repeats in the N-terminus region. In RSSC eight RipS (1-8) proteins have been described (Peeters et al., 2013). RipS family has been associated with bacterial fitness in eggplant (Chen et al., 2018). RipS1 was shown among other effectors in strain RS1000 (phylotype I) to weakly (70%) suppress fl22-induced ROS burst when transiently expressed in N. benthamiana leaves by agroinfiltration (Nakano and Mukaihara, 2019b), and RipS1 and RipS4 were shown to contribute to virulence in eggplant (Chen et al., 2021), supporting the virulence function hypothesis.

Fig. 4
figure 4

Pangenome of strains P781, P824, UW757 and PD7123. Diagram shows the number of orthologous genes present in the pangenome of the compared strains. The outer circle indicates genes that are unique to the strain in the corresponding section. The middle circle indicates the number of genes that are present in at least two of the compared genomes and at most three genomes. The inner circle indicates the number of orthologues present in all the genomes (core genome)

Fig. 5
figure 5

Distribution of T3Es families present in the genomes of compared strains. Venn diagram shows the number and percentages (in parenthesis) of T3Es orthologs families present in one or more genomes. Diagram was produced with Venny 2.1.0 software (https://bioinfogp.cnb.csic.es/tools/venny/ )

Significant differences in protein structure could also affect the function of a T3E. We found RS_T3E_hyp6 has only 65% similarity with the orthologues in P824, and PD7123 while it is absent in P781. In this case assuming that the function is lost in UW757, the effector could be considered a virulence factor for blueberry and rose. Alternatively, if the function in UW7575 is conserved, it could be considered an avirulence factor for mandevilla. Some of the core effectors could also be candidates if their protein conformation is different, however this is an unlikely possibility since the core effectors had a sequence similarity over 98% with few exceptions, with similarities over 80% (Online Resource 2B), confirming the genomic closeness of the studied Rps strains. We tested pathogenicity on four host species. Southern highbush blueberry (Vaccinium corymbosum) cv. “Arcadia” belong to the order Ericales, tea hybrid rose to the order Rosales, Mandevilla to the order Gentianales and Osteospermum to the order Asterales. Interestingly, P824 and PD7123, two strains found in two different continents and in two plants that belong to different orders were capable of infecting both blueberry and tea rose hosts. Families of orthologous effectors can undergo changes in specific strains by degradation (pseudogene), when meet hosts that recognize the effector to adapt to their hosts/environment. When in contact with a new host, the same strains might lose virulence or become non-pathogenic due to the loss of those effectors. The progressive resistance of different rose species to P824 and PD7123 evidenced another mechanism of resistance. These strains were not able to cause disease in ‘Double Knock out’ and miniature roses and showed decreased susceptibility in Grandiflora roses. Susceptibility of rose cultivars appeared to be negatively correlated with flower size. This is similar with results previously reported for bacterial wilt resistance and size of tomato (Acosta et al., 1964; Walter, 1967; Opeña, 1990; Wang et al., 1998). It is probable that flower size could be genetically linked to resistance clusters in plants genomes. Although genomic relationships between hybrid roses is complicated due to extensive breeding of many species and different ploidy, miniature roses are completely resistant to all R. pseudosolanacearum strains tested and probably the defense responses involve Pattern Triggered Immunity (PTI) also related to non-host immunity (Jones and Dangl, 2006; Zipfel, 2014). Similarly, hybrids of mandevilla s.p. can have different susceptibilities to the same strains. The authors plan to test these hypotheses by functionally characterizing and determining the mechanisms of action of the candidate effectors identified in this work. This work presents a set of T3E curated sequences in four Rps strains, a reduced number of candidates to virulence and avirulence factors and contributes to the understanding of host specificity in Ralstonia pseudosolanacearum.