Introduction

Streptococcus suis causes invasive infections in swine1. Globally, serotypes 2, 1/2, 3, 4, 5 7, 8, 9, and 14 are the most frequently recovered from diseased pigs1,2,3. S. suis is also a zoonotic agent. Of late, especially in Southeast Asian countries, there has been a substantial increase in the number of human cases of S. suis occurring in patients who have reported close contact with one or more of infected pigs, contaminated pork-derived products, or consumption of raw pork products4,5. Among the 29 described serotypes of S. suis, serotype 2 is most associated with human infections1,6, although human disease due to serotypes 4, 5, 7, 9, 14, 16, 21, 24, and 31 has also been reported1,7,8,9,10,11. Notably, while S. suis serotype 1 has been commonly isolated from diseased pigs in Canada, Belgium, and the United States1,2, there is only one report (containing two cases) of serotype 1 human disease12 and notably, in the human report, assignment of strains to serotype 1 was based on biochemical tests only and not confirmed using antisera or polymerase chain reaction (PCR).

Herein, we sequenced the genome of a S. suis strain isolated from a true serotype 1 human disease case. By comparing the genomic traits of this strain to a porcine serotype 1 genome, we studied novel insights into the genomic characteristics, putative virulence genes, the prediction of the pathogenic capacity, and the antimicrobial gene repertoire of this important S. suis serotype.

Results and discussion

Identification of S. suis serotype 1

Using a novel multiplex PCR test13, we first retyped the 46 strains in our collection previously identified using multiplex PCR14 as serotype 1 or 14. Among them, 2 strains were confirmed as serotype 1 by the new multiplex PCR13, PCR–RFLP of the cpsK locus15 and SNP at position 483 of cpsK16 (Fig. 1). The first strain was ID38828, isolated in 2011 from the blood of a human patient with septicemia in a hospital in Eastern Thailand. The second serotype strain (ID35541) was isolated from the tonsils of a clinically healthy pig in 2010 in Northern Thailand.

Figure 1
figure 1

Alignment of cps1K (A) and cps14K (B) at the position of 483 to identify serotype 1 or 14 among the strains ID38828, ID35541, E11Q, E34W, EN191, and EN314. The alignment result indicated ID38828 and ID35541 were the serotype 1 and E11Q, E34W, EN191, and EN314 were the serotype 14.

General genomic information

The completed genomes of the two S. suis serotype 1 were 2,074,728 bp and 2,097,918 bp for strains ID38828 and ID35541, respectively. Strain no. ID38828 contained 1,963 coding sequences (CDS) and 12 rRNA and 56 tRNA genes. ID35541 had 1,987 CDS and 12 rRNA and 56 tRNA genes. No plasmids were detected in either strain using PlasmidFinder and PLACNETw.

Antimicrobial resistance

Globally available antimicrobial resistance data for S. suis showed that S. suis strains recovered from both humans and pigs have high resistance to tetracycline and moderate-to-high resistance to macrolides, such as erythromycin17,18,19,20,21,22,23,24. We identified resistance to tetracycline, erythromycin, azithromycin, and clindamycin in the two serotype 1 strains, which, on the other hand, were susceptible to penicillin, ceftriaxone, levofloxacin, and chloramphenicol.

ResFinder 4.1 identified the genes tetO and ermB which confer resistance to tetracycline and macrolide-lincosamide-streptrogramin (MLSB), respectively, in the two serotype 1 strains. Two aminoglycoside-resistance genes (ant(6)-Ia and aph(3')-III) were additionally detected in the porcine strain ID35541. The prevalence and number of antimicrobial resistance genes have been shown to be variable among the different serotypes25. In addition, several studies have shown that the genes tet(O) and erm(B) are widely observed among pig and human S. suis isolates of various serotypes worldwide18,25,26,27,28,29. The genes ant(6)-Ia and aph(3')-III have been reported in several S. suis strains isolated from pigs in Canada, China, Korea, and Thailand17,25,28,30. Three existing aminoglycoside-modifying enzyme types have been described in S. suis, consisting of aminoglycoside N-acetyltransferases encoded by aac genes, aminoglycoside O-phosphotransferases encoded by aph genes, and aminoglycoside O-nucleotidyl transferases encoded by ant genes18.

As shown in Fig. 2, the organization of the tetO and ermB genes was different in the strains ID35541 and ID38828. ID35541 had the genes ant(6)-Ia and aph(3')-III upstream of ermB and tetO; the genetic organization of these antimicrobial resistance genes was flanked by transposase genes.

Figure 2
figure 2

Genetic organization of antimicrobial resistant genes in two Streptococcus suis serotype 1 strains.

Virulence-associated genes

Two other studies have described the presence of genes encoding a copper-exporting ATPase 1, a type I restriction-modification system S protein, two hypothetical proteins (SSU_RS03100 and SSU_RS09155), and an RNA-binding protein that could be useful to define a S. suis pathogenic pathotype, while a gene encoding a putative sugar ATP-binding cassette transporter could be a marker of strains possessing a non-pathogenic pathotype31,32. Our two serotype 1 strains had all pathogenic pathotype marker genes and lacked the non-pathogenic pathotype marker gene (the putative sugar ATP-binding cassette transporter). This may suggest that both the serotype 1 strains under investigation belonged to the pathogenic pathotype (diseased-associated). Note that three pathotype markers (copper-exporting ATPase 1, type I restriction-modification system S protein and putative sugar ATP-binding cassette transporter) were evaluated on small S. suis strains collected from England and Wales region31, whereas SSU_RS03100 and, SSU_RS09155 and the RNA-binding protein gene were evaluated with only North American isolates32. Large S. suis strains with difference of regions or countries, isolation sources, serotypes and STs should be further evaluated for the useful of this pathotyping system.

The two serotype 1 genomes under investigation were screened for the presence of 99 VAGs33,34. Among these 99 VAGs, 20 were considering to be putative zoonotic virulence factors (PZVF; Table 1)34. Eight VAGs from 99 VAGs (Hhly3, NisK, NisR, salK, salR, srtG, virB4, and virD4) were absent from both the ID38828 and ID35541 genomes (Table 1). Among 20 PZVF, only Hhly3, NisK, and NisR were absent in both strain (Table 1). In addition, Streptococcal adhesin P (sadP), one of PZVF, was absent from the genome of the porcine strain ID35541. Whereas sadP1 was observed in the human strain ID38828. This sadP1 was widely distributed and correlating with the CC135, according to our serotype 1 strain which belonged in CC1. Both the serotype 1 strains in this study had a classical VAG profile (epf/sly/mrp), suggesting the potential for increased virulence. However, the swine strain ID35541 present a large variant of epf and a small variant of mrp, its classical VAG profile was epf */sly+/mrpS. Whereas, the human strain ID38828 showed no variants of epf and mrp, the profile was epf+/sly+/mrp+.

Table 1 Distribution of 99 virulence-associated genes in Streptococcus suis serotype 1.

Genomic comparison

MLST analysis assigned strain ID38828 to ST105 and strain ID35541 to ST237. Both ST105 and ST237 are included in CC1. We analyzed 428 CC1 strains download from GenBank together with our two strains. As shown in Fig. 3, the porcine serotype 1 strain ID35541 (ST237) was very closely related to the ST11 strains MNCM07 (Thailand), 812 and 832 (both from China), and ST1 strain C160 from The Netherlands. The human serotype 1 strain ID38828 (ST105) clustered with the Vietnamese strains EN314, E34W, EN191, and E11Q. These ST105 Vietnamese strains are serotype 2 (retyped as serotype 14 in this study) isolates recovered from human infections https://www.ncbi.nlm.nih.gov/biosample/SAMEA1566042; https://www.ncbi.nlm.nih.gov/biosample/SAMEA3233828; https://www.ncbi.nlm.nih.gov/biosample/SAMEA1566194; https://www.ncbi.nlm.nih.gov/biosample/SAMEA3233818. S. suis ST105 strains have previously been reported almost exclusively in Southeast Asian countries, especially Thailand and Vietnam5,36,37, https://pubmlst.org/bigsdb?order=id&designation_field1=s_1_ST&designation_value1=105&set_id=0&submit=1&designation_operator1==&page=query&db=pubmlst_ssuis_isolates, accessed on Nov 18, 2022). It appears that ST105 is an endemic strain in this region. However, ST105 strains of serotype 1 have also been documented in pigs in the UK (https://pubmlst.org/bigsdb?order=id&designation_field1=s_1_ST&designation_value1=105&set_id=0&submit=1&designation_operator1==&page=query&db=pubmlst_ssuis_isolates; accessed on Nov 18, 2022).

Figure 3
figure 3

A phylogenetic tree based on the core genome SNPs of 430 genomic sequences of S. suis clonal complex 1 was constructed using the maximum likelihood method by FastTree v2.1.10 tool and visualized using interactive Tree of Life tool. The whole genome sequence of S suis serotype 1 strains in the current study are highlighted in yellow. * These original serotype 2 strains assigned in BioSample of GenBank were retyped as serotype 14 in this study.

Analysis of the MCG group showed that the ST237 porcine isolate ID35541 was MCG group 1. This group includes the virulent CC1 strains (especially ST1 and ST7) associated with human infections, death, and outbreaks19. MCG group 1 also contains a higher number of virulent genes19. However, the human ST105 isolate ID38828 was ungroupable by MCG analysis, as were the Vietnamese ST105 strains (EN314, E34W, EN191, and E11Q) most closely related to ID38828. The MCG ungroupable cluster also contained diverse S. suis serotypes and STs, which collectively had a higher number of virulence genes than strains belonging to MCG groups 2–719. It could be that this group was also virulent because some human-infected strains fell in this group.

Pangenome analysis of the two S. suis serotype 1 strains and their closest relatives is presented in Fig. 4. Both our strains showed difference in 54 and 55 unique genes for strains ID38828 (human) and ID35541 (porcine), respectively. Interestingly, the human-derived strains ID38828, E11Q, EN191, E34W, and EN314 were highly homologous, with only 2 and 1 unique genes present in strains ID38828 and E11Q, respectively, whereas no unique gene found in the strains EN191, E34W, and EN314 (Table S1 and Fig. 4). Two unique genes of strain ID38828 encoded hypothetical proteins for both, while one unique gene of E11Q was similar to membrane protein. Comparison of our porcine strains ID35541 with strains MNCM07, C160, 812, and 832 were relatively more distantly related, with 25, 8, 36, 18, and 10 unique genes (Fig. 4 and Table S2) for ID35541, MNCM07, C160, 812, and 832, respectively.

Figure 4
figure 4

Pangenome analysis representation based on anvi'o software. (A) Pangenome reconstructed with two complete genomes of S. suis strains ID35541 and ID38828. Each ring in the graph represents an individual S. suis genome and each ray corresponds with a given gene homolog. The two inner layers are colored in black to mark gene clusters found in that genome or left with a translucent background if the gene cluster is absent from that genome. The third layer shows density plot of number of paralog genes. The outermost ring illustrates the single-copy genes (SCG; brown color) and accessary genes of strain ID35541 and ID38828 in torque and magenta highlight, respectively. Based on anvi’o pangenome analysis, there are 1871 core genes common between two strains ID38828 and ID35541. The accessory genes of strains ID38828 and ID35541 show 54 and 55 genes, respectively. (B) Pan-genome reconstructed with 5 genomes of S. suis strains ID38828, EN314, EN191, E34W, and E11Q. (C) Pan-genome reconstructed with 5 genomes of S. suis strains ID35541, MNCM07, C160, 832, and 812. Each ring in the graph (B and C) represents an individual S. suis genome and each ray corresponds to a given gene homolog. The five inner layers are in black to designate gene clusters in that genome or translucent if the gene cluster is absent. The third layer shows a density plot of number of paralog genes. The outermost ring illustrates the single-copy genes (SCG; brown). UpsetR showing the number of genes that are shared and unique between the genomes.

It is interested that the genomic comparison among the human ST105 strains revealed no capsule genes different, actually our strain ID38828 and the Vietnamese strains (E11Q, EN314, EN191, and E34W) were different serotype as mentioned above. This may suggest that they are same serotype. To clarify this hypothesis, we determined capsule specific genes (cpsI and cpsJ) of the strains E11Q, EN191, E34W, and EN314. The specific capsule genes of these four strains showed positive only cps1I, cps1J, cps14I, and cps14J, whereas cps2I, cps2J, cps1/2I, and cps1/2J were not found for in all Vietnamese strains. It indicated that these four Vietnamese strains were either serotype 1 or 14, but not serotype 2 as mention in the GenBank data. Analysis of SNP at position 483 in cps1K and cps14K to identify serotype 1 or 14 revealed that all four strains were serotype 14 (Fig. 1).

Although the human strains were of different serotypes (serotype 1 for ID38828 and serotype 14 for EN314, E34W, EN191, and E11Q), genetically, they both were ST105. This suggested that they may represent an event of capsule switching because capsular locus of serotypes 1 and 14 are highly similar along capsule locus38,39. Capsular switching has been documented among serotype 2 strains40,41. Therefore it highly seems to occur between serotypes 1 and 14. In addition, the serotype 14-ST105 were also found and circulated in humans in Thailand36,42. It may be possible that Thai ST105 isolate of serotype 1 (also serotype 14) and the Vietnamese serotype 14 strain share a common ancestor circulating in the region as demonstrated in the Fig. 3.

Conclusion

Comparison of the genome sequences of two S. suis serotype 1 strains (one human and one porcine) and the closest relatives strains revealed very few regions of difference and relatively important similarity. Based on the schematic systems of pathotyping and the VAG profile used for classifying the S. suis serotype 2 strains, the two serotype 1 strains investigated here are bona fide members of virulent groups. These strains belong to MCG-1 and MCG ungroupable for ST237 (ID35541) and ST105 (ID38828), respectively. They carry both the tet(O) and ermB genes conferring resistance to tetracycline, macrolide, and lincosamide; in addition, the porcine isolate ID35541 had aminoglycoside resistance genes (ant(6)-Ia and aph(3')-III). Resistance to tetracycline, erythromycin, azithromycin, and clindamycin were observed in the two serotype 1 strains, whereas, both were susceptible to penicillin, ceftriaxone, levofloxacin, and chloramphenicol.

Materials and methods

Bacterial strain re-typing and antimicrobial susceptibility

We used a recently described improved multiplex PCR, capable of discriminating between serotypes 1 and 1413, to re-type a total of 46 S. suis strains present in our collection; these had been assigned to serotype 14 or 1. In total, two serotype 1 strains were identified from these 46 strains. One strain (strain no. ID38828) was from a human patient, while the second was of porcine origin (strain no. ID35541). Confirmation of serotype 1 assignment was performed based on PCR–RFLP as described elsewhere15.

The broth microdilution technique was used according to the standards defined in the M100 (32nd edition) guidelines of the Clinical and Laboratory Standard Institute (CLSI) to determine the minimum inhibitory concentrations (MICs) of penicillin and ceftriaxone43. Susceptibilities to other antimicrobials (azithromycin, erythromycin, tetracycline, clindamycin, levofloxacin, and chloramphenicol) were determined using the disk diffusion technique following the 2022 CLSI-M100 guidelines43. Since there are currently no breakpoints recommended for S. suis, those for the viridans group streptococci were used, as defined in the guidelines43. The Streptococcus pneumoniae ATCC49619 strain was used for quality control purposes.

Whole-genome sequencing

Bacterial genomic DNA samples extracted using ZymoBIOMICS DNA Kits (Zymo Research, CA, USA) were sequenced using the Oxford Nanopore Technologies (ONT) and Illumina platforms as described previously44. Hybrid assemblies with the ONT and Illumina data were performed using Unicycler v0.4.845 and the genome sequences were checked for quality using QUAST v5.0.246. Genome sequences were submitted to the NCBI Prokaryotic Genome Annotation Pipeline (PGAP v4.12) for annotation. The default parameters were used for all software unless otherwise specified.

Bioinformatics analysis

The serotype of the strains was further confirmed by the occurrence of a single-nucleotide substitution at the position 483 of the cpsK locus16 differentiating reference S. suis serotypes 1 and 14 (GenBank accession no. JF273644 and AB737822, respectively) using the Center for Genomic Epidemiology’s MyDbFinder 2.0. Sequence types (ST) were identified by comparing the sequences against the PubMLST database (https://pubmlst.org/organisms/streptococcus-suis). Minimum core genome (MCG) sequence typing was performed according to the procedures described elsewhere19. We screened the genomes of the serotype 1 strains using MyDbFinder 2.0 for the presence of up to 99 genes previously described to be virulence-associated genes (VAG) (Table 1)33,34.

Antimicrobial resistance genes were detected using ResFinder 4.147. Plasmid replicons were analyzed using PlasmidFinder 2.1 and PLACNETw48,49. Pangenome analyses were performed with the anvi’o v7 workflow50. This workflow identified gene clusters and single-copy genes in study genomes, including ID38828, ID35541, EN314, EN191, E34W, E11Q, MNCM07, C160, 832, and 812. All genomes, in fasta format, were submitted to pangenome analysis using the 'anvi-run-workflow' script. Genes were annotated using anvi-run-ncbi-cogs. All genomes were added to a new anvi'o genomes storage using the 'anvi-gen-genomes-storage' application. Then, the program ‘anvi-pan-genome’ ran pan-genomic analysis on all the stored genomes using NCBI's blastp tool. We used 'anvi-import-misc-data' to import additional metadata and 'anvi-compute-genome-similarity' to compute the average nucleotide identity (ANI) using the pyANI tool (https://github.com/widdowquinn/pyani). The pangenome was visualized in anvi'o using the 'anvi-display-pan' application. The whole pangenome was divided into core and accessory bins based on gene cluster frequency.

Phylogenetic analysis

Totally, 430 CC1 genomes were included in phylogenetic analysis. S. pneumoniae genome ATCC700669 (accession no. NC_011900)51 was used as out-group to root the tree. Single-nucleotide polymorphisms (SNPs) were detected using MUMmer v3.23, and the genome sequence of S. suis strain P1/7 (accession no. NC_01292) was used as a reference52. A phylogenetic tree based on the core genome SNPs was constructed using the maximum likelihood method by FastTree v2.1.10 described in a previous study19. Bootstraps were performed with 1,000 replicates. The phylogenetic tree was visualized using the iTOL V4 software53.

Accession number

The genome sequences of the two S. suis serotype 1 strains were deposited in the NCBI GenBank under Bioproject accession number PRJNA691075 for strain numbers ID38828 and ID35541, respectively.

Ethics statement

Ethical review and approval were not required because no human specimens or data were used in the current study.