Introduction

Group B streptococcus (GBS) is an important pathogen with the potential to cause disease in adults with underlying chronic medical conditions, pregnant women, but, above all, in newborns. Early onset neonatal disease (EOD) is acquired by vertical GBS transmission during birth and is characterized by pneumonia and septicemia in the first week of life. Late onset disease (LOD) occurs in infants up to seven months of age and usually involves meningitis. GBS colonization rates up to 35% in healthy pregnant women have been reported, but only 0.5–1% of neonates born from these mothers will finally develop GBS disease [1, 2]. The identification of highly virulent GBS could contribute to our understanding of the pathogenesis of GBS disease and target prevention and intervention strategies.

GBS virulence factors, including the polysaccharide capsule and several gene regulation mechanisms, have been identified, and may all contribute to the complex pathogenesis of GBS disease [3, 4]. Based on the nature of the immunogenic polysaccharide capsule, GBS can be grouped into nine serotypes: Ia, Ib, and II–VIII. Serotype III is recovered significantly more often from cases of meningitis and LOD. At the same time, it is the most prevalent serotype in asymptomatic carriers [57]. Previous epidemiological studies have identified virulent clonal types of serotype III by multilocus enzyme electrophoresis (MLEE), restriction digestion pattern (RDP) typing, and multilocus sequence typing (MLST) [813]. In vitro studies have demonstrated that these clones have a different level of enzyme activity known to play a role in virulence [14, 15]. The clones also contain genetic material next to known or putative virulence genes that is not present in less virulent serotype III GBS [16]. So, virulence might well be associated with specific GBS subtypes, but based on current microbiological knowledge, unequivocal markers of invasiveness have not been identified [17, 18]. Most known and putative GBS virulence genes are clustered into 14 genetic islands and are considered to be potential GBS-specific potential pathogenicity islands (PPIs) [19, 20]. We studied the epidemiological prevalence of PPI-specific molecular markers and serotypes in invasive neonatal versus colonizing strains. We analyzed well-characterized GBS strains from The Netherlands (Europe) and from a geographically and racially distinct population (Taiwan, Asia).

Materials and methods

Bacterial strains

A total of 136 isolates of GBS from The Netherlands and 202 isolates from Taiwan was included. From The Netherlands, 92 strains were isolated from ante-partum recto-vaginal cultures performed between 1995 and 2004 by the Laboratory of Microbiology at the Medical Center Haaglanden, The Hague, The Netherlands. Forty-four invasive strains were supplied by the Netherlands Reference Laboratory for Bacterial Meningitis at the Academic Medical Center, Amsterdam, The Netherlands. Strains from the same geographical region and episode (1999–2004) as those of the colonizing strains were selected: eight were isolated from blood, 14 from cerebrospinal fluid (CSF), and 22 from both blood and CSF. In Taiwan, 58 strains were isolated from ante-partum recto-vaginal cultures and 43 invasive neonatal strains were collected between 2000 and 2005: 24 from blood and 19 from blood and CSF. In Taiwan, 101 invasive strains were collected from infections in adults: 50 were isolated from blood only and 51 from blood and another focus (bone, joint, CSF, urine, etc.).

Species confirmation and serotyping

Species confirmation of GBS was performed by Latex agglutination according to the manufacturer’s instructions (Streptex®, Remel Inc., USA). Briefly, five colonies from a 24-h culture at 37°C on 5% sheep blood agar (SBA) were suspended in 400 μl of extraction reagent. After 10 min of incubation at 37°C, 40 μl of the extract was added to a drop of group-specific latex suspension (A, B, C, D, F, and G). Agglutination was read after 1 min. GBS serotyping was performed using the Strep-B-Latex kit (Statens Serum Institut, Denmark). Strains were cultured for 24 h in Todd Hewitt broth. Ten microliters from this culture was mixed with serotypes Ia, Ib, and II–VIII specific latex bead suspension. Agglutination was read after 5 to 10 s.

Polymerase chain reaction

DNA was prepared with the MagNA Pure system (Roche, Lelystad, The Netherlands) using the MagNA Pure LC DNA isolation kit III. For polymerase chain reaction (PCR), primer pairs directed to regions corresponding to 26 genes from 14 putative PPIs were used (Table 1) [20]. Reaction vials were filled with a final volume of 50 μl containing 5 μl 2 mM deoxynucleotide triphosphate bases, 1.0 μl primer pairs, 0.08 μl Taq polymerase (Sigma, Germany), 50 ng/5 μl DNA, and 38.4 μl of distilled water. PCR assays were performed in an ABI thermal cycler (Applied Biosystems, Gouda, The Netherlands) under the following conditions: 25 identical cycles of denaturation at 94°C for 1 min, hybridization at 52°C for 1 min, and extension at 72°C for 1 min. Fifteen microliters of the PCR products was subjected to electrophoresis for 50 min at 100 mA on a 1% agarose gel (Hispanagar, Sphaero Q, Leiden, The Netherlands) in 1× Tris-borate-EDTA (TBE) buffer (89 mM Tris, 89 mM boric acid, 2 mM EDTA). Gels were stained with ethidium bromide and photographed during UV illumination. PCR amplicon sizes were estimated by comparison to a 100 bp DNA ladder (Sigma, Germany).

Table 1 Gene assignment of putative pathogenicity islands (PPIs) and their characteristics [20]

For the selection of PPI-specific invasion markers, all 26 PCRs were initially performed on five colonizing and five invasive strains, randomly selected from the Dutch collection. If a PCR assay generated a similar result in the two groups of five strains (uniformly positive or negative), we concluded that the contribution of this PCR to discrimination between invasive and non-invasive strains would probably be small; these PCRs were excluded from further evaluation. After this preliminary screening, 14 PCRs were selected and performed for the entire collection, establishing the presence or absence of genetic markers corresponding to the genes of nine PPIs and one transposon for each strain (Table 3).

Data analysis

Isolates were divided into five groups: Dutch neonatal invasive GBS (DNI), Dutch colonizing GBS (DC), Taiwanese neonatal invasive GBS (TNI), Taiwanese adult invasive (TAI), and Taiwanese colonizing GBS (TC). The selective PCR results were analyzed using the SPSS 15.0 software package (SPSS Inc, Chicago, IL, USA). Two by two tables were created for virulent and colonizing strains with each serotype and genetic marker. We examined the differences in the prevalence of serotypes and markers in invasive and non-invasive strains by comparing Dutch neonatal invasive with Dutch colonizing GBS, Taiwanese neonatal invasive with Taiwanese colonizing GBS, and Taiwanese adult invasive with Taiwanese colonizing GBS. Chi-quare tests and Fisher’s exact tests were used to assess differences between groups. To limit the number of variables, PCR results for markers with just a few positive results in all groups (<5) were excluded from further statistical analysis. Logistic regression was performed to assess any dependency of one variable to another and to establish the distribution of serotypes and genetic markers combined in colonizing and virulent GBS. Serotype was included as a categorical variable. The Hosmer and Lemeshow test was used to establish the statistical significance of the outcome of the regression analysis. To assess the practical value, a receiver operating characteristic (ROC) curve was calculated with the significant variables to establish the sensitivity and specificity of the model.

Results

Epidemiology of serotype and gene distribution

The distribution of serotypes and PPI markers varied widely between the five groups (Tables 2 and 3). In Dutch and Taiwanese strains, serotype III isolates were more prevalent in neonatal invasive GBS (P < 0.0001). In Dutch strains, serotype II (P = 0.009) and in Taiwanese strains serotype V (P = 0.005) were more prevalent in colonizing GBS. In adult invasive GBS from Taiwan, no serotype was associated with invasiveness or colonization (Table 2). We found two PPI markers to be associated with invasiveness: gbs1073/sag1038, corresponding to PPI IX in Dutch neonatal strains (P < 0.0001), and sag915, corresponding to a transposon (Tn916) in both neonatal (P = 0.028) and adult invasive Taiwanese strains (P = 0.048). Furthermore, marker gbs0227/sag0234 (PPI I) was less prevalent in Dutch (P = 0.001) and Taiwanese invasive neonatal strains (P < 0.001), but not in Taiwanese adult invasive strains. The presence of markers gbs1313/sag1246 (PPI XII) and gbs1987/sag2029 (PPI XIII) was associated with colonization (P < 0.0001 and P = 0.013) only in the Dutch collection of strains (Table 3). Finally, gbs1135 (PPI X) was more prevalent in Taiwanese colonizing strains. The absence of this gene was associated with virulence in Taiwanese adult invasive strains (P = 0.013). When Dutch and Taiwanese colonizing GBS were compared, serotype II, serotype IV, and marker 1306/sag1233 (PPI XII P = 0.049, P = 0.043, P = 0.0001, respectively) were more prevalent among Dutch GBS and marker 1073/sag1038 (PPI IX) among Taiwanese GBS (P < 0.0001) (Tables 2 and 3). Apparently, there are clear regional differences in the GBS epidemiology and infection potential of clones.

Table 2 Prevalence of serotypes
Table 3 Prevalence of genetic markers

Invasion markers and serotype analysis

For the Dutch strains, serotype, absence of markers gbs1987/sag2029, gbs227/sag0234, gbs1125, and gbs1313/sag1246, and the presence of marker gbs1073/sag1038 were clearly associated with invasiveness (Tables 2 and 3). The ROC curve showed a large area under the curve (AUC) of 0.963, representing a sensitivity of 97.7% and a specificity of 85.9% in indicating virulence (Fig. 1a). If the algorithm deduced from the ROC curve was used to identify Taiwanese strains, this model correctly identified 79% of the neonatal invasive strains and only 25% of adult invasive strains and 57% of colonizing strains. Therefore, to optimize the selection of markers for the Taiwanese collection of strains, a new model was constructed, comparing both neonatal and adult invasive strains to colonizing strains from the same region. For Taiwanese neonatal strains, serotype and absence of markers gbs1987/sag2029, gbs0227/sag0234, and gbs1135, and the presence of marker sag 915 were independently associated with virulence (Tables 2 and 3). With these variables, the ROC curve showed an AUC of 0.894 and the model potentially predicted invasiveness with a sensitivity of 93% and specificity of 75.9% (Fig. 1c). If the serotype was excluded, the AUC was still 0.89 for the Dutch and 0.81 for the Taiwanese neonatal strains (Fig. 1b, d). The optimal ROC curve constructed for the adult invasive Taiwanese strains generated an AUC of only 0.657 and a low sensitivity (71.4%) and specificity (53.4%) for tracing potential virulence (Fig. 1e). Only a few variables contributed to this model: the presence of marker sag 915, and the absence of markers gbs1306/sag1233 and gbs1135. In contrast to both groups of neonatal invasive strains, serotype was not a relevant variable (Table 2). So, with different sets of markers and serotypes, it was possible to construct a reliable model for the prediction of potential invasiveness for both groups of neonatal but not for adult invasive strains.

Fig. 1
figure 1

Receiver operating characteristic (ROC) curves including both sensitivity and specificity estimates. Each curve is the regression line that summarizes the overall diagnostic accuracy. AUC = area under the curve, SE = standard error of AUC. a, b Dutch neonatal invasive strains. c, d Taiwanese neonatal invasive strains. e Taiwanese adult invasive strains. In a and c, the included variables are molecular markers and serotype. To establish the influence of markers alone, serotype was left out as a variable in b and d. For adult invasive strains, only one curve was constructed (e) because the serotype was not a relevant variable. a AUC: 0.963, SE: 0.014, 95% CI: 0.935–0.99, cutoff: 0.2055, sensitivity: 97.7%, specificity: 85.9%. Included variables are serotype, gbs1987/sag2029, gbs227/sag0234, gbs1073/sag1038, gbs1125, and gbs1313/sag1246. b AUC: 0.898, SE: 0.028, 95% CI: 0.844–0.952, cutoff: 0.351, sensitivity: 90.9%, specificity: 79.3%. Included variables as in a, except serotype. c AUC: 0.894, SE: 0.035, 95% CI: 0.826–0.963, cutoff: 0.3525, sensitivity: 93%, specificity: 75.9%. Included variables are serotype, gbs1987/sag2029, gbs227/sag0234, gbs1135, and sag915. d AUC 0.823, SE: 0.044, 95% CI:0.738–0.909, cutoff: 0.388, sensitivity: 72.1%, specificity: 82.8%. Variables as in c, except serotype. e AUC: 0.657, SE: 0.046, 95% CI: 0.567–0.747, cutoff: 0.5495, sensitivity: 71.4%, specificity: 53.4%. Included variables are sag915, gbs1306/sag1233, and gbs1135

Discussion

Based on a combination of conventional serotyping and assessment of a set of PCR markers related to GBS PPIs as defined for a collection of strains from The Netherlands and Taiwan, we have been able to characterize the epidemiology of virulent isolates of GBS likely to cause neonatal disease. Previously, Herbert et al. examined the presence of these PPIs in a limited number of invasive and non-invasive strains of GBS, and used PCRs directed at several genes per island as markers [20]. They found that islands I, VI, and XII met the criteria of a true pathogenicity island, but no association was found between the presence of markers corresponding to these islands and invasiveness [20, 21]. Our present study is the first to examine combinations of these markers of PPI and serotype on a large collection of strains, and to compare strains isolated from different groups of patients from different regions [20].

Previously, not a single major GBS virulence factor related to neonatal disease has been identified, and solely comparing the presence or absence of genes encoding virulence factors can not differentiate accurately between epidemic virulent and colonizing strains. This is confirmed by two independent studies that were carried out to establish the frequency of genes coding putative and known GBS virulence factors among invasive and colonizing isolates [22, 23] (Table 4). Others used random amplified polymorphic DNA (RAPD), exploring a larger part of the genome to compare the genetic structure of colonizing and invasive GBS isolated from neonates with meningitis [2426] (Table 4). Additionally, two previous French studies using pulsed field gel electrophoresis (PFGE) revealed that a subset of isolates recovered from meningitis clustered within three PFGE groups [27, 28]. The authors suggested that it should be possible to characterize the specific markers, define primers, and construct a PCR assay to identify virulent isolates of GBS. However, such a test has not been developed yet. The PPIs we examined harbor known and putative GBS virulence factors, insertion sequences, transposons, and genes of unknown function. (Table 1). We found that in all geographically related neonatal invasive strains, gbs1987/sag2029, corresponding to a true PPI (I), was more prevalent in colonizing rather than invasive GBS isolates. Also, one marker in the Dutch neonatal strains (gbs1073/sag1038, PPI IX) and one marker for a mobile element, sag915(Tn916), in the Taiwanese strains were associated with invasiveness. This probably reflects the complex and multi-factorial nature of GBS virulence and molecular pathogenesis, which is likely to be attributable to multiple genes, polymorphism, and differential expression of these genes and regulation mechanisms [24, 25, 2932].

Table 4 Overview of known markers to potentially identify the invasive subgroup, comparing the results of our study with those of previous works

Without understanding the exact link between PPI, serotype, and virulence, our novel markers allow us to epidemiologically differentiate between invasive and colonizing GBS, more so than other methods permitted thus far. However, it has to be emphasized that epidemiological behavior and invasiveness might be closely intertwined. Analysis of GBS strains from additional geographic locales will help to resolve this issue. Optimal PPI markers for epidemiological discrimination vary between the two geographically distinct areas and in subsets of neonatal GBS strains from The Netherlands and Taiwan. After optimizing the markers, our predictive model was still less accurate for the Taiwanese strains, most likely because our initial selection was based on screening of the Dutch strains. Further research exploring the distribution of molecular markers not included in our study in different populations and regions is necessary to give insight into the dynamics of group B streptococci.