Integrated genomic epidemiology and phenotypic profiling of Clostridium difficile across intra-hospital and community populations in Colombia

Muñoz, Marina; Restrepo-Montoya, Daniel; Kumar, Nitin; Iraola, Gregorio; Camargo, Milena; Díaz-Arévalo, Diana; Roa-Molina, Nelly S.; Tellez, Mayra A.; Herrera, Giovanny; Ríos-Chaparro, Dora I.; Birchenall, Claudia; Pinilla, Darío; Pardo-Oviedo, Juan M.; Rodríguez-Leguizamón, Giovanni; Josa, Diego F.; Lawley, Trevor D.; Patarroyo, Manuel A.; Ramírez, Juan David

doi:10.1038/s41598-019-47688-2

Integrated genomic epidemiology and phenotypic profiling of Clostridium difficile across intra-hospital and community populations in Colombia

Article
Open access
Published: 05 August 2019

Volume 9, article number 11293, (2019)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Integrated genomic epidemiology and phenotypic profiling of Clostridium difficile across intra-hospital and community populations in Colombia

Download PDF

Marina Muñoz^1,2,
Daniel Restrepo-Montoya^1,3,
Nitin Kumar ORCID: orcid.org/0000-0002-2140-4923⁴,
Gregorio Iraola ORCID: orcid.org/0000-0002-6516-3404^5,6,
Milena Camargo^7,8,
Diana Díaz-Arévalo ORCID: orcid.org/0000-0003-3766-5500^7,9,10,
Nelly S. Roa-Molina¹¹,
Mayra A. Tellez¹¹,
Giovanny Herrera^1,12,
Dora I. Ríos-Chaparro¹,
Claudia Birchenall¹³,
Darío Pinilla¹³,
Juan M. Pardo-Oviedo ORCID: orcid.org/0000-0003-0084-3449¹³,
Giovanni Rodríguez-Leguizamón¹³,
Diego F. Josa¹⁴,
Trevor D. Lawley⁴,
Manuel A. Patarroyo ORCID: orcid.org/0000-0002-4751-2500^7,8 &
…
Juan David Ramírez¹

3149 Accesses
12 Citations
2 Altmetric
Explore all metrics

Abstract

Clostridium difficile, the causal agent of antibiotic-associated diarrhea, has a complex epidemiology poorly studied in Latin America. We performed a robust genomic and phenotypic profiling of 53 C. difficile clinical isolates established from diarrheal samples from either intrahospital (IH) or community (CO) populations in central Colombia. In vitro tests were conducted to evaluate the cytopathic effect, the minimum inhibitory concentration of ten antimicrobial agents, the sporulation efficiency and the colony forming ability. Eleven different sequence types (STs) were found, the majority present individually in each sample, however in three samples two different STs were isolated. Interestingly, CO patients were infected with STs associated with hypervirulent strains (ST-1 in Clade-2). Three coexistence events (two STs simultaneously detected in the same sample) were observed always involving ST-8 from Clade-1. A total of 2,502 genes were present in 99% of the isolates with 95% of identity or more, it represents a core genome of 28.6% of the 8,735 total genes identified in the set of genomes. A high cytopathic effect was observed for the isolates positive for the two main toxins but negative for binary toxin (TcdA+/TcdB+/CDT− toxin production type), found only in Clade-1. Molecular markers conferring resistance to fluoroquinolones (cdeA and gyrA) and to sulfonamides (folP) were the most frequent in the analyzed genomes. In addition, 15 other markers were found mostly in Clade-2 isolates. These results highlight the regional differences that C. difficile isolates display, being in this case the CO isolates the ones having a greater number of accessory genes and virulence-associated factors.

Characterisation of Clostridium difficile strains isolated from Groote Schuur Hospital, Cape Town, South Africa

Article 27 July 2016

Nosocomial transmission of Clostridium difficile Genotype ST81 in a General Teaching Hospital in China traced by whole genome sequencing

Article Open access 29 August 2017

Characteristics of Clostridioides difficile isolates circulating in the Slovak hospitals

Article Open access 31 July 2023

Introduction

Clostridium difficile (CD) is an anaerobic Gram-positive spore-forming bacillus recognized as the causal agent of antibiotic-associated diarrhea and a wide range of gastrointestinal syndromes, including pseudomembranous colitis and toxic megacolon, that in complex cases can result in death¹. The impact of C. difficile infection (CDI) has been well documented in North America, Europe, and some regions of Asia, especially at the intrahospital (IH) level where individuals are exposed to the main risk factors for C. difficile proliferation, among them, antibiotic therapy². In Latin America, studies focused on determining the frequency of CDI and identifying the hypervirulent strain RT027/BI/NAP1 (belonging to ST-1, Clade 2) have been carried out mainly in Argentina, Colombia, Brazil, Costa Rica, Chile and Perú³.

C. difficile features two main toxins (ToxA and ToxB), which belong to a large family of clostridial toxins with glucosyltransferase activity⁴ that are responsible for the presentation of symptoms caused by damage to the epithelial tissue of the infected host⁵. Besides the main toxins, the binary toxin (composed of subunits CdtA and CdtB), which has ADP-ribosyl transferase activity, may have a role in the adherence of C. difficile by acting in a synergistic way with other factors such as surface proteins⁶. Another factor involved in the complexity of CDI management is antibiotic resistance, which has been associated with polymorphisms and/or the presence of genes that can be transported by mobile genetic elements⁷. Additionally, the ability of C. difficile to sporulate and germinate (aspects that impact on the dissemination and colonization of the microorganism), as well as to present resistance to certain antimicrobials (which affects the response to treatment), contribute significantly to the impact of C. difficile in their hosts⁸.

Whole-genome sequencing of C. difficile is a prerequisite to understand the molecular and genomic basis of the phenotype diversity involved in the wide range of clinical impacts this microorganism can cause. A detailed high-resolution genomic and phenotypic profiling of C. difficile samples collected in Bogotá (central Colombia) was thus carried out here, to obtain a broad insight about differences in CDI in community and intrahospital populations in Bogotá, Colombia.

Results

Genome sequencing, assembly, and species allocation

Fifty-three clinical isolates were established from 17 fecal samples from patients suffering diarrhea in whom CDI was detected as part of a previous screening⁹. A variable number of isolates were recovered for each sample (1 to 5), as a strategy to recover multiple genotypes potentially coexisting in a same sample as reported elsewhere¹⁰. Isolates information is described in Supplementary Table S1. The complete set of isolates was subjected to genomic and phenotypic characterization as is described in Supplementary Fig. S1. Thirty of the isolates (56.6%) were from the Fundación Clínica Shaio (Bogotá, Colombia) and the remaining 23 (43.4%) were from the Hospital Universitario Mayor – Méderi (Bogotá, Colombia). The 60.4% (n = 32) of the total isolates corresponded to health-care associated infections, collected at intra-hospital ‘IH’ services, and the remaining 39.6% were community-acquired infections (CO) (n = 21).

The analyzed genomes contained between 4.1 and 4.4 mega bases, with an average N50 of 422,610 nt (range 156,209–857,412 nt) for the assemblies and a depth of at least 319.3 × (Supplementary Data set S1). The species allocation of the genomes was verified using the average identity of nucleotides (ANI)¹¹ and considering ANI > 95.0 as the cutoff point.

C. difficile Clades 1 and 2 and coexistence of multiple sequence types were found within Colombian samples

A total of 11 sequence types (STs) were identified in the set of isolates evaluated. Even though MLST phylogenetics could be an incomplete method, this is an important scheme for grouping isolates and has been the accepted strategy for intra-species classification for C. difficile¹². Therefore, it was used as a first step to identify the clades to which circulating STs belong in the analyzed populations. All Colombian isolates are close to Clade-1 and Clade-2 members (Fig. 1A). Nevertheless, no independent grouping of the other clades reference sequences included in the analysis was observed. Such is the case of RT017_M68 and RT017_CF5 that despite belonging to Clade-3 are being grouped with Clade-1 strains.

In terms of ST diversity by Clade (Fig. 1B), 35 of the 53 isolates (66.0%) were in Clade-1, which contained 9 STs with ST-8 being the most abundant (20.8%), followed by ST-26 (11.3%). The remaining 18 isolates (34.0%) were grouped in Clade-2, which contained only 2 STs, ST-41 (18.9%) and ST-1 (15.1%). The discrimination of the STs by population (Fig. 1C) showed that 7 STs were found at the IH level, while 5 STs at the CO level. ST-8 (Clade-1 member) was the only ST found in both populations (IH and CO). STs belonging to Clade-2 occupied the first place in frequency for each population, being ST-1 (associated with hypervirulent strains) present at the CO population only.

In 14 of the 17 samples analyzed, a single ST was identified for all isolates established, showing that many of the isolates are highly clonal. However, in three samples, two STs were identified simultaneously, being defined as ‘coexistence events’ (defined as events where two STs simultaneously detected in the same sample) (Fig. 1A right panel). Two interesting findings were identified in those coexistence events: (i) ST-8 was always present in all three events; and (ii) STs belonging to two different Clades were found in coexistence event 3 (ST-8 and ST-41 from Clade-1 and Clade-2, respectively).

Clustering by clade and identification of high numbers of accessory genes by pangenome analysis

A total of 8,735 gene clusters were identified and most of them belonged to the accessory genome (71.4%). The core genome was limited to only 2,502 genes, corresponding to 28.6% of the total genes found. The phylogenetic reconstruction based on 99,527 homologous positions of core genome had better discrimination than that based on MLST, which allowed independent clustering by clade of the genomes in each of the five clades (Fig. 2a). However, when the topology of the obtained tree was compared with the population origin (IH/CO) and with the hospital where the samples were collected (Méderi/Shaio), no origin-specific clustering was found. This confirmed that C. difficile isolates that belonged to the two clades were circulating in both populations and in the two hospital centers (Fig. 2a).

Single nucleotide polymorphism (SNP) distances was determined among isolates, from the core genome alignment (2,502 core genes). The results showed that isolates belonging to the same ST and established from the same sample carried from 0 to 71 SNPs (average: 15.5 SNPs per sample), being two samples collected from CO population those that transport the isolates with the highest number of SNPs into this group (71 and 69.3 in average with four and five isolates, respectively). An average of 215.6 SNPs was identified between isolates belonging to the same ST (although they have been isolated from different samples). Strikingly, ST-26 (in Clade-1) showed the highest SNP result (475.8 SNPs; Fig. 2b).

The accessory genome phylogenetic reconstruction revealed well-supported nodes with branches longer than those found for the core genome analysis (Fig. 2c). Although the topology of the tree was maintained, mainly in the members of Clade-2, there were some incongruences in the clustering of the other genomes (Fig. 2c). These incongruences included changes in the clustering of some STs, namely ST-42 and ST-43 as well as ST-26 and ST-48, which were now found to be closely related, the first two being involved in coexistence events (Fig. 2d). The new clustering in Clade-1 in the accessory genome tree may be associated with the high number of accessory genes identified in the analyzed genomes; at least 1,045 accessory genes per genome (Fig. 2e). There were also a high number of unique genes (Fig. 2f), especially in the genome of Gcol-A84 (established from a CO individual enrolled in Fundación Clínica Shaio), which had 49 unique genes and was one of the isolates showing changes in the tree clustering. Although some of the unique genes found in Gcol-84 corresponded to genes coding for proteins involved in biological processes or hypothetical/uncharacterized proteins, genes associated with antibiotic resistance and sporulation, as well as markers of mobile genetic elements were also found (Supplementary Table S2).

Colombian isolates carry toxin coding genes with atypical organizations and have a cytopathic effect on Vero cells

Ariba analyses¹³ showed that 40 (75.5%) of the genomes contain toxin coding genes that were reported previously (Supplementary Fig. S2). Among them, tcdA (41.5%, n = 22) and tcdB (60.4%; n = 32), which code for the two main C. difficile toxins, are located within the pathogenicity locus (PaLoc), and cdtA and cdtB (34.0%; n = 18, present simultaneously), which encode for the subunits of the binary toxin, are located within the binary toxin (CDT) locus (CdtLoc). The toxigenic profiles of the isolates were analyzed according to the clade to which they belonged using a comparative approach. We found that none of the 18 isolates within Clade-2 (associated with hypervirulent strains) were positive for tcdA, only 10 (55.5%) were positive for tcdB, and the genes associated with the binary toxin (cdtA and cdtB) were detected exclusively in this clade.

The phenotypic tests aimed at evaluating the cytopathic effect of the culture supernatant in three dilutions (1:10, 1:100, and 1:1000) on Vero cells (a line of African green monkey kidney cells), were conducted according with the accessory genome clustering shown in Fig. 3c. This step of the phenotypical characterization revealed that only 9 isolates (17.0%) did not cause cell rounding, even at the highest evaluated concentration (1:10). These isolates showed an effect similar to that of the non-toxigenic reference C. difficile strain ATCC® 700057, which was included as the non-toxigenic control for the assay (Fig. 3a). The remaining 83.0% of the isolates (n = 44) caused cell rounding in more than 20% of the cells evaluated, including isolates Gcol-35, -37, -38, -39, and -40, in which toxin-coding genes were initially not detected (Supplementary Fig. S2). In general, although the isolates had a homogeneous effect between populations at the highest concentration (1:10), the effects were more stable through the dilutions in the established CO isolates than they were in the IH isolates where the cytotoxic effect was reduced in the 1:100 dilution and in the 1:1000 dilution (Fig. 3b,c). Further, the supernatants of isolates Gcol-52, -66, -81, and 89 caused the rounding of 100% of the cells even at the 1:1000 dilution, which was similar to the effect of the reference C. difficile strain ATCC® BAA-1870 that was evaluated as the toxigenic control of the assay (Fig. 3a). Although the comparative analysis by clade showed a significant increase in the mean cytopathic effect of the culture supernatant of the isolates in Clade-2 at 1:10 dilution, this difference was lost through the dilutions, being even higher in Clade-1 at dilution 1:1000, without any difference in the fold change per dilution (Fig. 3c). Interestingly, the comparison by population showed a marked differential pattern, with a significant increase in the mean cytopathic effect of the supernatants of the isolates established from CO and a clear difference in the fold change through the dilutions (Fig. 3d).

An exhaustive search of toxin coding genes in all the genomes analyzed was performed to clarify the genomic bases of this toxigenic effect, by mapping them against reference sequences of PaLoc and CdtLoc. The complete organization of these loci in the established isolates was analyzed by the clade to which they belonged (Fig. 3e,f). This exhaustive search showed that, when defining as positive any isolate that carried at least one gene previously associated with a toxigenic effect, the overall balance of toxigenic profiles increased to 83.0% (n = 44). This increase was accompanied by an increase in the overall frequency of detection of tcdA and tcdB to 71.7% (n = 38), these markers being present simultaneously and identified as positives in 100% of the Clade-2 isolates. The traditional toxin production types (TPTs) were identified considering the results described in Fig. 3e,f. The TcdA+/TcdB+/CDT+TPT was detected in 34.0% of the genomes (n = 18), which corresponded to all Clade-2 isolates, whereas the TcdA+/TcdB+/CDT− TPT was detected in 41.5% of the genomes (n = 22). An additional TPT (TcdA−/TcdB−/CDT−) was detected in the genomes of 24.5% (n = 13) of isolates that had lost the coding regions for the main toxins (Fig. 3g). The combination of TPTs shown by STs found as part of coexistence event 3 (Fig. 3c) completes the toxigenic arsenal in the same patient at the IH level (Fig. 3h). In general, the isolates with the TcdA+/TcdB+/CDT− TPT (Clade-1 members), showed the highest cytopathic effect (Fig. 3i).

The exhaustive search of toxin coding genes led to the identification of two accessory genes additional to tcdR, tcdE, and tcdC, corresponding to hypothetical proteins, which have already been described within PaLoc, that were restricted to most genomes in Clade-1 (marked with diamonds next to tcdC in Fig. 3e). For CdtLoc, although the mapping revealed the presence of coding regions for the two binary toxin subunits, they represented less than 50% of the total length of the genes (Supplementary Fig. S3), so the encoded proteins may not be functional and were considered negative for these regions during the analysis.

Atypical organizations in two PaLoc regions were also detected. One corresponded to an increase in the number of copies of the coding region for the holine-like protein that was identified in isolates obtained from 11 samples; however, this open reading frame was found in a single copy through the genomes (Supplementary Fig. S4a). In the four isolates established from sample 205 (obtained at IH level), although the majority of the PaLoc region was absent and they were classified as Tcd−/TcdB−/CDT− TPT contained an increased depth in holine-like protein region with respect to the average depth obtained in other PaLoc coding regions. The organization of this region in isolates obtained from sample 205 is like the organization in the sample 172 that was used as a model for the description of this organization (Supplementary Fig. S5a). The other atypical organization corresponded to loss of reads in a region close to tcdA in some isolates (marked with a red diamond in Fig. 3b). This represents a deletion towards the 3′ end of the coding region for tcdA for which the extended representation is shown in Supplementary Fig. S4b, which was present in Clade-2 isolates (Supplementary Fig. S5b) and affected the detection of this region during preliminary searches of the databases (Supplementary Fig. S2).

High correlation between antimicrobial resistance molecular markers and in vitro MIC

The existence of antimicrobial resistance molecular markers (AMR-MMs) was evaluated from whole genome sequences throughout comparisons against sequences deposited in eight data bases using Ariba software¹³. The results showed that the following four AMR-MMs were present in all isolates: cdeA and gyrA (associated with fluoroquinolones resistance), folP (that confers sulfonamides resistance) and rpoB (associated with rifamycins resistance). The following AMR-MMs in order were EF-Tu (associated with elfamycines resistance), present in 96% (n = 51) of the genomes. The identification of all other AMR-MMs was below 30% (n = 16), as in the case of ermB (conferring erythromycin resistance) and MLS (associated with macrolide, lincosamide and streptogramin resistance) (Fig. 4a). Moreover, less than 4% of the isolates contained markers associated with Mobile Genetic Elements (MGE): tetO, tet5, tetW, and tet6 (associated with tetracyclines resistance) (Fig. 4a). All analyzed genomes had molecular makers that have been previously associated to Fluoroquinolones, Sulfonamides and Rifamycins antimicrobial classes (Fig. 4a). The STs with higher number of AMR-MMs belong to the IH population (Supplementary Fig. S6), particularly those belonging to the Clade-2 with a total of 28–34 markers identified per genome (Supplementary Table 3).

The MIC₅₀ for 10 antimicrobials was determined for C. difficile (metronidazole, vancomycin, tetracycline, erythromycin, rifampicin, ampicillin, penicillin, fusidic acid, clindamycin, and moxifloxacin), with the aim to identify the real effect of AMR-MMs in phenotypic resistance. We found low sensitivity of the isolates to metronidazole, one of the most commonly used agents for the treatment of CDI, but adequate sensitivity for vancomycin and rifampicin, which are also recommended for its therapeutic management. Clindamycin, ampicillin, and penicillin showed limited ability to inhibit colony proliferation, as confirmed by both the MIC₅₀ (Fig. 4b) and MIC₉₀ (Supplementary Fig. S7). Tetracycline, erythromycin and vancomycin were the antimicrobial agents with higher number of associations with AMR-MMs presence. Conversely, a lack of association was evident for metronidazole, rifampicin, fusidic acid, moxifloxacin and clindamycin (Fig. 4c).

Colombian isolates show increased sporulation capacity and number of viable spores

The presence of proteins involved in the sporulation/germination process was evaluated initially by comparing their predicted amino acid sequences (Fig. 5a). We found that the Colombian CDs produced at least 38 of the 47 proteins involved in sporulation processes reported in ClosIndb for the reference strain C. difficile 630 uid57679¹⁴. Most of these proteins shared ≥98.0% identity; the exception was the reference sporulation protein CD630_05720, which shared <50.0% identity with the proteins in five of the Clade-2 isolates. We also found a double copy of the stage III sporulation protein AG (CD630_11980) in most of the isolates, except eight members of Clade-1. Three proteins encoded by a limited number of genomes (8–13) were also found, namely, CD630_34990, CD630_26810, and CD630_20350.

The phenotypic evaluation of the sporulation efficiency was higher for the Clade-1 than for the Clade-2 isolates (Fig. 5b), except for Gcol_A34, Gcol_A35, Gcol_A54, and Gcol_A55 for which the efficiency was <60%. The results were more heterogeneous for isolates in Clade-2, where the sporulation efficiency ranged from 40–91%. No differences were found by population. In the case of multiple infection events, the sporulation efficiencies of all the isolates were >80%.

The spores were purified and used to evaluate the capacity of the isolates to generate new CFUs, as an indicator of germination efficiency (Fig. 5c). The dilution at which the isolates generated colonies in the range 30–300 CFUs was then determined. It was found that the isolates generated CFUs in dilutions from 10⁻⁶ to 10⁻⁹, whereas the ATCC® control strains 700057™ and BAA-1870™ generated CFUs in dilutions from 10⁻⁵ and 10⁻⁶, respectively, which indicated the higher germination capacity of Colombian C. difficile isolates than the control strain.

Discussion

Genomic epidemiology is one of the most useful strategies to detect, characterize, and monitor pathogens that have an impact on human health¹⁵. Although a large number of studies have elucidated the genomic epidemiology of C. difficile^16,17, only a limited number of studies have focused on Latin America. One such study of C. difficile in Costa Rica not only identified hypervirulent CD strains at the local level, but also described profiles of toxin production that differed from those reported previously, and was the first study to reveal the high genomic diversity of C. difficile in this region of the world¹⁸. The absence of a routine C. difficile diagnosis strategy in Colombia led our group to investigate toxigenic profiles in patients with diarrhea. We detected circulating hypervirulent C. difficile strains and revealed the importance of the CO population in the epidemiology of C. difficile in Colombia. We also found the coexistence of C. difficile either with different STs or with the same ST but different toxins cassettes^9,19. Considering the evidence of the high diversity of circulating C. difficile in Colombia, we performed genomic and phenotypic characterization of the C. difficile strains circulating in Bogotá (Fig. 1).

During the first phase of analysis, the typing strategy traditionally accepted for C. difficile based on MLST¹² was applied to the complete genomes of the isolates, which showed that most of the circulating STs in Colombian C. difficile strains belong to Clade-1, with 9 STs (Fig. 1A,B). This agrees with the available data about the molecular epidemiology of this pathogen at a global level, which describes Clade-1 as the most heterogeneous clade because it contains STs with high frequency of detection in different regions of the world²⁰. The rest of the isolates belonged to Clade-2, with only 2 STs corresponding to ST-41 and ST-1 (Fig. 1A,B), a clade associated with hypervirulent strains that lead to a severe clinical picture and high recurrence events²¹. ST-8 had the highest frequency of detection among the isolates (Fig. 1B) and was the only ST present in the two populations evaluated (IH/CO, Fig. 1C). ST-41 was the most frequently found ST at the IH level (Fig. 1C) and it has been associated with severe inflammatory disease and disruption of the intestinal mucosa²². ST-1 was the most frequently found ST in the CO population and it was present only in this population (Fig. 1C). These results are of particular importance because ST-1 (Clade-2) was associated with hypervirulent strains involved in outbreaks in North America and Europe, which is why they are recognized as an important public health problem^21,23,24. The MLST analysis also allowed the identification of three coexistence events of two STs simultaneously in the same patient (Fig. 1A), being the coexistence event 3 caused by Clade-1 and Clade-2 STs. Two of these events were identified from CO samples (events 2 and 3) and ST-8 was the only ST that was present consistently in all the events, which could be due to its widely reported frequency²⁵. Although in Colombia typing schemes have been applied to a small number of isolates of C. difficile, these studies have been based on ribotyping and had a purely descriptive approach²⁶. The comparison of the findings with studies worldwide reveals a similar profile in that ST-8, the most frequent in this study, has been reported in the United Kingdom²⁷, Australia²⁸, China²⁹ and United States³⁰, usually associated with recurrent infections, in coinfection with a second ST, also showing substitutions associated with resistance to antibiotics (at the level of gyrA and gyrB)^27,29.

Herein, we generated a tree based on a core genome (Fig. 2a). Isolate clustering with members of the two main C. difficile clades (Clade-1 and Clade-2), and the identification of C5 as a possible common ancestor, is consistent with previous reports²⁰. These results are aligned with recent reports in which a core genome MLST (cgMLST) scheme was proposed as a useful tool to describe the population structure of C. difficile at the genetic level. The high percentage of genes (71.4%) that were part of the accessory genome and the inconsistencies in the topology of the phylogenetic reconstruction of the concatenated sequences of the accessory genes (Fig. 2c), which included a change in the relationship between some STs, as well as the high number of accessory and unique genes (Fig. 2d–f) confirmed the dynamic character of the C. difficile genome.

Comparisons of the genome sequences of the established isolates with the information available in databases revealed the high frequency with which the isolates obtained at the IH and CO levels transport toxin coding genes (Supplementary Fig. S2). However, the unexpected findings in Clade-2 and the cytotoxic capacity detected after phenotypic tests, revealed that this strategy had limitations in accurately describing the organization of circulating C. difficile isolates in Colombia. The detection of toxB, which leads the pathogenic effect on target cells, was consistent³¹, which indicated its importance for C. difficile colonization³². The changes in the organization of PaLoc regions that we detected (Fig. 3) corresponded to gain of tcdE, which encodes the holine-like protein in Clade-1 genomes initially considered non-toxigenic (Supplementary Fig. S2), and the loss of a region of tcdA in Clade-2 genomes, which could affect its detection, indicated that the highly diverse strains circulating in Latin America could escape the diagnostic strategies traditionally used for CD. Our study has revealed atypical organizations in Clade-1 and Clade-2, which suggests the urgent need to develop novel diagnostic strategies for C. difficile in Latin America.

The significantly greater cytopathic effect shown by the TcdA+/TcdB+/CDT− TPT compared with TcdA+/TcdB+/CDT+ TPT for all the main toxins (Fig. 3i), could be explained either by greater efficiency of the action of the proteins encoded in PaLoc or by the effect of additional genes that have not yet been analyzed. In PaLoc, the two additional accessory genes to those traditionally reported (tcdR, tcdE, and tcdC) could be involved, although they are currently only recognized as coding hypothetical proteins. They could play a role during the secretion/action process of TcdA and/or TcdB; however, these two additional accessory genes need to be characterized in order to identify their potential role during the C. difficile infection process.

Four AMR-MMs were detected in all the evaluated genomes (Fig. 4a). Two of these conferring resistance to fluoroquinolones (cdeA³³ and gyrA³⁴), while the other two have been associated with sulfonamides (folP³⁵) and rifamycin (rpoB³⁶) resistance. Three of these highly prevalent markers correspond to mutations in constitutive genes (gyrA, folP and rpoB), it confirms the ability of C. difficile to modify even constitutive proteins as part of the process of adaptation to hostile environments such as the human intestine, particularly when it has been exposed to antibiotic therapy³⁷. The existence of these four AMR-MMs in all the characterized genomes indicate that this type of modification could be setting as part of the core C. difficile genome, although it has been historically proposed that they may have been transferred through mobile genetic elements, mainly plasmids³⁸. Additionally, besides being the clade with the most genes coding for toxins (Fig. 3e), Clade-2 had isolates with the highest number of AMR-MMs (Fig. 4a), and this clade also showed reduced susceptibility to a higher number of antimicrobials than Clade-1 isolates. Interestingly, the analyzed isolates obtained from IH environment showed higher number of AMR-MMs than those established from CO population (Supplementary Fig. S6). Therefore, these findings could be because of C. difficile adaptation due to exposure to antimicrobial agents that led to selective pressure³⁹, but the history of antibiotic consumption of the individual patients who provided the samples in this study is unknown and is a limitation of this study.

The identification of at least 38 of the proteins involved in the different phases of sporulation by comparison with the C. difficile 630 reference strain revealed that, in most cases, they shared >98.0% identities (Fig. 5a)⁴⁰. Some specific exceptions were found, such as the absence of the gene encoding the integral membrane protein CD630_20350, which was found in only eight of the genomes. This protein has been reported as not strictly required for the sporulation or resistance of spores⁴⁰; therefore, it may be that the absence of genes occurs when the encoded protein is not essential for sporulation. Interestingly, we identified the stage III sporulation protein AG (CD630_11980), which had two copies in most of the genomes (45 out of 53). This protein, which is encoded in the spoIIIAABCDEFGH operon, under the control of the sigma G factor and expressed during phase 3 of sporulation, has been reported as strictly conserved among species in phylum Firmicutes that form spores (http://www.ebi.ac.uk/interpro/entry/IPR014195).

When analyzing the sporulation percentage determined during the phenotypic tests (Fig. 5b), we found that, although the results were heterogeneous for Clade-2 isolates, most of the Clade-1 isolates had sporulation percentages >60%, the exception were Gcol.A33, Gcol.A34, Gcol.A54, and Gcol.A55. The genomes of two of these isolates had only a single copy of CD630_11980 and the isolates had the lowest percentage of sporulation (≤32%).

In general, the confirmation of the plasticity of the C. difficile genome, the presence of STs with more than one toxigenic profile, the identification of infection events by more than one ST, and the variations found in cytotoxic capacity, sporulation, germination, and profiles of resistance to antibiotics, represent factors at the genomic and phenotypic levels that contribute to the knowledge of circulating C. difficile characteristics in Colombia. This study presents related limitations with the small sample size and reduced number of isolates, that limit the extrapolation of the data, therefore, future studies should include a larger sample size in order to support the results obtained here. However, our results provide evidence of the microdiversity that usually defines C. difficile populations and support the hypothesis that this opportunistic pathogen is maintained in continuous evolution processes⁴¹ that impact on its adaptation during persistent infection processes⁴². Additionally, the differential presence of different groups of genes and their correlation with the phenotypic profiles described here could have a profound impact on C. difficile ecology because horizontal gene transfer may favor acquisition of virulence and the subsequent transition from a microorganism environmental lifestyle to a pathogen, as was proposed in the hypothesis of virulence adaptive polymorphisms, which has been tested in Vibrio cholerae⁴³. This represents the first genomic and phenotypic approach conducted in Colombia and in Latin-America to our knowledge. Further studies in the region are needed to obtain the broad genomic epidemiology of CD.

Materials and Methods

Ethics approval and consent to participate

The Universidad del Rosario’s Research Ethics Committee approved the initial study aimed to detect and isolate CDI in fecal samples from patients with diarrhea in Bogotá, Colombia, through the act No. 290, July 27, 2015. All patients included in this study agreed to participate and signed informed consent forms agreeing to their participation in the study. All methods were performed in accordance with the Helsinki declaration and the Colombian ministry of health and social protection guidelines as approved by the certificate of the Universidad del Rosario’s Research Ethics Committee. In the context of this study, no clinical or other metadata were analyzed about the patients.

Clinical isolates

Fifty-three isolates were established from stool samples of patients with diarrhea who attended two healthcare centers in Bogotá, Colombia (Hospital Universitario Mayor – Méderi and Fundación Clínica Shaio). The methodology for the collection of the samples, CDI detection and establishment of isolates was previously reported by our group⁹. Briefly, an aliquot of the stool sample collected from each patient was spread on ChromID C. difficile CDIF (bioMérieux) and incubated for 48 hours at 37 °C under anaerobic conditions, using the GasPak EZ Anaerobe Pouch system (Becton Dickinson)⁴⁴. The colonies with the macroscopic morphology described by the manufacturer were extended on Trypticase ™ Soy Agar (TSA) with 5% Sheep Blood (Becton Dickinson), and subsequently incubated under the aforementioned conditions. A verification by microscopic inspection by routine interpretation by Gram stain was performed. The cellular biomass of the verified colonies was extended by massive seeding on medium TSA, which was later recovered for the isolate’s establishment. Multiple colonies were recovered for each sample because of the possible coexistence of C. difficile genotypes¹⁰.

The isolates were assigned to community ‘CO’ population when the patients with diarrhea attended the emergency service of the participating health-care centers and their time since admission to the medical center did not exceed 48 hours, or to intra-hospital ‘IH’ population, when the patient took three days or more to stay in the different services within the participating health centers (Supplementary Table S1), as was previously described⁴⁵.

DNA extraction and whole genome sequencing (WGS)

DNA was extracted using an Ultraclean Blood Spin DNA Isolation kit (MoBio Laboratories, Carlsbad, CA, USA) following the manufacturer’s instructions. The WGS of the selected isolates was carried out by Novogene Bioinformatics Technology Co., Ltd. (Beijing, China) using the Illumina HiSeq X-TEN platform. The quality control, assembly and genome identification can be retrieved in Supplementary Text S1.

Annotation and comparative genomics analysis

An automated annotation strategy was applied to the multi-FASTA files obtained from the assembly process. This strategy uses PROKKA v1.11⁴⁶ and is complemented by an improvement by comparison against genus-specific databases in RefSeq.⁴⁷, which was applied to all the genomes included in the analyzed data set. First, circular visualization of the genomes was carried out in the CGview server⁴⁸, then pairwise comparisons were made to identify differences between the genomes using the tool based on the BLAST algorithm that is included in the CGview server. The GFF files obtained from the annotation process were used to determine the pangenome using the Roary tool⁴⁹, with a percentage of identity of 95% and definition of the core genome of 99%. A presence/absence matrix of the genes that were part of the core or accessory genome was constructed as a basis to graphically represent the pangenome results using the Python script roary_plots.py⁵⁰.

Phylogenetic analysis

The concatenated sequences of the seven housekeeping genes that are part of the MLST scheme were extracted and aligned using the script align_seqs.py, considering the MUSCLE method⁵¹. For phylogenetic reconstructions, maximum likelihood trees were built from the alignments of both the MLST scheme and the core genome using FastTree Version 2.1.9 with double precision⁵². The robustness of the nodes was evaluated using the bootstrap method with 1,000 replicates⁵³. The phylogenetic trees were visualized in the web tool Interactive Tree Of Life V3 (http://itol.embl.de)⁵⁴.

The sequences of 15 high-quality reference genomes provided by the Wellcome Trust Sanger Institute (https://www.sanger.ac.uk/science/data/reference-genomes-clostridium-difficile) were included in the comparative analysis. These genomes belong to different C. difficile ribotypes (including those associated with the hypervirulent strains RT027 and RT078) and were considered as representative of the clades currently accepted for intra-taxa classification of CD²⁰. Detailed information for the reference genomes is described in Supplementary Table S4. A multi-FASTA alignment file and the SNP-sites program to detect and extract SNPs were used to analyze the informative sites from the phylogenetic approach⁵⁵. A pairwise SNP distance matrix from a FASTA alignment of the core genomes was generated to compare the 68 individual isolates included in the pangenome analysis. In addition, a detailed analysis of the accessory genome was conducted through the generation of a phylogenetic reconstruction from the binary gene presence matrix and the subsequent evaluation of the total number of accessory and unique genes by isolate.

Analysis of coding sequences for the main toxins

The presence of query coding sequences was evaluated using a pipeline composed by an initial step of mapping against reference sequences based on Burrows-Wheeler Aligner (BWA)⁵⁶, using BWA-MEM high-performance algorithm for Illumina reads⁵⁷. Secondly, the conversion, ordering, and indexing of the reads were carried out in the Sequence Alignment/Map (SAM) tools⁵⁸. The pipeline was completed with a quality statistics calculation step of aligned sequences using flagstat and stats in SAMtools⁵⁸. The genomes were visualized in Artemis tool⁵⁹.

Identification of virulence factors from WGS

The identification of virulence factors in the complete genomes of the isolates was directed to three main aspects: toxin coding genes, other virulence factors, and antimicrobial resistance molecular markers (AMR-MM)⁸, using Ariba¹³ (Supplementary Fig. S1a). Comparisons with the reports available in the eight databases available in this software (card⁶⁰, vfdb_core⁶¹, arganot⁶², megares⁶³, plasmidfinder⁶⁴, resfinder⁶⁵, srst2_arganot⁶² and virulencefinder⁶⁵) were done. The graphical representation of results was developed in Plotly server⁶⁶. The presence of toxin coding genes was confirmed by mapping them against reference sequences of PaLoc and CdtLoc. CD-HIT software was used to cluster sequences very similar to the proteins involved in the sporulation/germination processes⁶⁷. This analysis involved the definition of a set of reference proteins known to be involved in sporulation pathways in the C. difficile 630 uid57679 reference strain. The amino acid sequences of those reference proteins were exported from ClosIndb database, a data repository for analysis of Clostridium species¹⁴. An identity percentage ≥40.0% and a K-mer = 2 (defined as a subsequence of length K = 2, used to index the amino acid sequences) were the parameters considered for cluster definition.

Phenotypic characterization

The cryopreserved isolates were activated and used to develop: (i) cytotoxicity tests⁶⁸; (ii) Minimal inhibitory concentration (MIC) of 10 antimicrobial agents (metronidazole, vancomycin, rifampicin, tetracycline, erythromycin, fusidic acid, moxifloxacin, clindamycin, ampicillin, and penicillin; (Supplementary Table S5)) 50 (MIC₅₀) and 90 (MIC₉₀) determination⁶⁹, and (iii) sporulation efficiency and number of viable spores⁷⁰. A graphical description of complete methodology used to phenotypically characterize the isolates studied is described in the Supplementary Fig. S1b. Chi2 tests were developed to evaluate the existence of associations between the presence/absence of AMR-MMs and the categories established for MIC₅₀ results.

Data Availability

The set of genomes analyzed in this study were deposited at DDBJ/ENA/GenBank as part of the BioProject PRJNA551724.

References

Dapa, T. & Unnikrishnan, M. Biofilm formation by Clostridium difficile. Gut microbes 4, 397–402, https://doi.org/10.4161/gmic.25862 (2013).
Article PubMed PubMed Central Google Scholar
Goudarzi, M., Seyedjavadi, S. S., Goudarzi, H., Mehdizadeh Aghdam, E. & Nazeri, S. Clostridium difficile Infection: Epidemiology, Pathogenesis, Risk Factors, and Therapeutic Options. Scientifica 2014, 916826, https://doi.org/10.1155/2014/916826 (2014).
Article CAS PubMed PubMed Central Google Scholar
Balassiano, I. T., Yates, E. A., Domingues, R. M. & Ferreira, E. O. Clostridium difficile: a problem of concern in developed countries and still a mystery in Latin America. Journal of medical microbiology 61, 169–179, https://doi.org/10.1099/jmm.0.037077-0 (2012).
Article CAS PubMed Google Scholar
Rineh, A., Kelso, M. J., Vatansever, F., Tegos, G. P. & Hamblin, M. R. Clostridium difficile infection: molecular pathogenesis and novel therapeutics. Expert review of anti-infective therapy 12, 131–150, https://doi.org/10.1586/14787210.2014.866515 (2014).
Article CAS PubMed PubMed Central Google Scholar
Carter, G. P., Rood, J. I. & Lyras, D. The role of toxin A and toxin B in the virulence of Clostridium difficile. Trends in microbiology 20, 21–29, https://doi.org/10.1016/j.tim.2011.11.003 (2012).
Article CAS PubMed Google Scholar
Lyras, D. et al. Toxin B is essential for virulence of Clostridium difficile. Nature 458, 1176–1179, https://doi.org/10.1038/nature07822 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Peng, Z. et al. Update on Antimicrobial Resistance in Clostridium difficile: Resistance Mechanisms and Antimicrobial Susceptibility Testing. Journal of clinical microbiology 55, 1998–2008, https://doi.org/10.1128/JCM.02250-16 (2017).
Article CAS PubMed PubMed Central Google Scholar
Awad, M. M., Johanesen, P. A., Carter, G. P., Rose, E. & Lyras, D. Clostridium difficile virulence factors: Insights into an anaerobic spore-forming pathogen. Gut microbes 5, 579–593, https://doi.org/10.4161/19490976.2014.969632 (2014).
Article PubMed PubMed Central Google Scholar
Munoz, M. et al. New Insights into Clostridium difficile (CD) Infection in Latin America: Novel Description of Toxigenic Profiles of Diarrhea-Associated to CD in Bogota, Colombia. Frontiers in microbiology 9, 74, https://doi.org/10.3389/fmicb.2018.00074 (2018).
Article PubMed PubMed Central Google Scholar
Tanner, H. E., Hardy, K. J. & Hawkey, P. M. Coexistence of multiple multilocus variable-number tandem-repeat analysis subtypes of Clostridium difficile PCR ribotype 027 strains within fecal specimens. Journal of clinical microbiology 48, 985–987, https://doi.org/10.1128/JCM.02012-09 (2010).
Article CAS PubMed PubMed Central Google Scholar
Figueras, M. J., Beaz-Hidalgo, R., Hossain, M. J. & Liles, M. R. Taxonomic affiliation of new genomes should be verified using average nucleotide identity and multilocus phylogenetic analysis. Genome announcements 2, https://doi.org/10.1128/genomeA.00927-14 (2014).
Griffiths, D. et al. Multilocus sequence typing of Clostridium difficile. Journal of clinical microbiology 48, 770–778, https://doi.org/10.1128/JCM.01796-09 (2010).
Article CAS PubMed Google Scholar
Hunt, M. et al. ARIBA: rapid antimicrobial resistance genotyping directly from sequencing reads. Microbial genomics 3, e000131, https://doi.org/10.1099/mgen.0.000131 (2017).
Article PubMed PubMed Central Google Scholar
Polavarapu, R., Meetei, P. A., Midha, M., Bharne, D. & Vindal, V. ClosIndb: A resource for computationally derived information from clostridial genomes. Infection, genetics and evolution: journal of molecular epidemiology and evolutionary genetics in infectious diseases 33, 127–130, https://doi.org/10.1016/j.meegid.2015.04.020 (2015).
Article CAS PubMed Google Scholar
Bertelli, C. & Greub, G. Rapid bacterial genome sequencing: methods and applications in clinical microbiology. Clin Microbiol Infect 19, 803–813, https://doi.org/10.1111/1469-0691.12217 (2013).
Article CAS PubMed Google Scholar
Lekshmi, N., Joseph, I., Ramamurthy, T. & Thomas, S. Changing facades of Vibrio cholerae: An enigma in the epidemiology of cholera. The Indian journal of medical research 147, 133–141, https://doi.org/10.4103/ijmr.IJMR_280_17 (2018).
Article CAS PubMed PubMed Central Google Scholar
Martin, R. M. & Bachman, M. A. Colonization, Infection, and the Accessory Genome of Klebsiella pneumoniae. Frontiers in cellular and infection microbiology 8, 4, https://doi.org/10.3389/fcimb.2018.00004 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ramirez-Vargas, G. et al. Novel Clade C-I Clostridium difficile strains escape diagnostic tests, differ in pathogenicity potential and carry toxins on extrachromosomal elements. Scientific reports 8, 13951, https://doi.org/10.1038/s41598-018-32390-6 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Munoz, M. et al. Community-acquired infection with hypervirulent Clostridium difficile isolates that carry different toxin and antibiotic resistance loci: a case report. Gut pathogens 9, 63, https://doi.org/10.1186/s13099-017-0212-y (2017).
Article PubMed PubMed Central Google Scholar
Knight, D. R., Elliott, B., Chang, B. J., Perkins, T. T. & Riley, T. V. Diversity and Evolution in the Genome of Clostridium difficile. Clinical microbiology reviews 28, 721–741, https://doi.org/10.1128/CMR.00127-14 (2015).
Article PubMed PubMed Central Google Scholar
Liao, F. et al. A retrospective study of community-acquired Clostridium difficile infection in southwest China. Sci Rep 8, 3992, https://doi.org/10.1038/s41598-018-21762-7 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Costa, C. L. et al. A MLST Clade 2 Clostridium difficile strain with a variant TcdB induces severe inflammatory and oxidative response associated with mucosal disruption. Anaerobe 40, 76–84, https://doi.org/10.1016/j.anaerobe.2016.06.005 (2016).
Article CAS PubMed Google Scholar
Khanna, S. & Pardi, D. S. Community-acquired Clostridium difficile infection: an emerging entity. Clin Infect Dis 55, 1741–1742, https://doi.org/10.1093/cid/cis722 (2012).
Article PubMed Google Scholar
Valiente, E., Cairns, M. D. & Wren, B. W. The Clostridium difficile PCR ribotype 027 lineage: a pathogen on the move. Clin Microbiol Infect 20, 396–404, https://doi.org/10.1111/1469-0691.12619 (2014).
Article CAS PubMed Google Scholar
Michael, K. et al. Clostridium difficile Environmental Contamination within a Clinical Laundry Facility in the USA. FEMS Microbiol Lett, https://doi.org/10.1093/femsle/fnw236 (2016).
Article PubMed Google Scholar
Salazar, C. L. et al. Subtyping of Clostridium difficile PCR ribotypes 591, 106 and 002, the dominant strain types circulating in Medellin, Colombia. PloS one 13, e0195694, https://doi.org/10.1371/journal.pone.0195694 (2018).
Article CAS PubMed PubMed Central Google Scholar
Stevenson, E. C., Major, G. A., Spiller, R. C., Kuehne, S. A. & Minton, N. P. Coinfection and Emergence of Rifamycin Resistance during a Recurrent Clostridium difficile Infection. J Clin Microbiol 54, 2689–2694, https://doi.org/10.1128/JCM.01025-16 (2016).
Article CAS PubMed PubMed Central Google Scholar
Collins, D. A., Putsathit, P., Elliott, B. & Riley, T. V. Laboratory-based surveillance of Clostridium difficile strains circulating in the Australian healthcare setting in 2012. Pathology 49, 309–313, https://doi.org/10.1016/j.pathol.2016.10.013 (2017).
Article PubMed Google Scholar
Wang, B., Lv, Z., Zhang, P. & Su, J. Molecular epidemiology and antimicrobial susceptibility of human Clostridium difficile isolates from a single institution in Northern China. Medicine (Baltimore) 97, e11219, https://doi.org/10.1097/MD.0000000000011219 (2018).
Article CAS Google Scholar
Williamson, C. et al. A global to local genomics analysis of Clostridioides difficile ST1/RT027 identifies cryptic transmission events in a northern Arizona healthcare network. bioRxiv, https://doi.org/10.1101/544890 (2019).
Popoff, M. R. Clostridial pore-forming toxins: powerful virulence factors. Anaerobe 30, 220–238, https://doi.org/10.1016/j.anaerobe.2014.05.014 (2014).
Article CAS PubMed Google Scholar
Doosti, A. & Mokhtari-Farsani, A. Study of the frequency of Clostridium difficile tcdA, tcdB, cdtA and cdtB genes in feces of Calves in south west of Iran. Annals of clinical microbiology and antimicrobials 13, 21, https://doi.org/10.1186/1476-0711-13-21 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dridi, L., Tankovic, J. & Petit, J. C. CdeA of Clostridium difficile, a new multidrug efflux transporter of the MATE family. Microbial drug resistance 10, 191–196, https://doi.org/10.1089/mdr.2004.10.191 (2004).
Article CAS PubMed Google Scholar
Drlica, K. & Zhao, X. DNA gyrase, topoisomerase IV, and the 4-quinolones. Microbiology and molecular biology reviews: MMBR 61, 377–392 (1997).
CAS PubMed PubMed Central Google Scholar
Nakata, N., Kai, M. & Makino, M. Mutation analysis of the Mycobacterium leprae folP1 gene and dapsone resistance. Antimicrobial agents and chemotherapy 55, 762–766, https://doi.org/10.1128/AAC.01212-10 (2011).
Article CAS PubMed Google Scholar
Ishikawa, J., Chiba, K., Kurita, H. & Satoh, H. Contribution of rpoB2 RNA polymerase beta subunit gene to rifampin resistance in Nocardia species. Antimicrobial agents and chemotherapy 50, 1342–1346, https://doi.org/10.1128/AAC.50.4.1342-1346.2006 (2006).
Article CAS PubMed PubMed Central Google Scholar
Shah, D. et al. Clostridium difficile infection: update on emerging antibiotic treatment options and antibiotic resistance. Expert review of anti-infective therapy 8, 555–564, https://doi.org/10.1586/eri.10.28 (2010).
Article CAS PubMed PubMed Central Google Scholar
Mullany, P., Allan, E. & Roberts, A. P. Mobile genetic elements in Clostridium difficile and their role in genome function. Research in microbiology 166, 361–367, https://doi.org/10.1016/j.resmic.2014.12.005 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ardal, C. et al. International cooperation to improve access to and sustain effectiveness of antimicrobials. Lancet 387, 296–307, https://doi.org/10.1016/S0140-6736(15)00470-5 (2016).
Article PubMed Google Scholar
Paredes-Sabja, D., Shen, A. & Sorg, J. A. Clostridium difficile spore biology: sporulation, germination, and spore structural proteins. Trends in microbiology 22, 406–416, https://doi.org/10.1016/j.tim.2014.04.003 (2014).
Article CAS PubMed PubMed Central Google Scholar
Stabler, R. A. et al. Macro and micro diversity of Clostridium difficile isolates from diverse sources and geographical locations. PloS one 7, e31559, https://doi.org/10.1371/journal.pone.0031559 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Dobrindt, U., Zdziarski, J., Salvador, E. & Hacker, J. Bacterial genome plasticity and its impact on adaptation during persistent infection. International journal of medical microbiology: IJMM 300, 363–366, https://doi.org/10.1016/j.ijmm.2010.04.010 (2010).
Article PubMed Google Scholar
Shapiro, B. J., Levade, I., Kovacikova, G., Taylor, R. K. & Almagro-Moreno, S. Origins of pandemic Vibrio cholerae from environmental gene pools. Nature microbiology 2, 16240, https://doi.org/10.1038/nmicrobiol.2016.240 (2016).
Article CAS PubMed Google Scholar
Eckert, C., Burghoffer, B., Lalande, V. & Barbut, F. Evaluation of the chromogenic agar chromID C. difficile. Journal of clinical microbiology 51, 1002–1004, https://doi.org/10.1128/JCM.02601-12 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cohen, S. H. et al. Clinical practice guidelines for Clostridium difficile infection in adults: 2010 update by the society for healthcare epidemiology of America (SHEA) and the infectious diseases society of America (IDSA). Infection control and hospital epidemiology 31, 431–455, https://doi.org/10.1086/651706 (2010).
Article PubMed Google Scholar
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069, https://doi.org/10.1093/bioinformatics/btu153 (2014).
Article CAS PubMed Google Scholar
Pruitt, K. D., Tatusova, T., Brown, G. R. & Maglott, D. R. NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy. Nucleic acids research 40, D130–135, https://doi.org/10.1093/nar/gkr1079 (2012).
Article CAS PubMed Google Scholar
Grant, J. R. & Stothard, P. The CGView Server: a comparative genomics tool for circular genomes. Nucleic acids research 36, W181–184, https://doi.org/10.1093/nar/gkn179 (2008).
Article CAS PubMed PubMed Central Google Scholar
Page, A. J. et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693, https://doi.org/10.1093/bioinformatics/btv421 (2015).
Article CAS PubMed PubMed Central Google Scholar
Galardini, M. roary_plots.py, an ipython script to visualize pangenome results. Pathogen Informatics, WSI (2017).
Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC bioinformatics 5, 113, https://doi.org/10.1186/1471-2105-5-113 (2004).
Article CAS PubMed PubMed Central Google Scholar
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Molecular biology and evolution 26, 1641–1650, https://doi.org/10.1093/molbev/msp077 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wrobel, B. Statistical measures of uncertainty for branches in phylogenetic trees inferred from molecular sequences by using model-based methods. Journal of applied genetics 49, 49–67, https://doi.org/10.1007/BF03195249 (2008).
Article PubMed Google Scholar
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic acids research 44, W242–245, https://doi.org/10.1093/nar/gkw290 (2016).
Article CAS PubMed PubMed Central Google Scholar
Page, A. J. et al. SNP-sites: rapid efficient extraction of SNPs from multi-FASTA alignments. Microbial genomics 2, e000056, https://doi.org/10.1099/mgen.0.000056 (2016).
Article PubMed PubMed Central Google Scholar
Jo, H. & Koh, G. Faster single-end alignment generation utilizing multi-thread for BWA. Bio-medical materials and engineering 26(Suppl 1), S1791–1796, https://doi.org/10.3233/BME-151480 (2015).
Article PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760, https://doi.org/10.1093/bioinformatics/btp324 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079, https://doi.org/10.1093/bioinformatics/btp352 (2009).
Article CAS PubMed PubMed Central Google Scholar
Carver, T., Harris, S. R., Berriman, M., Parkhill, J. & McQuillan, J. A. Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data. Bioinformatics 28, 464–469, https://doi.org/10.1093/bioinformatics/btr703 (2012).
Article CAS PubMed Google Scholar
McArthur, A. G. et al. The comprehensive antibiotic resistance database. Antimicrobial agents and chemotherapy 57, 3348–3357, https://doi.org/10.1128/AAC.00419-13 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chen, L., Zheng, D., Liu, B., Yang, J. & Jin, Q. VFDB 2016: hierarchical and refined dataset for big data analysis–10 years on. Nucleic acids research 44, D694–697, https://doi.org/10.1093/nar/gkv1239 (2016).
Article CAS PubMed Google Scholar
Gupta, S. K. et al. ARG-ANNOT, a new bioinformatic tool to discover antibiotic resistance genes in bacterial genomes. Antimicrobial agents and chemotherapy 58, 212–220, https://doi.org/10.1128/AAC.01310-13 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lakin, S. M. et al. MEGARes: an antimicrobial resistance database for high throughput sequencing. Nucleic acids research 45, D574–D580, https://doi.org/10.1093/nar/gkw1009 (2017).
Article CAS PubMed Google Scholar
Carattoli, A. et al. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrobial agents and chemotherapy 58, 3895–3903, https://doi.org/10.1128/AAC.02412-14 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kleinheinz, K. A., Joensen, K. G. & Larsen, M. V. Applying the ResFinder and VirulenceFinder web-services for easy identification of acquired antibiotic resistance and E. coli virulence genes in bacteriophage and prophage nucleotide sequences. Bacteriophage 4, e27943, https://doi.org/10.4161/bact.27943 (2014).
Article PubMed PubMed Central Google Scholar
Plotly Technologies Inc. Title: Collaborative data science Publisher: Plotly Technologies Inc. Place of publication: Montréal, QC Date of publication, https://plot.ly (2015).
Huang, Y., Niu, B., Gao, Y., Fu, L. & Li, W. CD-HIT Suite: a web server for clustering and comparing biological sequences. Bioinformatics 26, 680–682, https://doi.org/10.1093/bioinformatics/btq003 (2010).
Article CAS PubMed PubMed Central Google Scholar
Reigadas, E. et al. Clinical significance of direct cytotoxicity and toxigenic culture in Clostridium difficile infection. Anaerobe 37, 38–42, https://doi.org/10.1016/j.anaerobe.2015.10.003 (2016).
Article CAS PubMed Google Scholar
Wiegand, I., Hilpert, K. & Hancock, R. E. Agar and broth dilution methods to determine the minimal inhibitory concentration (MIC) of antimicrobial substances. Nat Protoc 3, 163–175, https://doi.org/10.1038/nprot.2007.521 (2008).
Article CAS PubMed Google Scholar
Edwards, A. N. & McBride, S. M. Isolating and Purifying Clostridium difficile Spores. Methods in molecular biology 1476, 117–128, https://doi.org/10.1007/978-1-4939-6361-4_9 (2016).
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Claudia Chica and Yamile Alfonso, from the clinical laboratories of the participating healthcare centers, for their support during the collection and storage of the fecal samples used to establish the clinical isolates. The authors would also like to acknowledge the support of the Wellcome Sanger Institute Pathogen Informatics Team. Thanks to Daniel Paredes Sabja from the Microbiota-Host Interactions and Clostridia Research Group, Universidad Andrés Bello, Santiago, Chile and Clara Lina Salazar, from the Departamento de Estudios Básicos Integrados, Universidad de Antioquia, Medellin, Colombia, for technical support in the phenotypic tests. MM dedicates this publication to Belsy Díaz and Luis Carlos Muñoz for their support during her life and scientific career, and for being in addition to excellent parents, her best friends. We thank Margaret Biswas, PhD, from Edanz Group (www.edanzediting.com/ac) for editing a draft of this manuscript. In loving memory of Jorge Arturo Ramírez Uribe. The PhD programme of MM and MC was funded by the Departamento Administrativo de Ciencia, Tecnología e Innovación (Colciencias) within the framework of the National Program for Promoting Research Training (sponsorship call 617). This work was funded by the DIRECCION DE INVESTIGACION E INNOVACION from Universidad del Rosario. The authors extend their gratitude to the Wellcome Trust for supporting the whole genome analyses developed in the context of the interaction between the participating research groups (grant number: 098051). The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Author information

Authors and Affiliations

Grupo de Investigaciones Microbiológicas–UR (GIMUR), Departamento de Biología, Facultad de Ciencias Naturales y Matemáticas, Universidad del Rosario, Bogotá, Colombia
Marina Muñoz, Daniel Restrepo-Montoya, Giovanny Herrera, Dora I. Ríos-Chaparro & Juan David Ramírez
Posgrado Interfacultades Doctorado en Biotecnología, Facultad de Ciencias, Universidad Nacional de Colombia, Bogotá, Colombia
Marina Muñoz
Genomics and Bioinformatics Department, North Dakota State University, Fargo, North Dakota, USA
Daniel Restrepo-Montoya
Host-Microbiota Interactions Laboratory, Wellcome Sanger Institute, Hinxton, UK
Nitin Kumar & Trevor D. Lawley
Microbial Genomics Laboratory, Institut Pasteur Montevideo, Montevideo, Uruguay
Gregorio Iraola
Center for Integrative Biology, Universidad Mayor, Santiago de Chile, Chile
Gregorio Iraola
Molecular Biology and Immunology Department, Fundación Instituto de Inmunología de Colombia (FIDIC), Bogotá, Colombia
Milena Camargo, Diana Díaz-Arévalo & Manuel A. Patarroyo
School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
Milena Camargo & Manuel A. Patarroyo
Faculty of Animal Sciences, Universidad de Ciencias Aplicadas y Ambientales (UDCA), Bogotá, Colombia
Diana Díaz-Arévalo
Hygea group, Faculty of Health Sciences, Universidad de Boyacá, Tunja, Colombia
Diana Díaz-Arévalo
Centro de Investigaciones Odontológicas, Facultad de Odontología, Pontificia Universidad Javeriana, Bogotá, Colombia
Nelly S. Roa-Molina & Mayra A. Tellez
PhD Programme in Biomedical and Biological Sciences, Faculty of Natural Sciences and Mathematics/School of Medicine and Health Sciences, Universidad del Rosario, Bogotá, Colombia
Giovanny Herrera
Hospital Universitario Mayor – Méderi, Universidad del Rosario, Bogotá, Colombia
Claudia Birchenall, Darío Pinilla, Juan M. Pardo-Oviedo & Giovanni Rodríguez-Leguizamón
Fundación Clínica Shaio, Bogotá, Colombia
Diego F. Josa

Authors

Marina Muñoz
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Restrepo-Montoya
View author publications
You can also search for this author in PubMed Google Scholar
Nitin Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Gregorio Iraola
View author publications
You can also search for this author in PubMed Google Scholar
Milena Camargo
View author publications
You can also search for this author in PubMed Google Scholar
Diana Díaz-Arévalo
View author publications
You can also search for this author in PubMed Google Scholar
Nelly S. Roa-Molina
View author publications
You can also search for this author in PubMed Google Scholar
Mayra A. Tellez
View author publications
You can also search for this author in PubMed Google Scholar
Giovanny Herrera
View author publications
You can also search for this author in PubMed Google Scholar
Dora I. Ríos-Chaparro
View author publications
You can also search for this author in PubMed Google Scholar
Claudia Birchenall
View author publications
You can also search for this author in PubMed Google Scholar
Darío Pinilla
View author publications
You can also search for this author in PubMed Google Scholar
Juan M. Pardo-Oviedo
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Rodríguez-Leguizamón
View author publications
You can also search for this author in PubMed Google Scholar
Diego F. Josa
View author publications
You can also search for this author in PubMed Google Scholar
Trevor D. Lawley
View author publications
You can also search for this author in PubMed Google Scholar
Manuel A. Patarroyo
View author publications
You can also search for this author in PubMed Google Scholar
Juan David Ramírez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.M. and J.D.R. designed the study and drafted the manuscript. M.M., D.R.M., N.K., G.I. performed the bioinformatics analyses. M.M., M.C., D.D.A., N.S.R.M., M.A.T., G.H. and D.I.R.C. performed the phenotypic assays. C.B., D.P., J.M.P.O., G.R.L. and D.F.J. collected the samples and provided clinical information. T.D.L. and M.A.P. revised and edited the manuscript. All authors read and approved the final version of the manuscript.

Corresponding author

Correspondence to Juan David Ramírez.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Muñoz, M., Restrepo-Montoya, D., Kumar, N. et al. Integrated genomic epidemiology and phenotypic profiling of Clostridium difficile across intra-hospital and community populations in Colombia. Sci Rep 9, 11293 (2019). https://doi.org/10.1038/s41598-019-47688-2

Download citation

Received: 27 March 2019
Accepted: 22 July 2019
Published: 05 August 2019
DOI: https://doi.org/10.1038/s41598-019-47688-2
Springer Nature Limited

This article is cited by

Clostridioides Difficile in Latin America: An Epidemiological Overview
- Claudia G Morales-Olvera
- Lorena Lanz-Zubiría
- Eva Juárez-Hernández
Current Microbiology (2023)

Integrated genomic epidemiology and phenotypic profiling of Clostridium difficile across intra-hospital and community populations in Colombia

Abstract

Similar content being viewed by others

Characterisation of Clostridium difficile strains isolated from Groote Schuur Hospital, Cape Town, South Africa

Nosocomial transmission of Clostridium difficile Genotype ST81 in a General Teaching Hospital in China traced by whole genome sequencing

Characteristics of Clostridioides difficile isolates circulating in the Slovak hospitals

Introduction

Results

Genome sequencing, assembly, and species allocation

C. difficile Clades 1 and 2 and coexistence of multiple sequence types were found within Colombian samples

Clustering by clade and identification of high numbers of accessory genes by pangenome analysis

Colombian isolates carry toxin coding genes with atypical organizations and have a cytopathic effect on Vero cells

High correlation between antimicrobial resistance molecular markers and in vitro MIC

Colombian isolates show increased sporulation capacity and number of viable spores

Discussion

Materials and Methods

Ethics approval and consent to participate

Clinical isolates

DNA extraction and whole genome sequencing (WGS)

Annotation and comparative genomics analysis

Phylogenetic analysis

Analysis of coding sequences for the main toxins

Identification of virulence factors from WGS

Phenotypic characterization

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Clostridioides Difficile in Latin America: An Epidemiological Overview

Search

Navigation