Abstract
Escherichia coli is a pathogen commonly encountered in clinical laboratories, and is capable of causing a variety of diseases, both within the intestinal tract (intestinal pathogenic strains) and outside (extraintestinal pathogenic E. coli, or ExPEC). It is associated with urinary tract infections (UTIs), one of the most common infectious diseases in the world. This report represents the first comparative analysis of the draft genome sequences of 11 uropathogenic E. coli (UPEC) strains isolated from two tertiary hospitals located in Dhaka and Sylhet, Bangladesh, and is focused on comparing their genomic characteristics to each other and to other available UPEC strains. Multilocus sequence typing (MLST) confirmed the strains belong to ST59, ST131, ST219, ST361, ST410, ST448 and ST4204, with one of the isolates classified as a previously undocumented ST. De novo identification of the antibiotic resistance genes blaNDM-5, blaNDM-7, blaCTX-M-15 and blaOXA-1 was determined, and phenotypic-genotypic analysis of virulence revealed significant heterogeneity within UPEC phylogroups.
Similar content being viewed by others
Introduction
Urinary tract infections (UTIs) are the most common bacterial infections affecting approximately 11% of adult women each year globally, with approximately 60% of women experiencing UTI during their lifetime1, 2. Sporadic studies done on the prevalence of UTIs in Bangladesh and an investigation of 200 UTI patients, including men and women of various age groups, found females to be more susceptible to UTIs (80% positive) than males. In both genders, the prevalence rate was highest among those in the age group of 21–40 years (33%)3. The study also showed E. coli to be the predominant etiological agent, contributing to 57.38% of infections3.
Escherichia coli is an extremely diverse bacterial species which can be categorized into three major groups based on disease causing capability: commensal or nonpathogenic E. coli; intestinal pathogenic E. coli causing diarrhea; and extraintestinal pathogenic E. coli (ExPEC). The ExPEC term was described by Johnson et al. in 20004 and further subclassified as uropathogenic E. coli (UPEC), sepsis-associated E. coli (SEPEC), and neonatal meningitis-associated E. coli (MNEC)5. ExPECs are known to invade extraintestinal tissue and cause pathogenesis by harboring a variety of virulence factors, either present in the chromosome or carried in mobile genetic elements such as plasmids, thereby conferring greater diversity among ExPEC strains5,6.
Traditionally, E. coli phylogroups B2 and D have been understood to cause the majority of ExPEC infections, while phylogroups A and B1 were associated with comensal extraintestinal strains7. However, recent reports have revealed higher percentages of phylogroup A strains in UTI cases8. A strong association has also often been detected between a particular multilocus sequence type (MLST) with a pathology, such as the correlation of globally dominant E. coli ST131 and extraintestinal infections, especially in India9. Like ST131, many other successful clonal lineages of different sequence types (ST), including 410, 95 and 10 have disseminated globally due to their relatively higher virulence, fitness, and metabolic capabilities, along with acquisition of antibiotic resistance genes10,11,12,13.
Carbapenems are considered last-resort antibiotics, therefore resistance to this group of antibiotics is a greater health concern in treating infections caused by extended-spectrum-β-lactamase (ESBL)- or AmpC-producing bacteria14. New Delhi metallo-β-lactamase (NDM) is a relatively recent group of metallo-β-lactamase (MBL) that, over the last decade, has undergone rapid spread in the South-Asian continent15. While NDM producers have been found to be susceptible to a few antibiotics including colistin, several recent studies have reported that this treatment approach might not be sustainable and could become a very serious public health concern16,17. NDM genes are found both in plasmids and chromosomally integrated in various bacterial pathogens18,19. Reports of E. coli harboring NDM-1 and other ESBL genes such as CTX-M and OXA-48 have emerged from various parts of the world, including Japan, the Netherlands, South Korea, and Tanzania10,11,12,20,21,22,23. In addition, many such studies have detected the chromosomal integration of NDM genes in various ExPEC sequence types, such as ST38, ST410, ST131 and ST64824.
Association of different virulence factors, e.g., sat (secreted autotransporter toxin), iutA (aerobactin (siderophore) receptor), malX (pathogenicity island marker) and ompT (outer-membrane protease T), has also been reported, with specific sequence types such as ST38, ST131, ST405, and ST648 isolated25. In general, however, few genomic investigations have been done that could shed light on molecular mechanisms of pathogenesis and antibiotic resistance mechanisms and correlate those traits with the genotypes of local pathogens, especially in a developing country like Bangladesh.
To the best of our knowledge, there is no genomic information available on UPEC isolates circulating in Bangladesh. This study represents an initial effort to obtain genomic information on Bangladeshi UPEC isolates and to analyze genomic variations between Bangladeshi isolates and ExPECs from different parts of the world. Eleven strains representing different ExPEC phylogroups and antibiotic resistance were selected and their genomes determined using next-generation sequencing. Genotype–phenotype correlation analyses were also done on the isolates to determine virulence properties, e.g., biofilm formation, serum resistance, hemolysis, and antibiotic resistance.
Results
Antibiogram and phylogroup analysis
Presumptive identification using colony morphology revealed 47 of 74 (63.5%) bacterial isolates obtained from the Dhaka Central International Medical College and Hospital (DCIMCH) and 19 of 32 isolates (59.4%) from the Ibn Sina Hospital, Sylhet (ISH) were E. coli. All isolates from DCIMCH exhibited increased resistance to commonly used antimicrobials, including β-lactams (third- and fourth-generation cephalosporins), fluoroquinolones, and aminoglycosides (Table 1; Supplementary Table S1). Antibiotic sensitivity patterns differed between isolates collected from the two hospitals, with isolates from DCIMCH showing resistance to a larger number of antibiotics. For example, while most of the DCIMCH isolates were resistant to cefixime (85.1%), only 26.3% from ISH showed resistance to this antibiotic. DCIMCH isolates showed high frequency of resistance to second generation cephalosporin cefuroxime (83%), third generation cephalosporins ceftriaxone (80.9%) and ciprofloxacin (72.3%), and the monobactam, aztreonam (72.3%). ISH isolates, in contrast, showed resistance mainly to doxycycline (42.1%) and amoxicillin (36.8%). While ca. 17% (n = 8) of DCIMCH isolates conferred resistance to the carbapenem, imipenem, all of the ISH isolates were sensitive to carbapenems and all isolates included in this study were sensitive to colistin. Five isolates from DCIMCH were ESBL (Extended Spectrum β-lactamase) positive, but none from ISH were positive (Table S1).
Phylogroup determination based on PCR detection of chuA, yjaA and TspE4.C226, showed phylogroup B2 and phylogroup A to be most abundant, with B2 comprising 19 (40.4%) and A 14 (29.8%) of the 47 isolates (Table S2). A total of 11 strains (23.4%) were classified in phylogroup B1, while three (6.4%) were phylogroup D. Seven of 66 isolates harboured NDM-1 gene and all NDM positive strains were from DCIMCH. No association was observed between NDM and a particular phylogroup, with three strains from phylogroup A, two strains from B1, and one strain from B2 carrying the NDM gene.
Genomic features and strain characterization
Eleven isolates from the phylogroups were selected with the number of isolates from each phylogroup roughly proportional to prevalence of that phylogroup within the set of 47 isolates in this study. These were selected based on resistance patterns. Four isolates were selected from phylogroups A and B2, two from phylogroup B1 (NGE5 and NGCE100), and one from phylogroup D (NGE3). Combined length of contigs of the assembled genomes of each of the 11 strains ranged from ~ 4.3 to 5.4 Mbp, with N50 value (the minimum contig length required to cover 50% of the genome) ranging between 58,987 and 363,834 bp (Table 2). Size of the pangenome (i.e. total gene repertoire) was 16,797 genes and core genome 2,945 genes.
De novo analysis was used to confirm phylogroups of the assembled genomes and MLST analysis showed that, while strains belonging to a phylogroup were heterogenous in MLST types, there was direct correlation between serotype and MLST, with ST131 strains NGE7 and NGE16 both serotype O25:H4 (Table 3).
Phylogenetic and cluster dendrogram analysis
The 11 strains were compared to obtain the number of SNPs shared between any two strains. From the SNP matrix shown in Supplementary Table S3, isolates with the same ST shared a low SNP count, while isolates within the same phylogroup but different STs had high SNPs. For example, NGE7, NGE16, and reference strain (NA114) belong to ST131 and share a low SNP count of 411 bp, whereas ST219 strains (NGE9 and NGE6) and ST131 strains had high SNP count but were the same phylogroup.
Core alignment using parSNP aligned 189 of 402 UPEC strains available online (list of strains is given in Supplementary Table S4). A total of the 60,815 SNPs identified was extracted and linked to construct a midpoint rooted phylogenetic tree (Fig. 1), showing two major clades, Clade A and Clade B. Clade A branched into subclades, with strains from phylogroup A and B1 in one subclade and ST219 strain of phylogroup B2 in another. NGCE33 is an ESBL-containing, highly virulent strain of ST410 which, despite belonging to phylogroup A, clustered distantly from the rest of phylogroup A strains (NGE22, NGE4 and NGCE94) and closer to phylogroup B1 (NGE5, NGCE100 and SE11). Clade B also branched phylogroup D and B2 away from each other. It was observed that isolates obtained from urine and blood samples interleaved, without significant clustering of infection type. However, NGE5 (proposed as a new ST) in clade A, NGE7, and NGE16 (ST131) in clade B joined strains isolated from blood. Strains belonging to the same MLST were placed together in the phylogenetic tree.
Hierarchical clustering of the 11 UPEC strain sequences represents similarity of shared accessory genome content, yielding three major clusters, C1, C2, and C3, respectively (Fig. 2). Phylogroup D out grouped, forming a distinct cluster, C1, while a combination of phylogroups A, B1, and B2 joined the remaining two clusters (C2 and C3). C2 further divided into two distinguishable clusters comprising strains of ST131 family in one group and four multidrug resistant, highly virulent strains of various MLSTs in another. These four isolates included one ST448 strain, one ST361 strain carrying both blaNDM and ESBL genes, one “high-risk” clone of ST410 lineage27, and one resistant strain of the ST4204 family. The clustering pattern of C2 suggests sharing of accessory genes between highly virulent strains, irrespective of phylogroups and ST. C3 also separated into two clusters, comprising two less virulent phylogroup B2 strains of the ST219 family in one clade, and the moderately virulent ST4204 strain of phylogroup A (NGE4) and the phylogroup B1 strain of unknown ST (NGE5) in the other.
Accessory gene distribution responsible for this hierarchical clustering pattern is listed in Supplementary Table S5. Strains in cluster C2 share genes including OriC-binding nucleoid-associated protein (cnu), hemolysin expression-modulating protein (hha), suppressor of T4 td mutant (stpA) responsible for regulation of hemolysin (hly) gene expression and genes such as periplasmic inhibitor of g-type lysozyme (pliG) which provides lysozyme tolerance. Similarly, shared genomic content in cluster C3 involves genes RenD, ybcN and ybcK coding for uncharacterized prophage related proteins absent in C1 and C2 strains. Strains ST131 and ST219 belong to phylogroup B2 yet are located in two different clusters as ST131 strains contain 223 unique genes with significant enrichment in genes involved in biosynthetic and metabolic processes absent in ST219 strains, thus explaining the high numbers of SNPs between strains of the two STs.
De novo identification of antibiotic resistance markers
Genome annotation revealed that chromosomes of all strains sequenced had previously been reported to carry intrinsic antibiotic resistance genes, such as marA, gyrA, parC and parE28, as well as plasmid-mediated resistance genes belonging to AMR families, including β-lactamases, fluoroquinolones, aminoglycosides, macrolides, tetracyclines, trimethoprims, and sulfonamides. Genes associated with antibiotic resistance are shown in Fig. 3a.
Eleven isolates, irrespective of phylogroup and sequence type (ST), showed resistance to antibiotics according to presence of resistance genes. ESBL genotype blaCTX-M-15 is a predominant gene present in two NDM positive strains (NGCE100, NGCE94) and two ST131 strains (NGE16 and NGE7). NDM positive strains possessing blaNDM-5 and blaNDM-7 belong to phylogroup A and B1, and MLST groups ST361 and ST448, respectively. In addition, NGCE94 (ST361) contained β-lactamases blaTEM-1B and blaOXA-1, tetracycline-resistance tet(B), quinolone-resistance gene qepA, trimethoprim-resistance gene dfrA12 and chloramphenicol-resistance gene catA1. Variants of blaTEM and dfrA were also present in the highly resistant ST4204 strains (NGE4 and NGE22) belonging to the same phylogroup as ST361. Relatively less resistant ST219 strains (NGE6 and NGE9) of phylogroup B2 present a similar resistance pattern and contain resistance markers (tet(A) and qnrS1) common to phylogroup A rather than phylogroup B. Another member of phylogroup A (NGCE33) included highly resistant ST410 containing an array of β-lactamase genes including blaOXA-1B, blaCMY-2, blaDHA-1 and blaSHV-12. In addition, it contained a number of aminoglycosidase genes, fluoroquinolone-resistance gene qnrB4, macrolide gene mph(A) and trimethoprim-resistance genes drfA1 and drfA17. Genes strB and mph(A) were also shared by ST4204 isolates. Moreover, genes rmtB, aadA1, mph(A) and erythromycin-resistance gene erm(B) were harboured by NDM positive strains. Only a few resistance genes such as acc(6′)lb-cr, catB3 and blaOXA-1 were shared between the ST131 and ST410 strains.
Overall, the most resistant strain was NGCE33, based on genes coding for β-lactamase resistance, including the ESBL, blaCTX-M-15. This strain and NGCE100 were resistant to nitrofurantoin, a last resort nephrotoxic antibiotic that recently is more commonly used to treat carbapenem resistant UTIs.
Analysis of virulence profiles and genotype–phenotype correlation
UPEC pathogenesis encompasses a range of mechanisms including colonization of the urinary tract, protection against host defenses, and toxin production. Hemolysin production of 11 strains was tested, using blood agar and, while alpha hemolytic activity was observed only for ST131 strains NGE7 and NGE16, mild hemolysis was detected for NGE3, NGE4, and NGE22.
An important pathogenic determinant of UPEC is ability to form biofilm29. The biofilm formation assay results showed variation in biofilm formation both between and within the phylogroups (Fig. 4). Three strains (NGE22, NGCE94 and NGCE100) were classified as strong biofilm formers and four (NGE4, NGE7, NGE9 and NGE16) were moderate biofilm formers, based on specific biofilm formation value (SBF). Differences in biofilm forming ability were observed by sequence type, as exemplified by NGE7 and NGE16, both in the pandemic ST131 family, as well as ST219 strains (NGE6 and NGE9) and ST4204 strains (NGE4 and NGE22).
Bactericidal activity of serum complement acts as a major first-line defense against bacterial infection infiltrated into tissue30. In vitro serum resistance assay results showed all strains were susceptible to human serum bactericidal activity to varying degrees (Fig. 3). Except for NGE3, NGE5, and NGE6, they showed survival capacity equal to or greater than that of DH5α. While serum has a more pronounced bactericidal activity for strains belonging to phylogroup B1 and D, most strains of phylogroups A and B2 exhibited better survival in serum. Although little correlation was observed between degree of biofilm formation and serum resistance (r = 0.188) (Fig. 4), moderate/strong biofilm formers all showed greater survival in serum compared to weak biofilm formers.
Pan-genome analyses revealed virulomes of the 11 strains (Fig. 3b). In general, ST131 strains NGE7 and NGE16 carry an extensive repertoire of virulence genes. While the core genome encompasses gene families from different classes of virulence factors, not all are conserved, according to the SNP/bp ratio (Table 4).
Comparison with VFDB indicated adhesion factors belonging to the ecp (Escherichia coli common pilus), csg (Curli fibers) and fim (type 1 fimbriae) gene families are part of the core genetic pool, while afaB/C (adherence fibrillar adhesion) and sfa (S fimbrial adhesion) are absent. Variation was noted in presence of the foc (F1C fimbriae) and pap (pyelonephritis associated pili) family genes. SNP distributions show that while other members of the fim genes are moderately conserved, fimA displays high genetic variability with SNP/bp of 0.383. Among the conserved usp (universal stress protein) genes that are involved in bacterial adhesion, uspA also indicated greater variation with SNP/bp of 0.181. Among toxicity conferring genes, hemolysin toxin hlyA was detected in five strains, including the two ST131 strains. Other genes, such as hlyB, hlyC, hlyD, cnf-1 and sat, were present exclusively in only ST131 strains.
All 11 strains carried the well-characterized serum resistance gene ompA, while traT and bor were missing from their genomes. Only ST131 strains harbored iss (increased serum survival) another important gene in the serum resistome of E. coli. The genomes were also analyzed for presence of 56 genes recently characterized as belonging to the serum resistome of EC985 by transposon-directed insertion site sequencing (TraDIS)31. Most of these genes, including tol(A,B,Q) and rfaH were detected in the core genome. However, some essential genes were either completely absent (hyxA and hyxR) or not identical to the reference strain from the VFDB database (waaP and waaG).
Analysis of pathogenicity islands (PAIs)
CFT073, a well characterized pyelonephritogenic strain, harbours PAIs that suggest strong virulence when present in UPEC isolates32,33. Many UTI associated strains, as well as commensal E. coli, carry PAIs that were first identified in strain 53634. To determine genetic composition of PAIs, genomes of the 11 UPEC isolates were analyzed for presence of the CFT073 and 536 associated PAI genes as retrieved from the PAthogenisity Island DataBase (https://www.paidb.re.kr/) (Supplementary Table S6).
Analysis of PAI ICFT073 (Fig. 5a) showed that only two ST131 strains (NGE7 and NGE16) and NGE3 carried malX, a phosphotransferase system enzyme coding gene linked with occurrence of extraintestinal infections. The PAI ICFT073 gene dadX, however, was detected in all strains, while other genes, such as those coding for the motility regulating factors ycgR and emtA, a peptidogycan recycling enzyme ldcA (L, D-carboxypeptidase A) and cvrA (putative K+:H+ antiporter), were detected in all strains except NGE9. Apart from cad (BAC) which was present in all strains, most of the PAI IICFT073 associated genes (Fig. 5b) were found only in the ST131 isolates, NGE7 and NGE16. However, a phylogroup A strain NGCE94, was found to carry several genes of the pap (P fimbriae coding) operon indicating the possibility of acquisition of PAI IICFT073 from more virulent strains in the genitourinary microenvironment.
Analysis of Strain 536 associated genetic elements showed that PAI I536 (containing hemolysin genes hlyA, B, C, and D) as well as PAI II536 were exclusively found in NGE7 and NGE16. PAI III536 gene yciC (UPF0259 membrane protein) was only present in NGE3, NGE7 and NGE16. Furthermore, all strains apart from NGE3, NGE4 and NGE22 were enriched with genes from PAI V536. PAI IV536 genes on the contrary, had more diffuse distribution among the strains with only mtfA (mnemonic for Mlc titration factor A). That PAI IV536 is considered more stable than other PAIs of strain 53635 may explain these results.
Discussion
High-throughput sequencing technology development has led to a significant decrease in the cost of whole genome sequencing of bacterial pathogens so that sequencing is routine in developed countries. Infrastructure for next generation sequencing is now being developed in research centers located in developing countries including Bangladesh. The NSU Genome Research Institute (NGRI) at North South University, Bangladesh, has been established and aims to decipher whole genomes of bacterial pathogens of public health concern to Bangladesh. This initial study was carried out to determine genome characteristics of UPEC strains circulating in the country by sequencing 11 UPEC strains representing prevalent ExPEC phylogroups and antibiotic resistance profiles.
ST410 has been previously detected in Southeast Europe, Middle East and Greece28 and classified as a “high-risk clone” with essential regular monitoring due to its enhanced resistance mechanisms and moderate virulence36. The resistance and virulence patterns of ST410 strain NGCE33 validates this finding, and its very first identification in Bangladesh emphasizes the extent of its spread across continents. This study also reports an unknown MLST type i.e. NGE5, an observation that highlights the rapidly mutating nature of the UPEC genome and indicates the value of genomic characterization of local isolates.
The phylogenetic tree based on difference in SNPs (Fig. 1) showed phylogroups A and B1 branching separately from phylogroups B2 and D. This finding supports previous reports of phylogroup A and B1 belonging to sister lineages36,37. Phylogroup D and B2 can be concluded to have the same ancestral origin since most of the strains of these phylogroups are located in the same clade (Clade B) in SNP tree. A lack of clear phylogenomic separation of strains isolated from urine and blood coincides with results of previous related work28,38. The difference in SNP counts observed in this study further elucidates genomic relatedness since these sites account for the variation in nucleotide sequence. It was observed that ST219 and ST131 strains have an increased number of SNP difference despite belonging to the same phylogroup and thus cluster in separate clades in the phylogenetic tree. This may be due to difference in pathogenic capabilities and resistance potentials between STs, i.e., gain and loss of accessory genome in genomic diversity, with strains harbouring different fitness factors and MDR genotypes, irrespective of ST and phylogroup. Comparative genomic analysis of the sequenced UPEC strains showed that the phylogenetic analysis is congruous with serotype, sequence type, virulence and AMR pattern.
This is further supported by results of the genomic and phenotypic analyses (Fig. 4). The genetic architectures of strains included in this study are concordant with previous findings, with well known pandemic strains such as ST131 possessing many virulence genes and enriched PAIs9. However, as supported by the results of this study, although ST131 strains exhibit a notably dense virulome compared to other sequenced strains, a stark difference was not observed between the phenotypic virulence of ST131 and non ST131 strains (Fig. 4). Strains within the same phylogroup also displayed varied levels of virulence. While ST131 strains of phylogroup B2 (NGE7 and NGE16) displayed relatively strong phenotypic virulence, another strain NGE6, also classified under phylogroup B2 but of ST219 origin, displayed a much weaker pathogenic potential, such as weak biofilm formation.
Closer scrutiny of virulomes of the sequenced strains reveal certain virulence factors pertaining to resistance against serum bactericidal activity were either completely missing or present in degraded form within the bacterial genomes. These discrepancies explain that while most strains are capable of enduring serum bactericidal activity, they do not possess the robust serum resistome required to overcome complement action and proliferate in serum. The lack of any clear association between a particular gene and a given virulence phenotype suggests that likely there is a combinatorial effect of genes on pathogenic potential. E. coli possesses an open pan-genome by continuing gene acquisition, as found in other studies38,39. This characteristic may also explain phenotypic results as mentioned above and acquisition of yet-to-be characterized new genes may determine pathogenicity of strains. Several recent studies report an altered pathogenic potential of commensal E. coli40 and a similar observation can be made from this study. Emergence of highly virulent strains belonging to phylogroup A may be due to the open pan genome nature of E. coli, allowing it to acquire new virulence factors and resistance markers. SNP analyses of core virulence factors reveal that certain genes within a given gene family, such as fimA, are prominently more polymorphic compared with other members.
Several findings have indicated that septicemic/pyelonephritogenic strains carry certain virulent genes located in mobile genetic elements, called pathogenicity islands (PAI), usually absent in avirulent or less virulent strains41. Of all the strains sequenced in this study, ST131 strains NGE7 and NGE16 predictably possessed the most gene-dense CFT073 and 536 associated PAIs (Fig. 5). However, the presence of a large number of PAI IICFT073 mediated adhesion factors in the genome of NGCE94 indicates dissemination of virulence traits via horizontal transfer of PAIs from commonly virulent phylogroup B2 strains to less virulent phylogroup A strains. Transfer and evolution of genetic elements like PAIs contribute to fitness and pathogenic properties of UPEC.
Rapid emergence of multidrug resistant (MDR) strains of E. coli is a very serious concern, especially in a developing country like Bangladesh which experiences antibiotic misuse. Most resistance properties emerge via intra-species horizontal resistance gene transfer42. Clearly, however, excessive use of antibiotics in the global community creates evolutionary pressure towards enhanced resistance of UPEC28. Reported spread of the pan beta-lactam antibiotic resistance blaNDM family of genes is a major cause for concern, because of resistance conferred against penems, cephamycins, cephalosporins, and carbapenems, as well as horizontal transfer since the gene is located in a plasmid. This study reveals the presence of NDM gene varieties and a number of other ESBL genes in phylogroup A and B1 strains. In addition, the spread of virulence properties to strains like NGCE94, which is both NDM and ESBL positive, can have a profound effect on healthcare and spread of disease in Bangladesh.
In conclusion, this study presents useful insight into the genomes of Bangladeshi UPEC isolates, notably reporting for the first time an emerging pandemic clone ST410 in Bangladesh, contributing to the global distribution of this lineage. The study also demonstrates that strains belonging to phylogroup A exhibit virulence characteristics comparable to globally predominant known virulent ST131 (phylogroup B2) isolates, while other phylogroup B2 strains, such as ST219, display lower pathogenic potential. It also substantiates classification based on sequence type being an improved measure of genomic relatedness and pathogenicity. The risks posed by emerging pathogenic strains within different phylogroups need further assessment using comparative genomics and larger sample size.
Methods
Selection of isolates and antibiotic resistance profiling
A total of 106 bacterial isolates were cultured from urine of patients suffering from UTI and admitted to either the intensive care or emergency unit of two tertiary hospitals in Bangladesh from the period of June, 2017 and July, 2018. A total of 74 isolates were from inpatients admitted to Dhaka Central International Medical College and Hospital (DCIMCH), Dhaka and 32 isolates were collected from inpatients at Ibn Sina Hospital, Sylhet (ISH). E. coli colonies were presumptively identified by their colony morphology on MacConkey agar. Antibiotic susceptibility was carried out using the disc diffusion method and included 22 antibiotics for the 47 strains isolated from DCIMCH and 16 antibiotics for the 19 ISH strains (Table S1). Results were interpreted according to the 27th edition of Clinical and Laboratory Standards Institute (CLSI) guidelines. The isolates were transferred to North South University Genome Research Institute (NGRI), Dhaka, Bangladesh for further analysis.
PCR amplification and gel electrophoresis
The 66 E. coli isolates were inoculated into LB broth (HiMedia, India), grown at 37 °C overnight, and DNA was extracted using GeneJET Genomic DNA Purification Kit (Cat. No K0721) (ThermoFisher Scientific, USA) according to manufacturer’s protocol. Conventional singleplex PCR was carried out to detect the NDM gene43, and to classify strains into phylogroups using previously described protocols (Table S2)26. In brief, 12.5 µl of reaction volume was used containing 6.25 µl DreamTaq Green PCR Master Mix (ThermoFisher Scientific, USA), 1.0 µl 25 nmol of MgCl2, 20 pmol of forward and reverse primers and ca. 100 ng DNA. Amplification was carried out using GeneAtlas (Astec Co, Ltd), with the following assay conditions: denaturation at 94 °C for 5 min; 30 cycles of 30 s at 94 °C, 30 s at annealing temperature, 30 s at 72 °C, and final extension at 72 °C for 5 min. Agarose gel electrophoresis was used to visualize banding patterns of the strains.
Genome assembly and annotation
Library preparation and sequencing of the 11 selected strains were conducted at NGRI. Ca. 1 μg of high molecular weight E. coli genomic DNA was used to prepare Illumina libraries and employing Nextera DNA Library Preparation Kit (Cat. No. FC-121-1030) according to manufacturer’s guideline. De novo assembly of good quality paired-end Illumina reads (Q ≥ 30) was done by running genome assembly software SPAdes (v3.12)44 with filters to decrease the number of mismatches and short indels. Assembled contigs were annotated using PROKKA pipeline45 with contiglength < 500 bp filtered out. Possible genomic contaminations were assessed using the ContEst16S tool46. Pan and core genome size of the 11 isolates and reference genome NA11447 were identified using the GF (Gene Family) method of pan-genome analysis pipeline (PGAP) (v1.2.1)48. Further pan and core genome analyses were performed using Roary49. Hierarchical clustering based on presence and absence of accessory genes, was performed using PVclust: R package50, based on bootstrap resampling to generate p-values. The bootstrap value was set to n = 1,000, to cross-validate the clustering pattern. The functions of the accessory genes were analysed via STRING database (https://string-db.org/).
Evolutionary relationship and phylogenomic analysis
Single Nucleotide Polymorphism (SNP) matrix was generated using CSIphylogeny 1.4 (Conserved Signature Indels) (Table S3)51. The 11 sequenced UPEC isolates from this study, 386 publicly available published genomes12,38, and a few reference strains of diverse STs and phylogroups, including NA11447, CFT07352, IAI3953, SE1154 and BW295255 (Table S4), were aligned to generate a core alignment in order to derive whole genome SNP using Parsnp v1.2 from the Harvest suite56. SNP file was processed by SNPRelate: R package57 and phylogenomic tree was visualized using FigTree (https://tree.bio.ed.ac.uk/software/figtree/).
In silico analysis of UPEC genome sequences
Phylogroups were confirmed based on presence of marker genes arpA, chuA, yjaA and TspE4.C226 using local BLAST the scheme set by Clermont et al. in 201258. The ST of each annotated genome was extracted from the MLST 2.0 (Multi-locus sequence typing) database59, and serotypes were determined using SerotypeFinder 2.060. Similarly, antimicrobial resistance (AMR) genes and plasmids of each isolate was obtained from ResFinder 3.161 and PlasmidFinder 2.062 respectively. Furthermore, a virulence factor profile was generated by amalgamation of results obtained using BLASTp against the Virulence Factors of Bacterial Pathogens database (VFDB)63 that had been made available in 2016, and the tool VirulenceFinder 2.051. SNPs per gene were calculated using DnaSP v664. To study the genetic composition of possible PAIs, genomes of the 11 UPEC isolates were analysed for presence of CFT073 and 536 associated PAI genes retrieved from the PAthogenisity Island DataBase (https://www.paidb.re.kr/)65 and visualized using CIRCOS66.
Phenotypic virulence determination
Alpha and beta haemolytic reactions of the strains were demonstrated using blood agar (Oxoid, UK), prepared using sheep blood. A single isolated colony for each strain was streaked on a blood agar plate which was incubated overnight at 37 °C. Partially clear and completely clear zones around the colony were indicative of alpha and beta hemolytic activity respectively.
Biofilm formation assays were performed using previously described protocols with minor modifications13. Bacteria were grown overnight in M63 at 37 °C after which 2 µl aliquots were added to 198 µl fresh M63 medium in a sterile 96-well polystyrene microtitre plate with four replicate wells for each strain. M63 broth without inoculum served as negative control. The plates were incubated statically at 37 °C for 24 h and OD600 was measured both at the beginning of incubation (0 h) and end of incubation at 24 h (GloMax, Promega). The culture was then discarded and plates were gently washed twice with sterile saline and air dried. Ca. 250 µl 0.1% crystal violet was added to the wells and allowed to stain for 15 min. The plates were then washed thrice with distilled water and air dried. The stained bacterial cells were resolubilized in 200 µl of 33% glacial acetic acid and the plates were read at 560 nm to enumerate cells in the biofilm. Specific biofilm formation (SBF) was measured using the formula SBF = (AB – CW)/G, where AB is OD560 of stained cells, CW is OD560 of control wells, and G is bacterial cell growth calculated using the formula G = OD600nm(24 h) – OD600nm(0 h). The strains were classified as follows: SBF < 0.5 = weak biofilm former, 0.5 ≤ SBF < 1.0 = moderate biofilm former and SBF ≥ 1.0 = strong biofilm former67. The entire assay was performed at least twice for each strain.
Assay for serum resistance was performed using a slightly modified version of previously described protocols13. Ca. 5 µl from overnight cultures was added to 495 µl fresh LB broth (HiMedia, India) and inoculated statically for 2 h at 37 °C. The culture was then centrifuged at 5,000×g for 7 min and the pellet obtained was suspended in 500 µl of sterile saline. Ca. 20 µl aliquots from this mixture were transferred to 180 µl of normal human serum (NHS) in a sterile 96-well microtitre plate and incubated at 37 °C under static conditions for 3 h. Ca. 20 µl was removed from the culture at 0 h and after 3 h incubation and plated on LB plates after serial dilution. Bacteria were enumerated after the plates had incubated overnight at 37 °C. Resistance to serum was measured as percentage change in colony forming units (CFU) at the beginning and end of the incubation period. The entire experiment was run in duplicate.
Ethical statements
All isolates included in this study were collected for diagnostic purposes from two local tertiary hospitals where pathogens are isolated from clinical specimens as part of a routine diagnostic procedure and not for experimental purposes. All experiments and methods were carried out in accordance with relevant guidelines and regulations. All experimental protocols were approved by the North South University (NSU) Institutional Review Board (IRB) / Ethical Review Committee (ERC), protocol No. CTRG:NSU-RP-18-042. Clinical isolates used in this study were recovered for diagnostic purposes from local diagnostic centers or hospitals and were not experimental in nature. The clinical data were anonymized and unlinked and the requirement for informed consent was waived by the NSU IRB/ERC.
Data access
The 11 uropathogenic E. coli genome sequences and analysis from this study have been submitted to GenBank database, with accession numbers that include QEXN00000000 (NGE3), QFAZ00000000 (NGE4), RCIF00000000 (NGE5), RCIE00000000 (NGE6), QFRN00000000 (NGE7), QFRT00000000 (NGE9), QFTM00000000 (NGE16), QFXA00000000 (NGE22), RBWA00000000 (NGCE33), RAZR00000000 (NGCE94) and RAZQ00000000 (NGCE100).
References
Foxman, B. Epidemiology of urinary tract infections: Incidence, morbidity, and economic costs. Dis. Mon. 49, 53–70 (2003).
Foxman, B., Barlow, R., D’Arcy, H., Gillespie, B. & Sobel, J. D. Urinary tract infection: self-reported incidence and associated costs. Ann. Epidemiol. 10, 509–515 (2000).
Sanjee, S. A. et al. Prevalence and antibiogram of bacterial uropathogens of urinary tract infections from a tertiary care hospital of Bangladesh. J. Sci. Res. 9, 317 (2017).
Johnson, J. R. & Stell, A. L. Extended virulence genotypes of Escherichia coli strains from patients with urosepsis in relation to phylogeny and host compromise. J. Infect. Dis. 181, 261–272 (2000).
Horner, C. et al. Escherichia coli bacteraemia: 2 years of prospective regional surveillance (2010–12). J. Antimicrob. Chemother. 69, 59–66 (2014).
Wirth, T. et al. Sex and virulence in Escherichia coli: an evolutionary perspective. Mol. Microbiol. 60, 1136–1151 (2006).
Sarowska, J. et al. Virulence factors, prevalence and potential transmission of extraintestinal pathogenic Escherichia coli isolated from different sources: recent reports. Gut Pathog. 11, 1–16 (2019).
Khairy, R. M., Mohamed, E. S., Ghany, H. M. A. & Abdelrahim, S. S. Phylogenic classification and virulence genes profiles of uropathogenic E. coli and diarrhegenic E. coli strains isolated from community acquired infections. PLoS ONE https://doi.org/10.1371/journal.pone.0222441 (2019).
Shaik, S. et al. Comparative genomic analysis of globally dominant ST131 clone with other epidemiologically successful extraintestinal pathogenic Escherichia coli (ExPEC) lineages. MBio 8, e01596-e1617 (2017).
Lau, H. L. et al. Major uropathogenic Escherichia coli strain isolated in the Northwest of England identified by multilocus sequence typing. J. Clin. Microbiol. 46, 1076–1081 (2008).
Aibinu, I., Odugbemi, T., Koenig, W. & Ghebremedhin, B. Sequence type ST131 and ST10 complex (ST617) predominant among CTX-M-15-producing Escherichia coli isolates from Nigeria. Clin. Microbiol. Infect. 18, E49–E51 (2012).
Petty, N. K. et al. Global dissemination of a multidrug resistant Escherichia coli clone. Proc. Natl. Acad. Sci. 111, 5694–5699 (2014).
Nandanwar, N. et al. Extraintestinal pathogenic Escherichia coli (ExPEC) of human and avian origin belonging to sequence type complex 95 (STC95) portray indistinguishable virulence features. Int. J. Med. Microbiol. 304, 835–842 (2014).
Fair, R. J. & Tor, Y. Antibiotics and bacterial resistance in the 21st century. Perspect. Med. Chem. 6, 25–64 (2014).
Islam, M. A. et al. Environmental spread of New Delhi metallo-β-lactamase-1-producing multidrug-resistant bacteria in Dhaka, Bangladesh. Appl. Environ. Microbiol. 83, e00793-e817 (2017).
Peirano, G., Schreckenberger, P. C. & Pitout, J. D. D. Characteristics of NDM-1-producing Escherichia coli isolates that belong to the successful and virulent clone ST131. Antimicrob. Agents Chemother. 55, 2986–2988 (2011).
Tada, T. et al. NDM-8 metallo-βlactamase in a multidrug-resistant Escherichia coli strain isolated in Nepal. Antimicrob. Agents Chemother. 57, 2394–2396 (2013).
Poirel, L., Dortet, L., Bernabeu, S. & Nordmann, P. Genetic features of blaNDM-1-positive Enterobacteriaceae. Antimicrob. Agents Chemother. 55, 5403–5407 (2011).
Hu, H. et al. Novel plasmid and its variant harboring both a bla NDM-1 gene and type IV secretion system in clinical isolates of Acinetobacter lwoffii. Antimicrob. Agents Chemother. 56, 1698–1702 (2012).
Kumarasamy, K. K. et al. Emergence of a new antibiotic resistance mechanism in India, Pakistan, and the UK: a molecular, biological, and epidemiological study. Lancet Infect. Dis. 10, 597–602 (2010).
Poirel, L., Revathi, G., Bernabeu, S. & Nordmann, P. Detection of NDM-1-producing Klebsiella pneumoniae in Kenya. Antimicrob. Agents Chemother. 55, 934–936 (2011).
Dortet, L., Poirel, L. & Nordmann, P. Worldwide dissemination of the NDM-type carbapenemases in Gram-negative bacteria. Biomed. Res. Int. 2014, 249856 (2014).
Zou, D. et al. A novel New Delhi metallo-β-lactamase variant, NDM-14, isolated in a Chinese hospital possesses increased enzymatic activity against carbapenems. Antimicrob. Agents Chemother. 59, 2450–2453 (2015).
Falgenhauer, L. et al. Circulation of clonal populations of fluoroquinolone-resistant CTX-M-15-producing Escherichia coli ST410 in humans and animals in Germany. Int. J. Antimicrob. Agents 47, 457–465 (2016).
Pitout, J. D. D. Extraintestinal pathogenic Escherichia coli: a combination of virulence with antibiotic resistance. Front. Microbiol. https://doi.org/10.3389/fmicb.2012.00009 (2012).
Clermont, O., Bonacorsi, S. & Bingen, E. Rapid and simple determination of the Escherichia coli phylogenetic group. Appl. Environ. Microbiol. 66, 4555–4558 (2000).
Roer, L. et al. Escherichia coli sequence type 410 is causing new international high-risk clones. mSphere 3, e00337-18 (2018).
Abd El Ghany, M., Sharaf, H., Al-agamy, M. H., Shibl, A. & Hong, P. Genomic characterization of NDM-1 and 5, and OXA-181 carbapenemases in uropathogenic Escherichia coli isolates from Riyadh, Saudi Arabia. PLoS ONE 13, e0201613 (2018).
Kostakioti, M., Hadjifrangiskou, M. & Hultgren, S. J. Bacterial biofilms: development, dispersal, and therapeutic strategies in the dawn of the postantibiotic era. Cold Spring. Harb. Perspect. Med. 3, a010306 (2013).
Putrins, M. et al. Phenotypic heterogeneity enables uropathogenic Escherichia coli to evade killing by antibiotics and serum complement. Infect. Immun. 83, 1056–1067 (2015).
Phan, M. D. et al. The serum resistome of a globally disseminated multidrug resistant uropathogenic Escherichia coli clone. PLoS Genet. 9, e1003834 (2013).
Mobley, H. L. et al. Pyelonephritogenic Escherichia coli and killing of cultured human renal proximal tubular epithelial cells: role of hemolysin in some strains. Infect. Immun. 58, 1281–1289 (1990).
Kao, J. S., Stucker, D. M., Warren, J. W. & Mobley, H. L. T. Pathogenicity island sequences of pyelonephritogenic Escherichia coli CFT073 are associated with virulent uropathogenic strains. Infect. Immun. 65, 2812–2820 (1997).
Sabate, M., Moreno, T., Perez, T., Andreu, A. & Prats, G. Pathogenicity island markers in commensal and uropathogenic Escherichia coli. Clin. Microbiol. Infect. Dis. 12, 880–886 (2006).
Middendorf, B., Hochhut, B., Leipold, K., Dobrindt, U. & Blum-oehler, G. Instability of pathogenicity islands in uropathogenic Escherichia coli 536. J. Bacteriol. 186, 3086–3096 (2004).
Gordon, D. M., Clermont, O., Tolley, H. & Denamur, E. Assigning Escherichia coli strains to phylogenetic groups: multi-locus sequence typing versus the PCR triplex method. Environ. Microbiol. 10, 2484–2496 (2008).
Ciccozzi, M. et al. Phylogenetic analysis of multidrug-resistant Escherichia coli clones isolated from humans and poultry. New Microbiol. 36, 385–394 (2013).
Salipante, S. J. et al. Large-scale genomic sequencing of extraintestinal pathogenic Escherichia coli strains. Genome Res. 25, 119–128 (2015).
Rasko, D. A. et al. The pangenome structure of Escherichia coli: comparative genomic analysis of E. coli commensal and pathogenic isolates. J. Bacteriol. 190, 6881–6893 (2008).
Madoshi, B. P. et al. Characterisation of commensal Escherichia coli isolated from apparently healthy cattle and their attendants in Tanzania. PLoS ONE 11, e0168160 (2016).
Vejborg, R. M., Hancock, V., Schembri, M. A. & Klemm, P. Comparative genomics of Escherichia coli strains causing urinary tract infections. Appl. Environ. Microbiol. 77, 3268–3278 (2011).
Cordoni, G. et al. Comparative genomics of European avian pathogenic E. coli (APEC). BMC Genomics 17, 960 (2016).
Liang, Z. et al. Molecular basis of NDM-1, a new antibiotic resistance determinant. PLoS ONE 6, e23606 (2011).
Bankevich, A. et al. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J. Comput. Biol. 19, 455–477 (2012).
Seemann, T. Prokka: rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069 (2014).
Lee, I. et al. ContEst16S: an algorithm that identifies contaminated prokaryotic genomes using 16S RNA gene sequences. Int. J. Syst. Evol. Microbiol. 67, 2053–2057 (2017).
Avasthi, T. S. et al. Genome of multidrug-resistant uropathogenic Escherichia coli strain NA114 from India. J. Bacteriol. 193, 4272–4273 (2011).
Zhao, Y. et al. PGAP: pan-genomes analysis pipeline. Bioinformatics 28, 416–418 (2012).
Page, A. J. et al. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693 (2015).
Suzuki, R. & Shimodaira, H. Pvclust: an R package for assessing the uncertainty in hierarchical clustering. Bioinformatics 22, 1540–1542 (2006).
Joensen, K. G. et al. Real-time whole-genome sequencing for routine typing, surveillance, and outbreak detection of verotoxigenic Escherichia coli. J. Clin. Microbiol. 52, 1501–1510 (2014).
Welch, R. A. et al. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. PNAS 99, 17020–17024 (2002).
Touchon, M. et al. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet. 5, e1000344 (2009).
Oshima, K. et al. Complete genome sequence and comparative analysis of the wild-type commensal Escherichia coli strain SE11 isolated from a healthy adult. DNA Res. 15, 375–386 (2008).
Ferenci, T. et al. Genomic sequencing reveals regulatory mutations and recombinational events in the widely used MC4100 lineage of Escherichia coli K-12. J. Bacteriol. 191, 4025–4029 (2009).
Treangen, T. J., Ondov, B. D., Koren, S. & Phillippy, A. M. The Harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol. 15, 524 (2014).
Zheng, X. et al. A high-performance computing toolset for relatedness and principal component analysis of SNP data. Bioinformatics 28, 3326–3328 (2012).
Clermont, O., Christenson, J. K., Denamur, E. & Gordon, D. M. The Clermont Escherichia coli phylo-typing method revisited: improvement of specificity and detection of new phylo-groups. Environ. Microbiol. Rep. 5, 58–65 (2013).
Larsen, M. V. et al. Multilocus sequence typing of total-genome-sequenced bacteria. J. Clin. Microbiol. 50, 1355–1361 (2012).
Joensen, K. G., Tetzschner, A. M. M., Iguchi, A., Aarestrup, F. M. & Scheutz, F. Rapid and easy in silico serotyping of Escherichia coli isolates by use of whole-genome sequencing data. J. Clin. Microbiol. 53, 2410–2426 (2015).
Zankari, E. et al. Identification of acquired antimicrobial resistance genes. J. Antimicrob. Chemother. 67, 2640–2644 (2012).
Carattoli, A. et al. In silico detection and typing of plasmids using plasmidfinder and plasmid multilocus sequence typing. Antimicrob. Agents Chemother. 58, 3895–3903 (2014).
Chen, L. et al. VFDB: a reference database for bacterial virulence factors. Nucleic Acids Res. 33, D325–D328 (2005).
Rozas, J. et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol. Biol. Evol. 34, 3299–3302 (2017).
Yoon, S. H., Park, Y. K. & Kim, J. F. PAIDB v2.0: exploration and analysis of pathogenicity and resistance islands. Nucleic Acids Res. 43, D624–D630 (2015).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Martinez-Medina, M. et al. Biofilm formation as a novel phenotypic feature of adherent-invasive Escherichia coli (AIEC). BMC Microbiol. 9, 202 (2009).
Author information
Authors and Affiliations
Contributions
M.H.: conceptualization, supervision, computational analysis, writing, reviewing, editing; T.T.: computational and data analysis, writing, editing; A.R.: laboratory experiments, data analysis, writing, editing: Ar.H.: data collection, data analysis, writing; T.A.: computational analysis, A.M.I.M.: computational analysis, writing; A.S.: data collection; M.S.: laboratory experiments; F.S.: supervision, reviewing; A.I.: reviewing, editing; A.K.: supervision; M.A.: supervision; An.H.: supervision; G.U.A.: supervision; R.R.C.: supervision, writing, reviewing, and editing. All authors read and reviewed the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hossain, M., Tabassum, T., Rahman, A. et al. Genotype–phenotype correlation of β-lactamase-producing uropathogenic Escherichia coli (UPEC) strains from Bangladesh. Sci Rep 10, 14549 (2020). https://doi.org/10.1038/s41598-020-71213-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-71213-5
- Springer Nature Limited
This article is cited by
-
Occurrence of Imipenem-Resistant Uropathogenic Escherichia coli in Pregnant Women: An Insight into Their Virulence Profile and Clonal Structure
Current Microbiology (2024)
-
Detection of Extended-spectrum β-lactamase-producing Escherichia coli isolates by isothermal amplification and association of their virulence genes and phylogroups with extraintestinal infection
Scientific Reports (2023)
-
Characterization of plasmids carrying blaCTX-M genes among extra-intestinal Escherichia coli clinical isolates in Ethiopia
Scientific Reports (2023)
-
Virulence characterization and clonal analysis of uropathogenic Escherichia coli metallo-beta-lactamase-producing isolates
Annals of Clinical Microbiology and Antimicrobials (2021)