Background

Iron is an important inorganic component of a cell. Iron is required as co-factor for various essential enzymes and proteins some of which are involved in electron transport (Cytochromes), redox reactions (oxidoreductases) and regulation of gene expression (fumarate-nitrate reduction regulatory protein, iron-binding protein) [1]. However a higher level of intracellular iron can catalyze formation of hydroxyl radicals and reactive oxygen species through Fenton's reaction which could be lethal to the cell [2]. Hence, a careful regulation of iron-requiring enzymes/proteins and iron uptake proteins/enzymes is required for the survival of bacteria.

Inorganic iron is also known to influence virulence in many pathogenic bacteria such as Corynebacterium diphtheriae, Escherichia coli, and Bordetella bronchiseptica [35]. The diphtheria toxin repressor DtxR is known as an iron-activated global transcription regulator that represses the transcription of various iron-dependent genes in C. diphtheriae [6, 7]. Eight DtxR-binding sites in upstream sequences of operons/genes named as tox, hmuO, irp1, irp2, irp3, irp4, irp5 and irp6 have been identified by DNA footprinting methods [6]. The product of tox gene is diphtheria toxin which catalyzes the NAD-dependent ADP ribosylation of eukaryotic aminoacyl-transferase-II, thereby causing inhibition of protein synthesis and subsequent death of the host. The hmuO gene, which encodes a haem oxygenase, oxidizes the haem to release free iron. The operons irp1 and irp6 encode the products with homology to ABC-type ferric-siderophore transport systems. The gene irp3 encodes a homologue of AraC-type transcriptional activators. The products of irp2, irp4 and irp5 do not show any homology to the other known proteins. In addition, C. diphtheriae with inactive DtxR has been shown to be sensitive to killing by exposure to high iron conditions or hydrogen peroxide than the wild type [8].

This work uses an in silico method to identify additional DtxR-binding sites and target genes to understand the role of DtxR in virulence and patho-physiology of C. diphtheriae.

Results

In silico identification of putative DtxR-binding sites

Experimentally characterized DtxR-binding motifs were collected from the literature (Table 1). These binding sites were used to identify additional putative DtxR-binding sites along with associated operons in C. diphtheriae NCTC13129 genome (see materials and methods). Table 2 shows the predicted DtxR-binding sites with score 3.7438 or more. We could identify five (tox, irp4, irp5, irp6 and hmuO) of the eight known DtxR-binding sites, in sequenced C. diphtheriae NCTC13129 genome. We could not find irp1 and irp2 motifs as the corresponding genes (irp1, irp2) are not present in the sequenced strain NCTC13129 [9]. The regulator binding sites of irp3, irp4 and irp6 genes in the strain NCTC13129 shows one base change from the binding sites reported in strain C7 [6]. Binding site of irp3 gene (TTAGGTGAGACGCACCCAT) although exists in strain NCTC13129, but not there in the predicted sites, because it is located within the coding region of irp3 ORF. The predicted ORF of irp3 in the sequenced strain NCTC13129 has different start position and is larger than what was previously reported in strain C7 [9, 10].

Table 1 Known DtxR-binding sites from C. diptheriae
Table 2 Predicted DtxR-binding sites in C. diphtheriae

In addition, we have identified binding sites in upstream sequences of eight genes recently reported to be regulated by DtxR [7]. However, our prediction differs from the previous report for five (secY, deoR, chtA, frgA, sidA) of the seven sites which were identified by BLAST search (Table 2). Our prediction agreed with the previous report that the genes such as recA (DIP1450) and ywjA (DIP1735) are not under a direct DtxR regulation as we could not detect any motif upstream to these gene with scores above the cutoff value [7].

Experimental validation of predicted binding sites

Since our approach to identify DtxR-regulated genes is purely computational in nature, we decided to test the validity of our predictions. A sample of predicted regulator binding motifs (Table 2) (upstream to ORFs: DIP2161, DIP0699, DIP0586, DIP2304, DIP2272) were experimentally verified by EMSA using IdeR, an orthologue of DtxR from M. tuberculosis. DtxR and IdeR are iron-dependent regulators. A pair wise sequence comparison of the two proteins shows a high (58%) overall sequence identity (similarity 72%) which increases further to 92% identity and 100% similarity in DNA recognition domain. In addition, the structural comparison of two regulators also shows a very similar 3D organization, suggesting that the IdeR regulator would be able to recognize the DtxR motif [11].

Synthetic double stranded oligonucleotides corresponding to DNA-binding sites were labeled with 32P and mixed with purified IdeR in presence of manganese ions and was assayed for the formation of DNA-protein complex using EMSA. Manganese was used as the divalent metal in the binding reactions on account of its redox stability compared with ferrous ion. Electrophoretic mobility of all five double stranded oligonucleotides tested was retarded by IdeR (Figure 1). However a synthetic motif (TTTTCATGACGTCTTCTAA) used as a negative control did not show any complex formation. These results indicate that the predicted DtxR-binding sites can indeed bind to DtxR.

Figure 1
figure 1

IdeR binds the predicted DtxR-binding DNA fragments. 30 pmoles of IdeR was added to 32P-labelled DNA probes in the presence of 200 μM Mn2+, and complexes were resolved on a 7% Tris-borate polyacrylamide gel containing 150 μM Mn2+. Lane 1: Control gel retardation using Radiolabeled DNA motif without DtxR-binding site. Lane 2: Radiolabeled DIP2161 motif without IdeR. Lane 3: Radiolabeled DIP2161 motif with IdeR. Lane 4: Radiolabeled DIP0699 motif with IdeR. Lane 5: Radiolabeled DIP0586 motif with IdeR. Lane 6: Radiolabeled DIP2304 motif with IdeR. Lane 7: Radiolabeled DIP2272 motif with IdeR.

Identification and annotation of DtxR-regulated genes C. diphtheriae genome

In addition to the binding site prediction, we have also identified co-regulated genes (operons) downstream to the predicted DtxR-binding site (Table 3). Function of the proteins encoded by the putative genes in Table 2 and Table 3 was predicted by RPS-BLAST search against conserved domain database [12].

Table 3 Predicted DtxR-regulated operons in C. diphtheriae

Discussion

Our analysis identified putative DtxR motifs upstream to various operons/genes which could be involved in siderophore biosynthesis, ABC-type transport systems, iron storage, oxidative stress defense and iron-sulfur cluster biosynthesis. In addition, we have also identified the motifs upstream of operons that could be involved in anchoring of host-interacting proteins to the cell wall and secretion of various virulence factors. Important functions of some of these DtxR-regulated genes and their role in C. diphtheriae physiology are discussed here.

Regulation of siderophore biosynthesis and ABC-type transport systems

Predicted member of the DtxR regulon, the gene DIP0586, codes for the IucA/IucC family of enzymes that catalyze discrete step in the biosynthesis of the aerobactin [13]. In addition to known DtxR-regulated siderophore transport genes (irp1, irp6), DtxR could also regulate other ABC-type transport systems similar to Manganese/Zinc, peptide/Nickel and multidrug subfamilies of ABC transporters. The peptide/nickel transport system (DIP2162-DIP2165) has been suggested to be recently acquired by pathogenic C. diphtheriae [9].

Regulation of iron storage and oxidative stress defense

We predict that DtxR could regulate divergently transcribed genes DIP2303 and DIP2304 whose products are similar to starvation inducible DNA-binding protein (Dps) and Formamidopyrimidine-DNA glycosylase (Fpg), respectively. Dps in Escherichia coli is induced in response to oxidative or nutritional stress and protects DNA from oxidative stress damage by nonspecific binding [14]. Dps also catalyzes oxidation of ferrous iron to ferric iron by hydrogen peroxide (2Fe2+ + H2O2 + 2H2O → 2Fe+3OOH(core) + 4H+) which in turn prevents hydroxyl radical formation by Fenton's reaction (Fe2+ + H2O2 → Fe+3 + HO- + HO.) and thereby prevents subsequent DNA damage [15]. The enzyme, formamidopyrimidine-DNA glycosylase is a primary participant in the repair of 8-oxoguanine, an abundant oxidative DNA lesion [16]. The gene DIP1510 which codes for the site-specific recombinase XerD could also be regulated by DtxR. The xerD gene in E. coli belongs to the oxidative stress regulon [17].

Regulation of proteins involved in iron-sulfur cluster biosynthesis and iron-sulfur cluster containing proteins

We predict that the operon DIP1288-DIP1296, which is similar to the suf operon of E. coli, could be regulated by DtxR. The suf operon in bacteria encodes the genes for Fe-S cluster assembly machinery [18]. In addition, genes encoding the iron-sulfur containing proteins such as succinate dehydrogenase (Sdh), cytochrome oxidase (CtaD) and Ribonucleotide reductase (NrdF1) in C. diphtheriae also show DtxR motif in their upstream sequences.

Regulation of sortases

We predict that DtxR could regulate the recently acquired pathogenic island DIP2271-DIP2272, encoding the sortase srtA and hypothetical protein, respectively [9]. Sortases are membrane-bound trans-peptidases that catalyze the anchoring of surface proteins to the cell wall peptidoglycan [9]. Such systems are often used by gram-positive pathogens to anchor host-interacting proteins to the bacterial surface [19].

Regulation of protein translation and translocation system

DtxR could regulate two operons that contain genes DIP0699 (secA) and DIP0540 (secY) that code for the protein translocation system. The sec Y-containing operon, which is similar to the streptomycine operon spc from B. subtilis and other bacteria, involves the genes required for protein translation and translocation [20]. The operon contains additional sialidase gene (DIP0543) in comparison to non pathogenic Corynebacterium species. Activity of sialidase has been linked to virulence in several other microbial pathogens and may enhance fimbriae mediated adhesion in Corynebacterium diphtheriae by unmasking receptors on mammalian cells [9].

The Sec system can both translocate proteins across the cytoplasmic membrane and insert integral membrane proteins into it. The former proteins but not the latter possess N-terminal, cleavable, targeting signal sequences that are required to direct the proteins to the Sec system. Some of the DtxR-regulated genes including diphtheria toxin (Table 4) show predicted signal sequences by SignalP 3.0 [21] and hence they may play an important role in host interaction and virulence of Corynebacterium diphtheriae [9].

Table 4 DtxR-regulated genes containing the potential signal sequence

Conclusions

The bioinformatics method used to predict the targets of DtxR in C. diphtheriae NCTC13129 genome is promising, as some of the predicted targets were experimentally verified. The approach identified novel DtxR-regulated genes, which could play an important role in physiology of C. diphtheriae NCTC13129. DtxR, generally known as a repressor of diphtheriae toxin and iron siderophore/transport genes, can also regulate other metal ion transport genes, iron storage, oxidative stress, DNA-repair, biosynthesis of iron-sulfur cluster, Fe-S-cluster containing proteins, and even protein sortase and translocation systems.

Methods

Source of genome sequence

The complete genome sequence of C. diphtheriae was downloaded from NCBI ftp site [22], and the DtxR-binding sites identified by experimental methods were collected from literature [6, 10, 2527].

Prediction of DtxR-binding sites

DtxR-binding site recognition profile was calculated by positional Shannon relative entropy method [23, 24]. The positional relative entropy Q i at position i in a binding site is defined as

where b refers to each of the possible base (A, T, G, C), f b,i is observed frequency of each base at position i and q b is the frequency of base b in the genome sequence. The contribution of each base to the positional Shannon's relative entropy is calculated by multiplying positional frequency of each base with positional relative entropy. The binding site profile thus generated was used to scan upstream sequences of all the genes of the Corynebacterium diphtheriae genome. The score of each site is calculated as the sum of the respective positional Shannon relative entropy of each of the four possible bases. A maximally scoring site is selected from the upstream sequence of each gene. The lowest score among the input binding sites is considered as cut-off score. The sites scoring higher than the cut-off value are reported as potential binding sites conforming to the consensus sequence.

Prediction of operons

Co-directionally transcribed genes, downstream to the predicted binding site were selected as potential co-regulated genes (operons) according to one of the following criteria (a) Co-directionally transcribed orthologous gene pairs, conserved in at least 4 genomes; (b) genes belong to the same cluster of orthologous gene function category and the intergenic distance is less than 200 base pairs; (c) the first three letters in gene names are identical (gene names for putative genes were assigned from COG database); (d) intergenic distance is less than 90 base pairs [24].

Functional assignment of genes

The function of predicted genes was inferred using the RPS-BLAST search against conserved domain database [12]. These genes were further classified according to their function.

Expression and purification of IdeR

The iron-dependent regulator IdeR from M. tuberculosis was expressed from a recombinant pRSET vector containing the IdeR gene fused to a six His affinity tag (P. Chakhiyar unpublished). The expressed protein was first purified using Ni-NTA Metal Chelate Affinity chromatography; later it was desalted and concentrated using Centricon Ultra filtration device. The concentration of the recombinant protein was estimated using Bradford method.

Electrophoretic mobility shift assay

Double-stranded oligonucleotides containing the predicted binding motif (19 bp long) were end labeled with T4 polynucleotide kinase and [γ32P]-ATP and were incubated with the recombinant purified IdeR protein in a binding reaction mixture. The binding reaction mixture (20-μl total volume) contain the DNA-binding buffer (20 mM Tris-HCl [pH 8.0], 2 mM DTT, 50 mM NaCl, 5 mM MgCl2, 50% glycerol, 5 μg of bovine serum albumin per ml), 10 μg of poly(dI-dC) per ml (for nonspecific binding) and 200 μM MnCl2. The reaction mixture was incubated at room temperature for 30 min. Approximately 2 μl of the tracking dye (50% sucrose, 0.6% bromophenol blue) was added to the reaction mixture at the end of incubation and was loaded onto 7% polyacrylamide gel containing 150 μM MnCl2 in 1 × Tris-borate-EDTA buffer. The gel was electrophoresed at 200 V for 2 hours. Subsequently the gel was dried and exposed to Fuji Storage Phosphor Image Plates for 16 hours. The image plates were subsequently scanned in Fuji Storage Phosphor Imaging workstation.