Background

E. coli growing in a mixture of sugars exhibits diauxic growth characteristics, whereby glucose is preferentially assimilated before other sugars. This is due to CRP-mediated catabolite repression and inducer exclusion related to phosphotransferase system enzyme activity. It is well established that cyclic AMP (cAMP) and its receptor protein (CRP) are involved in transcriptional activation of catabolic genes [1, 2], but the details of catabolite repression and inducer exclusion mechanisms and their relation to the levels of cAMP and CRP (also known as CAP) are not clear and have motivated many studies [37].

Inducer exclusion is a result of dephosphorylation of enzyme IIAGlc of PTS [8, 9] and catabolite repression is associated with altered levels of cAMP [1012] and CRP [13]. Enzyme IIAGlc, when unphosphorylated, inhibits activity of other transport systems (non-PTS transporter) [1416]. In its phosphorylated form, enzyme IIAGlc stimulates adenylate cyclase activity, resulting in higher intracellular levels of cAMP [17, 18] and the cAMP-CRP complex (global transcription activator).

Efforts to study or alleviate catabolite repression mediated by CRP have resulted in a series of CRP mutants isolated from strains lacking adenylate cyclase and having an apparent reduced dependence on cAMP for activating catabolic genes (called CRP*, CRP-in or CAPc) [1, 19, 20]. Genetically different crp* strains reported are also phenotypically different, showing different sensitivities to cyclic nucleotides and relieving catabolic repression of select genes examined to different extents [21]. For example, six different crp* mutants isolated after UV treatment and selection for a lactose+ phenotype in an adenylate cyclase-deficient E. coli strain showed a variety of utilization patterns for different sugars (lactose, maltose, arabinose, xylose, ribose, mannose, mannitol) as well as different levels of activation of the lac operon by cAMP or cGMP [22]. Similar examples have been reported by others [3, 21].

Ability to co-utilize sugars via relief of catabolite repression during microbial production of value-added chemicals has potential to improve bioproduction process economics [23]. We previously engineered E. coli to produce xylitol from xylose while metabolizing glucose as a source of carbon and energy (xylose metabolism is disabled) [24, 25]. Expression of CRP* was an effective approach to promote expression of xylose transporters and enhance xylitol production in the presence of glucose. Although plasmid-based, CRP-independent expression of xylose transporters in wild-type crp strains also enhances xylose uptake and xylitol production in the presence of glucose [25], the favorable effects of CRP* expression were found to go beyond improving xylose transport and to include other beneficial phenotypes such as improved xylitol titer in controlled batch fermentation and reduced acetate production and higher yields on xylose reduced per mole of glucose consumed in resting cell transformations [26].

While CRP*s have been studied at the molecular level and the effects of expressing CRP* mutants on the expression of specific catabolic genes have been reported, the global transcriptional effects and regulatory consequences of CRP* expression is not known. Here, we report the results of comparisons between the transcriptome of E. coli W3110 (expressing wild-type CRP) and that of mutant strain PC05 (expressing CRP*) in the presence and absence of glucose through microarray analysis. Our results show that gene expression in PC05 is drastically different from that of W3110 in both the presence and absence of glucose, and that while expression of the CRP* allele used in this study has the general effect of suppressing transcriptional changes due to glucose, a significant response to glucose nonetheless remains. Results are analyzed in light of the observed differences between wild-type and CRP* strains during xylitol production. We identify many genes showing differential expression that are consistent with the observed elevated levels of glucose oxidation and NADPH-dependent xylose reduction for PC05 compared to W3110. A subsequent intracellular cofactor analysis reveals CRP*-correlated effects on cofactor levels that are consistent with the observed expression changes.

Results

The E. coli W3110-derivative CRP* strains used in our studies are derived from E. coli donor strain ET25 [8], which expresses a CRP* mutant with three amino acid substitutions (I112L, T127I, and A144T) identical to those found in an earlier characterized CRP* strain CA8404 [1]. Amino acid position 127 lies in the cAMP binding pocket, and T127I or T127L mutations occur frequently in CRP* alleles [21, 27], presumably serving to reduce the cAMP requirement to form an activating CRP complex [27]. Mutation A144T is also frequently found in different CRP* alleles [3, 21, 28] and can exhibit the CRP* phenotype to some extent even as the only mutation in the protein [29]. This position lies in the DNA binding domain of CRP and is suggested to improve affinity of the protein for CRP binding sites [29]. A fourth base substitution in the crp* sequence results in a T28K mutation, which is the result of native differences in the crp sequence between W3110 and the donor strain.

Genome-wide transcriptional effects of glucose and CRP*

Table 1 summarizes the genome-wide effects of CRP* expression under the conditions tested, while Table 2 lists the average signal values and expression ratios for specific genes mentioned in this paper. Supplementary Table S1 (see Additional file 1) contains signal values for the complete probe set data for the E. coli K-12 genome. Transcriptome analysis of strain W3110 reveals that 629 genes show significant changes in expression level in response to the presence of glucose (comparison between WT G and WT in Figure 1a). 375 of these genes are upregulated by glucose, as depicted in Figure 1a. The complete list of expression levels of the genes that are differentially expressed between WT G and WT is provided in Supplementary Table S2 (see Additional file 2). Catabolic genes, membrane-related components, and sugar transporters (especially non-glucose PTS related enzymes) represent a large portion of genes repressed by glucose. In a study of CRP-dependent gene expression, Gosset and coworkers reported transcriptome analysis of CRP-dependent genes in another E. coli K-12 strain BW25113 [30]. In Table 3 we compare their result to results from our study for common conditions tested (i.e., WT G/WT). While their study did not examine CRP*, this comparison provides an indication of the consistency of glucose-responsive gene expression among different but similar strains. Our comparison focuses on genes involved in central metabolism and shows that the genes which are subject to glucose repression in BW25113 (such as aceA (isocitrate lyase monomer), aldA (aldehyde dehydrogenase A), sdhA (succinate dehydrogenase) and sucA (oxoglutarate dehydrogenase)) are also downregulated in the presence of glucose for W3110. However, not all the genes which are upregulated in the presence of glucose in BW25113 are upregulated in W3110 under the same conditions (examples are aceE (pyruvate dehydrogenase E1 component), guaB (IMP dehydrogenase), rpsQ (30S ribosomal subunit protein S17)). This is likely to be due to differences between these two strains [26, 3133] as well as the differences in experimental methods.

Table 1 Summary of genome-wide effects of CRP* expression.
Table 2 Expression levels for the genes discussed in this paper.
Table 3 Comparison between expression levels (signal values) of the wild-type strain genes in response to the presence of glucose in two different studies.
Figure 1
figure 1

Genome-wide transcriptional effects of glucose in strain W3110 expressing wild-type CRP, presented as expression ratios for individual genes showing significant differential expression in the presence and absence of glucose (WT G/WT). a) 629 genes show significant changes in expression level in response to the presence of glucose in strain W3110. b) The changes in expression levels of the same genes shown in (a) in CRP* strain PC05, in response to glucose. Gene names and expression levels are given in supplementary Table S2 (see Additional file 2).

Figure 1b depicts the changes in expression levels of the same genes shown in Figure 1a for CRP mutant strain PC05 in response to glucose. The average WT G/WT ratio for the genes which are upregulated in W3110 in the presence of glucose is 4.06 while the average CRP* G/CRP* ratio for the same genes is 1.07. For downregulated genes in W3110 by glucose, WT G/WT and CRP* G/CRP* ratios are 0.32 and 0.81 respectively. These results show that genes whose expression is significantly altered by glucose in strain W3110 are generally not altered to the same extent in strain PCO5 and that CRP* suppresses this effect of glucose.

Figure 2 depicts that fewer genes show significant changes in expression level for strain PC05 (80 genes) compared to W3110 (629 genes) when grown in the presence versus absence of glucose. 29 of these genes are upregulated in the presence of glucose. This confirms the expected role of CRP* in the alleviation of glucose repression. Only 43 genes are common between those of Figure 1 and Figure 2. In contrast to W3110, the number of genes that are repressed in the presence of glucose in PC05 is greater than the number of genes that are upregulated. Only 3% of genes that are upregulated in W3110 in the presence of glucose are also upregulated in PC05 in the same condition, while 12% of glucose-repressed genes in W3110 are also repressed by glucose in PC05.

Figure 2
figure 2

Genome-wide transcriptional effects of glucose in CRP* strain PC05, presented as expression ratios for individual genes showing significant differential expression in the presence and absence of glucose (CRP* G/CRP*). a) 80 genes show significant changes in expression level for strain PC05. Only 29 are upregulated. b) The changes in expression levels of the same genes shown in (a) for strain W3110, in response to glucose. Gene names and expression levels are given in supplementary Table S3 (see Additional file 3).

The complete list of expression levels of the genes that are differentially expressed between in PC05 with versus without glucose is provided in Supplementary Table S3 (see Additional file 3). Specific examples of genes that are upregulated in PC05 in response to glucose include: the PTS gene ptsG, che genes (involved in regulation of chemotaxis), dhaM (associated with dihydroxyacetone kinase), edd (encoding phosphogluconate dehydratase of the Entner-Doudoroff pathway), gnt genes (gluconate transport and metabolism), genes involved in amino acid metabolism such as glt ( glutamate synthase), and ser (serine biosynthesis) genes, and ymf genes of the lambdoid prophage element e14. Genes that are downregulated in PC05 in response to glucose include: argD (involved in lysine and arginine biosynthesis), glp genes (glycerol transport and metabolism), gntP (encoding a gluconate transporter), srlA (glucitol/sorbitol PTS system), thiCE genes (involved in thiamine biosynthesis), and treBC genes (trehalose transport and metabolism). Refer to Table 2 for expression levels and ratios.

418 genes in the E. coli genome are suggested to be regulated in part by CRP, as reported by the most current EcoCyc database [34]. While the modes of regulation of many of these genes are complicated and not well understood (often involving multiple transcription factor binding sites), the CRP-cAMP complex is assigned to be a transcriptional activator for approximately 321 genes (implying upregulation in the absence of glucose) and a repressor for approximately 46 genes (implying downregulation in the absence of glucose) (in some cases the role of CRP is dual or unclear). While 629 genes show significant changes in their expression levels in response to the presence of glucose in W3110 (Figure 1), only 19% of them (118 out of 629) have a CRP binding site close to their start codon (these genes are highlighted in green in Additional file 2 (Table S2)). This result is perhaps not unexpected when considering that most genes under CRP control are also regulated by other transcription factors [34]. Of these 118 genes, 25 are upregulated while 93 show reduced expression in the presence of glucose. 77 out of the 93 genes downregulated in glucose are described in Ecocyc as being activated by CRP-cAMP, showing good agreement with the expected inverse relationship between glucose presence and CRP-cAMP activity. Meanwhile only 23 of the 80 genes showing significant differential expression in the presence of glucose for strain PC05 are present among the list of 418 genes believed to be directly regulated by CRP (highlighted in green in Additional file 3 (Table S3); 7 are upregulated in glucose, and 13 out of the remaining 16 downregulated genes are reported to be activated by CRP-cAMP and therefore expected to show lower expression in the presence of glucose), again demonstrating significant alleviation of catabolite repression. Thus, the majority of expression changes resulting from the presence of glucose or the CRP* mutations are not directly related to altered regulation by CRP at CRP binding sites, but rather due to secondary effects resulting from a smaller number of direct, CRP-mediated expression differences.

To investigate which genes respond differently to glucose in PC05 compared to W3110, an interaction term ((CRP* G/CRP*)/(WT G/WT)) was examined with the same criteria as pair-wise comparisons. This comparison reports the difference of differences and reveals that 238 genes respond differently to the presence of glucose in W3110 compared to PC05, as illustrated in Figures 3a and 3b (listed in supplementary Table S4 (see Additional file 4)). As shown in Table 2, ppsA (encoding phosphoenolpyruvate synthase), glp genes (involved in glycerol transport and metabolism), mgl genes of the galactose ABC transporter, and TCA cycle genes fumA (fumarate hydratase class I) and sdhA, sdhB, sdhD (succinate dehydrogenase) all show significantly different responses to glucose in W3110 compared to PC05.

Figure 3
figure 3

Genes that respond differently to the presence of glucose in W3110 compared to PC05, (CRP* G/CRP*)/(WT G/WT). a) Ratio of expression levels of these 238 genes in strain W3110 in the presence and absence of glucose (WT G/WT). b) Ratio of expression levels of the same genes shown in (a) in strain PC05, in the presence and absence of glucose (CRP* G/CRP*). Gene names and expression levels are given in supplementary Table S4 (see Additional file 4).

Pair-wise comparison between the CRP* G and WT G conditions identifies the changes in transcriptional levels of genes affected by CRP* in the presence of glucose. In this category, 349 genes show significant changes in their expression levels (Table 1), as depicted in Figure 4 and as listed in supplementary Table S5 (see Additional file 5). Many of the genes upregulated by CRP* (strain PC05) are involved in transport, catabolism, and amino acid metabolism (Table 2), including glp genes (involved in glycerol transport and metabolism), mgl genes (galactose ABC transporter), tdc genes (involved in serine and threonine metabolism), and tnaB (encoding a tryptophan transporter). Genes downregulated by CRP* in this comparison include gcd (encoding glucose dehydrogenase), glnP (glutamine ABC transporter), pro genes (involved in proline biosynthesis), and udp (uridine phosphorylase, involved in pyrimidine ribonucleoside metabolism). Additional differentially expressed genes of relevance to this study are described in the Discussion.

Figure 4
figure 4

Significant genome-wide transcription effects of expressing CRP* (strain PC05) instead of wild-type CRP (W3110) in the presence of glucose, presented as individual gene expression ratios (CRP* G/WT G). 349 genes show significant changes in their expression levels between CRP* G and WT G conditions. Gene names and expression levels are given in supplementary Table S5 (see Additional file 5).

Pair-wise comparison of gene expression in PC05 and W3110 grown on LB (without glucose) reveals changes in gene transcription levels as a result of the CRP* mutation. This comparison shows that 553 genes are expressed differently (392 of them are upregulated) between these two strains in the absence of glucose. These results are summarized in Table 1, and the specific genes are listed in supplementary Table S6 (see Additional file 6). gapC (encoding glyceraldehyde 3-phosphate dehydrogenase), nrf genes (involved in anaerobic respiration), and tdc genes (involved in serine and threonine metabolism) are examples of genes upregulated by CRP*.

Finally, to examine the extent to which CRP* reduces the glucose effect, we performed a pair-wise test between two conditions: PC05 in the presence of glucose (CRP* G) and W3110 in the absence of glucose (WT). Our results show that the transcriptional levels of 481 genes are significantly different between these two conditions. These results are summarized in Table 1 and the specific genes and expression values are listed in supplementary Table S7 (see Additional file 7). Examples of genes most significantly upregulated by CRP* in glucose include the nrf genes, citF (encoding citrate lyase), gat genes (involved in hexitol transport and metabolism), edd (encoding phosphogluconate dehydratase), ldhA (lactate dehydrogenase), fru genes (fructose transport and metabolism), manZ (mannose PTS permease), ptsG, and malF (maltose ABC transporter).

Real-Time Reverse Transcription PCR

To confirm the microarray results, the transcript levels of ppsA and pntA, both of which showed significant changes in their transcriptional levels under the various conditions tested, were compared by real-time reverse transcription PCR. The transcript of rpsQ was also analyzed, as this gene was noted to respond quite differently to the presence of glucose with wild-type CRP in our study as compared to the study by Gosset (WT G/WT value of 0.51 compared to 2.7) [30]. Data are presented in supplementary Table S8b as fold-changes (signal ratios) for all conditions tested, and show a good agreement between microarray and real-time RT-PCR results (see Additional file 8).

Cofactor analysis

In order to better understand how CRP* expression may influence NADPH availability for xylose reduction, intracellular cofactor concentrations were quantified for wild-type and CRP* strains engineered to produce xylitol. To prevent xylose metabolism in these strains, xylB encoding xylulokinase was deleted [24]. The wild-type CRP strain (W3110ΔxylB) was transformed with plasmid pPCC207 for inducible co-expression of an NADPH-dependent xylose reductase (CbXR) and the E. coli ATP-dependent xylose transporter system (XylFGH) [25], while the CRP* strain (PC05ΔxylB) was transformed with plasmid pLOI3815 for CbXR expression [24]. The strains were then compared after growth on glucose (no xylitol production) versus growth on glucose plus xylose (resulting in xylose reduction).

Results from the intracellular cofactor concentration measurements are summarized in Table 4. Also listed are the xylitol production results for these strains in batch fermentations and resting cell cultures. Note that the CRP* strain produced considerably more xylitol and with a higher yield. In the absence of xylose, the NADPH concentration is significantly higher in the CRP* strain (0.8 versus 0.5 μmol (g cdw)-1). Meanwhile, given the ability to reduce xylose to xylitol, the NADPH concentration falls to a much lower level in the CRP* strain (from 0.8 to 0.1 μmol (g cdw)-1) compared to wild-type CRP (from 0.5 to 0.3 μmol (g cdw)-1). While the NADPH/NADP+ ratios are nearly identical for both strains in the absence of glucose, the significant consumption of NADPH during xylose reduction coincides with a significantly larger drop in the NADPH/NADP+ ratio for CRP* (from 0.09 to 0.01) compared to wild-type CRP (from 0.10 to 0.05). Also note that the oxidized NADP+ concentrations are significantly elevated in the CRP* strain, while both NADH and NAD+ concentrations are much lower. The net effects are higher total NADP(H) (i.e. NADPH plus NADP+) concentrations and lower NAD(H) concentrations in the CRP* strains compared to wild-type. It is also noteworthy that the NADH/NAD+ ratio is significantly lower in the CRP* strain under both conditions tested.

Table 4 Culture performance and intracellular cofactor levels and ratios for strains engineered to produce xylitol.

Discussion

We previously used a CRP* strain to promote expression of xylose transporters in the presence of glucose to produce xylitol from a glucose+xylose mixture, with xylose metabolism disabled [24]. Plasmid-based, CRP-independent expression of xylose transporters in wild-type crp strains was an alternative strategy we explored to enhance xylose uptake and xylitol production in the presence of glucose [25]. However the favorable effects of CRP* expression were found to go beyond improving xylose transport and to include other beneficial phenotypes such as reduced acetate production and higher yields on xylose reduced per mole of glucose consumed (as shown in Table 4) [26].

Transcription changes associated with CRP* expression are extensive. While many of the genes known to be regulated by CRP show altered expression in the context of CRP*, the majority of differentially expressed genes are not known to be directly under CRP control. The complex network of genes showing altered regulation due to secondary effects of CRP* is not likely to be metabolically systematic, since the mutant CRP used in this study (i.e. CRP*) does not have an evolved physiological role, was isolated under very particular growth conditions, and does not simply serve as a constitutive, cAMP-independent regulator. Therefore the difficulty in identifying clear patterns of differential expression of metabolically related genes is perhaps not surprising (the mutations in CRP* may not uniformly alter the regulator's natural physiological role at different control sites). Rather than attempting to assign physiological meaning to the altered transcriptome, we instead identify gene expression changes that help to explain the beneficial effects of CRP* expression as they relate to xylitol production. Specifically, we focus on genes that may affect NADPH availability.

Improved cofactor availability for xylitol production

In E. coli, complete oxidation of glucose during aerobic growth requires that respiration and anabolic metabolism consume reducing equivalents as they are generated [35]. Elevated glucose flux beyond the capacity of respiration and growth results in incomplete oxidation and acid secretion. Heterologous, NADPH-dependent xylitol production can act as an added electron sink in strains producing xylitol from a mixture of glucose and xylose, and an increased ability to produce xylitol during aerobic growth is expected to increase glucose oxidation and tricarboxylic acid (TCA) cycle flux, provided reducing equivalents from NADH can be converted to NADPH. Alternately, increased expression of genes involved in glucose oxidation would allow for increased xylitol production, provided reducing equivalents can be delivered as NADPH. Our transcription analysis sheds light on the observed ability of CRP* strains expressing xylose reductase to produce more xylitol and secrete less acetate than similar wild-type CRP strains constitutively expressing a xylose transporter.

TCA cycle genes involved in reactions between succinate and oxaloacetate are upregulated in PC05 compared to W3110 in the presence of glucose, as listed in Table 2. These include sdhA, sdhB, sdhD (encoding succinate dehydrogenase), fumA (encoding fumarate hydratase class I), and mdh (encoding malate dehydrogenase). All of these genes are known to have CRP binding sites in their promoter regions, so increased expression in PC05 is likely due to direct regulatory effects of CRP*. A strain with a more active TCA cycle potentially increases glucose oxidation, produces more NADPH and produces less acetate [36, 37]. Upregulation of acs (acetyl-CoA synthetase) in CRP* G compared to WT G (3.63-fold) may promote acetate assimilation instead of accumulation. Also shown in Table 2, sthA encoding soluble pyridine nucleotide transhydrogenase is differentially expressed in CRP* G compared to WT G (CRP* G/WT G ratio of 0.6). SthA is believed to primarily oxidize NADPH to regenerate NADH and an increase in NADPH demand corresponds to reduced sthA expression [38, 39]. As shown in Table 2, sthA expression is lower with CRP* G compared to WT G. Transcriptional control of sthA is not well understood, but the lack of an apparent CRP binding site suggests that reduced sthA expression may be a result of increased NADPH demand rather than a direct result of CRP*-mediated control. Consistent with this apparent elevated demand for NADPH is the fact that the genes encoding both subunits of the membrane-bound, proton-translocating pyridine nucleotide transhydrogenase (pntA and pntB) are upregulated in CRP* G compared to WT G (1.88 and 1.76-fold respectively). PntAB has been reported to produce 35–45% of the NADPH required for E. coli biosynthesis during aerobic growth [38]. Interestingly, the maeB gene encoding NADP-linked malic enzyme (decarboxylating malate to pyruvate) can serve as another source of NADPH regeneration in E. coli and also shows a higher level of transcription in PC05 compared to W3110 in the presence of glucose (CRP* G/WT G ratio of 2.6). Changes in transcriptional patterns of the above mentioned genes can be in response to increased demands for NADPH in CRP* strains, perhaps as this relates to apparently increased anabolic demands (described below).

To ensure continued growth, E. coli balances intracellular concentrations and ratios of the reduced and oxidized cofactors through a complex interplay between catabolic metabolism, anabolic metabolism, redox-sensitive regulation (both genetic and allosteric) and transhydrogenase activities [35, 4043]. While the mechanisms of maintaining redox balance are not well understood, the expression levels of enzymes and regulators involved in redox metabolism play a critical role. It is thus perhaps not surprising that altering the activity of a global regulator has a significant impact on cofactor concentrations and the range of attainable redox states (as demonstrated in Table 4). NADH regulates the activity of a number of enzymes involved in central metabolism and glucose oxidation (e.g. pyruvate dehydrogenase [44, 45], citrate synthase [46, 47] and α-ketoglutarate dehydrogenase [48]). CRP* expression results in reduced NADH levels, increased production of NADPH relative to NADH, and increased tolerance to a range of NADPH levels and NADPH/NADP+ ratios, all of which are likely to improve NADPH-dependent xylitol production during glucose metabolism.

Expression of "unnecessary" genes

Most of the genes that are differentially expressed between PC05 and W3110 in the presence of glucose (231 out of 349) are upregulated with CRP* (Figure 4), supporting the generally assumed behavior of CRP* in alleviating glucose-dependent catabolite repression. The otherwise unnecessary upregulation of these genes likely causes a significant increase in demand for carbon and energy, helping to explain the slower growth rate observed for PC05 compared to W3110 [24]. Notable genes that fall into this upregulated category include many involved in amino acids metabolism, such as tnaA (encoding tryptophanase), thrA (aspartokinase I and homoserine dehydrogenase I), glyA (serine hydroxymethyl transferase), tdcB (threonine dehydratase), thrC (threonine synthase), thrB (homoserine kinase), and asnA (asparagine synthetase A) (refer to Table 2). Upregulation of amino acid metabolism pathways may be in response to increased protein synthesis demands caused by upregulation of other genes.

Catabolite repression and inducer exclusion

A relationship between the phosphorylation state of enzyme IIAGlc and the intracellular "phosphoenolpyruvate (PEP)/pyruvate" ratio has been suggested [9]. Decreased levels of phosphorylated enzyme IIAGlc is usually accompanied by decreased PEP/pyruvate ratios. The crucial role of the unphosphorylated form of enzyme IIAGlc in catabolite repression and inducer exclusion is well documented [8, 9, 1418]. As shown in Table 2, phosphoenolpyruvate synthase (encoded by ppsA) is expressed to a higher level in PC05 compared to W3110 in the presence of glucose (CRP* G/WT G ratio of 5.35). Upregulation of this enzyme which mediates conversion of pyruvate to PEP may increase the intracellular PEP/pyruvate ratio, resulting in an increase in the phosphorylated form of enzyme IIAGlc. This in turn may increase adenylate cyclase activity [17, 18] and further help to alleviate catabolite repression and inducer exclusion in PC05.

Conclusion

We have used microarray analysis to compare the transcriptomes of E. coli W3110 expressing wild-type CRP and mutant strain PC05 expressing CRP* in the presence and absence of glucose. Table 1 summarizes the genome-wide effects of CRP* expression under the conditions tested. Gene expression in the context of CRP* in the presence of glucose is very different from that of wild-type in the absence of glucose. Although fewer genes show expression sensitivity to glucose in PC05 compared to W3110, CRP* does not completely eliminate glucose effects. As expected, CRP* expression causes increased expression of genes involved in nutrient transport and catabolism (among many others). In addition, several genes showing significant differential expression in CRP* versus wild-type CRP help to explain the observed differences in cofactor levels and metabolic behavior of CRP* strains used in xylitol production.

Materials and methods

General

E. coli K-12 strain W3110 (ATCC 27325) and its derivatives were maintained on plates containing Luria-Bertani (LB) medium (10 g tryptone, 5 g yeast extract, 5 g NaCl, and 15 g agar per liter). Methods for construction of strains PC05 (W3110 and crp*), PC07 (W3110ΔxylB), and PC09 (PC05ΔxylB) were described previously [24]. Briefly, the crp* gene and xylB deletion were introduced into W3110 via P1 phage transduction using a lysate from strain ET25 (crp*::Tn10) [8] and PC06 (W3110, ΔxylB::FRT-aac-FRT) [24] followed by selections on tetracycline (for crp*) or apramycin (for ΔxylB) plates. Plasmid pLOI3815 is a medium copy, pBR322-origin vector carrying a kanamycin resistance marker and the xylose reductase gene from Candida boidinii, which is located downstream of tac promoter and upstream of a transcription termination sequence [24]. Xylose transporter genes xylFGH (ATP-dependent xylose transporter system) were cloned downstream of CbXR in pLOI3815 to make plasmid pPCC207.

Amino acid substitutions in the CRP* were confirmed by sequencing. The crp* phenotype was verified in two ways. First, several TetR transductants were grown in LB medium containing glucose (1%) and xylose (1%). Cells were harvested at mid logarithmic growth phase and washed twice in phosphate buffer containing kanamycin (50 μg/mL). After allowing time for residual sugars to be cleared, the cells were resuspended a final time in buffer containing xylose (1%), kanamycin, and 1% triphenyltetrazolium chloride (TTC). Reduction of TTC results in red color formation and indicates constitutive xylose utilization. The crp* phenotype was additionally confirmed using HPLC to verify simultaneous glucose and xylose consumption in batch cultures [24].

Growth conditions

Four different conditions were tested in this study: W3110 in LB medium (WT), W3110 in LB+glucose medium (WT G), PC05 in LB medium (CRP*), and PC05 in LB+glucose medium (CRP* G). All experiments were performed at least in triplicate and all data reported are the average of at least three experiments. Cell culture optical density was measured at 600 nm (OD600) using a SPECTRAMax PLUS384 spectrophotometer (Molecular Devices). Cells grown for harvesting were prepared briefly as follows. Overnight pre-seed cultures were prepared by inoculating 3 ml of LB medium (in 13 × 100 mm tube) with a few colonies from a fresh LB plate. The overnight cultures were used to inoculate, to an OD600 of 0.1, 50 ml LB media (with or without 0.4% glucose supplementation) seed cultures in a 250 ml shake-flask. The seed culture were grown at 37°C to an OD600 of ~2 and then were used to directly inoculate, to an OD600 of 0.02, 100 ml LB media (with or without 0.4% glucose supplementation) cultures in a 500 ml flask. These cultures were grown at 37°C and 250 rpm to an OD600 of 0.5.

Cell harvesting and preparation of RNA

Cells from the 100 ml culture were harvested at an OD600 of 0.5 (early logarithmic growth phase) by immediately placing on ice, transferring to 50 ml falcon tubes and centrifuging at 4°C for 5 minutes before treating with lysozyme. Promega PureYield™RNA Midiprep System kit was used for RNA extraction. As a preliminary check, RNA yield and quality were determined by spectrophotometry according to the manufacturer's protocol and the integrity of the purified RNA was determined by formaldehyde agarose gel electrophoresis.

Labeling, hybridization and scanning

Total RNA concentration and purity were determined using a NanoDrop spectrophotometer and total RNA integrity was examined using an Agilent Bioanalyzer. Total RNA of sufficient concentration, purity, and integrity was labeled and subsequently hybridized to Affymetrix GeneChip microarrays by the Penn State DNA Microarray Facility according to the manufacturer's instructions (Affymetrix Inc, Santa Clara, CA). Briefly, 10 μg of total RNA was converted to cDNA using random primed reverse transcription. cDNA was purified by removing the RNA via hydrolysis with NaOH and then neutralizing the solution. Purified cDNA was fragmented and subsequently end-labeled with biotin. Fragmented, end-labeled cDNA was dissolved in hybridization cocktail and hybridized to Affymetrix GeneChip E. coli Genome 2.0 Arrays (approximately 10000 probe set) for 16 hours at 45°C. The details of GeneChip E. coli Genome 2.0 Arrays are described by Affymetrix [49].

After hybridization, the hybridization cocktail was removed and the arrays were washed to remove unbound and non-specifically bound cDNA. Hybridization was detected by staining the arrays with streptavidin phycoerythrin. All washing and staining was performed using the Affymetrix GeneChip Fluidics Station 450 according to the manufacturer's instructions (Affymetrix Inc, Santa Clara, CA). Stained arrays were scanned using the Affymetrix GCS3000 7G scanner.

Microarray data analysis

A minimum of three data sets was generated for each of the four different conditions tested (based on the combination of the strains W3110 and PC05 in LB and LB+glucose media). Affymetrix Expression Console™software (Version 1.1) was used for background adjustment, normalization and summarization of chip level data in the form of feature intensity (CEL) files in order to generate probe set summarization (CHP) files, using the probe logarithmic intensity error (PLIER) method. Data from CHP files were then exported to a Microsoft Excel spreadsheet for further analysis. Signal values for 10208 probsets from GeneChip E. coli Genome 2.0 Arrays were filtered to extract probe set data for only the E. coli K-12 strain. All calculations and analyses were performed on the 4070 genes remaining after filtration.

Signal values were transformed to the log base for the pair-wise comparisons. A linear model was fitted to each gene using the Bioconductor software package LIMMA [50, 51] in the R environment [52]. The linear model coefficients were used to calculate significant differences in expression levels for all pair-wise comparisons. The P-values were adjusted by the Benjamini-Hochberg method [53] and genes with a P-value of <0.05 were considered as those with significantly different expression levels under different conditions tested. Data are reported as expression levels (signal values) or ratios of expression levels. Supplementary Table S1 (Additional file 1) contains signal values for the complete probe set data for the E. coli K-12 genome. Written code in R [52] (file name: sup_code.doc) can be found in supplementary material (see Additional file 9). Gene Annotations were transformed from AFFY probe set ID's to Entrez gene IDs using NETAFFX on the Affymetrix website [49]. The online database for annotation, visualization and integrated discovery (DAVID) [54, 55] and Kyoto Encyclopedia of Genes and Genome (KEGG) [56] were used for pathway visualization and gene ontology (GO) classification.

Real-time, Reverse Transcription PCR

Total RNA samples were isolated the same way as for microarray studies. ppsA (phosphoenolpyruvate synthase), pntA (membrane-bound proton-translocating pyridine nucleotide transhydrogenase), and rpsQ (30S ribosomal subunit protein S17) were selected for confirmation by real-time reverse transcription PCR with rrsH (encoding 16S ribosomal RNA) as a control. Primer and probe sequences used for RT-PCR are listed in supplementary Table S8a (Additional file 8) and were designed by Deborah S. Grove of the Penn State Nucleic Acid Facility using Primer Express v2.0 (Applied Biosystems, Foster City, CA). Probes were synthesized by Biosearch (Novato, CA). The Applied Biosystems High Capacity cDNA Reverse Transcription Kit (part number 4368813) was used for reverse transcription according to the manufacturer's instructions for cDNA production. cDNA was amplified in an ABI 7300 real-time machine using TaqMan® Universal PCR Master Mix, No AmpErase® UNG (part number 4324018). Output was analyzed using the method [57].

Cofactor measurements

The cofactor analysis used in this study is based on the methods developed by Bernofsky and Swan [58], and modified by Gibon and Larher [59], and Walton and Stewart [60]. To investigate the effect of xylitol production on intracellular NADP(H) and NAD(H) levels, cofactor concentrations and ratios were measured and compared in PC05ΔxylB strain harboring pLOI3815 and W3110ΔxylB harboring pPCC207. Seed cultures were grown at 37°C to an OD600 of ~2 and then were used to directly inoculate, to an OD600 of 0.02, 100 ml LB medium supplemented with 100 mM glucose, 100 mM xylose (or 200 mM glucose for non-xylitol producing conditions), 50 mM MOPS, kanamycin monosulfate (50 μg/ml) and isopropyl-B-D-thiogalactopyranoside (IPTG, 100 μM) in a 500 ml flask. These cultures were grown at 30°C and 250 rpm to an OD600 of 0.5. Cells were immediately chilled on ice and harvested by pelleting (4°C, 15 min, 3750 rpm) to achieve a final OD600 of 30 in 1 ml. To isolate the oxidized forms, the pellet was resuspended in 0.5 ml of 0.3 M HCl, 50 mM Tricine-NaOH (pH 8.0). To isolate the reduced forms, the pellet was resuspended in 0.5 ml of 0.3 M NaOH. All samples were then heated to 60°C for 7 minutes followed by a neutralization step (0.5 ml 0.3 M NaOH for oxidized forms, 0.3 ml 0.3 M HCl, 0.2 ml 1.0 M Tricine-NaOH (pH 8.0) for reduced forms). The neutralized solutions were then centrifuged (4°C, 60 min, 13000 rpm) and the supernatants were transferred to a new microcentrifuge tube.

Cofactor levels were measured in a 96-well microtiter plate. Either 40 μl of oxidized sample and 40 μl 0.1 M NaCl, or 80 μl of reduced sample was aliquoted to a single well. The 2X stock solution of the reaction mixture consisted of equal volumes of 1.0 M Tricine-NaOH (pH 8.0), 4.2 mM MTT, 40 mM EDTA, 1.67 mM PES, and substrate (either 5 M ethanol or 25 mM glucose-6-phosphate). After addition of the appropriate reaction mixture (ethanol for NAD(H), glucose-6-phosphate for NADP(H)), the plate was incubated at 37°C for 5 minutes. To start the reaction, either 10 units/ml alcohol dehydrogenase (from 100 units/ml stock) or 0.27 units/ml glucose-6-phosphate dehydrogenase (from 2.7 units/ml stock) was added. The formation of reduced MTT was monitored using a SpectraMax384 plate reader, taking readings every 15 seconds for 10 minutes using a wavelength of 570 nm while being incubated at 37°C. The cofactor concentration of the samples was interpolated by comparing the rate of reaction to that observed in a concentration curve run on the same plate, and subtracting the rate from the background of the sample (reaction without enzyme).

Note

**NOTE: Average signal values start at column Q in Table S1 and at column D in Tables S2-S7. Expression ratios start at column H in Tables S2-S7.