Introduction

Under normal aerobic growth conditions, DNA is continuously attacked by reactive oxygen species, which can cause various types of DNA damage. Most of these DNA lesions are chemically indistinguishable from those caused by ionizing radiation1. However, the clustering of DNA damage is a distinctive feature of the effect of ionizing radiation on DNA2,3. Clustered DNA damage is defined as two or more DNA lesions within one to two helical DNA turns, and comprises distinct types of lesions, such as base damage, apurinic/apyrimidinic (AP) sites, and strand breaks. DNA damage clustering is thought to be highly relevant to the biological consequences of ionizing radiation2,3. Experiments have demonstrated that, after irradiation with low-density ionization, the yield of clustered DNA damage is at least four times greater than that of double-strand breaks (DSBs)4,5,6, although the overall yield decreases with increasing ionization density7. Based on extensive computer simulation studies, the complexity of clustered DNA damage, in terms of the number and types of damage, is predicted to increase with increasing ionizing radiation density8,9,10,11. Recently, atomic force microscopy was used to experimentally confirm that the complexity of clustered DNA damage depends on the ionization density of the radiation12,13.

Most data regarding the processing and biological consequences of clustered DNA damage have been obtained by studying synthetic DNA lesions located close to each other. Extensive in vitro studies have revealed that the repair of lesions within a cluster is often retarded, depending on the type, position, and number of lesions within a cluster (for review, see14,15,16). Detailed in vivo studies of the biological consequences of clustered DNA damage have shown that clustered DNA damage is often detrimental to cells (for review see16,17,18). The processing of bi-stranded clustered DNA damage, consisting of AP sites and single-strand breaks (SSB), likely leads to a DSB; therefore, the replication of DNA containing such clusters is strongly retarded. In contrast, DNA containing clusters of base damage generally replicates efficiently, accompanied by the enhanced mutagenic potential of the base damage.

Using a plasmid-based assay, we and others found that the increase in mutation frequency (MF) at the mutagenic base lesion in a bi-stranded clustered damage site likely results from the persistence of an SSB, which is generated as a repair intermediate of a lesion within a cluster. Taking evidence from in vitro experiments and previous in vivo studies in combination suggests that an SSB impairs the repair of the resulting mutagenic base lesion, such as 8-oxo-7,8-dihydroguanine (8-oxoG), and that the base lesion persists until replication19,20,21,22,23,24,25,26. Interestingly, the retardation of mutagenic lesion repair does not seem to be the only cause of the enhanced MF at the clustered DNA damage site. For instance, even in the absence of the main DNA glycosylase for 8-oxoG excision in Escherichia coli, Fpg, the MF of a bi-stranded cluster containing an 8-oxoG was markedly greater than that of an isolated 8-oxoG20,21,24,26. This is inconsistent with the expectation that the MF of the 8-oxoG should be similar in the absence of Fpg, whether within a cluster or not. Additionally, the MF of 5-hydroxyuracil (hU) within a bi-stranded clustered site was greater than 80%, significantly greater than the expected 50% limit27. This high MF was not observed if the opposite non-mutagenic strand replicated to a similar extent as the hU strand. Based on these results, it was postulated that the strand harboring the mutagenic lesion is preferentially replicated, leading to the concomitant loss of the opposite strand. To further gain insights into the template-strand preference, we developed an assay in which a mismatch serves as a strand-specific marker to determine the fraction of each strand used as the template around a bi-stranded clustered damage site28,29. The assay results indicated that (1) the 8-oxoG strand is the preferred template around the clustered DNA damage site to tolerate the persisting SSB on the opposite strand, (2) the size of the template-strand preference is in the order of several tens of base pairs, and (3) the size is determined by DNA polymerase I. If unrepaired, a nick or an SSB will be converted to a DSB by the arrival of a replication fork30, leading to chromosome fragmentation31,32. Although replication-dependent DSBs are repaired by a process involving RecA33,34, the central enzyme for strand transfer that mediates homologous recombination (HR), it is currently unknown whether other independent pathways are operative. We found that the template-strand preference near a bi-stranded clustered damage site was not affected either by RecA or by UvrA or UvrC, which function in recognizing or excising, respectively, DNA with bulky damage during nucleotide excision repair (NER)29.

DNA polymerase I (Pol I) participates in the maturation of the Okazaki fragment on the lagging strand during replication. During this process, the polymerase displaces the RNA primer, polymerizes DNA, and removes the displaced 5′-flap structure to enable ligation35. Additionally, Pol I functions in the DNA synthesis step of various DNA repair pathways, such as base excision repair (BER), NER, and ribonucleotide excision repair36,37. In BER, Pol I is involved in both short- and long-patch repair; it incorporates a mononucleotide at the damaged site38 and inserts multiple nucleotides 3′ downstream of the damaged site38,39. To fulfill these roles in vivo, Pol I is multifunctional and is known to possess (1) DNA polymerase activity, (2) strand-displacement activity, (3) 5′-flap endonuclease activity, and (4) 3′-to-5′ proofreading exonuclease activity. Each activity is located on a separate domain of the protein. The polymerase activity, strand-displacement activity, and 3′-to-5′ exonuclease activity are on the large (known as the Klenow) fragment, while the 5′-flap endonuclease activity is on the N-terminal side40,41. The essential residues related to these activities are known42,43,44,45,46,47, and Pol I-deficient strains have been constructed with loss of strand-displacement activity (S769A and F771A double mutants)37, 5′-flap endonuclease deficiency (Y77C)48,49, and loss of 3′-to-5′ exonuclease activity (D424A)50. To better understand the underlying mechanism(s) of the template-strand bias around a clustered DNA damage site, we aimed to explore which Pol I activity is responsible for this process. Here, using polA strains each lacking one of the Pol I activities, we present evidence that strand-displacement activity and 5′-flap endonuclease activity, which are required for the completion of long-patch repair in BER, affect the template-strand preference. Moreover, we found that the lack of 5′-flap endonuclease activity gives rise to a specific mode of strand bias around a bi-stranded clustered damage site.

Results

Experimental outline

The plasmids shown in Fig. 1A were constructed to determine which function of DNA polymerase I is responsible for determining the template-strand preference around a clustered DNA damage site. The plasmids were constructed as previously described28,29 and contained a bi-stranded clustered damage site composed of a uracil and an 8-oxo-7,8-dihydroguanine (uracil + 8-oxoG), as well as a six-nucleotide XhoI recognition sequence inserted as a strand-specific marker. A synthetic 8-oxoG lesion was inserted into the strand complementary to the uracil (Fig. 1A(a)), and plasmids with the uracil and 8-oxoG in exchanged positions were also constructed (Fig. 1A(b)). After introduction of the plasmids into E. coli, the template-strand usage was measured based on the fraction of XhoI-uncut plasmid progeny (Fig. 1B). The strand-specific XhoI sequence was on the leading strand template, and so the XhoI-uncut fraction represents the fraction where the lagging strand was used as the template. In E. coli, a six-nucleotide mismatch is refractory to mismatch repair29,51, and thus strand synthesis past the mismatch site either eliminates or retains the XhoI recognition sequence, depending on which strand is used as a template. Notably, a uracil in a clustered DNA damage site is rapidly converted into an AP site and subsequently processed to generate an SSB in E. coli cells16,52. Hereafter, the Pol I-deleted strain (CC104mutY∆polA), the 3′-to-5′ exonuclease-deficient strain (NS2001), the strand displacement-deficient strain (NS2002), and the 5′-flap endonuclease-deficient strain (NS2003) (Table 1) are referred to as ∆polA, polAexo, polA_SD, and polA107, respectively.

Figure 1
figure 1

Schematic representations of the plasmids used in the study and the experiment to determine the level of strand bias near a clustered DNA damage site. (A) Constructed plasmids. (a) A bi-stranded clustered damage site comprised of uracil and 8-oxo-7,8-dihydroguanine (8-oxoG) was created by placing 8-oxoG on the lagging strand template (blue line) and uracil on the leading strand template (red line). The plasmid pMW119f1(−)NAEPBNG was used for construction. The black triangle represents the mismatch, which was generated by inserting a 6-nucleotide recognition sequence of XhoI and was used as a strand-specific marker. The numbers show the separation (in nucleotides) from 8-oxoG within the bi-stranded cluster, with + and − indicating the 5′ and 3′ directions on the leading strand, respectively. (b) A bi-stranded clustered damage site (uracil + 8-oxoG) was created by placing 8-oxoG on the leading strand template (red line) and uracil on the lagging strand template (blue line). pMW119f1(+)NAEPBNG, which synthesizes the complementary single strand DNA of that synthesized with pMW119f1(−)NAEPBNG, was used for construction. Uracil is placed at the position of 8-oxoG in (a), and the mismatches are placed as in (a). The black arrow indicates the direction of the movement of the replication fork. (B) Outline of the experiment. The constructed plasmids were introduced into E. coli, and the propagated plasmids were extracted, followed by linearization with ScaI. The progeny plasmids propagated from the strand with the mismatch were cut via XhoI treatment. The level of strand preference was measured by determining the fraction of the intensity of the XhoI-uncut plasmids to the intensity of all fragments after gel electrophoresis (see the text for further details).

Table 1 Strains used in the study.

Template-strand preference near a bi-stranded clustered damage site

When the 8-oxoG within the cluster (uracil + 8-oxoG) was on the lagging strand template in the wild-type (WT) Pol I strain, the fraction using the lagging strand template (8-oxoG strand) was greater at both +21 and −23 (greatest at −23) than the respective fraction observed with undamaged template (Fig. 2A,B), suggesting that a strand bias is generated near a bi-stranded clustered damage site by repair and/or replication29. The use of the 8-oxoG strand as the template in uracil + 8-oxoG was greatly enhanced in the ∆polA strain, as shown in our previous study29. In polAexo, the level of strand preference was comparable to that of WT. The extent of the increase at both +21 and −23 with uracil + 8-oxoG in the polA_SD strain was markedly enhanced, and the level was only slightly less than that in the ∆polA strain. Interestingly, in polA107, the strand bias was completely different from WT. The extent of using the 8-oxoG strand as a template in uracil + 8-oxoG was markedly reduced compared with the undamaged construct at +21, but it was comparable to the undamaged construct at −23.

Figure 2
figure 2

The fraction of the lagging strand template on which 8-oxo-7,8-dihydroguanine (8-oxoG) was placed when plasmids with bi-stranded cluster comprised of uracil and 8-oxoG (uracil + 8-oxoG) were propagated in Pol I-deficient strains. The relative positions of each damage and mismatch are schematically shown at the top of the figure. (A) The fraction of the lagging strand template (8-oxoG strand) at position +21. (B) The fraction of the lagging strand template (8-oxoG strand) at position −23. Black bars: no damage, Green bars: uracil + 8-oxoG bi-stranded cluster. polA genotypes of the strains are shown along the horizontal axis. ∆polA: strain lacking total Pol I activity (CC104mutY∆polA), polAexo: strain lacking 3′ to 5′ exonuclease activity (NS2001), polA_SD: strain lacking strand displacement activity (NS2002), polA107: strain lacking 5′-flap endonuclease activity (NS2003). Each data represents the mean ± standard error of at least 3 independent experiments. Statistically significant differences between the fractions with and without a bi-stranded clustered damage site are indicated (**p < 0.01).

The strand preference of single lesions likely reflects the outcome of BER. Distinct levels of the use of lagging strand template were observed between single lesions (Fig. S1) and uracil + 8-oxoG (Fig. 2) at both +21 and −23 in the WT, implying that most, if not all, of the bi-stranded clustered damage site is specifically processed by an as yet unknown pathway29. The strand bias of single lesions in polAexo was quite similar to WT, while in ∆polA, polA_SD, and polA107, different modes of strand bias were observed compared with WT. Furthermore, the strand bias around a single uracil was slightly different from that around a single 8-oxoG in each strain. We observed that the levels of strand preference around a single uracil in both polA_SD and polA107 were less than ∆polA (Fig. 1S). This suggests that some of the single uracil bases are removed by short-patch repair, as only ∆polA lacks the ability to complete short-patch repair. Less frequent usage of the lagging strand template was observed with a single 8-oxoG at + 21 in ∆polA and polA107 than in WT, whereas the usage was comparable between polA_SD and WT (Fig. 1S). These results suggest that the impairment of 5′-flap endonuclease activity strongly compromises short-patch repair of 8-oxoG, and that a backup pathway for 5′-flap removal is present.

Mutation frequencies of bi-stranded clustered damage sites in polA mutants

As the extent of template-strand preference around a bi-stranded clustered damage site depends on the activity of Pol I (Fig. 2), we next assessed the MF of uracil + 8-oxoG in polA mutants. As a mutation in the recognition sequence of a restriction enzyme leads to the inability to cut the sequence by the enzyme, the MF was estimated by the fraction of uncut Alw26I recognition sequence within which uracil and 8-oxoG were placed (Table S1)28,29. The 8-oxoG was placed on the lagging strand template. The MFs of the ∆polA and polA_SD strains were significantly greater than WT (p < 0.01), while the MF of the polAexo strain was comparable to WT (Fig. 3). In polA107, the MF of uracil + 8-oxoG was quite low and comparable to that of the undamaged construct, supporting the finding from Fig. 2 that the processing mode of uracil + 8-oxoG in polA107 is quite different from that in WT or other polA mutants. We found that the level of MF in each strain was, to some extent, correlated with the level of the 8-oxoG strand fraction at + 21 (Fig. 2A).

Figure 3
figure 3

The MF of a bi-stranded clustered DNA damage site in strains lacking Pol I activity. 8-oxo-7,8-dihydroguanine (8-oxoG) was placed on the lagging strand template. Black bars: no damage, green bars: uracil + 8-oxoG bi-stranded cluster. polA genotypes of the strains are shown along the horizontal axis. ∆polA: strain lacking total Pol I activity (CC104mutY∆polA), polAexo: strain lacking 3′ to 5′ exonuclease activity (NS2001), polA_SD: strain lacking strand displacement activity (NS2002), polA107: strain lacking 5′-flap endonuclease activity (NS2003). Each data represents the mean ± standard error from at least 3 independent experiments. Statistically significant differences between the fractions with and without a bi-stranded clustered damage site are indicated (**p < 0.01).

Strand annealing by RecT or RecO is not involved in the process leading to strand preference around a bi-stranded clustered damage site

We found previously29 that the key protein in HR (RecA) and the proteins required for NER (UvrA and UvrC) do not take part in determining the template-strand preference around a clustered DNA damage site. Here, we examined mutants that are deficient in strand-annealing activity to determine whether this activity plays any role in strand preference. RecT and RecO are known to possess strand-annealing activity53,54 and to function in the RecE and RecF recombination pathways, respectively55. Although our experimental strain derived from CC10456 does not harbor a sbcA mutation activating the RecE recombination pathway57, we aimed to determine whether RecE recombination plays a role in the biased template-strand usage by any spontaneous activation of sbcA in our strains. Our results with the recET-disrupted strain revealed that the strand-annealing activity of RecT is unlikely to play a significant role in determining strand preference, as the template-strand usage fractions around the uracil + 8-oxoG cluster (+21 and −23) in each mutant did not significantly differ from WT (Fig. 4). Similarly, RecO was not strongly involved in determining strand bias around the uracil + 8-oxoG cluster (Fig. 4).

Figure 4
figure 4

The fraction of the lagging strand template on which 8-oxo-7,8-dihydroguanine (8-oxoG) was placed when plasmids were propagated in strains lacking RecET or RecO. (A) The fraction of the lagging strand template (8-oxoG strand) at +21. (B) The fraction of the lagging strand template (8-oxoG strand) at − 23. Black bars: no damage, Green bars: uracil + 8-oxoG bi-stranded cluster. Strains deficient in strand annealing activity are shown along the horizontal axis. recET: strain that lacks RecET activity (NS2004). recO: strain that lacks RecO activity (NS2005). Each data represents the mean ± standard error from at least 3 independent experiments. Statistically significant differences between the fractions with and without a bi-stranded clustered damage site are indicated (**p < 0.01).

Template-strand preference around clusters with reversed lesion positions relative to the direction of replication fork movement

Strand bias may emerge at or after replication, and thus, the different modes of replication for the leading and lagging strands may affect the outcome. Therefore, we next determined the strand preference when the location of the lesions was reversed relative to the direction of replication fork movement. In this reversed setup, the 8-oxoG within the cluster was on the leading strand template (Fig. 1A(b)). In the WT, the fraction of the lagging strand template used, representing the level of use of the uracil strand, was lower at either position than the respective fractions without any damage (+21 or −23, although much less at +21 with uracil + 8-oxoG) (Fig. 5). The fractions in the Pol I mutants, except for polA107, also showed less usage of the uracil strand. Comparing these results with the results in Fig. 2 indicates that the 8-oxoG strand is preferentially used as the template in both configurations of the lesions, and there is little strand specificity. Even in polA107, where the usage of the uracil strand at position −23 was increased in the presence of bi-stranded clustered damage (Fig. 5, p < 0.01), strand specificity was not observed. These results indicate that the template-strand preference around a bi-stranded clustered damage site is not strongly affected by the direction of replication fork movement.

Figure 5
figure 5

The fraction of the lagging strand template, on which uracil was placed, when plasmids with uracil + 8-oxoG bi-stranded cluster were propagated in Pol I-deficient strains. (A) The fraction of the lagging strand template (uracil strand) at +21. (B) The fraction of the lagging strand template (uracil strand) at −23. Black bars: no damage, Green bars: uracil + 8-oxoG bi-stranded cluster. polA genotypes of the strains are shown along the horizontal axis. ∆polA: strain lacking total Pol I activity (CC104mutY∆polA), polAexo: strain lacking 3′ to 5′ exonuclease activity (NS2001), polA_SD: strain lacking strand displacement activity (NS2002), polA107: strain lacking 5′-flap endonuclease activity (NS2003). Each data represents the mean ± standard error of at least 3 independent experiments. Statistically significant differences between the fractions with and without a bi-stranded clustered damage site are indicated (**p < 0.01).

With single lesions, a strand-specific mode of template preference was observed, to some extent. The strand preference was unaffected at the 5′ side of a single 8-oxoG (−23) when it was located on the lagging strand in each strain, except for ∆polA (Fig. S1), whereas when the 8-oxoG was on the leading strand, the 5′ side of the 8-oxoG (+21) gave rise to a slight strand bias in every strain (Fig. S2). This difference suggests that the repair of a single 8-oxoG is, to some extent, strand specific.

Template-strand usage in wild-type or polA107 clones

To better understand how this strand bias was caused, we next assessed the distribution of the template-strand usage level for each transformed clone (Table 2). Notably, the average template-strand usage in each clone was comparable to the overall template-strand usage analyzed in Figs. 2 and 5. In the WT, when the uracil + 8-oxoG contained the 8-oxoG on the lagging strand template, the distribution markedly changed from that without damage (Table 2, p < 0.01 vs. no damage at both the +21 and −23 sites): most clones used only one of the strands at +21 (79.6%) or −23 (87.8%). Similarly, when the 8-oxoG of the uracil + 8-oxoG was on the leading strand, most clones at +21 (69.4%) or −23 (78.4%) used only one of the strands as a template (Table 2, p < 0.01 vs. no damage at both the +21 and −23 sites). These results were expected, because the 8-oxoG strand is the preferred template in the WT (Figs. 2 and 5); thus, the majority of clones used only one strand as a template near a bi-stranded clustered damage site.

Table 2 Template strand usage in wild-type or polA107 colonies.

The characteristic strand bias in polA107 (Figs. 2 and 5) prompted us to investigate the distribution of template usage in polA107. One of the template strands was predominantly used at +21 (77.1%) or −23 (60.0%) when the 8-oxoG of the uracil + 8-oxoG was on the lagging or leading strand, respectively. The respective fractions were much lower in plasmids without damage (44.9% and 37.5%) (Table 2, p < 0.01). However, the presence of uracil + 8-oxoG did not alter the mode of distribution of the template-strand usage (Table 2) at −23 or +21 when the 8-oxoG was on the lagging or leading strand, respectively.

Discussion

In a previous study29, we showed that Pol I is required to define the level and size of the preferential use of a template strand harboring a mutagenic lesion in a cluster. Herein, we have gained further insights into the preferential use of a mutagenic template by studying how the Pol I activity, and whether the relative direction of the movement of the replication fork, affect the template-strand usage.

The preference for the 8-oxoG strand and the significance of long-patch repair in the Pol I-proficient strain

In the WT, the preferred usage of the 8-oxoG strand within the uracil + 8-oxoG is in line with the observed enhanced mutagenic potential of the mutagenic lesion in a bi-stranded clustered damage site (Fig. 3)16,17,18. We found that the 8-oxoG strand was used preferentially, irrespective of which strand (the leading or lagging strand template) the 8-oxoG was located on within the cluster (Figs. 2 and 5). These results are consistent with the interpretation that the uracil in the uracil + 8-oxoG is excised first, retarding the subsequent repair of the 8-oxoG, and that the template-strand preference in the WT is not affected by which strand is used as the template for replication, but by the relative location of the lesions (Figs. 2 and 5). The usage of the 8-oxoG strand independent of the direction of the replication fork is in agreement with our earlier findings that reversing the direction of (and thus inverting the damage-containing strand relative to) the movement of the replication fork did not affect the level of MF of a bi-stranded clustered damage site21,58. Further, the lack of strand specificity suggests that any RecA-independent recombination or template switching mechanism associated with crossovers, which are proposed to proceed via annealing of nascent strands across a blocked replication fork, are unlikely to be involved, as these events requires discontinuity on the lagging strand, and thus, exhibits lagging strand specificity59. In addition, no strand bias during the preferential use of the 8-oxoG strand suggests that the process is unlikely to be related to the process with increased fidelity of lagging strand replication, which is proposed to result from the increased dissociability of lagging strand DNA polymerase60.

During BER long-patch repair, Pol I synthesizes DNA by incorporating multiple nucleotides while displacing the downstream strand, and subsequently excises the 5′-flap generated38,39. The last step, SSB ligation, is most likely inhibited in the WT, because a study using mammalian cell extracts demonstrated that, with a bi-stranded cluster, long-patch repair is impeded at ligation, not at the strand-displacement or 5′-flap removal steps61. The observation that cells display a greater fraction of the 8-oxoG strand at the 3′ end of the opposing lesion (uracil) irrespective of the direction of the fork (Figs. 2 and 5) supports the notion that incomplete long-patch repair initiates an SSB tolerance pathway and leads to template-strand preference in the WT. Since the tract size for long-patch repair in E. coli is in the range of 9 to 27 nucleotides62, we consider it likely that Pol I displaces the majority of the strands 3′ of the uracil. In contrast to the significance of the strand-displacement and 5′-flap endonuclease activities, the 3′-to-5′ proofreading exonuclease activity of Pol I had a minor, if any, effect on strand usage (Figs. 2 and 5). This suggests that the 3′-to-5′ exonuclease activity does not affect the 5′ side of the uracil, and that this activity is not essential for the strand bias surrounding a bi-stranded clustered DNA damage site.

Template-strand preference in strains deficient in different Pol I activities

The preferential use of the 8-oxoG strand as the template in the presence of uracil + 8-oxoG was further promoted when the strand-displacement activity was impaired (Figs. 2 and 5). This enhanced use of the 8-oxoG strand is consistent with the elevated MF in the polA_SD strain (Fig. 3). The deficiency of this activity resulted in only a slight decrease in the level of strand bias compared with that seen with total loss of Pol I activity (∆polA)29, implying that a bi-stranded clustered damage site is not mainly processed by short-patch repair and that a similar, if not the same, unknown backup pathway operates in both polA_SD and ∆polA. With single damage sites (Figs. S1 and S2), we found that the strand bias was less pronounced in polA_SD than in ∆polA. We believe that some of the single lesions are repaired by short-patch repair in the polA_SD strain.

Interestingly, the lack of 5′-flap endonuclease activity in polA107 led to a completely different mode of strand usage around a bi-stranded clustered damage site from that seen in the WT, or in the ∆polA and polA_SD strains (Figs. 2 and 5). In polA107, the strand bias at the 3′ side of the U (−23 and +21 when 8-oxoG is on the lagging and leading strand templates, respectively) in a bi-stranded clustered damage site was quite similar, if not the same, to the strand bias observed for the undamaged construct at the respective sites (Figs. 2 and 5, Table 2). These results indicate that the uracil strand was most likely not displaced, which is consistent with an in vitro study demonstrating that Pol I deficient in 5′-flap endonuclease activity does not displace the downstream strand and inhibits ligation63. Interestingly, a certain fraction of the 3′ strand of a single uracil (not within a cluster) was displaced in polA107 (Figs. S1 and S2), indicating that the uracil processing pathway depends on the presence of a nearby lesion. In contrast to the strand bias at the 3′ side of the U, the majority of the 5′ side of the uracil (which is the 3′ side of the opposing 8-oxoG) in a bi-stranded clustered damage site used the uracil strand as a template, irrespective of on which strand (leading or lagging strand template) the 8-oxoG was placed. Together with the markedly reduced MF in polA107, we believe that 8-oxoG is excised at a bi-stranded clustered damage site and that repair synthesis occurs downstream (3′) of 8-oxoG in polA107. This strand bias at the 3′ side of 8-oxoG was also observed with a single 8-oxoG in polA107.

These findings in polA_SD and polA107 suggest that an SSB in a bi-stranded clustered damage site is tolerated even in the absence of fully functional Pol I, and that different pathways operate to tolerate the SSB, based on the residual activities of Pol I. Notably, we observed no influence of the direction of replication fork movement on these SSB toleration mechanisms, indicating that they are not strand specific.

The process leading to strand preference

The findings—(1) that a lesion opposed to an 8-oxoG within a bi-stranded cluster is converted to an SSB whose repair is strongly retarded16,17,18, and (2) that the strand preference around a bi-stranded clustered damage site is affected very differently depending on which activity of Pol I is disrupted (Figs. 2 and 5, Table 1)—led us to suggest that the template-strand preference is associated with SSB tolerance pathways in which Pol I plays a crucial role. These potential SSB tolerance pathways are all unaffected by the relative position of the lesions with regard to the direction of the movement of the replication fork (Figs. 2 and 5), and are unrelated to the activity of RecA29 and the strand-annealing activities of RecT and RecO (Fig. 4). We, therefore, suggest that the mechanisms likely do not involve recombination between sister chromatids, but operate before replication. Our study proposes a model in which at least three different pathways give rise to strand bias (Fig. 6). The uracil in the uracil + 8-oxoG is likely processed first, by uracil DNA glycosylase generating an AP site. The major enzyme that cleaves AP site from uracil is most likely exonuclease III in E. coli, as mutants in dUTPase (dut) that incorporate large amounts of uracil into DNA are inviable in the presence of additional mutations in exonuclease III64. With its 3′ phosphatase activity, exonuclease III cleavage of the 5′ side of the AP site leaves 3′OH and 5′-deoxyribose-phosphate (dRP) at the break65. In the Pol I-proficient strain (Fig. 6, Pathway A), SSB is tolerated by a mechanism that leads to the formation of a gap with a size of several tens of base pairs, which is subsequently filled in most probably by Pol I using the mutagenic strand as a template29. The potential 3′ to 5′ exonuclease(s) involved in generating the gap 5′ upstream of the SSB is currently unknown. The pathway in the Pol I-proficient strain (pathway A) may resemble the long-patch repair subpathway identified by Woodrick et al.66, in which gap filling occurs not only downstream (3′) of the SSB but also upstream (5′). In this context, gap filling gives rise to strand bias and thus elevates the MF, as the 8-oxoG:adenine pair is likely to be generated prior to replication. In strains that are deficient in strand displacement (∆polA and polA_SD) (Fig. 6, Pathway B), the 5′-flap is not generated, and the SSB likely persists, as the short patch repair of an SSB within a cluster is strongly compromised15,16,17. The persisting SSB is likely tolerated by an unknown pathway, which forms a gap by both 5′ and 3′ digestion from the SSB. As this digestion on both sides of the break generates a gap with a relatively large size of several hundred base pairs29, we assume that the 5′ and 3′ nucleases involved in pathway B are different from the nucleases in pathway A. One potential candidate for the 5′ nuclease is RecJ, as RecJ possesses a dRPase activity which cleaves the 5′-dRP at the break67. The difference in the MF between the Pol I-proficient and strand displacement-deficient strains (Fig. 3) can be explained by the involvement of distinct DNA polymerases with different frequencies of adenine insertion opposite the 8-oxoG. Interestingly, the strand bias in cells deficient in 5′-flap endonuclease activity indicates that the 5′-flap generated 3′ of the SSB derived from the uracil remains unexcised (Figs. 2 and 5, Table 2). We assume that the 5′-flap is eventually reannealed to the original template (Fig. 6, Pathway C). The strand that is synthesized downstream of the initial SSB is unwound and removed by 3′ nuclease(s), followed by gap filling and ligation. The use of the 5′ side of the SSB as a template (Figs. 2 and 5, Table 2), together with the low MF observed in polA107 (Fig. 3), suggests that the 8-oxoG is subsequently excised and that, at least, some of the 3′ side of the 8-oxoG is displaced with repair synthesis. The precise mechanisms underlying the excision or digestion of the single-stranded DNA in the 5′-flap endonuclease-deficient polA107 strain remain unknown. Taken together, our study suggests that (1) different SSB tolerance/repair mechanisms are provoked, depending on how the SSB is processed by WT or mutated Pol I, and (2) the different tolerance/repair mechanisms lead to a specific template-strand bias and determine the mutagenic potential of the bi-stranded clustered damage site.

Figure 6
figure 6

Possible mechanisms for the occurrence of strand bias around a bi-stranded cluster containing uracil and 8-oxo-7,8-dihydroguanine (8-oxoG). The uracil in the uracil + 8-oxoG cluster is excised first and generates an SSB. Pathway (A): When wild-type Pol I is present, a gap is generated from 5′-flap cleavage after strand displacement and from the digestion of 3′ single-stranded (ss) DNA. Filling in the gap leads to a relatively short patch size of several tens of nucleotides. Pathway (B): When strand displacement activity (or total Pol I activity) is absent, the persistent SSB generated at a bi-stranded clustered damaged site is processed by an unknown mechanism that requires a long patch size of several hundred nucleotides. Since the preferential use and the mutagenic potential of the 8-oxoG strand is more enhanced in the strand displacement-deficient strain, the polymerase(s) involved in pathways A and B could be different. Pathway (C): In the absence of 5′-flap endonuclease activity, the 5′-flap is not excised and reanneals to the original template. The resulting 3′ ssDNA is digested to generate a strand break. Before reannealing the 5′-flap, 8-oxoG could be excised and processed similarly to a single 8-oxoG. Subsequently, the strand breaks generated after reannealing of the 5′-flap and generated downstream of 8-oxoG are both sealed.

The findings of the present study further deepen our understanding of the essential role played by Pol I activity in determining the extent of the preferential use of the mutagenic strand as a template, and that different backup pathways likely rescue the persistent SSB depending on which activity of Pol I (strand-displacement activity or 5′-flap endonuclease activity) is compromised. Pol I has long been known to play a significant role in BER, but despite extensive studies on the function of Pol I, its importance in preventing detrimental consequences after irradiation remains unclear. As SSBs are regularly generated in the genome by direct scission or after the excision of diverse types of damage, further studies are needed to fully reveal the role of Pol I in the tolerance/repair of SSBs in cells.

Materials and methods

Strains

The strains used in the study are listed in Table 2. The CC104mutY::tet strain was provided by Prof. Q.M. Akiyama (Kyoto University). The MG1655 polA_SD strain harboring polA F769A/F771A::cat allele was a kind gift from Prof. R. Woodgate (National Institutes of Health), and the KA796 polAD424A::cat strain was obtained from Prof. I. J. Fijalkowska (Institute of Biochemistry and Biophysics). Strain KMBL1789 was obtained from the Coli Genetic Stock Center (CGSC). Strains with the CC104mutY::tet background that lacked one of the activities of Pol I were constructed through P1 transduction, which uses P1 phage for moving portions of the E. coli genome from one genetic variant to another, or by disrupting the gene via the method developed by Datzenko and Wanner68, which utilizes λ Red recombinase to integrate PCR amplified fragments into the E. coli genome (Table 2). For NS2003 construction, firstly, a DNA fragment containing the chloramphenicol acetyltransferase (cat) gene was PCR amplified from KA796 polAD424A::cat using the primers 5′-ATCAGGCGCACTAAGATTCG-3′ and 5′-GACTGGGGCGGCTAAAATA-3′. Then, the amplified cat gene was inserted at the 3′ side of the polA107 allele in the strain KMBL1789 using the method of Datzenko and Wanner to construct NS0001. The polA107::cat fragment in NS0001 was further transferred by P1 transduction to CC104mutY::tet to produce NS2003. The plasmids used for gene disruption, pKD46, pKD13, and pCP20, were from CGSC. The strains used to analyze strand bias were MutY deficient. As MutY is responsible for removing adenine residues misincorporated in 8-oxoG:adenine mispairs, and thus, avoiding G:C to T:A mutations, this deficiency enabled monitoring of the reparability of 8-oxoG due to its mutagenic potential21,24. Bacterial cultures were grown at 37 °C in Luria–Bertani (LB) medium supplemented with appropriate antibiotics.

Oligonucleotides and plasmids

The oligonucleotides, whose sequences are shown in Table S1, were purchased from Tsukuba Oligo Service (Tsukuba, Japan). U-1EP and 8G-2EP are identical to NAEP except for the presence of uracil in U-1EP and 8-oxoG in 8G-2EP. Both U-1EPX + 21 and 8G-2EPX + 21 contain the 6-nucleotide XhoI recognition sequence (5′-CTCGAG-3′) and are identical to NAEPX + 21 except for the presence of uracil in U-1EPX + 21 and 8-oxoG in 8G-2EPX + 21. Similarly, U-1EPX-23 and 8G-2EPX-23 both contain the XhoI sequence and are identical to NAEPX-23 except for the presence of uracil in U-1EPX-23 and 8-oxoG in 8G-2EPX-23. Ura and 8OXOG are identical to BNG except for uracil and 8-oxoG in Ura and 8OXOG, respectively. Uracil and 8-oxoG were located within the recognition sequence of Alw26I. NAEP and BNG are complementary sequences, and pMW119f1(−)NAEPBNG and pMW119f1(+)NAEPBNG were constructed by replacing the HindIII–NdeI fragment of pMW119 with the HindIII–NdeI fragment of pGEM3Zf(−)NAEPBNG and pGEM3Zf(+)NAEPBNG, respectively, as described previously29. pMW119f1(+)NAEPBNG is identical to pMW119f1(−)NAEPBNG except for the direction of the origin of the f1 phage. Thus, pMW119f1(+)NAEPBNG generates single-stranded DNA (ssDNA) complementary to that from pMW119f1(−)NAEPBNG upon the addition of the helper phage. The number (e.g., +21 or −23) in the oligonucleotide indicates the position of the mismatch in the plasmids pMW119f1(−)NAEPBNG (Fig. 1A(a)) and pMW119f1(+)NAEPBNG (Fig. 1A(b)).

Construction of plasmids with DNA damage and mismatch

Plasmids with DNA damage and mismatch were constructed according to methods described previously29. In brief, VCM13 helper phages were added to cultures of CJ236 harboring pMW119f1(−)NAEPBNG or pMW119f1(+)NAEPBNG grown in the presence of uridine (75 µg/mL), and phage particles were precipitated overnight in the presence of 5% polyethylene glycol (average MW of 6000) and 0.5 M NaCl at 4 °C. Uracil-containing circular ssDNA was extracted from the pelleted phage particles with lysis buffer (10 mM MOPS, pH 6.5, 1% Triton X-100, 500 mM guanidine-HCl) and a Qiagen HiSpeed Plasmid Midi kit (Qiagen, Tokyo, Japan). Fifty pmol of oligonucleotides (Table S1) were annealed with 20 pmol of Uracil-containing circular ssDNA. The first strand was synthesized from the annealed primer in the presence of 50 mM Tris–HCl, pH 7.5, 5 mM MgCl2, 1 mM DTT, 0.5 mM ATP, 0.185 mM dNTPs, 5% PEG (average MW of 8000), 0.2 units of pyrophosphatase (New England Biolabs, Tokyo, Japan), 50 units of native T7 DNA polymerase (New England Biolabs, Tokyo, Japan) and 15 units of T4 DNA ligase (Thermo Fisher Scientific, Tokyo, Japan) at 33 °C for 30 min, followed by heat inactivation of the enzymes at 75 °C for 20 min. After digestion of the uracil-containing template strand at 37 °C for 1 h with 3 units of UDG, 50 units of exonuclease I, and 50 units of exonuclease III (all from New England Biolabs, Tokyo, Japan), the resulting single stranded circular first strand was purified with a QIAquick Gel Extraction Kit (Qiagen, Tokyo, Japan). Twenty pmol of the 5′-phosphorylated primer was annealed to the first strand, and the second strand was synthesized. The 8-oxoG-containing primers were used for first strand synthesis, and the uracil-containing primers were used for second strand synthesis. The final product (circular dsDNA) was gel-purified with a QIAquick Gel Extraction Kit.

Transformation and analysis of the amplified plasmids

The constructed plasmids (6.6 ng) were electroporated into E. coli cells with a Bio-Rad E. coli pulser set at 1.8 kV, 200 Ω and 25 µF. Under our experimental conditions, each type of plasmid generated 2000–3000 transformants (Table 2S and 3S). Plasmids from the overnight culture of transformants were extracted with a QIAprep Spin Miniprep Kit (Qiagen, Tokyo, Japan). For the analysis of the plasmids from each clone, bacterial culture was grown from each colony, and the plasmids were subsequently extracted. To quantify the fraction of the strand used as a template, plasmids were first linearized with ScaI, and the linearized fragments were further digested with XhoI (Fig. 1B). To measure the MF, the propagated plasmids were digested with Alw26I. After digestion, the plasmids were electrophoresed in an agarose gel, and the relative intensities of the ethidium bromide-stained DNA fragments were measured with a gel imaging system (FluoroChem Imager, Proteinsimple, Tokyo, Japan). The fraction of the strand used as a template was estimated based on the intensity of the uncut fragment relative to all fragments (both cut and uncut fragments). Similarly, the fraction of the uncut Alw26I plasmid was quantified to estimate the MF. The transformation was repeated at least three times for each plasmid construct, and the statistical significance of the differences in the fractions of uncut fragments between the plasmids was determined with the two-tailed t-test. For the analysis of the distribution of template strand usage in each clone, the significant difference was verified by the χ-square test.