Determination of DraIII Fidelity Index (FI)
A quantitative definition of star activity is provided by the Fidelity Index (FI), which is the ratio of the highest quantity of a REase showing no star activity during digestion to the lowest quantity needed for complete digestion on canonical sites on a specific substrate DNA (Wei et al., 2008). A higher FI corresponds to a higher cleavage fidelity of the REase. DraIII has an FI of 2 on λ DNA under reaction conditions described in MATERIALS AND METHODS (Fig. 1A).
Determination of DraIII star sites
In addition to the expected bands resulting from cleavage of the 10 canonical CACNNNGTG sites, extra discrete bands were observed in the cleavage reactions of λ DNA (Fig. 1A). These bands were attributed to star activity. The defined size of the star bands from cleavage reactions of λ DNA suggests that the DraIII star activity is not random and has certain specificity. It is difficult to map the DraIII star sites in λ DNA because of its large size (48.5-kb) and the large number of possible star sites. A smaller 2.7-kb pUC19 DNA was used to determine DraIII star sites. Although it does not contain the DraIII canonical site, the pUC19 DNA was linearized by DraIII (data not shown), showing the presence of one or more DraIII star sites. Double (BamHI and XmnI) and triple (BamHI, XmnI and DraIII) digestion on pUC19 suggest that DraIII star activity generates two star bands through one cut (Fig. 1B). According to the approximate size of star bands, the CATGTTGTG site (nt 2033) which only differs from the canonical sequence by one base was hypothesized to be DraIII star site. Cleavage on predicted CATGTTGTG star site generated the 1.6-kb and 0.3-kb star bands (Fig. 1B, Lane 2). A larger 22.6-kb plasmid DNA pXba was used to verify the predicted star site. pXba contains three DraIII canonical sites (nt 5816, 16551 and 17404 respectively) and one predicted star site CATGTTGTG (nt 21910). It was found that DraIII generated the 6.5-kb and 4.5-kb star bands consistent with cleavage at the CATGTTGTG site (Fig. 1C, Lane 2).
Since DraIII’s canonical specificity has no preference for the middle three nucleotides (Grosskopf et al., 1985), the specificity of these nucleotides within the star site CATNNNGTG was investigated. There are 11 such sites in pXba and the sequences containing these sites were tested independently on oligonucleotide duplex DNAs carrying each of the sites (Table S1). Among the 11 CATNNNGTG sites (Table S1, DNA2–12), cleavage was observed where NNN = AAA, GTG, TAC, ATG and GTT (Fig. 1D). Apparently the star activity of DraIII has a sequence preference for the central three nucleotides, whereas the canonical cleavage activity does not (Grosskopf et al., 1985).
DraIII digests star site sequence in asymmetrical pattern
To locate the exact cleavage position within the CATGTTGTG star site, 39-bp duplex DNA which contains canonical or star site and a 5′ Cy5 fluorophore on each strand (Fig. 2A) was digested by DraIII and separated by 20% acrylamide urea PAGE (Fig. 2B). The single strand product was compared with synthesized 15, 16, 17, 25, 26 and 27-nt single strand DNA markers (Fig. 2A and 2B). The PAGE result confirmed that DraIII star site is CAT↑GTT↓GTG (↑ indicates nicking on the bottom strand; ↓ indicates nicking on the top strand) (Fig. 2B).
To decipher the DraIII star activity cleavage mode, 39-bp duplex star DNA was used for time course analysis. Cleavage products were collected at different time points and analyzed by denaturing urea polyacrylamide gel electrophoresis. The 16-nt single strand product (5′-Cy5-catgatccccCATGTT) represents nicking at the top canonical half site and the 26-nt single strand product (5′-Cy5-atggtagagacgcttttttgCACAAC) represents nicking at the bottom star half site (Fig. 2A, star DNA; Fig. 2C). The nicking on the two halves of the DraIII star sequence occurs at different rates: the canonical half site GTG was cleaved faster than the star half site CAT (Fig. 2D). No difference in the cleavage rate of the two half sites for the canonical DraIII site was observed (data not shown). We further tested whether DraIII star activity cleaves the star half site at an equal or slower rate if the star half site CAT is present in both half sites arranged in the pseudo-palindrome (CATGTTATG). DraIII showed no significant activity on this pseudo-palindromic site (Fig. 1D, DNA13). This shows that under star conditions DraIII cleaves the canonical half site faster than the star half site, and that it requires one canonical half site for cleavage to occur. Altering the flanking sequence (Table S1, DNA14) showed no influence on the cleavage of the CATGTTGTG star site (data not shown).
Overall protein structure and catalytic sites
The structure of DraIII was determined in the presence of an 11-bp canonical DNA duplex containing a phosphorothioate at both of the scissile phosphodiester bonds (Fig. 3B). A complete protein structure was derived from a protein-phosphorothioate DNA duplex in the presence of magnesium chloride, although the DNA was not observed in the structure (Fig. 3A and Table 1). DraIII exists as a homodimer and dimerization occurs in the C-terminus region where the ββα-metal HNH active site is located. Two zinc- and one magnesium-binding sites are found in each DraIII monomer (Fig. 3A).
Table 1 Data collection and refinement statistics The DraIII monomer contains a ββα-metal fold (β1-β2-α12) in the C-terminal domain where the HNH active site is located (Fig. 3C). Sequence and structure alignments reveal that amino acid residue N202 in DraIII, positioned to coordinate the divalent metal cofactor, corresponds to N113 in PacI, N165 in Hpy99I and Q175 in KpnI, among known HNH REases (Fig. 4). H183 in DraIII corresponds to H149 in Hpy99I and H149 in KpnI, the general base that activates a nucleophile for the hydrolysis of the phosphodiester bond (Fig. 4). To verify their role in catalysis, H183 and N202 were mutated to Ala. Mutants H183A and N202A were overexpressed and were found to be inactive at the cell lysate level (Table 2). Superposition of the DraIII closed structure to Hpy99I, PacI and T4 Endo VII shows the high structural conservation of the HNH active site (Fig. 4).
Table 2 Activity and FIs of DraIII variants A number of HNH endonucleases display a series of histidine and cysteine residues with a characteristic spacing that allows for coordination of a zinc ion in a cross-brace structure. Most HNH endonucleases (REases, homing endonucleases and non-specific endonucleases) contain a CXXC/CXXC zinc-binding domain in the vicinity of the HNH active site that ligates the latter to the rest of the protein. Hpy99I and PacI have an extra copy of the CXXC/CXXC domain near the N-terminus (Shen et al., 2010; Sokolowska et al., 2009). The N-terminal zinc binding site (consisting of C4, C22, H28 and C32) and the C-terminal zinc binding site (C157, C160, H198 and C201) of DraIII (Fig. 3C) deviate from the CXXC/CXXC configuration. Similar to other HNH REases and homing endonucleases, the two zinc ions are tetrahedrally coordinated by four cysteine/histidine residues in the DraIII structure (Fig. 3A).
The coordination of the zinc ion in the second zinc binding site has been shown to play an essential role in the catalytic activity of T4 Endonuclease VII (Giraud-Panis et al., 1995). The removal of the two zinc ions by denaturing the enzyme in urea has also been shown to adversely affect the folding, thermostability and cleavage activity of KpnI (Saravanan et al., 2007a). Alanine mutation of any one of the eight zinc coordinating cysteine or histidine residues rendered the DraIII enzyme inactive at the cell lysate level (Table 2).
Interactions between the N- and C-terminal domains
The DraIII structure is composed of two domains: the N-terminal domain (α1–9) and the C-terminal domain (α10–13, β1–2) (Figs. 3C and 5A). The two domains are connected by the turn between α9 and α10, opening up to the solvent at the end of α3 of the N-terminal domain and the turn connecting the two β strands of the ββα-metal fold in C-terminal domain (Fig. 5A, cartoon mode). More importantly, the domain interface opens up to the active site tunnel the substrate DNA would bind to (Fig. 5A, surface mode). Six potential hydrogen bonds are found between the two domains along the domain interface, which can be divided into three regions: the mouth, middle and hinge region (Fig. 5A). A hydrophobic patch is found at the hinge region (Fig. 5A).
To study the role of the potential hydrogen bonds that connect the two domains, a series of mutations were made to the hydrogen bond donors and acceptors along the domain interface. T181 was mutated to Ala to disrupt the two potential hydrogen bonds located at the middle region between T181 and D55 (Fig. 5A, middle region). The mutant T181A showed remarkable high fidelity—no star activity was observed at very high enzyme concentrations (FI ≥ 4000) (Fig. 5B). To verify the role of these hydrogen bonds in cleavage fidelity, mutations T181S and T181Y, which potentially retain the hydrogen bonds, were created. Both mutants T181Y and T181S had the same FI as the WT (Table 2). Mutating T181 further to residues that do not support hydrogen bond formation, namely, Gly, Cys, Val, Met, Leu, Asn or Gln, and D55 to Ala improved the cleavage fidelity to different extents (Table 2). Mutation of T181 to Tyr did not improve the fidelity when tested in cell lysate preparations. Unfortunately, the mutant protein T181Y could not be purified to high enough quantity for further characterization.
Mutation H189A, which potentially disrupts two potential hydrogen bonds in the middle region between H189, D55 and A56 also resulted in high fidelity (Fig. 5A and Table 2). The two potential hydrogen bonds between R187 and E107 are located at the mouth region (Fig. 5A). Mutating R187 to Ala also improved cleavage fidelity (FI ≥ 2000) (Table 2). Mutation of L129, L164, F169, or V179 in the hydrophobic patch at the hinge region to Ala also increased cleavage fidelity, albeit to a lesser extent (Table 2).
Differential scanning calorimetry (DSC) analysis of the DraIII high fidelity mutants
To evaluate the effect of the domain interface mutations on the DraIII protein, the melting temperatures (Tm) of purified DraIII WT, T181A, T181G, H189A and R187A proteins were measured by DSC. As shown in Table 3, T181A, T181G and H189A have Tm values of 56.7°C, 52.5°C and 56.2°C, respectively, compared to the WT value of 59.1°C. Mutation of T181 and H198 to Ala led to similar and lower Tm values supporting the notion that hydrogen bonds were disrupted by each of the mutations. The even lower Tm value of the mutant T181G is probably due to the thermodynamic penalty in the folding of the protein caused by the cavity created by the Gly mutation. The correlation between high cleavage fidelity and disruption of hydrogen bonds between the two domains leads us to propose that the high cleavage fidelity is the result of increased conformational flexibility of the mutant proteins. The Tm of another high fidelity mutant R187A (Tm = 60.7°C) is comparable to that of WT, suggesting that the mutation R187A improves cleavage fidelity through a different mechanism.
Table 3 The Tm values of DraIII variants by differential scanning calorimetry analysis