Abstract
Purine-containing nucleotide second messengers regulate diverse cellular activities. Cyclic di-pyrimidines mediate anti-phage functions in bacteria; however, the synthesis mechanism remains elusive. Here, we determine the high-resolution structures of cyclic di-pyrimidine-synthesizing cGAS/DncV-like nucleotidyltransferases (CD-NTases) in clade E (CdnE) in its apo, substrate-, and intermediate-bound states. A conserved (R/Q)xW motif controlling the pyrimidine specificity of donor nucleotide is identified. Mutation of Trp or Arg from the (R/Q)xW motif to Ala rewires its specificity to purine nucleotides, producing mixed purine-pyrimidine cyclic dinucleotides (CDNs). Preferential binding of uracil over cytosine bases explains the product specificity of cyclic di-pyrimidine-synthesizing CdnE to cyclic di-UMP (cUU). Based on the intermediate-bound structures, a synthetic pathway for cUU containing a unique 2’3’-phosphodiester linkage through intermediate pppU[3’−5’]pU is deduced. Our results provide a framework for pyrimidine selection and establish the importance of conserved residues at the C-terminal loop for the specificity determination of CD-NTases.
Similar content being viewed by others
Introduction
Cyclic dinucleotides (CDNs) are secondary messengers that regulate various cellular activities1. In bacteria, cyclic diguanylate (c-di-GMP, cGG) is synthesized by diguanylate cyclases with a GGDEF motif and binds to effectors, including PilZ-domain proteins and riboswitches that regulate important bacterial activities such as biofilm formation2. Cyclic diadenylate (c-di-AMP, cAA) is synthesized by proteins containing the diadenylate cyclase domain, and participates in cellular processes by binding to effector proteins and riboswitches3. In mammals, 2’,3’-cyclic GMP-AMP (2’,3’-cGAMP) is synthesized by cGAS, a cyclic GMP-AMP synthase activated by double-strand DNA4. The binding of 2’,3’-cGAMP to stimulator of interferon genes (STING) effector proteins can induce innate immune responses in mammalian cells5. STING proteins also bind to cGG, cAA and 3’,3’-cyclic GMP-AMP (cGA) and may have a bacterial origin6,7. The cGAS homolog VcDncV (dinucleotide cyclase in Vibrio cholerae) synthesizes 3’,3’-cGAMP, which binds to and activates VcCapV (CDN-activated phospholipase in V. cholerae), which degrades the bacterial cell membrane8,9. The DncV-CapV pair constitutes an example of a cyclic oligonucleotide-based anti-phage signaling system (CBASS), which protects the bacterial population from viral destruction through an abortive infection mechanism by inducing cell death10,11.
Recently, scores of cGAS/DncV-like nucleotidyltransferases (CD-NTases) have been widely identified in bacterial genomes. CD-NTases can be classified into eight clades from A to H based on sequence identity, which usually corresponds to their ecological niches12. CD-NTases produce a variety of CDNs and cyclic trinucleotides (CTNs), including 3’,3’-cyclic UMP-AMP (cUA) synthesized by RmCdnE from Rhodothermus marinus, 3’,3’-cyclic di-UMP (cUU) synthesized by LpCdnE from Legionella pneumophila, 3’,3’-cyclic UMP-GMP (cUG) synthesized by BdCdnG from Bradyrhizobium diazoefficiens, 3’,2’-cGAMP synthesized by AsCdnG from Asticcacaulis sp., 3’,3’,3’-cyclic AMP-AMP-AMP (cAAA) synthesized by EcCdnC from Escherichia coli and 3’,3’,3’-cyclic AMP-AMP-GMP (cAAG) synthesized by EcCdnD from Enterobacter cloacae11,12,13,14. The diversity of CDNs and CTNs is a consequence of the arms race between phage-bacteria and bacteria-host under evolutionary pressure, which could strengthen the anti-phage ability of bacteria15 or facilitate evasion from innate immune detection by mammalian host cells12.
CD-NTases share an N-terminal core domain with templated and non-templated polymerases of the nucleotidyltransferase (NTase) superfamily, but the C-terminal helical domain shows a higher degree of variation12,16. Three conserved acidic residues in the central β-sheet of the NTase domain coordinate with two metal ions (A and B) that are essential for catalysis16. The reaction proceeds by an attack of the acceptor nucleotide, whose O3’ (or O2’) atom is bound to metal A, at the α-phosphate of the donor nucleotide, whose triphosphate associates with metal B (Fig. 1A)17. The cage-like architecture of CD-NTase may allow retention and re-orientation of the intermediate molecule for the next step of catalysis13,16. Crystal structures of VcDncV showed that the reaction intermediate had the original acceptor nucleotide bound to the donor site, and vice versa, while the second reaction took place, turning out the CDN product (Fig. 1B)18. Instead of base pairing, substrate specificity is determined by interactions with amino acid side chains surrounding the enzyme’s active site16. Govande et al.19 proposed that two key residues, lid residue 1 (lid 1) above the acceptor-nucleotide binding site and lid residue 2 (lid 2) above the donor pocket, were highly correlated with the product specificity of CD-NTases (Fig. 1C). For example, a conserved asparagine residue (lid 1) of CD-NTase in clade E (CdnE) has been shown to form two hydrogen bonds with the uracil base of the acceptor UTP in the structure of RmCdnE12. Asparagine-to-serine reprogramming results in the pyrimidine-purine specificity alteration of RmCdnEN166S, which produces mainly cAA instead of cUA12. However, the currently available structures of CD-NTase show that lid 2 only makes non-specific interactions with the base or ribose of donor nucleotides, indicating that the proposed lid 2 is insufficient to explain the substrate specificity.
In contrast to the well-known biological roles of cyclic di-purines, including cGG, cGA, and cAA, the functional significance of cyclic di-pyrimidines has only recently begun to be appreciated. In this work, we aimed to determine the structural and functional character of cyclic di-pyrimidine-synthesizing CdnE, including LpCdnE, ClCdnE from Cecembia lonarensis and EfCdnE from Enterococcus faecalis, which are encoded in type I CBASS operons with a transmembrane (TM) effector, named Cap1520 (Fig. 2A). All three CdnEs contain Asn as lid 1 but ClCdnE and EfCdnE harbors His as lid 2, which differs from the Asn as lid 2 of LpCdnE19. Structural characterization of these three CdnEs confirms the uracil-recognition role of the lid1 (Asn) for acceptor nucleotide selection. Moreover, a conserved (R/Q)xW motif at the C-terminal loop of CdnEs is proposed as lid residue 3 (lid 3), dictating donor nucleotide specificity to pyrimidine. According to the substrate-bound and intermediate-bound CdnE structures, a synthetic pathway for cyclic di-UMP is deduced. Finally, the rules controlling donor nucleotide specificity of CD-NTases are further demonstrated. This study aids in our understanding of the biochemical mechanisms underlying cyclic di-pyrimidine biosynthesis and provides rules for substrate specificity determination of CD-NTases.
Results
EfCdnE synthesizes products cUU and cUC
To understand the molecular basis for synthesizing cyclic di-pyrimidines, the substrate specificity of EfCdnE was first accessed by ITC. The binding of AMPcPP and GMPcPP to EfCdnE was too weak to be detected (Supplementary Fig. 1A, B), indicating that EfCdnE prefers pyrimidine nucleotides over purine nucleotides. The binding profile of UMPnPP to EfCdnE indicates that one EfCdnE protein molecule binds to two UMPnPP molecules, one in the donor pocket and the other in the acceptor pocket, with two different dissociation constants (KD) of 1.47 × 10−8 M and 9.8 × 10−7 M, respectively (Supplementary Fig. 1C). To investigate whether EfCdnE could distinguish UTP from CTP, titration with CMPcPP was conducted. One EfCdnE protein molecule also binds to two CMPcPP molecules, and the KD values of 1.87 × 10−8 M and 1.27 × 10−6 M, respectively, are slightly larger than those of UMPnPP (Supplementary Fig. 1D). CMPcPP did not bind to EfCdnE, which was pre-incubated with UMPnPP (Supplementary Fig. 1E), suggesting that the binding of UMPnPP to EfCdnE could exclude the binding of CMPcPP.
To understand the products synthesized by EfCdnE, liquid chromatography with tandem mass spectroscopy (LC-MS/MS) experiments were performed. After overnight incubation of purified EfCdnE with UTP and Mg2+, the elution peaks corresponding to product cUU and intermediate pppUpU were detected (LC-MS dataset, Fig. 2B and Supplementary Fig. 2). Different fragments derived from pppUpU, corresponding to UMP, UDP, UTP, pppUp, pUpU, and ppUpU, were also detected (LC-MS dataset, Fig. 2C). By including CTP in the reaction mixture with EfCdnE after overnight incubation, only the pppCpC reaction intermediate was detected (LC-MS dataset, Fig. 2D). Using CTP and UTP as substrates, EfCdnE generated intermediates with different base compositions, pppCpC, pppUpU, and pppUpC/pppCpU and products cUU and cUC (LC-MS dataset, Fig. 2E–G). Because the molecular masses of pppUpC, pppCpU, and the fragments derived from them are identical (LC-MS dataset, Supplementary Fig. 3), we could not determine which intermediate was generated by EfCdnE. Altogether, the above results indicate that EfCdnE catalyzes the synthesis of cUU products through pppUpU intermediate and cUC products through pppUpC or pppCpU as an intermediate.
Overall structure of cyclic di-pyrimidine-synthesizing CdnE
The native structure of ClCdnE was refined to 2.2-Å resolution (Supplementary Table 1). Like other CD-NTases, the ClCdnE protein folds into two distinct domains (Fig. 3A). The NTase domain contains a central β-sheet composed of five strands: βA–βE. The longest strand, βE in the middle, extends into a flanking β-sheet with strand βB’ on one side and βF and βG on the other. There are two helices, α2, and α3, in this domain. The helical domain consisted of six major helices, including the first helix, α1. CD-NTases are classified as type-I and type-II according to their requirement for an activation mechanism16. In type-I enzymes such as cGAS, the NTase and helical domains are connected by the “spine” helix, which is broken into two upon activation4. The separation of helices α1 and α2, which correspond to the “spine” in cGAS by a short loop in ClCdnE, may account for its constitutive NTase activity as a type II enzyme.
The native structure of EfCdnE was determined to be a 1.75-Å resolution (Supplementary Table 2). When compared with ClCdnE, helix α2 of EfCdnE was shorter by one turn at the N-terminus (Supplementary Fig. 4). The overall root-mean-square deviation (RMSD) is 1.38 Å between 280 matched pairs of Cα atoms. Other significant deviations were observed in the βB’-α3 loop, βD-βE loop, βE-βF loop, and α7-α8 loop. The native structure of LpCdnE was refined to 2.46-Å resolution (Supplementary Table 3). It differs from ClCdnE in similar regions, with 1.32 Å RMSD for 276 Cα pairs, but the α1-α2 loop is missing (Supplementary Fig. 4). Generally, regions with larger deviations were correlated with higher temperature factors. Despite the differences in local regions, the dispositions of the NTase and helical domains were similar in the native structures of ClCdnE, EfCdnE, and LpCdnE, respectively.
Despite larger than 25% sequence identity (Fig. 3B, C), the structures of EmCdnE and RmCdnE showed remarkable differences compared with ClCdnE (Supplementary Fig. 5). Strand βB’ and the entire connection to helix α3 are missing in EmCdnE, whereas the βB’-α3 loop in RmCdnE includes a short two-turn helix. The βE–βF loop conformations also varied greatly (Supplementary Fig. 5). The RMSD of the ClCdnE, EfCdnE, and LpCdnE structures from EmCdnE and RmCdnE range between 1.50 Å and 1.83 Å for 244 to 271 Cα pairs (Supplementary Table 4).
UTP substrate binding modes of CdnE
By including substrate UTP or UMPnPP and cofactor MgCl2 in the crystallization solutions, we obtained the complex crystals of ClCdnE-UTP, LpCdnE-UTP, and LpCdnE-UMPnPP and refined them to 2.6-Å, 1.95-Å and 2.2-Å resolution, respectively (Supplementary Fig. 6 and Supplementary Tables 1 and 3). The RMSD between the 297 pairs of protein Cα atoms in the native and complex structures of ClCdnE was 0.325 Å, and that between the 306 Cα pairs in the native and complex EfCdnE crystals was 0.253 Å (Supplementary Figure 7). The four protein models, two from LpCdnE-UTP and two from LpCdnE-UMPnPP complex show RMSD of 0.56 Å to 0.82 Å from the native LpCdnE for 281 to 286 matched Cα pairs (Supplementary Table 5). The largest deviation is seen in the βB’-α3 loop, probably because of its flexible nature and outward location (Supplementary Fig. 7). The little structural differences from native proteins suggest that substrate binding does not induce large domain movement in all three CD-NTases studied here.
In the ClCdnE-UTP complex structure, the triphosphate part of the donor UTP is bound to the enzyme via a Mg ion, termed Mg-B, as observed for ATP in other CD-NTases13. The Mg-B ion, with an octahedral configuration, was coordinated to all three phosphate groups of the UTP, two water molecules, and the side-chain of Asp67 (Fig. 4A). The β-phosphate group is hydrogen-bonded to the backbone of Ser53 and the side-chain of Arg197. The side-chain of Arg61 lies adjacent to γ-phosphate, but no direct bonding is observed. The ribose moiety of the donor UTP is not involved in direct bonding to the protein. However, its pyrimidine base forms stacking interactions with the side-chain of Phe217 on one side and the base of the acceptor UTP on the other. A hydrogen bond between the α-phosphate of the donor UTP and ribose O3’ of the acceptor was also present. The triphosphate moiety of the acceptor UTP forms two salt bridges on the side-chains of Lys123 and Lys276, respectively (Fig. 4B). The weaker interactions may correlate with its higher flexibility, as implied by the presence of electron density for the alternative disposition of the γ-phosphate group. In contrast, the base of the acceptor UTP is engaged in three well-defined hydrogen bonds with the protein side chains: one with Gln51 and two with Asn169. The proximity of the Phe159 side-chain suggested a T-stacking interaction with the acceptor base. In addition to the absence of an acceptor-bound Mg ion, the arrangement of UTP molecules is consistent with the canonical binding mode for the NTase superfamily17.
In the LpCdnE-UTP complex crystal, similar binding modes of the donor and acceptor nucleotides to the active site were observed for both protein molecules. In addition to the Mg-B ion, the α-phosphate of the donor UTP also forms a coordinate bond with a Mg ion, Mg-A, which further shares the Asp67 side-chain and a water molecule with Mg-B in octahedral-complex formation. The other ligands are the Asp139 side-chain, the ribose O3’ of the acceptor nucleotide, and a water molecule. The γ-phosphate group of the donor formed four hydrogen bonds with Ser53, Asn59, Lys201, and Ser220 of the protein in LpCdnE (Fig. 4C, E), showing remarkably stronger interactions than those in ClCdnE and EfCdnE. However, the stacking between the Tyr221 side-chain and the donor base may be weaker. The interaction was also weaker between the non-aromatic Ile163 side-chain and the acceptor base, which forms similar hydrogen bonds to the side-chains of Gln51 and Asn173, as seen in ClCdnE and EfCdnE (Fig. 4D, F). Besides the bond of O3’ to Mg-A, the acceptor ribose has O2’ hydrogen bonded to the Asp67 side-chain. LpCdnE binds more strongly to the acceptor triphosphate than ClCdnE. Although there is some disorder in the β- and γ-phosphate groups, the triphosphate is with hydrogen bond to the backbone N of Asp124, Lys125, and Ala126 in the βC–βD loop, and salt bridged to the Lys125 side-chain. In one of the conformations, β-phosphate forms a hydrogen bond with ribose O3’ (Fig. 4F).
In the LpCdnE-UMPnPP complex crystal, the donor nucleotide makes nearly identical interactions with the enzyme as in the UTP complex (Fig. 4G, I), except for an additional bond between a β-phosphate and the Ser64 side-chain (Fig. 4G). The base and ribose parts of the acceptor UMPnPP were also engaged in very similar interactions with the protein as those of the UTP complex (Fig. 4H, J). However, its triphosphate-analog part does not interact extensively with the backbone atoms in the βC-βD loop. Only a salt bridge was observed between α-phosphate and Lys125 side-chains. One of the LpCdnE-UMPnPP complexes contained a third Mg ion, Mg-C (Fig. 4G). It is coordinated by the side chains of Asp65, Asp128, and Asp139, two water molecules, and the γ-phosphate of the acceptor UMPnPP, which forms a salt bridge with the Arg29 side-chain (Fig. 4H). Although Asp65 and Asp139 are conserved, Arg29 and Asp128 are not. Mg-C appears accidental but may also be related to the catalytic pathway. After the first step of the reaction, the resulting pyrophosphate dissociates from the donor nucleotide as a Mg-complex, and the acceptor triphosphate moiety, now part of the intermediate, may occupy the donor-binding site more readily along with the bound Mg-C than without a pre-bound Mg ion. In DNA/RNA polymerases, the 3′ end of the primer substrate is tightly bound to the associated metal A, and the metal B containing the donor nucleotide shows weaker binding21. In contrast, CD-NTases appear to bind more strongly to metal B. In the complex structures of ClCdnE and EfCdnE described above, as well as in the three previously solved complex structures of EmCdnE (PDB 6E0N and 6E0O, respectively)12 and RmCdnE (PDB 6E0L)12, only Mg-B, but not Mg-A, was observed. However, both Mg-A and Mg-B were observed in the complex structure of LpCdnE. Analyses of the Mg-binding modes show that the distances from the respective ligand atoms are 2.2-2.5 Å for Mg-A, 2.0-2.2 Å for Mg-B, and 2.1-2.3 Å for Mg-C (Supplementary Table 6). The longer distances to Mg-A are consistent with its weaker binding, but this might indicate the use of other metal ions, such as manganese.
Conserved motif dictates donor pyrimidine selection of CdnE
The substrate-bound ClCdnE and LpCdnE presented here showed that recognition of acceptor UTP or UMPnPP is dictated by conserved Asn169 and Asn173 (lid 1), respectively, consistent with a previous report12. In contrast, the side-chain of Asn178 (lid 2) of LpCdnE forms a hydrogen bond only with the ribose 2’-OH of the donor UTP/UMPnPP, and His174 (lid 2) of ClCdnE makes no contact with donor UTP (Supplementary Figure 8), suggesting that lid 2 may plays a role in nucleotide specificity during catalysis or through some other mechanism. Instead, Trp277 and Trp276 form extensive hydrophobic contacts with the base moiety of donor UTP/UMPnPP in the substrate-bound ClCdnE and LpCdnE, respectively (Fig. 5A–C). Moreover, superimposition of the ClCdnE complex structure with those of RmCdnE and EmCdnE revealed that the bound AMPcPP and GTP could not be well accommodated in the donor site of ClCdnE, but would cause significant clashes with the side-chains of Glu263 and Arg275, which come together to form a salt bridge (Fig. 5E, F). Similarly, the side chains of Glu267 and Arg279 in EfCdnE are engaged in salt-bridging interactions and would have substantial clashes with the donor base if it were a purine nucleotide. This observation explains why we could not detect the binding of ATP and GTP analogs to EfCdnE in the ITC experiments. In LpCdnE, the Glu262 and Gln274 side chains also come together to form a hydrogen bond and restrict the donor-binding site for pyrimidine nucleotides. The corresponding Cys252 and Glu265 in RmCdnE or Glu259 and Ser271 in EmCdnE were well separated (Fig. 5G, H), and thus a purine base can be accommodated. Sequence analysis of cyclic di-pyrimidine-synthesizing CdnE proteins further revealed the conservation of Arg/Gln and Trp residues, together named the (R/Q)xW motif (Fig. 5D), which restricts the donor pocket and forms direct hydrophobic contacts with the uracil base of donor nucleotides, as mentioned above.
To validate the importance of this motif, we changed Arg279 and Trp281 of EfCdnE to a smaller alanine residue and assessed the substrate specificity and product synthesis of both wild-type and mutant EfCdnE by ITC and LC-MS/MS. The Arg-to-Ala reprogramming of EfCdnE restored its binding to both ATP and GTP with affinities near micromolar dissociation constants (Fig. 6A, B and Supplementary Figure 9A), in contrast to the lack of binding of purine nucleotides by wild-type EfCdnE (Supplementary Fig. 1A, B). Similarly, EfCdnEW281A binds AMPcPP with a KD value of 3.5 × 10−5 M and GMPcPP with a KD value of 6.5 × 10−5 M (Fig. 6C and Supplementary Fig. 9B), indicating that mutating either Arg or Trp of the (R/Q)xW motif is sufficient to rewire the specificity of the donor pocket of EfCdnE from pyrimidine to purine. LC-MS/MS analysis further demonstrated that incubation of EfCdnER279A or EfCdnEW281A with UTP/ATP and UTP/GTP mixtures yielded products corresponding to cUA and cUG, respectively (LC-MS dataset, Fig. 6D, E and Supplementary Fig. 9C–F). In conclusion, changing one of the two residues in the (R/Q)xW motif switches the product synthesis of EfCdnE from cyclic di-pyrimidine to purine-pyrimidine mixed species of CDNs, validating the critical role of the (R/Q)xW motif in pyrimidine selection. Based on sequence conservation and structural similarity, it is proposed that the cyclic di-pyrimidine-synthesizing CdnE proteins, including ClCdnE, EfCdnE, and LpCdnE, form smaller donor pockets that act as molecular gauges to recognize and retain the binding of pyrimidine nucleotides but weaken or exclude the binding to purine nucleotides (Fig. 6F). In combination with the conserved Asn residue in the acceptor pocket, the substrate specificity for double pyrimidine was achieved (Fig. 6F).
EfCdnE synthesizes 2'3’-cUU as major product
We showed that EfCdnE could use UTP as a substrate to produce cUU, using pppUpU as an intermediate (Fig. 2A). The EfCdnE-pppU[3’−5’]pU complex structure presented here shows that the triphosphate moiety of the reaction intermediate is coordinated to the Mg-B ion like that of the donor nucleotide in ClCdnE. However, there was only one hydrogen bond between the β-phosphate and the Ser53 side-chain (Fig. 7A). The side-chain of Lys202 is more than 4.5 Å from the triphosphate, while Arg61 is farther away. The uracil base in the donor binding site was flipped over, and the ribose was turned around compared with the donor structure in the ClCdnE-UTP complex (Fig. 7B). Neither part directly interacts with the protein, except for stacking the base with the Phe222 side-chain. The acceptor-base forms hydrogen bonds with the Gln51 and Asn174 side-chains in a similar manner as their equivalents in ClCdnE and is packed against the aromatic side-chain of Tyr164 (Fig. 7C). The acceptor ribose forms a hydrogen bond with the Asp67 side-chain via its 2’-OH group and an O3’ hydrogen bond to the donor α-phosphate. The phosphate group linking the donor and acceptor did not interact directly with the protein, despite the presence of a nearby Lys123 side-chain.
Including UTP and MgCl2 yields another complex crystal of EfCdnE, in which an intermediate pppU[2’−5’]p binds to the ligand-binding pocket. The complex structure was refined to a 2.0-Å resolution (Supplementary Table 2). The overall structure of EfCdnE-pppU[2’−5’]p is nearly identical to EfCdnE-pppU[3’−5’]pU, with an RMSD of 0.159 between 280 matched Cα pairs. The intermediate pppU[2’−5’]p, devoid of the acceptor base, binds to the donor pocket with an additional phosphate linked to the ribose 2’-O. The triphosphate part of the intermediate pppU[2’−5’]p is similarly coordinated to the Mg-B ion as that of pppU[3’−5’]pU (Fig. 7A, D). The β-phosphate forms two hydrogen bonds with the main- chain and side-chain of Ser53 (Fig. 7D). The side-chain of Ser221 is near γ-phosphate, forming a weak hydrogen bond (3.8 Å) (Fig. 7D). The side-chain of Phe222 forms a stacking interaction with the donor uracil base, similar to ClCdnE and LpCdnE (Figs. 4 and 7D). The ribose 2’-O-linked phosphate forms two hydrogen bonds with the side-chain of Lys123 (Fig. 7D). The side-chain of conserved R279 directly hydrogen bonded to the 2’-O of the uracil base, supporting its importance in pyrimidine selection in the donor pocket (Fig. 7D). LC-MS/MS detected the formation of pppUpU as an intermediate by EfCdnE upon UTP was used as a substrate (Fig. 2A) but could not distinguish the formation of 2’−5’ or 3’−5’ phosphodiester linkages. Here, we identified two kinds of intermediates, pppU[3’−5’]pU and pppU[2’−5’]p, in the crystal structures of EfCdnE by incubating the proteins with UTP and MgCl2, suggesting that the first step of the nucleotidyl-transfer reaction proceeds by attacking α-phosphate by either 2’-OH or 3’-OH of the acceptor UTP. However, superimposition of EfCdnE-pppU[3’−5’]pU with EfCdnE-pppU[2’−5’]p demonstrated that the terminal uracil base of pppU[2’−5’]pU could not fit into the acceptor pocket. It was probably too flexible to be modeled in the structure, suggesting that the re-orientation and re-binding of pppU[2’−5’]pU into the active site is difficult and energetically unfavored (Fig. 7E). Furthermore, our determined substrate-bound ClCdnE and LpCdnE showed that the 3’-OH is much closer to the α-phosphate than the 2’-OH of the acceptor UTP/UMPnPP (Supplementary Fig. 10A–C), suggesting the formation of pppU[3’−5’]pU as an intermediate. Moreover, in EfCdnE-pppU[3’−5’]pU complex structure, the 2’-OH seems to be positioned ready for attacking (3.5 Å) instead of 3’-OH (5.1 Å, Supplementary Figure 10D), which will lead to the production of 2'3’-cUU. Based on the above structural observations, we proposed a 2'3’-cUU/3'3’-cUU synthetic pathway involving EfCdnE (Fig. 7F). The first nucleotidyl-transfer reaction is catalyzed by the attack of the α-phosphate of the donor nucleotide by the acceptor ribose 3’-OH (Fig. 7F). Our determined EfCdnE complex with two different intermediates supports the synthesis of pppU[3’−5’]pU rather than pppU[2’−5’]pU; the latter could not fit well into the active site of EfCdnE for the subsequent reaction. Cyclization of the intermediate seems to favor the formation of a 2’−5’ linkage over the 3’−5’ linkage, judging by the EfCdnE-pppU[3’−5’]pU complex structure. Therefore, 2'3’-cUU instead of 3'3’-cUU is assumed to be the major product synthesized by EfCdnE (Fig. 7F).
Discussion
Altogether, the structural and functional studies of EfCdnE presented here prove that it is an authentic cyclic di-pyrimidine-synthesizing CD-NTase capable of simultaneously binding to two molecules of pyrimidine nucleoside triphosphate and catalyzing their cyclization to produce cyclic di-pyrimidine nucleotides. Based on its extensive structural similarity, ClCdnE probably functions similarly. Previous studies have demonstrated that the conserved asparagine residue (e.g., Asn166 in RmCdnE) in the active-site lid is important for uracil base recognition of the acceptor nucleotide-binding pocket of CD-NTases12,19. Here, the structures of three cyclic di-pyrimidine-synthesizing CD-NTases, ClCdnE, EfCdnE, and LpCdnE, confirmed this key finding. In structural terms, by flipping the side-chain amide group of this conserved asparagine residue, a cytosine base can also fit into the pocket with proper hydrogen bond formation (Supplementary Figure 11), explaining why both UTP and CTP can bind to these enzymes.
The similar binding affinity of UMPnPP and CMPcPP to EfCdnE suggests that cyclic di-pyrimidine- synthesizing CdnE distinguish the smaller six-membered pyrimidine bases from the larger nine-membered purine bases instead of recognizing the subtle structural differences in the nucleobases. If any of the uracil bases in the complex crystal structures of ClCdnE, EfCdnE, and LpCdnE is replaced by cytosine, the base can still fit well in the active site. With a flipping of the latter amide group, the side-chains of Gln51 and Asn174 in EfCdnE can form proper hydrogen bonds with the O2, N3, and N4 atoms of the cytosine base, and so can the equivalents in ClCdnE and LpCdnE. Although both acceptor and donor sites can bind to CTP, the preferred substrate is UTP. The cyclization reaction of CTP could not be completed, turning out only the intermediate pppCpC but no cCC product (Fig. 2C). The reason might lie in the binding mode of the donor moiety upon pppCpC rearrangement in the active site. To assume the pppU[3’−5’]pU-like binding mode, the donor base should be flipped over, and the ribose rotated to the other side. With uracil as the base, the pyrimidine ring remains virtually identical (Fig. 7B). However, similar flipping over of the cytosine base would result in the positional exchange of the 2-carbonyl and 4-amino groups, possibly causing some less favored interactions with the enzyme. Subtle changes in the substrate and intermediate binding modes may lead to significant differences in the catalyzed reactions.
Up to date, most of the identified bacterial CDNs contain 3'3’-phosphodiester linkage. One of the exceptions is 3’,2’-cGAMP synthesized by CD-NTase in clade G from Asticcacaulis sp.14, which could activate the downstream Cap5 effector for its specific defense against phage infections. Another case is the 2'3'3’-cyclic AMP-AMP-AMP produced by CD-NTase in clade D from Acinetobacter baumannii, which activates Cap4 effectors for double-stranded DNA degradation, resulting in cell deaths of infected bacteria22. The unique 3'2’ or 2'3’-phosphodiester linkage of bacterial second messenger may provide additional protection against phosphodiesterase degradation from invading phages15. In this study, we provide structural insights into the mechanism underlying synthesizing 2'3’-cUU by EfCdnE, which could be beneficial to the design and engineering of cyclic di-pyrimidine-synthesizing enzymes. However, whether specific isomer of cUU activates the downstream Cap15 effector remains unclear and need to be characterize in the future study.
Inspired by the key finding that the conserved residues at the C-terminal loop above the donor pocket, named lid 3, dictate pyrimidine nucleotide selection, we identified conserved lid 3 at the equivalent position of CD-NTases in clades A, B, C, E, and G, which directly controls donor nucleotide specificity (Fig. 8A). VcDncV belongs to CD-NTase in clade A01 (CdnA01), which produces 3'3’-cGA as the major product and 3'3’-cGG and 3'3’-cAA as the minor products23. Based on current available structures, lid 2 of VcDncV (Val264) was found to make van der Waals interaction with the donor nucleotide but does not form base-specific interactions (Fig. 8B). Instead, Asp348 of VcDncV forms bidentate hydrogen bonds with the guanine base (Fig. 8B), which is the major determinant for the high-affinity binding of the GTP substrate18. Sequence analysis showed that lid 3 (Asp348) of VcDncV was highly conserved in CdnA01 (Fig. 8B). Similarly, BfCdnB from Bacteroides fragilis also contains a highly conserved Asp238 as lid 3 of CD-NTase in clade B04 (CdnB04), which prefers GTP over ATP and synthesizes mainly 3'3’-cGA, similar to VcDncV (Fig. 8C). Furthermore, two highly conserved Asn residues among CD-NTases in clade C01 (CdnC01) were identified as lid 3 (Fig. 8D). One of the conserved Asn residues (Asn270) forms a hydrogen bond with the N1 atom of the adenine base of donor ATP in E. coli CdnC (Fig. 8D). Finally, the conserved (D/E)xxxD motif of CD-NTases in clade G10 (CdnG10) was identified as lid 3 (Fig. 8E). The structure of BdCdnG complexed with GTP (PDB 7LJN) demonstrated that the former Glu residue (Glu305) hydrogen bonded to the 2-NH2 group of the guanine base of donor GTP, while the latter Asp residue coordinated with the guanine N1 atom of donor GTP (Fig. 8E). In summary, the identified conserved lid 3 directly interacts with the donor nucleotide and controls its specificity, unlike the previously proposed lid 2. Our results could help facilitate future studies on CD-NTases and are described below. First, the Asp residue prefers the binding of guanine bases owing to the formation of bidentate hydrogen bonds rather than one hydrogen bond with adenine bases. Second, the conserved Asn residue does not distinguish ATP from GTP; however, the cellular concentration of ATP is much higher than that of GTP, leading to the selective advantage of ATP by Asn residues. Third, combining two adjacent Asp/Glu residues enhances GTP binding.
Little is known about pyrimidine-containing signaling molecules’ synthesis and recognition mechanisms in biological systems. For example, cyclic UMP (cUMP) and cyclic CMP (cCMP) are implicated in the regulation of apoptosis, necrosis, immune modulation, and embryonic development in mammalian cells24 and the induction of the reactive oxygen species (ROS) response in plants25. However, the exact synthases and receptors of cUMP and cCMP in eukaryotes remain elusive. cUMP and cCMP are found to mediate the anti-phage defense function in bacteria through Pycsar (pyrimidine cyclase system for antiphage resistance)26. Despite the absence of the receptor-cUMP/cCMP complex structure, the apo structure of cUMP synthase BcPycC from Burkholderia cepacia showed that BcPycC has a bulky side-chain of Tyr and Arg residues extending into the substrate-binding pocket, which sterically occludes the binding of larger purine nucleotides26. Thus, cyclic mono-pyrimidine and cyclic di-pyrimidine synthases in bacteria utilize similar molecular mechanisms for pyrimidine nucleotide selection.
Methods
Protein expression and purification
Full-length gene fragments encoding ClCdnE from Cecembia lonarensis (GenBank ID: EKB47661.1), EfCdnE from Enterococcus faecalis (GenBank ID: EOK48090.1) and LpCdnE from Legionella pneumophila (NCBI accession: WP_042646516.1) with Escherichia coli codon optimization were chemically synthesized and were each sub-cloned into pET21 vector by Blossom Biotechnologies, Inc. (Supplementary Table 7) to generate C-terminal His6-tagged proteins. The constructed plasmids were transformed separately into E. coli BL21 (DE3), which was grown overnight in Luria Bertani (LB) broth at 37 °C. Overexpression of the target protein was induced by 0.5 mM IPTG when the OD600 reached 0.6-0.8 and the bacteria were further incubated for 20 hours at 16 °C. The cells were harvested by centrifugation at 4 °C and 4500 x g for 30 min. The pellets were re-suspended in a lysis buffer containing 50 mM Tris-HCl pH 8.0, 500 mM NaCl, 10% glycerol, 1 mM tris(2-carboxyethyl)phosphine (TCEP), 1 mM phenylmethylsulfonyl fluoride (PMSF), and 5 mM imidazole. The cells were then lysed by sonication on ice. After centrifugation at 28000 x g, 4 °C, for 30 min, the clarified supernatant was loaded onto a 5 ml HisTrap FF column (Cytiva), followed by wash and elution with the lysis buffer that contained 10-200 mM imidazole gradient using ÄKTAprime plus (Cytiva). Based on subsequent analyses by 12% SDS-PAGE, fractions containing the target protein were pooled and further purified by gel-filtration using HiLoad 16/600 Superdex 200 pg column (Cytiva) equilibrated with a buffer T of 50 mM Tris-HCl pH 8.0, 200 mM NaCl, 5% glycerol and 1 mM TCEP.
Crystallization and X-ray data collection
The sitting-drop vapor-diffusion method was employed in crystallizing the proteins and their nucleotide complexes using commercial kits from Hampton Research, Molecular Dimensions, and Emerald Biostructures. Each drop was set up by mixing 1 μl protein sample at a concentration of 12 mg/ml with 1 μl reservoir solution and equilibrated against 200 μl of the reservoir at 4 °C or 20 °C. The crystals of ligand-free form of ClCdnE were grown by using a reservoir solution of 0.1 M MgCl2, 0.1 M 2’,2’,2’-nitrilotriacetic acid (ADA) pH 6.5, 12% w/v polyethylene glycol (PEG) 6000. Complex crystals of ClCdnE with UTP were obtained by including 5 mM UTP and 10 mM MgCl2 in the protein solution. Ligand-free EfCdnE was crystallized by using a reservoir solution containing 0.2 M sodium tartrate pH 7.3, 20% w/v PEG 3350. Both the complex crystals of EfCdnE with the intermediate pppU[3’−5’]pU and pppU[2’−5’]p, were obtained by incubating the protein with 5 mM UTP and 10 mM MgCl2 prior to setting up the drops.
Because LpCdnE was prone to precipitate, crystals were not obtained until the concentration of protein sample was reduced to 2.4 mg/ml. Substrate-free LpCdnE was crystallized by using a reservoir solution of 0.2 M MgCl2, 0.1 M Tris pH 8.0, 5% w/v PEG 8000. The protein was co-crystallized with UTP by including 5 mM UTP and 10 mM MgCl2, using a reservoir solution of 0.2 M ammonium acetate, 0.1 M sodium citrate pH 6.0, 9% w/v PEG 4000. The complex crystals of LpCdnE with the non-hydrolysable UTP analogue UMPnPP, in which the O atom connecting the α- and β-phosphate is replaced by an imido (NH) group, were obtained by including 5 mM UMPnPP and 10 mM MgCl2 in the protein solution and using 0.2 M lithium sulfate, 0.1 M Tris pH 8.5, 30% w/v PEG 3350 as the reservoir.
Before flash vitrification in liquid nitrogen, each protein crystal was washed briefly in the reservoir solution that contained 15–25% glycerol as a cryoprotectant. All of the X-ray diffraction data in this study were collected at the National Synchrotron Radiation Research Center (NSRRC), Taiwan, and processed by using HKL2000_v722. The cubic ClCdnE crystals diffracted X-rays to 2.2-2.6 Å resolution. They have space group I23 or I213 and a single protein molecule in an asymmetric unit. The monoclinic crystals of EfCdnE diffracted X-rays to 1.6–1.75 Å. They have space group P21 and an asymmetric unit also contains one protein molecule. The native LpCdnE crystal diffracted X-rays to 2.5 Å, with trigonal space group P31 or P32 and one protein molecule in an asymmetric unit. The UTP and UMPnPP complex crystals of LpCdnE with space groups P21 and P21212 diffracted X-rays to 1.95 and 2.2 Å resolution. Both have two protein molecules in an asymmetric unit. Some data-collection statistics from the ClCdnE, EfCdnE and LpCdnE crystals can be found in Supplementary Table 1–3.
Structure determination and refinement
The protein sequence of ClCdnE shows 36% identity to EmCdnE. Consequently, the EmCdnE model from PDB 6E0M was employed in solving the native ClCdnE structure by molecular replacement (MR), using the program CNS27. The space group was confirmed as I23. Subsequent auto-building by PHENIX 1.1928 and iterative manual rebuilding and adjustment of the model using Coot 0.9.629 yielded a refined model of native ClCdnE, containing a continuous polypeptide chain from residues 2 to 298, the last three (L-E-H) corresponding to the His-tag, and 295 water molecules. The R and Rfree values were further improved by using Translation-Libration-Screw (TLS) tensors and the final values were 0.157 and 0.215, respectively, at a 2.2-Å resolution. Almost all (99.3%) residues were in the favored regions of the Ramachandran plot, and there were no outliers (Supplementary Table 1). The ClCdnE-UTP complex crystal is isomorphous to the native. The Fo-Fc map showed two bound nucleotides in the active site, which were included in the model along with an Mg ion. The model was subsequently adjusted and refined by using COOT and PHENIX. Some statistics of the refined ClCdnE structures can be found in Supplementary Table 1.
The native crystal structure of EfCdnE was solved by MR, using CNS, in which the refined ClCdnE structure served as the search model. The three-dimensional structure was iteratively refined and rebuilt using PHENIX and Coot and the TLS tensors were also applied. The refined model gave R and Rfree values of 0.168 and 0.198, respectively (Supplementary Table 2). The entire protein chain, including six histidine residues at the C-terminus, was visible. However, the connecting region between helices α1 and α2 has weak electron density. The EfCdnE-pppU[3’−5’]pU and EfCdnE-pppU[2’−5’]p complex crystals are also isomorphous to the native, and the bound intermediates, visible in the initial Fo-Fc and 2Fo-Fc maps, were placed into the active site. Some statistics of the EfCdnE models after further refinement by using Coot and PHENIX can be found in Supplementary Table 2.
The native crystal structure of LpCdnE was solved also by MR, but using MOLREP 11.7.0330 and a search model from SWISS-MODEL31 with ClCdnE as the template. The space group was found to be P31, and the adjustment and refinement of the model were performed using Coot and REFMAC version 5.8.026732. The final R and Rfree values were 0.191 and 0.223, respectively (Supplementary Table 3). The two complex structures of LpCdnE were also solved by using MOLREP, with the native structure as the search model. Two protein molecules were successfully located in each unit cell. The bound ligands were placed into each active site and adjusted manually by using Coot. The complex structures were then refined by REFMAC, in which non-crystallographic symmetry (NCS) restraints and TLS tensors were used. Further refinements were carried out by using Coot and PHENIX. Some statistics of the refined LpCdnE models can be found in Supplementary Table 3. All the figures containing three-dimensional protein structures were depicted using PyMOL 2.3.533.
Affinity measurement
EfCdnE was chosen as the sample to evaluate the affinity of this class of enzyme to different nucleotides by isothermal titration calorimetry (ITC), using nano-ITC (TA Instruments). EfCdnE was dialyzed overnight against an assay buffer containing 20 mM Tris pH 8.0, 150 mM NaCl. The nucleotides to be tested (UTP, CTP, ATP and GTP, and their analogs) were each dissolved in the same buffer. Titration was carried out by 20 sequential injections of the nucleotide at a concentration of 250 μM at 3-min intervals to 50 μM EfCdnE, with a stirring speed of 300 rpm. The heat of dilution resulting from injecting the nucleotide into the assay buffer was subtracted. The final data were analyzed and curve-fitted with sequential two-site mode by using NanoAnalyze v3.12.5 (TA Instruments).
LC-MS/MS analysis
To analyze the intermediate and cyclic dinucleotide products synthesized by CdnE, reaction mixtures containing 1 mM Mg2+, 0.5 mM indicated nucleotide substrates and 1 μM wild-type or mutant CdnE were prepared and overnight incubated. Reactions were heat inactivated at 90 °C for 10 mins, followed by centrifugation at 13000 x g, 4 °C for 30 min. After removal of the pellet, 5 μl of the sample solution was directly injected into the LC-MS system for analysis.
The LC-MS/MS experiments were performed as previously described34 with some modifications. A UHPLC system (Ultimate 3000; Dionex, Germany) equipped with a XBridge BEH amide column (2.5um 2.1 × 150 mm; Waters) was coupled to a hybrid Q-TOF mass spectrometer (maXis impact, Bruker Daltonics, Bremen, Germany) with an orthogonal electrospray ionization (ESI) source. LC-MS data acquisition system was controlled by HyStar software (version 3.2, Bruker Daltonics). Mobile phase A was H2O that contained 10 mM ammonium bicarbonate. Mobile phase B was pure acetonitrile. The flow rate was 0.25 mL/min and an elution gradient was applied, starting with 90% B for 0.5 min, then down to 30% B in the next 6.5 min, followed by holding at 30% B for 1 min. It was then returned to the starting conditions by re-equilibration of the column for 3.2 min with 90% B prior to the next injection. The total run time was 11.2 min. The ESI source was operated in the negative mode (−3000 V). The endplate offset was 500 V. The nebulizer gas flow was 1 bar and drying gas flow was 8 L/min. The drying temperature was set at 200 °C. Funnel 1 radiofrequency (RF) and Funnel 2 RF were both 400 Vpp. The hexapole RF was 200 Vpp and the low mass cutoff of quadrupole was 100 m/z. To acquire MS/MS spectra of target ions, parallel reaction monitoring (PRM) was applied to acquired fragmented ions with the detection range of m/z 50 ~ 1000 at 2 Hz. The PRM collision energy was ramped from 32-48 eV for cUU, cUC, cCC, pppUpU, pppCpC and pppCpU/pppUpC. The precursor ion/fragment ion (observed retention time) of 611.04/305.01, 609.07/304.03 and 610.06/305.01 were extracted for the LC-MS peaks of cUU (RT: 6.4 min), cCC (RT: 6.1 min) and cUC (RT: 6.2 min), respectively. The isolation window for targets was 5 Da. The compound identification by the LC-MS/MS approach may be affected by the MS/MS spectra consisting of fragmented-ions from the co-eluting peaks with m/z difference smaller than the isolation window. All LC-MS raw data were processed by DataAnalysis (version 4.4, Bruker Daltonics) to obtain LC-MS spectra.
Bioinformatic analysis
Multiple sequence alignment was performed using Clustal Omega35. Phylogenetic tree shown in Fig. 3C was generated using W-IQ-TREE36. To generate sequence logo shown in Figs. 5D and 8B–E, 100 CdnE protein sequences, 288 CdnA01 protein sequences, 31 CdnB04 protein sequences, 265 CdnC01 protein sequences, and 115 CdnG10 protein sequences were aligned using Clustal Omega, which then subjected to WebLogo (version 2.8.2)37.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Data are available within the article and supplementary information. The coordinates and structure factors of ClCdnE, EfCdnE, LpCdnE, ClCdnE-UTP complex, EfCdnE-pppU[3’−5’]pU complex, EfCdnE-pppU[2’−5’]p complex, LpCdnE-UTP complex and LpCdnE-UMPnPP complex generated in this study have been deposited in the Protein Data Bank under accession codes 7X4A, 7X4C, 7X4F, 7X4G, 7X4P, 8HYK, 7X4Q and 7X4T. The protein structures used for analysis in this study are available in the Protein Data Bank under accession codes 4XJ3, 6E0K, 6E0L, 6E0M, 6E0N, 6E0O, 6P80, 7LJN, and 7LJO. The LC-MS dataset supporting the conclusions of this article is available from MassIVE (https://massive.ucsd.edu) under accession code MSV000092570.
References
Yoon, S. H. & Waters, C. M. The ever-expanding world of bacterial cyclic oligonucleotide second messengers. Curr. Opin. Microbiol. 60, 96–103 (2021).
Aline Dias da, P., Nathalia Marins de, A., Gabriel Guarany de, A., Robson Francisco de, S. & Cristiane Rodrigues, G. The world of cyclic dinucleotides in bacterial behavior. Molecules 25, 2462 (2020).
He, J., Yin, W., Galperin, M. Y. & Chou, S. H. Cyclic di-AMP, a second messenger of primary importance: tertiary structures and binding mechanisms. Nucleic Acids Res. 48, 2807–2829 (2020).
Zhou, W. et al. Structure of the human cGAS-DNA complex reveals enhanced control of immune surveillance. Cell 174, 300–311.e311 (2018).
Hopfner, K. P. & Hornung, V. Molecular mechanisms and cellular functions of cGAS-STING signalling. Nat. Rev. Mol. Cell Biol. 21, 501–521 (2020).
Morehouse, B. R. et al. STING cyclic dinucleotide sensing originated in bacteria. Nature 586, 429–433 (2020).
Ko, T. P. et al. Crystal structure and functional implication of bacterial STING. Nat. Commun. 13, 26 (2022).
Kranzusch, P. J. et al. Structure-guided reprogramming of human cGAS dinucleotide linkage specificity. Cell 158, 1011–1021 (2014).
Severin, G. B. et al. Direct activation of a phospholipase by cyclic GMP-AMP in El Tor Vibrio cholerae. Proc. Natl. Acad. Sci. USA 115, E6048–e6055 (2018).
Cohen, D. et al. Cyclic GMP-AMP signalling protects bacteria against viral infection. Nature 574, 691–695 (2019).
Ye, Q. et al. HORMA domain proteins and a Trip13-like ATPase regulate bacterial cGAS-like enzymes to mediate bacteriophage iImmunity. Mol. Cell 77, 709–722.e707 (2020).
Whiteley, A. T. et al. Bacterial cGAS-like enzymes synthesize diverse nucleotide signals. Nature 567, 194–199 (2019).
Ko, T. P. et al. Crystal structure and functional implication of a bacterial cyclic AMP-AMP-GMP synthetase. Nucleic Acids Res. 49, 4725–4737 (2021).
Fatma, S., Chakravarti, A., Zeng, X. & Huang, R. H. Molecular mechanisms of the CdnG-Cap5 antiphage defense system employing 3’,2’-cGAMP as the second messenger. Nat. Commun. 12, 6381 (2021).
Hobbs, S. J. et al. Phage anti-CBASS and anti-Pycsar nucleases subvert bacterial immunity. Nature 605, 522–526 (2022).
Kranzusch, P. J. cGAS and CD-NTase enzymes: structure, mechanism, and evolution. Curr. Opin. Struct. Biol. 59, 178–187 (2019).
Steitz, T. A. A mechanism for all polymerases. Nature 391, 231–232 (1998).
Kato, K., Ishii, R., Hirano, S., Ishitani, R. & Nureki, O. Structural basis for the catalytic mechanism of DncV, bacterial homolog of cyclic GMP-AMP synthase. Structure 23, 843–850 (2015).
Govande, A. A., Duncan-Lowey, B., Eaglesham, J. B., Whiteley, A. T. & Kranzusch, P. J. Molecular basis of CD-NTase nucleotide selection in CBASS anti-phage defense. Cell Rep. 35, 109206 (2021).
Duncan-Lowey, B., McNamara-Bordewick, N. K., Tal, N., Sorek, R. & Kranzusch, P. J. Effector-mediated membrane disruption controls cell death in CBASS antiphage defense. Mol. Cell 81, 5039–5051.e5035 (2021).
Belogurov, G. A. & Artsimovitch, I. The mechanisms of substrate selection, catalysis, and translocation by the elongating RNA polymerase. J. Mol. Biol. 431, 3975–4006 (2019).
Lowey, B. et al. CBASS immunity uses CARF-related effectors to sense 3’-5’- and 2’-5’-linked cyclic oligonucleotide signals and protect bacteria from phage infection. Cell 182, 38–49.e17 (2020).
Davies, B. W., Bogard, R. W., Young, T. S. & Mekalanos, J. J. Coordinated regulation of accessory genetic elements produces cyclic di-nucleotides for V. cholerae virulence. Cell 149, 358–370 (2012).
Seifert, R. cCMP and cUMP: emerging second messengers. Trends Biochem. Sci. 40, 8–15 (2015).
Marondedze, C., Wong, A., Thomas, L., Irving, H. & Gehring, C. Cyclic nucleotide monophosphates in plants and plant signaling. Handb. Exp. Pharm. 238, 87–103 (2017).
Tal, N. et al. Cyclic CMP and cyclic UMP mediate bacterial immunity against phages. Cell 184, 5728–5739.e5716 (2021).
Brünger, A. T. et al. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D. Biol. Crystallogr. 54, 905–921 (1998).
Liebschner, D. et al. Macromolecular structure determination using X-rays, neutrons and electrons: recent developments in Phenix. Acta Crystallogr. D. Struct. Biol. 75, 861–877 (2019).
Casañal, A., Lohkamp, B. & Emsley, P. Current developments in Coot for macromolecular model building of Electron Cryo-microscopy and Crystallographic Data. Protein Sci. 29, 1069–1078 (2020).
Vagin, A. & Teplyakov, A. Molecular replacement with MOLREP. Acta Crystallogr. D. Biol. Crystallogr. 66, 22–25 (2010).
Waterhouse, A. et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 46, W296–w303 (2018).
Murshudov, G. N. et al. REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr. D. Biol. Crystallogr. 67, 355–367 (2011).
Schrodinger, LLC. The PyMOL Molecular Graphics System, Version 2.3.3 (2019).
Chen, C. J., Liao, W. L., Chang, C. T., Lin, Y. N. & Tsai, F. J. Identification of urinary metabolite biomarkers of type 2 diabetes nephropathy using an untargeted metabolomic approach. J. Proteome Res. 17, 3997–4007 (2018).
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Trifinopoulos, J., Nguyen, L. T., von Haeseler, A. & Minh, B. Q. W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res. 44, W232–235 (2016).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Acknowledgements
The authors thank National Synchrotron Radiation Research Center for beam time allocations and data collection supports. We also thank the Academia Sinica Protein Clinic, funded by Academia Sinica Core Facility and Innovative Instrument Project [AS-CFII-111-206] for the X-ray crystallographic service. This work was supported by the Ministry of Science and Technology of Taiwan [109-2311-B241-001 and 111-2311-B-039-001-MY3] (to Y.C.).
Author information
Authors and Affiliations
Contributions
C.S.Y. and T.P.K. designed and carried out the experiments and acquired and analyzed the data. C.J.C., M.H.H. and Y.C.W. carried out the experiments. Y.C. interpreted the data, wrote the manuscript, and supervised the entire project.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Philip Kranzusch and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yang, CS., Ko, TP., Chen, CJ. et al. Crystal structure and functional implications of cyclic di-pyrimidine-synthesizing cGAS/DncV-like nucleotidyltransferases. Nat Commun 14, 5078 (2023). https://doi.org/10.1038/s41467-023-40787-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-40787-9
- Springer Nature Limited