Crystal structures of N-terminal WRKY transcription factors and DNA complexes

Xu, Yong-ping; Xu, Hua; Wang, Bo; Su, Xiao-Dong

doi:10.1007/s13238-019-00670-0

Crystal structures of N-terminal WRKY transcription factors and DNA complexes

Letter
Open access
Published: 16 November 2019

Volume 11, pages 208–213, (2020)
Cite this article

Download PDF

You have full access to this open access article

Protein & Cell

Crystal structures of N-terminal WRKY transcription factors and DNA complexes

Download PDF

Yong-ping Xu¹,
Hua Xu¹,
Bo Wang¹ &
…
Xiao-Dong Su ORCID: orcid.org/0000-0001-6948-2317¹

3028 Accesses
24 Citations
1 Altmetric
Explore all metrics

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Dear Editor,

Plant-specific WRKY transcription factors (TFs) are among the largest families of TFs in higher plants; they are also found in the unicellular eukaryote Giardia lamblia and the slime mold Dictyostelium discoideum (Ulker and Somssich, 2004), but not in animals. WRKY TFs participate in diverse developmental and physiological processes in plants, such as disease resistance, abiotic stress responses, senescence, seed and trichome development, as well as additional developmental and hormone-controlled processes (Agarwal et al., 2011).

There are 75 WRKY family members identified so far in Arabidopsis and more than 100 in rice (UniProt:http://www.uniprot.org/). The WRKY TFs are named after their approximately 60 conserved amino acids of DNA binding domains (DBDs) called the WRKY domains required for W-box (5′-TTGAC-C/T-3′) DNA recognition (Eulgem et al., 2000). The WRKY domain contains a highly conserved WRKYGQK motif (forming a β strand) near the N-terminus and a zinc-finger motif at the C-terminus, featuring an atypical C2H2 (CX_4–5CX_22–23HX₁H) or C2HC (CX₇CX₂₃HXC) type (Eulgem et al., 2000). The zinc-finger structure is indispensable for the DNA-binding of WRKY TFs. Any substitutions of the conserved cysteine or histidine residue eliminate the protein-DNA interaction (Duan et al., 2007). Based on the number of WRKY domains and the zinc-finger structure, WRKY TFs are classified into groups I to III and each group is further divided into subgroups (Brand et al., 2013). Group I WRKY genes are defined by the presence of two WRKY domains, whereas groups II and III contain only a single WRKY domain (Eulgem et al., 2000). We named the WRKY domains as WRKY-N and WRKY-C respectively for the group I two-domain WRKY TFs. As shown by previous experiments, the specific binding to W-box was thought to be mediated mainly by the C-terminal WRKY domain, whereas the N-terminal WRKY domain showed weaker (Brand et al., 2013) or even no binding to W-box (Ishiguro and Nakamura, 1994; de Pater et al., 1996; Eulgem et al., 1999). However, recent one-hybrid studies on yeast demonstrated that the two WRKY domains of AtWRKY1 (Arabidopsis WRKY1 protein) were both essential for its transcriptional activities (Qiao et al., 2016). We have previously determined the crystal structure of the apo AtWRKY1-C (Arabidopsis WRKY1-C) that comprises a five-stranded β-sheet (Duan et al., 2007). The solution structures of the apo AtWRKY4-C (Arabidopsis WRKY4-C), and its complex with a W-box DNA were solved by NMR (Yamasaki et al., 2005; Yamasaki et al., 2012). Recently, a complex crystal structure of dimeric rice WRKY45-DBD, a group III (C2HC zinc finger domain) WRKY TF bound to two W-box DNA was reported (Cheng et al., 2019). To date, there is no structural information for any N-terminal WRKY domains.

In this study, we present the crystal structures of AtWRKY1-N (residues 108–169) at 3.0 Å, AtWRKY2-N (residues 270-332) at 2.4 Å and AtWRKY33-N (residues 185–242) at 3.0 Å in complex with a same 15-bp W-box double stranded DNA (dsDNA) (Tables S1 and S2). The three complex structures look very similar to each other and all comprise four β strands named β2 to β5, forming a β sheet (partially consistent with the other reported WRKY structures), and the sheet inserted almost vertically into the major groove of the W-box DNA (Fig. 1A–D). The notable changes of these three DNA complex structures are the orientation of the loop linking β4-β5 compared with other structures of AtWRKY1-C (PDB code: 2AYD), AtWRKY4-C (PDB code: 2LEX), OsWRKY45 (PDB code: 6IR8) and AtWRKY52 (PDB code: 5W3X) (Fig. 1E). The most striking differences are that the distances (measured between main chain CO and NH groups) between β3 and β4 of these three AtWRKY-N proteins are closer than those of other WRKY domains (mean distance 3.0 Å for AtWRKY-N proteins vs. 5–6 Å for the others); Fig. 1F and 1G showed two sets of comparisons between group I WRKY proteins. As mentioned above, the OsWRKY45-DBD can form a homodimer by swapping the β4-β5 strands (Cheng et al., 2019). From above comparison, we could suggest that the domain swapping of OsWRKY45-DBD is relevant to the flexibility around β3 and β4 strands.

The residues directly interacting with the dsDNA are conserved in all three complex structures (Fig. 2A), and they interact with DNA in the same manner, either specific base recognition (Fig. S1) or nonspecific interaction to the main chain phosphate groups. We will take the structure of AtWRKY1-N to describe the details below. Besides the previous reported recognition of the W-box sequence by AtWRKY4-C mainly via apolar contacts with the methyl groups of the TT bases (Yamasaki et al., 2012) (Fig. 2C and 2D), our crystal structures showed extensive H-bond interactions between AtWRKY1-N and dsDNA with recognition of the Crick strand G6’, G7’ and C9’ (using prime “ base’ ” to denote the 5′ -> 3′ reading from the Crick strand) (Fig. 2B and 2E). In our structure, most of distances between amino acids (AtWRKY1-N) and bases are closer (>0.2 Å) than those of AtWRKY4-C (Fig. 2B–E). Accordingly, the AtWRKY1-N contributes to DNA binding with higher affinity of K_D ~0.1 µmol/L (Fig. 2G), whereas the K_D for the AtWRKY1-C is 1.3 µmol/L (Fig. S2B) measured by ITC (isothermal titration calorimetry) assay, a 13-fold increase in the DNA binding affinity.

The DNA recognition was accomplished by seven base-specific interactions (Watson strand T5, T6, T7 and Crick strand G6’, G7’, T8’, C9’) and nonspecific interactions with the phosphate backbone (Fig. 2F). To provide a unified description for all WRKY domains with different numbering, we renumbered the highly conserved WRKYGQK sequence as W₁R₂K₃Y₄G₅Q₆K_7. The base-recognition of Watson strand are mainly by hydrophobic interactions, including: (a) R₂ and K₃ to T5, (b) K₃ and Y₄ to T6, (c) K₃, G₅ and Q₆ to T7 (Fig. 2F). In addition, the side chain of R₂ and K₃ contact the phosphate backbone of C4, T5 and T6 through nonspecific salt bridges and hydrogen bonds (Fig. 2F). As for the Crick strand, there are more interactions involved in DNA recognition. The hydrophobic interactions include: (a) G₅, Y₄, K₇ and Y133 to T8’, (b) Y₄ and G₅ to C9’. There also exist H-bonds, including: (a) the amino-group of K₇ forms hydrogen bonds with the O6 and N7 atoms of G6’ and G7’, (b) the carboxyl oxygen of Y₄ and the N4 atom of C9’ (Fig. 2F). Meanwhile, the H-bonds and electrostatic interactions between protein and DNA phosphate groups strengthen the binding preference to the sequence -G6’G7’T8’C9’- of the Crick strand. The H-bonds include: (a) the guanidyl of R131 and the hydroxyl of Y133 to the G7’, (b) the hydroxyl of Y₄ and the side chain of K144 to T8’, (c) the guanidyl of R135 to C9’. Electrostatic interactions are found between Arg or Lys and DNA phosphate group, including R131 to G7’, K144 to T8’ and R135 to C9’ (Fig. 2F).

We also studied the contacts described above by base substitution and evaluated their binding affinities by ITC measurements (Fig. 2G). The binding affinity of the substitutions of G7’C9’ to A7’A9’ is much weaker than in the original sequence. The K_D between AtWRKY1-N and G7’C9’ to A7’A9’ is 7.6 µmol/L, whereas the K_D between AtWRKY1-N and the original W-box is 0.1 µmol/L, a 76-fold decrease (Fig. 2G and Table S3), showing the importance of these two bases for the specific recognition. In addition, the substitution of T8’ to A8’ significantly decreased the affinity by 12-fold (Fig. 2G and Table S3). However, the substitution of G6’ to T6’ only decreased the binding affinity by 2.5-fold (Fig. S2A and Table S3). Subsequently, we mutated the T6 and T7 of the Watson strand to C6 and G7, which only decreased the affinity by 4.5-fold (Fig. 2G and Table S3). Therefore, TT bases of the Watson strand are not as important to AtWRKY1-N as to AtWRKY4-C (Yamasaki et al., 2012) and OsWRKY45-DBD (Cheng et al., 2019). These results demonstrate that the DNA-protein interaction of AtWRKY1N is mainly concentrated on the Crick strand particularly around sequence of “-G7’T8’C9’-”. The substitution of the bases outside the “-G7’T8’C9’-” only impairs the binding slightly (Table S3). A previous work also revealed that the DBDs of AtWRKY11 and AtWRKY50 bind to an invariant ‘GAC’ core consensus (reading from the Watson strand) (Brand et al., 2013), consistent with our results.

Next, we investigated the residues of AtWRKY1-N involved in DNA binding by site-directed mutagenesis and electrophoretic mobility shift assay (EMSA). The mutant of R117A (R₂) or K118A (K₃), interacting with TT bases of the Watson strand, could still bind to DNA (Fig. S3A and S3B), whereas the mutation of K416A (K₃) in AtWRKY4-C eliminated its DNA binding activity (Yamasaki et al., 2012). The mutants Q121A (Q₆), K122A (K₇), Y133A, R135A and K144A appeared to not bind to DNA without an apparent shift band (Fig. S3D, S3E and S3G–I), noticeably the mutant K122A (K₇), with Y119 (Y₄), Q121 (Q₆), K122 (K₇), Y133, R135 and K144 directly in contact with the sequence G7’, T8’, C9’ and T7 (Fig. 2F). The mutant Y119A still bound to DNA (Fig. S3C) because Y119 (Y₄) forms a hydrogen bond with base C9’ via the main chain oxygen atom (Fig. 2E). These results are consistent with the complex structures observed above and ITC results.

The classical model of a transcription factor searching for its specific site presumes that positively charged DBD binds first to dsDNA somewhere non-specifically and then slides on the DNA in one dimension to find the specific site (Berg et al., 1981). In our case, the residues involved in non-specific contacts surrounding the phosphate groups appear to enable the protein to locate closer to the DNA major groove non-specifically, and the K₇ contributes to searching for the optimal specific binding site. However, we could not obtain the dynamic process from the static picture of our crystal structures. The residue K₇ is absolutely conserved for all WRKY proteins. We thus propose that K₇ is the key amino acid for all WRKY domains to search for and bind to dsDNA specifically. In our three complex structures, the K₇ interacts with G6’ and G7’ with different but similar distances (Fig. S4A–C). To understand the role of K₇ in different WRKY domains, we mutated it to Ala, Gln and Arg. Only the mutant K284R of AtWRKY2-N could form a slight band with DNA in one of the WRKY domains while the other mutants completely eliminated the DNA binding ability (Fig. S4D–F).

All together, we have shown that the N-terminal group I WRKY domains bind to W-box DNA as well (if not better) as the C-terminal WRKY domains, with quite different binding mode (more extensive interaction to the Crick strand and to the ‘GAC’ core sequence). Furthermore, the EMSA and ITC results show that AtWRKY1^101−339 (residues 101–339, comprising both WRKY domains) can bind to two W-box DNA at the same time (Fig. 2H–I). The K_D between AtWRKY1^101−339 and W-box DNA is 0.5 µmol/L with two DNA binding sites (Fig. 2I). The mechanism of two binding-sites on group I WRKY proteins immediately suggests that group I WRKY TFs can interact and recruit more DNA partners than previous knowledge of a single domain of WRKY TF binding to one W-box DNA (Fig. S5).

WRKY TFs bind to DNA specific sites in the promoters of target genes to regulate their expression. However, all WRKY TFs bind to the same W-box sequence, raising the question of how specificity is achieved and differentiated between different promoters and WRKY TFs. The differences in their binding site preferences were suggested to partly depend on flanking sequences outside the TTGACY-core motif (Ciolkowski et al., 2008). Our study also emphasized that N-terminal WRKY domain interacting with W-box is more concentrated on a conserved ‘-G’T’C’-’ consensus on the Crick strand (Figs. 2F, 2G and S2A), indicating some diversity in the binding sequences since there should be many more binding sites with the three bases ‘-G’T’C’-’ (or ‘GAC’ reading from the Watson strand) consensus. A WRKY gene from Tamarix hispida, ThWRKY4, could bind to two other motifs: a W-box like sequence (GTCTA) and the RAV1A element (CAACA) (Xu et al., 2017). The former consists of the invariant ‘GTC’ motif while another is a novel sequence. These studies suggest and in agreement with our results that the WRKY TFs not only recognize the conventional W-box (TTGACC), but also could bind to other DNA sequences.

References

Agarwal P, Reddy MP, Chikara J (2011) WRKY: its structure, evolutionary relationship, DNA-binding selectivity, role in stress tolerance and development of plants. Mol Biol Rep 38:3883–3896
Article CAS Google Scholar
Berg OG, Winter RB, von Hippel PH (1981) Diffusion-driven mechanisms of protein translocation on nucleic acids. 1. Models and theory. Biochemistry 20:6929–6948
Article CAS Google Scholar
Brand LH, Fischer NM, Harter K, Kohlbacher O, Wanke D (2013) Elucidating the evolutionary conserved DNA-binding specificities of WRKY transcription factors by molecular dynamics and in vitro binding assays. Nucleic Acids Res 41:9764–9778
Article CAS Google Scholar
Cheng X, Zhao Y, Jiang Q, Yang J, Zhao W, Taylor IA, Peng YL, Wang D, Liu J (2019) Structural basis of dimerization and dual W-box DNA recognition by rice WRKY domain. Nucleic Acids Res 47:4308–4318
Article CAS Google Scholar
Ciolkowski I, Wanke D, Birkenbihl RP, Somssich IE (2008) Studies on DNA-binding selectivity of WRKY transcription factors lend structural clues into WRKY-domain function. Plant Mol Biol 68:81–92
Article CAS Google Scholar
de Pater S, Greco V, Pham K, Memelink J, Kijne J (1996) Characterization of a zinc-dependent transcriptional activator from Arabidopsis. Nucleic Acids Res 24:4624–4631
Article Google Scholar
Duan MR, Nan J, Liang YH, Mao P, Lu L, Li L, Wei C, Lai L, Li Y, Su XD (2007) DNA binding mechanism revealed by high resolution crystal structure of Arabidopsis thaliana WRKY1 protein. Nucleic Acids Res 35:1145–1154
Article CAS Google Scholar
Eulgem T, Rushton PJ, Robatzek S, Somssich IE (2000) The WRKY superfamily of plant transcription factors. Trends Plant Sci 5:199–206
Article CAS Google Scholar
Eulgem T, Rushton PJ, Schmelzer E, Hahlbrock K, Somssich IE (1999) Early nuclear events in plant defence signalling: rapid gene activation by WRKY transcription factors. EMBO J 18:4689–4699
Article CAS Google Scholar
Ishiguro S, Nakamura K (1994) Characterization of a Cdna-encoding a novel DNA-binding protein, Spf1, that recognizes Sp8 sequences in the 5’ upstream regions of genes-coding for sporamin and beta-amylase from sweet-potato. Mol Gen Genet 244:563–571
Article CAS Google Scholar
Qiao Z, Li CL, Zhang W (2016) WRKY1 regulates stomatal movement in drought-stressed Arabidopsis thaliana. Plant Mol Biol 91:53–65
Article CAS Google Scholar
Ulker B, Somssich IE (2004) WRKY transcription factors: from DNA binding towards biological function. Curr Opin Plant Biol 7:491–498
Article Google Scholar
Xu H, Shi X, Wang Z, Gao C, Wang C, Wang Y (2017) Transcription factor ThWRKY4 binds to a novel WLS motif and a RAV1A element in addition to the W-box to regulate gene expression. Plant Sci 261:38–49
Article CAS Google Scholar
Yamasaki K, Kigawa T, Inoue M, Tateno M, Yamasaki T, Yabuki T, Aoki M, Seki E, Matsuda T, Tomo Y et al (2005) Solution structure of an Arabidopsis WRKY DNA binding domain. Plant Cell 17:944–956
Article CAS Google Scholar
Yamasaki K, Kigawa T, Watanabe S, Inoue M, Yamasaki T, Seki M, Shinozaki K, Yokoyama S (2012) Structural basis for sequence-specific DNA recognition by an Arabidopsis WRKY transcription factor. J Biol Chem 287:7683–7691
Article CAS Google Scholar

Download references

Footnotes

This work was supported by grants 31670740 and 31270803 from NSFC (the National Science Foundation of China). We thank the Shanghai Synchrotron Radiation Facility (SSRF) for providing us with opportunities to test the crystals and to collect datasets on the BL19U beamline. We thank the KEK Photon Factory and staff members for their assistance in data collection. We thank the National Center for Protein Sciences at Peking University (Beijing) for providing experimental equipment.

X.-D.S. conceived the project. Y.X. and H.X performed gene construction, protein expression and purification, and crystal screening and optimization. Y.X. and B.W. performed the collection of X-ray diffraction data and structure determination. Y.X. performed the EMSA assays, ITC assays and structure analysis. X.-D.S. and Y.X. wrote the manuscript.

Yong-ping Xu, Hua Xu, Bo Wang and Xiao-Dong Su declare that they have no conflict of interest.

This article does not contain any studies with human or animal subjects performed by any of the authors.

Author information

Authors and Affiliations

State Key Laboratory of Protein and Plant Gene Research, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing, 100871, China
Yong-ping Xu, Hua Xu, Bo Wang & Xiao-Dong Su

Authors

Yong-ping Xu
View author publications
You can also search for this author in PubMed Google Scholar
Hua Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bo Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Dong Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiao-Dong Su.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 839 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Xu, Yp., Xu, H., Wang, B. et al. Crystal structures of N-terminal WRKY transcription factors and DNA complexes. Protein Cell 11, 208–213 (2020). https://doi.org/10.1007/s13238-019-00670-0

Download citation

Published: 16 November 2019
Issue Date: March 2020
DOI: https://doi.org/10.1007/s13238-019-00670-0

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Crystal structures of N-terminal WRKY transcription factors and DNA complexes

References

Footnotes

Author information

Authors and Affiliations

Corresponding author

Electronic supplementary material

Supplementary material 1 (PDF 839 kb)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation