Targeted transcriptional repression using a chimeric TALE-SRDX repressor protein
Transcriptional activator-like effectors (TALEs) are proteins secreted by Xanthomonas bacteria when they infect plants. TALEs contain a modular DNA binding domain that can be easily engineered to bind any sequence of interest, and have been used to provide user-selected DNA-binding modules to generate chimeric nucleases and transcriptional activators in mammalian cells and plants. Here we report the use of TALEs to generate chimeric sequence-specific transcriptional repressors. The dHax3 TALE was used as a scaffold to provide a DNA-binding module fused to the EAR-repression domain (SRDX) to generate a chimeric repressor that targets the RD29A promoter. The dHax3.SRDX protein efficiently repressed the transcription of the RD29A::LUC transgene and endogenous RD29A gene in Arabidopsis. Genome wide expression profiling showed that the chimeric repressor also inhibited the expression of several other genes that contain the designer TALE-target sequence in their promoters. Our data suggest that TALEs can be used to generate chimeric repressors to specifically repress the transcription of genes of interest in plants. This sequence-specific transcriptional repression by direct on promoter effector technology is a powerful tool for functional genomics studies and biotechnological applications.
KeywordsChimeric repressorsTALE-based repressorsTargeted repressionEAR-repression domain
Plants are sessile organisms and rely on a wide array of molecular mechanisms to control and adjust their adaptive responses to developmental and environmental cues. Understanding these responses requires that we understand the functioning of plant genes. Although such understanding has become increasingly possible with rapid advances in DNA sequencing technologies, the challenge is to turn information about gene sequence into knowledge about gene function. The analysis of the completed plant genome sequences revealed that transcriptional regulation plays a very pronounced role in plants (Riechmann et al. 2000). The Arabidopsis genome is predicted to contain 25,498 genes, and more than 2000 of these are transcription factors. Consequently, the percentage of transcription factor genes relative to the whole gene content is higher in Arabidopsis (6–10%) than in organisms with genomes of similar size including Drosophila melanogaster (4.7%) and Caenorhabditis elegans (3.6%) (Riechmann et al. 2000). Plants use many different transcription factors to modulate gene expression and achieve the correct spaciotemporal control of gene activation and repression. Researchers use factors that activate or repress gene expression to understand gene function and phenotypic effects (Krogan and Long 2009). Transcriptional repression effects occur at several levels and involve active and passive repressors. Active repressors exhibit an intrinsic repression activity because they have defined repression domains that can interfere with the formation of the transcription pre-initiation complex and basal transcription (Krogan and Long 2009; Mahfouz 2010). Passive repressors, in contrast, interfere with transcriptional activators and prevent their binding to the DNA to cause transcription repression (Kazan 2006; Krogan and Long 2009).
Chimeric repressors, which have been used to study gene function in animal and plant systems, have a DNA-binding domain or a transcription factor fused to a variety of repression domains (Beerli et al. 2000; de Haan et al. 2000; Ohta et al. 2001). Transcription repressors that contain the EAR-repression domain (ERF-associated amphiphilic repression) were the first to be identified in plants (Kagale and Rozwadowski 2011; Ohta et al. 2001). The exact mechanism of function of the transcriptional repressors is not known but may involve interaction with co-repressors (Szemenyei et al. 2008). Researchers have fused the EAR motif to a number of transcription factors to generate dominant chimeric repressors (Hiratsu et al. 2003; Ikeda and Ohme-Takagi 2009; Matsui et al. 2008; Mito et al. 2010; Tsutsui et al. 2009). The application of this technology, however, is limited to the pre-existing transcription factors. In the chimeric repressor silencing technology or “CRES-T” system, the transcription factors were used as a DNA-binding module to repress the transcription factor’s target genes but because many of the transcription factor targets are not known, it is difficult or impossible to achieve repression to specific genes. This repression technology would be much more powerful if it could be applied to a user-selected and defined single locus or multiple loci.
A chimeric transcription repressor or activator, in principle, is composed of a DNA-binding domain, an activation or repression domain, and a nuclear-localization signal (Guan et al. 2002; Yaghmai and Cutting 2002). Exploiting this fundamental principle to generate chimeric repressors or activators with genome-wide applicability requires a DNA-binding domain that can be modified and adapted to selectively and specifically bind to any DNA sequence in the genome. This kind of DNA-binding domain exists in phytopathogenic bacteria in the genus Xanthomonas. Phytopathogenic Xanthomonas spp. use the type III secretion system to inject transcription activator-like effectors (TALEs) into plant cells (Boch and Bonas 2010). These TALEs, which act as virulence factors, translocate to the nucleus and function as transcription factors by binding to their DNA targets in the promoter regions and reprograming the gene expression of the host (Kay et al. 2007; Romer et al. 2007). The repeats of the DNA-binding domains (DBD) dictate the specificity of TALEs to their DNA target (Boch et al. 2009; Moscou and Bogdanove 2009). The repeats are nearly identical, and each repeat is composed of 34 or 35 amino acids and has a repeat variable diresidue (RVD) at positions 12 and 13. There is a one-to-one correspondence between each RVD and a single nucleotide in the DNA target, and hence the identity and the sequence of the RVDs determine the specificity of each TALE to its target (Boch et al. 2009; Moscou and Bogdanove 2009).
The Brassicaceae pathogen X. campestris pv. armoraciae strain 5 produces Hax3, which is a member of the AvrBs3 family of TALEs (Kay et al. 2005). Hax3 TALE protein has all the structural features of TALEs including a DNA-binding domain composed of 11.5 repeat units, a nuclear-localization signal, and an acidic activation domain. The chief advantage of TALEs for the design of transcriptional repressors or activators is their versatility; they could be used, in principle, to target any functional domain to any gene locus by fusing the functional domain to an appropriately modified TALE DBD (Christian et al. 2010; Li et al. 2011a; Mahfouz et al. 2011; Morbitzer et al. 2010; Weber et al. 2011). Thus, TALEs could be used to take advantage of the entire repertoire of the transcription-regulatory domains to achieve the desired level and type of regulation for any gene of interest. Hax3 recognizes a 12-bp DNA-binding element and has been used to generate a de novo engineered hybrid nuclease capable of generating double-strand breaks in vitro and in vivo in plants (Mahfouz et al. 2011).
Here, we report the generation of a chimeric transcriptional repressor that targets an element in the RD29A promoter. Our data demonstrate that the dHax3.SRDX chimeric repressor selectively suppresses the expression of the RD29A::LUC transgene and the RD29A endogenous gene as well as several other genes with the target element in their promoters in Arabidopsis. This gene repression technology will be useful for functional genomics studies in crops and should have significant biotechnological applications.
Design and generation of the dHax3.SRDX TALE-based transcriptional repressor to target an RD29A promoter element
Hax3.SRDX chimeric repressor protein binds to its target sequence in vitro
To examine the ability of the bacterially expressed dHax3.SRDX chimeric repressor to bind to its DNA target, we performed electrophoretic mobility shift assays (EMSA) using the bacterially expressed dHax3.SRDX and a biotin-labeled double-stranded oligo-nucleotide containing the 12-bp DNA target sequence preceded by the T nucleotide. The assays showed that the dHax3.SRDX was capable of binding to its target DNA sequence and that the binding was competitively reduced by the addition of the same but unlabeled double-stranded oligo-nucleotides. The results suggest that the fusion of the SRDX domain did not affect the ability of the dHax3.SRDX chimeric repressor protein to bind to its target site (Fig. 2b).
Expression of dHax3.SRDX repressor results in repression of the RD29A promoter in vivo
To test whether the dHax3.SRDX causes transcriptional repression of the luciferase transgene driven by the RD29A promoter and the endogenous RD29A gene in their chromosomal contexts, we generated stable transgenic lines of Arabidopsis ectopically expressing Hax3.SRDX under the cauliflower mosaic virus 35S promoter in a background containing the RD29A::LUC transgene (ecotype C24, referred to as the WT). Several transgenic lines over-expressing 35S::dHAX3.SRDX were selected on MS nutrient medium supplemented with hygromycin at 30 mg/L. Homozygous T3 plants expressing 35S::dHAX3.SRDX were used to analyze the transcriptional repression of the RD29A promoter in both the luciferase transgene and the endogenous RD29A gene.
Because the RD29A promoter is also induced by salt, we tested the ability of the dHax3.SRDX chimeric repressor to repress the RD29A promoter after salt treatment (300 mM NaCl, 3 h). Our data show that the dHax3.SRDX chimeric repressor can significantly repress the RD29A promoter after salt treatment (Fig. 3c).
Since the repression might be mediated merely by the binding of the dHax3 effector, to its promoter, and not by SRDX repression domain of dHax3.SRDX repressor, we tested this possibility by functional analysis in N. benthamiana transient assays. We inserted the dHax3.SRDX effector-binding element in the Bs3 minimal promoter to generate ∆dHax3Bs3::uidA and co-delivered this construct or its WT version (Bs3::uidA) with 35S::dHax3 or 35S::dHax3.SRDX, respectively (Morbitzer et al. 2011). Our data clearly indicate that the dHax3 activates the ∆dHax3Bs3::uidA expression (Fig. 4b, panel 2). dHax3.SRDX, however, strongly repressed the expression of the ∆dHax3Bs3::uidA (Fig. 4b, panel 7).
Genome-wide effect of the dHax3.SRDX chimeric repressor on gene expression
Several reports have shown that TALEs can be used as DNA-binding modules with high specificity and precision (Li et al. 2011a; Morbitzer et al. 2010; Zhang et al. 2011). TALEs with a higher number of repeats and longer DNA targets might be more specific than those with shorter repeats and DNA targets (Boch et al. 2009). To test the specificity of the dHax3.SRDX chimeric repressor, we used the Patmatch algorithm (http://is.gd/g31n9) to scan the Arabidopsis genome for sequences that contain matching boxes specifically in the promoter regions. We identified in the −500 bp upstream sequence only one perfect match that corresponds to the RD29A promoter (AT5G52310). The code of the RVDs DNA-binding specificities, however, is somewhat degenerate. For example, the NS RVD could bind to A, C, G, or T nucleotide, with higher affinity to A and reduced affinity to C, G, and T nucleotides (Boch et al. 2009; Miller et al. 2011). We scanned the Arabidopsis genome for potential dHax3.SRDX binding to DNA boxes that include TCCCTTTATCTCT, TCCCTTTCTCTCT, TCCCTTTGTCTCT, or TCCCTTTTTCTCT in the −500 bp upstream promoter region. We found one perfect match, in the RD29A gene promoter that includes the TCCCTTTATCTCT box, and 20 other possible binding hits on either strand, including C, G, or T nucleotide corresponding to the 7th repeat in the tandem repeat region of the DNA-binding module. Moreover, we identified 70 additional possible targets when we extended the search to −3,000 bp upstream promoter regions and 82 binding hits in the genes including introns and UTR regions.
In the present study, we demonstrate that the TALE-based dHax3.SRDX chimeric protein designed to target the RD29A promoter can repress the RD29A::LUC transgene and RD29A endogenous gene in a chromosomal context and thus can be used as an effective artificial repressor. The repeats of the dHax3 DNA-binding domain were modified so that it can bind to a DNA box in the RD29A promoter. The dHax3 was fused in frame to the SRDX domain to generate the dHax3.SRDX. Our data suggest that the fusion of the SRDX domain does not affect the conformation of the dHax3 since the fusion protein was capable of binding to its target sequence in vitro (Fig. 2b). dHax3.SRDX, driven by the CaMV 35S promoter, was ectopically expressed in Arabidopsis containing the RD29A::LUC transgene. The RD29A promoter can be activated by a variety of environmental stresses including salt, drought, cold, and also the phytohormone ABA (Chinnusamy et al. 2002; Hua et al. 2006; Ishitani et al. 1997; Kasuga et al. 2004). To test the ability of the dHax3.SRDX to repress its target genes, we used luciferase imaging to analyze LUC expression in several Arabidopsis lines expressing dHax3.SRDX. ArabidopsisdHax3.SRDX lines grown on MS media and treated with 100 μm ABA for 3 h showed a significant suppression of the luciferase signal relative to the wild type control, indicating the repression of the RD29A::LUC transgene by the dHax3.SRDX chimeric repressor protein (Fig. 3a). The dHax3.SRDX also suppressed the expression of RD29A::LUC transgene in response to cold or salt treatments (Fig. 3b, c). Moreover, our data demonstrate the ability of the dHax3.SRDX to suppress the expression of the RD29A::LUC transgene after a combined ABA and cold treatment, indicating its ability to simultaneously inhibit the activities of multiple transcriptional activators, which integrate signals from different signaling pathways (Fig. 4).
Our data indicate that the dHax3.SRDX chimeric repressor protein suppressed the transcription of a RD29A::LUC transgene and RD29A endogenous gene in a chromatin context. These data demonstrate the versatility of the repression mechanism and the ability of the TALE-based repressors to suppress the expression of more than one gene simultaneously. It should be noted that the transient and stable expression of TALE protein alone did not suppress the expression of RD29A promoter indicating that the repression is due to the presence of the SRDX repression domain (Fig. 4b) (Mahfouz et al. 2011). Moreover, the use of the strong 35S promoter might not be necessary to obtain gene repression effects. A lower ratio of repressor to activator was able to cause efficient suppression of the reporter gene in Arabidopsis transient assays (Fujimoto et al. 2000). Our data demonstrate a significant reduction in the expression of the endogenous RD29A gene, although the repression of RD29A was generally less than that of the RD29A::LUC transgene as evidenced by real-time RT-PCR (Fig. 5a, b). The ability of the TALE repressor to bind to its target might be affected by the chromatin status or the distance of the TALE binding motif from the transcriptional start site may influence its repression activity.
Because the dHax3.SRDX chimeric repressor contains the NS-type RVD at the 7th position, we tested the ability of the repressor to repress genes that contain binding boxes where the 7th nucleotide in the DNA target is A, C, G, or T. We identified 21 potential hits in the −500 bp upstream promoter region and 70 more gene hits in the −3,000 bp upstream region. It should be mentioned that these are not off-targets since they have the TALE-SRDX DNA binding box according to the TALE DNA binding code. Off-targets are defined as the genes whose level of expression is modulated due to the expression of TALE-SRDX and may contain a non-perfect match of the DNA binding box. Our microarray data indicate that the expression of 5 gene hits in the −500 bp regions is repressed in all tested lines. The repression levels vary among the targets perhaps because of the chromosomal position effect or the distance from the transcription start site. Moreover, two gene hits (AT2G41230 and AT2G07280) containing the dHax3.SRDX DNA-binding box on the opposite strand in −3,000 bp upstream region, show significantly higher levels of repression. These two genes contain multiple repeats with similar sequence to the DNA-binding box TCCCTTT[ACGT]TCTCT within −5 kb upstream of the transcription start site. In fact, there are eight repeats of 50-bp motifs with a DNA sequence similar to that of the DNA binding box (TCCCTTGTAAATCATTCT). The high level of repression of these two genes could be due to their local chromatin environments, the presence of the DNA-binding box on the opposite strand, the distance of the DNA-binding box relative to the transcriptional start site, or the presence of multiple copies of a motif similar to the DNA-binding box. The presence of multiple sequences similar to the DNA-binding box might help recruit a large number of the chimeric repressor to result in strong transcriptional repression. It appears that the transcriptional repression occurred regardless of the dHax3.SRDX binding orientation because the presence of the EBE on the forward or reverse strand relative to the gene resulted in repression activities. This is consistent with proposed models of SRDX-mediated repression where SRDX might function by disrupting the formation of the transcription complex or recruiting repressors and co-repressors. These data indicate that the chromatic status may affect the binding of the dHax3.SRDX or the function of the SRDX domain. Moreover, many details are unknown concerning the nature of TALE binding, including the chromatin environments and optimal distance from the transcription start site. These details are of paramount importance for the future design of effective TALE-based activators or repressors.
Because de novo TALEs (dTALEs) with higher repeat units might be expected to possess higher target specificity (Morbitzer et al. 2011), dHax3.SRDX chimeric repressors should be designed with at least 18 repeat units if a single gene is to be targeted. However, designing dTALEs with variable repeat lengths and RVDs with degenerate nucleotide binding would be useful for analyzing the nature and details of the binding and activity of dTALE activators and repressors. Several parameters for TALE target designs were suggested based on the study of positional biases, neighbor effects, and the binding of RVDs to nucleotides in the overall target (Cermak et al. 2011). According to these suggestions, there is strong bias against T at position 1, A at position 2, and G at position 3 and the next to the last position, and a moderate bias for T at the last position (Cermak et al. 2011). These and other suggestions may be useful for designing effective TALE DNA-binding modules. Moreover, several TALEs repeats assembly protocols were recently reported, and these should facilitate the efficient design and generation of TALEs for user-selected targets (Cermak et al. 2011; Li et al. 2011b; Morbitzer et al. 2011; Zhang et al. 2011).
Our present study suggests that a single Hax3.SRDX protein can be designed to silence a single gene or a group of genes sharing DNA-binding elements in the promoter regions, making it possible to functionally characterize single genes, multiple genes or gene families. It remains to be determined whether the TALE-based repression technology could be a useful alternative to RNA interference (RNAi), a widely used technology that suffers from a variety of disadvantages and pitfalls. The demonstration of the effectiveness of the TALE-based repression technology to different promoters, gene families and chromosomal loci compared to RNAi is required to draw a conclusion. The primary disadvantage of RNAi is the off-target effects, i.e., sequence complementarities between an siRNA and the mRNA involving fewer than 10 nucleotides can lead to the reduction of expression (Birmingham et al. 2006). dTALE repressors, in contrast, can be designed to bind specifically to one genomic target. RNAi-based silencing exhibits variable effects depending on the target gene, the targeted region of the transcript, and even between different lines carrying the same RNAi construct (Wang et al. 2005). Moreover, RNAi phenotypes can be lost over several generations. Because RNAi suffers from problems related to efficacy, specificity and stability, silencing by TALE chimeric repressors might represent an attractive alternative with enormous potential for basic and applied research. For example, silencing by TALE chimeric repressors could be used for the genetic manipulation of commercially important crop plants (Mahfouz and Li 2011). This gene repression technology may also be applied to crop plants to identify genes responsible for traits of interest including biotic and abiotic stress resistance, nitrogen use efficiency, high yield, and control of fertility.
Materials and methods
dHax3.SRDX design and vector construction
pET32a.dHax3.SRDX was generated by performing LR recombination reactions between the dHax3.SRDX entry clone and the gateway-compatible pET32a expression vector according to manufacturer’s protocol (Invitrogen). The expression clone was transformed into Escherichia coli BL21, and the protein expression was induced at 25°C for 5 h with 1 mM isopropyl β-d-1thiogalactopyranoside (IPTG). The TRX.dHax3.SRDX protein was purified using Qiagen Ni–NTA agarose resin according to the manufacturer’s instructions. LR reactions were performed between the pENTR/dHax3.SRDX entry clone and the pMDC32 gateway compatible binary vector to generate the final construct for in planta analysis. The pMDC32/dHax3.SRDX clone was confirmed by sequencing and transformed into Agrobacterium tumefaciens GV3101 and then to Arabidopsis plants by the floral dipping method (Zhang et al. 2006).
Electrophoretic mobility shift assay (EMSA)
EMSA was performed using the Lightshift Chemiluminescent kit (Pierce) and following the manufacturer’s instructions. For dHax3.SRDX DNA-binding studies, the TRX.dHax3.SRDX protein was purified as described above, and the protein concentration was measured with a Bradford protein assay kit (Bio-Rad). Complementary pairs of 5′ biotin-labeled or non-labeled oligonucleotides were annealed and used as probes for the binding studies. EMSA binding-reaction buffer contained 12 mM Tris–HCL, 60 mM KCl, 1 mM DTT, 2.5% glycerol, 5 mM MgCl2, 50 ng/μL poly(dI.dC), 0.05% Nonidet P-40, 0.2 mM EDTA, 35 fmol biotin-labeled DNA, 10 pmol unlabeled DNA, and 30–480 fmol TRX.His fusion proteins. The EMSA binding reactions were kept on ice for 10 min before the biotin-labeled probe was added. Gel electrophoresis was performed using 8% Tris/Borate/EDTA (TBE) native ready-made gels from Invitrogen. Blotting was performed on a positively charged nylon membrane, and the membrane was cross-linked using CL-1000 UV cross-linker for 30 s.
Agrobacterium and Arabidopsis transformation
Arabidopsis thaliana (ecotype C24) expressing the chimeric RD29A::LUC reporter gene (referred to as the wild type or WT in this study) was used for the in planta analysis of the dHax3.SRDX repressor. pMDC32.dHax3.SRDX was transformed into Agrobacterium strain GV3101 by the freeze-and-thaw method. A transformed single colony of Agrobacterium GV3101 was used to transform Arabidopsis C24 RD29A::LUC plants as described previously (Zhang et al. 2006). Transformants were selected on MS agar plates supplemented with 30 mg/L of hygromycin B. T3 plants were used for the transcriptional repression analysis.
Assay of RD29A::LUC expression
WT (C24 RD29A::LUC) and dHax3.SRDX over-expression lines were grown on MS agar media for 1 week. Seedlings were sprayed with 100 μm ABA for 3 h or kept at 4°C for 24 h (cold treatment) or incubated on filter papers soaked with 300 mM NaCl (salt treatment) (Chinnusamy et al. 2002). Plates with treated plants were sprayed with Luciferin at room temperature and kept in the dark for 5 min before they were placed in the dark chamber of the CCD camera (PIXIS 2048). Image quantification was carried out using the Winview software supplied by the manufacturer.
RNA isolation and quantitative real-time PCR
The expression levels for the LUC transgene and RD29A endogenous gene were analyzed in the treated and untreated WT and dHax3.SRDX lines. Total RNA was purified using the RNeasy kit (Qiagen SA). For the cDNA synthesis, 1 μg of total RNA was used for the reverse transcription using reverse trancriptase (Life Technologies, Applied Bioscience).
All cDNAs were measured by real-time PCR. Quantitative PCR was performed in an ABI prism 7,900 using a SYBR Green. The β-actin gene served as the internal control for normalization of data. All experiments are done in at least duplicate.
The RNeasy plant mini kit (Qiagen) was used to extract total RNAs from 2-week-old WT and dHax3.SRDX over-expression lines that were grown on MS agar plates and treated with 100 μM ABA. The total RNA samples were pretreated with RNase-free DNase I and cleaned using the Plant Total RNA isolation kit (Qiagen). Probe preparation, hybridization to the Arabidopsis ATH1 Gene expression arrays (Agilent Technologies), and subsequent processing steps were performed according to the manufacturer’s procedures. Raw signals were extracted from the scanned images; the background was subtracted and the data were normalized using the Feature Extraction Software (10.7.1.1; Agilent technologies). These signals were then log 2 transformed and subjected to percentile shift-based normalization and median-based baseline transformation using Genespring GX (Agilent Technologies). WT (C24 RD29A::LUC) plant values were used as the baseline to calculate the intensity ratio/fold changes of the dHax3.SRDX plant lines. False discovery rates (FDRs) were calculated by significance analysis of microarrays algorithm, and genes with fold change of at least 2 and a FDR-corrected P value lower than 0.05 were identified.
We thank Thomas Lahaye for providing the Bs3::uidA binary construct. We also thank Jose M. Pardo and Ray Bressan for helpful discussions and members of the genome-engineering group at KAUST for their feedback and technical assistance.
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.