Introduction

With the advent of post-genome era, the whole genome sequences of many plant species have now become available (http://en.wikipedia.org/wiki/List_of_sequenced_plant_genomes; Feuillet et al. 2011; Bevan and Uauy 2013). Next and urgent tasks will be the functional verification of a large number of genes to accelerate plant biology study and crop genetic improvements. Disruption of a gene function using reverse genetics is a very important and relatively simple way to analyze gene functions and achieve crop trait improvements (Page and Grossniklaus 2002; Kumar et al. 2012). Recently, a plethora of creative ways has been devised to disrupt gene functions and screen mutations that are useful for gene functional analyses or for crop trait improvements (Yin and Zhang 2010). Of these various tools, RNA interference (RNAi) (Fire et al. 1998) has been employed as a universal platform for gene silencing in various organisms including plants (Waterhouse and Helliwell 2003; Shafran et al. 2008; Bellés 2010). Reverse genetics through RNAi is unique in its ability to directly correlate the target genes with their functions by loss-of-function. Verification of a gene function is difficult with conventional approaches such as biochemical assays and use of mutants. In particular, gene functional redundancy is an acute problem for functional verification using single gene mutation lines. In addition to plant biology studies, RNAi plays important and unique roles in crop genetic improvements (Auer and Frederick 2009; Baykal and Zhang 2010). RNA interference may achieve more significant crop improvements than conventional breeding, as illustrated by various important crop species with different improved traits (Liu et al. 2002; Ogita et al. 2003; Gilissen et al. 2005; Le et al. 2006; Bonfim et al. 2007; Shi et al. 2007; Dodo et al. 2008; Eady et al. 2008; Flores et al. 2008; Shimizu et al. 2009). Many of these studies show that the RNAi-mediated silencing is inheritable, alleviating long-standing concerns about the RNAi inheritability.

Classification, biogenesis, and functions of small regulatory RNAs: siRNA, miRNA, and tasiRNA

RNA interference is mediated by small RNAs—non-protein coding which can be categorized into two major classes: small interfering RNA (siRNA) and microRNA (miRNA) (Brodersen and Voinnet 2006; Jones-Rhoades et al. 2006; Vaucheret 2006; Siomi and Siomi 2009). In plants, siRNAs originate from invading viruses, transgene inverted repeats, centromeres, transposons, and other repetitive sequences. The long double-stranded RNA (dsRNA) from a transgene construct is a common entry transcript in siRNA pathway. There are three key steps in the siRNA biogenesis in plants. First, dsRNAs are diced into short (21–24 bp) siRNAs by RNAse-III Dicer-like 3 or 4. The RNA-dependent RNA polymerase (RDR) is required to generate the long double-stranded RNAs if the dsRNAs are derived from single-stranded RNA. Second, the siRNA duplex is unwound and one strand is preferentially loaded to RNA-induced silencing complex (RISC). Third, the RISC finds its transcript targets, and the loaded single-stranded siRNA (guide strand) directs the sequence-specific cleavage or translational repression of the target transcript through the Argonaute protein (AGO) in the RISC. The siRNA is thought to be an RNA-based immune mechanism to fight off nucleic acid invaders or to maintain genome stability. In contrast, plant miRNAs are endogenous and originated from genome regions, normally intergenic regions or introns, as primary precursors (pri-miRNAs) transcribed by RNA polymerase II (Chen 2009). These precursors contain stem-loop structures, which harbor 21-nucleotide (nt) miRNA in 5′ or 3′ half of the stem. RNAse-III Dicer-like 1 (DCL1) then generates 21-nt miRNA/miRNA* duplex. Subsequently, the guide strand miRNA is preferentially loaded onto the AGO in the RISC while the miRNA* (passenger strand) is destroyed. The miRNA then directs the AGO in the RISC to cleave or translationally repress mRNA targets. The roles of miRNAs are to regulate plant developments and various physiological responses. Regardless of siRNA or miRNA/miRNA* duplex, they are all first 3′-end methylated by HUA ENHANCER 1 (HEN1) and the preferential loading of the guide strand is determined by thermodynamic asymmetry along the siRNA or miRNA/miRNA* duplex. The characteristic thermodynamic unstable 5′ end of the siRNA or miRNA strand allows this molecule to be loaded onto the RISC. Recently, plant siRNA- or miRNA-mediated translational repression of mRNA targets was found to be broadly applied (Brodersen et al. 2008). Intriguingly and importantly, the sequence complementarity between plant siRNA and miRNA, and its mRNA targets do not determine cleavage or translational repression of mRNA targets. This is in sharp contrast to animal models in which a complete or near-complete sequence complementarity causes mRNA target cleavage, while a low degree of complementarity causes translational inhibition of mRNA target. The mechanism that dictates cleavage or translational inhibition of mRNA targets in plants is still poorly understood.

One subclass of small regulatory RNAs is the trans-acting small interfering RNA (tasiRNA), which silences its mRNA target in trans (Chapman and Carrington 2007). The tasiRNA transcripts are generated from non-protein coding locus of plant genome region, i.e., trans-acting siRNA locus (TAS). To date four TAS loci have been found in plants: TAS1, TAS2, TAS3, and TAS4 (Peragine et al. 2004; Vazquez et al. 2004; Allen et al. 2005; Gasciolli et al. 2005; Xie et al. 2005; Yoshikawa et al. 2005; Rajagopalan et al. 2006). Of these, TAS1, TAS2 and TAS4 are found only in Arabidopsis thaliana: TAS1 and TAS2 each contains miR173 complementary site whereas TAS4 carries miR828-binding site (close to the 5′-end). TAS3 is universal in the plant kingdom and contains two miR390 complementary sites, one close to 5′ end, and the other close to 3′ end. These TAS miRNA-binding sites function as trigger sequences for endogenous miR173, miR390 or miR828 to trigger the degradation of the TAS transcript, producing phased 21-nt siRNAs. In TAS1, TAS2, and TAS4, the cleavage sites for generating phased siRNAs are located downstream of miRNA complementary sites, whereas in TAS3 the phased siRNAs are generated from the upstream sequence of the 3′-end miR390 complementary site. The Arabidopsis TAS1 has three family members (TAS1a, TAS1b, and TAS1c). Thus far no additional family members have been found in TAS2, TAS3, or TAS4.

Figure 1 illustrates small regulatory RNA pathways including dsRNA, miRNA, and tasiRNA pathways.

Fig. 1
figure 1

Small regulatory RNA pathways. a Double-stranded (ds) RNA pathway. A long dsRNA with perfect complementary repeats from a transgene construct is diced, by DCL4, to 21-nt siRNA, which is then methylated by HEN1. One of the 21-nt siRNA strands is uploaded to AGO1 in RISC and guides sequence-specific degradation of target RNA. Note: 24-nt siRNA-mediated methylation pathway is not shown. b miRNA pathway. Pri-miRNA transcript of stem-loop structure from a genomic locus is produced through RNA Pol II. Mature miRNA is then generated from pri-mRNA by concerted action of DCL1 and HYL1 and further methylated by HEN1. The mature and methylated miRNA is then exported out of the nucleus, mediated by HASTY, and uploaded to AGO1 in RISC to guide sequence-specific degradation or translational repression of target transcript, or DNA methylation. c tasiRNA pathway. Pri-tasiRNA transcript is first transcribed from genomic TAS locus, cleaved by miRNA through AGO1, double-stranded through cooperative activity of SGS3 and RDR6, diced to 21-nt siRNA, and methylated by HEN1. The 21-nt siRNA then guides sequence-specific degradation of target transcript. AtasiRNA enables to silence several different target mRNAs simultaneously. Note: nucleotide bonds of different colors indicate different sequences of either siRNAs, miRNAs, or target mRNAs

Artificial tasiRNA: an emerging tool for plant biology studies and crop genetic improvements

The silencing induced by the dsRNA of a transgene construct has several significant limitations. In a dsRNA construct, each inverted repeat (IR) that forms dsRNA normally needs to be 300–800 bp long for effective RNAi (Watson et al. 2005). Such a dsRNA can produce many different siRNA species and a much higher chance of causing “off-target” effect (silencing of unintended gene) (Jackson et al. 2003) or “transitive silencing” (spreading of silencing to regions outside the inducer sequence) (Baulcombe 2007). Both can cause epigenetic modifications of cells in untended tissues due to the siRNA mobility, even if a transgene expression is controlled in a tissue-specific manner (Dunoyer et al. 2010; Martienssen 2010; Molnar et al. 2010). It is also difficult for dsRNA to silence a specific gene family member because of the high degree of sequence similarities. The dsRNA is also challenging for stacking multiple transgene cassettes in a complex metabolic engineering study or crop trait stacking. Such a study not only faces technical challenges in cloning multiple transgene cassettes, but is also restricted by promoter choices to avoid promoter homology-dependent gene silencing (Meyer and Saedler 1996). In addition, because the effect of long inverted repeat is difficult to predict and control owing to the generation of multiple siRNAs species with varying 3′- and 5′-ends, it is difficult to control the degrees of silencing of target genes through dsRNA.

Recently, artificial miRNA (amiRNA) and tasiRNA (atasiRNA) technologies have been developed (Baykal and Zhang 2010; Baulcombe et al. 2006; Schwab et al. 2006; de la Luz et al. 2008; Ossowski et al. 2008). Both of these silencing technologies possess intrinsic advantages over dsRNA-induced silencing approach and overcome limitations including non-specificity of dsRNA. Artificial miRNA and atasiRNA utilize miRNA (19–24nt) and siRNA (21nt) which are complementary to their target sequences and are, therefore, highly sequence-specific and can minimize off-target effect and silence genes sharing a high degree of homology. These short RNAs can be engineered to carry 3′-end mismatches that minimize the transitive silencing due to avoiding RDR activity (Baulcombe 2007; Voinnet 2008). Desirable and varying degrees of silencing of target mRNAs could be achieved by creating different numbers of mismatches (e.g., 1–3). By enabling highly transcript-specific knockdown and thus more conclusive results, or by avoiding lethality without generating a relatively large number of RNAi lines, these advanced features will make functional tests of target genes available to plant scientists. Finally, these two new technologies, i.e., amiRNA and atasiRNA, make it possible to use a single promoter to drive a single transgene cassette (avoiding use of multiple expression cassettes) to carry different amiRNA or atasiRNA species to simultaneously silence multiple genes of either same or different pathways for complex metabolic engineering. This is difficult to achieve using dsRNA construct.

However, the design of amiRNA remains complicated because the pre-miRNA secondary structure in amiRNA expression cassette is highly sensitive to a change in the 19- to 24-nt miRNA sequence and thus it is hard to predict the effect of designed miRNA on the pre-miRNA structure and silencing efficiency. Also, cloning of multiple pre-miRNAs into the amiRNA cassette in a construct could be technically challenging and the resulting long-stranded cassette may cause undesirable impacts on amiRNA processing and functions. However, atasiRNA technology could overcome the limitations of the amiRNA approach (Baulcombe et al. 2006; de la Luz et al. 2008). A single copy of atasiRNA carried by a TAS locus will silence the mRNA target (de la Luz et al. 2008). And a single TAS locus can be engineered to contain either multiple atasiRNAs that target different sites of the same gene to enhance silencing efficiency or carry multiple atasiRNAs (each specific to a target mRNA) to simultaneously silence multiple genes in a complex metabolic engineering feat (Baulcombe et al. 2006; de la Luz et al. 2008). Using single TAS locus has a significant advantage over amiRNA technology—as single TAS locus avoids technical challenges in cloning and minimizes possible undesirable impact of long-stranded multiple pre-miRNAs. Moreover, a TAS locus (e.g., TAS1) can be co-introduced and coexpressed with miRNA trigger (e.g., miR173) into any plant species for either constitutive or inducible silencing of target genes (Baulcombe et al. 2006). From the regulatory point of view, using a minimal number of transgene cassettes via atasiRNA strategy in a genetically modified (GM) crop will ease regulatory work and save cost. Figure 2 depicts principles of atasiRNA approach.

Fig. 2
figure 2

atasiRNA approach using TAS1a as an example. Transgene expression cassette is composed of two expression units for overexpressing miR173 and TAS1a, respectively. Each unit has its own promoter and terminator. The miR173 cassette generates miR173 precursor which is then processed to produce mature miR173 that binds to miR173 trigger sequence (miR173ts) at 5′ region of the TAS1a transcript. Such a binding triggers TAS1a cleavage in phase, producing phased atasiRNAs. The 21-nt atasiRNA in each phasing position (bracket) can be designed as either siRNA or miRNA. For cassette construction, single atasiRNA can be built into the TAS1a locus using overlapping PCR while TAS1a carrying multiple atasiRNAs can be synthesized. Each atasiRNA can bind to an endogenous transcript, causing either transcript cleavage or translational repression of the target transcript. The scheme shows transcript cleavage only. Note: Pro and Ter: promoter and terminator, respectively. Different colors in promoters and terminators indicate different sequences whereas the numbers mean different phasing positions in TAS1a or target transcripts of different sequences. Black dotted lines suggest degradation of more target transcripts. A similar design can be applied to employing TAS1b, TAS1c and TAS2 for atasiRNA platform

Very recently, microRNA-induced gene silencing (MIGS) approach has been developed (Felippes et al. 2012). This new silencing platform is essentially derived from the tasiRNA pathway using miRNA173 as a trigger to generate phased siRNA (Fig. 3). However, each sequence used to silence target mRNA is fused with miRNA173 trigger at 5′ end and can be over hundreds of nucleotides long. Such a long silencing sequence will further ensure efficiency. Nevertheless, MIGS retains the limitation of the dsRNA platform, that is, the high chance of causing off-target effect and transitivity.

Fig. 3
figure 3

microRNA-induced gene silencing platform. A transgene cassette includes a single promoter and terminator driving multiple expression units. Each unit carries a miR173 trigger sequence (miR173ts) fused with a partial cDNA at downstream. Each cDNA silences a specific target transcript. These multiple units generate a long transcript including miR173ts and complementary transcript to each cDNA. Binding of mature miR173 to the miR173ts triggers production of phased 21-nt siRNAs. The siRNAs derived from each partial cDNA will cleave plant endogenous complementary transcripts. Note: Pro and Ter: promoter and terminator, respectively. Different colors indicate different nucleotide sequences

Success in atasiRNA necessitates accurate design of mRNA target sites

Because of its intrinsic advantages, atasiRNA is an emerging tool for highly specific and effective gene silencing in plant biology studies and crop improvements. However, it is critical to be able to design the 21-nt atasiRNA accurately to effectively silence mRNA targets. Because a 21-nt atasiRNA binds to a very short stretch of target sequence, the most effective site of a target mRNA needs to be identified. Such a 21-nt atasiRNA is in contrast with the dsRNA from transgene IR which generates multiple siRNAs and thus binds to many different sites across the target mRNA. Although multiple atasiRNAs can be deployed within a TAS locus to bind different sites of an mRNA target to warrant a high degree of silencing, using a single atasiRNA remains more desirable because it provides more specificity and reproducibility of the silencing results (Tilesi et al. 2009). Importantly, a high success rate of atasiRNA prediction will minimize the number of atasiRNA target sites to be screened through in vivo tests in a model plant or in vitro transient assays to silence mRNA targets. Such “pilot” screens are currently necessary before the tasiRNA-expressing cassette can be actually introduced into a crop genome to warrant successful RNAi. Therefore, a precise atasiRNA design to minimize these pilot tests will make RNAi-mediated silencing study and crop trait improvements both cost-effective and time-saving.

Design of effective siRNAs for plant mRNA targets can be substantially improved by employing new approaches

Recently, significant progress has been made in the design and validation of siRNAs or miRNAs for mRNA targets. Of various design programs, WMD3 (http://wmd3.weigelworld.org) has been widely adapted to aid amiRNA designs to silence plant mRNA targets. In principle, this program can be also used to predict atasiRNAs to be carried by TAS locus to silence mRNA targets. However, there are significant limitations of this design program. It generates a long list of amiRNA candidates for each target gene and score candidates by sequence complementarity and hybridization energy with little consideration of mRNA structure and bounded-regulatory proteins, leading to unknown in vivo efficacy (Li et al. 2013).

Very recently, a new breakthrough has been made in validating effective miRNAs by employing epitope-tagged protein-based amiRNA screens (Li et al. 2013). In this improved approach, a DNA sequence encoding target mRNAs and its adjacent epitope-tagged protein are constitutively or inducibly coexpressed in a transient expression system such as protoplasts with amiRNA candidates targeting single or multiple genes. Because the design principles of amiRNA and atasiRNA candidates are very similar, this new approach can be readily employed to validate atasiRNA (unpublished results), substantially improving the effectiveness of the atasiRNA design and enhancing plant biology study and crop improvements.

atasiRNA remains a unique and effective tool as other technologies are emerging

Over the past 12 months, another new tool has emerged and is being further developed for precise genome editing, namely clustered regularly interspaced short palindromic repeats (CRISPR/Cas) (Li et al. 2014). This editing platform is from bacterial Type II CRISPR locus and employs non-coding RNA to recognize target genome sequence. One of the key advantages of this new platform is its ability for multiplex genome editing, which other precise genome editing approaches including zinc finger nuclease and transcription activator-like effector nuclease do not possess. This multiplex feature is in some way similar to atasiRNA but the former edits target genome DNA rather than transcript. However, a key advantage of atasiRNA is its ability to minimize off-target effect through the employment of 21-nt miRNA species within TAS locus (unpublished). In addition, different levels of target transcript knockdown can be achieved through atasiRNA which avoids lethality. These two features are significantly advantageous over the above approaches for plant biology study and field application.

Conclusion

We are entering a new era where new tools are being developed at an accelerated pace. These new tools enhance the analysis and utilization of enormous genomic information coming from post-genome research, particularly, the completion of whole genome sequences in many plant species. Of various tools, atasiRNA plays a uniquely important role in plant biology study and crop genetic improvements.