Introduction

RNA is a key molecule in life, and the understanding of its structure/function relationships is crucial in molecular biology and therefore has important implications in terms of human health [1]. However, we are as yet unable to accurately predict either the folding pathway or the active tertiary structure of RNA molecules from their primary sequences. Clearly, these two fundamental questions need to be addressed. A long-term goal is to be able to interpret an RNA sequence in terms of its functionally folded three-dimensional form. RNA molecules possess a hierarchical structure: the primary sequence determines the formation of the secondary structure elements, and the secondary structure in turn determines the tertiary folding [24]. RNA molecules fold sequentially from 5′ to 3′ using recurring stable submotifs [5, 6], and the folding intermediates tend to become increasingly stable as the tertiary interaction network progresses [79]. RNA can fold into multiple structures; however, only a single structure is usually functional. In order to fold correctly, the RNA must avoid the problem of folding into alternative, non-functional structures and kinetic traps [1013].

Ribozymes are interesting molecules to study when addressing these RNA questions because modifications in their structures are reflected in their catalytic properties. More precisely, small ribozymes are suitable for this task since considerable progress has been made in the determination of their structures [14, 15], and a versatile toolbox is available for their study. Here we present a review of the recent progress in the study of the tertiary structures, the folding pathways and the modulation of the small ribozymes derived from natural, self-cleaving, RNA motifs.

Small self-cleaving ribozymes

This group of natural ribozymes includes five RNA species that have been derived from self-cleaving sequences ranging from ~40 to 200 nucleotides in length and possessing various secondary structures. One subgroup includes three self-catalytic RNA motifs that are part of species belonging to the brotherhood of small, circular, self-replicating RNAs and are essential components of the rolling circle replication mechanism of these infectious RNAs. According to this mechanism, the circular monomer strands are replicated into linear multimeric strands of complementary polarity that then are self-cleaved, on the basis of the RNA motifs, into monomers that, following circularization, participate in the next round of replication [16]. More specifically, the hammerhead and hairpin RNA motifs were obtained from viroids and viroid-like satellite RNAs of plant origin [1722], while the HDV self-cleaving RNA motif was obtained from the hepatitis delta virus (HDV) that infects humans [2325]. The hammerhead motif has also been detected in eukaryote satellite DNA transcripts obtained from newt, schistosome and cricket [2628], while the HDV motif has also been detected in the human genome [29].

The members of the second subgroup of natural self-cleaving RNA motifs are found within the bodies of larger transcripts. For example, the VS self-cleaving motif was identified by accident in the Varkud satellite RNA, an abundant transcript from the circular mitochondrial Varkud satellite DNA found in numerous Neurospora species and in other simple eukaryotes [30]. In common with the above ribozymes, this self-catalytic motif, in vivo, processes a multimeric RNA into monomers [31]. Another example is the glmS self-cleaving motif that was identified, by computer analysis of the 5′ untranslated region (5′UTR) of the glucosamine-6-phosphate synthase (glmS) mRNA of certain gram-positive bacteria, to be a putative riboswitch [32]. In fact, upon the binding of glucosamine-6-phosphate (GlcN6P), the glmS motif has been shown to undergo a site-specific self-cleavage reaction making it the first, and as yet the only, natural allosteric ribozyme [33]. This cleavage dramatically decreases the half-life of glmS mRNA, and thus this ribozyme acts as a regulatory element controlling the glucosamine-6-phosphate synthase level [33].

All of these self-cleaving RNA motifs catalyze the cleavage of the RNA phosphodiester backbone through a transesterification reaction involving the attack of the vicinal 2′-hydroxyl group (2′-OH) on the scissile phosphate, yielding a 2′-3′-cyclic phosphate and a 5′-hydroxyl termini as products [34]. The mechanism involved is a bimolecular nucleophilic substitution (SN2) that is general acid–base catalyzed and requires in-line geometry between the 2′ oxygen, the phosphate and the 5′ oxygen. The catalytic strategies and active site organization at the atomic level for each of these ribozymes have been extensively characterized and do not fall under the scope of this work since these aspects have been recently been reviewed [1, 15]. Finally, the presence of divalent metal cations is important in RNA folding in order to obtain well-defined structures, as well as in the catalysis, by functioning either as a general acid or as a general base when coordinated with water. These self-cleaving RNA motifs are metalloenzymes under physiological conditions, but are also active in the absence of divalent metals under certain conditions such as in the presence of high monovalent metal ion concentrations or at low pH levels [35, 36].

HDV ribozyme

The self-cleaving RNA motif derived from HDV varies from 85 to 95 nucleotides in length depending on the sequence variant and the polarity. It has been possible to separate this self-catalytic sequence into two molecules, thereby creating a trans-acting system where one molecule, the ribozyme, possesses the catalytic properties required to cleave multiple copies of the other molecule, the substrate. This separation can be achieved in several ways, the most frequent being the removal of the junction between stems I and II (Fig. 1a) [25]. The development of trans-acting ribozymes, for HDV as well as all other self-catalytic RNA motifs, has permitted experiments leading to the identification of the important structural features of these motifs [24, 25, 37]. For example, direct mutagenesis experiments led to the identification of the crucial nucleotides in the catalytic core, as well as identifying essential base-paired positions [for examples see Refs. 37, 38]. It should be noted, however, that an unbiased in vitro selection revealed that, with the exception of the catalytic nucleotide (position 76), none of the bases were absolutely required for cleavage [39]. This work clearly revealed that the ribozyme supports more variability than was originally thought to be the case based on the isolation of natural sequences. In addition, it also revealed that higher cleavage levels were observed for sequences closely related to the natural ones, suggesting that the combination of all of the structural features is important for optimal cleavage activity.

Fig. 1
figure 1

Structure and folding pathway of HDV ribozyme. The ribozyme and substrate are denoted by Rz and S, respectively. The cleavage site is indicated by an arrow. a Secondary structure of a shortened trans-acting version of the wild-type antigenomic HDV ribozyme according to the notation of Leontis and Westhof [145]. The harmonized nomenclature using roman numbering for the stems is indicated as well as the previous one in parenthesis. The dotted line represents the junction that has been removed in order to generate a trans-acting version. b Crystal structure of the genomic version of HDV ribozyme [46]. The colors of the various domains are the same in both panels A and B. c Proposed folding pathway of HDV ribozyme according to Reymond et al. [8]. The substrate is shown in green and the products in red. Important nucleotides are indicated in the various intermediates

According to the experimentally well-supported pseudoknot model, the HDV ribozyme is composed of one stem (I or P1), one pseudoknot (II or P2), two stem-loops (stems III or P3 and IV or P4) and three single-stranded junctions (I/II, I/IV and IV/II). The structure depicted in Fig. 1a is derived from the HDV of antigenomic polarity. Previous nomenclature of the various regions and the harmonized one using roman numbers for the stems are indicated. It is composed of a 57-nucleotide ribozyme and an 11-nucleotide substrate. By using this design, the stem I is composed of one Wobble base pair followed by six Watson–Crick base pairs formed between the ribozyme and the substrate and becomes the recognition domain. Both the junction I/IV and the loop III are single-stranded in the initial folding steps, but are eventually involved in the formation of a second pseudoknot (I.I) [37, 40, 41]. This structure requires the presence of divalent cations such as calcium or magnesium in order to be formed [42]. Both X-ray diffraction analysis and nuclear magnetic resonance studies have provided high-resolution tertiary structures of the genomic HDV ribozyme [40, 43, 44]. Overall, these approaches have shown that the global shape of the ribozyme is dominated by two coaxial helices formed by the stacking of the stems I–I.I–IV and the stems II–III (Fig. 1b). The stems II and IV have a structural role and are located above and below the catalytic core, which is formed by a network of interactions located at the interface between the two helical stacks. This network relies heavily on the nucleotides’ identities, as well as on several of the 2′-OH groups of the ribose moieties [38, 45].

The crystal structure of the genomic HDV ribozyme also revealed that this catalytic RNA adopts a highly ordered structure [40, 43, 46], in agreement with the previously reported unusual properties of this motif. For example, the self-catalytic motif retains activity at temperatures as high as 80°C and in solutions containing up to 5 M urea [47, 48]. Moreover, it was shown that for some trans-acting HDV ribozyme variants the cleavage level was near 100%, suggesting that HDV ribozyme forms a homogeneous population of structures and is not prone to alternative structure formation [8]. This important characteristic has been fully exploited in order to decipher the folding pathway of the HDV ribozyme, by far the most complex folding pathway elucidated to date for a small ribozyme. Extensive work has been performed using a number of approaches, including in vitro selection and photo-induced cross linking, in order to identify a specific set of mutants able to halt the folding pathway at each stable intermediate [39, 49]. Briefly, after the recognition of the substrate by the ribozyme leading to the formation of the stem I, five subsequent conformational steps are required in order to form the catalytically active structure (Fig. 1c). The first conformational transition is the docking of the stem I within the catalytic core. A specific interaction between the substrate in the middle of the stem I, which is slightly unfolded at the position +4 from the cleavage site, and both the C22 and the U23 residues of the loop III are most likely responsible for this docking [50]. This conclusion is supported by the fact that replacing a weak base pair, such as UA, in the middle of the stem I by a stronger base pair, such as GC, is detrimental to the ribozyme’s activity [51]. The substitution of an A for a U in position 23 blocks stem I docking to the catalytic core [50]. The second conformational transition is the formation of the pseudoknot I.I, which includes two GC base pairs [37, 41]. Any mutation of the pseudoknot I.I reducing either its stability, or the coaxial stacking, is detrimental to the cleavage activity [52]. The third conformational step is the formation of the A-minor motif between the two consecutive adenosines of the junction IV/II and the minor groove of the stem III, more specifically with the two GC base pairs of the latter [40]. This is a key interaction for the positioning of the junction IV/II, which is otherwise quite flexible. Moreover, this conformational transition initiates the positioning of the catalytic cytosine (C76), which is also located within the junction IV/II. The replacement of the adenosines in positions 78 and 79 by uridines prevents any further progression along the folding pathway [8, 40]. The fourth and fifth conformational steps are the formation of the trefoil motif and a base pair switch at the bottom of the stem II, respectively [40, 53, 54]. These two steps most likely occur simultaneously, although an order has been suggested based on physico-chemical analyses [8]. The trefoil turn is a particular motif identified by X-ray crystallography involving the catalytically active cytosine and its two flanking nucleotides [40]. This motif is induced by the formation of the A-minor motif and is proposed to contribute relaxing the phosphodiester backbone of the junction IV/II, thereby positioning the cytosine deep inside the catalytic centre [54]. Altering the trefoil turn motif by deleting the uridine in position 77 results in a ribozyme completely deprived of catalytic activity [53]. The other conformation transition, the base pair switch, takes place at the bottom of the stem II. It consists of switching the C19–G81 base pair of the stem II to a new C19–G80 base pair and also bulging out the G81 residue [53]. The replacement of the guanosine in position 80 by a cytosine prevents the adoption of the ribozyme–substrate transition complex. This base pair switch is also induced by the formation of the A-minor motif. Upon completion of the base pair switch, the geometry at the catalytic site is correct, and the chemical reaction occurs. The nucleotide involved in this chemical step is the catalytically active cytosine C76, which plays the role of a general base [55, 56]. Any mutation of this cytosine results in the complete loss of cleavage activity. When the transesterification reaction occurs, it is simultaneously accompanied by the products’ release. However, under certain conditions, it has been shown that the 3′-product might remain associated with the ribozyme, thereby causing product inhibition.

In summary, each of these conformational steps is separated by the formation of specific interactions. Several mutants have been designed in order to block the folding pathway at stable intermediates by disrupting these interactions. These mutants have been used in an isothermal titration calorimetry study that yielded the complete thermodynamic characterization of the HDV folding pathway [8]. This folding pathway is enthalpy driven, and the formation of additional interactions, such as the stacking in the coaxial helices, is responsible for the stability of this complex structure. The formation of the pseudoknot I.I has been shown to be the limiting step in the molecular mechanism of the HDV ribozyme [8, 46]. At the beginning of the folding pathway, the loop III and junction I/IV are located relatively far from each other and have a significant amount of freedom. The formation of the pseudoknot I.I is required in order to bring together these two single-stranded regions, thereby trapping the substrate within the catalytic core [49]. This requires an important entropic loss, and the driving force of this step is most likely the formation of two GC base pairs and the coaxial stacking of the helices I–I.I–IV.

Three different strategies have been used to modify activity of the HDV ribozyme. The first one is based on the fact that, due to its self-cleaving origin, the HDV ribozyme suffers from a lack of substrate specificity when used in trans as a molecular tool. Its substrate specificity depends on the formation of stem I that contains only seven base pairs (Fig. 1a), while a total of 15–16 base pairs has been estimated to be required in order to ensure the targeting of a unique RNA species from the human transcriptome [57]. This is the reason why the past interest in this ribozyme was only moderate. In order to overcome this hurdle a module named the SOFA (Specific On/ofF Adaptor) was engineered for the HDV ribozyme (Fig. 2a) [58]. The SOFA adaptor switches the cleavage activity from “off” to “on” state solely in the presence of its cognate substrate. This modulation acts at two different levels. The substrate binding site forms a short duplex with an inserted sequence element (the blocker); this increases the energetic barrier for non-specific base-pairing interactions with the substrate binding site, thus reducing the potential for off-target cleavages. A second inserted sequence element (the biosensor) extends base-pairing with the substrate to favor binding of the genuine substrate, and formation of this duplex concomitantly results in disruption of the short duplex involving the blocker sequence. Analysis of the cleavage activity using a large collection of substrates and SOFA-ribozyme mutants provided evidence as to the roles of each domain and gave hints for design optimization [59]. The proof-of-concept of this adaptor was demonstrated both in vitro and in vivo using several SOFA-ribozymes that cleaved various RNA transcripts [58, 60, 61]. Several alternative versions of the SOFA module have been reported for the HDV ribozyme, and it has been adapted to various ribozymes including both the hammerhead and the hairpin ribozymes [62].

Fig. 2
figure 2

Various modulated versions of the HDV ribozyme. The inactive and active states are indicated by “off” and “on”, respectively. The substrate is shown in green, and the portion of the ribozyme responsible for the modulation in blue. a The SOFA module [58]. BL and BS indicate blocker and biosensor, respectively. b The G-quartzyme [64]. c The theophylline aptamer attached by a communication module [65]

Two other ways by which the HDV ribozyme’s activity can be modified involve the use of allosteric modules adapted to the HDV ribozyme. The first involves using an aptamer known to fold into an unstable hairpin structure in the absence of potassium and into a G-quadruplex structure upon the addition of potassium [63]. A rationally designed HDV ribozyme in which most of the stem II was replaced by this aptamer lost its cleavage activity in the absence of potassium (Fig. 2b) [64]. Further structural characterization led to the proposal that the 5′-strand of the aptamer was interacting with loop III, inactivating the HDV ribozyme. The addition of potassium promotes the formation of the G-quadruplex structure involving both strands of the aptamer, releasing the catalytic core of the ribozyme and stabilizing it by the formation of the stem II. The resulting RNA species represents a new class of ribozyme that exhibits a monovalent cation-dependent activity. The second method involves replacing most of the stem IV by a randomized region followed by a theophylline aptamer previously developed by in vitro selection (Fig. 2c) [65]. A negative selection was performed so as to remove all the sequences that cleave in the absence of theophylline. A positive selection was then performed in order to enrich the pool of sequences that are active in the presence of theophylline, thereby selecting a theophylline-dependent allosteric HDV ribozyme.

Hammerhead ribozyme

The hammerhead ribozyme is undoubtedly the most studied catalytic RNA so far. It was the first to be discovered [18, 66] and to have its crystal structure determined [67, 68]. It is famous for the dilemma surrounding the poor correlation that exists between the biochemical data and the crystal structures of minimal versions [6971]. The hammerhead ribozyme adopts a “Y-shape” consisting of three helical stems of variable sequence and length named stems I–III (Fig. 3a). In the center of the three-way junction of the Y, 11 highly conserved nucleotides form the catalytic core that includes the cleavage site located 3′ of an important single-stranded cytosine residue [17]. A recent X-ray crystal structure of a larger version derived from Schistosoma mansoni underlines the importance of two peripheral elements, specifically a hairpin loop located at the end of stem II and a bulge located in the longer stem I (Fig. 3b) [72]. Interaction of these two motifs in a tertiary manner increases the cleavage rate by at least 50-fold [73]. Most, if not all, natural hammerhead self-cleaving motifs appear to contain similar interactions between loops or bulges located in stems I and II [74, 75]. Moreover, this new crystal structure reconciled older chemical data and provided a basis to hypothesize how the transition state might be achieved [76]. In this transition state an interaction network implicating key nucleotides (G8, A9, G12, U16.1 and C17) positions the 2′-hydroxyl nucleophile, the scissile phosphate and the 5′-oxygen leaving group at an angle very close to 180° [72].

Fig. 3
figure 3

Structure and folding pathway of the hammerhead ribozyme. The ribozyme and substrate are denoted by Rz and S, respectively. The cleavage site is indicated by an arrow. a Secondary structure of a trans-acting version of the hammerhead ribozyme [72] according to the notation of Leontis and Westhof [145]. The corresponding minimal version is obtained by removing the shaded region at the top. The tertiary interactions between the loop of stem II and the bulge of stem I are illustrated by the thin curved lines. b Crystal structure of the Schistosome version of hammerhead ribozyme [72]. The colors of the various domains are the same in panels A and B. c Hammerhead ribozyme’s folding pathway. The substrate is shown in green, and the products in red

Trans-acting hammerhead ribozymes have been extensively used to develop gene-inactivation systems either for functional genomic applications or for therapeutic aims [77, 78]. Several trans-acting versions have been produced by cleaving the RNA strand at various locations within the loops, with the most common involving strand breaks in loops I and III. In such trans-acting versions, the initial step of the folding pathway is the formation of the ribozyme–substrate complex via assembly of both stems I and III. Then the tertiary interaction network is established, which enables formation of the transition state for cleavage (Fig. 3c). Depending on the length and the base composition of the stems formed between the substrate and the ribozyme, product release inhibition has been observed.

The hammerhead ribozyme is also by far the most studied RNA molecule in terms of the modulation of the catalytic activity, either directly or by the addition of motifs leading to the production of allosteric versions. Several compounds were reported to either positively or negatively modulate the hammerhead cleavage activity [7981]. For example, in the case of antibiotics, usually inhibitory effects were observed. This is nicely exemplified by the effect of neomycin on hammerhead activity [79]. In this case, it has been proposed that neomycin interacts directly with the ribozyme–substrate complex, stabilizing the ground state over the transition state and thereby inhibiting cleavage. Other antibiotics, such as chlorotetracycline, either stimulate or inhibit depending on the reaction conditions [80]. However, in general the mechanisms by which antibiotics affect the cleavage activity remain elusive [79]. Another interesting mechanism has been reported in a study of metal ion inhibition of the hammerhead cleavage. For example, terbium ions tightly bind to the ribozyme and subsequently displace an adjacent magnesium cation important for the catalytic activity [82]. In the case of cobalt hexammine, a study revealed that it binds to the hammerhead ribozyme and provokes a conformational change that produces an inactive structure [83]. Finally, several proteins and peptides with RNA chaperone activities have been shown to positively modulate hammerhead ribozyme activity [8489]. For example, the p7 nucleocapsid protein of HIV has been demonstrated to resolve misfolded ribozyme–substrate complexes [84]. Other proteins, such as hnRNP A1, enhance the binding rate of the substrate and significantly accelerate product release, thereby increasing ribozyme turnover [85].

Since Watson–Crick base pairs are predictable, several strategies directed towards controlling hammerhead activity were based on the use of either additional sequences or complementary oligonucleotides. For example, in the case of modulation through additional sequences, the SOFA module developed for the HDV ribozyme was adapted to the hammerhead ribozyme [62] (Fig. 4a). The blocker was inserted at the 5′ end of stem I and interacts with the 3′ arm of stem III, thereby blocking the ribozyme in an “off” state. The biosensor complementary to the 5′ region of the substrate was added to the 3′ end of stem III. Substrate annealing to the biosensor then promotes the displacement of the blocker, yielding a ribozyme in an “on” state. In the case of modulation through complementary oligonucleotides, facilitator oligonucleotides that interact with regions of the substrate outside of the binding site of the ribozyme were shown to modify the ribozyme’s kinetic parameters. Positive effects are seen when the facilitator binds the substrate 3′ of the ribozyme, while negative ones are observed when it binds 5′ of the ribozyme [90]. Alternatively, other strategies involving trans-acting oligonucleotides that interact with the ribozyme were designed in order to modulate the cleavage activity (Fig. 4b–d) [9194].

Fig. 4
figure 4

Various modulation strategies for the hammerhead ribozyme based on complementary sequences. The inactive and active states are indicated by “off” and “on”, respectively. The substrate is shown in green and the portion of the ribozyme responsible for the modulation in blue. The trans-acting oligonucleotides are represented by the dotted lines. a The SOFA module [62]. BL and BS indicate blocker and biosensor, respectively. b Three-strand ribozyme strategy [91]. c Blocker release strategy [93]. d Stem II stabilization strategy [92]

Over the years, rational designs and in vitro selection strategies have been used for the development of a wide repertoire of allosteric hammerhead ribozymes (also named aptazymes). The ribozyme activity is switched from inactive to active, or vice versa, by the binding of an effector to an adjacent aptamer domain (Table 1, upper portion) [95106]. The general strategy involves using aptamers previously reported to be specific for a compound and then inserting them in one of the structural stems of the hammerhead ribozyme. Insertion in either stem I or III generally regulates the substrate binding step, while insertion in stem II usually either impairs the formation or decreases the stability of this helical region critical for the catalytic activity. Aptamers can be specific to various small components and proteins, including adenosine-5′-triphosphate (ATP), theophylline, flavin mononucleotide (FMN) and the HIV Tat protein, to name only a few examples, and have been reported to regulate hammerhead catalytic activity [95100, 104]. Interestingly, two allosteric ribozymes using the insertion of the same ATP aptamer within stem II have been demonstrated to regulate the catalysis in different ways [95, 96]. The only difference between the two was in the linker domain located between the aptamer domain and the catalytic core. In one construct this communication module is composed of 4 base pairs that form a proper stem II in the absence of ATP, leading to negative modulation [95, 96]. In the second construct, the presence of a linker composed of 14 bases including only 3 base pairs prevents the formation of stem II and yields an inactive ribozyme in the absence of ATP, leading to positive modulation [95]. Clearly, this illustrates the importance of the communication module in the allosteric behavior of a ribozyme. Moreover, cooperativity between the binding of two different effectors has also been reported [100]. In this case, FMN and theophylline aptamers have been juxtaposed within stem II and sequential binding of both effectors is required in order to activate the ribozyme.

Table 1 Allosteric hammerhead ribozymes

Several allosteric hammerhead ribozymes have been isolated by in vitro selection (see Table 1, lower portion) [107114]. Generally, in terms of allosteric ribozymes, the initial rounds are performed using a negative selection process that removes the sequences that do not require the presence of the effector for cleavage to occur, followed by subsequent rounds using a positive selection in the presence of the effector. For most of the reported allosteric hammerhead ribozymes, the region including the randomized positions was introduced in stem II. In vitro selection yielded allosteric domains that exhibit a very high level of specificity for their effectors, even discriminating between closely related compounds such as cAMP, cCMP and cGMP [107, 108]. Alternatively, the reselection of an allosteric hammerhead ribozyme with an improved discrimination power for a given effector after mutagenesis of the original aptamer domain has also been reported [110]. This type of strategy has been used to isolate ribozymes whose activities are modulated by photosensitive compounds [105, 111], opening the way to in vivo light-regulated gene expression through the activity of photochemically modulated hammerhead ribozymes.

Finally, as is the case with the rational design of allosteric ribozymes, the domain linking the aptamer and the catalytic core is very important, in terms of the modulation, for in vitro selected allosteric ribozymes. In vitro selection of a communication module connected to an FMN-aptamer has shown that in some cases the effector causes a positive modulation, while for other sequence variants it causes a negative regulation [113]. In a similar study, several theophylline-dependent allosteric hammerhead ribozymes harboring various sequences as communication sequences have been isolated, illustrating the flexibility of this domain [110]. Furthermore, a communication module is not restricted to one specific aptamer or particular ribozyme. The same activating communication module selected for the FMN aptamer was used in a construction with both an ATP aptamer and a theophylline aptamer, causing a positive modulation of the activity in the presence of the ligand. In the same way, a communication module selected with the HDV ribozyme was also effective when transferred into a hammerhead ribozyme [65]. Clearly, the communication module is a key feature of allostreric ribozymes, permitting the transfer of properties from the aptamer domain to the catalytic core. The molecular mechanism leading to the modulation of in vitro selected hammerhead ribozymes remains largely unknown, mainly because the efforts to elucidate the structures of both the inactive and active conformation were limited. In many cases, the stabilization of stem II has been proposed to be required for the adoption of a catalytically active structure.

Hairpin ribozyme

In its naturally occurring form, the hairpin ribozyme adopts a four-way junction secondary structure [115, 116]. Both two-way and four-way junction trans-acting ribozymes have been derived and extensively studied (Fig. 5a) [117119]. In the minimal two-way junction version, the 49-nucleotide-long ribozyme is composed of four helices separated into two domains (A and B). Domain A comprises stems I and II (or H1 and H2) and an internal loop of eight nucleotides, while domain B comprises stems III and IV (or H3 and H4) and an internal loop of 16 nucleotides. These two domains dock into a non-coaxial orientation, and the interactions between the two internal loops form the catalytic core of the ribozyme (Fig. 5b) [120]. This catalytic core can be formed with a variety of domain-connecting setups; in fact, it can even be formed if the two domains are separated into two distinct RNA molecules [121, 122]. The cleavage site and almost all of the conserved nucleotides are located in these two internal loops [123]. The presence of a guanosine 3′ of the cleavage site is absolutely required as it is involved in stacking tertiary interactions within the internal loop of domain B, being a key feature of the hairpin catalytic core [124]. In contrast, a variety of mutations can be found in the base-paired regions as long as they support helix formation, the only requirement being the presence of a guanosine as the first nucleotide in the ribozyme strand of stem II [125].

Fig. 5
figure 5

Structure and folding pathway of the hairpin ribozyme. The ribozyme and substrate are denoted by Rz and S, respectively. The cleavage site is indicated by an arrow. a Secondary structure of a four-way junction trans-acting version of the hairpin ribozyme [116] according to the notation of Leontis and Westhof [145]. The corresponding minimal two-way version is obtained by removing the shaded region at the top. The tertiary interactions between the internal loops A and B are illustrated by the thin curved lines. The harmonized nomenclature using roman numbers for the stems is indicated as well as the previous one in parenthesis. b Crystal structure of the four-way junction version [116]). The colors of the various domains are the same in panels A and B. c Hairpin ribozyme’s folding pathway. The substrate is shown in green and the products in red

Folding of the hairpin ribozyme is a two-step folding pathway: after the binding of the substrate that results in the formation of domain A, the docking of the two internal loops caused by tertiary interactions forms the catalytic site (Fig. 5c). Once it is formed, the chemical step can then occur using in-line geometry. Interestingly, the ligation reaction is around tenfold more efficient than the cleavage reaction [126]. The hypothesis to explain this observation is that the docking of the two internal loops confers sufficient rigidity to the catalytic core that the optimal geometry is maintained and the highly reactive 2′–3′-cyclic phosphate can open and react with the 5′-hydroxyl [126].

At least four different ways of modulating the activity of the hairpin ribozyme have been succesfully used. First, in vitro selected adenine-dependent hairpin ribozymes have been developed; those require the metabolite adenine as a structural component to be able to form their catalytic cores (Fig. 6a) [127]. Second, the replacement of the short stem IV with a communicator module has permitted the introduction of a redox active aptamer into the ribozyme resulting in allosteric regulation of the ribozyme’s activity (Fig. 6b) [128]. Third, a SOFA version was developed in which a short blocker sequence forming a stem II equivalent was attached to the 3′end of the ribozyme, and a biosensor recognition sequence that is able to bind the substrate was added to the 5′end (Fig. 6c) [62]. As with the hammerhead ribozyme, the binding of the substrate to the biosensor allows the substrate to unfold the blocker and bind in its place, thereby increasing the specificity of the reaction. Finally, a modulation based on the presence of a trans-acting oligonucleotide has been achieved by introducing a short stem loop between stems II and III. When an oligonucleotide composed of either DNA or RNA complementary to this new stem loop is bound, domains A and B can no longer interact together and the catalytic activity is disrupted (Fig. 6d) [129, 130].

Fig. 6
figure 6

Various modulations of the activity of the hairpin ribozyme. The inactive and active states are indicated by “off” and “on”, respectively. The substrate is shown in green, and the portion of the ribozyme responsible for the modulation in blue. a The adenosine-dependant hairpin ribozyme [127]. b The FMN aptamer attached by a communication module [128]. c The SOFA module [62]. BL and BS indicate blocker and biosensor, respectively. d The oligonucleotide modulation [129]. The trans-acting oligonucleotide is represented by the dotted line

VS ribozyme

With a length of around 140 nucleotides in its trans-acting version, the VS ribozyme is the largest known small self-cleaving ribozyme. Its crystal structure has not yet been solved, although a recent small-angle X-ray scattering study gave a low-resolution structure, which was used to fit the ribozyme and to confirm its predicted overall shape [131]. The secondary structure of this ribozyme is formed by 5 helices, numbered from II to VI, that form two three-way junctions sharing stem III (Fig. 7a). The substrate is composed of stem I and, in contrast to other ribozymes, the VS ribozyme is able to rely exclusively on tertiary interactions in order to recognize its substrate [132]. The substrate appears to be docked between stems II and VI, in a cleft formed by the three-way junction involving stems II, III and VI, where it makes close interactions with the internal loop of stem VI (Fig. 7b). An adenosine (A756) in this highly conserved internal loop and a guanosine (G638) in the internal loop of the substrate have been shown to be important for catalysis and are probably directly implicated in the chemical reaction, leading to the hypothesis that these internal loops are in fact the ribozyme’s active site [133]. The cleavage occurs in the internal loop of stem I through a general acid–base catalyzed transesterification mechanism. This cleavage has been shown to be a consequence of a one nucleotide shift in the stable secondary structure of stem I [134], a secondary structure rearrangement that is stabilized by the formation of a kissing loop pseudoknot interaction. This pseudoknot is composed of three base pairs formed between the loops of stems I and V, and is required for the cleavage by VS ribozyme [135]. Hydroxyl radical footprinting showed that a solvent-inaccessible core is rapidly formed in this region [136]; the absence of water is supported by a previous kinetic study that reported pH variation has only a small effect on this catalytic RNA [137]. The folding of the VS ribozyme is ion-induced, and the presence of divalent ions is required, while the presence of monovalent ions further enhances the reaction rate [137].

Fig. 7
figure 7

Structure of the VS ribozyme. The ribozyme and substrate are denoted by Rz and S, respectively. The cleavage site is indicated by an arrow. a Secondary structure of a trans-acting version of the VS ribozyme according to the notation of Leontis and Westhof [145]. A shortened version can be obtained by removing the non-essential shaded region at the bottom. The tertiary interactions forming the kissing loop between loops I and V are illustrated by the thin curved lines. The dotted line represents the junction that has been removed in order to generate a trans-acting version. b Structure of the VS ribozyme obtained by SAXS [131]. The colors of the various domains are the same in panels A and B

While screening a collection of antibiotics looking for inhibitors of the VS ribozyme, a class of simple cyclic peptides was found to enhance ribozyme cleavage [138]. The presence of tuberactinomycin antibiotics and, especially, viomycin, in a trans-cleavage experiment results in a more extensive reaction and a decrease in the magnesium requirements for VS ribozyme cleavage. These antibiotics are able to interact with RNA and, in the case of the VS ribozyme, to restore the activity of some mutants, although the exact mechanism is not known.

Finally, a longer version of the natural VS ribozyme has a non-essential stem (called stem VII) that increases the stabilization between the 3′ end of the ribozyme and the 5′ end of its substrate (Fig. 7a). This version is particularly interesting in ligation assays since it permits substrate binding through a secondary structure [139].

GlmS ribozyme

The trans-acting version of the glmS ribozyme is around 120 nucleotides long and is composed of eight small stems, two of which are pseudoknots (Fig. 8a). The available crystal structures have shown that coaxial stacking of RNA helices defines the global tertiary structure of the glmS ribozyme, while the catalytic core is formed by an internal loop involved in the two pseudoknots (Fig. 8b) [140, 141]. The double pseudoknot catalytic core forms a rigid binding pocket that is able to recognize and use a variety of small hydroxylamines, such as ethanolamine or tris-hydroxymehylaminomethane, although only binding of glucosamine results in efficient cleavage [142]. The glmS ribozyme does not appear to undergo important conformational changes upon the binding of its cofactor glucosamine-6-phosphate (GlcN6P). When the cofactor GlcN6P is bound in the catalytic site through the recognition of the sugar moiety and of the phosphate [143], 80% of the solvent-accessible surface of the metabolite is buried, and the primary amine is well positioned for catalysis, either directly as a general acid, or as a general base through water molecules [140]. Metal ions are required for the formation of the precleavage structure, but do not appear to play any catalytic roles in glmS ribozyme activity [143].

Fig. 8
figure 8

Structure of the glmS ribozyme. The ribozyme and substrate are denoted by Rz and S, respectively. The cleavage site is indicated by an arrow. a Secondary structure of a trans-acting version of the glmS ribozyme according to the notation of Leontis and Westhof [145]. The harmonized nomenclature using roman numbers for the stems is indicated as well as the previous one in parentheses. b Crystal structure of the glmS ribozyme from Thermoanaerobacter tengcongensis [140]. The colors of the various domains are the same in panels A and B

Even if glmS ribozyme activity does not rely on conformational changes, its unique metabolite dependence implies an extremely interesting and as yet poorly understood folding pathway. The binding site for the GlcN6P metabolite is formed through a complex network of interactions that provides sufficient rigidity to the pocket so that it stays open, with the functional groups ready to accommodate the cofactor, quite a different scenario from the induced fit folding usually seen upon substrate binding with other RNA enzymes. Most of the research to date has focused on understanding the cleavage reaction, but very little is known about the folding steps leading to the cofactor-free structure. This ribozyme has only recently been discovered, and already some progress has been made toward the elucidation of its folding pathway. In addition, both the structural and functional versatility of this ribozyme has been examined by in vitro selection [144].

Concluding remarks

Studies on the natural small ribozymes have made key contributions to our understanding of the folding of RNA molecules, and have helped to identify structural features important for the formation of catalytically active structures. Although the accurate prediction of the secondary and tertiary structures of long RNA species is not yet possible, the current knowledge obtained from these small ribozymes has considerably improved our view of the folding pathways of RNA molecules in general in addition to providing opportunities for the development of a substantial number of methodologies for their study. Moreover, the discovery of other small, natural, self-catalytic ribozymes and riboswitches will widen the repertoire of this class of catalytic ribozyme and will most likely increase our ability to correctly predict their folding. The characterization of the interactions involved, combined with the various components modulating ribozyme activity, has paved the way for many developments in the area of the nanotechnology. Progress in comprehending how the various RNA motifs interplay together in order to control activity should lead to a better understanding of complex biological mechanisms such as the regulation of translation involving several RNA motifs that form the 5′ untranslated region of mRNA. From this point of view, RNA molecules can be considered as combinations of building blocks. The designing of complex ribozymes with several motifs modulating either together, or sequentially, the catalytic activity is expected to propel further progress in this direction.