Abstract
The discovery of introns over four decades ago revealed a new vision of genes and their interrupted arrangement. Throughout the years, it has appeared that introns play essential roles in the regulation of gene expression. Unique processing of excised introns through the formation of lariats suggests a widespread role for these molecules in the structure and function of cells. In addition to rapid destruction, these lariats may linger on in the nucleus or may even be exported to the cytoplasm, where they remain stable circular RNAs (circRNAs). Alternative splicing (AS) is a source of diversity in mature transcripts harboring retained introns (RI-mRNAs). Such RNAs may contain one or more entire retained intron(s) (RIs), but they may also have intron fragments resulting from sequential excision of smaller subfragments via recursive splicing (RS), which is characteristic of long introns. There are many potential fates of RI-mRNAs, including their downregulation via nuclear and cytoplasmic surveillance systems and the generation of new protein isoforms with potentially different functions. Various reports have linked the presence of such unprocessed transcripts in mammals to important roles in normal development and in disease-related conditions. In certain human neurological-neuromuscular disorders, including myotonic dystrophy type 2 (DM2), frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS) and Duchenne muscular dystrophy (DMD), peculiar processing of long introns has been identified and is associated with their pathogenic effects. In this review, we discuss different mechanisms involved in the processing of introns during AS and the functions of these large sections of the genome in our biology.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Introns, as well as their splicing and retention were discovered in 1977 as a result of an interesting observation that the mRNA used to code for proteins was almost always shorter than the DNA from which it had been transcribed (Berget and Sharp 1977; Chow et al. 1977). The original publications described the discovery in adenovirus mRNAs, but it soon became clear that this mechanism was not solely a viral phenomenon since introns were also found in cellular genes from eukaryotic cells (Breathnach et al. 1977; Jeffreys and Flavell 1977; Mandel et al. 1978). Initially, genes that contained introns were called interrupted and considered to be noncoding; however, this notion has changed over time. The completed sequencing projects of the human genome and other organisms including D. melanogaster and C. elegans have confirmed that all eukaryotes have introns (Rogozin et al. 2012). However, different species harbor dramatically different density and length of introns, ranging from a few bps to hundreds kbps. Genes in higher eukaryotes such as mammals have a greater number of introns than those of lower eukaryotes such as yeast, Drosophila, and C. elegans (Nixon et al. 2002; Wu et al. 2013). The differences may partly be explained by the variability in modes of intron removal between these organisms. Several studies revealed that first introns (i.e., the 5ʹ most first introns of genes) are typically the longest and most conserved (Gaffney and Keightley 2004; Bieberstein et al. 2012). Conservation of the first intron is probably related to the presence of regulatory elements and a specific pattern of chromatin organization. In the human genome, exceptionally long introns are found in genes with a wide variety of cellular and developmental functions, including genes with important roles in some diseases, e.g., dystrophin in DMD and cystic fibrosis transmembrane conductance regulator (CFTR) in cystic fibrosis (CF) (Sterrantino et al. 2021). The number of introns in transcribed human RNAs ranges from zero in histones to over 75 introns in dystrophin, in which only 0.5% of the gene is comprised of exons. Similarly, CFTR is approximately 98% introns (Bonadia et al. 2014). Such vast stretches of the genome must have an important biological function, given the amount of energy expended by the cell in the synthesis and transcription of these seemingly useless pieces of DNA. The fact that all higher metazoan species have introns, and that the higher the organism is, the higher the proportion of introns, indicates their important biological role.
Removal of introns through editing of premature RNA (pre-mRNA) requires the presence of splice donor (5′ss) and splice acceptor (3′ss) sites, in addition to the branch point (BP) site located near the 3′ss. This process involves the formation of a lariat intermediate controlled by specific dinucleotide sequences that border the exon/intron junction, i.e., conserved GU and AG, respectively, at the 5′- and 3′-ends of each intron (Fig. 1A). In human genes, the BP site is a short motif comprised mostly of adenosine, and the sequence is located between 18 and 44 nt upstream of the 3’ss; however, some BPs can be more distant and are found up to 400 nt upstream of the 3’ end (Gooding et al. 2006; Gao et al. 2008; Mercer et al. 2015). Editing of the pre-mRNAs to release introns and produce a functional unit is confined within the nucleus. This process is mediated by the spliceosome, a large ribonucleoprotein (snRNP) complex composed of small nuclear RNAs (snRNAs) and a few dozen proteins. The first step involves cleavage of the precursor RNA (pre-RNA) at the 5′ss of the intron followed by the joining of the 5′ end to an adenosine unit embedded within a BP sequence via a 2ʹ,5ʹ-phosphodiester bond, and eventually, the intron undergoing processing only remains attached to the downstream exon. Its release in the form of the lariat results from the ligation of the 5′ and 3′ splice sites of exposed exons and the ligation process is facilitated by multiprotein Exon Junction Complex (EJC) that is deposited ~ 20–24 nucleotides upstream of exon-exon junctions during mRNA splicing (Joseph and Lai 2021). Released lariat RNA is subsequently debranched by the RNA lariat debranching (DBR1) enzyme. This leads to its linearization and rapid degradation by the intron turnover pathway (Fig. 1A). A variety of exceptions to this canonical pathway have been described, and experimental evidence has shown that some lariat RNAs may escape the debranching process. These escapees either persist longer in the nucleus or travel to the cytoplasm, where they remain as stable circular RNAs (Talhouarne and Gall 2014, 2018).
Transcriptome diversity in eukaryotes may occur through alternative splicing (AS) of pre-RNAs, through the use of alternative promoters and polyA signals and, to some extent, through nucleotide editing. Intron retention (IR), which was originally described in viruses is one type of alternative splicing event (ASE) and is characterized by the inclusion of one or more introns in mature mRNA (RI-mRNA). This class of ASE is distinct from other types in that the mature transcript harboring unspliced introns still contains a potentially spliceable unit (Sznajder et al. 2018; Monteuuis et al. 2019; Broseus and Ritchie 2020). Although IR events have been found to be an integral part of global alternative splicing in mammals, the mechanism and factors underlying IR remain poorly understood (Wong et al. 2013, 2016, 2017; Braunschweig et al. 2014; Pimentel et al. 2016; Edwards et al. 2016). It was proposed that IR controls mammalian gene expression via a global, bidirectional cross-talk mechanism, in which inaccurate transcription results in low levels of expression due to impaired recruitment of core splicing factors (Wong et al. 2017; Zhang et al. 2018). This in turn results in localized pausing of RNA Pol II, further intron retention, and ultimately turnover of the RI-mRNAs (Zhang et al. 2018). Intron retention regulates RNA stability and protein isoform production in various ways. When in the main open reading frame (ORF), IR may either maintain the frame to allow translation of IR transcripts, or it may interrupt the ORF by introducing a premature termination codon (PTC). Due to the frameshift, these translation-incompetent mRNAs are eventually turned over via the cytoplasmic surveillance machinery of nonsense-mediated decay (NMD) (Lewis et al. 2003; Weischenfeldt et al. 2012; Ge and Porse 2014). However, some RI-mRNAs with PTCs are translated into new protein isoforms. This phenomenon was first discovered in viruses which developed two pathways for nuclear export of mRNAs with retained introns that allow them to escape either the nuclear or cytoplasmic degradation machineries and subsequently permit such incompletely spliced mRNAs to be translated (Hadzopoulou-Cladaras et al. 1989; Grüter et al. 1998; Hammarskjöld et al. 1989). These mechanisms rely on the interactions of cis-acting elements in RI-mRNAs with nuclear export proteins. Although they were originally thought to be unique to viruses, it was subsequently shown that the mammalian nuclear export factor 1 (NXF1) gene itself contains a constitutive cis-acting transport element in its alternatively spliced mRNA with a retained intron (Li et al. 2006). Direct interaction of Nxf1 with this element enables export and translation of a short alternative Nxf1 protein (Li et al. 2016a, b). Additionally, IR may be a source of mRNA isoforms referred to as detained introns (DI), representing either nuclear end-products or stable intermediates, which are temporarily stored awaiting signals for splicing completion and nuclear export (Ninomiya et al. 2011; Boutz et al. 2015; Mauger et al. 2016). Nuclear retention of introns has also been implicated in the pathogenesis of human hereditary disorders, including myotonic dystrophy type 2 (DM2, CNBP locus with CCTGexp), Fuchs endothelial corneal dystrophy (FECD, TCF4 locus with CTGexp), and amyotrophic lateral sclerosis/ frontotemporal dementia (C9-ALS/FTD, C9orf72 locus with GGGGCCexp) (Sznajder et al. 2018). Peculiar processing of introns harboring these GC-rich microsatellites has been linked with their propensity to adopt secondary RNA structures which may limit recruitment of splicing machinery factors and eventually may perturb proper removal of the introns from their host transcripts (Mirkin 2007; Zhang and Ashizawa 2017).
During the canonical splicing of pre-RNA, an intron is removed as a single unit in a two-step reaction. However, some large introns (> 10 k nt) pose challenges during processing and are cut out by splicing of distinct segments in a process known as recursive splicing (RS) (Joseph et al. 2018). This multistep intraintronic splicing was first discovered in Drosophila and was later also observed in genes of the human transcriptome (Hatton et al. 1998; Burnette et al. 2005). Despite the fact that approximately half of human protein-coding genes have introns over 24 k nt long and contain motifs similar to Drosophila RS-sites, only a handful of recursively spliced introns was initially identified in humans, mostly in genes involved in brain development, despite the greater abundance of long introns in vertebrate genomes (Duff et al. 2015; Sibley et al. 2015). However, recent study has revealed that most introns of human genes are removed from pre-mRNAs in smaller pieces rather than spliced as whole units in one step reaction (Wan et al. 2021). These results have led to a model of stochastic splice site selection based on which unannotated splicing sites (internal RS sites) within introns are used frequently but randomly by the spliceosomes to make many cuts instead of a single cut to progressively remove an intron. This process results in the generation of transient splicing intermediates that are pools of final mRNAs. Little is known about the RS machinery involved in the processing of RS introns; however, it was proposed that it is kinetically coupled with RNA polymerase II (RNA Pol II) elongation. In mammalian cells, most RS events occur posttranscriptionally, and RS introns are removed by both recursive and canonical splicing. The choice of which splicing mechanism is used seems to be cell-type specific (Zhang et al. 2018).
In this review, we present various pathways of intron processing during alternative splicing of mammalian cells and tissues. We discuss the functions and fate of mature mRNAs harboring unspliced introns in physiological and pathological programs of global alternative splicing that mediate gene expression. Finally, we discuss the most recent data on stable circular intronic RNA in various vertebrates, for which biogenesis, biological functions and turnover remain to be investigated.
Aberrant processing of introns during global alternative splicing
The formation of fully functional mRNA involves the processing of a precursor transcript through three main modifications: 5′-capping, 3′-polyadenylation and splicing. The diversity of the transcriptome in eukaryotes results from alternative splicing of pre-RNAs. High-throughput transcriptome analyses have shown that AS affects approximately 95% of multiexonic genes in humans (Nilsen and Graveley 2010; Barbosa-Morais et al. 2012; Merkin et al. 2012). During AS, splice sites in primary transcripts are differentially utilized, making it possible for a single gene to produce multiple mRNA and protein isoforms. Through this mechanism of regulating gene expression, the information stored in the genes can be processed in a variety of ways (Brett et al. 2002; Roy et al. 2013). Alternative splicing is regulated by the complex interplay between cis- and trans-acting factors that serve to promote or repress the assembly of splicing complexes, referred to as spliceosomes. Tight control of AS has paramount importance, as evidenced by links between abnormal alternative splicing and human disorders, and over the past years, ASEs have been recognized as promising biomarkers for clinical diagnosis and targets for therapeutic intervention (Cooper et al. 2009; Dvinge and Bradley 2015; Jung et al. 2015). ASEs can be classified into different splicing patterns that include alternative cassette exons (i.e., exon inclusion or exclusion), mutually exclusive exons, retained introns and exitrons, alternative 5′ and 3′ splice sites, alternative promoters, and alternative poly-A sites (Fig. 1B). Exon inclusion and exclusion have long been recognized as the most frequent types of AS in animals and implicated in the control of diverse aspects of normal and disease biology. However, the development of RNA-Seq over the past decade has highlighted the importance of recognizing intron retention (Kalsotra and Cooper 2011; Braunschweig et al. 2014; Wong et al. 2016; Pimentel et al. 2016). Although IR events have been found to be an integral part of many mammalian physiological and pathological programs of the global alternative splicing-mediated regulation of gene expression, this type of ASE remains much better characterized in plants, fungi, and unicellular eukaryotes. In these organisms, IR is the major mechanism for the regulation of gene expression (Ner-Gaon et al. 2004; Syed et al. 2012).
Incompletely spliced transcripts may harbor one or more the entire introns but may also contain intron fragments resulting from piecewise processing of large introns of > 10 k nt via recursive splicing. Initially, such RI-mRNAs were considered to be nonproductive based on the RNA isoform being predestined for degradation rather than translated. However, in many cases, these products do not lack functionality and play important regulatory roles (Middleton et al. 2017; Jacob and Smith 2017; Schmitz et al. 2017). The fate of IR-harboring transcripts depends upon a number of factors, including the location of the IR event within the transcript. IR in the 3′-UTR can introduce cis-elements, affecting mRNA stability or translational efficiency; IR in the 5′-UTR can insert an upstream ORF (uORF) or other structural features that can activate or repress translational initiation efficiency (Tahmasebi et al. 2016). More commonly, intron retention that interrupts the main ORF may lead to the introduction of PTC. The detection and ultimate degradation of such transcripts is maintained through the cytoplasmic surveillance machinery, which downregulates gene expression via NMD (Lewis et al. 2003; McGlincy and Smith 2008; Weischenfeldt et al. 2012; Ge and Porse 2014; Lareau and Brenner 2015). Importantly, some RI-mRNAs with PTCs are translated into new protein isoforms (for more details, please see the section Intron Retention in Viruses). However, IR in the main ORF can also maintain the reading frame, allowing translation of IR transcripts to generate alternative protein isoforms with novel functions. IR may also generate mRNA isoforms referred to as detained introns representing the nuclear end-products and nuclear stable intermediates (Fig. 2). In general, the first group is allocated to exosome-mediated RNA turnover of the host transcripts, whereas the second group of RNA containing a potentially spliceable intron is temporarily stored, awaiting signals for splicing completion and is eventually exported. Transcripts with such introns are considered a reservoir for rapid induction of expression without the necessity for transcription initiation (Boutz et al. 2015; Mauger et al. 2016). In addition to the above IR events, an unusual subfamily of introns has been discovered inside annotated protein-coding exons of plant and human genomes (Marquez et al. 2012, 2015). Based on their exonic and intronic nature, they were termed exitrons (exonic introns, EIs). These EIs have all canonical core splicing signals (5′ and 3′ splice sites and branch points), but they do not contain stop codons. Although EIs were detected as a subgroup of retained introns, they are clearly distinguishable. Human and Arabidopsis exitrons share some features, i.e., (i) weaker splice sites and higher GC content in comparison to RIs; (ii) size that is closer to exons than to RIs; and (iii) EI-containing exons are considerably longer than other exons (Marquez et al. 2015). EI removal or retention through alternative splicing results in transcripts with different fates. Since exitrons are protein-coding sequences, they have the potential to increase proteome complexity and add to phenotypic diversity via AS. In plant and nonplant species, EI-containing isoforms are exported to the cytoplasm, associate with ribosomes and are translated in contrast to IR transcripts that are often retained in the nucleus and are not translated (Yap et al. 2012; Shalgi et al. 2014).
A number of research groups have used various approaches for monitoring the extent to which RNA isoforms of AS associate with ribosomes and whether the isoforms are differentially translated (Sterne-Weiler et al. 2013; Shalgi et al. 2014). These studies found that IR events were detectable in cytoplasmic fractions and engaged with ribosomes to variable extents. High-resolution fractionation of polysomes into different size classes revealed that IR events were mostly present in a cluster of poorly translated transcripts, and only a small number of IR-containing transcripts were enriched in larger polysomes (Floor and Doudna 2016). As an alternative to polysome profiling, Weatheritt et al. combined ribosome footprinting with RNA-Seq to assess ribosomal engagement by mRNA isoforms (Weatheritt et al. 2016). This study revealed that IR was underrepresented on ribosomes compared to whole-cell RNA. In addition, IR events were enriched among the lowest expressed RNAs, consistent with ribosome engagement as a precursor to NMD. Thus, there are various ways in which intron retention regulates RNA stability and protein isoform production, as well as a rapid induction of expression via posttranscriptional splicing of nuclear-detained introns.
Intron retention in viruses
Intron retention was initially recognized in the analysis of viral interactions with host cells, and a mechanism for export and expression of mRNA with retained introns was first studied in human immunodeficiency virus (HIV) (Hadzopoulou-Cladaras et al. 1989; Hammarskjöld et al. 1989). In the late 1980's it was observed that HIV and other retroviruses use one of two mechanisms to overcome cellular restrictions to export and translation of their unspliced and incompletely spliced mRNAs. In complex retroviruses, such as HIV, virus-encoded regulatory proteins (Rev for HIV) bind to a specific cis-acting element in the incompletely spliced viral mRNA (RRE for HIV) and contact a host cell karyopherin protein (CRM1), also known as Exportin1 or Xpo1. Subsequently, this complex shuttles to the cytoplasm, where it delivers the RNA cargo (Hadzopoulou-Cladaras et al. 1989; Malim et al. 1989; Hammarskjöld et al. 1989; Fornerod et al. 1997). Thus, Rev works in trans on the RRE, and if this element is mutated or deleted or if Rev is not present, viral mRNAs with retained introns are not exported from the nucleus of the host cell. The specific domain in Rev that recruits Crm1 was identified to have a “nuclear export signal” (NES) (Fischer et al. 1995). Similar to many cellular proteins with the NES, Rev also has a nuclear localization signal (NLS) and shuttles between the nucleus and the cytoplasm even when it is not bound to RRE-RNA (Meyer and Malim 1994; Kalland et al. 1994). On the other hand, simpler retroviruses also contain a cis-acting sequence in their mRNAs, a constitutive RNA transport element (CTE) (Bray et al. 1994; Rekosh and Hammarskjold 2018). This element functions constitutively in host cells without a viral protein and interacts directly with the NXF1 to enable nuclear exit and subsequent translation of RI-mRNAs (Grüter et al. 1998). Although these mechanisms were originally thought to be unique to viruses, it was subsequently shown that the mammalian NXF1 gene itself contains a CTE in an alternatively spliced mRNA with a retained intron (Li et al. 2006). Direct interaction of Nxf1 with the CTE enables export and translation of a short alternative Nxf1 protein. It is still unclear how many mammalian genes contain elements that can function as CTEs. However, there are now numerous examples of RI-mRNAs that are efficiently exported from the nucleus and used to express novel proteins (Li et al. 2006; Yap et al. 2012; Pimentel et al. 2016).
Cis- and trans-acting elements regulate intron retention
The widely known factors of splice signal recognition that are integral to AS include epigenetic changes (e.g., DNA methylation), RNA Pol II processing, and splicing factor recruitment (Shukla et al. 2011). Their relevance to control IR has been a matter of investigation and has resulted in the recognition of both cis- and trans-acting elements as contributors to the mechanisms underlying IR-mediated regulation of gene expression (Braunschweig et al. 2014; Wong et al. 2017). Analysis of poly(A)+ RNA-Seq data from diverse mammalian and murine tissues and cell types revealed some distinct features that discriminate the retained introns from general introns undergoing constitutive splicing. They include elevated GC content, increased demethylated CpG, reduced length and weaker 5′ and 3′ splice sites (Fig. 3). Additionally, IR was found to be dependent on the location of the intron within the gene and was enriched in UTRs, but depleted in protein-coding regions. These features were characteristic of transcripts either from nuclear or cytoplasmic mammalian poly(A)+ RNA. In the search for cis-regulatory elements and trans-acting factors, Braunschweig and colleagues analyzed ENCODE ChIP-Seq data from human (K562 hematopoietic) and mouse (CH12 B lymphoma) cell lines focusing on transcription and chromatin components. They detected significant enrichment of RNA Pol II and some of the chromatin regulators, e.g., chromodomain-helicase-DNA-binding protein 2 (CHD2), over retained introns compared to constitutive, and specific chromatin modifications, i.e. acetylation of histone H3 (H3K27ac). Of all the factors analyzed, the strongest enrichment was found for RNA Pol II phosphorylated at serine 2 (Pol II-Ser2P) associated with transcription elongation, indicating that RIs are sites of pausing for Pol II, whose levels appear to be tightly coupled with increased IR events (Braunschweig et al. 2014). The relationship between these two factors was confirmed after treatment of murine embryonic stem cells (ESCs) with 5,6-dichloro-1-beta-D-ribofuranosylbenzimidazole (DRB), a compound that prevents polymerase elongation by blocking phosphorylation of its C-terminal domain. Over 70% of IR events with an increased percent of IR (PIR) were detected in DRB-treated cells compared to mock-treated cells used as a control (Braunschweig et al. 2014).
DNA methylation occurs predominantly at CpG dinucleotides in mammals, and sharp transitions in its patterns are known to mark splice junctions. This suggests that these signals may be crucial for the recognition of exons and introns (Laurent et al. 2010; Jones 2012). Changes in DNA methylation levels have well-established roles in the regulation of splicing including exon inclusion and skipping (Shukla et al. 2011; Maunakea et al. 2013). To determine whether this cis-acting element is associated with IR, Wong and colleagues performed whole-genome bisulfite sequencing (WGBS) on genomic DNA from mouse promyelocytes and granulocytes. They measured levels of DNA methylation near the 3’ and 5’ splice junctions and within the body of all introns in these cells. Significantly reduced methylation levels were observed in retained introns than in constitutive introns, particularly near the junctions of RI; however, a weaker reduction was also detected within the body of the unspliced introns (Wong et al. 2017). Furthermore, they determined the extent of IR events that are marked by reduced DNA methylation by analyzing the methylome of other normal and cancer cell types using publicly available RNA-Seq data from humans and mice, including human ESC lines (H1 and H9), a human lung fibroblast cell line (IMR90), a human neuron progenitor cells, a human colon cancer cell line (HCT116) and mouse primary and reprogrammed fibroblasts. Similar to what was observed in murine cells, the outcomes of this study revealed that reduced DNA methylation levels consistently marked retained introns near splice junctions and within introns, indicating conservation of this phenomenon in human and mouse biology.
RNA Pol II stalling and reduced DNA methylation levels observed near splice junctions of RIs suggest that methylation regulates retention by altering RNA Pol II progression and splice site recognition. Alternatively, RNA Pol II stalling near splice junctions of retained introns could occur as a consequence of inefficient recruitment of trans-acting spliceosomal components (Fong and Zhou 2001; Lin et al. 2008). The latter scenario is possible since retained introns have weaker splice sites than constitutive introns and thus might not be properly recognized as being sensitive to the levels of splicing factor concentrations (Sakabe and de Souza 2007; Wong et al. 2013; Braunschweig et al. 2014). This correlation was investigated by Braunschweig and colleagues who analyzed poly(A)+ RNA-Seq data generated from HeLa cells following knockdown of spliceosomal components (Saltzman et al. 2008; Braunschweig et al. 2014). The authors found that (i) reduction of snRNPs resulted in an increase of PIR for retained versus constitutive introns; (ii) introns that displayed the largest increases in PIR were associated with significantly higher levels of RNA Pol II occupancy, and (iii) the levels of transcripts containing introns that show increased retention upon snRNPs depletion are significantly reduced compared to the levels of transcripts with introns that do not show an increased PIR. Based on these result, it was proposed that depletion of snRNPs contributes to increased PIR, further supporting the conclusion that IR controls mammalian gene expression via a global, bidirectional cross-talk mechanism (Braunschweig et al. 2014). In this process, inaccurate transcription and low levels of expression resulting from impaired recruitment of core splicing factors, are linked with localized pausing of RNA Pol II, retention of introns and ultimately turnover of transcripts in mammalian cells and tissues.
An interesting model of splicing control centered on the competition of pre-mRNAs for limiting the splicing apparatus was proposed by the Ares lab (Munding et al. 2013). Based on this model, the trans-competition control (TCC) model originally described in yeast, splicing regulation is subjected to the composition of a pool of endogenous competing RNAs. By altering the competitive status of a target pre-mRNA through modulation of the levels of other RNAs that compete for limiting splicing machinery availability, splicing regulation can be achieved. There are thousands of competing introns within a cell, each with its own affinity for the spliceosome; as the concentration of any one of them changes, the splicing efficiency of all the others then must change as well. Munding and coauthors demonstrated that competition between introns from expressed vegetative or meiotic genes controls the expression of genes required for sporulation in yeast. Essentially, introns from sporulation genes display weaker splice sites and are outcompeted for limited splicing factors by the much more abundant introns from ribosomal protein genes. When yeast cells are stressed, the expression of ribosomal protein genes decreases, and transcripts from meiotic genes can be spliced. The TCC model also applies to mammalian systems in which induction of gene expression programs can result in large changes in the composition of the transcript pool, altering competition for the splicing machinery (Berg et al. 2012). Under these conditions, the competitive advantage of alternative exons for the splicing machinery may be decreased, resulting in a shift of mRNA isoforms.
Intron retention in normal biology
Alternative splicing facilitates biodiversity through the generation of various transcriptome and proteome isoforms and has been linked with developmental regulation (Wang et al. 2015). A number of studies on transcriptome-wide analyses have demonstrated that different IR events control gene expression programs during both normal development and disease-related states. For example, increased IR incidences were observed in various cells of the hematopoietic lineage, such as transformation of erythroblasts, megakaryocytes, granulocytes and CD4 T-cells (Wong et al. 2013; Pimentel et al. 2016; Edwards et al. 2016; Ni et al. 2016), during differentiation of ESCs into neural progenitors, reprogramming of mouse embryonic fibroblasts to induced pluripotent stem cells (IPSCs) (Braunschweig et al. 2014; Hussein et al. 2014; Boutz et al. 2015), and during normal development of smooth muscle cells (Llorian et al. 2016). Comparative analysis of these studies has revealed the common and unique features of IR-mediated regulation including its tissue- and cell-type specificities. In normally regulated programs involved in the developmental process, a common tendency for increased intron retention was found in more differentiated or postmitotic cellular states. However, a high percentage of IR events was not necessary for the induction of terminal differentiation. More plastic cell types whose functional maturation involves transition to proliferative states exhibit decreased IR incidence as shown during dedifferentiation of smooth muscle cells (Wong et al. 2013; Cho et al. 2014; Braunschweig et al. 2014; Pimentel et al. 2016; Edwards et al. 2016; Llorian et al. 2016). Some differentiation programs involve more complex patterns of IR for example, the final stages of erythropoiesis include a progressive increase in IR events before achieving a terminal stage that exhibited lower IR than the precursor cells. This dynamic pattern is further complicated by the presence of subgroups of IR events that exhibit differential regulation between the different stages of erythroblast differentiation (Pimentel et al. 2016; Edwards et al. 2016).
To investigate the incidence of IR, as well as its regulation and biological roles, Braunschweig et al. performed a comprehensive analysis of over forty diverse human and mouse cell and tissue types using high-coverage poly(A)+ RNA-Seq data (Braunschweig et al. 2014). This study revealed that the frequency of IR in mammals is far more widespread than previously detected using lower coverage RNA-Seq data (Wang et al. 2008) and affects most multiexonic genes. IR was detected to a variable extent between different cell and tissue types, and in general, a higher proportion of retained introns was found in neural and immune cell types, whereas IR was detected to be less frequent in ESCs and muscle cells. Increased IR in neuronal and immune cells was proposed to facilitate a rapid response to external stimuli, within a time frame shorter than that required for de novo transcription and protein synthesis (Ni et al. 2016; Mauger et al. 2016). Thus, IR may represent a tissue-specific process that serves to restrict the translation of proteins only to cells where they are required while concurrently maintaining transcription from the locus in other tissues. To investigate the physiological roles of IR during the cell maturation program, Braunschweig and colleagues analyzed alternative retained introns using RNA-Seq data from a time series of differentiation of cortical glutamatergic neurons from murine ESCs (Hubbard et al. 2013; Braunschweig et al. 2014). Strikingly, the vast majority (~ 90%) of detected differentially retained introns between embryonic stem cells and mature neurons revealed a progressive increase in retention during differentiation. Consistent with this observation and a global regulatory role for IR in the suppression of gene expression, transcripts with increased IR in mature neurons were expressed at significantly lower steady-state levels than transcripts with increased IR in ESCs. These results suggested that IR accompanies the process of functional tuning of the transcriptomes during murine neuronal differentiation. This is done by reducing the expression of transcripts that are not essential or are less required for the physiology of cells in which they were detected. Elimination of these transcripts was achieved through cytoplasmic NMD (Wong et al. 2013). Such a function of IR in transcriptome tuning remains in agreement with a number of other reports, including maturation of cells of the hematopoietic lineage (Yap et al. 2012; Boutz et al. 2015; Pimentel et al. 2016; Edwards et al. 2016; Llorian et al. 2016; Mauger et al. 2016).
Eukaryotic cells responses to stress-related programs are associated with IR (Shalgi et al. 2014; Boutz et al. 2015; Memon et al. 2016; Pimentel et al. 2016; Edwards et al. 2016; Llorian et al. 2016). Shalgi and colleagues examined genome-wide splicing regulation during heat shock in mouse fibroblasts. They observed widespread retention of introns in transcripts not directly involved in the stress response. In contrast, genes required for the immediate response to heat shock, such as protein folding were unaffected. Newly synthesized RNA from these genes appeared to be mostly cotranscriptionally spliced, whereas IR transcripts were posttranscriptionally spliced. Interestingly, transcripts with retained introns were not exported to the cytosol, but were stably maintained in the nucleus. They potentially serve as a pool of precursors that can be readily spliced and activated for recovery of normal gene expression post stress (Shalgi et al. 2014). Similar control of the level of IR transcripts which is independent of the cytoplasmic NMD pathway and relies on the nuclear RNA surveillance machinery, was shown during neurogenesis (Yap et al. 2012).
Intron retention in human diseases
IR has also been associated with complex disorders thereby providing disease-specific diagnostic biomarkers. A literature survey identified publications describing the association of IR with neuromuscular-neurodegenerative diseases including Alzheimer’s disease (AD) (Xu et al. 2008), ALS/FTD, DM2 and FECD and DMD (Xiao et al. 2008; Sznajder et al. 2018). IR is also widespread among a range of cancers including myelodysplastic syndromes (MDS) and chronic lymphocytic leukemia (CLL) and is described as a mechanism of tumor suppressor inactivation (Dvinge and Bradley 2015; Jung et al. 2015; Memon et al. 2016; Jeromin and Bowser 2017). In recent years, therapeutic strategies aimed at splicing regulation have been developed and are currently being tested in clinical trials for a range of diseases including muscular dystrophy and motor neuron diseases (Scotti and Swanson 2016; Zakharova 2021). It is therefore increasingly important to understand the relationship between IR and diseases. Below, we briefly review some of the studies that linked IR with human disorders.
A work by Sznajder and colleagues identified specific features of retained introns that contribute to their misprocessing (Sznajder et al. 2018). The authors took advantage of the presence of elongated microsatellites in introns of the genes linked with some of the human hereditary diseases. These included DM2 (CCTGexp, repeat expansion), ALS/FTD (GGGGCCexp), FECD (CTGexp), spinocerebellar ataxias type 10 (ATTCTexp), type 37 (ATTTCexp), and Friedreich’s ataxia (GAAexp) (Wojciechowska and Krzyzosiak 2011). Based on their results, the stable RNA secondary structure adopted by the expanded and highly polymorphic repeats in mutant transcripts has an effect on host intron splicing (Mirkin 2007; Zhang and Ashizawa 2017). In particular, highly structured GC-rich expansions (CTG, CCTG and GGGGCC), disrupted splicing of their host introns when studied in various human tissues and cultured cells from patients. Such misprocessing of mutant introns was not detected for A/AU-rich microsatellites of weak RNA structures (Sznajder et al. 2018). Interestingly, in FECD, DM2 and ALS/FTD disorders, aberrant processing of mutation-harboring intron RNAs occurs along with dysregulation of developmental programs of alternative splicing (Ranum and Day 2004; Cooper et al. 2009). This suggests that GC-rich intronic microsatellite expansions, by altering RNA structure and/or splicing factor accessibility, impair spliceosome recruitment. This in turn could cause misprocessing of host introns in affected genes and trigger pathogenesis. IR has also been implicated in AD. As shown by Xu and coworkers, neuronal expression of apolipoprotein E4 (APOE4) isoform associated with AD pathogenesis, is controlled by the presence or absence of intron 3 in APOE4 (Xu et al. 2008). In the primary neuron transfection experiment the authors found that expression of the APOE4 isoform was significantly higher when intron 3 was included in their cDNA construct, while lower when it was excluded. Since the APOE4 isoform has been found to increase Alzheimer’s risk (Sienski et al. 2021), this result suggests an association between IR and AD. In ALS transgenic mice degeneration of motor neurons has been linked to overexpression of the peripherin (PRPH) and associated with IR events. Xiao and coworkers have identified a novel splicing variant of the PRPH retaining introns 3 and 4 (Xiao et al. 2008). The IR-containing mRNA of peripherin was found to be expressed at a low stoichiometric level in ALS mouse motor neurons. When its expression is upregulated, it leads to the aggregation of peripherin, suggesting that the abnormal splicing retaining introns 3 and 4 produces a splice isoform that in ALS is prone to aggregation.
Several studies have identified an association between IR and cancers. Zhang and coauthors used whole transcriptome sequencing data from five lung adenocarcinoma tissues and matched normal tissues to detect IR (Zhang et al. 2014). A large number of IR events were found in both tissue types, i.e., 2,340 and 1,422 genes contained only tumor-specific and normal tissue-specific retention events, respectively. Subsequent functional analysis indicated that genes with tumor-specific retention include known lung cancer driver genes, e.g., EGFR, ROS1, and RUNX1, and are enriched in pathways that are important in carcinogenesis. IR in these genes causes frameshift, which generally invokes NMD and reduces the expression of mRNAs. These over-expressed or highly mutable driver genes may have a protective effect in patients. In another study by Jung et al. the authors demonstrated that IR is a mechanism leading to the inactivation of tumor suppressor genes (Jung et al. 2015). By analyzing the RNA-seq and exome data from approximately 2000 cancer patients, they determined that at least 163 of the 900 splice-disrupted somatic exon single-nucleotide variants caused IR in an allele-specific manner and were enriched in tumor suppressor genes. Another study of the association between IR and cancers was conducted by Dvinge and Bradley who analyzed the genome-wide RNA splicing patterns of 805 matched tumor and control samples from 16 cancers (Dvinge and Bradley 2015). They found that in cancers, abnormal RNA splicing occurs in the form of IR. They also used the transcriptomic data generated by the Cancer Genome Atlas project to identify large-scale differences in RNA splicing between the cancer and control samples. In all the cancers except breast cancer, IR events were upregulated in comparison to controls. This finding suggests that IR is a common factor associated with tumorigenesis. Finally, through genome-wide quantitative analysis and unsupervised clustering analysis, Dvinge and Bradley confirmed that although some retained introns are shared by the majority of cancer types, most are either present at a low frequency in multiple cancers or unique to primary cancers. Clustering results showed that cancers originating from similar tissues, such as the colon and rectum, have similar patterns of IR.
Recursive splicing of long introns
Conventional splicing of a pre-RNA involves the removal of introns as single units in a two-step catalytic reaction. However, processing some large introns > 10 k nt becomes challenging as they are excised from their maturing transcripts by splicing of multiple consecutive subfragments in a process known as recursive splicing. The RS process was first identified in the 73 k nt long intron of the Ultrabithorax (Ubx) gene of Drosophila (Hatton et al. 1998), and was subsequently experimentally validated in three other genes of the fruit fly i.e., Kuzbanian, Outspread, and Frizzled (Burnette et al. 2005; Conklin et al. 2005). In Drosophila, RS occurs via intronic cryptic splice sites, called recursive splice sites (RS-sites) or ratchet points (RPs). Each RS-site contains an AGGU sequence motif comprising a pair of juxtaposed 3′ and 5′ splice sites (Fig. 4). This is similar to the signals normally found at the end and the beginning of an intron, which divides a long intron into two or more segments to be sequentially spliced out. The RS-site initially functions as a 3′ splice site, paired with the upstream 5′ splice site to remove the upstream intronic segment. The reconstituted 5′ splice site then interacts with the downstream 3′ splice site, allowing an RS-site to function sequentially as an acceptor and then a donor for splicing of the intronic segment(s). As the process continues from one RS-site to the next, small loops of lariat RNA are released from the intron eventually joining two distant exons. Unlike canonical splicing, RS events leave no blueprint of the final mRNA, and the only direct evidence of RS is the splicing intermediates. However, the unstable nature of these intermediates and difficulty in capturing them has led to them being overlooked for a long time. Nevertheless, recent advances in high-throughput RNA-Seq have substantially increased the ability to detect RS events. Earlier RS analysis by Duff and coworkers using poly(A) + -selected RNA-seq data (~ 10 billion RNA-seq reads) which derive predominantly from mature transcripts identified 130 recursively spliced introns in flies (Duff et al. 2015). The number of RS-sites per intron ranged from one to six. For example, in the luna gene containing a 108 kb intron, five RPs were found, and the intron was removed in a six-step RS. Although, the RS-sites were enriched in large introns over 24 k nt, not all large introns contained these sites and intronic segments removed by RS ranged from ~ 2 k nt to over 60 k nt (Duff et al. 2015). Using this catalog of recursive sites, Duff and coworkers confirmed that recursive splicing is a conserved mechanism to excise constitutive introns, requires canonical splicing machinery, and only occurs in the longest 3% of Drosophila introns. More comprehensive analysis of RS in Drosophila was later reported from the Burge laboratory (Pai et al. 2018). Using new computational approaches and nascent RNA sequencing data highly enriched for splicing intermediates, Pai and coworkers discovered greater numbers of RS sites (2–4 times more), using less than 1/20th as many reads as used by Duff et al. (2015), supporting the potential of nascent RNA analysis for identification of recursive splice sites. Their analysis determined that very many long introns (> 40 k nt in length) have recursive sites, with the majority (~ 70%) of such introns containing at least one recursive site, and the number of these sites increased roughly linearly with intron length. Overall, they detected 539 candidate recursive sites in 379 fly introns and 98 introns contained multiple recursive sites, with up to seven sites observed in a single intron e.g., intron 1 of the tenascin major (Ten-m) gene contains five recursive sites, two of which were previously unknown. Recursively spliced introns were enriched in first introns, which are longer than non-first introns, relative to subsequent introns in fly genes (Pai et al. 2018). Altogether, these results suggests that recursive splicing is the prevalent mechanism by which very large fly introns are excised, however; its function and components remain elusive, and it remains unknown why some long introns are recursively spliced while others are not.
Despite the fact that approximately half of human protein-coding genes have introns over 24 k nt long and contain motifs similar to Drosophila RS-sites, only a handful of recursively spliced introns was initially identified in humans, mostly in genes involved in brain development, despite the greater abundance of long introns in vertebrate genomes (Duff et al. 2015; Sibley et al. 2015). Duff et al. analyzed total RNA-Seq datasets from twenty human tissues and identified five RS-sites in four genes, HS6ST3, CADM2, ROBO2, and PDE4D (Duff et al. 2015). In an independent study by Sibley et al. global RNA analysis of total RNA-Seq from human postmortem brains identified highly conserved RS-sites in four other genes i.e., ANK3, CADM1, OPCML, and NCAM1. Introns harboring these RS-sites were among the longest across vertebrates (Sibley et al. 2015). Interestingly, both studies demonstrated that RS events in humans utilize different splicing mechanism compared to those described in Drosophila. While no nucleotide RS exons are found in the fruit fly, in vertebrates, the RS process involves exon-like events, such as initial exon definition, and requires an RS-exon downstream of the AGGU RS-site (Fig. 4) (Sibley et al. 2015; Joseph et al. 2018; Pai et al. 2018; Zhang et al. 2018). A comprehensive analysis of the genomic features of human RS-sites was performed by Zhang and colleagues, who found that (i) RS-sites are enriched only in long introns, and RS is a mechanism that aids in the removal of such introns; (ii) RS-sites match the AGGU motif, i.e., the 3′ splice site consensus (AG) immediately followed by the 5′ splice site consensus (GU); (iii) the 3′ splice sites of RS-sites need to be sufficiently strong to be recognized by the spliceosome at the first step of recursive splicing and most RS exons have much lower percent-spliced-in (PSI) values than cassette exons and, in most cases, are spliced out of the final transcripts; (iv) the low efficiency of RS-exon inclusion is correlated with differences in GC% content compared to constitutive exons, and while the later tend to have higher GC% content than their long flanking introns, the former exons exhibit the same GC% content; and (v) some isoforms of RS exons may contain PTC, their inclusion suggests that the latter exons have higher GC stability, which is another factor contributing to the low abundance of RS exons (Zhang et al. 2018) (Fig. 4). Most recent study of RS by Wan and coworkers, who established a high-throughput system for labeling and imaging intronic RNA, and directly measured RNA Pol II and the spliceosome working in concert on endogenous human genes, revealed that most introns are removed from pre-mRNAs in smaller pieces rather than spliced as whole units in one step reaction (Wan et al. 2021). Their results have led to a model of stochastic splice site selection based on which unannotated splicing sites (internal RS sites) within introns are used frequently but randomly by the spliceosomes to make many cuts, instead of a single cut, to progressively remove an intron. This process results in the generation of transient splicing intermediates that are source of a final mRNA. Thus, this study by Wan et al., introduced a stochastic view for splicing that has parallels to the prevailing stochastic view of transcription.
Little is known about the RS machinery that regulates the processing of transcription units with long introns in human genes. Because RS is one of the splicing categories, similar to what was described in canonical splicing, it was proposed that RS is cotranscriptionally regulated and thus kinetically coupled with transcriptional elongation by RNA Pol II (Fong and Zhou 2001; de la Mata et al. 2010; Dujardin et al. 2013; Bentley 2014; Saldi et al. 2016). The Pol II elongation rate has been shown to exert an impact on the efficiency of recursive splicing of the fruit fly Ubx gene, correlation that has also been investigated in humans (de la Mata et al. 2003; Zhang et al. 2018). The most comprehensive analysis of the process underlining RS-site processing in the human transcriptome was conducted by Zhang and colleagues, who analyzed RNA-Seq data of time-course 4sUDRB (4-thiouridine tagging of nascent RNAs upon synchronization of Pol II elongation with DRB inhibition) from various human cell lines (PA1ovarian carcinoma cells, H9 embryonic stem cells, and forebrain (FB) neuronal progenitor cells differentiated from H9 cells) (Zhang et al. 2018). The authors determine that the RS introns were removed by both recursive splicing and canonical splicing. They detected intronic reads across the predicted branch point in a lariat configuration (full-length or recursive introns), and reads corresponded to canonical splicing spanning the entire length of the intron, including the RS site. These results suggest that recursive splicing in humans, unlike in flies, is not mutually exclusive with canonical splicing and supplements it rather than replacing it. The choice of which splicing mechanism was used appeared to be cell-type specific indicating that RS is part of the cell’s arsenal for long intron removal. Additionally, Zhang and coauthors defined that most RS events rather than cotranscriptionally underwent posttranscriptional recursive splicing. Although, the onset of most RS splicing events lags behind the completion of transcription of RS introns, they proceed in a timely fashion thereafter (Zhang et al. 2018). This suggests that RS genes exhibit high RNA Pol II elongation rates. They have some genomic and epigenetic features known to be positively correlated with fast Pol II elongation rates in humans. These genes are longer and contain significantly prolonged first introns compared to non-RS genes, in addition to their higher densities of histone marks e.g., H3K79me2 and H4K20me1 (Veloso et al. 2014).
Lariat-derived stable circular introns
Intronic sequences are spliced out of the primary RNA transcript as branched circular RNAs (lariats). They represent a large fraction of unstable molecules, as most of them are destroyed within minutes in the nucleus via the DBR1 enzyme. The enzyme hydrolyzes the 2′–5′ phosphodiester bond (branch point) in lariats to give rise to linear molecules promoting their rapid turnover (Ooi et al. 2001). Many exceptions to this canonical pathway have been described, and if the excised intron is not debranched, it can linger on in the nucleus or can also be exported to the cytoplasm, where it remains a stable circular molecule (Zhang et al. 2013; Talhouarne and Gall 2018). The presence of circular RNAs (circRNAs) was discovered in 2012 (Salzman et al. 2012) and since then three categories of molecules have been identified in mammalian cells and tissues as a result of linear pre-mRNA splicing which involves a head-to-tail (back-splicing) linking process in which a 5ʹ splice donor is joined to an upstream 3ʹ splice acceptor (e.g., the end of exon 2 is joined to the beginning of exon 1) (Czubak et al. 2019). These three categories include (i) exonic circRNAs (formed by the circularization of one or more exons through a back-splicing process), (ii) exonic-intronic circRNAs (formed by back spliced exons containing retained intron(s) being a product of alternative splicing), and (iii) intronic circRNAs obtained by pathways other than back splicing (Zhang et al. 2013). The processing of these noncanonical lariat-derived stable circular introns (sciRNAs) was found to be dependent on a consensus RNA motif containing a 7-nt GU-rich element near the 5′ splice site and an 11-nt C-rich motif close to the BP site. Additionally, these sciRNAs originate from introns that use cytosine rather than adenosine at the branch point, and they retain the 2′–5′ bond after 3ʹ-end trimming of the lariat. Since the debranching enzyme DBR1 is known to be relatively inefficient in linearizing C-branched lariats, it is speculated that a delay in the linearization of such introns may contribute to the formation of a stable complex that is more resistant to debranching. Moreover, these molecules escape debranching, partly, through export to the cytoplasm where there is no DBR1 enzyme.
SciRNAs from a variety of genes have been found in the cytoplasm and nucleus of Xenopus oocytes, in embryos of Drosophila, and in cells and tissues from humans, mice, chickens and fish (Gardner et al. 2012; Talhouarne and Gall 2014, 2018). Molecular analysis of these molecules showed that they were resistant to an α-amanitin inhibitor and were retained in the exposed cells several hours posttreatment, while other conventionally processed introns were degraded. Importantly, these sciRNAs from human, mouse, and chicken cells were usually derived from a single short (~ 100–500 nt) intron per gene (Fig. 5). The biological significance of these circular introns remains obscure; however, it is possible that their cytoplasmic fraction plays a role in the regulation of functional levels of RNA-binding proteins (RBPs) in normal cells and tissues, as proposed for dicer, TDP-43, and snRNPs (Armakola et al. 2012; Li et al. 2016b; Han et al. 2017). However, these molecules may also participate in some disease programs contributing to abnormal depletion and functional insufficiency of splicing factors and other proteins important in the pathogenesis of some human genetic disorders, such as myotonic dystrophy type 1 (DM1) and DM2 (Wojciechowska and Krzyzosiak 2011). Interestingly, the upregulation of various circular RNAs has recently been detected in tissues and cells from DM1 patients and in mouse model of the disease (Czubak et al. 2019; Voellenkle et al. 2019). Their biological role and prospective pathogenic effect are being investigated (Wojciechowska M, unpublished data).
Concluding remarks
Only approximately 1.5% of DNA contains protein-coding sequences. The other over 98% consists of nonprotein coding regions (Kapranov et al. 2007; Guttman et al. 2009; ENCODE Project Consortium 2012). Evidence is rapidly accumulating that at least some of the noncoding genome is integral to the function of cells, particularly for the control of gene expression. Intronic sequences that account for over 20% of the human genome provide a source of noncoding RNAs, and their processing and biological functions have been of particular interest in recent years (Zhang et al. 2013). It appears that introns are the key source of information for understanding our biology. Their removal from the host transcript is tightly associated with pre-mRNA splicing, and aberrations in this process may result in them retaining in mature mRNA, which may have broad biological consequences. Increasing numbers of identified RI-mRNAs are now recognized as a result of the higher sensitivity of RNA deep sequencing. The levels of RI-mRNAs have been linked with gene regulation during normal development and differentiation, but they have also been recognized as a source of disorders associated with human diseases. Of particular interest is a group of excised introns that escape canonical processing, and their presence in the form of lariats without a tail has been detected in various organisms. Their biological function, interactions with other cellular molecules, and eventual turnover remain elusive, and more research is needed to evaluate whether these byproducts of splicing are indeed unwanted passengers with no biological purpose or whether they represent the key link for the proper physiological regulation of cellular processes.
Availability of data and material
Not applicable.
Code availability
Not applicable.
References
Armakola M, Higgins MJ, Figley MD et al (2012) Inhibition of RNA lariat debranching enzyme suppresses TDP-43 toxicity in ALS disease models. Nat Genet 44:1302–1309. https://doi.org/10.1038/ng.2434
Barbosa-Morais NL, Irimia M, Pan Q et al (2012) The evolutionary landscape of alternative splicing in vertebrate species. Science 338:1587–1593. https://doi.org/10.1126/science.1230612
Bentley DL (2014) Coupling mRNA processing with transcription in time and space. Nat Rev Genet 15:163–175. https://doi.org/10.1038/nrg3662
Berg MG, Singh LN, Younis I et al (2012) U1 snRNP determines mRNA length and regulates isoform expression. Cell 150:53–64. https://doi.org/10.1016/j.cell.2012.05.029
Berget SM, Sharp PA (1977) A spliced sequence at the 5’-terminus of adenovirus late mRNA. Brookhaven Symp Biol 74(8):3171–3175. https://doi.org/10.1073/pnas.74.8.3171
Bieberstein NI, Carrillo Oesterreich F, Straube K, Neugebauer KM (2012) First exon length controls active chromatin signatures and transcription. Cell Rep 2:62–68. https://doi.org/10.1016/j.celrep.2012.05.019
Bonadia LC, de Lima Marson FA, Ribeiro JD et al (2014) CFTR genotype and clinical outcomes of adult patients carried as cystic fibrosis disease. Gene 540:183–190. https://doi.org/10.1016/j.gene.2014.02.040
Boutz PL, Bhutkar A, Sharp PA (2015) Detained introns are a novel, widespread class of post-transcriptionally spliced introns. Genes Dev 29:63–80. https://doi.org/10.1101/gad.247361.114
Braunschweig U, Barbosa-Morais NL, Pan Q et al (2014) Widespread intron retention in mammals functionally tunes transcriptomes. Genome Res 24:1774–1786. https://doi.org/10.1101/gr.177790.114
Bray M, Prasad S, Dubay JW et al (1994) A small element from the Mason-Pfizer monkey virus genome makes human immunodeficiency virus type 1 expression and replication Rev-independent. Proc Natl Acad Sci U S A 91:1256–1260. https://doi.org/10.1073/pnas.91.4.1256
Breathnach R, Mandel JL, Chambon P (1977) Ovalbumin gene is split in chicken DNA. Nature 270:314–319. https://doi.org/10.1038/270314a0
Brett D, Pospisil H, Valcárcel J et al (2002) Alternative splicing and genome complexity. Nat Genet 30:29–30. https://doi.org/10.1038/ng803
Broseus L, Ritchie W (2020) Challenges in detecting and quantifying intron retention from next generation sequencing data. Comput Struct Biotechnol J 18:501–508. https://doi.org/10.1016/j.csbj.2020.02.010
Burnette JM, Miyamoto-Sato E, Schaub MA et al (2005) Subdivision of large introns in Drosophila by recursive splicing at nonexonic elements. Genetics 170:661–674. https://doi.org/10.1534/genetics.104.039701
Cho V, Mei Y, Sanny A et al (2014) The RNA-binding protein hnRNPLL induces a T cell alternative splicing program delineated by differential intron retention in polyadenylated RNA. Genome Biol 15:R26. https://doi.org/10.1186/gb-2014-15-1-r26
Chow LT, Roberts JM, Lewis JB, Broker TR (1977) A map of cytoplasmic RNA transcripts from lytic adenovirus type 2, determined by electron microscopy of RNA:DNA hybrids. Cell 11:819–836. https://doi.org/10.1016/0092-8674(77)90294-x
Conklin JF, Goldman A, Lopez AJ (2005) Stabilization and analysis of intron lariats in vivo. Methods San Diego Calif 37:368–375. https://doi.org/10.1016/j.ymeth.2005.08.002
Cooper TA, Wan L, Dreyfuss G (2009) RNA and disease. Cell 136:777–793. https://doi.org/10.1016/j.cell.2009.02.011
Czubak K, Taylor K, Piasecka A et al (2019) Global increase in circular RNA levels in myotonic dystrophy. Front Genet 10:649. https://doi.org/10.3389/fgene.2019.00649
de la Mata M, Alonso CR, Kadener S et al (2003) A slow RNA polymerase II affects alternative splicing in vivo. Mol Cell 12:525–532. https://doi.org/10.1016/j.molcel.2003.08.001
de la Mata M, Lafaille C, Kornblihtt AR (2010) First come, first served revisited: factors affecting the same alternative splicing event have different effects on the relative rates of intron removal. RNA NYN 16:904–912. https://doi.org/10.1261/rna.1993510
Duff MO, Olson S, Wei X et al (2015) Genome-wide identification of zero nucleotide recursive splicing in Drosophila. Nature 521:376–379. https://doi.org/10.1038/nature14475
Dujardin G, Lafaille C, Petrillo E et al (2013) Transcriptional elongation and alternative splicing. Biochim Biophys Acta 1829:134–140. https://doi.org/10.1016/j.bbagrm.2012.08.005
Dvinge H, Bradley RK (2015) Widespread intron retention diversifies most cancer transcriptomes. Genome Med 7:45. https://doi.org/10.1186/s13073-015-0168-9
Edwards CR, Ritchie W, Wong JJ-L et al (2016) A dynamic intron retention program in the mammalian megakaryocyte and erythrocyte lineages. Blood 127:e24–e34. https://doi.org/10.1182/blood-2016-01-692764
ENCODE Project Consortium (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489:57–74. https://doi.org/10.1038/nature11247
Fischer U, Huber J, Boelens WC et al (1995) The HIV-1 Rev activation domain is a nuclear export signal that accesses an export pathway used by specific cellular RNAs. Cell 82:475–483. https://doi.org/10.1016/0092-8674(95)90436-0
Floor SN, Doudna JA (2016) Tunable protein synthesis by transcript isoforms in human cells. Elife 5:e10921. https://doi.org/10.7554/eLife.10921
Fong YW, Zhou Q (2001) Stimulatory effect of splicing factors on transcriptional elongation. Nature 414:929–933. https://doi.org/10.1038/414929a
Fornerod M, Ohno M, Yoshida M, Mattaj IW (1997) CRM1 is an export receptor for leucine-rich nuclear export signals. Cell 90:1051–1060. https://doi.org/10.1016/s0092-8674(00)80371-2
Gaffney DJ, Keightley PD (2004) Unexpected conserved non-coding DNA blocks in mammals. Trends Genet TIG 20:332–337. https://doi.org/10.1016/j.tig.2004.06.011
Gao K, Masuda A, Matsuura T, Ohno K (2008) Human branch point consensus sequence is yUnAy. Nucleic Acids Res 36:2257–2267. https://doi.org/10.1093/nar/gkn073
Gardner EJ, Nizami ZF, Talbot CC, Gall JG (2012) Stable intronic sequence RNA (sisRNA), a new class of noncoding RNA from the oocyte nucleus of Xenopus tropicalis. Genes Dev 26:2550–2559. https://doi.org/10.1101/gad.202184.112
Ge Y, Porse BT (2014) The functional consequences of intron retention: alternative splicing coupled to NMD as a regulator of gene expression. BioEssays News Rev Mol Cell Dev Biol 36:236–243. https://doi.org/10.1002/bies.201300156
Gooding C, Clark F, Wollerton MC et al (2006) A class of human exons with predicted distant branch points revealed by analysis of AG dinucleotide exclusion zones. Genome Biol 7:R1. https://doi.org/10.1186/gb-2006-7-1-r1
Grüter P, Tabernero C, von Kobbe C et al (1998) TAP, the human homolog of Mex67p, mediates CTE-dependent RNA export from the nucleus. Mol Cell 1:649–659. https://doi.org/10.1016/s1097-2765(00)80065-9
Guttman M, Amit I, Garber M et al (2009) Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals. Nature 458:223–227. https://doi.org/10.1038/nature07672
Hadzopoulou-Cladaras M, Felber BK, Cladaras C et al (1989) The rev (trs/art) protein of human immunodeficiency virus type 1 affects viral mRNA and protein expression via a cis-acting sequence in the env region. J Virol 63:1265–1274. https://doi.org/10.1128/JVI.63.3.1265-1274.1989
Hammarskjöld ML, Heimer J, Hammarskjöld B et al (1989) Regulation of human immunodeficiency virus env expression by the rev gene product. J Virol 63:1959–1966. https://doi.org/10.1128/JVI.63.5.1959-1966.1989
Han B, Park HK, Ching T et al (2017) Human DBR1 modulates the recycling of snRNPs to affect alternative RNA splicing and contributes to the suppression of cancer development. Oncogene 36:5382–5391. https://doi.org/10.1038/onc.2017.150
Hatton AR, Subramaniam V, Lopez AJ (1998) Generation of alternative ultrabithorax isoforms and stepwise removal of a large intron by resplicing at exon-exon junctions. Mol Cell 2:787–796. https://doi.org/10.1016/s1097-2765(00)80293-2
Hubbard KS, Gut IM, Lyman ME, McNutt PM (2013) Longitudinal RNA sequencing of the deep transcriptome during neurogenesis of cortical glutamatergic neurons from murine ESCs. F1000Research 2:35. https://doi.org/10.12688/f1000research.2-35.v1
Hussein SMI, Puri MC, Tonge PD et al (2014) Genome-wide characterization of the routes to pluripotency. Nature 516:198–206. https://doi.org/10.1038/nature14046
Jacob AG, Smith CWJ (2017) Intron retention as a component of regulated gene expression programs. Hum Genet 136:1043–1057. https://doi.org/10.1007/s00439-017-1791-x
Jeffreys AJ, Flavell RA (1977) The rabbit beta-globin gene contains a large large insert in the coding sequence. Cell 12:1097–1108. https://doi.org/10.1016/0092-8674(77)90172-6
Jeromin A, Bowser R (2017) Biomarkers in neurodegenerative diseases. Adv Neurobiol 15:491–528. https://doi.org/10.1007/978-3-319-57193-5_20
Jones PA (2012) Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat Rev Genet 13:484–492. https://doi.org/10.1038/nrg3230
Joseph B, Lai EC (2021) The exon junction complex and intron removal prevent re-splicing of mRNA. PLoS Genet 17:e1009563. https://doi.org/10.1371/journal.pgen.1009563
Joseph B, Kondo S, Lai EC (2018) Short cryptic exons mediate recursive splicing in Drosophila. Nat Struct Mol Biol 25:365–371. https://doi.org/10.1038/s41594-018-0052-6
Jung H, Lee D, Lee J et al (2015) Intron retention is a widespread mechanism of tumor-suppressor inactivation. Nat Genet 47:1242–1248. https://doi.org/10.1038/ng.3414
Kalland KH, Szilvay AM, Brokstad KA et al (1994) The human immunodeficiency virus type 1 Rev protein shuttles between the cytoplasm and nuclear compartments. Mol Cell Biol 14:7436–7444. https://doi.org/10.1128/mcb.14.11.7436-7444.1994
Kalsotra A, Cooper TA (2011) Functional consequences of developmentally regulated alternative splicing. Nat Rev Genet 12:715–729. https://doi.org/10.1038/nrg3052
Kapranov P, Cheng J, Dike S et al (2007) RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 316:1484–1488. https://doi.org/10.1126/science.1138341
Lareau LF, Brenner SE (2015) Regulation of splicing factors by alternative splicing and NMD is conserved between kingdoms yet evolutionarily flexible. Mol Biol Evol 32:1072–1079. https://doi.org/10.1093/molbev/msv002
Laurent L, Wong E, Li G et al (2010) Dynamic changes in the human methylome during differentiation. Genome Res 20:320–331. https://doi.org/10.1101/gr.101907.109
Lewis BP, Green RE, Brenner SE (2003) Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc Natl Acad Sci U S A 100:189–192. https://doi.org/10.1073/pnas.0136770100
Li Y, Bor Y-C, Misawa Y et al (2006) An intron with a constitutive transport element is retained in a Tap messenger RNA. Nature 443:234–237. https://doi.org/10.1038/nature05107
Li Y, Bor Y-C, Fitzgerald MP et al (2016a) An NXF1 mRNA with a retained intron is expressed in hippocampal and neocortical neurons and is translated into a protein that functions as an Nxf1 cofactor. Mol Biol Cell 27:3903–3912. https://doi.org/10.1091/mbc.E16-07-0515
Li Z, Wang S, Cheng J et al (2016b) Intron lariat RNA inhibits microRNA biogenesis by sequestering the dicing complex in arabidopsis. PLoS Genet 12:e1006422. https://doi.org/10.1371/journal.pgen.1006422
Lin S, Coutinho-Mansfield G, Wang D et al (2008) The splicing factor SC35 has an active role in transcriptional elongation. Nat Struct Mol Biol 15:819–826. https://doi.org/10.1038/nsmb.1461
Llorian M, Gooding C, Bellora N et al (2016) The alternative splicing program of differentiated smooth muscle cells involves concerted non-productive splicing of post-transcriptional regulators. Nucleic Acids Res 44:8933–8950. https://doi.org/10.1093/nar/gkw560
Malim MH, Hauber J, Le SY et al (1989) The HIV-1 rev trans-activator acts through a structured target sequence to activate nuclear export of unspliced viral mRNA. Nature 338:254–257. https://doi.org/10.1038/338254a0
Mandel JL, Breathnach R, Gerlinger P et al (1978) Organization of coding and intervening sequences in the chicken ovalbumin split gene. Cell 14:641–653. https://doi.org/10.1016/0092-8674(78)90248-9
Marquez Y, Brown JWS, Simpson C et al (2012) Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res 22:1184–1195. https://doi.org/10.1101/gr.134106.111
Marquez Y, Höpfler M, Ayatollahi Z et al (2015) Unmasking alternative splicing inside protein-coding exons defines exitrons and their role in proteome plasticity. Genome Res 25:995–1007. https://doi.org/10.1101/gr.186585.114
Mauger O, Lemoine F, Scheiffele P (2016) Targeted intron retention and excision for rapid gene regulation in response to neuronal activity. Neuron 92:1266–1278. https://doi.org/10.1016/j.neuron.2016.11.032
Maunakea AK, Chepelev I, Cui K, Zhao K (2013) Intragenic DNA methylation modulates alternative splicing by recruiting MeCP2 to promote exon recognition. Cell Res 23:1256–1269. https://doi.org/10.1038/cr.2013.110
McGlincy NJ, Smith CWJ (2008) Alternative splicing resulting in nonsense-mediated mRNA decay: what is the meaning of nonsense? Trends Biochem Sci 33:385–393. https://doi.org/10.1016/j.tibs.2008.06.001
Memon D, Dawson K, Smowton CS et al (2016) Hypoxia-driven splicing into noncoding isoforms regulates the DNA damage response. NPJ Genomic Med 1:16020. https://doi.org/10.1038/npjgenmed.2016.20
Mercer TR, Clark MB, Andersen SB et al (2015) Genome-wide discovery of human splicing branchpoints. Genome Res 25:290–303. https://doi.org/10.1101/gr.182899.114
Merkin J, Russell C, Chen P, Burge CB (2012) Evolutionary dynamics of gene and isoform regulation in Mammalian tissues. Science 338:1593–1599. https://doi.org/10.1126/science.1228186
Meyer BE, Malim MH (1994) The HIV-1 Rev trans-activator shuttles between the nucleus and the cytoplasm. Genes Dev 8:1538–1547. https://doi.org/10.1101/gad.8.13.1538
Middleton R, Gao D, Thomas A et al (2017) IRFinder: assessing the impact of intron retention on mammalian gene expression. Genome Biol 18:51. https://doi.org/10.1186/s13059-017-1184-4
Mirkin SM (2007) Expandable DNA repeats and human disease. Nature 447:932–940. https://doi.org/10.1038/nature05977
Monteuuis G, Wong JJL, Bailey CG et al (2019) The changing paradigm of intron retention: regulation, ramifications and recipes. Nucleic Acids Res 47:11497–11513. https://doi.org/10.1093/nar/gkz1068
Munding EM, Shiue L, Katzman S et al (2013) Competition between pre-mRNAs for the splicing machinery drives global regulation of splicing. Mol Cell 51:338–348. https://doi.org/10.1016/j.molcel.2013.06.012
Ner-Gaon H, Halachmi R, Savaldi-Goldstein S et al (2004) Intron retention is a major phenomenon in alternative splicing in Arabidopsis. Plant J Cell Mol Biol 39:877–885. https://doi.org/10.1111/j.1365-313X.2004.02172.x
Ni T, Yang W, Han M et al (2016) Global intron retention mediated gene regulation during CD4+ T cell activation. Nucleic Acids Res 44:6817–6829. https://doi.org/10.1093/nar/gkw591
Nilsen TW, Graveley BR (2010) Expansion of the eukaryotic proteome by alternative splicing. Nature 463:457–463. https://doi.org/10.1038/nature08909
Ninomiya K, Kataoka N, Hagiwara M (2011) Stress-responsive maturation of Clk1/4 pre-mRNAs promotes phosphorylation of SR splicing factor. J Cell Biol 195:27–40. https://doi.org/10.1083/jcb.201107093
Nixon JEJ, Wang A, Morrison HG et al (2002) A spliceosomal intron in Giardia lamblia. Proc Natl Acad Sci USA 99:3701–3705. https://doi.org/10.1073/pnas.042700299
Ooi SL, Dann C, Nam K et al (2001) RNA lariat debranching enzyme. Methods Enzymol 342:233–248. https://doi.org/10.1016/s0076-6879(01)42548-1
Pai AA, Paggi JM, Yan P et al (2018) Numerous recursive sites contribute to accuracy of splicing in long introns in flies. PLoS Genet 14:e1007588. https://doi.org/10.1371/journal.pgen.1007588
Pimentel H, Parra M, Gee SL et al (2016) A dynamic intron retention program enriched in RNA processing genes regulates gene expression during terminal erythropoiesis. Nucleic Acids Res 44:838–851. https://doi.org/10.1093/nar/gkv1168
Ranum LPW, Day JW (2004) Pathogenic RNA repeats: an expanding role in genetic disease. Trends Genet TIG 20:506–512. https://doi.org/10.1016/j.tig.2004.08.004
Rekosh D, Hammarskjold M-L (2018) Intron retention in viruses and cellular genes: detention, border controls and passports. Wiley Interdiscip Rev RNA 9:e1470. https://doi.org/10.1002/wrna.1470
Rogozin IB, Carmel L, Csuros M, Koonin EV (2012) Origin and evolution of spliceosomal introns. Biol Direct 7:11. https://doi.org/10.1186/1745-6150-7-11
Roy B, Haupt LM, Griffiths LR (2013) Review: alternative splicing (AS) of genes as an approach for generating protein complexity. Curr Genomics 14:182–194. https://doi.org/10.2174/1389202911314030004
Sakabe NJ, de Souza SJ (2007) Sequence features responsible for intron retention in human. BMC Genomics 8:59. https://doi.org/10.1186/1471-2164-8-59
Saldi T, Cortazar MA, Sheridan RM, Bentley DL (2016) Coupling of RNA polymerase II transcription elongation with pre-mRNA splicing. J Mol Biol 428:2623–2635. https://doi.org/10.1016/j.jmb.2016.04.017
Saltzman AL, Kim YK, Pan Q et al (2008) Regulation of multiple core spliceosomal proteins by alternative splicing-coupled nonsense-mediated mRNA decay. Mol Cell Biol 28:4320–4330. https://doi.org/10.1128/MCB.00361-08
Salzman J, Gawad C, Wang PL et al (2012) Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS One 7:e30733. https://doi.org/10.1371/journal.pone.0030733
Schmitz U, Pinello N, Jia F et al (2017) Intron retention enhances gene regulatory complexity in vertebrates. Genome Biol 18:216. https://doi.org/10.1186/s13059-017-1339-3
Scotti MM, Swanson MS (2016) RNA mis-splicing in disease. Nat Rev Genet 17:19–32. https://doi.org/10.1038/nrg.2015.3
Shalgi R, Hurt JA, Lindquist S, Burge CB (2014) Widespread inhibition of posttranscriptional splicing shapes the cellular transcriptome following heat shock. Cell Rep 7:1362–1370. https://doi.org/10.1016/j.celrep.2014.04.044
Shukla S, Kavak E, Gregory M et al (2011) CTCF-promoted RNA polymerase II pausing links DNA methylation to splicing. Nature 479:74–79. https://doi.org/10.1038/nature10442
Sibley CR, Emmett W, Blazquez L et al (2015) Recursive splicing in long vertebrate genes. Nature 521:371–375. https://doi.org/10.1038/nature14466
Sienski G, Narayan P, Bonner JM et al (2021) APOE4 disrupts intracellular lipid homeostasis in human iPSC-derived glia. Sci Transl Med 13:eaaz4564. https://doi.org/10.1126/scitranslmed.aaz4564
Sterne-Weiler T, Martinez-Nunez RT, Howard JM et al (2013) Frac-seq reveals isoform-specific recruitment to polyribosomes. Genome Res 23:1615–1623. https://doi.org/10.1101/gr.148585.112
Sterrantino M, Fuso A, Pierandrei S et al (2021) Quantitative evaluation of CFTR pre-mRNA splicing dependent on the (TG)mTn poly-variant tract. Diagn Basel Switz 11:168. https://doi.org/10.3390/diagnostics11020168
Syed NH, Kalyna M, Marquez Y et al (2012) Alternative splicing in plants–coming of age. Trends Plant Sci 17:616–623. https://doi.org/10.1016/j.tplants.2012.06.001
Sznajder ŁJ, Thomas JD, Carrell EM et al (2018) Intron retention induced by microsatellite expansions as a disease biomarker. Proc Natl Acad Sci USA 115:4234–4239. https://doi.org/10.1073/pnas.1716617115
Tahmasebi S, Jafarnejad SM, Tam IS et al (2016) Control of embryonic stem cell self-renewal and differentiation via coordinated alternative splicing and translation of YY2. Proc Natl Acad Sci USA 113:12360–12367. https://doi.org/10.1073/pnas.1615540113
Talhouarne GJS, Gall JG (2014) Lariat intronic RNAs in the cytoplasm of Xenopus tropicalis oocytes. RNA NYN 20:1476–1487. https://doi.org/10.1261/rna.045781.114
Talhouarne GJS, Gall JG (2018) Lariat intronic RNAs in the cytoplasm of vertebrate cells. Proc Natl Acad Sci USA 115:E7970–E7977. https://doi.org/10.1073/pnas.1808816115
Veloso A, Kirkconnell KS, Magnuson B et al (2014) Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications. Genome Res 24:896–905. https://doi.org/10.1101/gr.171405.113
Voellenkle C, Perfetti A, Carrara M et al (2019) Dysregulation of circular RNAs in myotonic dystrophy type 1. Int J Mol Sci 20:E1938. https://doi.org/10.3390/ijms20081938
Wan Y, Anastasakis DG, Rodriguez J et al (2021) Dynamic imaging of nascent RNA reveals general principles of transcription dynamics and stochastic splice site selection. Cell 184:2878-2895.e20. https://doi.org/10.1016/j.cell.2021.04.012
Wang ET, Sandberg R, Luo S et al (2008) Alternative isoform regulation in human tissue transcriptomes. Nature 456:470–476. https://doi.org/10.1038/nature07509
Wang ET, Ward AJ, Cherone JM et al (2015) Antagonistic regulation of mRNA expression and splicing by CELF and MBNL proteins. Genome Res 25:858–871. https://doi.org/10.1101/gr.184390.114
Weatheritt RJ, Sterne-Weiler T, Blencowe BJ (2016) The ribosome-engaged landscape of alternative splicing. Nat Struct Mol Biol 23:1117–1123. https://doi.org/10.1038/nsmb.3317
Weischenfeldt J, Waage J, Tian G et al (2012) Mammalian tissues defective in nonsense-mediated mRNA decay display highly aberrant splicing patterns. Genome Biol 13:R35. https://doi.org/10.1186/gb-2012-13-5-r35
Wojciechowska M, Krzyzosiak WJ (2011) Cellular toxicity of expanded RNA repeats: focus on RNA foci. Hum Mol Genet 20:3811–3821. https://doi.org/10.1093/hmg/ddr299
Wong JJ-L, Ritchie W, Ebner OA et al (2013) Orchestrated intron retention regulates normal granulocyte differentiation. Cell 154:583–595. https://doi.org/10.1016/j.cell.2013.06.052
Wong JJ-L, Au AYM, Ritchie W, Rasko JEJ (2016) Intron retention in mRNA: No longer nonsense: known and putative roles of intron retention in normal and disease biology. BioEssays News Rev Mol Cell Dev Biol 38:41–49. https://doi.org/10.1002/bies.201500117
Wong JJ-L, Gao D, Nguyen TV et al (2017) Intron retention is regulated by altered MeCP2-mediated splicing factor recruitment. Nat Commun 8:15134. https://doi.org/10.1038/ncomms15134
Wu J, Xiao J, Wang L et al (2013) Systematic analysis of intron size and abundance parameters in diverse lineages. Sci China Life Sci 56:968–974. https://doi.org/10.1007/s11427-013-4540-y
Xiao S, Tjostheim S, Sanelli T et al (2008) An aggregate-inducing peripherin isoform generated through intron retention is upregulated in amyotrophic lateral sclerosis and associated with disease pathology. J Neurosci off J Soc Neurosci 28:1833–1840. https://doi.org/10.1523/JNEUROSCI.3222-07.2008
Xu Q, Walker D, Bernardo A et al (2008) Intron-3 retention/splicing controls neuronal expression of apolipoprotein E in the CNS. J Neurosci off J Soc Neurosci 28:1452–1459. https://doi.org/10.1523/JNEUROSCI.3253-07.2008
Yap K, Lim ZQ, Khandelia P et al (2012) Coordinated regulation of neuronal mRNA steady-state levels through developmentally controlled intron retention. Genes Dev 26:1209–1223. https://doi.org/10.1101/gad.188037.112
Zakharova M (2021) Modern approaches in gene therapy of motor neuron diseases. Med Res Rev 41:2634–2655. https://doi.org/10.1002/med.21705
Zhang N, Ashizawa T (2017) RNA toxicity and foci formation in microsatellite expansion diseases. Curr Opin Genet Dev 44:17–29. https://doi.org/10.1016/j.gde.2017.01.005
Zhang Y, Zhang X-O, Chen T et al (2013) Circular intronic long noncoding RNAs. Mol Cell 51:792–806. https://doi.org/10.1016/j.molcel.2013.08.017
Zhang Q, Li H, Jin H et al (2014) The global landscape of intron retentions in lung adenocarcinoma. BMC Med Genomics 7:15. https://doi.org/10.1186/1755-8794-7-15
Zhang X-O, Fu Y, Mou H et al (2018) The temporal landscape of recursive splicing during Pol II transcription elongation in human cells. PLoS Genet 14:e1007579. https://doi.org/10.1371/journal.pgen.1007579
Acknowledgements
This work was funded by grants from the Polish National Science Centre (2019/33/B/NZ5/02473 and 2020/39/B/NZ3/01811 to M.W). We would like to thank Kaja Jaskot for drafting panels of Fig. 1.
Funding
This work was funded by grants from the Polish National Science Centre (2019/33/B/NZ5/02473 to M.W, and 2020/39/B/NZ3/01811 to M.W).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Ethics approval
Not applicable.
Consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kumari, A., Sedehizadeh, S., Brook, J.D. et al. Differential fates of introns in gene expression due to global alternative splicing. Hum Genet 141, 31–47 (2022). https://doi.org/10.1007/s00439-021-02409-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-021-02409-6