Abstract
Alternatively spliced introns are the ones that are usually spliced but can be occasionally retained in a transcript isoform. They are the most frequently used alternative splice form in plants (~50% of alternative splicing events). Chlamydomonas reinhardtii, a unicellular alga, is a good model to understand alternative splicing (AS) in plants from an evolutionary perspective as it diverged from land plants a billion years ago. Using over 7 million cDNA sequences from both pyrosequencing and Sanger sequencing, we found that a much higher percentage of genes (~20% of multi-exon genes) undergo AS than previously reported (3–5%). We found a full component of SR and SR-like proteins possibly involved in AS. The most prevalent type of AS event (40%) was retention of introns, most of which were supported by multiple cDNA evidence (72%) while only 20% of them have coding capacity. By comparing retained and constitutive introns, we identified sequence features potentially responsible for the retention of introns, in the framework of an “intron definition” model for splicing. We find that retained introns tend to have a weaker 5′ splice site, more Gs in their poly-pyrimidine tract and a lesser conservation of nucleotide ‘C’ at position −3 of the 3′ splice site. In addition, the sequence motifs found in the potential branch-point region differed between retained and constitutive introns. Furthermore, the enrichment of G-triplets and C-triplets among the first and last 50 nt of the introns significantly differ between constitutive and retained introns. These could serve as intronic splicing enhancers. All the alternative splice forms can be accessed at http://bioinfolab.miamioh.edu/cgi-bin/PASA_r20140417/cgi-bin/status_report.cgi?db=Chre_AS.
Similar content being viewed by others
Abbreviations
- 5′ss:
-
5′ Splice site
- 3′ss:
-
3′ Splice site
- CDS:
-
Coding sequence
- ESE:
-
Exonic splicing enhancer
- ESS:
-
Exonic splicing silencer
- GFF3:
-
General feature format
- ISE:
-
Intronic splicing enhancer
- ISS:
-
Intronic splicing silencer
- nt:
-
Nucleotide
- PSSM:
-
Position specific scoring matrix
- NMD:
-
Non-mediated decay
- AS:
-
Alternative splicing
- PASA:
-
Program to assemble spliced alignment
References
Barbazuk W, Fu Y, McGinnis K (2008) Genome-wide analyses of alternative splicing in plants: opportunities and challenges. Genome Res 18:1381–1392. doi:10.1101/gr.053678.106
Barbosa-Morais NL, Carmo-Fonseca M, Aparício S (2006) Systematic genome-wide annotation of spliceosomal proteins reveals differential gene family expansion. Genome Res 16:66–77. doi:10.1101/gr.3936206
Barta A, Kalyna M, Lorković ZJ (2008) Plant SR proteins and their functions. In: Reddy ASN, Golovkin M (eds) Nuclear pre-mRNA processing in plants. Springer, Berlin, pp 83–102
Berget SM (1995) Exon recognition in vertebrate splicing. J Biol Chem 270:2411–2414
Blaby IK, Blaby-Haas CE, Tourasse N, Hom EFY, Lopez D, Aksoy M, Grossman A, Umen J, Dutcher S, Porter M, King S, Witman GB, Stanke M, Harris EH, Goodstein D, Grimwood J, Schmutz J, Vallon O, Merchant SS, Prochnik S (2014) The Chlamydomonas genome project: a decade on. Trends Plant Sci. doi:10.1016/j.tplants.2014.05.008
Black D (2003) Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem 72:291–336. doi:10.1146/annurev.biochem.72.121801.161720
Campbell M, Haas B, Hamilton J, Mount S, Buell C (2006) Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis. BMC Genomics 7:327. doi:10.1186/1471-2164-7-327
Carvalho RF, Feijão CV, Duque P (2013) On the physiological significance of alternative splicing events in higher plants. Protoplasma 250:639–650. doi:10.1007/s00709-012-0448-9
Colak D, Ji S-J, Porse BT, Jaffrey SR (2013) Regulation of axon guidance by compartmentalized nonsense-mediated mRNA decay. Cell 153:1252–1265. doi:10.1016/j.cell.2013.04.056
Crooks G, Hon G, Chandonia J, Brenner S (2004) WebLogo: a sequence logo generator. Genome Res 14:1188–1190. doi:10.1101/gr.849004
De Conti L, Baralle M, Buratti E (2013) Exon and intron definition in pre-mRNA splicing. WIREs RNA 4:49–60. doi:10.1002/wrna.1140
Dirksen WP, Sun Q, Rottman FM (1995) Multiple splicing signals control alternative intron retention of bovine growth hormone pre-mRNA. J Biol Chem 270:5346–5352. doi:10.1074/jbc.270.10.5346
Falciatore A, Merendino L, Barneche F, Ceol M, Meskauskiene R, Apel K, Rochaix J (2005) The FLP proteins act as regulators of chlorophyll synthesis in response to light and plastid signals in Chlamydomonas. Genes Dev 19:176–187
Ferris P, Olson BJSC, Hoff PLD, Douglass S, Casero D, Prochnik S, Geng S, Rai R, Grimwood J, Schmutz J, Nishii I, Hamaji T, Nozaki H, Pellegrini M, Umen JG (2010) Evolution of an expanded sex-determining locus in Volvox. Science 328:351–354. doi:10.1126/science.1186222
Filichkin SA, Priest HD, Givan SA, Shen R, Bryant DW, Fox SE, Wong W-K, Mockler TC (2010) Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res 20:45–58. doi:10.1101/gr.093302.109
Fox-Walsh KL, Dou Y, Lam BJ, Hung S, Baldi PF, Hertel KJ (2005) The architecture of pre-mRNAs affects mechanisms of splice-site pairing. PNAS 102:16176–16181. doi:10.1073/pnas.0508489102
Fukuzawa H, Miura K, Ishizaki K, Kucho K, Saito T, Kohinata T, Ohyama K (2001) Ccm1, a regulatory gene controlling the induction of a carbon-concentrating mechanism in Chlamydomonas reinhardtii by sensing CO2 availability. PNAS 98:5347–5352. doi:10.1073/pnas.081593498
Ge Y, Porse BT (2014) The functional consequences of intron retention: alternative splicing coupled to NMD as a regulator of gene expression. Bioessays 36:236–243. doi:10.1002/bies.201300156
Goren A, Ram O, Amit M, Keren H, Lev-Maor G, Vig I, Pupko T, Ast G (2006) Comparative analysis identifies exonic splicing regulatory sequences—the complex definition of enhancers and silencers. Mol Cell 22:769–781. doi:10.1016/j.molcel.2006.05.008
Guo S, Zheng Y, Joung J-G, Liu S, Zhang Z, Crasta OR, Sobral BW, Xu Y, Huang S, Fei Z (2010) Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types. BMC Genomics 11:384. doi:10.1186/1471-2164-11-384
Haas BJ, Delcher AL, Mount SM, Wortman JR, Jr RKS, Hannick LI, Maiti R, Ronning CM, Rusch DB, Town CD, Salzberg SL, White O (2003) Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res 31:5654–5666. doi:10.1093/nar/gkg770
Kalyna M, Lopato S, Voronin V, Barta A (2006) Evolutionary conservation and regulation of particular alternative splicing events in plant SR proteins. Nucleic Acids Res 34:4395–4405. doi:10.1093/nar/gkl570
Kent W (2002) BLAT—the BLAST-like alignment tool. Genome Res 12:656–664
Keren H, Lev-Maor G, Ast G (2010) Alternative splicing and evolution: diversification, exon definition and function. Nat Rev Genet 11:345–355. doi:10.1038/nrg2776
Khodor YL, Menet JS, Tolan M, Rosbash M (2012) Cotranscriptional splicing efficiency differs dramatically between Drosophila and mouse. RNA 18:2174–2186. doi:10.1261/rna.034090.112
Kis M, Jakab G, Pollak T, Branlant C, Solymosy F (1993) Nucleotide sequence of U1 RNA from a green alga, Chlamydomonas reinhardtii. Nucleic Acids Res 21:2255
Labadorf A, Link A, Rogers M, Thomas J, Reddy A, Ben-Hur A (2010) Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii. BMC Genomics 11:114. doi:10.1186/1471-2164-11-114
Lawrence M, Huber W, Pagès H, Aboyoun P, Carlson M, Gentleman R, Morgan MT, Carey VJ (2013) Software for computing and annotating genomic ranges. PLoS Comput Biol 9:e1003118. doi:10.1371/journal.pcbi.1003118
Lejeune F, Cavaloc Y, Stevenin J (2001) Alternative splicing of intron 3 of the serine/arginine-rich protein 9G8 gene. Identification of flanking exonic splicing enhancers and involvement of 9G8 as a trans-acting factor. J Biol Chem 276:7850–7858. doi:10.1074/jbc.M009510200
Li JB, Lin S, Jia H, Wu H, Roe BA, Kulp D, Stormo GD, Dutcher SK (2003) Analysis of Chlamydomonas reinhardtii genome structure using large-scale sequencing of regions on linkage groups I and III. J Eukaryot Microbiol 50:145–155
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079. doi:10.1093/bioinformatics/btp352
Liang C, Wang G, Liu L, Ji G, Liu Y, Chen J, Webb JS, Reese G, Dean JFD (2007) WebTraceMiner: a web service for processing and mining EST sequence trace files. Nucleic Acids Res 35:W137–W142. doi:10.1093/nar/gkm299
Lim LP, Burge CB (2001) A computational analysis of sequence features involved in recognition of short introns. Proc Natl Acad Sci 98:11193–11198. doi:10.1073/pnas.201407298
Loke JC, Stahlberg EA, Strenski DG, Haas BJ, Wood PC, Li QQ (2005) Compilation of mRNA polyadenylation signals in Arabidopsis revealed a new signal element and potential secondary structures. Plant Physiol 138:1457–1468. doi:10.1104/pp.105.060541
Lu T, Lu G, Fan D, Zhu C, Li W, Zhao Q, Feng Q, Zhao Y, Guo Y, Li W, Huang X, Han B (2010) Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-sEq. Genome Res 20:1238–1249. doi:10.1101/gr.106120.110
Macknight R, Duroux M, Laurie R, Dijkwel P, Simpson G, Dean C (2002) Functional significance of the alternative transcript processing of the Arabidopsis floral promoter FCA. Plant Cell 14:877–888. doi:10.1105/tpc.010456
Mao R, Raj Kumar PK, Guo C, Zhang Y, Liang C (2014) Comparative analyses between retained introns and constitutively spliced introns in Arabidopsis thaliana using random forest and support vector machine. PLoS ONE 9:e104049. doi:10.1371/journal.pone.0104049
Marquez Y, Brown J, Simpson C, Barta A, Kalyna M (2012) Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis. Genome Res 22:1184–1279. doi:10.1101/gr.134106.111
Matlin AJ, Clark F, Smith CWJ (2005) Understanding alternative splicing: towards a cellular code. Nat Rev Mol Cell Biol 6:386–398. doi:10.1038/nrm1645
McCullough AJ, Berget SM (1997) G triplets located throughout a class of small vertebrate introns enforce intron borders and regulate splice site selection. Mol Cell Biol 17:4562–4571
McCullough AJ, Berget SM (2000) An intronic splicing enhancer binds U1 snRNPs to enhance splicing and select 5′ splice sites. Mol Cell Biol 20:9225–9235
Merchant S, Prochnik S, Vallon O, Harris E, Karpowicz S, Witman G, Terry A, Salamov A, Fritz-Laylin L, Marechal-Drouard L et al (2007) The Chlamydomonas genome reveals the evolution of key animal and plant functions. Science 318:245. doi:10.1126/science.1143609
Michael IP, Kurlender L, Memari N, Yousef GM, Du D, Grass L, Stephan C, Jung K, Diamandis EP (2005) Intron retention: a common splicing event within the human kallikrein gene family. Clin Chem 51:506–515. doi:10.1373/clinchem.2004.042341
Moseley JL, Page MD, Alder NP, Eriksson M, Quinn J, Soto F, Theg SM, Hippler M, Merchant S (2002) Reciprocal expression of two candidate di-iron enzymes affecting photosystem I and light-harvesting complex accumulation. Plant Cell 14:673–688. doi:10.1105/tpc.010420
Nussinov R (1989) Conserved signals around the 5′ splice sites in eukaryotic nuclear precursor mRNAs: G-runs are frequent in the introns and C in the exons near both 5′ and 3′ splice sites. J Biomol Struct Dyn 6:985–1000. doi:10.1080/07391102.1989.10506526
Pages H, Gentleman R, Aboyoun P, DebRoy S (2008) Biostrings: string objects representing biological sequences, and matching algorithms. R Package Version 2:160
Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ (2008) Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet 40:1413–1415. doi:10.1038/ng.259
Quesada V, Macknight R, Dean C, Simpson GG (2003) Autoregulation of FCA pre-mRNA processing controls Arabidopsis flowering time. EMBO J 22:3142–3152. doi:10.1093/emboj/cdg305
Reddy A (2001) Nuclear pre-mRNA splicing in plants. Crit Rev Plant Sci 20:523–571
Reddy A (2007) Alternative splicing of pre-messenger RNAs in plants in the genomic era. Annu Rev Plant Biol 58:267–361. doi:10.1146/annurev.arplant.58.032806.103754
Reddy ASN, Rogers MF, Richardson DN, Hamilton M, Ben-Hur A (2012) Deciphering the plant splicing code: experimental and computational approaches for predicting alternative splicing and splicing regulatory elements. Front Plant Sci. doi:10.3389/fpls.2012.00018
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, Mesirov JP (2011) Integrative genomics viewer. Nat Biotechnol 29:24–26. doi:10.1038/nbt.1754
Romano M, Marcucci R, Baralle FE (2001) Splicing of constitutive upstream introns is essential for the recognition of intra-exonic suboptimal splice sites in the thrombopoietin gene. Nucleic Acids Res 29:886–894. doi:10.1093/nar/29.4.886
Romiguier J, Ranwez V, Douzery EJP, Galtier N (2010) Contrasting GC-content dynamics across 33 mammalian genomes: relationship with life-history traits and chromosome sizes. Genome Res 20:1001–1009. doi:10.1101/gr.104372.109
Sakabe NJ, Souza SJ de (2007) Sequence features responsible for intron retention in human. BMC Genomics 8:59. doi:10.1186/1471-2164-8-59
Sakharkar MK, Perumal BS, Sakharkar KR, Kangueane P (2005) An analysis on gene architecture in human and mouse genomes. In Silico Biol 5:347–365
Sammeth M, Foissac S, Guigó R (2008) A general definition and nomenclature for alternative splicing events. PLoS Comput Biol 4:e1000147. doi:10.1371/journal.pcbi.1000147
Schroda M, Vallon O, Whitelegge JP, Beck CF, Wollman FA (2001) The chloroplastic GrpE homolog of chlamydomonas two isoforms generated by differential splicing. Plant Cell 13:2823–2839. doi:10.1105/tpc.010202
Seo PJ, Kim MJ, Ryu J-Y, Jeong E-Y, Park C-M (2011) Two splice variants of the IDD14 transcription factor competitively form nonfunctional heterodimers which may regulate starch metabolism. Nat Commun 2:303. doi:10.1038/ncomms1303
Shen Y, Ji G, Haas BJ, Wu X, Zheng J, Reese GJ, Li QQ (2008) Genome level analysis of rice mRNA 3′-end processing signals and alternative polyadenylation. Nucleic Acids Res 36:3150–3161. doi:10.1093/nar/gkn158
Simpson CG, Thow G, Clark GP, Jennings SN, Watters JA, Brown JW (2002) Mutational analysis of a plant branchpoint and polypyrimidine tract required for constitutive splicing of a mini-exon. RNA 8:47–56. doi:10.1017/S1355838202015546
Šmarda P, Bureš P, Šmerda J, Horová L (2012) Measurements of genomic GC content in plant genomes with flow cytometry: a test for reliability. New Phytol 193:513–521. doi:10.1111/j.1469-8137.2011.03942.x
Stamm S, Zhu J, Nakai K, Stoilov P, Stoss O, Zhang MQ (2000) An alternative-exon database and its statistical analysis. DNA Cell Biol 19:739–756. doi:10.1089/104454900750058107
Sterner DA, Berget SM (1993) In vivo recognition of a vertebrate mini-exon as an exon-intron-exon unit. Mol Cell Biol 13:2677–2687. doi:10.1128/MCB.13.5.2677
Syed NH, Kalyna M, Marquez Y, Barta A, Brown JWS (2012) Alternative splicing in plants—coming of age. Trends Plant Sci 17:616–623. doi:10.1016/j.tplants.2012.06.001
Talerico M, Berget SM (1994) Intron definition in splicing of small Drosophila introns. Mol Cell Biol 14:3434–3445. doi:10.1128/MCB.14.5.3434
Tian B, Pan Z, Lee JY (2007) Widespread mRNA polyadenylation events in introns indicate dynamic interplay between polyadenylation and splicing. Genome Res 17:156–165. doi:10.1101/gr.5532707
Wang B, Brendel V (2006) Genomewide comparative analysis of alternative splicing in plants. PNAS 103:7175–7180. doi:10.1073/pnas.0602039103
Wong JJ-L, Ritchie W, Ebner OA, Selbach M, Wong JWH, Huang Y, Gao D, Pinello N, Gonzalez M, Baidya K, Thoeng A, Khoo T-L, Bailey CG, Holst J, Rasko JEJ (2013) Orchestrated intron retention regulates normal granulocyte differentiation. Cell 154:583–595. doi:10.1016/j.cell.2013.06.052
Wu TD, Watanabe CK (2005) GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21:1859–1875. doi:10.1093/bioinformatics/bti310
Zhang MQ (1998) Statistical features of human exons and their flanking regions. Hum Mol Genet 7:919–932. doi:10.1093/hmg/7.5.919
Zhang G, Guo G, Hu X, Zhang Y, Li Q, Li R, Zhuang R, Lu Z, He Z, Fang X, Chen L, Tian W, Tao Y, Kristiansen K, Zhang X, Li S, Yang H, Wang J, Wang J (2010) Deep RNA sequencing at single base-pair resolution reveals high complexity of the rice transcriptome. Genome Res 20:646–654. doi:10.1101/gr.100677.109
Zhao Z, Wu X, Kumar PKR, Dong M, Ji G, Li QQ, Liang C (2014) Bioinformatics analysis of alternative polyadenylation in green alga Chlamydomonas reinhardtii using transcriptome sequences from three different sequencing platforms. G3 (Bethesda) 4:871–883. doi:10.1534/g3.114.010249
Zheng CL, Fu X-D, Gribskov M (2005) Characteristics and regulatory elements defining constitutive splicing and different modes of alternative splicing in human and mouse. RNA 11:1777–1787. doi:10.1261/rna.2660805
Acknowledgements
Funding for this project is provided by a grant award from the Ohio Plant Biotechnology Consortium to CL. This work was partially supported by the National Institutes of Health (1R15GM94732-1 A1 to CL) and Centre National de la Recherche Scientifique (ANR-11-LABX-0011-DYNAMO to OV). The authors thank Mario Stanke, Lin Liu and Trey Moler for their participation in this project and Marina Cavaioulo for giving access to her mapping of Illumina reads.
Authors’ contributions
CL managed and coordinated the project. PKRK and CL conceived the study. PKRK carried out implementation and drafted the manuscript. OV conducted SR protein analysis and individual intron inspection and helped devise validation tests. All authors participated in manuscript writing.
Author information
Authors and Affiliations
Corresponding authors
Electronic supplementary material
Below is the link to the electronic supplementary material.
11103_2017_605_MOESM1_ESM.tif
Supplemental Fig. 1 Example of alternative splicing event supported only by Augustus annotation. a. Splicing graph of the gene structure as viewed in PASA website [http://bioinfolab.miamioh.edu/cgi-bin/PASA_r20140417/cgi-bin/assembly_alt_splice_info.cgi?db=Chre_AS&cdna_acc=asmbl_94&SHOW_ALL=1&SHOW_ALIGNMENTS]. b and c show alternative splicing forms “alternative acceptor” and “retained intron” as inferred from comparing two isoforms called asmbl_94 and asmbl_95. Splice sites are shown in light blue, exons in black, coding regions as predicted by PASA in red, and spliced out introns by the thin connecting lines. Highlighted purple box denote the change between isoforms. Both isoforms asmbl_94 and asmbl_95 are supported by only Augustus annotations “Cre01.g003100.t1.3” and “Cre01.g003100.t2.1” respectively. An intron retained between the coding exons is denoted as CDS retained intron (TIF 458 KB)
11103_2017_605_MOESM2_ESM.xlsx
Supplemental Table 1 Chlamydomonas SR proteins. SR protein family assignments were based on (Barbosa-Morais et al. 2006; Barta et al. 2008). RRM: RNA recognition motif; RS: arginine/serine-rich region; ZnK: Zinc knuckle domain; SWAP/Surp: suppressor-of-white-apricot domain; PWI: Pro-Trp-Ile domain. Parentheses within a “Known Domains” field entry indicate a weak domain feature. Au9 gene models, annotations, and corresponding browser links showing PASA transcript models are provided in the right column. (XLSX 13 KB)
11103_2017_605_MOESM3_ESM.xlsx
Supplemental Table 2 Retained introns and their validation scores. Retained intron found in the PASA model with their genomic coordinates, validation scores and results of individual inspection where available. (XLSX 112 KB)
Rights and permissions
About this article
Cite this article
Raj-Kumar, PK., Vallon, O. & Liang, C. In silico analysis of the sequence features responsible for alternatively spliced introns in the model green alga Chlamydomonas reinhardtii . Plant Mol Biol 94, 253–265 (2017). https://doi.org/10.1007/s11103-017-0605-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11103-017-0605-9