Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum

Piao, Xianyu; Hou, Nan; Cai, Pengfei; Liu, Shuai; Wu, Chuang; Chen, Qijun

doi:10.1186/1471-2164-15-715

Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum

Research article
Open access
Published: 26 August 2014

Volume 15, article number 715, (2014)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum

Download PDF

Xianyu Piao¹,
Nan Hou¹,
Pengfei Cai¹,
Shuai Liu¹,
Chuang Wu¹ &
…
Qijun Chen^1,2

3493 Accesses
14 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Schistosoma japonicum is a pathogen of the phylum Platyhelminthes that causes zoonotic schistosomiasis in China and Southeast Asian countries where a lack of efficient measures has hampered disease control. The development of tools for diagnosis of acute and chronic infection and for novel antiparasite reagents relies on understanding the biological mechanisms that the parasite exploits.

Results

In this study, the polyadenylated transcripts from the male and female S. japonicum were sequenced using a high-throughput RNA-seq technique. Bioinformatic and experimental analyses focused on post-transcriptional RNA processing, which revealed extensive alternative splicing events in the adult stage of the parasite. The numbers of protein-coding sequences identified in the transcriptomes of the female and male S. japonicum were 15,939 and 19,501 respectively, which is more than predicted from the annotated genome sequence. Further, we identified four types of post-transcriptional processing, or alternative splicing, in both female and male worms of S. japonicum: exon skipping, intron retention, and alternative donor and acceptor sites. Unlike mammalian organisms, in S. japonicum, the alternative donor and acceptor sites were more common than the other two types of post-transcriptional processing. In total, respectively 13,438 and 16,507 alternative splicing events were predicted in the transcriptomes of female and male S. japonicum.

Conclusions

By using RNA-seq technology, we obtained the global transcriptomes of male and female S. japonicum. These results further provide a comprehensive view of the global transcriptome of S. japonicum. The findings of a substantial level of alternative splicing events dynamically occurring in S. japonicum parasitization of mammalian hosts suggest complicated transcriptional and post-transcriptional regulation mechanisms employed by the parasite. These data should not only significantly improve the re-annotation of the genome sequences but also should provide new information about the biology of the parasite.

The Schistosoma mansoni genome encodes thousands of long non-coding RNAs predicted to be functional at different parasite life-cycle stages

Article Open access 05 September 2017

De novo transcriptomic analysis of the female and male adults of the blood fluke Schistosoma turkestanicum

Article Open access 11 March 2016

Step-by-Step Bioinformatics Analysis of Schistosoma mansoni Long Non-coding RNA Sequences

Background

Human schistosomiasis, which is second only to malaria in terms of morbidity and mortality, is a chronic debilitating disease caused by infections of Schistosoma species that vary depending on the endemic region of the parasites [1]. Three principal Schistosoma species can infect humans and cause severe diseases: Schistosoma japonicum, Schistosoma mansoni, and Schistosoma haematobium. S. japonicum is the causative agent of zoonotic schistosomiasis, affecting millions of people in several East and Southeast Asian countries. Despite the availability of a highly effective chemotherapeutic drug (Praziquantel), the high re-infection rates in humans and animals plus the requirement of frequent administration of the agent still limits the overall success of chemotherapy and disease control efforts. Novel targets for drug and vaccine development remain to be defined for optimal treatment and disease prevention; however, the lack of knowledge about this parasite’s biology remains a hurdle. Schistosoma parasites can persist in a mammalian host for decades in the presence of the host immune system, and current knowledge about the mechanism of parasitization is still fragmented. What is known is that the successful host-evasion mechanisms of the parasite involve the inert tegument that covers the surface in most developmental stages, the recruitment of host components to the surface, and the expression of various antigens and immune-regulating factors [2–5].

Schistosoma parasites have a complicated developmental and biological cycle. They are among the few platyhelminth parasites to adopt a dioecious lifestyle and possess heteromorphic sex chromosomes. The genome of S. japonicum contains eight pairs of chromosomes comprising seven pairs of autosomes and one pair of sexual chromosomes, with an estimated 397 Mb containing primarily 13,469 protein-coding sequences [6, 7] that account for 4% of the genome. The decoding and availability of the genome sequences of the three most pathogenic parasites, S. mansoni, S. japonicum, and S. haematobium, has proved pivotal for the systematic dissection of the parasite biology [7–10].

Deep transcriptome sequencing (also called RNA-Seq) with next-generation sequencing technologies has provided unprecedented opportunities to investigate the genome-wide transcriptional property of many species [11–14]. This technique allows for the survey of the entire transcriptome in a very high-throughput and quantitative manner, making it possible to identify exons and introns, map their boundaries and the 5′ and 3′ ends of genes, and understand the complexity of genome organization and activity comprehensively. A majority of eukaryotic protein-coding genes contain intron sequences that must be removed by splicing after transcription from the DNA templates. However, some pre-mRNAs can be processed alternatively by the splicing out or retention of the transcript regions of exons or introns. This alternative splicing allows individual genes to produce two or more variant mRNA templates, which in many cases encode functionally distinct proteins. Alternative splicing is an integrated process in regulation of gene transcription and expression and results in structural and functional diversity of molecules [15]. Because of the powerful readout of RNA-seq, which can generate many sequence reads that span exon–exon junctions, RNA transcripts generated from different splicing events can be identified [12, 13, 16]. So far, five basic modes of alternative splicing are generally recognized: exon skipping (ES), mutually exclusive exons, alternative donor sites (ADS), alternative acceptor sites (AAS), and intron retention (IR) [17, 18]. ES, also called exon cassette, indicates that an exon is spliced out from the primary transcript and occurs most commonly in mammalian cells [19]. In the event of mutually exclusive exons, only one of the two exons is retained in mRNAs after splicing. An ADS results when an alternative 5′ splice junction (called the donor site) is used, leading to a change in the 3′ boundary of the upstream exon. An AAS arises when an alternative 3′ splice junction (acceptor site) is used, leading to a change in the 5′ boundary of the downstream exon (Figure 1). IR occurs when a sequence is spliced out as an intron or simply retained and is distinguished from ES because introns do not flank the retained sequence. The retained transcript of the intron region in most cases encodes amino acids in-frame with the neighboring exons [19]. Recent results have suggested that schistosomes create multiple protein variants by splicing micro-exon gene transcripts, which might be involved in immune evasion mechanisms [20]; however, the general feature of alternative splicing in the parasites remains understudied. Here, we investigated the alternative splicing of transcripts in both male and female S. japonicum after deep RNA sequencing. We found that the gene transcripts were diversely processed and that four types of RNA splicing were identifiable after transcription of the genome.

Methods

Parasites and RNA purification

Schistosoma japonicum–infected Oncomelania hupensis were purchased from Jiangxi Institute of Parasitic Disease, Nanchang, China. Cercariae were freshly shed from the infected snails. One New Zealand white female rabbit was percutaneously infected with ~1,500 S. japonicum cercariae, as described previously [21]. Mature adult parasites were isolated at 6 weeks post-infection from the rabbit by flushing the blood vessels with phosphate-buffered saline, as described previously [5, 22–24]. Male and female parasites were manually separated with the aid of a light microscope. Total RNA from the parasites was purified with Trizol reagent (Invitrogen, CA, USA), and contaminating genomic DNA was removed using the RNase-Free DNase Set (Qiagen, Germany). RNA quantification and quality were examined with a Nanodrop ND-1000 spectrophotometer (Nanodrop Technologies, Wilmington, DE, USA) and standard agarose gel electrophoresis. All RNA samples were stored at -80°C until use.

Library preparation and sequencing

Polyadenylated RNA samples from adult male and female S. japonicum parasites were isolated from total RNA using oligo-(dT) conjugated magnetic beads (Dynabeads®, Invitrogen, CA, USA). The mRNA was interrupted into short fragments by adding the fragmentation buffer provided by the manufacturer (Illumina RNA-seq kit, part no. 1004898). With these short fragments as templates, random hexamer primers were used to synthesize the first-strand cDNA. The second-strand cDNA was synthesized using buffer, dNTPs, RNase H, and DNA polymerase I, respectively. Short fragments were purified following instructions accompanying the kit (QiaQuick PCR Purification Kit, Qiagen, Germany), and double-stranded cDNAs were end-repaired according to manufacturer-recommended protocols, followed by connection with Illumina adapters (Illumina RNA-seq kit, part no. 1004898). The fragments were first amplified by PCR. Purified cDNA fragments were pooled and indexed and loaded onto one lane of an Illumina GA IIX flow cell. A total of 75 pair-end sequencing cycles were carried out. Cluster formation, primer hybridization, and pair-end sequencing were performed according to the provided protocols [25].

Sequence analysis

Low-quality reads (more than half of the bases had a quality value less than 5), reads in which unknown bases represented more than 10%, and adapter sequences were removed from the reads, and the clean reads were mapped onto the S. japonicum genome of SGST, (http://lifecenter.sgst.cn/schistosoma/en/schdownload.do) by TopHat (version v2.0.4; default parameters were used) [26], then assembled with Cufflinks (version v2.0.2) [27] to construct unique transcript sequences using the parameter: -g –b –u –o (-g/–GTF-guide: use reference transcript annotation to guide assembly; –b/–frag-bias-correct: use bias correction-reference fasta required; –u/–multi-read-correct: use ‘rescue method’ for multi-reads; –o/–output-dir: write all output files to this directory). The Cufflinks assembler is freely available at http://cufflinks.cbcb.umd.edu/. Cuffcompare [27] was used to compare the assembled transcripts of each library to the referenced annotated genes and build a non-redundant transcript dataset among the libraries. Then, Cuffdiff was used to find significant changes in gene expression level [27]. We used FDR to correct P values and obtained Q values; for Q value ≤5%, we considered the genes to be differentially expressed (Additional file 1). Several Perl scripts were written to summarize the splicing forms in each library. The following algorithms were used to detect alternative splicing events. First, junction sites, which give information about boundaries and combinations of different exons in a transcript, were detected by TopHat (with all default parameters). Then, all junction sites of the same gene were used to distinguish the type of alternative splicing event [26] (Additional file 2: Figure S1 and Figure 1).

Functional annotation and classification

Transcripts were first compared using the Kyoto Encyclopedia of Genes and Genomes database (KEGG, release 58) [16] with BLASTX [28] at E values ≤ 1e-10. A Perl script was used to retrieve KO (KEGG Ontology) information from the Blast result and establish pathway associations between UniGene and the database.

InterPro [29] domains were annotated by InterProScan [30] (Release 27.0), and functional assignments were mapped onto Gene Ontology (GO) [31]. WEGO [32] was employed to do GO classification and draw the GO tree. The significance analysis of functional pathways was performed using IDEG6 [33].

To identify pseudogenes in the S. japonicum genome, we used PseudoPipe [34]. The assembled transcripts that fell into or included the predicted position of pseudogenes were designated as pseudogenes. WEGO was used for the GO classification.

Non-coding RNA annotation

Rfam [35] (Release 10.1) databases were used to annotate the non-coding transcripts. The assembled novel transcripts were compared to Rfam by Blast at E values ≤ 1e-10.

Verification of alternative splicing transcripts by RT-PCR and sequencing

Genomic DNA of S. japonicum (adult male and female worms) was purified with the DNeasy Blood & Tissue Kit (Qiagen, Germany) according to the manufacturer’s instructions. Total RNA was prepared using TRIzol reagent (Invitrogen), as previously described [21], and contaminating genomic DNA was removed with the RNase-Free DNase Set (Ambion). PCR was conducted in triplicate, and each reaction involved 35 amplification cycles on an Applied Biosystems 9700 PCR system (Applied Biosystems, Foster City, CA, USA). The 20 μl reaction system contained 50 ng of total RNA (50 ng RNA was used for the first-strand synthesis step) or 80 ng DNA, 0.5 μM of each primer, and 10 μl of Premix Ex Taq (version 2.0, TaKaRa). The reaction conditions were as follows: 94°C for 3 min; 35 cycles of 94°C, 30 s; 55°C, 30 s; and 72°C, 90 s; and then 10 min at 72°C. An 8 μl aliquot of each PCR sample was then subjected to electrophoresis in a 1.5% agarose gel. The RT-PCR primer sequences are listed in Additional file 3: Table S1.

Results and discussion

Identification of a large number of novel transcripts from un-annotated genome loci following deep sequencing of the S. japonicumtranscriptome

In this study, we determined the transcriptomes of male and female adult worms of S japonicum by high-throughput RNA-seq with poly-A–purified RNA samples. A total of 8,112,913 and 8,260,474 paired reads were obtained, with a total length of 1,216,936,950 and 1,239,071,100 bp from female and male worms, respectively (Table 1, Additional file 4). The number of predicted genes of female and male worms of S. japonicum was 15,939 and 19,501, respectively, which was more than that predicted based on the genome sequence [6, 7]. Of the 15,939 genes predicted in the female parasite, a total of 10,087 were known and 5,852 were novel, while of the 19,501 predicted genes in the male parasite, a total of 10,469 were known and 9,032 were novel. The number of predicted transcripts in the two libraries of female and male parasites was 21,009 and 25,706, respectively, with 14,301 known and 6,708 novel transcripts in females and 18,931 known and 6,775 novel transcripts in males. The finding of so many novel transcripts should assist with the upgrade or reassembling of the genome sequence of S. japonicum [2]. However, the novel transcripts may also be generated by alternative post-transcriptional RNA processing or alternative splicing [36]. Indeed, we found 3,905 multi-transcription loci in female and 4,677 in male parasites, with about 1.32 transcripts per locus in both sexes (Table 1). Thus, the general transcription activity of both sexes was diverse but in a similar manner, confirming earlier studies [21, 36, 37]. However, the sequence data generated from this study was much more than that of earlier studies [37]. This was likely due to the approach applied in this study which is technically advanced than the digital gene expression technique. All sequence data have been deposited in the database (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?) with an accession number of GSE58564.The sequence reads can be classified into four types: exons, introns, intergenic, and spliced. The proportions of transcripts in female parasites from exons, introns, and intergenic loci were 56%, 7%, and 24%, respectively, and 13% of the transcripts were generated by alternative splicing (Figure 2). Similarly, in male parasites, the percentages of transcripts from exons, introns, and intergenic loci were 57%, 7%, and 22% respectively; 14% of the transcripts were generated by alternative splicing (Figure 2).

Table 1 Summary data of the transcriptome analysis

Full size table

Alternative splicing in S. japonicum

Four types of alternative splicing in both female and male worms of S. japonicum (Table 2 and Additional file 5) were identified, including ES, IR, ADS, and AAS. Of the alternative splicing events, AAS and ADS were more common than the other two types, suggesting that the gene regulation mechanism of the Schistosoma parasite is diverged from that of the mammalian taxa, in which ES has been more commonly observed [19]. In female S. japonicum, a total of 13,438 alternative splicing events were bioinformatically predicted while in male worms, a total of 16,507 were predicted (Table 2). The percentage of different alternative splicing events was similar between the two sexes (Table 2); however, the genes undergoing alternative splicing were not necessarily the same between them (Additional file 5).

Table 2 Statistics for alternative splicing events

Full size table

To confirm experimentally the prediction of the alternative splicing events in the bioinformatic analysis, eight transcripts in which alternatively spliced fragments were more than 100 bp were chosen randomly. Transcripts generated by ES skipping of five genes and transcripts generated by IR of three genes were validated by PCR and RT-PCR. The five genes with ES activity included one that encodes the protein C14orf166 homolog; a novel gene; S. japonicum Zinc finger CCCH domain-containing protein 5; S. japonicum cell division cycle and apoptosis regulator protein 1; and S. japonicum protein phosphatase 1 regulatory subunit SDS22. The three genes with IR were respectively beta-amyloid binding protein (Sjc_0025470), S. japonicum IPR001478 PDZ/DHR/GLGF domain-containing protein, and deoxyribodipyrimidine photolyase. The amplicons of all RT-PCR reactions were cloned and sequenced and were correlated with the predicated alternative splicing events (Figures 3 and 4), suggesting that the bioinformatic prediction based on the primary sequencing data was reliable.

Functional category of alternatively spliced genes in S. japonicum

After mapping of the RNA-Seq reads to the S. japonicum reference genome, transcripts were assembled and their relative abundances were calculated. Cuffdiff was used to find significant changes in gene level expression in the two libraries [27]. Genes subject to alternative splicing and showing significant differences in expression in the two libraries are listed in Table 3 and Additional file 6: Figure S2. Genes related to the function of genetic information processing were found to be more biased to the female parasite while genes related to the environmental information processing were more active in the male parasites (Additional file 7). These data reflect the biology of the two sexes of the parasite. The female parasites, which are kept in the cavity of the male parasites, are more active in the reproduction process while the male parasites are principally responsible in the host–parasite interaction.

Table 3 GO classification statistics of alternatively processed genes that were differentially expressed in female and male S. japonicum

Full size table

Identification of novel transcripts from intergenic regions and previously determined pseudogenes

One of the advantages of transcriptomic analysis is that it allows identification of novel transcripts that may not be predicted based on genomic sequences. The novel sequences can thus provide a powerful tool for re-annotation of the genome of S. japonicum, which has been poorly assembled (7). Of the novel polyadenylated sequences, two classes of transcripts have been identified: one that does not map either to regions of the genome corresponding to annotated genes or to the untranslated regions, and another that maps to previously annotated pseudogenes [7]. We identified 9,286 novel transcripts that completely matched the previously annotated intergenic regions of the genome. The length of these transcripts was from 74 to 166,115 bp, with an average length of 1,965 bp (Additional file 8). It has been reported that the S. mansoni genome contains many small open reading frames (8). Our results indicated that the small transcripts derived from both intergenic and "pseudogenes" in S. japonicum may encode important functions, as reported for the human genome [38]. Further, 31% (2,851 sequences) of these transcripts had at least one complete open reading frame that could be translated into proteins; the other 69% were not annotated (Additional file 8). Thus, of the 9,286 novel transcripts, at least 2,851 genomic sequences corresponding to the transcripts can be re-annotated as protein-coding genes. A total of 239 transcripts were mapped to the non-coding RNA database of the Rfam [39–43], which is also frequently used as a source of high-quality alignments for training and benchmarking RNA sequence analysis software tools. These transcripts were found as either microRNAs or ribosomal or other non-coding RNAs (Figure 5, Additional files 9 and 10). They were likely the contaminated sequences which were not completely depleted during mRNA purification process.

Furthermore, among the 9,286 novel transcripts, we detected 1,392 that were derived from pseudogenes; of these, 690 were derived from annotated pseudogenes, and the rest were from unannotated pseudogenes (Additional files 9 and 10). Pseudogenes can be transcribed from either direction, which contributed to the templates for generation of small endogenous interfering RNAs in S. japonicum, which is more common in transposable elements [21–23]. The identification of these pseudogene-derived transcripts suggested that all sequences are polyadenylated and that there is no discrimination between coding and non-coding transcripts in the RNA polyadenylation process in Schistosoma parasites. On the other hand, complete reading frames were indeed identified in a number of pseudogene-derived transcripts that encoded proteins with known functions (Figure 6). Thus, these "pseudogene" genes can be re-annotated as protein-coding genes.

Conclusions

In summary, by using RNA-seq technology, we obtained the global transcriptomes of male and female S. japonicum. Approximately 80% of the total reference genes (http://lifecenter.sgst.cn/schistosoma/en/schdownload.do) were expressed in the adult stage of the parasite, representing the majority of the transcriptomes. These results further provide a comprehensive view of the global transcriptome of S. japonicum. The findings of a substantial level of alternative splicing events dynamically occurring in the parasitization in the mammalian hosts of the S. japonicum suggest complicated transcriptional and post-transcriptional regulatory mechanisms employed by the parasite. The data should not only significantly improve the re-annotation of the genome sequences but also should provide new information about the biology of the parasite.

References

King CH, Dickman K, Tisch DJ: Reassessment of the cost of chronic helmintic infection: a meta-analysis of disability-related outcomes in endemic schistosomiasis. Lancet. 2005, 365 (9470): 1561-1569. 10.1016/S0140-6736(05)66457-4.
Article PubMed Google Scholar
Skelly PJ, Alan Wilson R: Making sense of the schistosome surface. Adv Parasitol. 2006, 63: 185-284.
Article PubMed Google Scholar
McIntosh RS, Jones FM, Dunne DW, McKerrow JH, Pleass RJ: Characterization of immunoglobulin binding by schistosomes. Parasite Immunol. 2006, 28 (9): 407-419. 10.1111/j.1365-3024.2006.00829.x.
Article CAS PubMed Google Scholar
Wu C, Cai P, Chang Q, Hao L, Peng S, Sun X, Lu H, Yin J, Jiang N, Chen Q: Mapping the binding between the tetraspanin molecule (Sjc23) of Schistosoma japonicum and human non-immune IgG. PLoS One. 2011, 6 (4): e19112-10.1371/journal.pone.0019112.
Article CAS PubMed Central PubMed Google Scholar
Cai P, Bu L, Wang J, Wang Z, Zhong X, Wang H: Molecular characterization of Schistosoma japonicum tegument protein tetraspanin-2: sequence variation and possible implications for immune evasion. Biochem Biophys Res Commun. 2008, 372 (1): 197-202. 10.1016/j.bbrc.2008.05.042.
Article CAS PubMed Google Scholar
Hirai H, Taguchi T, Saitoh Y, Kawanaka M, Sugiyama H, Habe S, Okamoto M, Hirata M, Shimada M, Tiu WU, Lai K, Upatham ES, Agatsuma T: Chromosomal differentiation of the Schistosoma japonicum complex. Int J Parasitol. 2000, 30 (4): 441-452. 10.1016/S0020-7519(99)00186-1.
Article CAS PubMed Google Scholar
Zhou Y, Zheng H, Chen Y, Zhang L, Wang K, Guo J, Huang Z, Zhang B, Huang W, Jin K, Dou T, Hasegawa M, Wang L, Zhang Y, Zhou J, Tao L, Cao Z, Li Y, Vinar T, Brejova B, Brown D, Li M, Miller DJ, Blair D, Zhong Y, Chen Z, Liu F, Hu W, Wang ZQ, Zhang QH, et al: The Schistosoma japonicum genome reveals features of host-parasite interplay. Nature. 2009, 460 (7253): 345-351. 10.1038/nature08140.
Article CAS PubMed Central Google Scholar
Berriman M, Haas BJ, LoVerde PT, Wilson RA, Dillon GP, Cerqueira GC, Mashiyama ST, Al-Lazikani B, Andrade LF, Ashton PD, Aslett MA, Bartholomeu DC, Blandin G, Caffrey CR, Coghlan A, Coulson R, Day TA, Delcher A, DeMarco R, Djikeng A, Eyre T, Gamble JA, Ghedin E, Gu Y, Hertz-Fowler C, Hirai H, Hirai Y, Houston R, Ivens A, Johnston DA, et al: The genome of the blood fluke Schistosoma mansoni. Nature. 2009, 460 (7253): 352-358. 10.1038/nature08160.
Article CAS PubMed Central PubMed Google Scholar
Young ND, Jex AR, Li B, Liu S, Yang L, Xiong Z, Li Y, Cantacessi C, Hall RS, Xu X, Chen F, Wu X, Zerlotini A, Oliveira G, Hofmann A, Zhang G, Fang X, Kang Y, Campbell BE, Loukas A, Ranganathan S, Rollinson D, Rinaldi G, Brindley PJ, Yang H, Wang J, Gasser RB: Whole-genome sequence of Schistosoma haematobium. Nat Genet. 2012, 44 (2): 221-225. 10.1038/ng.1065.
Article CAS PubMed Google Scholar
Webster JP, Oliviera G, Rollinson D, Gower CM: Schistosome genomes: a wealth of information. Trends Parasitol. 2010, 26 (3): 103-106. 10.1016/j.pt.2009.12.006.
Article CAS PubMed Google Scholar
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5 (7): 621-628. 10.1038/nmeth.1226.
Article CAS PubMed Google Scholar
Wang Z, Gerstein M, Snyder M: RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009, 10 (1): 57-63. 10.1038/nrg2484.
Article CAS PubMed Central PubMed Google Scholar
Filichkin SA, Priest HD, Givan SA, Shen R, Bryant DW, Fox SE, Wong WK, Mockler TC: Genome-wide mapping of alternative splicing in Arabidopsis thaliana. Genome Res. 2010, 20 (1): 45-58. 10.1101/gr.093302.109.
Article CAS PubMed Central PubMed Google Scholar
Harr B, Turner LM: Genome-wide analysis of alternative splicing evolution among Mus subspecies. Mol Ecol. 2010, 19 (Suppl 1): 228-239.
Article CAS PubMed Google Scholar
Kelemen O, Convertini P, Zhang Z, Wen Y, Shen M, Falaleeva M, Stamm S: Function of alternative splicing. Gene. 2012, 514 (1): 1-30.
Article PubMed Google Scholar
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M: From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006, 34 (Database issue): D354-D357.
Article CAS PubMed Central PubMed Google Scholar
Black DL: Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem. 2003, 72: 291-336. 10.1146/annurev.biochem.72.121801.161720.
Article CAS PubMed Google Scholar
Matlin AJ, Clark F, Smith CW: Understanding alternative splicing: towards a cellular code. Nat Rev Mol Cell Biol. 2005, 6 (5): 386-398. 10.1038/nrm1645.
Article CAS PubMed Google Scholar
Sammeth M, Foissac S, Guigo R: A general definition and nomenclature for alternative splicing events. PLoS Comput Biol. 2008, 4 (8): e1000147-10.1371/journal.pcbi.1000147.
Article PubMed Central PubMed Google Scholar
DeMarco R, Mathieson W, Manuel SJ, Dillon GP, Curwen RS, Ashton PD, Ivens AC, Berriman M, Verjovski-Almeida S, Wilson RA: Protein variation in blood-dwelling schistosome worms generated by differential splicing of micro-exon gene transcripts. Genome Res. 2010, 20 (8): 1112-1121. 10.1101/gr.100099.109.
Article CAS PubMed Central PubMed Google Scholar
Piao X, Cai P, Liu S, Hou N, Hao L, Yang F, Wang H, Wang J, Jin Q, Chen Q: Global expression analysis revealed novel gender-specific gene expression features in the blood fluke parasite Schistosoma japonicum. PLoS One. 2011, 6 (4): e18267-10.1371/journal.pone.0018267.
Article CAS PubMed Central PubMed Google Scholar
Hao L, Cai P, Jiang N, Wang H, Chen Q: Identification and characterization of microRNAs and endogenous siRNAs in Schistosoma japonicum. BMC Genomics. 2010, 11: 55-10.1186/1471-2164-11-55.
Article PubMed Central PubMed Google Scholar
Cai P, Hou N, Piao X, Liu S, Liu H, Yang F, Wang J, Jin Q, Wang H, Chen Q: Profiles of small non-coding RNAs in Schistosoma japonicum during development. PLoS Negl Trop Dis. 2011, 5 (8): e1256-10.1371/journal.pntd.0001256.
Article CAS PubMed Central PubMed Google Scholar
Cai P, Mu Y, Piao X, Hou N, Liu S, Gao Y, Wang H, Chen Q: Discovery and confirmation of ligand binding specificities of the Schistosoma japonicum polarity protein scribble. PLoS Negl Trop Dis. 2014, 8 (5): e2837-10.1371/journal.pntd.0002837.
Article PubMed Central PubMed Google Scholar
Xiong J, Lu X, Zhou Z, Chang Y, Yuan D, Tian M, Wang L, Fu C, Orias E, Miao W: Transcriptome analysis of the model protozoan, Tetrahymena thermophila, using Deep RNA sequencing. PLoS One. 2012, 7 (2): e30630-10.1371/journal.pone.0030630.
Article CAS PubMed Central PubMed Google Scholar
Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009, 25 (9): 1105-1111. 10.1093/bioinformatics/btp120.
Article CAS PubMed Central PubMed Google Scholar
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, Salzberg SL, Wold BJ, Pachter L: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010, 28 (5): 511-515. 10.1038/nbt.1621.
Article CAS PubMed Central PubMed Google Scholar
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.
Article CAS PubMed Central PubMed Google Scholar
Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Barrell D, Bateman A, Binns D, Biswas M, Bradley P, Bork P, Bucher P, Copley RR, Courcelle E, Das U, Durbin R, Falquet L, Fleischmann W, Griffiths-Jones S, Haft D, Harte N, Hulo N, Kahn D, Kanapin A, Krestyaninova M, Lopez R, Letunic I, Lonsdale D, Silventoinen V, Orchard SE, Pagni M, et al: The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res. 2003, 31 (1): 315-318. 10.1093/nar/gkg046.
Article CAS PubMed Central PubMed Google Scholar
Zdobnov EM, Apweiler R: InterProScan–an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001, 17 (9): 847-848. 10.1093/bioinformatics/17.9.847.
Article CAS PubMed Google Scholar
Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS, et al: The Gene Ontology (GO) database and informatics resource. Nucleic Acids Res. 2004, 32 (Database issue): D258-D261.
CAS PubMed Google Scholar
Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, Wang J, Li S, Li R, Bolund L: WEGO: a web tool for plotting GO annotations. Nucleic Acids Res. 2006, 34 (Web Server issue): W293-W297.
Article CAS PubMed Central PubMed Google Scholar
Romualdi C, Bortoluzzi S, D’Alessi F, Danieli GA: IDEG6: a web tool for detection of differentially expressed genes in multiple tag sampling experiments. Physiol Genomics. 2003, 12 (2): 159-162.
Article CAS PubMed Google Scholar
Zhang Z, Carriero N, Zheng D, Karro J, Harrison PM, Gerstein M: PseudoPipe: an automated pseudogene identification pipeline. Bioinformatics. 2006, 22 (12): 1437-1439. 10.1093/bioinformatics/btl116.
Article CAS PubMed Google Scholar
Gardner PP, Daub J, Tate J, Moore BL, Osuch IH, Griffiths-Jones S, Finn RD, Nawrocki EP, Kolbe DL, Eddy SR, Bateman A: Rfam: Wikipedia, clans and the "decimal" release. Nucleic Acids Res. 2011, 39 (Database issue): D141-D145.
Article CAS PubMed Central PubMed Google Scholar
Protasio AV, Tsai IJ, Babbage A, Nichol S, Hunt M, Aslett MA, De Silva N, Velarde GS, Anderson TJ, Clark RC, Davidson C, Dillon GP, Holroyd NE, LoVerde PT, Lloyd C, McQuillan J, Oliveira G, Otto TD, Parker-Manuel SJ, Quail MA, Wilson RA, Zerlotini A, Dunne DW, Berriman M: A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni. PLoS Negl Trop Dis. 2012, 6 (1): e1455-10.1371/journal.pntd.0001455.
Article CAS PubMed Central PubMed Google Scholar
Sun J, Wang SW, Li C, Hu W, Ren YJ, Wang JQ: Transcriptome profilings of female Schistosoma japonicum reveal significant differential expression of genes after pairing. Parasitol Res. 2014, 113 (3): 881-892. 10.1007/s00436-013-3719-2.
Article PubMed Google Scholar
Magny EG, Pueyo JI, Pearl FM, Cespedes MA, Niven JE, Bishop SA, Couso JP: Conserved regulation of cardiac calcium uptake by peptides encoded in small open reading frames. Science. 2013, 341 (6150): 1116-1120. 10.1126/science.1238802.
Article CAS PubMed Google Scholar
Holmes I: A probabilistic model for the evolution of RNA structure. BMC Bioinformatics. 2004, 5: 166-10.1186/1471-2105-5-166.
Article PubMed Central PubMed Google Scholar
Do CB, Woods DA, Batzoglou S: CONTRAfold: RNA secondary structure prediction without physics-based models. Bioinformatics. 2006, 22 (14): e90-e98. 10.1093/bioinformatics/btl246.
Article CAS PubMed Google Scholar
Yao Z, Weinberg Z, Ruzzo WL: CMfinder–a covariance model based RNA motif finding algorithm. Bioinformatics. 2006, 22 (4): 445-452. 10.1093/bioinformatics/btk008.
Article CAS PubMed Google Scholar
Sun Y, Buhler J: Designing secondary structure profiles for fast ncRNA identification. Comput Syst Bioinformatics Conf. 2008, 7: 145-156.
Article PubMed Google Scholar
Yusuf D, Marz M, Stadler PF, Hofacker IL: Bcheck: a wrapper tool for detecting RNase P RNA genes. BMC Genomics. 2010, 11: 432-10.1186/1471-2164-11-432.
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgments

We appreciate very much the bioinformatic support of Dr. Haibo Sun at MininGene Biotechnology and the efforts of technicians at Shenzhen BGI for Solexa sequencing. We also thank the Schistosoma japonicum Genome Sequencing and Functional Analysis Consortium for making the S. japonicum genome available on line publicly.

Funding

This study was supported by the National Natural Science Foundation of China (#81270026), the intramural grant from Institute of Pathogen Biology, CAMS (2012IPB207), the National S & T Major Program (Grant No. 2012ZX10004-220) and the Program for Changjiang Scholars and Innovative Research Team in University(IRT13007).

Author information

Authors and Affiliations

MOH Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, People’s Republic of China
Xianyu Piao, Nan Hou, Pengfei Cai, Shuai Liu, Chuang Wu & Qijun Chen
Key Laboratory of Zoonosis, Ministry of Education, Institute of Zoonosis, Jilin University, Changchun, People’s Republic of China
Qijun Chen

Authors

Xianyu Piao
View author publications
You can also search for this author in PubMed Google Scholar
Nan Hou
View author publications
You can also search for this author in PubMed Google Scholar
Pengfei Cai
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Chuang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Qijun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Qijun Chen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

XP, NH and QC conceived and designed the experiments. XP, NH, PC, SL, and CW performed the experiments. XP, NH and QC analyzed the data. XP and QC wrote the manuscript. All authors read and approved the final manuscript.

Xianyu Piao, Nan Hou contributed equally to this work.

Electronic supplementary material

Additional file 1: Differentially expressed genes between females and males.(XLS 605 KB)

Additional file 2: Figure S1: Junction sites. (TIFF 826 KB)

Additional file 3: Table S1: Primers and sequences for verification of the alternative splicing events. (DOC 36 KB)

Additional file 4: Total transcripts identified in female and male parsites.(XLS 9 MB)

Additional file 5: Genes with alternative splicing in female and male parsites.(XLS 700 KB)

12864_2014_6414_MOESM6_ESM.tiff

Additional file 6: Figure S2: Go category of the genes that were alternatively spliced and also differentially transcribed. (TIFF 144 KB)

12864_2014_6414_MOESM7_ESM.xlsx

Additional file 7: Genes related to the function of genetic information processing and environmental information.(XLSX 38 KB)

Additional file 8: Novel transcripts identified.(XLS 6 MB)

Additional file 9: Noncoding transcripts identified in the female S. japonicum .(XLS 62 KB)

Additional file 10: Noncoding transcripts identified in the male S. japonicum.(XLS 64 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Rights and permissions

This article is published under an open access license. Please check the 'Copyright Information' section either on this page or in the PDF for details of this license and what re-use is permitted. If your intended use exceeds what is permitted by the license or if you are unable to locate the licence and re-use information, please contact the Rights and Permissions team.

About this article

Cite this article

Piao, X., Hou, N., Cai, P. et al. Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum. BMC Genomics 15, 715 (2014). https://doi.org/10.1186/1471-2164-15-715

Download citation

Received: 22 July 2014
Accepted: 19 August 2014
Published: 26 August 2014
DOI: https://doi.org/10.1186/1471-2164-15-715

Genome-wide transcriptome analysis shows extensive alternative RNA splicing in the zoonotic parasite Schistosoma japonicum

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Methods

Parasites and RNA purification

Library preparation and sequencing

Sequence analysis

Functional annotation and classification

Non-coding RNA annotation

Verification of alternative splicing transcripts by RT-PCR and sequencing

Results and discussion

Identification of a large number of novel transcripts from un-annotated genome loci following deep sequencing of the S. japonicumtranscriptome

Alternative splicing in S. japonicum

Functional category of alternatively spliced genes in S. japonicum

Identification of novel transcripts from intergenic regions and previously determined pseudogenes

Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation