Abstract
RNA-seq has proven to be a powerful technique for transcriptome profiling based on next-generation sequencing (NGS) technologies. However, due to the limited read length of NGS data, it is extremely challenging to accurately map RNA-seq reads to splice junctions, which is critically important for the analysis of alternative splicing and isoform construction. Several tools have been developed to find splice junctions by RNA-seq de novo, without the aid of gene annotations [1-3]. However, the sensitivity and specificity of these tools need to be improved. In this paper, we describe a novel method, called TrueSight, that combines information from (i) RNA-seq read mapping quality and (ii) coding potential from the reference genome sequences into a unified model that utilizes semi-supervised learning to precisely identify splice junctions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Trapnell, C., Pachter, L., Salzberg, S.L.: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25(9), 1105–1111 (2009)
Wang, K., et al.: MapSplice: accurate mapping of RNA-seq reads for splice junction discovery. Nucleic Acids Res. 38(18), e178 (2010)
Au, K.F., et al.: Detection of splice junctions from paired-end RNA-seq data by SpliceMap. Nucleic Acids Res. 38(14), 4570–4578 (2010)
Celeux, G., Govaert, G.: A classification EM algorithm for clustering and two stochastic versions. Computational Statistics & Data Analysis 14(3), 315–332 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, Y., Li, HM., Burns, P., Borodovsky, M., Robinson, G.E., Ma, J. (2012). TrueSight: Self-training Algorithm for Splice Junction Detection Using RNA-seq. In: Chor, B. (eds) Research in Computational Molecular Biology. RECOMB 2012. Lecture Notes in Computer Science(), vol 7262. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29627-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-29627-7_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29626-0
Online ISBN: 978-3-642-29627-7
eBook Packages: Computer ScienceComputer Science (R0)