Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data
- Cite this paper as:
- Nicolae M., Mangul S., Măndoiu I., Zelikovsky A. (2010) Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data. In: Moulton V., Singh M. (eds) Algorithms in Bioinformatics. WABI 2010. Lecture Notes in Computer Science, vol 6293. Springer, Berlin, Heidelberg
In this paper we present a novel expectation-maximization algorithm for inference of alternative splicing isoform frequencies from high-throughput transcriptome sequencing (RNA-Seq) data. Our algorithm exploits disambiguation information provided by the distribution of insert sizes generated during sequencing library preparation, and takes advantage of base quality scores, strand and read pairing information if available. Empirical experiments on synthetic datasets show that the algorithm significantly outperforms existing methods of isoform and gene expression level estimation from RNA-Seq data. The Java implementation of IsoEM is available at http://dna.engr.uconn.edu/software/IsoEM/.
Unable to display preview. Download preview PDF.