Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data

  • Marius Nicolae
  • Serghei Mangul
  • Ion Măndoiu
  • Alex Zelikovsky
Conference paper

DOI: 10.1007/978-3-642-15294-8_17

Part of the Lecture Notes in Computer Science book series (LNCS, volume 6293)
Cite this paper as:
Nicolae M., Mangul S., Măndoiu I., Zelikovsky A. (2010) Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data. In: Moulton V., Singh M. (eds) Algorithms in Bioinformatics. WABI 2010. Lecture Notes in Computer Science, vol 6293. Springer, Berlin, Heidelberg

Abstract

In this paper we present a novel expectation-maximization algorithm for inference of alternative splicing isoform frequencies from high-throughput transcriptome sequencing (RNA-Seq) data. Our algorithm exploits disambiguation information provided by the distribution of insert sizes generated during sequencing library preparation, and takes advantage of base quality scores, strand and read pairing information if available. Empirical experiments on synthetic datasets show that the algorithm significantly outperforms existing methods of isoform and gene expression level estimation from RNA-Seq data. The Java implementation of IsoEM is available at http://dna.engr.uconn.edu/software/IsoEM/.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Marius Nicolae
    • 1
  • Serghei Mangul
    • 2
  • Ion Măndoiu
    • 1
  • Alex Zelikovsky
    • 2
  1. 1.Computer Science & Engineering DepartmentUniversity of ConnecticutStorrs
  2. 2.Computer Science DepartmentGeorgia State University, University PlazaGeorgia

Personalised recommendations