Science China Life Sciences

, Volume 56, Issue 2, pp 143–155

Comparative study of de novo assembly and genome-guided assembly strategies for transcriptome reconstruction based on RNA-Seq

Open AccessResearch Paper Special Topic

DOI: 10.1007/s11427-013-4442-z

Cite this article as:
Lu, B., Zeng, Z. & Shi, T. Sci. China Life Sci. (2013) 56: 143. doi:10.1007/s11427-013-4442-z

Abstract

Transcriptome reconstruction is an important application of RNA-Seq, providing critical information for further analysis of transcriptome. Although RNA-Seq offers the potential to identify the whole picture of transcriptome, it still presents special challenges. To handle these difficulties and reconstruct transcriptome as completely as possible, current computational approaches mainly employ two strategies: de novo assembly and genome-guided assembly. In order to find the similarities and differences between them, we firstly chose five representative assemblers belonging to the two classes respectively, and then investigated and compared their algorithm features in theory and real performances in practice. We found that all the methods can be reduced to graph reduction problems, yet they have different conceptual and practical implementations, thus each assembly method has its specific advantages and disadvantages, performing worse than others in certain aspects while outperforming others in anther aspects at the same time. Finally we merged assemblies of the five assemblers and obtained a much better assembly. Additionally we evaluated an assembler using genome-guided de novo assembly approach, and achieved good performance. Based on these results, we suggest that to obtain a comprehensive set of recovered transcripts, it is better to use a combination of de novo assembly and genome-guided assembly.

Keywords

transcriptome reconstruction RNA-Seq de novo assembly genome-guided assembly 
Download to read the full article text

Supplementary material

11427_2013_4442_MOESM1_ESM.pdf (726 kb)
Supplementary material, approximately 726 KB.

Copyright information

© The Author(s) 2013

Authors and Affiliations

  1. 1.Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life SciencesEast China Normal UniversityShanghaiChina
  2. 2.Software Engineering Institute, School of Software EngineeringEast China Normal UniversityShanghaiChina