DTAST: A Novel Radical Framework for de Novo Transcriptome Assembly Based on Suffix Trees
In this article, we develop a novel radical framework for de novo transcriptome assembly based on suffix trees, called DTAST. DTAST extends contigs by reads that have the longest overlaps with the contigs’ terminuses. These reads can be found in linear time of the length of the reads through a well-designed suffix tree structure. Besides, DTAST proposes two strategies to extract transcript-representing paths: a depth-first enumeration strategy and a hybrid strategy based on length and coverage. Experimental results showed that DTAST performs more competitive than the other compared state-of-the-art de novo assemblers. The software with choice for either strategy is available at https://github.com/Jane110111107/DTAST.
- 1.Trapnell, C., Williams, B.A., Pertea, G., Mortazavi, A., Kwan, G., van Baren, M.J., Salzberg, S.L., Wold, B.J., Pachter, L.: Transcript assembly and abundance estimation from RNA-seq reveals thousands of new transcripts and switching among isoforms. Nat. Biotechnol. 28(5), 511 (2010)CrossRefGoogle Scholar