Comparing Bowtie and BWA to Align Short Reads from a RNA-Seq Experiment

Medina-Medina, N.; Broka, A.; Lacey, S.; Lin, H.; Klings, E. S.; Baldwin, C. T.; Steinberg, M. H.; Sebastiani, P.

doi:10.1007/978-3-642-28839-5_23

Comparing Bowtie and BWA to Align Short Reads from a RNA-Seq Experiment

N. Medina-Medina⁵,
A. Broka⁶,
S. Lacey⁷,
H. Lin⁸,
E. S. Klings⁸,
C. T. Baldwin⁸,
M. H. Steinberg⁸ &
…
P. Sebastiani⁷

Conference paper

1344 Accesses
1 Citations

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 154))

Abstract

High-throughput sequencing technologies are a significant innovation that can contribute to important advances in genetic research. In recent years, many algorithms have been developed to align the large number of short nucleotide sequences generated by these technologies. Choosing within the available alignment algorithms is difficult; to assist this decision we evaluate several algorithms for the mapping of RNA-Seq data. The comparison was completed in two phases. An initial phase narrowed down the comparison to the three algorithms implemented in the tools: ELAND, Bowtie and BWA. A second phase compared the tools in terms of runtime, alignment coverage and process control.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Craig Venter, J., et al.: The sequence of the human genome. Science 291(5507), 1304–1351 (2001); doi:10.1126/science.1058040
Google Scholar
Sinsheimer, R.L.: Sequencing the human genome: summary report of the Santa Fe workshop. Genomics 5(4), 954–956 (1989)
Article Google Scholar
US Department of Health and Human Services and Department of Energy: Understanding our genetic inheritance. The U.S. human genome project: the first five years. US Dept. of Health and Human Services, Washington, DC (1990)
Google Scholar
Strauss, E.C., Kobori, J.A., Siu, G., Hood, L.E.: Specific-primer-directed DNA sequencing. Anal. Biochem. 154(1), 353–360 (1986)
Article Google Scholar
Yang, G., Ho, M.-H., Hubbell, E.: High-throughput microarray-based genotyping. In: IEEE Computational Systems Bioinformatics Conference, pp. 586–587 (2004)
Google Scholar
Hall, N.: Advanced sequencing technologies and their wider impact in microbiology. The Journal of Experimental Biology 210(9), 1518–1525 (2007); doi:10.1242/jeb.001370
Google Scholar
Pop, M., Salzberg, S., Shumway, M.: Genome sequence assembly: algorithms and issues. IEEE Computer 35, 47–54 (2002)
Article Google Scholar
Mount, D.M.: Bioinformatics: sequence and genome analysis. Cold Spring Harbor Laboratory Press, Cold Spring Harbor,(2004); ISBN: 0-87969-608-7
Google Scholar
Needleman, S.B., Wunsch, C.D.: A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology 48(3), 443–453 (1970)
Article Google Scholar
Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. Journal of Molecular Biology 147(1), 195–197 (1981)
Article Google Scholar
Drummond, A.J., Ashton, B., Buxton, S., Cheung, M., Cooper, A., Heled, J., Kearse, M., Moir, R., Stones-Havas, S., Sturrock, S., Thierer, T., Wilson, A.: Geneious v5.1 (2010), http://www.geneious.com
CLC Main Workbench: A comprehensive workbench for advanced DNA, RNA, and protein analyses, http://www.clcbio.com
Langmead, B., Trapnell, C., Pop, M., Salzberg, S.L.: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology 10(3), R25 (2009)
Google Scholar
Li, H., Durbin, R.: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(19), 1754–1760 (2009)
Article Google Scholar
Giardine, B., Riemer, C., Hardison, R.C., Burhans, R., Elnitski, L., Shah, P., Zhang, Y., Blankenberg, D., Albert, I., Miller, W., et al.: Galaxy: a platform for interactive large-scale genome analysis. Genome Research 15(10), 1451–1455 (2005)
Article Google Scholar
Illumina: Illumina sequencing, http://www.illumina.com
Nelson, M.: Data compression with the Burrows-Wheeler transform. Dr. Dobb’s Journal of Software Tools 21(9), 46–50 (1996)
Google Scholar
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R.: 1000 genome project data processing subgroup. The sequence alignment/map format and SAMtools. Bioinformatics 25(16), 2078–2079 (2009)
Article Google Scholar
Li, H., Durbin, R.: Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25(14), 1754–1760 (2009)
Article Google Scholar
Hoffmann, S., Otto, C., Kurtz, S., Sharma, C.M., Khaitovich, P., Vogel, J., Stadler, P.F., Hackermuller, J.: Fast mapping of short sequences with mismatches, insertions and deletions using index structures. PLoS Computational Biology 5(9), R1000502 (2009)
Google Scholar
Ruffalo, M., Laframboise, T., Koyutürk, M.: Comparative analysis of algorithms for next-generation sequencing read alignment. Bioinformatics 27(20), 2790–2796 (2011)
Article Google Scholar
Heng, L., Nils, H.: A survey of sequence alignment algorithms for next-generation sequencing. Briefings in Bioinformatics 11(5), 473–483 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department L.S.I, Technical School of Computer and Telecommunications Engineering, University of Granada, Granada, Spain
N. Medina-Medina
Boston University LinGA Computing Resource, Boston, USA
A. Broka
Department of Biostatistics, Boston University School of Public Health, Boston, USA
S. Lacey & P. Sebastiani
Department of Medicine, Boston University School of Medicine, Boston, USA
H. Lin, E. S. Klings, C. T. Baldwin & M. H. Steinberg

Authors

N. Medina-Medina
View author publications
You can also search for this author in PubMed Google Scholar
A. Broka
View author publications
You can also search for this author in PubMed Google Scholar
S. Lacey
View author publications
You can also search for this author in PubMed Google Scholar
H. Lin
View author publications
You can also search for this author in PubMed Google Scholar
E. S. Klings
View author publications
You can also search for this author in PubMed Google Scholar
C. T. Baldwin
View author publications
You can also search for this author in PubMed Google Scholar
M. H. Steinberg
View author publications
You can also search for this author in PubMed Google Scholar
P. Sebastiani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to N. Medina-Medina .

Editor information

Editors and Affiliations

, Dep. Informática / CCTC, Universidade do Minho, Largo do Paco, Braga, 4710 - 057, Portugal
Miguel P. Rocha
European Bioinformatics Institute, Wellcome Trust Genome Campus, EMBL Outstation - Hinxton, Hinxton, CB10 1SD, United Kingdom
Nicholas Luscombe
, Department of Informatics, University of Vigo, Edificio Politécnico. Campus Universitario As Lagoas s/n, Ourense, 32004, Spain
Florentino Fdez-Riverola
Faculty of Science, Department of Computing Science, University of Salamanca, Plaza de la Merced S/N, Salamanca, 37008, Spain
Juan M. Corchado Rodríguez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Medina-Medina, N. et al. (2012). Comparing Bowtie and BWA to Align Short Reads from a RNA-Seq Experiment. In: Rocha, M., Luscombe, N., Fdez-Riverola, F., Rodríguez, J. (eds) 6th International Conference on Practical Applications of Computational Biology & Bioinformatics. Advances in Intelligent and Soft Computing, vol 154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28839-5_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-28839-5_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28838-8
Online ISBN: 978-3-642-28839-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics