Big Data Technologies for DNA Sequencing
DNA sequencing is a modern technique for the precise determination of the order of nucleotides within a DNA molecule. Using this technique a huge amount of raw data is generated in life sciences.
Genome analyses play an important role in different applications in the life sciences ranging from animal breeding to personalized medicine. The technological advancements in DNA sequencing lead to vast amounts of genome data being produced and processed on a daily basis. This chapter provides an overview of the big data challenges in the area of DNA sequencing and discusses several data management solutions.
Next-generation sequencing (NGS) technologies make it possible for life scientists to produce huge amounts of DNA sequence data in a short period of time (Stephens et al. 2015). Using these technologies, in recent years thousands of genomes and short DNA sequence reads for humans, plants, animals, and microbes have been collected...
- Becker H (2011) Pflanzenzüchtung. UTB basics. UTB GmbHGoogle Scholar
- Carlson R (2003) The pace and proliferation of biological technologies. Biosecur Bioterror Biodefense Strategy Pract Sci 1(3):203–214Google Scholar
- Fiannaca A, La Rosa M, La Paglia L, Messina A, Urso A (2016) Biographdb: a new graphdb collecting heterogeneous data for bioinformatics analysis. In: Proceedings of BIOTECHNOGoogle Scholar
- Martínez H, Barrachina S, Castillo M, Tárraga J, Medina I, Dopazo J, Quintana-Ortí ES (2015) Scalable RNA sequencing onclusters of multicore processors. Trustcom/BigDataSE/ISPA 3:190–195Google Scholar
- Mushtaq H, Liu F, Costa C, Liu G, Hofstee P, Al-Ars Z (2017) Sparkga: a spark framework for cost effective, fast and accurate dna analysis at scale. In: Proceedings of the 8th ACM international conference on bioinformatics, computational biology, and health informatics. ACM, pp 148–157Google Scholar
- Pedersen E, Bongo LA (2016) Big biological data management. In: Pop F, Kolodziej J, Martino BD (eds) Resource management for big data platforms. Computer communications and networks. Springer, Heidelberg, pp 265–277Google Scholar