Journal of Applied Genetics

, Volume 57, Issue 1, pp 71–79

Review of alignment and SNP calling algorithms for next-generation sequencing data

Animal Genetics • Review

DOI: 10.1007/s13353-015-0292-7

Cite this article as:
Mielczarek, M. & Szyda, J. J Appl Genetics (2016) 57: 71. doi:10.1007/s13353-015-0292-7

Abstract

Application of the massive parallel sequencing technology has become one of the most important issues in life sciences. Therefore, it was crucial to develop bioinformatics tools for next-generation sequencing (NGS) data processing. Currently, two of the most significant tasks include alignment to a reference genome and detection of single nucleotide polymorphisms (SNPs). In many types of genomic analyses, great numbers of reads need to be mapped to the reference genome; therefore, selection of the aligner is an essential step in NGS pipelines. Two main algorithms—suffix tries and hash tables—have been introduced for this purpose. Suffix array-based aligners are memory-efficient and work faster than hash-based aligners, but they are less accurate. In contrast, hash table algorithms tend to be slower, but more sensitive. SNP and genotype callers may also be divided into two main different approaches: heuristic and probabilistic methods. A variety of software has been subsequently developed over the past several years. In this paper, we briefly review the current development of NGS data processing algorithms and present the available software.

Keywords

Alignment Genotype calling NGS SNP calling Review Software 

Copyright information

© Institute of Plant Genetics, Polish Academy of Sciences, Poznan 2015

Authors and Affiliations

  1. 1.Biostatistics Group, Department of GeneticsWroclaw University of Environmental and Life SciencesWroclawPoland

Personalised recommendations