Statistics in Biosciences

, Volume 5, Issue 1, pp 3–25

Single Nucleotide Polymorphism (SNP) Detection and Genotype Calling from Massively Parallel Sequencing (MPS) Data


DOI: 10.1007/s12561-012-9067-4

Cite this article as:
Li, Y., Chen, W., Liu, E.Y. et al. Stat Biosci (2013) 5: 3. doi:10.1007/s12561-012-9067-4


Massively parallel sequencing (MPS), since its debut in 2005, has transformed the field of genomic studies. These new sequencing technologies have resulted in the successful identification of causal variants for several rare Mendelian disorders. They have also begun to deliver on their promise to explain some of the missing heritability from genome-wide association studies (GWAS) of complex traits. We anticipate a rapidly growing number of MPS-based studies for a diverse range of applications in the near future. One crucial and nearly inevitable step is to detect SNPs and call genotypes at the detected polymorphic sites from the sequencing data. Here, we review statistical methods that have been proposed in the past five years for this purpose. In addition, we discuss emerging issues and future directions related to SNP detection and genotype calling from MPS data.


Massively parallel sequencingNext-generation sequencingSNP detectionGenotype callingLinkage disequilibrium (LD)

Copyright information

© International Chinese Statistical Association 2012

Authors and Affiliations

  1. 1.Department of GeneticsUniversity of North CarolinaChapel HillUSA
  2. 2.Department of BiostatisticsUniversity of North CarolinaChapel HillUSA
  3. 3.Division of Pediatric Pulmonary Medicine, Allergy and Immunology, Department of Pediatrics, Children’s Hospital of Pittsburgh of UPMCUniversity of Pittsburgh School of MedicinePittsburghUSA
  4. 4.Department of Computer ScienceUniversity of North CarolinaChapel HillUSA