Abstract
Analysis of large numbers of single-nucleotide polymorphisms (SNPs) can increase individual discrimination power, and, particularly, it can supply important evidence for kinship or ethnic identification. We identified 300 Korean-specific SNPs from 306 Korean whole-exome sequencing (WES) data. Functionally significant SNPs (variants in splicing site, missense, nonsense, and exonic indels) were filtered out from the variant pool, and SNPs with minor allele frequencies (MAFs) of <0.3 in the 1000 Genomes (1000G) database but >0.3 in the Korean population were selected. Genotypes obtained from WES were confirmed by the Sanger sequencing method. The identified markers were evenly distributed throughout the autosomal chromosomes. All the SNPs were in the Hardy-Weinberg equilibrium with a mean MAF of 0.415 (0.161 in 1000G). The mean heterozygosities were 0.476 (observed) and 0.470 (experimental). The combined power of discrimination was very high. Korean MAFs in most SNPs were similar to those for the Chinese and Japanese populations, but were significantly higher than those for several other ethnic populations. These selected SNPs will be used to develop forensic markers and are expected to be widely used for additional individual identification, ethnic discrimination, and linkage analysis for kinship tests.
References
Gill P (2002) Role of short tandem repeat DNA in forensic casework in the UK—past, present, and future perspectives. Biotechniques 32:366–372
Butler JM, Shen Y, McCord BR (2003) The development of reduced size STR amplicons as tools for analysis of degraded DNA. J Forensic Sci 48:1054–1064
Pereira R, Phillips C, Alves C, Amorim A, Carracedo A, Gusmão L (2009) A new multiplex for human identification using insertion/deletion polymorphisms. Electrophoresis 30:3682–3690
Sanchez JJ, Phillips C, Børsting C, Balogh K, Bogus M, Fondevila M, Harrison CD, Musgrave-Brown E, Salas A, Syndercombe-Court D, Schneider PM, Carracedo A, Morling N (2006) A multiplex assay with 52 single nucleotide polymorphisms for human identification. Electrophoresis 27:1713–1724
Gill P (2001) An assessment of the utility of single nucleotide polymorphisms (SNPs) for forensic purposes. Int J Legal Med 114:204–210
Amorim A, Pereira L (2005) Pros and cons in the use of SNPs in forensic kinship investigation: a comparative analysis with STRs. Forensic Sci Int 150:17–21
Goodwin W, Linacre A, Hadi S (2007) An introduction to forensic genetics. Wiley, New York
Albert TJ, Molla MN, Muzny DM, Nazareth L, Wheeler D, Song X, Richmond TA, Middle CM, Rodesch MJ, Packard CJ, Weinstock GM, Gibbs RA (2007) Direct selection of human genomic loci by microarray hybridization. Nat Methods 4:903–905
Fordyce SL, Avila-Arcos MC, Rockenbauer E, Borsting C, Frank-Hansen R, Petersen FT, Willerslev E, Hansen AJ, Morling N, Gilbert MT (2011) High-throughput sequencing of core STR loci for forensic genetic investigations using the Roche Genome Sequencer FLX platform. Biotechniques 51:127–133
Nielsen R, Paul JS, Albrechtsen A, Song YS (2011) Genotype and SNP calling from next-generation sequencing data. Nat Rev Genet 12:443–451
Van Neste C, Van Nieuwerburgh F, Van Hoofstat D, Deforce D (2012) Forensic STR analysis using massive parallel sequencing. Forensic Sci Int Genet 6:810–818
Hancock-Hanser BL, Frey A, Leslie MS, Dutton PH, Archer FI, Morin PA (2013) Targeted multiplex next-generation sequencing: advances in techniques of mitochondrial and nuclear DNA sequencing for population genomics. Mol Ecol Resour 13:254–268
Berglund E, Kiialainen A, Syvanen AC (2011) Next-generation sequencing technologies and applications for human genetic history and forensics. Investig Genet 2:23
Seo SB, King JL, Warshauer DH, Davis CP, Ge J, Budowle B (2013) Single nucleotide polymorphism typing with massively parallel sequencing for human identification. Int. J. Legal Med 27:1079–1086
Kumar S, Banks TW, Cloutier S (2012) SNP discovery through next-generation sequencing and its applications. Int J Plant Genom 2012:831460
Pakstis AJ, Haigh E, Cherni L, ElGaaied AB, Barton A, Evsanaa B, Togtokh A, Brissenden J, Roscoe J, Bulbul O, Filoglu G, Gurkan C, Meiklejohn KA, Robertson JM, Li CX, Wei YL, Li H, Soundararajan U, Rajeevan H, Kidd JR, Kidd KK (2015) 52 additional reference population samples for the 55 AISNP panel. Forensic Sci Int Genet 19:269–271
Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C, Shaffer T, Wong M, Bhattacharjee A, Eichler EE, Bamshad M, Nickerson DA, Shendure J (2009) Targeted capture and massively parallel sequencing of 12 human exomes. Nature 461:272–276
Li R, Li Y, Fang X, Yang H, Wang J, Kristiansen K, Wang J (2009) SNP detection for massively parallel whole-genome resequencing. Genome Res 19:1124–1132
Choi M, Scholl UI, Ji W, Liu T, Tikhonova IR, Zumbo P, Nayir A, Bakkaloğlu B, Ozen S, Sanjad S, Nelson-Williams C, Farhi A, Mane S, Lifton RP (2009) Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci U S A 106:19096–19101
Choi BO, Koo SK, Park MH, Rhee H, Yang SJ, Choi KG, Jung SH, Kim HS, Hyun YS, Nakhro H, Lee HJ, Woo HM, Chung KW (2012) Exome sequencing is an efficient tool for genetic screening of Charcot-Marie-Tooth disease. Hum Mutat 33:1610–1615
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The Sequence Alignment/Map format and SAMtools. Bioinformatics 25:2078–2079
The 1000 Genomes Project Consortium, Abecasis GR, Altshuler D, Auton A, Brooks LD, Durbin RM, Gibbs RA, Hurles ME, McVean GA (2010) A map of human genome variation from population-scale sequencing. Nature 467:1061–1073
Excoffier L, Lischer H (2010) Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Res 10:564–567
Acknowledgments
This study was supported by a grant from the National Forensic Service (NFS), Republic of Korea.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The experiments comply with the current laws of our country.
All samples were collected with informed consent according to the protocol approved by the Institutional Review Board for Kongju National University.
Conflict of interest
The authors declare that they have no competing interests.
Additional information
Sung Min Kim and Seong Yeon Yoo contributed equally to this work.
Rights and permissions
About this article
Cite this article
Kim, S.M., Yoo, S.Y., Nam, S.H. et al. Identification of Korean-specific SNP markers from whole-exome sequencing data. Int J Legal Med 130, 669–677 (2016). https://doi.org/10.1007/s00414-015-1313-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00414-015-1313-0