Abstract
Three approaches applicable to the analysis of forensic ancestry-informative marker data—STRUCTURE, principal component analysis, and the Snipper Bayesian classification system—are reviewed. Detailed step-by-step guidance is provided for adjusting parameter settings in STRUCTURE with particular regard to their effect when differentiating populations. Several enhancements to the Snipper online forensic classification portal are described, highlighting the added functionality they bring to particular aspects of ancestry-informative SNP analysis in a forensic context.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jobling M, Hollox E, Hurles M et al (2014) Human evolutionary genetics: origins, peoples & disease, 2nd edn. Garland Science - Taylor & Francis Group, New York
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155(2):945–959
Phillips C, Salas A, Sánchez JJ et al (2007) Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. Forensic Sci Int Genet 1(3–4):273–280
Fondevila M, Phillips C, Santos C et al (2013) Revision of the SNPforID 34-plex forensic ancestry test: assay enhancements, standard reference sample genotypes and extended population studies. Forensic Sci Int Genet 7(1):63–74
Pereira R, Phillips C, Pinto N et al (2012) Straightforward inference of ancestry and admixture proportions through ancestry-informative insertion deletion multiplexing. PLoS One 7(1):e29684
Jolliffe I (2002) Principal component analysis. Springer, New York
R Development Core Team (2011) R: a language and environment for statistical computing. http://www.r-project.org
Phillips C (2009) SNP databases. In: Komar AA (ed) Single nucleotide polymorphisms, vol 578, Methods in molecular biology. Humana, New York, pp 43–71
Amigo J, Salas A, Phillips C et al (2008) SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access. BMC Bioinformatics 9:428
Amigo J, Phillips C, Lareu MV et al (2008) The SNPforID browser: an online tool for query and display of frequency data from the SNPforID project. Int J Legal Med 122(5):435–440
Amigo J, Salas A, Phillips C (2011) ENGINES: exploring single nucleotide variation in entire human genomes. BMC Bioinformatics 12:105
Amigo J, Phillips C, Salas A et al (2009) pop.STR—an online population frequency browser for established and new forensic STRs. Forensic Sci Int Genet Suppl Ser 2(1):361–362
Rosenberg NA, Pritchard JK, Weber JL et al (2002) Genetic structure of human populations. Science 298(5602):2381–2385
Rosenberg NA, Li LM, Ward R et al (2003) Informativeness of genetic markers for inference of ancestry. Am J Hum Genet 73(6):1402–1422
Kalinowski ST (2011) The computer program STRUCTURE does not reliably identify the main genetic clusters within species: simulations and implications for human population structure. Heredity 106(4):625–632
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14(8):2611–2620
Earl DA, vonHoldt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4(2):359–361
Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164(4):1567–1587
Hubisz MJ, Falush D, Stephens M et al (2009) Inferring weak population structure with the assistance of sample group information. Mol Ecol Resour 9(5):1322–1332
Porras-Hurtado L, Ruiz Y, Santos C et al (2013) An overview of STRUCTURE: applications, parameter settings and supporting software. Front Genet 4:98
Jakobsson M, Rosenberg NA (2007) CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23(14):1801–1806
Rosenberg NA (2004) DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes 4(1):137–138
Gonzalez JR, Armengol L, Sole X et al (2007) SNPassoc: an R package to perform whole genome association studies. Bioinformatics 23(5):644–645
Phillips C, Prieto L, Fondevila M et al (2009) Ancestry analysis in the 11-M Madrid bomb attack investigation. PLoS One 4(8):e6583
Cann HM, de Toma C, Cazes L et al (2002) A human genome diversity cell line panel. Science 296(5566):261–262
Rosenberg NA (2006) Standardized subsets of the HGDP-CEPH human genome diversity cell line panel, accounting for atypical and duplicated samples and pairs of close relatives. Ann Hum Genet 70:841–847
Li JZ, Absher DM, Tang H et al (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science 319(5866):1100–1104
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media New York
About this protocol
Cite this protocol
Santos, C., Phillips, C., Gomez-Tato, A., Alvarez-Dios, J., Carracedo, Á., Lareu, M.V. (2016). Inference of Ancestry in Forensic Analysis II: Analysis of Genetic Data. In: Goodwin, W. (eds) Forensic DNA Typing Protocols. Methods in Molecular Biology, vol 1420. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-3597-0_19
Download citation
DOI: https://doi.org/10.1007/978-1-4939-3597-0_19
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-3595-6
Online ISBN: 978-1-4939-3597-0
eBook Packages: Springer Protocols