PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile

Pereira, Luísa; Alshamali, Farida; Andreassen, Rune; Ballard, Ruth; Chantratita, Wasun; Cho, Nam Soo; Coudray, Clotilde; Dugoujon, Jean-Michel; Espinoza, Marta; González-Andrade, Fabricio; Hadi, Sibte; Immel, Uta-Dorothee; Marian, Catalin; Gonzalez-Martin, Antonio; Mertens, Gerhard; Parson, Walther; Perone, Carlos; Prieto, Lourdes; Takeshita, Haruo; Rangel Villalobos, Héctor; Zeng, Zhaoshu; Zhivotovsky, Lev; Camacho, Rui; Fonseca, Nuno A.

doi:10.1007/s00414-010-0472-2

PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile

Original Article
Published: 16 June 2010

Volume 125, pages 629–636, (2011)
Cite this article

International Journal of Legal Medicine Aims and scope Submit manuscript

Luísa Pereira^1,2,
Farida Alshamali³,
Rune Andreassen⁴,
Ruth Ballard⁵,
Wasun Chantratita⁶,
Nam Soo Cho⁷,
Clotilde Coudray⁸,
Jean-Michel Dugoujon⁸,
Marta Espinoza⁹,
Fabricio González-Andrade¹⁰,
Sibte Hadi¹¹,
Uta-Dorothee Immel¹²,
Catalin Marian¹³,
Antonio Gonzalez-Martin¹⁴,
Gerhard Mertens¹⁵,
Walther Parson¹⁶,
Carlos Perone¹⁷,
Lourdes Prieto¹⁸,
Haruo Takeshita¹⁹,
Héctor Rangel Villalobos²⁰,
Zhaoshu Zeng²¹,
Lev Zhivotovsky²²,
Rui Camacho^23,24 &
…
Nuno A. Fonseca²⁵

789 Accesses
33 Citations
15 Altmetric
1 Mention
Explore all metrics

Abstract

Because of their sensitivity and high level of discrimination, short tandem repeat (STR) maker systems are currently the method of choice in routine forensic casework and data banking, usually in multiplexes up to 15–17 loci. Constraints related to sample amount and quality, frequently encountered in forensic casework, will not allow to change this picture in the near future, notwithstanding the technological developments. In this study, we present a free online calculator named PopAffiliator (http://cracs.fc.up.pt/popaffiliator) for individual population affiliation in the three main population groups, Eurasian, East Asian and sub-Saharan African, based on genotype profiles for the common set of STRs used in forensics. This calculator performs affiliation based on a model constructed using machine learning techniques. The model was constructed using a data set of approximately fifteen thousand individuals collected for this work. The accuracy of individual population affiliation is approximately 86%, showing that the common set of STRs routinely used in forensics provide a considerable amount of information for population assignment, in addition to being excellent for individual identification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A genetic Study of the Ghanaian Population Using 15 Autosomal STR Loci

Article 04 March 2023

Abban Edward Kofi, David Adjem Agyemang, … Hisham Atan Edinur

Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population

Article Open access 21 January 2021

Mahdi Haidar, Fatimah A. Abbas, … Penelope R. Haddrill

An evaluation of the SureID 23comp Human Identification Kit for kinship testing

Article Open access 14 November 2019

Hussain M. Alsafiah, Ali A. Aljanabi, … William Goodwin

References

Alves C, Amorim A, Gusmão L, Pereira L (2001) VWA STR genotyping: further inconsistencies between Perkin-Elmer and Promega kits. Int J Leg Med 115:97–99
Article CAS Google Scholar
Pamplona JP, Freitas F, Pereira L (2008) A worldwide database of autosomal markers used by the forensic community. Forensic Sci Int: Genetics Supplement Series 1:656–657
Article Google Scholar
Salas A, Bandelt HJ, Macaulay V, Richards MB (2007) Phylogeographic investigations: the role of trees in forensic genetics. Forensic Sci Int 168:1–13
Article PubMed CAS Google Scholar
Jobling MA (2001) Y-chromosomal SNP haplotype diversity in forensic analysis. Forensic Sci Int 118:158–162
Article PubMed CAS Google Scholar
Phillips C, Salas A, Sánchez JJ, Fondevila M, Gómez-Tato A, Alvarez-Dios J, Calaza M, de Cal MC, Ballard D, Lareu MV, Carracedo A, SNPforID Consortium (2007) Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. Forensic Sci Int Genet 1:273–280
Article PubMed CAS Google Scholar
Phillips C, Prieto L, Fondevila M, Salas A, Gómez-Tato A, Alvarez-Dios J, Alonso A, Blanco-Verea A, Brión M, Montesino M, Carracedo A, Lareu MV (2009) Ancestry analysis in the 11-M Madrid bomb attack investigation. PLoS ONE 4:e6583
Article PubMed Google Scholar
Sanchez JJ, Børsting C, Balogh K, Berger B, Bogus M, Butler JM, Carracedo A, Court DS, Dixon LA, Filipović B, Fondevila M, Gill P, Harrison CD, Hohoff C, Huel R, Ludes B, Parson W, Parsons TJ, Petkovski E, Phillips C, Schmitter H, Schneider PM, Vallone PM, Morling N (2008) Forensic typing of autosomal SNPs with a 29 SNP-multiplex-results of a collaborative EDNAP exercise. Forensic Sci Int Genet 2:176–183
Article PubMed CAS Google Scholar
Allocco DJ, Song Q, Gibbons GH, Ramoni MF, Kohane IS (2007) Geography and genography: prediction of continental origin using randomly selected single nucleotide polymorphisms. BMC Genomics 8:68
Article PubMed Google Scholar
Amorim A, Pereira L (2005) Pros and cons in the use of SNPs in forensic kinship investigation: a comparative analysis with STRs. Forensic Sci Int 150:17–21
Article PubMed CAS Google Scholar
Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW (2002) Genetic structure of human populations. Science 298:2381–2385
Article PubMed CAS Google Scholar
Rosenberg NA, Mahajan S, Ramachandran S, Zhao C, Pritchard JK, Feldman MW (2005) Clines, clusters, and the effect of study design on the inference of human population structure. PLoS Genet 1:e70
Article PubMed Google Scholar
Bamshad MJ, Wooding S, Watkins WS, Ostler CT, Batzer MA, Jorde LB (2003) Human population genetic structure and inference of group membership. Am J Hum Genet 72:578–589
Article PubMed CAS Google Scholar
Evett IW, Pinchin R, Buffery C (1992) An investigation of the feasibility of inferring ethnic origin from DNA profiles. JFSS 32:301–306
CAS Google Scholar
Meyer E, Wiegand P, Brinkmann B (1995) Phenotype differences of STRs in 7 human populations. Int J Leg Med 107:314–322
Article CAS Google Scholar
Lowe AL, Urquhart A, Foreman LA, Evett IW (2001) Inferring ethnic origin by means of an STR profile. Forensic Sci Int 119:17–22
Article PubMed CAS Google Scholar
Fosella X, Marroni F, Manzoni S, Verzeletti A, De Ferrari F, Cerri N, Presciuttini S (2004) Assigning individuals to ethnic groups based on 13 STR loci. Int Congr Ser 1261:59–61
Article CAS Google Scholar
Graydon M, Cholette F, Ng LK (2009) Inferring ethnicity using 15 autosomal STR loci -comparisons among populations of similar and distinctly different physical traits. Forensic Sci Int Genet 3:251–254
Article PubMed CAS Google Scholar
Klintschar M, Füredi S, Egyed B, Reichenpfader B, Kleiber M (2003) Estimating the ethnic origin (EEO) of individuals using short tandem repeat loci of forensic relevance. Int Congr Ser 1239:53–56
Article CAS Google Scholar
Fridman C, dos Santos PC, Kohler P, Garcia CF, Lopez LF, Massad E, Gattás GJ (2008) Brazilian population profile of 15 STR markers. Forensic Sci Int Genet 2:e1–e4
Article PubMed Google Scholar
Brisighelli F, Capelli C, Boschi I, Garagnani P, Lareu MV, Pascali VL, Carracedo A (2009) Allele frequencies of fifteen STRs in a representative sample of the Italian population. Forensic Sci Int Genet 3:e29–e30
Article PubMed CAS Google Scholar
Herrera-Paz EF, García LF, Aragon-Nieto I, Paredes M (2008) Allele frequencies distributions for 13 autosomal STR loci in 3 Black Carib (Garifuna) populations of the Honduran Caribbean coasts. Forensic Sci Int Genet 3:e5–e10
Article PubMed CAS Google Scholar
Jacewicz R, Jedrzejczyk M, Ludwikowska M, Berent J (2008) Population database on 15 autosomal STR loci in 1000 unrelated individuals from the Lodz region of Poland. Forensic Sci Int Genet 2:e41–e43
Article PubMed Google Scholar
Juárez-Cedillo T, Zuñiga J, Acuña-Alonzo V, Pérez-Hernández N, Rodríguez-Pérez JM, Barquera R, Gallardo GJ, Sánchez-Arenas R, García-Peña Mdel C, Granados J, Vargas-Alarcón G (2008) Genetic admixture and diversity estimations in the Mexican Mestizo population from Mexico City using 15 STR polymorphic markers. Forensic Sci Int Genet 2:e37–e39
Article PubMed Google Scholar
Kraaijenbrink T, Zuniga S, Su B, Shi H, Xiao CJ, Tang WR, de Knijff P (2008) Allele frequency distribution of 21 forensic autosomal STRs in 7 populations from Yunnan, China. Forensic Sci Int Genet 3:e11–e12
Article PubMed CAS Google Scholar
Omran GA, Rutty GN, Jobling MA (2009) Genetic variation of 15 autosomal STR loci in Upper (Southern) Egyptians. Forensic Sci Int Genet 3:e39–e44
Article PubMed CAS Google Scholar
Piatek J, Jacewicz R, Ossowski A, Parafiniuk M, Berent J (2008) Population genetics of 15 autosomal STR loci in the population of Pomorze Zachodnie (NW Poland). Forensic Sci Int Genet 2:e41–e43
Article PubMed CAS Google Scholar
Sánchez-Diz P, Menounos PG, Carracedo A, Skitsa I (2008) 16 STR data of a Greek population. Forensic Sci Int Genet 2:e71–e72
Article PubMed Google Scholar
Sánchez-Diz P, Acosta MA, Fonseca D, Fernández M, Gómez Y, Jay M, Alape J, Lareu MV, Carracedo A, Restrepo CM (2009) Population data on 15 autosomal STRs in a sample from Colombia. Forensic Sci Int Genet 3:e81–e82
Article PubMed Google Scholar
Nie S, Yao J, Yan H, Yang Y, Gu T, Tang W, Li W, Wang B, Xiao C (2008) Genetic data of 15 STR loci in Chinese Yunnan Han population. Forensic Sci Int Genet 3:e1–e3
Article PubMed CAS Google Scholar
Soták M, Petrejcíková E, Bernasovská J, Bernasovský I, Sovicová A, Boronová I, Svicková P, Bôziková A, Gabriková D (2008) Genetic variation analysis of 15 autosomal STR loci in Eastern Slovak Caucasian and Romany (Gypsy) population. Forensic Sci Int Genet 3:e21–e25
Article PubMed Google Scholar
Rubi-Castellanos R, Anaya-Palafox M, Mena-Rojas E, Bautista-España D, Muñoz-Valle JF, Rangel-Villalobos H (2009) Genetic data of 15 autosomal STRs (Identifiler kit) of three Mexican Mestizo population samples from the States of Jalisco (West), Puebla (Center), and Yucatan (Southeast). Forensic Sci Int Genet 3:e71–e76
Article PubMed CAS Google Scholar
Simms TM, Garcia C, Mirabal S, McCartney Q, Herrera RJ (2008) The genetic legacy of the transatlantic slave trade in the island of New Providence. Forensic Sci Int Genet 2:310–317
Article PubMed CAS Google Scholar
Budowle B, Moretti TR (1999) Genotype profiles for six population groups at the 13 CODIS Short Tandem Repeat core loci and other PCRB Based loci. Forensic Sci Commun 1
Zhivotovsky LA, Veremeichyk VM, Kuzub NN, Atramentova LA, Udina IG, Kartel NA, Tsybovsky IS (2009) A reference data base on STR allele frequencies in the Belarus population developed from paternity cases. Forensic Sci Int Genet 3:e107–e109
Article PubMed CAS Google Scholar
Zhivotovsky LA, Malyarchuk BA, Derenko MV, Wozniak M, Grzybowski T (2009) Developing STR databases on structured populations: the native South Siberian population versus the Russian population. Forensic Sci Int Genet 3:e111–e116
Article PubMed CAS Google Scholar
Zhivotovsky LA, Akhmetova VL, Fedorova SA, Zhirkova VV, Khusnutdinova EK (2009) An STR database on the Volga-Ural population. Forensic Sci Int Genet 3:e133–e136
Article PubMed CAS Google Scholar
Li C, Li L, Zhao Z, Lin Y, Que T, Liu Y, Xue J (2009) Genetic polymorphism of 17 STR loci for forensic use in Chinese population from Shanghai in East China. Forensic Sci Int Genet 3:e117–e118
Article PubMed CAS Google Scholar
Andreassen R, Pereira L, Dupuy BM, Mevaag B (2009) Icelandic population data for the STR loci in the AMPFlSTR®SGM Plus™ system and the PowerPlex® Y-system. Forensic Sci Int Genet (in press)
Tillmar AO, Bäckström G, Montelius K (2009) Genetic variation of 15 autosomal STR loci in a Somali population. Forensic Sci Int Genet 4:e19–e20
Article PubMed CAS Google Scholar
Lopes V, Serra A, Gamero J, Sampaio L, Balsa F, Oliveira C, Batista L, Corte-Real F, Vieira DN, Vide MC, Anjos MJ, Carvalho M (2009) Allelic frequency distribution of 17 STRs from Identifiler and PowerPlex-16 in Central Portugal area and the Azores archipelago. Forensic Sci Int Genet 4:e1–e7
Article PubMed CAS Google Scholar
Witten IH, Frank E (2005) Data Mining: practical machine learning tools and techniques, 2nd Edition, Morgan Kaufmann
Fonseca NA, Camacho R, Pereira L (submitted) On the prediction of an individual affiliation to a major population group based on information from a small set of autosomal STRs—a machine learning approach
Muro T, Fujihara J, Imamura S, Nakamura H, Yasuda T, Takeshita H (2008) Allele frequencies for 15 STR loci in Ovambo population using AmpFlSTR Identifiler Kit. Leg Med (Tokyo) 10:157–159
CAS Google Scholar

Download references

Acknowledgments

IPATIMUP is an Associate Laboratory of the Portuguese Ministry of Science, Technology and Higher Education and is partially supported by FCT, the Portuguese Foundation for Science and Technology. CRACS-INESC Porto is supported by Programa Operacional Ciência, Tecnologia e Inovação (POCTI) e Quadro Comunitário de Apoio III. NJ and DH were supported by grant 196-1962766-2751. LZh received grants from the Russian Academy of Science for Mol & Cell Biol and FSM.

Author information

Authors and Affiliations

Instituto de Patologia e Imunologia Molecular da Universidade do Porto (IPATIMUP), R. Dr. Roberto Frias s/n, 4200-465, Porto, Portugal
Luísa Pereira
Faculdade de Medicina da Universidade do Porto, Porto, Portugal
Luísa Pereira
General Department of Forensic Sciences & Criminology, Dubai Police GHQ, Dubai, UAE
Farida Alshamali
Faculty of Health Sciences, Oslo University College, Oslo, Norway
Rune Andreassen
Department of Biological Sciences, California State University, Sacramento, CA, USA
Ruth Ballard
Department of Pathology, Faculty of Medicine, Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
Wasun Chantratita
Department of Forensic Medicine, Central District Office, National Institute of Scientific Investigation, Daejeon, Republic of Korea
Nam Soo Cho
Laboratoire d’Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), CNRS and University Toulouse III Paul Sabatier, Toulouse, France
Clotilde Coudray & Jean-Michel Dugoujon
Departamento de Ciencias Forenses, Organismo de Investigación Judicial, Poder Judicial, Unidad de Genética Forense, San José, Costa Rica
Marta Espinoza
Department of Medicine, Metropolitan Hospital, Quito, Ecuador
Fabricio González-Andrade
School of Forensic & Investigative Sciences, University of Central Lancashire, Preston, UK
Sibte Hadi
Institute of Legal Medicine, Martin-Luther-University Halle, Halle, Germany
Uta-Dorothee Immel
Carcinogenesis, Biomarkers and Epidemiology Program, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington, DC, USA
Catalin Marian
Department Zoology and Physical Anthropology, Faculty of Biology, University Complutense of Madrid, Madrid, Spain
Antonio Gonzalez-Martin
Forensic DNA Laboratory, Antwerp University Hospital, Edegem, Belgium
Gerhard Mertens
Institute of Legal Medicine, Innsbruck Medical University, Innsbruck, Austria
Walther Parson
Núcleo de Ações e Pesquisa em Apoio Diagnóstico, Faculdade de Medicina, Universidade Federal de Minas Gerais (NUPAD/FM-UFMG), Belo Horizonte, Minas Gerais, Brazil
Carlos Perone
DNA Laboratory, Comisaría general de Policía Científica, University Institute of Research Police Sciences (IUICP), Madrid, Spain
Lourdes Prieto
Department of Legal Medicine, Shimane University School of Medicine, Izumo, Shimane, Japan
Haruo Takeshita
Instituto de Investigación en Genética Molecular, Centro Universitario de la Cienega (CUCI-UdeG), Universidad de Guadalajara, Ocotlán, Jalisco, México
Héctor Rangel Villalobos
Department of Legal Medicine, School of Basic Medical Sciences, Zhengzhou University, Zhengzhou, Henan, China
Zhaoshu Zeng
Institute of General Genetics, The Russian Academy of Sciences, Moscow, Russia
Lev Zhivotovsky
Laboratory of Artificial Intelligence and Decision Support (LIAAD-INESC), Porto, Portugal
Rui Camacho
DEI, Faculdade de Engenharia da Universidade do Porto, Porto, Portugal
Rui Camacho
CRACS-INESC Porto LA, Porto, Portugal
Nuno A. Fonseca

Authors

Luísa Pereira
View author publications
You can also search for this author in PubMed Google Scholar
Farida Alshamali
View author publications
You can also search for this author in PubMed Google Scholar
Rune Andreassen
View author publications
You can also search for this author in PubMed Google Scholar
Ruth Ballard
View author publications
You can also search for this author in PubMed Google Scholar
Wasun Chantratita
View author publications
You can also search for this author in PubMed Google Scholar
Nam Soo Cho
View author publications
You can also search for this author in PubMed Google Scholar
Clotilde Coudray
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Michel Dugoujon
View author publications
You can also search for this author in PubMed Google Scholar
Marta Espinoza
View author publications
You can also search for this author in PubMed Google Scholar
Fabricio González-Andrade
View author publications
You can also search for this author in PubMed Google Scholar
Sibte Hadi
View author publications
You can also search for this author in PubMed Google Scholar
Uta-Dorothee Immel
View author publications
You can also search for this author in PubMed Google Scholar
Catalin Marian
View author publications
You can also search for this author in PubMed Google Scholar
Antonio Gonzalez-Martin
View author publications
You can also search for this author in PubMed Google Scholar
Gerhard Mertens
View author publications
You can also search for this author in PubMed Google Scholar
Walther Parson
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Perone
View author publications
You can also search for this author in PubMed Google Scholar
Lourdes Prieto
View author publications
You can also search for this author in PubMed Google Scholar
Haruo Takeshita
View author publications
You can also search for this author in PubMed Google Scholar
Héctor Rangel Villalobos
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoshu Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Lev Zhivotovsky
View author publications
You can also search for this author in PubMed Google Scholar
Rui Camacho
View author publications
You can also search for this author in PubMed Google Scholar
Nuno A. Fonseca
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Luísa Pereira.

Additional information

LP delineated the project and collected the database. RC and NAF performed the machine learning analyses, interpreted results and constructed the online calculator. The remaining authors contributed genotype profiles for the database and collaborated in the improvement of the manuscript and of the online tool’s output.

Luísa Pereira and Nuno A. Fonseca contributed equally to this work.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pereira, L., Alshamali, F., Andreassen, R. et al. PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile. Int J Legal Med 125, 629–636 (2011). https://doi.org/10.1007/s00414-010-0472-2

Download citation

Received: 13 January 2010
Accepted: 17 May 2010
Published: 16 June 2010
Issue Date: September 2011
DOI: https://doi.org/10.1007/s00414-010-0472-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile

Abstract

Access this article

Similar content being viewed by others

A genetic Study of the Ghanaian Population Using 15 Autosomal STR Loci

Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population

An evaluation of the SureID 23comp Human Identification Kit for kinship testing

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

PopAffiliator: online calculator for individual affiliation to a major population group based on 17 autosomal short tandem repeat genotype profile

Abstract

Access this article

Similar content being viewed by others

A genetic Study of the Ghanaian Population Using 15 Autosomal STR Loci

Population genetics and forensic utility of 23 autosomal PowerPlex Fusion 6C STR loci in the Kuwaiti population

An evaluation of the SureID 23comp Human Identification Kit for kinship testing

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation