Abstract
The Y chromosome is one of the best genetic materials to explore the evolutionary history of human populations. Global analyses of Y chromosomal short tandem repeats (STRs) data can reveal very interesting world population structures and histories. However, previous Y-STR works tended to focus on small geographical ranges or only included limited sample sizes. In this study, we have investigated population structure and demographic history using 17 Y chromosomal STRs data of 979 males from 44 worldwide populations. The largest genetic distances have been observed between pairs of African and non-African populations. American populations with the lowest genetic diversities also showed large genetic distances and coancestry coefficients with other populations, whereas Eurasian populations displayed close genetic affinities. African populations tend to have the oldest time to the most recent common ancestors (TMRCAs), the largest effective population sizes and the earliest expansion times, whereas the American, Siberian, Melanesian, and isolated Atayal populations have the most recent TMRCAs and expansion times, and the smallest effective population sizes. This clear geographic pattern is well consistent with serial founder model for the origin of populations outside Africa. The Y-STR dataset presented here provides the most detailed view of worldwide population structure and human male demographic history, and additionally will be of great benefit to future forensic applications and population genetic studies.
Similar content being viewed by others
References
Burgarella C, Navascués M (2011) Mutation rate estimates for 110 Y-chromosome STRs combining population and father-son pair data. Eur J Hum Genet 19(1):70–75
Core Team R (2013) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Excoffier L, Laval G, Schneider S (2007) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform Online 1:47–50
Gazave E, Ma L, Chang D, Coventry A, Gao F, Muzny D, Boerwinkle E, Gibbs RA, Sing CF, Clark AG, Keinan A (2014) Neutral genomic regions refine models of recent rapid human population growth. Proc Natl Acad Sci USA 111(2):757–762
Hellenthal G, Auton A, Falush D (2008) Inferring human colonization history using a copying model. PLoS Genet 4(5):e1000078
Jobling MA, Tyler-Smith C (2003) The human Y chromosome: an evolutionary marker comes of age. Nat Rev Genet 4(8):598–612
Kayser M, Brauer S, Willuweit S, Schädlich H, Batzer MA, Zawacki J, Prinz M, Roewer L, Stoneking M (2002) Online Y-chromosomal short tandem repeat haplotype reference database (YHRD) for U.S. populations. J Forensic Sci 47:513–519
Lessig R, Willuweit S, Krawczak M, Wu FC, Pu CE, Kim W, Henke L, Henke J, Miranda J, Hidding M, Benecke M, Schmitt C, Magno M, Calacal G, Delfin FC, de Ungria MC, Elias S, Augustin C, Tun Z, Honda K, Kayser M, Gusmao L, Amorim A, Alves C, Hou Y, Keyser C, Ludes B, Klintschar M, Immel UD, Reichenpfader B, Zaharova B, Roewer L (2003) Asian online Y-STR haplotype reference database. Leg Med 5:S160–S163
Li JZ, Absher DM, Tang H, Southwick AM, Casto AM, Ramachandran S, Cann HM, Barsh GS, Feldman M, Cavalli-Sforza LL, Myers RM (2008) Worldwide human relationships inferred from genome-wide patterns of variation. Science 319(5866):1100–1104
Pour NA, Plaster CA, Bradman N (2013) Evidence from Y-chromosome analysis for a late exclusively eastern expansion of the Bantu-speaking people. Eur J Hum Genet 21(4):423–429
Ramakrishnan U, Mountain JL (2004) Precision and accuracy of divergence time estimates from STR and SNPSTR variation. Mol Biol Evol 21:1960–1971
Reynolds J, Weir BS, Cockerham CC (1983) Estimation of the coancestry coefficient: basis for a short-term genetic distance. Genetics 105(3):767–779
Roewer L, Krawczak M, Willuweit S, Nagy M, Alves C, Amorim A, Anslinger K, Augustin C, Betz A, Bosch E, Cagliá A, Carracedo A, Corach D, Dekairelle AF, Dobosz T, Dupuy BM, Füredi S, Gehrig C, Gusmaõ L, Henke J, Henke L, Hidding M, Hohoff C, Hoste B, Jobling MA, Kärgel HJ, de Knijff P, Lessig R, Liebeherr E, Lorente M, Martínez-Jarreta B, Nievas P, Nowak M, Parson W, Pascali VL, Penacino G, Ploski R, Rolf B, Sala A, Schmidt U, Schmitt C, Schneider PM, Szibor R, Teifel-Greding J, Kayser M (2001) Online reference database of European Y-chromosomal short tandem repeat (STR) haplotypes. Forensic Sci Int 118:106–113
Sengupta S, Zhivotovsky LA, King R, Mehdi SQ, Edmonds CA, Chow CE, Lin AA, Mitra M, Sil SK, Ramesh A, Usha Rani MV, Thakur CM, Cavalli-Sforza LL, Majumder PP, Underhill PA (2006) Polarity and temporality of high-resolution Y-chromosome distributions in India identify both indigenous and exogenous expansions and reveal minor genetic influence of Central Asian pastoralists. Am J Hum Genet 78(2):202–221
Shi W, Ayub Q, Vermeulen M, Shao RG, Zuniga S, van der Gaag K, de Knijff P, Kayser M, Xue Y, Tyler-Smith C (2010) A worldwide survey of human male demographic history based on Y-SNP and Y-STR data from the HGDP-CEPH populations. Mol Biol Evol 27(2):385–393
Wang CC, Li H (2013) Inferring human history in East Asia from Y chromosomes. Investig Genet 4(1):11
Wang CC, Li H (2014) Comparison of Y-chromosomal lineage dating using either evolutionary or genealogical Y-STR mutation rates. bioRxiv, doi:10.1101/004705
Wang CC, Huang Y, Wen SQ, Chen C, Jin L, Li H (2013) Agriculture driving male expansion in Neolithic Time. arXiv preprint arXiv:1311.6857
Wei W, Ayub Q, Xue Y, Tyler-Smith C (2013) A comparison of Y-chromosomal lineage dating using either resequencing or Y-SNP plus Y-STR genotyping. Forensic Sci Int Genet 7(6):568–572
Wilson IJ, Weale ME, Balding DJ (2003) Inferences from DNA data: population histories, evolutionary processes and forensic match probabilities. J R Stat Soc 116:155–188
Xue Y, Zerjal T, Bao W, Zhu S, Shu Q, Xu J, Du R, Fu S, Li P, Hurles ME, Yang H, Tyler-Smith C (2006) Male demography in East Asia: a north-south contrast in human population expansion times. Genetics 172(4):2431–2439
Yan S, Wang CC, Zheng HX, Wang W, Qin ZD, Wei LH, Wang Y, Pan XD, Fu WQ, He YG, Xiong LJ, Jin WF, Li SL, An Y, Li H, Jin L (2013) Y chromosomes of 40% Chinese are descendants of three Neolithic super-grandfathers. arXiv preprint arXiv:1310.3897
Zhivotovsky LA (2001) Estimating divergence time with the use of microsatellite genetic distances: impacts of population growth and gene flow. Mol Biol Evol 18:700–709
Zhivotovsky LA, Underhill PA, Cinnioğlu C, Kayser M, Morar B, Kivisild T, Scozzari R, Cruciani F, Destro-Bisol G, Spedini G, Chambers GK, Herrera RJ, Yong KK, Gresham D, Tournev I, Feldman MW, Kalaydjieva L (2004) The effective mutation rate at Y chromosome short tandem repeats, with application to human population-divergence time. Am J Hum Genet 74(1):50–61
Acknowledgments
This work was supported by the National Excellent Youth Science Foundation of China (No. 31222030), the National Natural Science Foundation of China (No. 91131002), the Shanghai Rising-Star Program (No. 12QA1400300), MOE University Doctoral Research Supervisor’s Funds (20120071110021), and MOE Scientific Research Project (113022A).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by S. Hohmann.
H. Xu and C.-C. Wang contributed equally to this work.
Electronic supplementary material
438_2014_903_MOESM1_ESM.docx
Supporting doc1. Average gene diversity over loci, allele size range and number of alleles at different loci, and expected heterozygosity. (DOCX 255 kb)
438_2014_903_MOESM2_ESM.docx
Supporting doc2. Plots of matrix of pairwise FST, Slatkins linearized FST, average number of pairwise differences, and matrix of coancestry coefficients at continental level. (DOCX 806 kb)
438_2014_903_MOESM5_ESM.xls
Table S3. Pairwise genetic distance values and corresponding p-values between all populations at both the continental and population level. (XLS 125 kb)
438_2014_903_MOESM8_ESM.xls
Table S6. TMRCA, expansion time, effective population size, and population growth rate for 44 worldwide populations estimated in Batwing. (XLS 151 kb)
438_2014_903_MOESM9_ESM.xls
Table S7. Y chromosome haplogroup prediction and demographic history inference of 25 Y chromosome predicted haplogroups. (XLS 222 kb)
Rights and permissions
About this article
Cite this article
Xu, H., Wang, CC., Shrestha, R. et al. Inferring population structure and demographic history using Y-STR data from worldwide populations. Mol Genet Genomics 290, 141–150 (2015). https://doi.org/10.1007/s00438-014-0903-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-014-0903-8