Abstract
Southwest China was the crossroad for the initial settler people of East Asia, which shows the highest diversity in languages and genetics. This region played a significant role in the formation of the genetic makeup of the proto-Hmong–Mien-speaking people and in the north-to-south human expansion during the Neolithic-to-historic transformation. Their genetic history covering migration events and the admixture processes still needs to be further explored. Therefore, in the current study, we have generated genome-wide data from three genomic aspects covering autosomal, mitochondrial and Y-chromosomal regions in 260 Hmong–Mien, Tibeto-Burman, and Sinitic people from 29 different southwestern Chinese groups, and further analyzed them with 2676 published modern and ancient Eurasian genomes. Here, we have noticed a new southwestern East Asian genetic cline composed of the Hmong–Mien-specific ancestry enriched in modern Hmong and Pathen. This newly identified southern inland East Asian lineage contributed to a great extent of the gene pool in the modern southern East Asians. We also have observed genetic substructure among Hmong–Mien-speaking populations. The southern Hmong–Mien-speaking people showed more genetic affinity with modern Tai–Kadai/Austroasiatic people, while the northern Hmong–Mien speakers expressed a closer genetic connection with the Neolithic-to-modern northern East Asians. Moreover, southwestern Sinitic populations had a strong genomic affinity with the adjacent Hmong–Mien-speaking populations and the lowlander Tibeto-Burman-speaking populations, which suggested the large-scale genetic admixture occurred between them. Allele-sharing-based qpAdm/qpGraph results further confirmed that all included southwestern Chinese populations could be modeled as a mixed result of the major ancestry component from the northern millet farmers in the Yellow River basin and the minor ancestry component from the southern rice farmers in the Yangtze River basin. Usually, this newly identified Hmong–Mien-associated southern East Asian ancestry could improve our understanding of the full-scale genetic landscape of the evolutionary and admixture history of southwestern East Asians. Further ancient genomic studies from southeastern China are required to shed deeper light on our established phylogeny context.
Similar content being viewed by others
Data availability
We Followed the informed consent of this project and the regulations of the Human Genetic Resources Administration of China (HGRAC), the generated raw data can be shared via personal communication with the corresponding authors with the following conditions, (i) the data can be only used for studying population history, (ii) the data will not be used for commercial purposes, (iii) the data will not be used to identify the sample donors, (iv) the data will not be used for studying natural/cultural selections, medical or other related studies.
References
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664
Association WM (2001) World Medical Association Declaration of Helsinki. Ethical principles for medical research involving human subjects. Bull World Health Organiz 79:373
Browning BL, Browning SR (2013) Improving the accuracy and efficiency of identity-by-descent detection in population data. Genetics 194:459–471
Chen P, Wu J, Luo L, Gao H, Wang M, Zou X, Li Y, Chen G, Luo H, Yu L, Han Y, Jia F, He G (2019) Population genetic analysis of modern and ancient DNA variations yields new insights into the formation, genetic structure, and phylogenetic relationship of Northern Han Chinese. Front Genet 10:1045
Chi Z, Hung H-C (2008) The neolithic of Southern China—origin, development, and dispersal. Asian Perspect 47:299–329
Chi Z, Hung H-c (2015) The emergence of agriculture in southern China. Antiquity 84:11–25
Dong G, Du L, Wei W (2020) The impact of early trans-Eurasian exchange on animal utilization in northern China during 5000–2500 BP. The Holocene 31:294–301
Fu Q, Li H, Moorjani P, Jay F, Slepchenko SM, Bondarev AA, Johnson PL, Aximu-Petri A, Prufer K, de Filippo C, Meyer M, Zwyns N, Salazar-Garcia DC, Kuzmin YV, Keates SG, Kosintsev PA, Razhev DI, Richards MP, Peristov NV, Lachmann M, Douka K, Higham TF, Slatkin M, Hublin JJ, Reich D, Kelso J, Viola TB, Paabo S (2014) Genome sequence of a 45,000-year-old modern human from western Siberia. Nature 514:445–449
Gakuhari T, Nakagome S, Rasmussen S, Allentoft ME, Sato T, Korneliussen T, Chuinneagain BN, Matsumae H, Koganebuchi K, Schmidt R, Mizushima S, Kondo O, Shigehara N, Yoneda M, Kimura R, Ishida H, Masuyama T, Yamada Y, Tajima A, Shibata H, Toyoda A, Tsurumoto T, Wakebe T, Shitara H, Hanihara T, Willerslev E, Sikora M, Oota H (2020) Ancient Jomon genome sequence analysis sheds light on migration patterns of early East Asian populations. Commun Biol 3:437
Green RE, Krause J, Briggs AW, Maricic T, Stenzel U, Kircher M, Patterson N, Li H, Zhai W, Fritz MH, Hansen NF, Durand EY, Malaspinas AS, Jensen JD, Marques-Bonet T, Alkan C, Prufer K, Meyer M, Burbano HA, Good JM, Schultz R, Aximu-Petri A, Butthof A, Hober B, Hoffner B, Siegemund M, Weihmann A, Nusbaum C, Lander ES, Russ C, Novod N, Affourtit J, Egholm M, Verna C, Rudan P, Brajkovic D, Kucan Z, Gusic I, Doronichev VB, Golovanova LV, Lalueza-Fox C, de la Rasilla M, Fortea J, Rosas A, Schmitz RW, Johnson PLF, Eichler EE, Falush D, Birney E, Mullikin JC, Slatkin M, Nielsen R, Kelso J, Lachmann M, Reich D, Paabo S (2010) A draft sequence of the Neandertal genome. Science 328:710–722
Handel Z (2012) Hmong-Mien Language History. Diachronica 29:385–398
He G, Wang Z, Guo J, Wang M, Zou X, Tang R, Liu J, Zhang H, Li Y, Hu R, Wei LH, Chen G, Wang CC, Hou Y (2020) Inferring the population history of Tai-Kadai-speaking people and southernmost Han Chinese on Hainan Island by genome-wide array genotyping. Eur J Hum Genet 28:1111–1123
He GL, Li YX, Wang MG, Zou X, Yeh HY, Yang XM, Wang Z, Tang RK, Zhu SM, Guo JX, Luo T, Zhao J, Sun J, Xia ZY, Fan HL, Hu R, Wei LH, Chen G, Hou YP, Wang CC (2021a) Fine-scale genetic structure of Tujia and central Han Chinese revealing massive genetic admixture under language borrowing. J System Evol 59(1):1–20
He G, Wang M, Zou X, Chen P, Wang Z, Liu Y, Yao H, Wei L-H, Tang R, WANG C-C, Yeh H-Y, (2021b) Peopling history of the Tibetan Plateau and multiple waves of admixture of Tibetans inferred from both ancient and modern genome-wide data. Front Genet. https://doi.org/10.3389/fgene.2021.725243
He GL, Wang MG, Li YX, Zou X, Yeh HY, Tang RK, Yang XM, Wang Z, Guo JX, Luo T, Zhao J, Sun J, Hu R, Wei LH, Chen G, Hou YP, Wang CC (2021c) Fine-scale north-to-south genetic admixture profile in Shaanxi Han Chinese revealed by genome-wide demographic history reconstruction. J System Evol. https://doi.org/10.1111/jse.12715
Hellenthal G, Busby GBJ, Band G, Wilson JF, Capelli C, Falush D, Myers S (2014) A genetic atlas of human admixture history. Science 343:747–751
Jeong C, Ozga AT, Witonsky DB, Malmstrom H, Edlund H, Hofman CA, Hagan RW, Jakobsson M, Lewis CM, Aldenderfer MS, Di Rienzo A, Warinner C (2016) Long-term genetic stability and a high-altitude East Asian origin for the peoples of the high valleys of the Himalayan arc. Proc Natl Acad Sci U S A 113:7485–7490
Kumar S, Stecher G, Tamura K (2016) MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol 33:1870–1874
Kutanan W, Kampuansai J, Srikummool M, Brunelli A, Ghirotto S, Arias L, Macholdt E, Hubner A, Schroder R, Stoneking M (2019) Contrasting paternal and maternal genetic histories of thai and lao populations. Mol Biol Evol 36:1490–1506
Kutanan W, Shoocongdej R, Srikummool M, Hubner A, Suttipai T, Srithawong S, Kampuansai J, Stoneking M (2020) Cultural variation impacts paternal and maternal genetic lineages of the Hmong-Mien and Sino-Tibetan groups from Thailand. Eur J Hum Genet 28:1563–1579
Lawson DJ, Hellenthal G, Myers S, Falush D (2012) Inference of population structure using dense haplotype data. PLoS Genet 8:e1002453
Li Y-C, Tian J-Y, Liu F-W, Yang B-Y, Gu K-S-Y, Rahman ZU, Yang L-Q, Chen F-H, Dong G-H, Kong Q-P (2019) Neolithic millet farmers contributed to the permanent settlement of the Tibetan Plateau by adopting barley agriculture. Natl Sci Rev 6:1005–1013
Lipson M, Cheronet O, Mallick S, Rohland N, Oxenham M, Pietrusewsky M, Pryce TO, Willis A, Matsumura H, Buckley H, Domett K, Nguyen GH, Trinh HH, Kyaw AA, Win TT, Pradier B, Broomandkhoshbacht N, Candilio F, Changmai P, Fernandes D, Ferry M, Gamarra B, Harney E, Kampuansai J, Kutanan W, Michel M, Novak M, Oppenheimer J, Sirak K, Stewardson K, Zhang Z, Flegontov P, Pinhasi R, Reich D (2018) Ancient genomes document multiple waves of migration in Southeast Asian prehistory. Science 361:92–95
Liu D, Duong NT, Ton ND, Van Phong N, Pakendorf B, Van Hai N, Stoneking M (2020) Extensive ethnolinguistic diversity in vietnam reflects multiple sources of genetic diversity. Mol Biol Evol 37:2503–2519
Liu S, Huang S, Chen F, Zhao L, Yuan Y, Francis SS, Fang L, Li Z, Lin L, Liu R, Zhang Y, Xu H, Li S, Zhou Y, Davies RW, Liu Q, Walters RG, Lin K, Ju J, Korneliussen T, Yang MA, Fu Q, Wang J, Zhou L, Krogh A, Zhang H, Wang W, Chen Z, Cai Z, Yin Y, Yang H, Mao M, Shendure J, Wang J, Albrechtsen A, Jin X, Nielsen R, Xu X (2018) Genomic analyses from non-invasive prenatal testing reveal genetic associations, patterns of viral infections, and chinese population history. Cell 175:347–359 e314
Liu Y, Wang M, Chen P, Wang Z, Liu J, Yao L, Wang F, Tang R, Zou X, He G (2021) Combined low-/high-density modern and ancient genome-wide data document genomic admixture history of high-altitude East Asians. Front Genet 12:582357
Loh PR, Lipson M, Patterson N, Moorjani P, Pickrell JK, Reich D, Berger B (2013) Inferring admixture histories of human populations using linkage disequilibrium. Genetics 193:1233–1254
Ma X, Yang W, Gao Y, Pan Y, Lu Y, Chen H, Lu D, Xu S (2021) Genetic origins and sex-biased admixture of the Huis. Mol Biol Evol. https://doi.org/10.1093/molbev/msab158
McColl H, Racimo F, Vinner L, Demeter F, Gakuhari T, Moreno-Mayar JV, van Driem G, Gram Wilken U, Seguin-Orlando A, de la Fuente CC, Wasef S, Shoocongdej R, Souksavatdy V, Sayavongkhamdy T, Saidin MM, Allentoft ME, Sato T, Malaspinas AS, Aghakhanian FA, Korneliussen T, Prohaska A, Margaryan A, de Barros DP, Kaewsutthi S, Lertrit P, Nguyen TMH, Hung HC, Minh Tran T, Nghia Truong H, Nguyen GH, Shahidan S, Wiradnyana K, Matsumae H, Shigehara N, Yoneda M, Ishida H, Masuyama T, Yamada Y, Tajima A, Shibata H, Toyoda A, Hanihara T, Nakagome S, Deviese T, Bacon AM, Duringer P, Ponche JL, Shackelford L, Patole-Edoumba E, Nguyen AT, Bellina-Pryce B, Galipaud JC, Kinaston R, Buckley H, Pottier C, Rasmussen S, Higham T, Foley RA, Lahr MM, Orlando L, Sikora M, Phipps ME, Oota H, Higham C, Lambert DM, Willerslev E (2018) The prehistoric peopling of Southeast Asia. Science 361:88–92
Mengge W, Guanglin H, Yongdong S, Shouyu W, Xing Z, Jing L, Zheng W, Hou Y (2020) Massively parallel sequencing of mitogenome sequences reveals the forensic features and maternal diversity of tai-kadai-speaking hlai islanders. Forensic Sci Int Genet 47:102303
Ning C, Li T, Wang K, Zhang F, Li T, Wu X, Gao S, Zhang Q, Zhang H, Hudson MJ, Dong G, Wu S, Fang Y, Liu C, Feng C, Li W, Han T, Li R, Wei J, Zhu Y, Zhou Y, Wang CC, Fan S, Xiong Z, Sun Z, Ye M, Sun L, Wu X, Liang F, Cao Y, Wei X, Zhu H, Zhou H, Krause J, Robbeets M, Jeong C, Cui Y (2020) Ancient genomes from northern China suggest links between subsistence changes and human migration. Nat Commun 11:2700
Patterson N, Moorjani P, Luo Y, Mallick S, Rohland N, Zhan Y, Genschoreck T, Webster T, Reich D (2012) Ancient admixture in human history. Genetics 192:1065–1093
Pickrell JK, Pritchard JK (2012) Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet 8:e1002967
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 81:559–575
Raghavan M, Skoglund P, Graf KE, Metspalu M, Albrechtsen A, Moltke I, Rasmussen S, Stafford TW Jr, Orlando L, Metspalu E, Karmin M, Tambets K, Rootsi S, Magi R, Campos PF, Balanovska E, Balanovsky O, Khusnutdinova E, Litvinov S, Osipova LP, Fedorova SA, Voevoda MI, DeGiorgio M, Sicheritz-Ponten T, Brunak S, Demeshchenko S, Kivisild T, Villems R, Nielsen R, Jakobsson M, Willerslev E (2014) Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans. Nature 505:87–91
Raghavan M, Steinrucken M, Harris K, Schiffels S, Rasmussen S, DeGiorgio M, Albrechtsen A, Valdiosera C, Avila-Arcos MC, Malaspinas AS, Eriksson A, Moltke I, Metspalu M, Homburger JR, Wall J, Cornejo OE, Moreno-Mayar JV, Korneliussen TS, Pierre T, Rasmussen M, Campos PF, de Barros DP, Allentoft ME, Lindo J, Metspalu E, Rodriguez-Varela R, Mansilla J, Henrickson C, Seguin-Orlando A, Malmstrom H, Stafford T Jr, Shringarpure SS, Moreno-Estrada A, Karmin M, Tambets K, Bergstrom A, Xue Y, Warmuth V, Friend AD, Singarayer J, Valdes P, Balloux F, Leboreiro I, Vera JL, Rangel-Villalobos H, Pettener D, Luiselli D, Davis LG, Heyer E, Zollikofer CPE, Ponce de Leon MS, Smith CI, Grimes V, Pike KA, Deal M, Fuller BT, Arriaza B, Standen V, Luz MF, Ricaut F, Guidon N, Osipova L, Voevoda MI, Posukh OL, Balanovsky O, Lavryashina M, Bogunov Y, Khusnutdinova E, Gubina M, Balanovska E, Fedorova S, Litvinov S, Malyarchuk B, Derenko M, Mosher MJ, Archer D, Cybulski J, Petzelt B, Mitchell J, Worl R, Norman PJ, Parham P, Kemp BM, Kivisild T, Tyler-Smith C, Sandhu MS, Crawford M, Villems R, Smith DG, Waters MR, Goebel T, Johnson JR, Malhi RS, Jakobsson M, Meltzer DJ, Manica A, Durbin R, Bustamante CD, Song YS, Nielsen R, Willerslev E (2015) Population Genetics. Genomic evidence for the Pleistocene and recent population history of Native Americans. Science 349:aab3884
Reich D, Green RE, Kircher M, Krause J, Patterson N, Durand EY, Viola B, Briggs AW, Stenzel U, Johnson PL, Maricic T, Good JM, Marques-Bonet T, Alkan C, Fu Q, Mallick S, Li H, Meyer M, Eichler EE, Stoneking M, Richards M, Talamo S, Shunkov MV, Derevianko AP, Hublin JJ, Kelso J, Slatkin M, Paabo S (2010) Genetic history of an archaic hominin group from Denisova Cave in Siberia. Nature 468:1053–1060
Sikora M, Pitulko VV, Sousa VC, Allentoft ME, Vinner L, Rasmussen S, Margaryan A, de Barros DP, de la Fuente C, Renaud G, Yang MA, Fu Q, Dupanloup I, Giampoudakis K, Nogues-Bravo D, Rahbek C, Kroonen G, Peyrot M, McColl H, Vasilyev SV, Veselovskaya E, Gerasimova M, Pavlova EY, Chasnyk VG, Nikolskiy PA, Gromov AV, Khartanovich VI, Moiseyev V, Grebenyuk PS, Fedorchenko AY, Lebedintsev AI, Slobodin SB, Malyarchuk BA, Martiniano R, Meldgaard M, Arppe L, Palo JU, Sundell T, Mannermaa K, Putkonen M, Alexandersen V, Primeau C, Baimukhanov N, Malhi RS, Sjogren KG, Kristiansen K, Wessman A, Sajantila A, Lahr MM, Durbin R, Nielsen R, Meltzer DJ, Excoffier L, Willerslev E (2019) The population history of northeastern Siberia since the Pleistocene. Nature 570:182–188
Stevens CJ, Fuller DQ (2017) The spread of agriculture in eastern Asia. Language Dynam Change 7:152–186
Sun J, Zhao J, Cheng H-Z, Yang X, He G, Guo J, Li Y, Hu R, Chen G, Wei L-H, Wang C-C (2020) Genetic affinity and substructure of Hmong-Mien speaking populations inferred from genome-wide array genotyping of three unrecognized ethnic groups Dongjia, Xijia and Gejia. Mol Genet Genom
Wang CC, Yeh HY, Popov AN, Zhang HQ, Matsumura H, Sirak K, Cheronet O, Kovalev A, Rohland N, Kim AM, Mallick S, Bernardos R, Tumen D, Zhao J, Liu YC, Liu JY, Mah M, Wang K, Zhang Z, Adamski N, Broomandkhoshbacht N, Callan K, Candilio F, Carlson KSD, Culleton BJ, Eccles L, Freilich S, Keating D, Lawson AM, Mandl K, Michel M, Oppenheimer J, Ozdogan KT, Stewardson K, Wen S, Yan S, Zalzala F, Chuang R, Huang CJ, Looh H, Shiung CC, Nikitin YG, Tabarev AV, Tishkin AA, Lin S, Sun ZY, Wu XM, Yang TL, Hu X, Chen L, Du H, Bayarsaikhan J, Mijiddorj E, Erdenebaatar D, Iderkhangai TO, Myagmar E, Kanzawa-Kiriyama H, Nishino M, Shinoda KI, Shubina OA, Guo J, Cai W, Deng Q, Kang L, Li D, Li D, Lin R, Nini SR, Wang LX, Wei L, Xie G, Yao H, Zhang M, He G, Yang X, Hu R, Robbeets M, Schiffels S, Kennett DJ, Jin L, Li H, Krause J, Pinhasi R, Reich D (2021) Genomic insights into the formation of human populations in East Asia. Nature 591:413–419
Wang M, Wang Z, He G, Wang S, Zou X, Liu J, Wang F, Ye Z, Hou Y (2020) Whole mitochondrial genome analysis of highland Tibetan ethnicity using massively parallel sequencing. Forens Sci Int Genet 44:102197
Wen B, Li H, Gao S, Mao X, Gao Y, Li F, Zhang F, He Y, Dong Y, Zhang Y, Huang W, Jin J, Xiao C, Lu D, Chakraborty R, Su B, Deka R, Jin L (2004a) Genetic structure of hmong-mien speaking populations in East Asia as revealed by mtDNA lineages. Mol Biol Evol 22:725–734
Wen B, Li H, Lu D, Song X, Zhang F, He Y, Li F, Gao Y, Mao X, Zhang L, Qian J, Tan J, Jin J, Huang W, Deka R, Su B, Chakraborty R, Jin L (2004b) Genetic evidence supports demic diffusion of Han culture. Nature 431:302–305
Xia Z-Y, Yan S, Wang C-C, Zheng H-X, Zhang F, Liu Y-C, Yu G, Yu B-X, Shu L-L, Jin L (2019) Inland-coastal bifurcation of southern East Asians revealed by Hmong-Mien genomic history. bioRxiv. https://doi.org/10.1101/730903
Yang MA, Fan X, Sun B, Chen C, Lang J, Ko YC, Tsang CH, Chiu H, Wang T, Bao Q, Wu X, Hajdinjak M, Ko AM, Ding M, Cao P, Yang R, Liu F, Nickel B, Dai Q, Feng X, Zhang L, Sun C, Ning C, Zeng W, Zhao Y, Zhang M, Gao X, Cui Y, Reich D, Stoneking M, Fu Q (2020) Ancient DNA indicates human population shifts and admixture in northern and southern China. Science 369:282–288
Yang MA, Gao X, Theunert C, Tong H, Aximu-Petri A, Nickel B, Slatkin M, Meyer M, Paabo S, Kelso J, Fu Q (2017) 40,000-Year-old individual from Asia provides insight into early population structure in Eurasia. Curr Biol 27:3202–3208 e3209
Yao H, Wang M, Zou X, Li Y, Yang X, Li A, Yeh HY, Wang P, Wang Z, Bai J, Guo J, Chen J, Ding X, Zhang Y, Lin B, Wang CC, He G (2021) New insights into the fine-scale history of western-eastern admixture of the northwestern Chinese population in the Hexi Corridor via genome-wide genetic legacy. Mol Genet Genom 296:631–651
Zhang M, Fu Q (2020) Human evolutionary history in Eastern Eurasia using insights from ancient DNA. Curr Opin Genet Dev 62:78–84
Acknowledgements
GHL was supported by the China Postdoctoral Science Foundation (2021M691879). RKT was supported by the Science and Technology Project of Yubei District (X4487).
Author information
Authors and Affiliations
Contributions
GH, RT and XZ designed this study. GH, YW, XZ and MW wrote the manuscript. YW and XZ conducted the experiment. YW, XZ, MW, DY, LY, YZ, FC and GH analyzed the results. RT, XZ, MW and GH revised the manuscript. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The author declares no conflict of interest.
Ethics statement
We followed the recommendations of the Helsinki Declaration of 2000, the regulations of the Human Genetic Resources Administration of China (HGRAC) and requirements of the genetic survey of Human populations published by the Medical Ethics Committee in Chongqing Medical University along with the Medical Ethics Committee of Xiamen University (XDYX201909).
Additional information
Communicated by Shuhua Xu.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Wang, Y., Zou, X., Wang, M. et al. The genomic history of southwestern Chinese populations demonstrated massive population migration and admixture among proto-Hmong–Mien speakers and incoming migrants. Mol Genet Genomics 297, 241–262 (2022). https://doi.org/10.1007/s00438-021-01837-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-021-01837-3