Abstract
Deletion/insertion polymorphism (DIP) is one of the more promising genetic markers in the field of forensic genetics for personal identification and biogeographic ancestry inference. In this research, we used an in-house developed ancestry-informative marker-DIP system, including 56 autosomal diallelic DIPs, three Y-chromosomal DIPs, and an Amelogenin gene, to analyze the genetic polymorphism and ancestral composition of the Chinese Korean group, as well as to explore its genetic relationships with the 26 reference populations. The results showed that this novel panel exhibited high genetic polymorphism in the studied Korean group and could be effectively applied for forensic individual identification in the Korean group. In addition, the results of multiple population genetic analyses indicated that the ancestral component of the Korean group was dominated by northern East Asia. Moreover, the Korean group was more closely related to the East Asian populations, especially to the Japanese population in Tokyo. This study enriched the genetic data of the Korean ethnic group in China and provided information on the ancestry of the Korean group from the perspective of population genetics.
Similar content being viewed by others
References
Alexander DH, Novembre J, Lange K (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Res 19:1655–1664
Alladio E, Poggiali B, Cosenza G et al (2022) Multivariate statistical approach and machine learning for the evaluation of biogeographical ancestry inference in the forensic field. Sci Rep 12:8974
Auton A, Abecasis GR, Altshuler DM et al (2015) A global reference for human genetic variation. Nature 526:68–74
Balloux F, Lugon-Moulin N (2002) The estimation of population differentiation with microsatellite markers. Mol Ecol 11:155–165
Cabana GS, Lewis CM Jr, Tito RY et al (2014) Population genetic structure of traditional populations in the Peruvian Central Andes and implications for South American population history. Hum Biol 86:147–165
Chen C, Chen H, Zhang Y et al (2020) TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant 13:1194–1202
Chen L, Zhou Z, Zhang Y et al (2022) EASplex: a panel of 308 AISNPs for East Asian ancestry inference using next generation sequencing. Forensic Sci Int Genet 60:102739
De La Vega FM, Bryc K, Degehnardt JD et al (2010) Genome sequencing and analysis of admixed genomes of African and Mexican ancestry: implications for personal ancestry reconstruction and multi-ethnic medical genomics. Genome Biol 11:O4
Fang Y, Liu Y, Xu H et al (2023) Performance evaluation of an in-house panel containing 59 autosomal InDels for forensic identification in Chinese Hui and Mongolian groups. Genomics 115:110552
Gouy A, Zieger M (2017) STRAF-A convenient online tool for STR data evaluation in forensic genetics. Forensic Sci Int Genet 30:148–151
Gravel S, Zakharia F, Moreno-Estrada A et al (2013) Reconstructing native American migrations from whole-genome and whole-exome data. PLoS Genet 9:e1004023
Han Y, Li L, Liu X et al (2016) Genetic analysis of 17 Y-STR loci in Han and Korean populations from Jilin Province, Northeast China. Forensic Sci Int Genet 22:8–10
Harihara S, Saitou N, Hirai M et al (1988) Mitochondrial DNA polymorphism among five Asian populations. Am J Hum Genet 43:134–143
Hunter JD (2007) Matplotlib: a 2D graphics environment. Comput Sci Eng 9:90–95
Jin X, Wei Y, Lan Q et al (2019) A set of novel SNP loci for differentiating continental populations and three Chinese populations. PeerJ 7:e6508
Jin X, Cui W, Chen C et al (2020a) Biogeographic origin prediction of three continental populations through 42 ancestry informative SNPs. Electrophoresis 41:235–245
Jin X, Guo Y, Chen C et al (2020) Ancestry prediction comparisons of different AISNPs for five continental populations and population structure dissection of the Xinjiang Hui group via a self-developed panel. Genes (Basel) 11:505
Hosmer DWH, Lemeshow S, Sturdivant RX (2013) Applied logistic regression, 3rd edn. Wiley
Kim W, Shin DJ, Harihara S et al (2000) Y chromosomal DNA variation in east Asian populations and its potential for inferring the peopling of Korea. J Hum Genet 45:76–83
Lan Q, Li S, Cai M et al (2023) A self-developed AIM-InDel panel designed for degraded DNA analysis: forensic application characterization and genetic landscape investigation in the Han Chinese population. Genomics 115:110620
LaRue BL, Ge J, King JL et al (2012) A validation study of the Qiagen investigator DIPplex(R) kit; an INDEL-based assay for human identification. Int J Legal Med 126:533–540
Li C, Pakstis AJ, Jiang L et al (2016) A panel of 74 AISNPs: improved ancestry inference within Eastern Asia. Forensic Sci Int Genet 23:101–110
Lin Z, Ohshima T, Gao S et al (2000) Genetic variation and relationships at five STR loci in five distinct ethnic groups in China. Forensic Sci Int 112:179–189
Nei M, Tajima F, Tateno Y (1983) Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data. J Mol Evol 19:153–170
Pennisi E (2010) Genomics. 1000 Genomes project gives new map of genetic diversity. Science 330:574–575
Phillips C (2015) Forensic genetic analysis of bio-geographical ancestry. Forensic Sci Int Genet 18:49–65
Phillips C, Salas A, Sanchez JJ et al (2007) Inferring ancestral origin using a single multiplex assay of ancestry-informative marker SNPs. Forensic Sci Int Genet 1:273–280
Phillips C, Fernandez-Formoso L, Gelabert-Besada M et al (2013) Development of a novel forensic STR multiplex for ancestry analysis and extended identity testing. Electrophoresis 34:1151–1162
Pilli E, Morelli S, Poggiali B et al (2023) Biogeographical ancestry, variable selection, and PLS-DA method: a new panel to assess ancestry in forensic samples via MPS technology. Forensic Sci Int Genet 62:102806
Qu S, Zhu J, Wang Y et al (2019) Establishing a second-tier panel of 18 ancestry informative markers to improve ancestry distinctions among Asian populations. Forensic Sci Int Genet 41:159–167
Rohart F, Gautier B, Singh A et al (2017) mixOmics: an R package for omics feature selection and multiple data integration. PLoS Comput Biol 13:e1005752
Rolf B, Horst B, Eigel A et al (1998) Microsatellite profiles reveal an unexpected genetic relationship between Asian populations. Hum Genet 102:647–652
Rosenberg NA, Li LM, Ward R et al (2003) Informativeness of genetic markers for inference of ancestry. Am J Hum Genet 73:1402–1422
Rousset F (2008) genepop’007: a complete re-implementation of the genepop software for Windows and Linux. Mol Ecol Resour 8:103–106
Santos NP, Ribeiro-Rodrigues EM, Ribeiro-Dos-Santos AK et al (2010) Assessing individual interethnic admixture and population substructure using a 48-insertion-deletion (INSEL) ancestry-informative marker (AIM) panel. Hum Mutat 31:184–190
Santos C, Phillips C, Fondevila M et al (2016) Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region. Forensic Sci Int Genet 20:71–80
Seong KM, Park JH, Hyun YS et al (2014) Population genetics of insertion-deletion polymorphisms in South Koreans using investigator DIPplex kit. Forensic Sci Int Genet 8:80–83
Tamura K, Stecher G, Kumar S (2021) MEGA11: molecular evolutionary genetics analysis version 11. Mol Biol Evol 38:3022–3027
Wang Y, Lu D, Chung YJ et al (2018) Genetic structure, divergence and admixture of Han Chinese, Japanese and Korean Populations. Hereditas 155:19
Wang Y, Li S, Dang Z et al (2019) Genetic diversity and haplotype structure of 27 Y-STR loci in a Yanbian Korean population from Jilin Province, Northeast China. Leg Med (tokyo) 36:110–112
Wei Y, Wei L, Zhao L et al (2016) A single-tube 27-plex SNP assay for estimating individual ancestry and admixture from three continents. Int J Legal Med 130:27–37
Wilkinson L (2011) ggplot2: elegant graphics for data analysis by H. WICKHAM. Biometrics 67:678–679
Xuan J, Adnan A, Khan RA et al (2019) Population genetics of 19 Y-STR loci in Yanbian Korean samples from China. Ann Hum Genet 83:134–140
Xuan J, Adnan A, Zafar AA et al (2020) Genetic structure and forensic characteristics of the Korean population revealed by GoldenEye 20A. Ann Hum Biol 47:560–563
Zhang Y, Cui H, Cui Y et al (2006) Genetic profile of three short tandem repeat loci CSF1PO, TPOX, and TH01 in a Chinese Korean population. J Forensic Sci 51:1199
Zhang X, Shen C, Jin X et al (2021) Developmental validations of a self-developed 39 AIM-InDel panel and its forensic efficiency evaluations in the Shaanxi Han population. Int J Legal Med 135:1359–1367
Zhao C, Yang J, Xu H et al (2022) Genetic diversity analysis of forty-three insertion/deletion loci for forensic individual identification in Han Chinese from Beijing based on a novel panel. J Zhejiang Univ Sci B 23:241–248
Zhou Y, Jin X, Wu B et al (2021) Development and performance evaluation of a novel ancestry informative DIP panel for continental origin inference. Front Genet 12:801275
Acknowledgements
We thank all participants for their contributions to this study.
Funding
The National Key R&D Program of China (2022YFC3302004, 2022YFC3302004-1).
Author information
Authors and Affiliations
Contributions
This study was designed by BZ. MC performed the experiment, analyzed data, visualized corresponding diagrams, and wrote the manuscript. JY collected samples and assisted in data collation and analysis, drafting the manuscript. SL, XZ, WX, JS, XY revised the manuscript to provide the necessary recommendations for research. All authors contributed to data analysis, editing and revision of the manuscript.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Ethical Approval
Our research was strictly adhered to the principles of human ethics research and was in accordance with the approvals of the Ethics Committees of Southern Medical University and Xi’an Jiaotong University (No. 2019-1039).
Informed Consent
All participants in this study signed the written informed consents before providing samples.
Additional information
Handling editor: Alexander Platt.
Supplementary Information
Below is the link to the electronic supplementary material.
239_2023_10143_MOESM1_ESM.tiff
The cross-validation error of each K value estimated by the ADMIXTURE software for the East Asian populations and the Korean group based on 56 AIM-DIPs. Supplementary file1 (TIFF 185 KB)
239_2023_10143_MOESM2_ESM.tiff
The cross-validation error of each K value estimated by the ADMIXTURE software for the Korean group and 26 reference populations based on 56 AIM-DIPs. Supplementary file2 (TIFF 186 KB)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Cai, M., Li, S., Zhang, X. et al. Ancestral Information Analysis of Chinese Korean Ethnic Group via a Novel Multiplex DIP System. J Mol Evol 91, 922–934 (2023). https://doi.org/10.1007/s00239-023-10143-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-023-10143-y