Abstract
The genetic basis of selection for geographic adaptation and how it has contributed to population structure are unknown in tossa jute (Corchorus olitorius), an important bast fibre crop. We performed restriction site-associated DNA (RAD) sequencing-based (1115 RAD-SNPs) population genomic analyses to investigate genetic differentiation and population structure within a collection of 221 fibre-type lines from across nine geographic regions of the world. Indian populations, with relatively higher overall diversity, were significantly differentiated (based on FST and PCA) from the African and the other Asian populations. There is strong evidence that African C. olitorius was first introduced in peninsular India that could perhaps be its secondary centre of origin. However, multiple later introductions have occurred in central, eastern and northern India. Based on four assignment tests with different statistical bases, we infer that two ancestral subpopulations (African and Indian) structure the C. olitorius populations, but not in accordance with their geographic origins and patterns of diversity. Our results advocate recent migration of C. olitorius through introduction and germplasm exchange across geographical boundaries. We argue that high intraspecific genetic admixture could be associated with increased genetic variance within Indian populations. Employing both subpopulation (FST/GST-outlier) and individual-based (PCAdapt) tests, we detected putative RAD-SNP loci under selection and demonstrated that bast fibre production was an artificial, while abiotic and biotic stresses were natural selection pressures in C. olitorius adaptation. By reinferring the population structure without outlier loci, we propose ad interim that C. olitorius was possibly domesticated as a fibre crop in the Indian subcontinent.
Similar content being viewed by others
References
Akter J, Islam MS, Sajib AA, Ashraf N, Haque S, Khan H (2008) Microsatellite markers for determining genetic identities and genetic diversity among jute cultivars. Austral J Crop Sci 1:97–107
Andrews KR, Good JM, Miller MR, Luikart G, Hohenlohe PA (2016) Harnessing the power of RADseq for ecological and evolutionary genomics. Nat Rev Genet 17:81–92
Antao T, Lopes A, Lopes R, Beja-Pereira A, Luikart G (2008) LOSITAN: a workbench to detect molecular adaptation based on a F st-outlier method. BMC Bioinform 9:323
Banerjee S, Das M, Mir RR, Kundu A, Topdar N, Sarkar D et al (2012) Assessment of genetic diversity and population structure in a selected germplasm collection of 292 jute genotypes by microsatellite (SSR) markers. Mol Plant Breed 3:11–25
Basu A, Ghosh M, Meyer R, Powell W, Basak SL, Sen SK (2004) Analysis of genetic diversity in cultivated jute determined by means of SSR markers and AFLP profiling. Crop Sci 44:678–685
Basu T, Satya P, Sarkar D, Kar CS, Mitra J, Karmakar PG (2016) Organelle genetic diversity in a global collection of jute (Corchorus capsularis and C. olitorius, Malvaceae). S Afr J Bot 103:54–60
Benor S, Blattner FR, Demissew S, Hammer K (2010) Collection and ethnobotanical investigation of Corchorus species in Ethiopia: potential leafy vegetables for dry regions. Genet Resour Crop Evol 57:293–306
Benor S, Demissew S, Hammer K, Blattner FR (2012) Genetic diversity and relationships in Corchorus olitorius (Malvaceae s. l.) inferred from molecular and morphological data. Genet Resour Crop Evol 59:1125–1146
Bienert MD, Delannoy M, Navarre C, Boutry M (2012) NtSCP1 from tobacco is an extracellular serine carboxypeptidase III that has an impact on cell elongation. Plant Physiol 158:1220–1229
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635
Campana MG, Hunt HV, Jones H, White J (2011) CorrSieve: software for summarizing and evaluating Structure output. Mol Ecol Resour 11:349–352
Carvajal-Rodríguez A (2017) HacDivSel: two new methods (haplotype-based and outlier-based) for the detection of divergent selection in pairs of populations. PLoS One 12:e0175944
Chakraborty A, Sarkar D, Satya P, Karmakar PG, Singh NK (2015) Pathways associated with lignin biosynthesis in lignomaniac jute fibres. Mol Genet Genom 290:1523–1542
Choudhary SB, Sharma HK, Anil Kumar A, Maruthi RT, Karmakar PG (2017) The genus Corchorus L. (Malvaceae) in India: species distribution and ethnobotany. Genet Resour Crop Evol 64:1675–1686
Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21:3674–3676
Daniels MJ, Chaumont F, Mirkov TE, Chrispeels MJ (1996) Characterization of a new vacuolar membrane aquaporin sensitive to mercury at a unique site. Plant Cell 8:587–599
Doebley JF, Gaut BS, Smith BD (2006) The molecular genetics of crop domestication. Cell 127:1309–1321
Earl DA, vonHoldt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4:359–361
Edmonds JM (1990) Herbarium survey of African Corchorus L. species. International Board for Plant Genetic Resources, Rome
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14:2611–2620
Foll M, Gaggiotti O (2008) A genome-scan method to identify selected loci appropriate for both dominant and codominant markers: a Bayesian perspective. Genetics 180:977–993
Foll M, Fischer MC, Heckel G, Excoffier L (2010) Estimating population structure from AFLP amplification intensity. Mol Ecol 19:4638–4647
François O, Martins H, Caye K, Schoville SD (2016) Controlling false discoveries in genome scans for selection. Mol Ecol 25:454–469
Freeland JR, Kirk H, Petersen S (2011) Molecular ecology, 2 edn. Wiley-Blackwell, Chichester
Frichot E, Mathieu F, Trouillon T, Bouchard G, François O (2014) Fast and efficient estimation of individual ancestry coefficients. Genetics 196:973–983
Fuller DQ, Murphy C (2018) The origins and early dispersal of horsegram (Macrotyloma uniflorum), a major crop of ancient India. Genet Resour Crop Evol 65:285–305
Glaubitz JC, Casstevens TM, Lu F, Harriman J, Elshire RJ, Sun Q et al (2014) TASSEL-GBS: a high capacity genotyping by sequencing analysis pipeline. PLoS One 9:e90346
Gould KS, Dudle DA, Neufeld HS (2010) Why some stems are red: cauline anthocyanins shield photosystem II against high light stress. J Exp Bot 61:2707–2717
Greenbaum G, Templeton AR, Zarmi Y, Bar-David S (2014) Allelic richness following population founding events- a stochastic modeling framework incorporating gene flow and genetic drift. PLoS One 9:e115203
Hammer Ø, Harper DAT, Ryan PD (2001) Past: paleontological statistics software package for education and data analysis. Palaeontol Electron 4:1–9
Hardy OJ, Vekemans X (2002) SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes 2:618–620
Hou Y, Nowak MD, MirrÈ V, Bjor CS, Brochmann C, Popp M (2015) Thousands of RAD-seq loci fully resolve the phylogeny of the highly disjunct Arctic-Alpine genus Diapensia (Diapensiaceae). PLoS One 10:e0140175
Huang Y-F, Poland JA, Wight CP, Jackson EW, Tinker NA (2014) Using genotyping-by-sequencing (GBS) for genomic discovery in cultivated oat. PLoS One 9:e102448
Islam M, Saito JA, Emdad E, Ahmed B, Islam M, Halim A et al (2017) Comparative genomics of two jute species and insight into fibre biogenesis. Nat Plants 3:16223
Jakobsson M, Rosenberg NA (2007) CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23:1801–1806
Jeffreys H (1961) Theory of probability. Oxford University Press, Oxford
Jombart T (2008) adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24:1403–1405
Kar CS, Kundu A, Sarkar D, Sinha MK, Mahapatra BS (2009) Genetic diversity in jute (Corchorus spp.) and its utilization: a review. Indian J Agric Sci 79:575–586
Kar CS, Satya P, Mitra J, Sarkar D, Sinha MK, Kundu A et al (2010) Varietal development of jute and allied fibres in India. Indian Farm 60:5–9
Kundu BC (1951) Origin of jute. Indian J Genet Plant Breed 11:95–99
Kundu A, Topdar N, Sarkar D, Sinha MK, Ghosh A, Banerjee S et al (2013) Origins of white (Corchorus capsularis L.) and dark (C. olitorius L.) jute: a reevaluation based on nuclear and chloroplast microsatellites. J Plant Biochem Biotechnol 22:372–381
Kundu A, Chakraborty A, Mandal NA, Das D, Karmakar PG, Singh NK et al (2015) A restriction-site-associated DNA (RAD) linkage map, comparative genomics and identification of QTL for histological fibre content coincident with those for retted bast fibre yield and its major components in jute (Corchorus olitorius L., Malvaceae s. l.). Mol Breed 35:19
Lewontin RC, Krakauer J (1973) Distribution of gene frequency as a test of theory of selective neutrality of polymorphisms. Genetics 74:175–195
Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760
Liu K, Muse SV (2005) PowerMarker: an integrated analysis environment for genetic marker analysis. Bioinformatics 21:2128–2129
Liu T, Tang S, Zhu S, Tang Q, Zheng X (2014) Transcriptome comparison reveals the patterns of selection in domesticated and wild ramie (Boehmeria nivea L. Gaud). Plant Mol Biol 86:85–92
Lu F, Lipka AE, Glaubitz J, Elshire R, Cherney JH, Casler MD et al (2013) Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol. PLoS Genet 9:e1003215
Luu K, Bazin E, Blum MGB (2017) pcadapt: an R package to perform genome scans for selection based on principal component analysis. Mol Ecol Resour 17:67–77
Mahapatra AK, Saha A, Gupta D (2006) Catalogue on evaluation of tossa jute germplasm (Corchorus olitorius L.). Central Research Institute for Jute and Allied Fibres, Kolkata
Marroni F, Pinosio S, Zaina G, Fogolari F, Felice N, Cattonaro F et al (2011) Nucleotide diversity and linkage disequilibrium in Populus nigra cinnamyl alcohol dehydrogenase (CAD4) gene. Tree Genet Genomes 7:1011–1023
Meirmans PG (2012) AMOVA-based clustering of population genetic data. J Hered 103:744–750
Meirmans PG, Hedrick PW (2011) Assessing population structure: F ST and related measures. Mol Ecol Resour 11:5–18
Meirmans PG, Van Tienderen PH (2004) GENOTYPE and GENODIVE: two programs for the analysis of genetic diversity of asexual organisms. Mol Ecol Notes 4:792–794
Moore AD, Held A, Terrapon N, Weiner J, Bornberg-Bauer E (2014) DoMosaics: software for domain arrangement visualization and domain-centric analysis of proteins. Bioinformatics 30:282–283
Mu C, Chen N, Li X, Jia P, Wang Z, Liu H (2010) F-box protein Arabidillo-1 promotes lateral root development by depressing the functioning of GA3 in Arabidopsis. J Plant Biol 53:374–380
Mugford ST, Qi X, Bakht S, Hill L, Wegel E, Hughes RK et al (2009) A serine carboxypeptidase-like acyltransferase is required for synthesis of antimicrobial compounds and disease resistance in oats. Plant Cell 21:2473–2484
Narum SR, Hess JE (2011) Comparison of F ST outlier tests for SNP loci under selection. Mol Ecol Resour 11:184–194
Narum SR, Buerkle AC, Davey JW, Miller MR, Hohenlohe PA (2013) Genotyping-by-sequencing in ecological and conservation genomics. Mol Ecol 22:2841–2847
Nei M, Tajima F, Tateno Y (1983) Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data. J Mol Evol 19:153–170
Paetkau D, Calvert W, Stirling I, Strobeck C (1995) Microsatellite analysis of population structure in Canadian polar bears. Mol Ecol 4:347–354
Pan Y, Wang X, Sun G, Li F, Gong X (2016) Application of RAD sequencing for evaluating the genetic diversity of domesticated Panax notoginseng (Araliaceae). PLoS One 11:e0166419
Pegadaraju V, Nipper R, Hulke B, Qi L, Schultz Q (2013) De novo sequencing of sunflower genome for SNP discovery using RAD (Restriction site Associated DNA) approach. BMC Genom 14:1–9
Pérez-Figueroa A, García-Pereira MJ, Saura M, Rolán-Alvarez E, Caballero A (2010) Comparing three different methods to detect selective loci using dominant markers. J Evol Biol 23:2267–2276
Pfeifer B, Wittelsbürger U, Ramos-Onsins SE, Lercher MJ (2014) PopGenome: an efficient Swiss army knife for population genomic analyses in R. Mol Biol Evol 31:1929–1936
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959
R Core Team (2017) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna
Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J et al (2001) Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA 98:11479–11484
Rius M, Darling JA (2014) How important is intraspecific genetic admixture to the success of colonising populations? Trends Ecol Evol 29:233–242
Rosenberg NA (2004) DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes 4:137–138
Rousset F (2008) GENEPOP’007: a complete re-implementation of the GENEPOP software for Windows and Linux. Mol Ecol Resour 8:103–106
Roy A, Bandyopadhyay A, Mahapatra AK, Ghosh SK, Singh NK, Bansal KC et al (2006) Evaluation of genetic diversity in jute (Corchorus species) using STMS, ISSR and RAPD markers. Plant Breed 125:292–297
Sarkar D, Mahato AK, Satya P, Kundu A, Singh S, Jayaswal PK et al (2017a) The draft genome of Corchorus olitorius cv. JRO-524 (Navin). Genom Data 12:151–154
Sarkar D, Menor C, Singh NK (2017b) Does RNA-seq-based Eukaryotic GeneFinding of Blast2GO require repeat-masking the whole genome shotgun (WGS) sequence? A case study in jute (Corchorus olitorius L., Malvaceae s. l.). BioBam Bioinformatics S. L. https://www.blast2go.com/support/blog/22-blast2goblog/178-is-repeat-masking-necessary-in-blast2go. Accessed 24 Jul 2017
Satya P, Banerjee R, Biswas C, Karan M, Ghosh S, Ali N (2014a) Genetic analysis of population structure using peroxidase gene and phenylalanine ammonia-lyase gene-based DNA markers: a case study in jute (Corchorus spp.). Crop Sci 54:1609–1620
Satya P, Karan M, Chakraborty K, Biswas C, Karmakar PG (2014b) Comparative analysis of diversification and population structure of kenaf (Hibiscus cannabinus L.) and roselle (H. sabdariffa L.) using SSR and RGA (resistance gene analogue) markers. Plant Syst Evol 300:1209–1218
Shivaraj SM, Deshmukh RK, Rai R, Bélanger R, Agrawal PK, Dash PK (2017) Genome-wide identification, characterization, and expression profile of aquaporin gene family in flax (Linum usitatissimum). Sci Rep 7:46137
Soto-Cerda BJ, Cloutier S (2013) Outlier loci and selection signatures of simple sequence repeats (SSRs) in flax (Linum usitatissimum L.). Plant Mol Biol Rep 31:978–990
Supek F, Bošnjak M, Škunca N, Šmuc T (2011) REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS One 6:e21800
Szpiech ZA, Jakobsson M, Rosenberg NA (2008) ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinformatics 24:2498–2504
Takezaki N, Nei M, Tamura K (2014) POPTREEW: web version of POPTREE for constructing population trees from allele frequency data and computing some other quantities. Mol Biol Evol 31:1622–1624
Valdisser PAMR, Pappas GJ, Menezes IPP, Müller BSF, Pereira WJ, Narciso MG et al (2016) SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis. Mol Genet Genom 291:1277–1291
Verhoeven KJF, Macel M, Wolfe LM, Biere A (2011) Population admixture, biological invasions and the balance between local adaptation and inbreeding depression. Proc R Soc Lond B Biol Sci 278:2–8
Vigouroux Y, Glaubitz JC, Matsuoka Y, Goodman MM, Sánchez GJ, Doebley J (2008) Population structure and genetic diversity of New World maize races assessed by DNA microsatellites. Am J Bot 95:1240–1253
Winters CA (2010) Y-chromosome evidence of an African origin of Dravidian agriculture. Int J Genet Mol Biol 2:30–33
Xu P, Xu S, Wu X, Tao Y, Wang B, Wang S et al (2014) Population genomic analyses from low-coverage RAD-Seq data: a case study on the non-model cucurbit bottle gourd. Plant J 77:430–442
Yang Z, Yan A, Lu R, Dai Z, Tang Q, Cheng C et al (2017) De novo transcriptome sequencing of two cultivated jute species under salinity stress. PLoS One 12:e0185863
Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z et al (2006) WEGO: a web tool for plotting GO annotations. Nucleic Acids Res 34:W293–W297
Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF et al (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38:203–208
Zhang L, Yuan M, Tao A, Xu J, Lin L, Fang P et al (2015) Genetic structure and relationship analysis of an association population in jute (Corchorus spp.) evaluated by SSR markers. PLoS One 10:e0128195
Acknowledgements
This work was funded by National Agricultural Science Fund (NASF), Indian Council of Agricultural Research (ICAR), New Delhi (Grant ID: GB-2018) and ICAR-Network Project on Transgenics in Crops (Grant ID: ICAR-NPTC-3070). We thank NxGenBio Life Sciences, New Delhi for assistance in RADseq library preparation, Illumina HiSeq™ 2000 sequencing and raw data processing. We also thank Dr. Subhojit Datta for helpful feedback on tossa jute aquaporins. The manuscript was reviewed and approved by the institute. Comments and suggestions on the manuscript from the Editor and two anonymous reviewers are gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Ethical approval
All laboratory experiments complied with appropriate ethical standards, according to existing rules and regulations of the Indian Council of Agricultural Research (ICAR), Government of India. The article does not pertain to any laboratory experiments involving human participants or animals.
Additional information
Communicated by Dacheng Tian.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The raw Illumina RADseq reads have been deposited in the NCBI Sequence Read Archive (SRA) under the project SRP064554 vide BioProject PRJNA207496 and BioSamples SAMN03097738 to SAMN03097962, with 225 SRX accessions listed in Online Resource 2. RADseq genotype data and other summary statistics are available as a Figshare (https://figshare.com/s/05b13e169a8f5ae8e634) entry doi: https://doi.org/10.6084/m9.figshare.6339518.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Sarkar, D., Kundu, A., Das, D. et al. Resolving population structure and genetic differentiation associated with RAD-SNP loci under selection in tossa jute (Corchorus olitorius L.). Mol Genet Genomics 294, 479–492 (2019). https://doi.org/10.1007/s00438-018-1526-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-018-1526-2