Introduction

Apricot belongs to the Rosaceae family, Prunus genus, which consists of eight species. Apricots, cultivated worldwide, belong to the Prunus armeniaca L. species. World apricot production is approximately 3.57 million tons. Regarding production, Türkiye ranks first with 800 thousand tons, followed by Uzbekistan and Iran (FAO 2021).

Research revealed that apricot culture began in Asia approximately 5000 years ago. It has been stated that two regions in Asia, the Fergana Valley (border of Uzbekistan, Tajikistan, and Kyrgyzstan) and China, are possible origin regions for domesticated apricots (Vavilov 1951; Decroocq et al. 2016). Further research divided domesticated and wild apricots into two main ecogeographic groups based on fruit size: the first group includes Central Asia, which includes parts of China, Afghanistan, Pakistan, and northern India, and the second is the Dzhungar-Zailij group, which includes Kyrgyzstan, Kazakhstan, and China's Xinjiang province (Kostina 1969). Central Asia, including China, is the center of origin of the apricot species, including a high genetic diversity (Bourguiba et al. 2020). In addition, two more ecogeographic groups, Iranian-Caucasian and European, have been defined for cultivated apricots (Kostina 1969; Decroocq et al. 2016). The Central Asian group is the oldest group with the richest variation. Self-incompatibility is observed in most cultivars in this group; the fruits are small to medium. Likewise, the Iranian-Caucasian group is generally self-incompatible, but compared to the Central Asian group, the fruits are more significant, and the chilling requirements are lower. The European group is younger and less diverse. This group probably originates from some Asian apricots. Also, European cultivars, which are mostly self-compatible, have lower chilling requirements and have only a short maturation period. (Maghuly et al. 2005).

Central Asia has rich resources regarding apricot genetic diversity, including cultivated and wild materials. The apricot genotypes here contain significant variations in high sugar content, fruit shape, fruit color, fruit flesh, late flowering and late ripening, drought, cold, and salt tolerance. International researchers had limited access to evaluate these genetic resources until the collapse of the Soviet Union. Wide-scale use of these resources will allow for broader adaptation, expansion of production regions, and efficient and more sustainable production (Zaurov et al. 2013). On the other hand, it has been reported that Central Asian apricots require higher temperatures for flowering in spring, complete flower development more slowly, and, as a result, have a more extended flowering period. Likewise, it has been emphasized that flower formation stops in Central Asian apricots below 8 °C; on the contrary, the process continues in European apricots under these conditions, and as a result, European apricots are more affected by the cold (Lomakin 1977). Based on these points, it is seen that apricots of Central Asian origin have a more comprehensive adaptation.

Although different marker systems are used to determine plant diversity, DNA-based markers not affected by environmental conditions provide more successful and safe results. Among these, simple sequence repeats (SSR) are considered a preferred marker as they are codominant, have high reproducibility, and have greater genome abundance (Nadeem et al. 2018). This marker system is widely used for genetic diversity and fingerprint research in the apricot species (Hormaza et al. 2002; Zhebentyayeva et al. 2003; Hayashi et al. 2008; Pedryc et al. 2009; Yılmaz et al. 2012; Bakır et al. 2015: Gurcan et al. 2015; Corrado et al. 2021; Ehliz et al. 2021; Sheikh et al. 2021; Wang et al. 2023).

Kyrgyzstan, located within the homeland region of apricots, has a significant richness of cultivated and wild apricots. This study was conducted to determine the genetic diversity and population structure among apricot genotypes containing wild and cultivated forms collected from different regions of Kyrgyzstan.

Materials and methods

Ninety-one apricot accessions collected from the Chuy, Issyk-Kul, Osh, and Batken regions of Kyrgyzstan were used as material. Chuy and Issyk-Kul are located in northern Kyrgyzstan, while Batken and Osh are located in southern Kyrgyzstan. As described in Table 1, the materials consist of wild apricots, landraces, and cultivars and represent the apricot diversity of that region. The cultivars whose names are mentioned in Table 1 are widely grown in the Central Asian region (Zaurov et al. 2013).There is an elevated level of observable variation among the fruit morphologies of the apricots used in the study (Fig. 1).

Table 1 List of apricot accessions studied and their type and region collected
Fig. 1
figure 1

Fruit differences in some apricot accessions used in the study (Accession number of apricots described in Table 1; 16: Cultivar; 25: Wild type; 36: Wild type; 91: Landrace)

DNA isolation from young leaves of apricot accessions was performed using the CTAB method, according to Doyle and Doyle (1990). DNA concentrations were determined with the help of a semi-automatic microplate reader (PowerWave HT, BIO-TEK Instruments, Inc., Winooski, VT, USA), and DNA concentrations of 10 ng/µl were prepared for PCR research.

SSR analysis

SSR primers specified by Gürcan et al. (2015) were used for SSR analyses. All accessions were studied with 15 SSR primers, which successfully resulted in the tests performed with these primers (Table 2). PCR components and PCR cycles were made as reported by the same researchers and described below.

Table 2 Fifteen simple sequence repeat (SSR) primers used in apricot materials and their allele number, expected heterozygosity (He), observed heterozygosity (Ho), and polymorphism information content (PIC)

The polymerase chain reaction (PCR) was performed according to the method described by Schuelke (2000). PCR reactions were performed including fluorescently labeled M13 primer with 6-FAM, NED, PET, or VIC. The PCR reaction mix consisted of 2 µL of 10X PCR buffer, 0.6 µL of 50 mM MgCl2, 2 µL of 10 mM dNTP, 0.15 µL of 10 µM of a sequence-specific forward primer with M13(-21) tail at its 5´ end, 0.15 µL of 10 µM of a sequence-specific reverse primer, and 0.20 µL of 10 µM the universal fluorescent-labeled M13(-21) primer, 0.2 µL 5U/µL Taq DNA polymerase, and 3 µL of 25 ng/µL sample DNA. The final reaction volume was brought up to 20 µL with distilled water. The thermal cycler was programmed for denaturation at 94 °C for 5 min followed by 35 cycles of 94 °C for 40 s for denaturation, 60 °C for 30 s for primer annealing, 72 °C for 40 s for extension, and a final 5-min extension step at 72 °C. Four multiplex groups were assigned to primer pairs.

Allele lengths were determined by running PCR results on the ABI 3500 capillary electrophoresis system (Applied Biosystems, Foster City, CA, USA) at Erciyes University Genome and Stem Cell Center.

PowerMarker V3.025 software was run to determine the expected allele number (n) (He) and observed heterozygosity (Ho) and polymorphism information content (PIC). The neighbor-joining (NJ) method made the dendrogram using a genetic similarity matrix in MEGA6 (Tamura et al. 2007) and PowerMarker programs.

Results and discussion

Genetic diversity

Ninety-one apricot genotypes used were studied with 15 SSR primers. The allele numbers expected (He) and observed heterozygosity (Ho) and PIC (Polymorphism Information Content) values of these primers are given in Table 2. Accordingly, 107 alleles were obtained in all genotypes with 15 SSR primers. The highest allele was found in the SSR 3 primer (11), and the lowest was found in the SSR 1 and SSR4 primers (4). Expected heterozygote values in the primers are between 0.70 (SSR 4) and 0.88 (SSR 6). The observed heterozygosity values ranged between 0.58 (SSR 14) and 0.95 (SSR 3). On the other hand, PIC values were found between 0.65 (SSR 4) and 0.86 (SSR 6). Estimates of He was higher than that of Ho for ten loci (Table 2) showing that null alleles are high for the loci. The high frequency of null alleles indicates amplification failure for one allele in heterozygotes, which results in a loss of data and a decrease in the apparent heterozygosity (Dakin and Avise 2004). In some previous research on apricots, the number of alleles per marker was generally higher. Bourguiba et al. (2012), in their study of apricots from the regions around the Mediterranean basin, found the number of alleles per marker to be between 5 and 18. Gürcan et al. (2015) determined the number of alleles between 5 and 20 in apricot cultivars from five countries. Decroocq et al. (2016), in their study with 230 wild and 142 cultivated apricot materials, determined the number of alleles per marker as 6–22 and the observed heterozygosity values as 0.57–0.81. The relatively higher number of alleles obtained in these investigations, probably was because the cultivars used belong to different regions of the world.

On the other hand, Corrado et al. (2021), in their study of a narrower apricot population in Italy, determined the number of alleles to be between 2 and 10, similar to the results in our study.

According to the dendrogram obtained with the 15 SSR primers used in this study, it is seen that there are significant variations among genotypes (Fig. 2). Almost all of the genotypes are genetically separated from each other. On the other hand, apricot genotypes taken from northern (Chuy and Issyk-Kul, red color in the dendrogram) and southern Kyrgyzstan (Batken and Osh, blue color in the dendrogram) regions were grouped geographically. Genotypes are divided into five basic groups in the dendrogram. With some exceptions, genotypes taken from the southern regions of Kyrgyzstan were included in groups 1, 4, and 5, and genotypes taken from the northern regions were included in groups 2 and 3. In general, genotypes in groups 1, 4, and 5 consist of dried apricots, and those in groups 2 and 3 consist of table apricots. Decroocq et al. (2016), in their study with SSR markers, reported that wild apricots taken from the Issyk-Kul region of Kyrgyzstan are in the same group as apricots taken from the Kazakhstan region and that they belong to the Dzhungar-Zailig ecogeographic group. Researchers emphasized that the apricots taken from Sarı Chelek, located in the south of Kyrgyzstan and close to the Ferghana region, are in a completely different group from the others and belong to the Central Asian ecogeographic group.

Fig. 2
figure 2

Grouping of the 91 apricot accessions based on simple sequence repeat (SSR) data (Red color: Northern Kyrgyzstan -Chuy and Issyk-Kul-; The blue color refers to apricots originating from southern Kyrgyzstan–Batken and Osh-)

There was a total of seven genotypes in group 1 in the dendrogram. In this group, only genotype 1 belongs to the northern region (Chuy), while the remaining six belong to the southern Batken (73, 75, 77) and Osh (91, 92, 94) regions. There are 18 genotypes in group number two, all belonging to the northern regions. Genotype 56 was taken from the Chuy region, and the remaining 17 were taken from the Issyk-Kul region.

Group number three is the most populous group in the dendrogram and has 33 genotypes. Again, all of the genotypes in this group belonging to Issyk-Kul region, except genotype 57 (Chuy). In this group, genotypes 13 and 15 and 21 and 22 were the most similar dual genotypes. This large group was divided into two subgroups, with 23 and 10 apricot genotypes in the subgroups, respectively. All apricot genotypes in the fourth group in the dendrogram belong to the Batken and Osh regions south of Kyrgyzstan. The genotypes within the group were separated, and there were 16 genotypes. The two 'Supkhani' cultivars (79, 81) included in the study are located very close to each other in this group.

The last group, the fifth group, consists of 12 genotypes, of which genotypes 5, 6, and 7 belong to the northern Kyrgyzstan (Issyk-Kul) region. These three genotypes are grouped in a small subgroup, separate from the others. The remaining nine genotypes were taken from the southern regions of Kyrgyzstan. ‘Kandak’ (83 and 87) and ‘Mirsandjali’ cultivars (85, 86) in this group were reported to have the same AFLP profiles in a previous study (Zhebentyayeva et al. 2008).

Population structure

The population structure of apricot accessions was determined using the STRUCTURE 3.2 program defined by Evanno et al. (2005). LnP(D) value reached its maximum at K = 3, and this method's most appropriate population number was evaluated as 3 (Fig. 3). Based on SSR markers, the substructuring Bayesian analysis identified three subpopulations in our data set (Fig. 4). Accessions with membership coefficiency of 0.80 (Q value) or more to a subpopulation are considered pure (Fukunaga et al. 2005). In contrast, individuals with less membership coefficients are considered admixed by at least two distinct genotypes. In this study, 38 individuals have 0.80 or more membership coefficients; therefore, they are likely pure and non-admixed. The remaining 53 individuals had varying membership coefficients and were likely to be admixed by at least two subpopulations.

Fig. 3
figure 3

A Graphical representation of mean likelihood L (K) and variance per K values on dataset containing 91 apricot accessions

Fig. 4
figure 4

Graph of grouping of Kyrgyzstan 91 apricot accessions obtained with Structure version 3.2 software indicating membership coefficiencies and level of admixture to each subpopulation

When the bar plot obtained by STRUCTURE analysis is examined, it is seen that there have been mixtures and gene transfers between apricot genotypes over many years in the Central Asian region in general (Fig. 4). The STRUCTURE analysis results were compatible with the groupings obtained in the NJ dendrogram in Fig. 2. Consistent with the results in the dendrogram, a grouping appropriate to the geographical distribution of genotypes was obtained in the STRUCTURE analysis. Bourguiba et al. (2017) found that geographical origin and propagation mode are essential for structuring genetic diversity in North African apricot species. Also, Bourguiba et al. (2020) identified strong relationships between memberships of accessions within the clusters and their geographical regions for global apricot accessions. Similarly, Sheikh et al. (2021) determined a relationship between the genetic structure and the geographical origins of the materials in their study on Himalayan apricots.

There were 21 genotypes in the red group in the bar plot, and the Q value was above 0.80 in 8 of them (Fig. 4). Genotypes 13, 15, 21, and 22 in this group showed an almost homogeneous structure. In addition, genotypes such as 19, 42, 46, 47, 16, 29, and 24 are included in the red group, but they also contain infrastructure in the other two groups. On the other hand, numbers 30, 32, and 43 showed almost half the red and blue group resolution. The red group in the bar plot consists of genotypes in group 3 in the NJ dendrogram (Fig. 2). This group generally consists of genotypes belonging to the northern Kyrgyzstan region.

In the bar plot, the green-colored group consists of 29 genotypes, and the Q value of 12 (91, 57, 88, 83, 85, 56, 67, 64, 62, 61, 86, 60) is above 0.80. All genotypes in this group belong to the Batken and Osh regions in southern Kyrgyzstan. The feature of these genotypes is that they are mostly drying. This reveals genetic differences between northern (Issyk-Kul) and southern (Batken, Osh) Kyrgyzstan apricot populations. Similarly, Bourguiba et al. (2020) noted the genetic structure within Central Asia's wild Prunus armeniaca populations, with a highly differentiated Kyrgyz population in the South.

The accessions in this group are distributed in groups 4 and 5 in the dendrogram. In the dendrogram, 2 of the three samples of the 'Mirsandjali' cultivar (85, 86) were placed in group 5, and one (65) was placed in group 4. However, in the bar plot, three samples (60, 80, and 81) were in the green group. Similarly, two “Kandak” genotypes (83, 87) in group 5 in the dendrogram are in the green group (78, 82) in the bar plot. It is seen that the Q values of such genotypes in group 5 in the dendrogram belong to the blue and red groups at a rate of approximately 20–40% in the green group on the barplot. In group 4 in the dendrogram, the Q value of the blue and red groups is much lower than that of the green group. This shows that the genetic background is less complex in these genotypes.

In the bar plot, the blue-colored group contained the most accessions. There are 40 genotypes in this group, and 18 of them have a Q value greater than 0.80. The genotypes in this group are generally distributed in groups 1, 2, 3, and 5 in the NJ dendrogram. The entire group numbered two, and a subgroup of groups 1, 3, and 5 formed the green-colored group in the bar plot. These genotypes mainly include genotypes belonging to the Issyk-Kul region. However, genotypes 64, 73, 74, 75, 76, 78, and 82 belong to the Batken region in southern Kyrgyzstan. On the other hand, it is seen that 20–50% of the 15 genotypes in this group have a green and red group background. This shows that there are interactions between genotypes in the process and that seed propagation occurs occasionally.

Conclusion

The results of our study show that Kyrgyzstan has a significant level of apricot diversity. This study is the first comprehensive genetic diversity study on apricots in this country. The presence of wild, local, and standard cultivars has enabled the formation of a more complex genetic structure with the interactions between them over time. However, the unique genetic diversity in the Central Asian region is threatened with extinction due to unauthorized human economic activity, overgrazing, and the almost complete absence of natural seed regeneration caused by the lack of protective and forest restoration measures (Shalpykov and Dolotbakov 2011). In order to preserve this diversity, it is necessary to establish ex situ core collections and in situ conservation strategies. The results of our study also provide information for establishing such collections. In addition, these results could be used for parent selection and other events for new research.