Introduction

The genus Gossypium (Malvaceae family) exhibits great phenotypic variation and includes approximately 50 species (Campbell et al. 2009). Because of its high economic significance, the genus has gained much attention from taxonomists, evolutionary biologists, and agricultural scientists (Wendel et al. 2009). Some Gossypium (G.) species are known for morphological resemblance, making it difficult to identify them, especially if fruits are not present (Stanton et al. 1994). The widely cultivated Cotton belongs to the two species Gossypium barbadense and Gossypium hirsutum L. domesticated in South America. The new cultivars of the two species are introgressed to each other (Wendel et al. 2009). Cotton is the most widely used natural fiber in the world. However, the cultivars grown for commercial purposes are nearly the species G. hirsutum, derived from only a few cultivars and hence have a limited genetic base (Moiana et al. 2015).

On the other hand, G. barbadense is the second most cultivated species, known for the best fiber quality (Liu et al. 2015). Cotton is a fiber and oil crop with a good fiber texture and high-quality oil (Aslam et al. 2020). Cotton is a major source of income and employment in many nations, with millions of people employed in crop production, processing, and distribution (Chaudhry et al. 2003). Egyptian and Pima cotton are grown for extra-long, strong, and fine fiber (Hussein et al. 2007).

In Egypt, Cotton provides a source of income for millions of people who work in the textile sector, either directly or indirectly. Egyptian cotton breeders are concentrating their efforts on improving yield for long-staple Cotton and attempting to develop new, improved varieties with desirable traits for both farmers and breeders are continuous (Yehia and El-Hashash 2019). Breeders improved the upper half mean length, uniformity index, and strength of these Cotton, determined by the longest staple length in the long-staple category. Furthermore, the strength level is comparable to or near that of extra-long-staple Cotton allowing long-staple Cotton to compete in spinning performance and yarn quality with extra-long-staple Cotton (Abdelbary et al. 2021).

Plant morphological traits are continuously used to assess the genetic variability between and within crop plant populations (Begna 2021). The Genetic diversity of Cotton is important for sustainable development for breeding new genotypes, and it is also critical to select parents for plant breeding programs (Bertan et al. 2007). The first step in producing germplasm and crop cultivars is to characterize genetic diversity and the degree of connection between and within genetic resources, which are regarded as sources for new crop varieties (Govindaraj et al. 2015; Han et al. 2022) and essential for the crop improvement success (Rana et al. 2007). Genetic diversity data is vital when attempting to improve crops and develop new varieties (Bakhsh et al. 2019; Pereira et al. 2015; Swarup et al. 2021).

The use of molecular markers has become an important additional requirement for understanding the genetic basis in addition to sets of morphological traits (Selvi et al. 2013). Molecular markers have been used to measure the genetic diversity and relationships between species and their wild relatives in Cotton (Ditta et al. 2018; Hoffmann et al. 2018; Saif et al. 2017; Sethi et al. 2015; Tidke et al. 2014). Polygenic morphological traits are influenced by the environment and are mostly quantitatively inherited (Hassan 2018; Lukonge et al. 2007). Inter Simple Sequence Repeat (ISSR) has been applied in many genetic diversity studies. ISSR is a simple and informative genetic marker system in Cotton for revealing inter- and intraspecific variation (Abdellatif et al. 2012; Farahani et al. 2018; Kahodariya et al. 2015; Liu and Wendel 2001). It uses the primers complementary to a single SSR and anchored at either the 5′ or 3′ end with a one- to three-base extension. The ISSR markers are robust, reliable, quick, efficient, and reproducible, with greater discriminative ability than the other techniques (Abdellatif et al. 2012; Dongre et al. 2007; Preetha and Raveendren 2008; Rana et al. 2007).

ISSR markers have been used for differentiating cotton genotypes. For example, the cotton genotypes (G. barbadense L.) were clustered into two major clusters using a UPGMA cluster analysis based on ISSR polymorphism, according to Hoffmann et al. (2018). Also, they concluded that the G. barbadense germplasm had a narrow genetic diversity, and the genetic relationship was attributed to their similar ancestors. The present study aims to analyze the genetic diversity among Cotton (G. barbadense) varieties using Inter-Simple Sequence Repeat markers (ISSR), evaluate the variation in morphological traits to differentiate five varieties of Egyptian Cotton and estimate the genetic distance between them.

Material and methods

Plant material

Five Egyptian Giza varieties of Gossypium barbadense L. were involved in this study. The varieties' names, abbreviations, pedigrees, and the year of release of these genotypes are illustrated in Table 1. Seeds were kindly provided by the Cotton Research Institute, El-Marashda Research Station, Agricultural Research Center-Egypt. Plants of the 2nd generation are grown under newly reclaimed lands in El-Marashda city, Qena governorate, Egypt. The second author prepares a herbarium collection. The specimens are dried and kept in Botany & Microbiology Department—South Valley University Herbarium (QNA- proposed acronym). The nomenclature and synonyms of G. barbadense L. are reviewed according to www.tropicos.org; after referring to (Fryxell 1969), a specimen sheet of it is revised from a collection of the National History Museum- London, Appendix 1.

Table 1 Varieties names, Abbreviations, pedigrees, and the year of release of the studied cotton genotypes

Morphological characters

This study concerns some vegetative and reproductive, quantitative, and qualitative morphological characters. They are also investigated based on the crop yield of these genotypes and previously prepared herbarium specimens. The quantitative characters of the stem and leaves are the stem height (cm), no. of vegetative branches, no. of fruiting branches, the position of the 1st fruiting branch in relation to the node order, petiole length, leaf length, leaf wide (cm), no. of leaf lobes, lobes length and lobe width (cm). The qualitative characteristics of the stem are its hairiness, color, black spots amount, outline shape, petiole color, leaf bract shape, flower color, internal petaloid spots' color, staminal tube length, stigma height in relation to anthers, and the anthers color. Three replicates of specimens for each genotype are examined—fifteen readings for each specimen record observations and reading of the characters.

ISSR-PCR reactions

Eleven ISSR primers were used to detect polymorphism. The amplification reaction was carried out in 25 μl reaction volume containing 12.5 μl Master Mix (Sigma), 2.5 μl primer (10pcmol), 3 μl template DNA (10 ng), and 7 μl dH2O, according to (Adhikari et al. 2015).

Thermocycling profile PCR

PCR amplification was performed in a Perkin-Elmer/GeneAmp® PCR System 9700 (P.E. Applied Biosystems) programmed to fulfill 40 cycles after an initial denaturation cycle for 5 min at 94ºC. Each cycle was composed of a denaturation step lasting 1 min at 94 °C, an annealing step lasting 1 min at 45 °C, and an elongation step lasting 1.5 min at 72 °C.In the final cycle, the primer extension segment was extended to 7 min at 72ºC.

Detection of the PCR products

The amplification products were resolved by electrophoresis in a 1.5% agarose gel containing ethidium bromide (0.5ug/ml) in a 1X TBE buffer at 95 V. PCR products were visualized on U.V. light and photographed using a Gel Documentation System (BIO-RAD 2000).

Data analysis

The morphological measurements represent the means with standard error mean (SEM), and an Analysis of Variance (one-way ANOVA) of all the morphological measurements was performed using the XLSTAT software (Addinsoft 2021) following Steel and Torrie's (1997) method. The mean comparison of the treatments was investigated using the L.S.D test at the level of significance (p < 0.05). Correlation coefficient (Pearson) values between morphological traits were obtained using the SPSS program version-20 (Dunn 2013). The environmental, phenotypic, and genotypic coefficients of variations and their variance were estimated according to Singh and Chaudhary (1985), and the heritability (broad sense) was determined based on the genetic mean according to Allard (1999). The genetic Advance was calculated per the formula by Johnson et al. (1955) using the variability package of R statistical software in RStudio version 1.4.1717 (Popat 2020).

For ISSR analysis, only clear and unambiguous bands were visually scored as either present (1) or absent (0) for all samples, and the final data sets included both polymorphic and monomorphic bands. Then, a binary statistic matrix was constructed. The PAST 3.22 software (Hammer et al. 2001) was used to construct cluster trees (dendrogram) according to the Euclidean distance coefficient using the unweighted pair group method with arithmetic averages (UPGMA).

The potential of the ISSR markers in the estimation of genetic variability was assessed by measuring the Heterozygocity index (H); Polymorphic Information Content (PIC); Effective multiplex ratio (E); Arithmetic mean of H (H.av); Marker Index (MI); Discriminating power (D); Resolving power (R) according to (Amiryousefi et al. 2018).

Results

Morphological characters

The detailed data in Table 2 show valuable variation in the morphological traits among the five varieties of G. barbadense. The plant stem was mostly green or greenish-red, rarely green to red herbs; stems outline circular, polygonal, or circular to polygonal, glabrous, with few to dense black spots on stems (dense in G 89). The stem height ranged from 82 cm to 103.2 cm in G 89 and G 95, respectively. Giza 90 genotype showed the lowest measurements in most quantitative traits, including No. of branches bearing fruits (16.33* ± 0.56), the position of 1st branch bearing fruit in relation to nodes (6.60* ± 0.51), Petiole L (6.15* ± 0.15 cm), Leaf width (6.49 ± 0.24 cm), and Lobe length (3.77 ± 0.40 cm). Giza 86 genotype exhibited the highest values in some morphological characters such as No. of branches bearing fruits (19.53* ± 0.67), the position of 1st branch bearing fruit in relation to nodes (9.07* ± 0.28), and Petiole L (7.61* ± 0.24), while the Giza 80 genotype showed the highest values in the Leaf length (7.69* ± 0.25), lobe length (4.58 ± 0.25), and lobe width (3.15 ± 0.17). Leaves were bracteate, bract linear-lanceolate; petiolate, green, greenish-red or green to greenish-red; flowers were yellow; petals' basal internal spots deep purple in G 86 and G 89 varieties and ranges between length or deep purple in other varieties; staminal tube in all genotypes is short; anthers color shades are orange/yellowish orange or purplish yellow, brilliant yellow tending to orange or orange to purplish orange and yellowish-orange to brilliant or purplish yellow; stigma commonly higher than the anthers' height.

Table 2 Some vegetative and reproductive morphological characters of five varieties of G. barbadense L

Key for G. barbadense L. varieties based on morphological characters

1.a

Stem outline polygonal, with few black spots…

G 80

b

Stem outline not polygonal, with rare/few to dense black spots…

2

2.a

The average no. of vegetative branches is 1.4–1.5…

3

b

The average no. of vegetative branches is > 2.0…

G 90

3.a

Lobe/leaf L. ratio = 0.5…

4

b

Lobe/leaf L. ratio more than 0.5…

G 89

4.a

Anthers orange or purplish yellow…

G 95

b

Anthers yellowish orange or purplish orange…

G 86

Table 3 shows the Pearson correlation coefficient between each pair of morphological traits based on cotton varieties. As shown in Table 3, there was a highly positive significant correlation between leaf length and leaf width (0.529**), between stem height and No. of branches bearing fruits (0.477**), and between leaf width and lobe width (0.443**). In the meantime, the position of the 1st branch bearing fruit in relation to nodes had a significant positive correlation with each of No. of branches bearing fruits (0.278*), Leaf width (0.250*), and Lobe width (0.293*). Also, the petiole L significantly and positively correlated with Leaf length (0.280*) and Leaf width (0.255*). Other negative significant correlations were found between No. of vegetative branches, and each of No. of branches bearing fruits ( − 0.261*) and Petiole L ( − 0.246*) and between Stem height and Lobe length ( − 0.238*). Also, a significant negative correlation was found between No. of leaf lobes and the Lobe width ( − 0.264*).

Table 3 Pearson correlation coefficient between each pair of morphological traits based on the examined G. barbadense varieties

Sum Square, Mean Square, F-value, and probability value from Analysis of Variance for investigated traits in the examined G. barbadense varieties are presented in Table 4. The results revealed that Stem height, the position of 1st branch bearing fruit in relation to nodes, and Leaf length were highly significant differences as probability value Pr (> F) was ≤ 0.001. Meanwhile, No. of branches bearing fruits and Petiole L were highly significant differences among varieties (P value ≤ 0.01), while the other traits showed non-significant differences among the studied varieties.

Table 4 Sum Square, Mean Square, F-value, and probability value from Analysis of Variance in the examined G. barbadense

The data for the Analysis of Variance in the examined G. barbadense varieties are shown in Table 5. The data showed that the environmental variance, genotypic variance, phenotypic variance, heritability (in the broad sense), and genetic advance values for the stem height trait were all the highest. The number of vegetative branches was the trait with the highest values for the environmental, genotypic, and phenotypic coefficient of variation. As for heritability, the data indicated that heritability was the highest value for stem height, followed by the position of 1st branch bearing fruit in relation to nodes, leaf length, the number of branches bearing fruits, and Petiole length.

Table 5 Estimates of variance components and genetic parameters for various quantitative traits in the examined G. barbadense varieties

Phenotypic and genotypic correlations

Phenotypic and Genotypic Correlation in the quantitative characters of the examined G. barbadense genotypes were calculated among all characters (Table 6) after excluding the traits with negative genotypic variance in Table 5. The data revealed that the phenotypic correlation ranged from 0.0017 to 0.5137, while the genotypic correlation ranged from 0.5947 to 2.4007. it was clear that the genotypic correlation was higher in magnitude than the phenotypic correlation. It is noteworthy that stem height has a highly significant positive correlation with the number of branches bearing fruits at phenotypic and genotypic levels. Also, the number of branches bearing fruits had a significant positive correlation with the position of 1st branch bearing fruit in relation to nodes at genotypic and phenotypic levels. The leaf length and width had a highly significant positive correlation only at a phenotypic level which corresponds to the Pearson correlation in Table 3. There was a leaf length significantly negative correlation with the number of branches bearing fruits (p < 0.01) and Leaf width (p < 0.05) only at genotypic correlation. The petiole length trait had a significant positive correlation with the position of 1st branch bearing fruit in relation to nodes at the genotypic level and with the leaf length and Leaf width traits at the phenotypic level.

Table 6 Phenotypic (above diagonal) and Genotypic (below diagonal) Correlations of the quantitative characters for the examined G. barbadense

ISSR molecular marker analysis

As shown in Table 7 and Fig. 1, the 11 used primers produced 134 bands for the five cotton genotypes; 42 bands were polymorphic, 85 monomorphic, and 7 bands were unique. The primer ISSR-06 produced the highest number of bands (18), and primer ISSR-03 and ISSR-11 had the lowest number (9). The number of polymorphic bands varied between primers; the bands produced by the primers ISSR-06 showed the highest number of polymorphic bands (12 bands), while primers ISSR-01, ISSR-08, and ISSR-11 showed one polymorphic band (the lowest number of bands). The number of polymorphic bands and percentage of polymorphism in the ISSR profile of the 11 primers are given in Table 2. The percentages of polymorphism varied and reached 67% in ISSR-06.

Table 7 Primers codes, number of polymorphic, monomorphic, unique bands, and polymorphism percentage calculated by the analysis of ISSR fingerprinting in the examined G. barbadense varieties
Fig. 1
figure 1

ISSR profile produced by 11 primers in the examined G. barbadense varieties

Table 8 illustrates the marker parameters, Heterozygosity index (H); Polymorphic Information Content (PIC); Effective multiplex ratio (E); Arithmetic mean of H (H.av); Marker Index (MI); Discriminating power (D), and Resolving power (R). The mean of PIC values is analyzed for all loci of each primer. A high PIC value of 0.278 (ISSR-01) and a low PIC value of 0.193 (ISSR-07) were obtained with an average PIC per primer of 0.239. The effective multiplex ratio (E) depends on the fraction of polymorphic fragments. Our study observed the highest effective multiplex ratio, 13.6, with the primer ISSR-06. The lowest effective multiplex ratio (E), 7.4, was observed with the primer ISSR-03, with an average E of 18.6 per primer. The MI (marker index) for each ISSR primer was calculated to determine the usefulness of the system of markers used. The highest MI was observed with three primers, ISSR-07, ISSR-05, and ISSR-06 (0.0585, 0.0563, and 0.0558, respectively), and the lowest in the primer ISSR-01 (0.0064), with an average MI of 0.042 per primer was obtained. The resolving power (R) parameter indicates the discriminatory potential of the primers chosen. With an average R of 5.2 per primer, the primers with the greatest and lowest R-values were ISSR-06 and ISSR-01 (8 and 0.40, respectively). The expected heterozygosity HE of the diversity index DI is another name for the heterozygosity index (H). It is defined as the probability that an individual is heterozygous for the locus in the population. The highest value of the H index was observed with three primers, ISSR-07, ISSR-05, and ISSR-06 (0.413, 0.375, and 0.369, respectively), and the lowest in the ISSR-01 (0.033), with an average H of 0.259 per primer. The highest value of Discriminating power (D) was observed with the ISSR-07 primer (0.501) and the lowest with the primer ISSR-01 (0.033), with an average D of 0.293 per primer.

Table 8 Marker parameters calculated for each ISSR primer used with the G. barbadense varieties

The UPGMA cluster analysis was constructed to illustrate the genetic behavior among the examined G. barbadense varieties based on morphological traits, ISSR polymorphism, and the combination of them (Fig. 2A–C, respectively). Based on morphological traits and the combination between morphological traits and ISSR fingerprinting, the G90 was separated from the other four varieties that were grouped into two clusters; one cluster comprising G86 and G89 and the other cluster containing G80 and G95. The UPGMA tree illustrating the genetic diversity based on ISSR polymorphism separated the five varieties into two major clusters; the first one included three varieties, G90, G95, and G89, while the second cluster included G80 and G86.

Fig. 2
figure 2

UPGMA cluster analysis of five G. barbadense varieties based on morphological traits (A), ISSR fingerprinting (B), and a combination of morphological & ISSR fingerprinting (C)

Discussion

The morphological description of Egyptian cotton genotypes indicated narrow differences in the qualitative traits. This is consistent with the results of many authors who used the morphological variation in cotton traits to divers among different genotypes of the plant, indicating their genetic diversity (Amer et al. 2016; El-Seidy et al. 2017; Hassan 2018; Hoffmann et al. 2018; Khan and Hassan 2011; Sen et al. 2022). (Hoffmann et al. 2018) found a significant variability for morphological characters in Gossypium accesses, underlining the possibility of using it to supplement breeding programs. Many positive and negative significant Pearson correlations were found between cotton genotypes in the study of Yehia and El-Hashash (2021). Many morphological characteristics of Cotton exhibited Pearson correlations between them, especially those about yield; this correlation positively or negatively was linked to allele accumulation and the plant's genetic potential (Hinze et al. 2011). The high Sum Square and Mean Square for some traits and the significance of the p value indicate the existence of considerable genetic divergence among the studied genotypes (Abd El-Moghny et al. 2015). This information could benefit cotton breeders looking to achieve a certain level of diversity in specific morphological traits. It could aid in the identification and engineering of crosses. Our results revealed that the heritability values were high for some quantitative traits, which is consistent with (Khan and Hassan 2011), which found high heritability values between different cotton genotypes and concluded that the desirable genotypes could be maintained through simple selection in segregating generation. These results are in agreement with many studies for other plants found that a lesser magnitude of phenotypic correlation coefficients than the genotypic correlation coefficients (Baye et al. 2020; Dabi et al. 2016; Oladosu et al. 2018; Sen et al. 2022); this indicated that many morphological traits have underlying genetic relationships, and the environment less influences the phenotypic expression of these traits.

Analysis of diversity using PCR-based markers is an efficient and speedy approach to identifying genotype relationships and/or differences among genotypes (Rana et al. 2005; Schulman 2007; Zhao et al. 2015). In many studies on Gossypium genotypes and other plants, when compared to the different molecular markers utilized, ISSR markers produced the largest percentage of polymorphic bands (Abdellatif et al. 2012; Jedrzejczyk and Rewers 2020; Liu et al. 2006); it's believed that this is because ISSR is a dominant marker that measures the distance between two microsatellites. Variable factors like primer structure, template quantity, and the genome's lower proportion of annealing sites influence the number of bands amplified by different primers (Muralidharan and Wakeland 1993). In a similar study on the G. barbadense genotypes, (Saif et al. 2017) found that the lowest number of polymorphic fragments and percentage of polymorphism were reported for ISSR markers; in comparison, in our study, the percentage of polymorphism was given high values (ISSR's effectiveness). The PIC values were smaller than those obtained by (Abdellatif et al. 2012), whereas the PIC values of their study primers ranged from 0.75 to 0.94. The UPGMA of ISSR and morphological data combination indicated that varieties with the same ancestors were grouped together.

In conclusion, the cultivars employed in this study can be used as parents to expand the genetic basis of cotton germplasm in Egypt and for the development of new high-yielding types to enhance cotton production and quality.