Isolation and characterization of microsatellite markers and analysis of genetic variability in Curculigo latifolia Dryand
- First Online:
- Cite this article as:
- Babaei, N., Abdullah, N.A.P., Saleh, G. et al. Mol Biol Rep (2012) 39: 9869. doi:10.1007/s11033-012-1853-z
Curculin, a sweet protein found in Curculigo latifolia fruit has great potential for the pharmaceutical industry. This protein interestingly has been found to have both sweet taste and taste-modifying capacities comparable with other natural sweeteners. According to our knowledge this is the first reported case on the isolation of microsatellite loci in this genus. Hence, the current development of microsatellite markers for C. latifolia will facilitate future population genetic studies and breeding programs for this valuable plant. In this study 11 microsatellite markers were developed using 3′ and 5′ ISSR markers. The primers were tested on 27 accessions from all states of Peninsular Malaysia. The number of alleles per locus ranged from three to seven, with allele size ranging from 141 to 306 bp. The observed and expected heterozygosity ranged between 0.00–0.65 and 0.38–0.79, respectively. The polymorphic information content ranged from 0.35 to 0.74 and the Shannon’s information index ranged from 0.82 to 1.57. These developed polymorphic microsatellites were used for constructing a dendrogram by unweighted pair group method with arithmetic mean cluster analysis using the Dice’s similarity coefficient. Accessions association according to their geographical origin was observed. Based on characteristics of isolated microsatellites for C. latifolia accessions all genotype can be distinguished using these 11 microsatellite markers. These polymorphic markers could also be applied to studies on uniformity determination and somaclonal variation of tissue culture plantlets, varieties identification, genetic diversity, analysis of phylogenetic relationship, genetic linkage maps and quantitative trait loci in C. latifolia.
KeywordsSSR markers5′ and 3′ anchored ISSRGenetic variationLembaPolymorphism
Lemba (Curculigo latifolia Dryand) a monocotyledonous perennial herb belongs to the Hypoxidaceae family. The genus Curculigo comprising of about 20 species is distributed in the tropical regions of Asia and Africa . C. latifolia is widely spread in primary and secondary forests throughout Malaysia. Species confounded in Borneo are C. racemose and C. orchioides. Curculin, which is extracted from fruits of C. latifolia has been found to have a sweet taste with sweetness-modifying characteristics of natural sweeteners and has been shown to be a good low-calorie sweetener [2, 3]. It has been proven that curculin is up to 9,000 times sweeter than sucrose  and has antidiabetic properties . Therefore, this plant has great potential for the pharmaceutical and food industries. Plants are currently being brought into cultivation. However, prior to cultivation, it is prudent to look at the fundamental knowledge of the population structure of this plant. The beneficial characteristics of the species can be further enhanced through plant breeding, but characterization of the available species in Malaysia is loose and considered necessary before any breeding work can commence. Determination of genetic diversity and population structure are prerequisites of breeding programs and a first step in the development and evaluation of plant genotypes.
Microsatellites or simple sequence repeats (SSRs) are tandemly repeated motifs of 1–6 nucleotides found in all prokaryotic and eukaryotic genomes . Since microsatellites are co-dominants inherited, highly abundant, polymorphic, multiallelic, and reproducible with transferability characteristics, they become one of the most desirable markers for use in genetic studies . Interestingly, SSR has been the marker of choice for assessment of genetic variability in many plant species such as commercial peach varieties , sugar beet , barley  and chickpea , analysis of phylogenetic relationship , marker assisted selection , construction of genetic linkage maps , and quantitative trait loci (QTL) . Microsatellites may be identified by screening DNA databases, but for this genus no sequence information has been reported and no microsatellite markers were isolated and developed. It was therefore essential to develop microsatellites for C. latifolia. The knowledge generated could also be used in related species.
ISSR-PCR is an alternative strategy that has been devised to reduce the time invested in microsatellites isolation and to significantly increase yield  without the need for enrichment and/or hybridization screening . Besides, the ISSR-PCR technique targets only those regions of the genome that are rich in microsatellite motifs [18, 19]. In plants, the construction of microsatellite markers with both 3′ and 5′ anchored ISSR-PCR strategy has been proven to be effective in producing polymorphic loci for different species such as, Canada thistle , wheat [21, 22], oil palm , Japanese persimmon , and turnip .
This study was performed with the objectives of to develop polymorphic microsatellite markers for C. latifolia using 5′ and 3′ anchored ISSR primers; and to determine the suitability of developed microsatellite markers for genetic variation study using constructing a dendrogram and demonstrating relationships among C. latifolia accessions.
Materials and methods
Genomic DNA was extracted from the young leaf of 27 accessions from Peninsular Malaysia using GENE √ ALL™ Plant SV Mini Kit (from General Biosystem, Seoul, Korea) following manufacturer’s instructions with the DNA concentration adjusted to 70 ng/μl.
Microsatellite markers development
In this study an accession from Ringlet was used for PCR amplification using ISSR markers. Both 3′ and 5′ anchored ISSR primers were used for microsatellite markers development. The 3′ anchored ISSRs were UBC815 and UBC835 with sequences of (CT)8G and (AG)8YC, respectively. Three 5′ anchored ISSRs used were RAM1, BP8 and BP10 with sequences of YHY(CCA)5, KKYHYHYHY(GTT)5 and KKDRDRD(TC)10 respectively, where Y = C/T, H = A/T/C, K = G/T, D = G/A/T and R = A/G.
PCR was carried out in a total volume of 25 μl including deionized water, 1 × PCR buffer plus MgSO4, 200 μM dNTP mix, 0.6 μM primer, 0.75 Pfu DNA polymerase and 70 ng/μl DNA. The thermal cycler with the touchdown thermal cycling protocol starting with three minutes of denaturing at 94°C was followed by the remaining thermal cycling protocol where temperature was set to 94°C for 40 s. The annealing step was started at 10°C above optimum annealing temperature for 30 s and then reduced by 1°C per cycle until optimum annealing temperature followed by 60 s extension time at 72°C. The program was followed by the remaining thermal cycling protocol where temperature was set to 95°C for 40 s, then at the primer’s optimum annealing temperature for 50 s, and extension at 72°C for 60 s, for a total of 30 cycles with a final ten minutes extension at 72°C. Amplified products were resolved via 2 % agarose gel, stained by ethidium bromide and visualized by UV-light.
Fragments ranging from 250 to 1,300 were purified using Gene JET™ PCR Purification Kit (Fermentas) and ligated into pCR®II-Blunt-TOPO® vector (Zero Blunt® TOPO® PCR Cloning Kit, Invitrogen®) following manufacturer’s instructions and then transformed into Escherichia coli DH5α component cells. Transformed clones were grown overnight in selective media (LB-Amp). Ten randomly selected recombinant clones were grown overnight in LB broth and plasmids DNA were extracted using PureLink™ Quick Plasmid Miniprep Kit (Invitrogen®). Extracted plasmid DNA of recombinant clones was sequenced. Microsatellite motifs were screened using Microsatellite Repeat Finder—Online Bioinformatic Tools and primers were designed using PRIMER3 Version 0.4.0 (http://frodo.wi.mit.edu/primer3/). The major parameters for primer design were set as follows: primer length from 20 to 25 nucleotides, PCR products size from 140 to 310 bp, annealing temperatures at 55.5–61.5°C and GC content of between 40 and 60 %.
The binary data attained from scoring of microsatellite markers were analyzed using NTSYS-pc 2.1 in order to reveal the genetic variability and associations among C. latifolia accessions. The coefficients of genetic similarity were computed using Dice’s similarity coefficient. This similarity matrix was used to create a dendrogram using the unweighted paired group method using arithmetic average (UPGMA). Principal component analysis (PCA) was also carried out to explore associations among accessions .
Gap statistics was computed to estimate the number of clusters in the dendrogram . The gap statistics were analyzed using R software version 2.15.0 using matrix data.
Sequence analysis of cloned fragments
Characterization of the microsatellite loci in 27 C. latifolia accessions
Gene bank accession no.
Primer sequence (5′–3′)
Allele size range (bp)
F:CCA ACT ATC CTT TCC CGA CA
R:TGG GTA GGG GTC CTC TCT CT
F:CTC TCT CTC TGT GCC CCA AG
R:CGC ACC ATA CGT TTG TTT GA
F:GAG AGC CAC GAG TAA AGA GTC A
R:AAG GCT TAC ACT AAT GAT TTG CTT
F:CCG GTT GAG GAT ACA AAT GG
R:GGA CCA GCT GAG CAT TGA TT
F:CCG GTT GAG GAT ACA AAT GG
R:AAG CGG GAG AGG CAT TTA TT
F:GAG AGA GAG AGA GCC CAGCA
R:TTG GCC ATG AAA TTT TGT CC
A total of 235 microsatellite regions were identified from the sequencing results consisting of different microsatellite core units. Among all microsatellite motifs found, the dinucleotide core was the most frequent with 121 motifs, followed by mononucleotides and trinucleotides with 64 and 47 motifs, respectively. In contrast, only two and one tetranucleotide and pentanucleotide motifs respectively were found, and no hexanucleotide microsatellite motif was found among all sequences. Among the selected sequenced clones, 86.1 % (31 out of 36 unique sequences) contained internally located microsatellite motifs in addition to those at the ends with variable flanking regions on both sides of the motifs. Based on Weber’s (1990) classification rules , among the 36 unique sequences, 99 perfect microsatellites without interruption, five compound repeat sequences with adjacent tandem microsatellites of a different sequence, and 10 imperfect/interrupted compound microsatellites with one or more interruptions in the run of repeats were found centrally located at the sequences.
Microsatellite polymorphism within C. latifolia accessions
Genotyping and variability analysis
Gaps calculated for estimating number of clusters in the dendrogram
As indicated earlier the genus Curculigo, although widely distributed in Malaysia, Indonesia and Brunei, has not been characterized for genomic sequences and microsatellite development. Therefore, these developed microsatellite markers for C. latifolia could facilitate future population genetic studies and breeding programs for this plant and related species.
Microsatellite markers are characterized by a high degree of variability making them powerful tools for population genetic analyses . The conventional protocols used for the isolation of microsatellites are cost, time and labor intensive and the efficiency of microsatellite isolation is low . The primer extension strategy has been proven to be useful for the isolation of dinucleotide repeat microsatellites. Although enrichment method is more desirable than traditional, it is still time consuming because of many several steps and needs more investigation to gain tri- or tetra nucleotide containing microsatellites . The isolation of microsatellites from plants is technically more demanding as their frequency in plants is relatively low comparing to animal genomes . To overcome these limitations ISSR-PCR technique as targets only those regions of the genome those are rich in microsatellite motifs is a desirable strategy. This technique was highly successful, as over 86 % of unique clones obtained, contained internally located microsatellite motifs in addition to those at the ends.
In this study using ISSR-PCR technique, 11 polymorphic microsatellites have been developed and screened in 27 accessions of C. latifola. All the variability parameters calculated for the microsatellites described in the study indicated that microsatellites will become a useful tool for genetic variation studies, genotype identification and similarity analysis in C. latifolia. The average of 5.1 observed alleles and 3.12 of effective alleles per locus was detected in this study. The difference between average number of observed alleles and effective number of alleles was due to the uneven frequency of each allele . PIC provides an estimate of discriminatory power of a marker to differentiate genotypes based on both the number of alleles expressed and their relative frequencies . The average of PIC value was 0.59 which indicate an isolation of highly polymorphic microsatellites. This was consistent with findings on potential applications of ISSR-PCR technique in developing high polymorphic microsatellites in Sphagnum capillifolium with low level of genetic variation . Overall genetic variability for the accessions studied, represented by Shannon’s indexes, was particularly high with the average of 1.13. The high value of Shannon’s information index represents the effectiveness of microsatellite loci to reveal the variation. The results indicated that all loci deviated significantly from HWE. Possible explanations for deviations of loci from HWE are heterozygote deficiency in loci , population size, and propagation through rhizomes.
The values obtained with Dice’s coefficient indicated that the extent of genetic variability among accessions varies, but that in most cases, genetic similarity is higher among accessions from one location or neighboring states. The highest Dice’s similarity coefficient (0.81) was found between accessions 7 and 8, indicating that they had almost the same genetic constituents based on the 11 microsatellite primers used. The lowest similarity coefficient (0.00) was found between accessions 1 and 5 which indicates that they were relatively remote in relationship.
The high level of genetic polymorphism was clearly evident from the dendrogram. C. latifolia is reported to be a cross-pollinating species . The relatively high level of polymorphism could be due to cross pollination in this species, however low level of variation among accessions from confined population of a small size could be referred to propagation through rhizome in the species. The low genetic diversity among accessions of C. latifolia taken from one location also reported as a result of vegetative propagation through rhizome . The rhizome propagation theoretically has a similar effect in population genetic structure as strict selfing .
The two-dimensional graph of accessions differentiation was revealed by PCA. This type of graphical illustration enables the assessment of the population structure and geometric distances among all of the accessions in the study . The distribution of the accessions in the two-dimensional graph based on the first two principle components was similar to that obtained from cluster analysis, where all accessions collected from one location in Jelebu were distinctly separated from other accessions.
In conclusion, 11 polymorphic microsatellite markers in C. latifolia were developed by the 5′ and 3′ anchored PCR technique. All the loci showed considerable variation in the population of this plant collected from within Malaysia. These results indicated that microsatellite primers tested could clearly distinguish the different sets of genotypes. The characteristics of these loci provide useful information for further studies on population genetics, assessment of genetic stability and somaclonal variation, construction of genetic linkage maps and mapping of economically quantitative trait loci, estimation of genetic diversity and divergence in C.latifolia and related plants. The use of these microsatellite markers will also facilitate the management and exploration of genetic resources of Hypoxidaceae in the lower Asparagales and assist in their genetic improvement to some extent.
The authors would like to acknowledge the Ministry of Agriculture, Malaysia for funding this project under the e-Science Fund (05-01-04-SF1051).
This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.