Findings

Hevea brasiliensis (Willd. ex Adr. de Juss.) Muell. -Arg., native to the Amazon rainforest, is a diploid (2n = 36, x = 9), perennial, monoecious and cross-pollinated tree species. It belongs to the genus Hevea and the botanical family Euphorbiaceae.

The genus Hevea comprises 11 inter-crossable species [1, 2] (H. benthamiana Muell.- Arg., H. brasiliensis H. carmagoana Pires, H. camporum Ducke, H. guianensis Aubl, H. microphylla Ule, H. nitida Mart. ex-Muel.-Arg., H. pauciflora (Spruce ex-Benth.) Muell.-Arg., H. rigidifolia (Spruce ex-Benth.) Muell.-Arg., H. spruceana (Benth.) Muell.-Arg. and H. paludosa Ule), which have evolved in the Amazon rainforest over 100,000 years [3].

Of all the species in the genus Hevea H. brasiliensis is the most economically important, because it is the major source of natural rubber worldwide. Natural rubber is important mainly in the tire industry but also in many other sectors because it is flexible, resistant, impermeable to liquids and abrasion resistant [4]. These singular properties make natural rubber both complementary and competitive to synthetic rubber and furthermore superior to it in varied applications. As is the case for many other perennial trees, rubber tree breeding is time-consuming and expensive. An average duration of 20–25 years of field experiments in large areas is generally required to obtain a new cultivar with reasonably low risks [5].

Molecular markers, such as amplified fragment length polymorphisms (AFLPs), restriction fragment length polymorphisms (RFLPs), random amplified polymorphic DNA (RAPD), simple sequence repeats (SSRs or microsatellites) and single nucleotide polymorphisms (SNPs), have successively become increasingly important in plant breeding. These markers are efficient tools for the assessment of genetic diversity, the identification of quantitative trait loci (QTLs) and/or gene mapping, variety protection and marker-assisted selection (MAS) [6].

Of these markers, microsatellite markers are considered the most suitable for genetic studies, because they combine co-dominance and high polymorphism with abundance, locus specificity and uniform dispersion in plant genomes. Moreover, microsatellite markers can discriminate closely related individuals [7]. In addition, microsatellite analysis is inexpensive, high reproducible and highly transferable across related species.

Following biochemical markers such as isozymes [8], molecular markers have been developed and used since the middle of 1990’s for diversity studies [9, 10], genetic mapping [11] and the identification of genetic loci implicated in the expression of agronomic traits in H. brasiliensis[1114]. However, most of the markers used have been isozyme, RAPD, RFLP or AFLP markers.

To date, few studies using genomic microsatellites [10, 15, 16] or microsatellites from expressed sequence tags (EST-SSRs) have been published [6]. Accordingly, we present the development of genomic microsatellites using dinucleotide- and trinucleotide- enriched libraries, the characterization of these microsatellite markers in multiple accessions of H. brasiliensis and test their transferability in six other Hevea species.

Results

Microsatellite-enriched library analysis

Di- and trinucleotide-enriched libraries were constructed, and the clones from each library were sequenced (576 and 288 clones, respectively).

A total of 291 (50.5%) clones from the dinucleotide-enriched library contained microsatellite sequences. Since dinucleotide probes were used for the library enrichment, these motifs were the most abundant comprising 324 (90.2%) SSRs followed by 20 tetranucleotide (5.6%), 12 trinucleotide (3.3%) and 3 pentanucleotide (0.8%) (Table 1).

Table 1 Number of sequences and SSRs of each type in each library

In the trinucleotide-enriched library, 62 (21.5%) clones contained microsatellites sequences. Trinucleotides were the most frequent motif with 45 (72.5%) SSRs followed by 13 dinucleotide (20.9%), 9 tetranucleotide (14.5%) and 3 pentanucleotide (4.8%) SSRs (Table 1).

For some sequences, both the dinucleotide- and trinucleotide-enriched libraries had more than one microsatellite, which explains the greater number of SSRs found compared to the number of sequences analyzed.

A total of 130 and 32 primer pairs were designed based on the di- and trinucleotide-enriched libraries, respectively. To remove possible redundancies with published SSRs, each sequence containing SSR was compared against GenBank database using BLASTN. Nine sequences, only from the dinucleotide library, were identical to previously published sequences, which already had primer pairs. These sequences were removed from this study.

Dinucleotide motifs have been reported as the most abundant type of microsatellite in plant genomes [17]. Recent studies relating to expressed sequence tags revealed that trinucleotide motifs are the most abundant motifs in ESTs in many plants, such as sugarcane [18], barley [19], grapes [20], rice [21], wheat [22] and citrus [23], whereas in other plants dinucleotide motifs are the most abundant in ESTs, such as kiwi [24], coffee [25] and apricot and peach [26] In rubber tree, Feng and co-workers analyzed all of the ESTs in NCBI database in search for microsatellites. They found that dinucleotide motifs were the most abundant and were three times more abundant than trinucleotides [6]. The low efficiency of the trinucleotide enrichment in our study could result from the low frequency of trinucleotide motifs in the rubber tree genome.

Polymorphism analysis and cross-species transferability

In total, 153 specific primer pairs were designed. A total of 119 primer pairs produced good amplification products (Additional file 1: Table S1) and 90 of these products were polymorphic among a set of 9 rubber clones indicated in Table 2.

Table 2 Genotypes of H. brasiliensis and six species of the genus Hevea used for characterization and transferability

The observed and expected heterozygosities ranged from 0.1111 to 1 and 0.1111 to 0.9150, respectively, and the PIC values ranged between 0.0994 and 0.8496. The mean number of alleles was 4.46 (2–9 alleles).

Of the 90 polymorphic markers we chose 46 SSRs for characterization among a set of 36 genotypes of H. brasiliensis (Table 2). The PIC values of these markers ranged from 0.097 to 0.8339, and the observed and expected heterozygosities ranged from 0.0909 to 0.9167 and 0.1387 to 0.8629, respectively. The mean number of alleles was 6.4 (2–17 alleles) (Table 3).

Table 3 Characterization of the 46 polymorphic SSR markers

Six other species from the genus Hevea (H. guianensis, H. rigidifolia, H. nitida, H. pauciflora, H. benthamiana and H. camargoana) being two different genotypes of H. pauciflora, were used to evaluate the transferability of the markers (Table 2). All loci were tested under the same PCR conditions used for H. brasiliensis.

Of the 46 loci tested, 40 (87%) were amplified for H. guianensis and H. pauciflora- (112CNSG), 39 (85%) were amplified for H. camargoana, H. nitida and H. pauciflora-(116CNSG) and 38 (82%) were amplified for H. benthamiana (Table 4).

Table 4 Cross-amplification of the 46 polymorphic SSRs markers among the other Hevea species

For most of the SSR loci, which cross-species amplification were tested, the number of alleles found in the H. brasiliensis genotypes in conjunction with the tested species (from 2 to 21, with a mean number of 9.5) was larger when the same loci were analyzed in H. brasiliensis alone (Table 3) which means that other species revealed some novel alleles.

Three primers pairs, HB54, HB62 and HB69, did not produce amplification products for the six other species tested, whereas 27 loci were amplified for all species (Table 4).

Saha and co-workers first observed SSR cross-amplification in H. benthamiana and H. spruceana[27]. Together with the cross-fertility potential, this high SSR transferability supports the consideration of the Hevea genus as a species complex with moderate differentiation among the species. These aspects appear to be favorable for genetic introgressions using other Hevea species to the rubber breeding population, which is mainly based on H. brasiliensis.

Locus duplication

Members of the Euphorbiaceae family have a basic number of chromosomes between 6 and 11; thus, any species with more than this number of chromosomes could be amphidiploid in origin [28, 29]. Similar to cassava, plants in the genus Hevea have 36 chromosomes and behave as diploid. In these species, it has been assumed that n = 18 and the basic number of chromosomes is x = 9 [30, 31].

In different Hevea species, chromosomes mainly formed bivalents and tetravalents are rarely produced as a result of pairing between non-homologous chromosomes during prophase I and metaphase I of meiosis [31, 32].

Moreover, cytogenetic studies revealed two distinct loci on two different chromosomes, bearing the same18S-5.8 S-25 S rDNA sequence which may have arisen by the hybridization of two unknown diploid species (n = 9), thus suggesting a possible allotetraploid origin, however no potencial diploid ancestor has been described to date [29].

Locus duplication, as revealed by molecular markers, had been reported in H. brasiliensis 11] and H. guianensis H. rigidifolia and H. pauciflora[15].

In H. brasiliensis, we observed locus duplication for the markers HB45 and HB109 for clones RRIM 701 and Fx4098, respectively. Cases of locus duplication were observed in all Hevea species tested, with the exception of H. rigidifolia. We also observed locus duplication of the HB36, HB68, HB100 and HB105 markers in H. nitida; HB68 and HB105 markers in H. benthamiana; HB101 and HB105 in H. guianensis. The marker HB105 exhibited one duplicated locus for H. pauciflora (112 CNSG), H. pauciflora (116 CNSG) and H. camargoana.

Although we have not been mapped the loci we cited as duplicated in the wild species, we observed more than 2 alleles (excluding the stutter bands) in the same base pair range compared with the other genotypes used for characterization (Figure 1). The presence of allele duplication, as determined by polyacrylamide gel electrophoresis suggests that the loci are duplicated for the species involved.

Figure 1
figure 1

A 6% denaturing silver-stained polyacrilamyde gel for the characterization of the HB101 marker. PCR product for the HB101 marker in the 36 genotypes of H. brasiliensis and the 6 other species of Hevea which more than 2 alleles can be observed for H. guianensis. (1) H. brasiliensis; (2) H. guianensis; (3) H. rigidifolia; (4) H. nitida; (5 ) H. pauciflora - (112 CNSG); (6) H. pauciflora - (116 CNSG); (7) H. benthamiana; and (8) H. camargoana.

Although locus duplication had been described in H. brasiliensis, H. guianinsis, H. rigidifolia and H. pauciflora, this is the first report for H. benthamiana, H. camargoana and H. nitida.

Conclusion

Herein, we report the development of new SSR markers for H. brasiliensis, representing a powerful resource for genetic diversity studies and genetic breeding techniques, such as molecular genetic mapping, QTLs identification and MAS.

Due to observed high percentage of cross-amplification and the absence of reproductive barriers between the species within this genus, these markers may be important tools to monitor the genetic variability from other Hevea species into the current breeding programs.In addition, these SSR markers can be helpful for the identification of important agronomic characteristics in different Hevea species.

Methods

Plant material and DNA extraction

We used 36 genotypes of H. brasiliensis to characterize the SSRs (Table 2). These genotypes were kindly provided by the Agronomic Institute of Campinas (IAC) and Michelin Plantation (Brasil). We also used 6 other species of the genus Hevea (H. guianensis H. rigidifolia H. nitida H. pauciflora H. benthamiana and H. camargoana) that were kindly provided by the Brazilian Agricultural Research Corporation (EMBRAPA - Amazônia Ocidental) in Manaus, AM to test the transferability of the SSRs (Table 2). The genomic DNA samples were extracted from lyophilized leaf tissues using a modified CTAB method [33] and their quality and quantity were assessed using 1% agarose gel electrophoresis.

Construction of microsatellite-enriched libraries and sequence analysis

The microsatellite-enriched libraries for H. brasiliensis were constructed using the RRIM 600 clone according to the methodology described by Billote and co-workers [34]. The DNA samples were digested with AFAI and enriched using streptavidin magnetic-coated beads (Streptavidin MagneSphere Paramagnetic Particles, Promega, Madison, WI) and (CT)8 and (GT)8 biotinylated microsatellite probes for the dinucleotide-enriched library or (ATC)8 and (CCT)8 for the trinucleotide-enriched library.

Selected DNA fragments were amplified by PCR and then cloned into the pGEM-T vector (Promega, Madison, WI). Competent XL1-blue Escherichia coli cells were transformed with the recombinant plasmids and cultivated on agar medium containing ampicillin and 100 μg/ml of X-galactosidase. The clones containing the insert were sequenced using the Big Dye Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA) and an automated ABI 377 sequencer (Applied Biosystems, Foster City, CA).

All the sequences obtained were aligned, edited and eliminated if redundant using SeqMan (DNASTAR, Madison, WI). The sequences were also evaluated using Microsat software (A. M. Risterucci, CIRAD, personal communication), which removes the adapters and verifies the presence of restriction sites.

The microsatellites were identified using a specific research tool, SSRIT – (Simple Sequence Repeat Identification Tool) [35]. As a criterion for the SSR selection, the sequences that showed at least five dinucleotide repeats; four trinucleotide repeats; and three tetra-, penta- and hexanucleotide repeats were selected. Primers complementary to the sequences flanking the microsatellites were designed using Primer Select Program (DNAStar, Madison, WI) and Primer 3 [36]. To eliminate possible redundancies with the published SSRs, each sequence containing microsatellite was compared against the GenBank database using BLASTN. The redundant sequences which already had primer pairs published were eliminated.

PCR amplification

The microsatellite fragments were PCR amplified in a 15 μl reaction containing 25 ng template DNA, 0.5 μM each primer, 100 μM each dNTP, 1.5 mM MgCl2, 20 mM Tris–HCl, 50 mM KCl and 0.5 U Taq DNA Polymerase. The PCR amplifications were performed with the following four programs: (1) initial denaturation at 94°C for 4 min, 30 amplification cycles (1 min at 94°C, 45 s at the specific annealing temperature and 1 min at 72°C), and a final extension at 72°C for 10 min; (2) TD-1, initial denaturation for 3 min at 94°C, 10 amplification cycles with a 0.5°C decrease in annealing temperature per cycle starting at 55°C or 60°C (94°C for 1 min, 60°C or 55°C for 30 s, and 72°C for 1 min 15 s); followed by 20 cycles with annealing at 50°C (94°C for 1 min, 50°C for 30 s and 72°C for 30 s) and a final elongation step at 72°C for 10 min; (3) TD-2, previously described by Le Guen and co-workers [10]; and (4) TD-3, initial denaturation at 94°C for 2 min, 2× 10 cycles with a 1°C decrease in annealing temperature per cycle from 65°C to 55°C (94°C for 1 min, 65°C for 1 min and 72°C for 1 min), followed by 18 cycles at 55°C (94°C for 1 min, 55°C for 1 min and 72°C for 2 min), and a final elongation step at 72°C for 5 min.

The amplification products were resolved by electrophoresis through 3% agarose gels prior to vertical electrophoresis using 6% denaturing polyacrylamide gels and were subsequently silver stained [37]. The product sizes were determined by comparison with a 10 bp DNA ladder (Invitrogen, Carlsbad, CA).

Analysis

The allelic polymorphic information content of each SSR was calculated using the formula P I C = 1 i = 1 n p i 2 i = 1 n j = i + 1 n 2 p i 2 p j 2 , where n is the number of alleles of the marker among the set of genotypes used for characterizing the SSR polymorphism, and pi and pj are the frequencies of alleles i and j. The observed and expected heterozygosities were calculated using the TFPGA program [38].