Introduction

The genus Ternstroemia Mutis ex L.f. contains between 110 and 160 species mainly distributed across the tropical and subtropical regions worldwide [1, 2]. In Mexico, some Ternstroemia species are considered either diagnostic or associated with tropical montane cloud forests (TMCF) [3], while other species across the world are widespread in both tropical and temperate forests. According to climate-change scenarios, the TMCF will face severe threats regarding physiological adaptation and survival [4]. Among the species of Ternstroemia, some taxa such as T. dentisepala B.M. Barthol. and T. huasteca B.M. Barthol. are considered a priority for conservation since they are endemic and geographically rare [5]. Ternstroemia huasteca is considered “vulnerable” in the IUCN Red List [6]. Moreover, Mexican species such as T. chalichophila, T. dentisepala, and T. impressa belong to the Ternstroemia lineata species complex, a taxonomic group with unresolved relationships, putatively obscured by ongoing interspecific processes [7].

In Mexico, most genetic diversity research on forest species has been focused on timber species, such as the Pinaceae and the genus Quercus. In contrast, the knowledge of genetic patterns in other tree species is relatively scarce [8]. Nevertheless, some studies on Mexican TMCF species such as Abies [9], Chiranthodendron [10], Liquidambar [11], and Podocarpus [12] have identified historical gene drift and populational isolation during the Pleistocene interglacial periods [13].

The biogeographic history of Ternstroemia remains poorly known because there are no comprehensible explanations of its amphipacific distribution. However, efforts like the study of Rose et al. [14] included few Pentaphylacaceae species. Moreover, there are no populational studies of any species of the genus nor specific genetic markers, while during recent years, new species are increasingly being described, frequently endemic and/or threatened [15,16,17,18]. In Mexico, several species of Ternstroemia are considered an ethnopharmacological resource, named as “té de tila”. The dry fruits are used to treat anxiety and insomnia; also, they are known for their analgesic, anti-inflammatory, and anticonvulsant properties [19]. Although, most of the effects result from the neurotoxicity induced by terpenoids and may be considered a health risk [20]. This issue is a health concern because infusions are sold as mixtures with other species (by local sellers and street markets) or in sachets (finely ground in supermarkets).

Currently, there are no published assessments about the impact of the exploitation of Ternstroemia fruits on its genetic diversity and demography. However, similar systems (trees under some exploitation) such as Aquilaria [21], Cedrus [22], and Dipterocarpus [23] have been genetically evaluated using microsatellites or simple sequence repeats (SSRs). This technique offers advantages such as codominance, high polymorphism, individual resolution, technical simplicity, and the ease of applying it to degraded DNA, such as herbarium vouchers [24].

Considering the pharmacological importance and ubiquity of Ternstroemia within the Mexican TMCF, it is urgent to develop specific genetic markers for the genus. Therefore, this manuscript describes the development of a set of 38 nuclear microsatellites. Specifically, we seek to test its utility in (1) estimating genetic diversity in Ternstroemia lineata, and (2) its transferability to six other key species of Ternstroemia and other Ericales.

Materials and methods

Tissue collection and DNA isolation

We collected single, young leaves in silica gel from 20 individuals of Ternstroemia lineata from a mixed temperate forest in southern Morelia (Central-Western Mexico), between the localities of San Miguel del Monte and Ichaqueo. The individuals were georeferenced and sampled at least 250 m apart (Fig. 1). Leaves from species other than T. lineata were either opportunistically collected and dried in silica gel or retrieved from herbarium vouchers (such as Freziera sp. and the South American T. asymmetrica and T. subserrata). These two South American species may give insights about the markers’ transferability success in non-Mexican species. The Ericales sample included some representative families across the order: Ebenaceae (Diospyros xolocotzii), Foquieriaceae (Foquieria splendens), Primulaceae (Rapanea sp.), and Symplocaceae (Symplocos citrea); this selection is intended to provide a rough estimation of cross-amplification success across the order Ericales. We grounded the leaf tissue of all the studied species in individual 2 mL microtubes in a Retsch Mill (MM400). DNA was isolated using the CTAB protocol [25].

Fig. 1
figure 1

Location of the Ternstroemia lineata population sampled for microsatellite evaluation. Each pink dot represents individual coordinates

Microsatellite search using high throughput sequencing

In order to find the repetitive regions, we outsourced AllGenetics & Biology SL (La Coruña, Spain). One individual was randomly chosen as the source sample and used to construct the genomic DNA library. The library was prepared using the Nextera XT DNA enrichment kit (Illumina) with the microsatellite motifs: AC, AG, AT, ACG, and ATCT and following the manufacturer’s instructions. The library was then sequenced in the Illumina MiSeq platform (PE300), producing 9,921,869 paired-end reads. The quality of the raw sequencing data was checked using FastQC [26]. Finally, reads were processed in Geneious 10.2.3 and using in-house developed scripts. Primer design was implemented in Primer3 [27, 28] in Geneious 10.2.3 (Biomatters, LTD).

Validation and cross-amplification

First, we filtered the database to select only perfect and uninterrupted motifs, and then we randomly selected 38 candidate primer pairs. These 38 loci contained only dinucleotide repeats; by far, the most common (35 loci or 92.1%) motif was (AG)n. Two markers, Tli161 and Tli198, presented the motif (AC)n. The motif (AT)n was present only in the locus Tli175. PCRs were performed using the Platinum Master Mix (Thermo-Fisher, USA) following the manufacturer’s instructions for reaction assembly and program. The annealing temperature was 60 °C since all the primers were designed around this value. Once the 38 markers were validated in the source T. lineata subsp. lineata sample, they were tested in six American Ternstroemia spp., Cleyera Thunb., and Freziera Willd. (Pentaphylacaceae), and other Ericales such as Diospyros L. (Ebenaceae), Fouquieria Kunth. (Fouquieriaceae), Rapanea Aubl. (Primulaceae), and Symplocos Jacq. (Symplocaceae). Out of the 38 candidate markers, we chose 23 “well-working” primer pairs that showed consistent amplification in a 2% agarose gel electrophoresis (high transferability and amplification success across the genus Ternstroemia, few secondary bands and/or bands consistent with the size obtained in the source sample validation). Since the remaining 15 markers show variable amplification success, we decided not to use them in genotyping. However, we report the primer sequences and the expected size. The 23 “well-working” markers subset was fluorescently labeled in the 5′ endings using the Dye Set 33 (Applied Biosystems, USA) and then used to evaluate the parameters of the source sample population. Fragment analysis was performed in Psomagen Inc. (Maryland, USA), and genotyping was achieved using the Microsatellite plugin (v. 1.4.7) of Geneious Prime 2022 (Dotmatics, NZ). Allele scoring was performed manually using as guidelines the supplementary material of Selkoe and Toonen [24]. Genetic parameters such as expected and observed heterozygosity (He, Ho), allelic richness (Na), and deviation from Hardy-Weinberg equilibrium were calculated in GenAlEx 6.503 [29]. PIC was calculated using PIC_CALC [30]. We tested linkage disequilibrium among all the 23 chosen loci using the association index (\(\bar{r}_{d}\)) of Agapow and Burt [31] implemented in the R package poppr [32]. We checked the presence of null alleles using the R package PopGenReport [33]. Finally, we performed an interpolation of the individual heterozygosity using Empirical Bayesian kriging [34] in ArcGIS 10.3 (ESRI, USA), to explore the marker set’s potential in fine-scale genetic approaches. This approach automatically calculated the semivariogram from 1000 simulations using a standard circular neighborhood search (10–15 neighbors data points, radius = 0.013).

Results and discussion

All the evaluated markers presented polymorphism. The number of alleles (Na) ranged from 3 to 14 (mean = 7.174). The observed heterozygosity ranged from 0 to 0.889 (mean = 0.539), and the expected heterozygosity (He) ranged from 0.202 to 0.908 (mean = 0.676). Polymorphism Information Content ranged from 0.19 to 0.90 (mean = 0.643), with only four loci under PIC = 0.5 (Table 2). We found significant evidence of null alleles in five loci, whereas four loci exhibited Hardy-Weinberg disequilibrium (at P > 0.05) (Table 2). The index of association (\(\bar{r}_{d}\)) was 0.0301 (P = 0.618), indicating no evidence of linkage disequilibrium among the markers. All amplicons, including the microsatellite regions, were deposited in NCBI Genbank (accessions in Table 1). For the candidate loci, transferability is relatively high among Ternstroemia spp. (81.6–100%) and the two Pentaphylacaceae (89.5%for Cleyera theaeoides (Sw.) Choisy and 81.6%for Freziera sp.) (Table 3). Across the Ericales, the amplification success (Table 4) was very low in Diospyros xolocotzii Madrigal & Rzed. (7.9% to low in Symplocos sp. (26.3%) but consistent with the expected amplification success among families of the same order [35]. Since we selected the 23 evaluated markers for their consistent amplification, the minimum amplification success of this subset for Pentaphylacaceae was 91.3%. The interpolated map of genetic diversity (measured as individual heterozygosity) showed a clear declining trend from South-West (Fig. 2). Therefore, these markers are a suitable set for fine-scale population genetic research in this family. They can also clarify the taxonomic limits among troublesome groups such as the Ternstroemia lineata species complex (e.g., T. chalicophila Loes., T. dentisepala, and T. impressa Lundell tested in this study). In this species complex, efforts using traditional phylogenetic markers have been insufficient. They are also useful for assessing the genetic diversity in threatened species, as is the case of T. huasteca.

Table 1 The 38 candidate loci and the selected 23 (in boldtype) for populational evaluation
Table 2 Genetic parameters of the 23 evaluated loci
Table 3 Transferability across Pentaphylacaceae, including Ternstroemia spp
Table 4 Transferability across some Ericales
Fig. 2
figure 2

Empirical Bayesian kriging of individual heterozygosity. Satellite image from Google Satellite