Introduction

The Tropical Andes biodiversity hotspot is home to at least 20,000 endemic plant species (Myers et al. 2000). In Ecuador, this hotspot is composed primarily of the Eastern Cordillera Real montane forests ecoregion which contains up to 8000 plant species (Richter et al. 2009) and is dominated by evergreen broad-leaved forest communities, including cloud forests at higher elevations. Southern Ecuador, particularly the provinces of Loja and Zamora-Chinchipe, has especially high species richness (Brehm et al. 2008).

Patterns of plant diversity are determined by multiple interrelated factors (Kreft and Jetz 2007). In southern Ecuador, the high vascular plant diversity can be explained by its special and complicated topography with high mountains, valleys, and inter-Andean region (Richter 2003). The specific air circulation patterns differ between the western and eastern mountain ranges, conferring specific and heterogeneous precipitation patterns to each side and between them, as well. The existence of multiple dry and wet areas has resulted in many different ecosystems suitable for multiple species (Richter and Moreira-Muñoz 2005). In addition, southern Ecuador is located within the Andean Depression, the lowest part of the Andean mountain range where the northern and central Andes merge (Richter et al. 2009). This low elevation might have allowed contact of coastal and Amazonian species, thereby further increasing the region’s biodiversity (Beck and Richter 2008).

Furthermore, there are other evolutionary forces such as ecological-geographical barriers that could explain the diversity of this region (Hughes and Eastwood 2006; Prieto-Torres et al. 2018). The convergence of the eastern and western cordilleras of the Andes and the repeated Pleistocene glacial and interglacial periods in the Ecuadorian Andes would have created several physical and climatic barriers to migration, including mountain ranges and low valleys in the Loja and Zamora-Chinchipe provinces that likely contributed to the current species richness of the region (Jørgensen et al. 1995; Richter et al. 2009). Such impediments to gene flow raise the possibility that populations may have existed in isolation long enough for significant genetic divergence to have occurred. In addition, fluctuations in population size and repeated bottlenecks associated with ongoing Pleistocene climatic instability may be important factors that have affected the genetic diversity of populations inhabiting this region (Prieto-Torres et al. 2018).

Habitat conditions can also affect the genetic flow between populations (Coates et al. 2003). Deforestation and habitat fragmentation have been the main pressures that are affecting the diversity of tropical forests (Hansen et al. 2013; Lambin et al. 2003; Macedo et al. 2012; Tapia-Armijos et al. 2015). Also, some species have suffered from selective logging that intensifies the impacts on demographic structure of trees (Broadbent et al. 2008; Inza et al. 2012). Beginning in the late 20th century, the clearing of nearly half of the native forests in southern Ecuador for pastureland and plantation silviculture (Tapia-Armijos et al. 2015) further reduced and fragmented the remaining forests, many of which are on private, unprotected land. The combination of these disturbances in the landscape can reduce the forest into small remnants and can lead to reduction in connectivity of populations with concomitant decrease in gene flow, thus increasing genetic differentiation and reducing genetic variability (Aguilar et al. 2008; Cascante et al. 2002; Lowe et al. 2005).

Cinchona is a genus of the Rubiaceae family known for its medicinal importance as a source of quinine and other alkaloids that are present in different quantities in its 23 species. Cinchona is widespread throughout the central and northern Andes with a noticeable center of diversity just north of the Huancabamba Deflection in southernmost Ecuador. Most species of the Cinchona genus seem to have fairly restricted distributions, and although they have a tendency to hybridize, their ranges do not typically overlap (Andersson 1998).

Cinchona officinalis was the first species of the genus to be described in detail (Andersson 1998). The species was named by Linnaeus in 1753 based on a specimen collected in Cajanuma (near Loja, Ecuador) in 1737 by La Condamine (Garmendia 2017). The distribution of C. officinalis is limited only to a small area in the Andean cloud forest. In Ecuador, C. officinalis L. is found in the southern provinces of Loja, El Oro, Cañar and Azuay at 1700–3100 m a.s.l. (Andersson 1998). Brako and Zarucchi (1993) also reported the occurrence of C. officinalis in the northern Andes of Peru. The growth form of the species can be shrubby, but small trees up to 6 m tall are more common. Cinchona officinalis has distylous pinkish or purplish tubular flowers. (Andersson 1998) that promote outcrossing by hummingbirds (Tropicos 2018). Its small wind-dispersed winged seeds are enclosed in capsules (Andersson 1998).

During the conquest of the Americas in the 16th and 17th centuries, Spanish missionaries learned of Cinchona bark’s medicinal properties from indigenous tribes and discovered its effectiveness in the treatment of malaria (Jaramillo-Arango 1947). Therefore, C. officinalis was probably an important source of quinine and other alkaloids in the early years of Cinchona harvesting but became less so as its small native populations near the early Spanish settlement of Loja were exhausted by the mid-18th century (Gray 1738). Early bark harvesting methods were typically fatal to the trees (Arias 2017). Other, more abundant, Cinchona species were exploited over the next 200 years (Andersson 1998; Martin and Gandara 1945). Once Cinchona plantations were established in Asia in the 19th century, the pressure on native Andean populations eased, but a second round of harvesting occurred during World War II when the Allied quinine source was cut off from the Dutch East Indies and tens of millions of pounds of Cinchona bark were exported to the United States (Hodge 1948; Cuvi 2011). The oldest map of Cinchona distribution in Loja is a hand-painted map deposited in the Archivo General de Indias, in which bark collectors and Cinchona trees are depicted (Crawford 2016). According to this map, Cinchona trees were widely distributed in the mountains of the Loja province. However, the past overharvesting and the high rates of deforestation in southern Ecuador in recent decades (Tapia-Armijos et al. 2015) have threatened C. officinalis.

Determining the genetic composition of remaining populations of C. officinalis can help plan effective conservation and reforestation measures for this important species. The main objectives of this research were (i) to characterize the genetic diversity of the remnant C. officinalis populations in Loja province of southern Ecuador by nuclear and chloroplast DNA analysis, (ii) to test our hypothesis that barriers to migration and gene flow in southern Ecuador will have resulted in divergent populations of C. officinalis in the more geographically isolated parts of its distribution, and (iii) to identify priority populations for conservation.

Materials and methods

Population descriptions, sample collection, and DNA isolation

Six populations of C. officinalis were sampled from forest fragments in southern Ecuador (Fig. 1), all within Loja province. The majority of samples were collected from trees that probably survived bark harvesting and deforestation due to their growth on steep slopes. Most of the individuals in the populations of El Madrigal, Vilcabamba, Yangana, and Gonzanamá are scattered around the edges of pastures and likely subsist because of continued local medicinal use (as inferred from our occasional observations of limited bark removal). The most conserved populations were found in the private nature reserves of Angashcola and El Cristal. The four populations located along the western slope of the Eastern Cordillera Real (i.e., the western border of Podocarpus National Park) were at elevations of 2141–2271 m a.s.l. whereas the Gonzanamá and Angashcola populations were at 2643–2877 m a.s.l.

Fig. 1
figure 1

Locations of sampled C. officinalis populations. PNP, Podocarpus National Park; PNY, Yacuri National Park

Samples were collected under a general permit (Contrato Marco de Acceso a Recursos Genéticos, número MAE-DNB-CM-2015-0016) issued by the Ecuadorian Ministry of the Environment. Permission to collect was granted by landowners. Individual leaves from 25 to 31 trees in each population were desiccated in plastic bags containing 10 g silica gel (Merck, Germany) and stored at room temperature. Total DNA was isolated from 10 to 30 mg of dry leaf tissue using the PureLink Genomic Plant DNA Purification Kit (Invitrogen, USA) or by the CTAB method of Doyle and Doyle (1987). DNA was quantified on a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, USA).

Nuclear microsatellite marker development and screening

Genomic DNA was isolated from a tree grown at the San Francisco Scientific Station (33 km east of Loja) using seed obtained from a tree near San Pedro de Vilcabamba. DNA was extracted from 20 mg of dry leaf material using the protocol of Curto et al. (2013). This sample was then used for marker discovery using the Illumina MiSeq system (Illumina, USA) and primer design following the approaches described in Deck et al. (2016). This consisted of a three-primer assay using the multiplexing short tandem repeat polymorphisms with tailed primers (MSTP) method of Oetting et al. (1995) as performed by Curto et al. (2013). Fluorescently tagged M13-based primers (VIC-5′-TAATACGACTCACTATAGGG-3′, PET-5′-GATAACAATTTCACACAGG-3′, NED-5′-TTTCCCAGTCACGACGTTG-3′ or 6FAM-5′-TGTAAAACGACGGCCAGT-3′) were included along with the locus-specific reverse primers (unmodified) and forward primers (modified with the appropriate M13 primer sequences as 5′ tails as shown in Table 1).

Table 1 Cinchona officinalis microsatellite loci descriptions

PCR was conducted on 10-μL reactions containing 5 μL of Multiplex PCR Master Mix (Qiagen, USA), 0.4 μL of M13-tailed forward primer (1 pmol/μL), 0.4 μL of the reverse and fluorescent primers (10 pmol/μL), and 3.8 μL of water. The reaction was promoted by the following temperature profiles: 95 °C for 15 min; 11 cycles of denaturation at 95 °C for 30 s, a touchdown pattern for annealing from 60 to 55 °C, decreasing 0.5 °C per cycle for 60 s, extension at 72 °C for 30 s, followed by 19 cycles of denaturation at 95 °C for 30 s, annealing at 55 °C for 30 s and extension at 72 °C for 30 s. Cycling was finished with a final extension step of 60 °C for 30 min. All successful PCR reactions were sent for fragment analysis to the Sequencing Service at the Ludwig Maximilian University Munich. Obtained chromatograms were analyzed in Geneious using the Microsatellite plugin (Kearse et al. 2012).

Microsatellite PCR and genotyping

Two multiplex PCR reactions were performed for each sample using the three-primer method described above. Loci Coff8, Coff10, and Coff20 PCR products were amplified and labeled with VIC, PET, and 6FAM, respectively, in a 10-μL multiplex reaction containing 1× ABI Master Mix (Applied Biosystems, USA), 5–20 ng genomic DNA and the following primers: VIC-M13 (0.4 μM), PET-M13 (0.1 μM), 6FAM-M13 (0.1 μM), tailed Coff8-F (0.04 μM), tailed Coff10-F (0.01 μM), tailed Coff20-F (0.01 μM), Coff8-R (0.4 μM), Coff10-R (0.1 μM), and Coff20-R (0.1 μM). Loci Coff13 and Coff15 PCR products were amplified and labeled with NED and 6-FAM, respectively, in a 10-μL multiplex reaction containing 1× ABI Master Mix, 5–20 ng genomic DNA and the following primers: NED-M13 (0.1 μM), 6FAM-M13 (0.1 μM), tailed Coff13-F (0.01 μM), tailed Coff15-F (0.01 μM), Coff13-R (0.1 μM), and Coff15-R (0.1 μM). Both multiplex reactions were run using a touchdown PCR procedure: initial denaturation at 95 °C (15 min); 10 cycles of 95 °C (30 s), 62 °C (60 s), and 72 °C (30 s) with a 1 °C reduction in annealing temperature per cycle; 20 cycles of 95 °C (30 s), 52 °C (30 s), and 72 °C (30 s); final extension at 72 °C (30 min). For some samples with lower DNA concentrations, 22 cycles were performed at the 52 °C annealing temperature instead of 20.

PCR products (1 μL) were mixed with 0.15 μL of GeneScan LIZ 600 size standards (Thermo Fisher Scientific) and 10 μL of Hi-Di Formamide (Thermo Fisher Scientific). Fragments were separated on a 3500 Genetic Analyzer (Applied Biosystems) according to manufacturer’s directions and scored in GeneMapper version 4.1 software (Applied Biosystems) using size bins for each locus. To avoid genotyping error when calling alleles, only peaks with amplitudes greater than 500 relative fluorescent units (RFU) were scored. Samples with peaks below this threshold were reamplified until reliable consensus genotypes were obtained.

Statistical analysis

Scored microsatellite genotypes were formatted in CONVERT 1.31 (Glaubitz 2004) to facilitate conversion of the data into other software formats. GENALEX 6.503 (Peakall and Smouse 2006, 2012) was used to calculate allele frequencies, observed number of alleles (A), number of effective alleles (Ae), private alleles (PA), observed heterozygosity (Ho), unbiased expected heterozygosity (He), overall FST (Wright 1951, 1965), FST (Meirmans 2006; Meirmans and Hedrick 2011), GST (Nei 1973), GST (Hedrick 2005), Jost’s DEST (Jost 2008; Meirmans and Hedrick 2011), pairwise comparisons of population differentiation by FST and DEST, probability of identity [P(ID) and P(ID)sib] (Waits et al. 2001), and number of migrants per generation (Nm) based on overall FST. The number of migrants per generation (Nm) was also calculated in GENEPOP on the Web based on the private allele method of Barton and Slatkin (1986). Exact Hardy-Weinberg global tests of heterozygote deficiency and inbreeding coefficients (FIS) were calculated in GENEPOP on the Web (Raymond and Rousset 1995; Rousset 2008) and FSTAT 2.9.3.2 (Goudet 2001), respectively. The program FreeNA (Chapuis and Estoup 2007) was used to calculate null allele frequencies. Simple Mantel tests were performed using the Nei genetic pairwise distance matrix in GENALEX and the FST pairwise genetic differentiation matrix in R (R Development Core Team 2004). The package hierfstat (Goudet and Jombart 2015) was used to compute the pairwise FST matrix and the vegan package (Oksanen et al. 2019) to compute the Mantel test. In both cases, 9999 random permutations were performed.

BOTTLENECK v1.2.02 (Piry et al. 1999) was used to detect signs of recent bottlenecks in the studied populations based on the heterozygosity excess method (Cornuet and Luikart 1996; Luikart and Cornuet 1998). Since the Wilcoxon test gives most reliable results when less than 20 polymorphic loci are studied (Piry et al. 1999), the one-tailed Wilcoxon signed rank test was used with 10,000 iterations to test for significant excess of heterozygosity. Three mutation models were used: the infinite allele model (IAM), the stepwise mutation model (SMM), and the two-phase model (TPM). The IAM (Kimura and Crow 1964) and SMM (Ohta and Kimura 1973) are the extreme mutation models (Chakraborty and Jin 1992) while the TPM (Di Rienzo et al. 1994) is an intermediate model that more closely reflects microsatellite mutation. Since bottlenecks also cause a reduction in low frequency (< 0.1) alleles, the qualitative graphical method of Luikart et al. (1998), available in the BOTTLENECK software, was used to detect any mode-shift in allele frequency distributions to more intermediate frequencies. Finally, the M-ratio approach developed by Garza and Williamson (2001), which is based on the ratio of the number of microsatellite alleles to the range in allele size, was also used to detect recent bottlenecks.

Bootstrapped UPGMA dendrograms were constructed in POPTREE2 (Takezaki et al. 2010) to visualize Nei’s DA distance (Nei et al. 1983; Takezaki and Nei 2008) with and without null allele frequency corrections calculated by FreeNA (Chapuis and Estoup 2007). Population structure was analyzed using the Bayesian model-based clustering program STRUCTURE 2.3.4 (Falush et al. 2003; Hubisz et al. 2009; Pritchard et al. 2000) with the admixture ancestry model. Both allele frequency models (independent and correlated) were run with length of burnin period and MCMC repetitions each set at 10,000. Running five repetitions for each potential number of clusters (K = 3–7), the independent allele model gave highly reproducible estimates of posterior probabilities of the data [LnP(D)]. However, in the correlated allele model, there was significantly greater variability at some K values, requiring an increase in burnin and MCMC repetition to 20,000. The plateauing nature of the correlated allele K vs. LnP(D) graph made inference of the optimal K value difficult, but a ΔK transformation of the data (Evanno et al. 2005) provided an estimate that was consistent with the independent allele model. Results from STRUCTURE were graphically displayed using DISTRUCT (Rosenberg 2004) and were essentially identical with either model.

Median-joining networks for chloroplast rps16 intron sequence haplotypes were constructed in NETWORK 5.0 (Bandelt et al. 1999) using an epsilon value of 10 and weights of 10 for substitutions and 5 for indels. Manual alignments of a hypervariable TnAn region were made in order to minimize the number of polymorphic sites.

Chloroplast rps16 intron PCR and sequencing

The rps16 intron was initially amplified using the forward (rpS16F: 5′-AAACGATGTGGTARAAAGCAAC-3′) and reverse (rpS16R: 5′-AACATCWATTGCAASGATTCGATA-3′) primers of Shaw et al. (2005). However, sequencing of a mononucleotide repeat (Gn) near the 3′ end of the intron gave variable results depending on which primer was used for sequencing. This problem was alleviated by designing a new reverse primer (rpS16R2: 5′-TCGGATCATAAAAACCCACT-3′) in exon 2 of rps16 located 56-bp downstream of the original rpS16R priming site. The sequence of rpS16R2 was based on a Coffea arabica GenBank accession (EF044213.1).

The 20-μL PCR reactions consisted of 1× GoTaq Buffer (Promega, USA), 1.5 mM magnesium chloride (included in the buffer), 0.2 mM each deoxyribonucleotide, 0.25 μg/mL bovine serum albumin, 0.5 μM each of primers rpS16F and rpS16R2, 1 U GoTaq DNA Polymerase (Promega), and 20–50 ng template DNA. Amplifications were carried out at 94 °C (3 min) followed by 35 cycles of 94 °C (30 s), 50 °C (45 s), and 72 °C (90 s). Final elongation was at 72 °C (10 min). PCR products of approximately 940 bp were purified with the Wizard SV Gel and PCR Clean-up System (Promega) or the Clean and Concentrator-5 Kit (Zymo, USA). Sequencing was performed by Macrogen (South Korea) or the University of Maine DNA Sequencing Facility (USA) using both PCR primers. Sequences were aligned using ClustalX2 (Larkin et al. 2007).

Results

Microsatellite characteristics and discrimination power

The Illumina MiSeq run produced 273,042 paired reads from which 271,118 passed the quality control and overlapped. Among those 20,957 contained microsatellites with flanking regions larger than 40 bp. A set of 26 primer pairs were designed from which 16 amplified in single reactions in the sample used for marker development. However, after applying them to the remaining samples, only five markers (two dinucleotide and three trinucleotide repeats) provided reliable and polymorphic PCR products, but these proved to be sufficient for our study. Repeat motifs, primers and allele summaries are shown in Table 1. Coff13 and Coff15 were the most polymorphic (19 and 16 total observed alleles, respectively) while Coff20 was the least polymorphic (7 total alleles). Although the original sequenced alleles from Coff13 and Coff15 contained perfect trinucleotide repeats, the difference in sizes of observed alleles did not strictly follow a three base pair pattern.

The five-locus microsatellite analysis was successful in identifying unique genotypes in 97.5% of individuals tested (Table 2). Two consecutively sampled trees in the Vilcabamba population had identical genotypes, as did two trees in Yangana. For all populations, the mean standard probability of identity [P(ID)] was 0.00014 which is within the “reasonably low” range of 0.01–0.0001 indicated by Waits et al. (2001). The mean probability of identity among siblings [P(ID)sib], i.e., the upper limit of probability that two individuals in a population will have the same multi-locus genotype, averaged 0.023. Due to the high outcrossing rates of distylous species, such as C. officinalis, it is likely that the standard P(ID) values are more accurate estimations and that the discrimination power of the microsatellite panel was adequate for the study.

Table 2 Probability of identity and number of unique genotypes for the six C. officinalis populations using five microsatellite loci

Genetic diversity

The average number of alleles per locus ranged from 5.2 to 7.6 in the six populations (Table 3). In the four populations having the least barriers to migration/pollination between them (El Madrigal, El Cristal, Vilcabamba and Yangana), the total number of private alleles was low (ranging from 1 to 3) whereas Gonzanamá and Angashcola, being more isolated, had 8 and 6 total private alleles, respectively. Observed heterozygosities (Ho) averaged over the five microsatellite loci ranged from 0.580 to 0.680 while overall expected heterozygosities (He) ranged from 0.616 to 0.717 (lowest in Yangana, highest in Angashcola). The number of effective alleles (Ae) was highest in Angashcola (3.8 vs. 3.0–3.2 for the other populations), consistent with its greater expected heterozygosity and average number of alleles.

Table 3 Genetic diversity indicesa for six C. officinalis populations at five microsatellite loci

Three populations (El Madrigal, Gonzanamá, and Angashcola) showed significant deviation from Hardy-Weinberg equilibrium (heterozygote deficiency) over all loci, but only Gonzanamá and Angashcola had relatively high positive inbreeding coefficients (global FIS values > 0.10). Locus Coff15 showed the highest estimated null allele frequency across all populations (average of 0.064), especially in the Gonzanamá and Angashcola populations (0.090 and 0.162, respectively). Angashcola also had a relatively high estimated null allele frequency (0.123) at the Coff13 locus. The appearance of null alleles would be more likely in these two populations which are the most isolated from the source of genomic DNA (near Vilcabamba) that was used for microsatellite screening (Chapuis and Estoup 2007). Therefore, Hardy-Weinberg disequilibrium in Angashcola, and possibly Gonzanamá, could be attributed to the presence of null alleles rather than to factors such as inbreeding or departure from random mating.

The number of migrants per generation (Nm) across all loci was estimated to be 1.2 (FST method) or 1.4 (private allele method), indicating that there is adequate gene flow between populations to counter the negative effect of genetic drift on heterozygosity (Nason and Hamrick 1997). Bottleneck analysis of the data yielded no convincing evidence of recent bottlenecks (Supplementary Table S1). Of the three mutation models, the only significant result for heterozygosity excess was for the Vilcabamba population using the IAM which is the least conservative model and more prone to type I errors (Cornuet and Luikart 1996). All M ratios were above 1.7, well above the critical value of 0.68 (Garza and Williamson 2001). Similarly, there were no significant mode shifts in the allele frequency distribution histograms that would be indicative of low-frequency allele loss caused by bottlenecks.

Population structure

Wright’s FST (Wright 1951, 1965) is a traditional measure of subpopulation genetic differentiation, but its current utility has been questioned because its maximum value, i.e., complete differentiation, can be severely reduced from 1 when using highly polymorphic markers such as microsatellites (Neigel 2002; Jost 2008; Meirmans and Hedrick 2011). The overall FST calculated from our microsatellite data is 0.180, and the related GST (Nei 1973) value is similar (0.166). Standardization of these values to a scale of 0–1 by dividing by their maximum values yields the more insightful measures of FST (Meirmans 2006) and GST (Hedrick 2005). For the C. officinalis populations, FST = 0.561 and G’ST = 0.542, values which imply a higher level of subpopulation differentiation. Jost’s D, which is claimed to be a much more reliable standardized measure of differentiation (Jost 2008), was slightly lower (DEST = 0.451) although still an indication of relatively high subpopulation differentiation. The above values were derived from data without null allele correction, but overall FST decreased by only 1.9% when the null allele correction was applied using FreeNA.

Pairwise comparisons of population differentiation were calculated for both FST and Jost’s DEST (Table 4). Pairwise FST values ranged from 0.064 to 0.168 while Jost’s DEST ranged from 0.220 to 0.763. The only relative difference was that the El Madrigal population was slightly less differentiated from Angashcola (0.130) than Gonzanamá (0.141) in the FST comparison but slightly more differentiated from Angashcola (0.619) than Gonzanamá (0.587) in the Jost’s DEST comparison. The latter comparison is more geographically consistent. In both comparisons, the El Madrigal population is more genetically similar to the Vilcabamba population than to its neighboring El Cristal population. The Jost’s DEST comparison, in particular, shows a high degree of differentiation (0.619–0.763) between the four populations on the western slope of the Eastern Cordillera Real and the Angashcola population. As with overall FST, correction for null allele frequencies decreases the average pairwise FST only slightly (2.0%).

Table 4 Pairwise population FST (below diagonal) and Jost’s DEST (above diagonal) comparisons.

Because of the possibility of null alleles, distance trees were constructed from microsatellite data with and without null allele frequency corrections (Fig. 2). In both cases, the group containing the populations located on the western slope of the Eastern Cordillera Real (El Madrigal, El Cristal, Vilcabamba, and Yangana) was well supported with high bootstrap values although the positions of El Cristal and Yangana were reversed and not well supported. The sister group of El Madrigal and Vilcabamba appeared in both dendrograms. The positions of Gonzanamá and Angashcola were unaffected by the null allele corrections. Using data without null allele correction (Fig. 2a) gave a slightly more geographically consistent dendrogram.

Fig. 2
figure 2

UPGMA dendrograms of C. officinalis microsatellite data. a Without null allele correction; b with null allele correction. Bootstrap percentages (1000 replications) are at nodes

The program STRUCTURE was used to assess the microsatellite data by a Bayesian clustering approach (Fig. 3). Although six nominal populations were sampled, the inferred cluster size (K) was determined to be five by either the independent or correlated alleles model with the admixture patterns essentially the same between models. The majority of individuals in the El Cristal, Yangana, Gonzanamá, and Angashcola populations had ancestry within their unique clusters, but the fifth cluster was composed mainly of individuals from El Madrigal and Vilcabamba with significant admixture from the Yangana cluster and, to a lesser extent, El Cristal. Very little admixture from the Gonzanamá cluster appeared in El Madrigal, El Cristal and Vilcabamba, and essentially no admixture from the Angashcola cluster was found in any population except for a few individuals in Gonzanamá. The least admixture was seen in the more isolated Angashcola population. Although geographically located between the El Madrigal and Vilcabamba populations, El Cristal shows a different genetic background. The STRUCTURE results are consistent with the distance tree analysis (Fig. 2).

Fig. 3
figure 3

Bayesian cluster analysis of C. officinalis microsatellite data from STRUCTURE (K = 5)

Using the Mantel test, correlation between the Nei genetic distance and geographic distances of population pairs was not significant (r = 0.045, p = 0.181). With the FST genetic distance, the r value was slightly higher but only weakly significant (r = 0.102, p = 0.049).

Chloroplast haplotype variation and distribution

Six haplotypes were detected at the chloroplast rps16 intron (Table 5). The four polymorphic sites included one base substitution and three repetitive regions. Haplotype A, the presumptive ancestral sequence, was by far the most abundant with an overall frequency of 0.877. Haplotypes D and E, with frequencies of 0.006, were found in only one individual each.

Table 5 Chloroplast haplotype sequence variation in the rps16 intron based on a 781 bp alignment

Haplotype distributions are shown in Fig. 4. El Madrigal had the most haplotype diversity with five sequence variants (A–E). However, the populations of El Cristal, Vilcabamba, Yangana, and Gonzanamá were fixed at haplotype A. The latter three populations occur in the most degraded habitat and are the smallest (less than 50 individuals), suggesting that fixation may have occurred in this haploid genome through genetic drift. Angashcola was the only population containing haplotype F and the only one where haplotype A was the least abundant. The latter observation is consistent with the previous analyses which showed a greater genetic divergence of Angashcola relative to the other populations.

Fig. 4
figure 4

Chloroplast rps16 intron haplotype frequency distributions of C. officinalis populations. PNP, Podocarpus National Park; PNY, Yacuri National Park

The construction of a unique median-joining network of haplotypes was not possible owing to the hypervariability of the second polymorphic site (see Table 5) which allowed a number of possible alignments and networks (not shown). However, all networks placed haplotype A as the ancestral sequence. Depending on the alignment used, haplotype F (unique to Angashcola) was either directly derived from haplotype A, indirectly derived from haplotype A via haplotype B, or it shared a common (undetected) ancestral haplotype with haplotype B.

Discussion

Habitat fragmentation can have potentially serious genetic consequences, including reduction in genetic diversity and gene flow, higher rates of inbreeding and greater inter-population divergence due to genetic drift (Lowe et al. 2005). Some tropical forest plants are particularly susceptible to the effects of habitat fragmentation and degradation owing to their reliance on animal pollinators, complex self-incompatibility mechanisms, and high rates of outcrossing (Hamrick and Murawski 1990). In a meta-analysis of more than 100 plant studies, Aguilar et al. (2008) generally found significant negative effects of habitat fragmentation on expected heterozygosity, allele number, percent polymorphic loci, and outcrossing rates whereas increased inbreeding was less common. In addition, the analysis showed that loss of heterozygosity was greater when more time had passed since fragmentation.

Since there are no known remaining extended forests of C. officinalis, it is impossible to determine with certainty whether the moderately high heterozygosity of the six fragmented populations (average He = 0.653) in our study reflects that of the original forest, or if there has been an actual reduction in heterozygosity over nearly 400 years of harvesting this species. However, compared to Angashcola, the largest and least impacted population in this study, the other five populations had expected heterozygosities that were 11% lower on average. Mean allelic richness and number of effective alleles for these five populations were 22% and 18% lower than Angashcola, respectively. Assuming that the Angashcola population analysis provides a minimum estimation of the original forest’s genetic diversity, these data support the conclusions of Vranckx et al. (2012) that fragmentation may only have a slight negative impact on heterozygosity, but allelic richness is lost at a higher rate. Also, for microsatellites, changes in allelic diversity are considered to be more informative than heterozygosity for assessing genetic erosion (Kang et al. 2008; Spencer et al. 2000).

Loss of genetic diversity through genetic drift cannot be completely discounted in C. officinalis populations, but, according to Young et al. (1996), our estimation of gene flow from microsatellite data (Nm = 1.2–1.4) appears to be high enough to counter genetic erosion in the nuclear genome caused by drift. Gene flow via long-distance pollination and seed dispersal are mechanisms by which fragmented forest populations can maintain genetic diversity (Kramer et al. 2008), and even isolated trees can act as “stepping stones” between fragmented populations (Lander et al. 2010; Lowe et al. 2005). Seed dispersal is an unlikely agent of gene flow between fragmented C. officinalis populations because the winged seeds of Cinchona species are probably incapable of dispersal over such long distances. According to Vittoz and Engler (2007), winged seeds in forest species can reach a dispersion distance of only 40–150 m. Due to their purple-pink coloration and tubular shape, the flowers of C. officinalis are mainly pollinated by hummingbirds (Altshuler 2003; Sazima et al. 1996; Tropicos 2018) which are capable of foraging over long distances (Murcia 1996) although they typically forage within a narrow home territory (Renner 1998). Some hummingbirds can persist in a matrix of forest fragments and cattle pasture (Stouffer and Bierregaard Jr 1995), but other studies have shown that open areas severely reduce their visitations to fragments (Lindberg and Olesen 2001; Magrach et al. 2012). Whether the large blocks of open pasture separating fragments of C. officinalis are impediments to hummingbird pollination is likely species-dependent and remains unstudied.

Two populations in this study, Gonzanamá and Angashcola, had relatively high FIS values (0.105 and 0.114, respectively). We are hesitant to conclude that inbreeding is the cause. Instead, heterozygote deficiency is more likely explained by the presence of null alleles, especially in locus Coff15 which deviated significantly from Hardy-Weinberg equilibrium in four of the six populations. In fact, of all the populations, inbreeding would be least expected in Angashcola since it has the highest number of individuals. There would, however, be a higher probability of null alleles in Angashcola because of its greater divergence from the population used to construct the microsatellite library (Chapuis and Estoup 2007). Although the presence of null alleles can inflate FST values and genetic distances in significantly differentiated populations (Chapuis and Estoup 2007), the difference between corrected and uncorrected overall FST or average pairwise FST values in our study was marginal (≤ 2%).

Many studies have shown that after forest fragmentation, first generation progeny (seedlings and saplings) have reduced genetic diversity and sometimes show signs of reduced fitness (Cascante et al. 2002; Fernández-M and Sork 2007; Lowe et al. 2005; Nason and Hamrick 1997). Therefore, Lowe et al. (2015) have suggested that future studies be focused on the progeny of forest fragments. Such reductions in fitness could be due to environmental changes in the modified habitats as well as genetic factors (Cascante et al. 2002). Limited studies have not shown evidence of reduced fitness of C. officinalis seeds under controlled germination conditions: 70% germination with light exposure (Romero-Saritama and Munt 2017), 87% with hydrogen peroxide and light exposure (Armijos-González and Pérez-Ruiz 2016), and 71–83% in sand and peat (Guáman et al. 2017). However, according to Acosta (1947), germination of older seeds is greatly reduced. Further research is needed to determine whether post-fragmentation progeny of C. officinalis are indeed suffering from genetic erosion and reduced fitness.

The overall genetic differentiation of the C. officinalis populations is relatively high as measured by FST, GST, and Jost’s DEST, although the higher value of DEST is likely more dependable since GST (and FST) typically underestimates differentiation when heterozygosity is high and Nm > 1 (Leng and Zhang 2011). In a simulated study using microsatellite data under the finite island model, Leng and Zhang (2011) suggested that one explanation for higher values (> 0.15) of GST and D is small effective population size (< 100). Pairwise population comparisons, especially with Jost’s DEST, showed moderate genetic differentiation among the four populations located on the western slope of the Eastern Cordillera Real, but greater differentiation was seen between those populations and the other two (Gonzanamá and Angashcola) where mountainous terrain up to 3000 m or more would preclude easy gene flow. Angashcola, in particular, was highly differentiated from the Eastern Cordillera Real populations owing to its remote location. This finding supports our original hypothesis that the complex geological history of southern Ecuador would encourage divergence of some populations.

Distance-based tree analysis yielded similar results. The only geographically inconsistent placement in the UPGMA dendrogram was the El Cristal population which did not group between its neighboring populations (El Madrigal and Vilcabamba). Upon further investigation, we discovered that this population had not been cut over the past 100 years (G. Samaniego, personal communication). Review of the microsatellite data revealed that El Cristal had approximately twice the percentage of rare alleles (frequency < 0.05) compared to the average of the neighboring populations which is consistent with less historical cutting. Bayesian cluster analysis gave similar results in that it separated El Cristal from the El Madrigal-Vilcabamba cluster although there was some admixture. El Cristal is located in the region of Cajanuma, an area in which a royal decree in 1782 prohibited harvesting due to overexploitation of the tree (Arias 2017). However, there is no subsequent documentation regarding unique management approaches in Cajanuma that might have influenced the species’ genetic composition. Regardless of the reasons for this slight phylogeographic discrepancy, the Eastern Cordillera Real populations appear to share a recent common ancestry and were probably part of a continuous population prior to deforestation.

Since haploid genomes have smaller effective population sizes than diploid genomes (Weising et al. 2005), chloroplast loci are more sensitive to genetic drift than nuclear markers (Provan et al. 2001). Haplotype A of the C. officinalis chloroplast rps16 intron, the most common and widespread variant, is presumably the ancestral form based on its abundance and centrality in median-joining networks. Although there was no evidence of genetic bottlenecks from the microsatellite data, fixation of haplotype A by genetic drift occurred in four out of six of the C. officinalis populations. Despite the four rare haplotypes (B–E) found at El Madrigal, haplotype A was still the most abundant in that population. However, haplotype F was the most frequent in, and unique to, Angashcola. As with the microsatellite data, the chloroplast analysis underscores the divergent nature of the Angashcola population relative to the others.

Both orogeny and glacial cycles undoubtedly played a role in the phylogeographic history of C. officinalis. According to Gentry (1982), much of the diversity of the Neotropics can be attributed to rapid species diversification caused by recent Andean orogeny (< 10 MY). The complex mountain terrain of southern Ecuador combined with climatic fluctuations during glacial-interglacial periods created barriers to migration that would have led to isolation and species divergence (Beck and Richter 2008; Jørgensen et al. 1995). In fact, this general area of southern Ecuador is thought to be the center of diversity for the genus Cinchona (Andersson 1998).

The greater genetic differentiation of the Angashcola population of C. officinalis implies that there was an ancient split of an ancestral population, although the timing of its divergence (i.e., Pleistocene or late Miocene) remains unknown. The Gonzanamá population has genetic characteristics that are intermediate between Angashcola and the Eastern Cordillera Real populations and may provide a clue as to the general location of the split. In addition to the mountain barrier (> 3000 m) separating the Eastern Cordillera Real from Angashcola, the large valley east of Gonzanamá has also been an effective interglacial barrier to migration of C. officinalis. A northwest-southeast migration route along the mountain slopes between Gonzanamá and Yangana is possible. The more related populations on the western slope of the Eastern Cordillera Real exist in a relatively narrow elevational range of roughly 2100–2300 m. Much of the more accessible land below that elevation has been cleared for pasture while growth above that range is limited by an abrupt climatic transition which favors páramo. Both the Gonzanamá and Angashcola populations, however, have adapted to higher elevations (approximately 2600–2800 m) in inter-Andean mountain ranges further west. This upward migration is likely due to warmer, less windy conditions away from the Amazonian fronts that cross the eastern cordillera. Mantel tests using Nei’s distance and FST did not show strong evidence of isolation by distance as the r values were low and only one of the tests (FST) was weakly significant. In a review of the Mantel test, Guillot and Rousset (2013) referenced a number of studies that expressed concern about the test’s type I error rate, so we have little confidence in the marginal significance (p = 0.049) of the test using FST data.

In addition to the historic harvesting of C. officinalis for its medicinal value, its habitat has also been severely degraded by deforestation and conversion to pasture or pine-eucalyptus plantations (Tapia-Armijos et al. 2015). However, despite the obvious decrease in population size and continuing threats to the remaining habitat, the conservation status of this species has yet to be assessed by the IUCN (2018).

In our study, all six fragmented populations of C. officinalis in Loja province were found to have moderate levels of genetic diversity. However, the more remote population of Angashcola has the highest genetic diversity, is genetically unique as measured by nuclear microsatellite and chloroplast analysis, and should therefore be given highest conservation priority. Fortunately, of all the populations studied, Angashcola has the largest number of individuals, the most intact habitat, and is managed as a private reserve. The El Madrigal and El Cristal C. officinalis populations are also located in private reserves and therefore will have the opportunity to recover. However, the populations of Gonzanamá, Yangana, and Vilcabamba are very small and located on highly degraded agricultural land. Without intervention, these populations are in danger of extirpation.

Remaining fragments of C. officinalis in southern Ecuador and northern Peru should be identified and studied in order to better understand the conservation needs of this species. Isolated individuals scattered at the peripheries of pastures or at higher elevations need to be evaluated for their value in maintaining gene flow between populations. In fact, in recent expeditions (Arias 2017), individuals or small groups of trees have been found on the western slope of the Eastern Cordillera Real between some of the populations assessed in our study. Characterization of plant-pollinator interactions should be performed in order to develop strategies (Menz et al. 2011) to reconnect C. officinalis fragments with native pollinators. Additionally, studies must be designed to detect and understand any reductions of natural regeneration or fitness. Finally, efforts to educate the local population and landowners on the importance of conserving the few remaining populations of this threatened species should continue.

Data archiving statement

The microsatellite dataset from the current study is available from the corresponding author upon request. GenBank accession numbers for nuclear microsatellite loci sequences and chloroplast haplotype sequences are provided in Tables 1 and 4, respectively.