Introduction

Plant communities in tropical and subtropical regions are commonly exposed to extreme weather and climatic conditions including high-temperature fluctuations, variable precipitation patterns, and intense radiation. Along with recent climate changes, anthropogenic activities, such as overexploitation, habitat destruction and fragmentation, and introduction of new species, have accelerated ecosystem alterations (Goudie and Viles 2013) resulting in often massive ecological disturbances and fragile biological conditions with unpredictable consequences. Becklin et al. (2016) reported that alteration of precipitation regimes and temperature may disturb the regular patterns of plant morphology, physiology, and life history, which can lead to low plant viability and germination success of seeds (Anderson 2016). Furthermore, disturbances may cause shifts in demography which in turn cause alterations in genetic settings (genetic diversity and differentiation, genetic bottlenecks, gene flow patterns) worldwide (Kassa et al. 2017; MacLaren 2016; Tardieu 2013; Wesche et al. 2005). Responses to changing and harsher environmental conditions can result in a shift in the distribution of mountain plant communities (Grabherr et al. 1994; Lenoir et al. 2008) and a switch from sexual to asexual reproduction (Braatne et al. 1996), respectively, among others.

Such adverse conditions and their effects are also assumed for the open forest community of high altitudinal regions in the Hajar Mountains (the highest area of the Sultanate of Oman with a maximum altitude of 2976 m a.s.l.) dominated by Olea europaea subsp. cuspidata (Wall. & G.Don) Cif. and Juniperus seravschanica Kom. (Zeravschan or Persian juniper). Studies by Al-Kalbani and Price (2015; data range 1979–2012) and Al Sarmi and Washington (2011; data range 1980–2008) indicated that in one decade, mean winter and annual temperatures have risen by 0.85 °C and 0.27 °C, respectively, while the annual precipitation dropped by 67 mm and 9 mm, respectively. Therefore, in addition to recent and accelerated anthropogenic disturbances such as urban encroachments (housing area expansion, road constructions) and touristic activities (campfires), this mountain range is considered highly vulnerable (Al Charaabi and Al-Yahyai 2013; Al-Kalbani et al. 2014; Patzelt 2015a, b; Victor 2008). Consequently, these changes are likely to affect the stands of Olea europaea subsp. cuspidata, nonetheless clear-cut causes are not yet understood.

Olea europaea subsp. cuspidata is one of six sub-species distributed in Asia, Africa, the Mediterranean region, and Macaronesia with less oil yield and smaller fruits than the economically important and widely cultivated ancient cultigen O. europaea subsp. europaea L. (European olive) in the Mediterranean Basin (Díaz et al. 2006; Rallo et al. 2000). Olea europaea subsp. cuspidata is the most widespread subspecies distributed naturally from the southern to north-eastern part of Africa and from the Arabian Peninsula to China and is restricted to regions with adequate precipitation (± 600 mm) including higher elevation areas. It was introduced into Australia, Hawaii, New Zealand, and Norfolk Island (Green 2002). This diploid (2n = 46) and monoecious tree species is mostly outcrossing and wind pollinated, though insect pollination has been also observed (Canale and Loni 2010; Kassa et al. 2017). Seeds are typically dispersed by small mammals and birds (Abiyu et al. 2015; Cuneo and Leishman 2006). So far, clonal reproduction for this subspecies has not yet been proven, though related subspecies demonstrate clonal growth (Baali-Cherif and Besnard 2005). The tree plays an important role as forage and shade tree for agro-pastoral activities, and fruits are eaten by local people (Ghazanfar 2018).

This study was set up to test the hypotheses that (1) remoteness from human activity positively affects stand structure and viability of O. europaea subsp. cuspidata, (2) clonal growth of the species may exist in drier (sloped) conditions promoted by lack of water that suppresses successful germination of seeds, and (3) genetic structure shows bottleneck effects and signs of inbreeding.

Knowledge about this subspecies is insufficient, and the overall goal of this study therefore was to evaluate the vulnerability of the subspecies based on genetic data that can be potentially used to develop conservation strategies for this wild valuable genetic resource.

Materials and methods

Study sites and plant sampling

The field study was conducted in Al Jabal Al Akhdar (hereafter named as Akhdar) and Jabal Shams (herein named Shams) area in the Hajar Mountains range of 2110–2507 m a.s.l. (Fig. 1) in October 2016 and approved by the Ministry of Environmental and Climate Affairs (MECA, permit number 15/2016) of Sultanate of Oman. Three plateaus and their adjacent slopes (~ smallest sampling units with distances of 0.5–1.0 km between them) were selected per each of the two mountains ((3 plateaus + 3 slopes) × 2 mountains = 12 sites in total). The field design furthermore included six nested locations (3 locations (plateaus + slopes) × 2 mountains = 6 locations) (Table 1). In addition, the two mountains were analysed as two separate populations. All slope sites selected were north-exposed to reduce bias and to avoid exposure effects on plant growth and stand density. Per site, one reference point was assigned randomly. Around this predefined reference point, a group of the closest 30–32 individuals was recorded (Fig. 2a, b). Finally, 366 individual trees were sampled. For each individual, spatial coordinates were recorded with a hand-held GPS receiver (Vista HCx eTrex, accuracy ± 2 m, Garmin® Ltd., Southampton, Ireland). The following individual dendrological parameters were recorded: height and crown diameter (both in m), number of alive and dead trunks, diameter at breast height (DBH), and diameter at base (both in cm) for individuals smaller than 1.3 m. Tree condition index was used as proxy for tree vitality and estimated based on foliage cover, partitioned into four classes (100%, 75%, 50% and 25%, Fisher and Gardner 1995), knowing that this parameter is rather subjective. In addition, the presence of human disturbance (pruning and cutting) was recorded. Five healthy leaves from each individual were collected and air dried. Out of the 366 individuals, a subset of 188 individuals (15–16 individuals nearest to the reference point of each of the 12 sites) were chosen for further genetic analyses.

Fig. 1
figure 1

Maps of the study region and its sampling sites (white dots) in the Hajar Mountains range, Sultanate of Oman

Table 1 Number of samples and their different geographic grouping for analyses of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman
Fig. 2
figure 2

Examples of a plateau (a) and slope (b) site displayed as top and intersectional views each of Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

DNA extraction and microsatellite marker analysis

Total genomic DNA was extracted from 188 individuals using one air-dried leaf per tree and the DNeasy™ 96 Plant Kit (Qiagen GmbH, Hilden, Germany) according to the manufacturers’ protocol.

We selected the most suitable and reproducible SSR markers from the published data for our study. Eleven nSSRs [ssrOeUA-DCA1, ssrOeUA-DCA3, ssrOeUA-DCA4, ssrOeUA-DCA5, ssrOeUA-DCA7, ssrOeUA-DCA9, ssrOeUA-DCA18 (Sefc et al. 2000), PA(ATT)2 (Saumitou-Laprade et al. 2000), EMO2 (De la Rosa et al. 2002), GAPU71 (Carriero et al. 2002), and UDO99-011 (Cipriani et al. 2002)] and 15 EST-SSRs [OeUP-02, OeUP-06, OeUP-09, OeUP-10, OeUP-11, OeUP-16, OeUP-18, OeUP-23, OeUP-29, OeUP-33, OeUP-38, OeUP-41, OeUP-42, OeUP-45, and OeUP-46 (Arbeiter et al. 2017)] were originally selected for testing in O. europaea subsp. cuspidata. All these markers were tested for amplification and polymorphism in four samples per each of all four different geographic sites. Four markers (ssrOeUA-DCA1, GAPU71, OeUP-06, and EMO2) producing multi-locus patterns and weak amplification products were additionally tested in 16 samples from the first two sites. Among the 26 markers tested, 17 generated single locus polymorphic products, five were monomorphic (ssrOeUA-DCA5, OeUP-09, OeUP-11, OeUP-18, and OeUP-29), and the remaining four produced multi-locus patterns (OeUP-23, OeUP-38, and OeUP-46) or no amplification (ssrOeUA-DCA7).

Finally, 12 polymorphic microsatellite markers consisting of six nSSRs and six EST-SSRs were used for genetic analysis considering their consistent amplification and reliable unambiguous genotyping results (Table 2). The samples were amplified using three PCR amplification multiplexes: (1) PCRI-UDO99-011, ssrOeUA-DCA9, and OeUP-16 labelled by HEX together with ssrOeUA-DCA4 labelled by 6-FAM; (2) PCRII-ssrOeUA-DCA18 and GAPU71 labelled by HEX together with OeUP-02 and OeUP-10 labelled by 6-FAM; 3) PCRIII-OeUP-41 labelled by HEX together with ssrOeUA-DCA3, OeUP-42, and OeUP-45 labelled by 6-FAM. The following touchdown PCR program was used on a Biometra TProfessional Thermocycler (Analytik Jena, Germany): an initial denaturation for 15 min at 95 °C, 12 touchdown cycles each with 1 min at 94 °C, 1 min at 60 °C (decreased by 1 °C per cycle), and 1 min at 72 °C followed by 24 regular cycles each with 1 min at 94 °C, 1 min at 48 °C, and 1 min at 72 °C followed by a 20 min final extension at 72 °C. For cost-efficient PCRs, a tailed primer approach was used (Kubisiak et al. 2013; Schuelke 2000). The PCR was carried out in a 14 µl PCR reaction mix containing 1.5 µl MgCl2 (25 mM), 1 µl dNTPs (2.5 mM each dNTP), 0.2 µl Taq DNA Polymerase (5 U/µl), 1.5 µl 10X reaction buffer, 0.2 µl (5 picomol/µl) forward primer with a M13-specific sequence (5′-CACGACGTTGTAAACGAC-3′) added to the 5′-end of the primer (Kubisiak et al. 2013; Schuelke 2000) and 0.5 µl (5 picomol/µl) of the PIG-tailed (5′-GTTTCTT-3′, Brownstein et al. 1996) reverse primer (Sigma-Aldrich, St. Louis, MO, USA) (Table 2), 1.0 µl of (5 picomol/µl) dye labelled (6-FAM/HEX) M13 primer, 7.1 µl H2O, and 1.0 µl DNA (~ 0.6 ng/µl). An automatic capillary sequencer ABI 3130xl Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) was used to separate the PCR products. GeneMapper v. 4.0 (Applied Biosystems, Foster City, CA, USA) was used to record the alleles after careful visual inspection.

Table 2 Microsatellite markers genotyped in 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

Data analysis

GenAlEx v6.5 (Peakall and Smouse 2012) was used to calculate percentage of polymorphic loci (PPL), the mean number of all (Na), effective (Ne) and private alleles (NPA), and observed (Ho) and expected (He) heterozygosity (Nei 1973), including the standard error of the mean each. For comparative analysis, the samples were grouped based either on their location (six groups, each including plateau and adjacent slope), plateau versus slope site (two groups), or Akhdar vs. Shams (two groups). Genetic parameters were calculated for all geographic groupings. To check for generation-dependent effects on genetic parameters, age class (assuming increasing DBH as proxy for increasing tree age) quartiles (≤ 7.2 cm, ≤ 10.7 cm, ≤ 16.1 cm and > 16.1 cm; n = 160) were created from trees higher than 1.3 m. The GenAlEx software was also used to check for the probability of identity (probability that two unrelated individuals will have the same genotype) and to detect identical multi-locus genotypes (match function). The MICRO-CHECKER software (van Oosterhout et al. 2004) was used to test for the presence of null alleles and to estimate their frequency. To check for usability of markers, Polymorphic Information Content (PIC) was calculated as PIC = 1 − ∑ (Pi)2, where Pi is the proportion of samples carrying the ith allele of a particular locus. GenAlEx was applied to check for deviations from Hardy–Weinberg equilibrium (HWE). The Genepop 4.3 software (Rousset 2008) was used to calculate FIS values (Weir and Cockerham 1984) for all markers. Significance values were corrected for multiple testing using the false discovery rate (FDR) method in the p.adjust function with R 3.4.3 software (R Core Team 2017). The BOTTLENECK v. 1.2.02 software (Piry et al. 1999) was used to test whether observed heterozygosity and allele frequency distribution were affected by bottleneck events in the recent past. Non-excess of heterozygosity assumes L-shaped allele frequency distributions, which are expected under mutation-drift equilibrium and lack of bottlenecks. The program was run for the two-phased model (TPM) of mutation choosing a 0.7 proportion of the stepwise mutation model (SMM) in the TPM and a 0.3 variance of the geometric distribution for TPM with 10,000 iterations. The Wilcoxon rank test was applied to test for significance. AMOVA, fixation indices (FST, FIS and FIT), the standardized FST (Meirmans 2006), Jost’s estimate of differentiation Dest (Jost 2008), pairwise genetic differentiation with 999 permutations, and Nei’s genetic distance between groups were calculated using GenAlEx.

A Principal Coordinate Analysis (PCoA) based on Nei´s unbiased genetic distance (Nei 1978) was also performed with GenAlEx.

The STRUCTURE v. 2.3.4 software (Pritchard et al. 2010) was used to examine the entire sample for population structure by inferring the number of potential subpopulations or genetic clusters (K) in a sample based on the Bayesian model-based clustering method and considering correlated allele frequencies for an admixture model. The admixture model potentially improves the clustering of closely related species or populations (Lind and Gailing 2013). The program was set to 10 independent runs for each value of potential clusters (K). Each K from 1 to 10 included a burn in period of 50,000 MCMC followed by 106 iterations. One of the 12 SSR markers, OeUP-16, likely contained null alleles at high frequency based on the MICRO-CHECKER analysis (see results below) and was hence excluded from STRUCTURE analysis since the presence of null alleles might adversely affect the efficiency of assigning individuals correctly in STRUCTURE (Carlsson 2008).

The STRUCTURE HARVESTER 0.6.94 software (Earl and VonHoldt 2012) was used to extract the most likely number of clusters (K) based on the rate of change in the log probability of data between successive K values as suggested by Evanno et al. (2005). We used the CLUMPAK software (Kopelman et al. 2015) to generate the summary and the graphical presentation of STRUCTURE results.

To check for large-scale spatial genetic patterns, isolation-by-distance (Mantel test with 999 permutations) was assessed with GenAlEx. Fine-scale spatial genetic structure (FSGS) was analysed for each site by spatial autocorrelation analysis using SPAGeDi v. 1.5 (Hardy and Vekemans 2002). To quantify FSGS, the Sp statistic (Vekemans and Hardy 2004) was calculated as Sp=− bF/(1 − F1), where bF is the regression slope of F(d) over the natural logarithm of distance, and F1 is the average kinship coefficient referred to as a co-ancestry estimate (Loiselle et al. 1995) based on all loci between pairs of individuals of the first distance class. The kinship coefficients were plotted against 10 m distance classes. The distance classes were defined so that 50% of all trees were represented at least once in the interval, and the coefficient of variation (CV) of the number of times when each individual was represented was ≤ 1. Upper and lower 95% confidence intervals of kinship coefficients were determined through 10,000 permutations of individuals among distance classes.

Results

Dendrological parameters

The height and crown width of individuals averaged 3.7 m (ranged from 0.2 to 12.0 m) and 14 m (ranged from 1 to 18 m) (Table 3), respectively. The number of alive and dead trunks varied only marginally from three to four (Table 3). DBH size classes showed a bell-shaped distribution, which was slightly right-skewed due to a few very old trees (Fig. 3). Trees smaller than 1.3 m showed an L-shaped distribution and were heavily browsed, resulting in a dwarfed habitus. DBH and diameter at base varied strongly among sites and locations, although less between mountains (data not shown). The plateau sites exhibited larger DBH values then their adjacent slopes, a trend that was not observed for diameter at base. Only one individual had fruits. The condition index averaged in total 50%, while trees from plateaus of Shams showed the lowest vitalities. In terms of cuttings (branches and stems), human influence was pronounced on Akhdar, while almost absent on Shams (Table 3).

Table 3 Average dendrological parameters measured in different groups of 362 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman
Fig. 3
figure 3

Distribution of the diameter at breast height (DBH) and diameter at base measured on 366 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

Genetic richness, diversity, and differentiation

In total, 147 different microsatellite alleles were found for all 12 microsatellite markers. Probability of observing identical genotypes by chance was extremely low (4.3−13), and no identical multi-locus genotypes were found in the entire sample. The mean number of alleles per locus was 12.3 with the lowest number of alleles (Na = 2) at both EST-SSR loci OeUP-42 and -45 and the highest number of alleles (Na = 31) at locus ssrOeUA-DCA9 (Table 4). The effective number of alleles per locus averaged 5.6. The observed heterozygosity (Ho) ranged from 0.03 (OeUP-45) to 0.89 (ssrOeUA-DCA18) with a mean value of 0.57. The average expected heterozygosity was higher than the observed one (He = 0.64 vs. Ho = 0.57). Fixation index (FIS) and genetic differentiation (FST) for the 12 markers averaged 0.09 (ranging from − 0.41 to 0.15) and 0.02 (ranging from 0.02 to 0.04), respectively (Table 4). The standardized differentiation measures FST and Jost´s Dest averaged 0.03 (ranging from 0.003 to 0.18) and 0.02 (ranging from 0.0001 to 0.17), respectively (Table 4). PIC ranged between 0.28 and 0.82 (Table 4).

Table 4 Genetic diversity and differentiation parameters for 12 microsatellite loci used for the genetic analysis of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

The HWE tests per locus and in the six locations showed overall low indication of homozygote excess (about 25% of the markers demonstrated homozygote excess, mostly statistically insignificant and likely due to null allele presence) (Table 5). The OeUP-16 marker indicated a pronounced excess of homozygotes with significant deviation from HWE in all six locations, except for Akhdar 2 (Table 5). The ssrOeUA-DCA4, ssrOeUA-DCA9, GAPU71, UDO99-011, and OeUP-41 markers were only occasionally indicating deviations from HWE and null allele excess. Deviations from HWE and excess of homozygotes were observed for the same markers also for the mountain and site groups. FIS ranged from − 0.064 to 0.396.

Table 5 Combined summary of Chi square tests for Hardy–Weinberg (GenAlEx) and detections of excess of homozygotes (grey shades, MICRO-CHECKER) for 12 microsatellite markers in six locations of Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

Microsatellite loci were polymorphic for any grouping (Table 6). The average number of alleles (Na) per grouping was nearly twice as high as the number of effective alleles (Ne; Table 6). The number of private alleles (NPA) ranged between 3 and 12 (location), 36–51 (mountain), and 24–41 (site) (Table 6). Expected (He) and observed (Ho) heterozygosities showed nearly identical values ranging between 0.61 and 0.65 and 0.61 and 0.66, respectively (Table 6). Genetic diversity parameters declined with increasing age, while number of private alleles did not follow this trend (Table 6).

Table 6 Average genetic diversity parameters (± standard error) based on different geographic grouping of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

An L-shaped distribution of allele frequencies was observed for any grouping, indicating the absence of pronounced bottleneck effects (data no shown). Pairwise Nei´s genetic distance ranged from 0.03 to 0.09, being highest for Shams 3 against all others (Table 7). The F-index values were low, but differed significantly from zero for any grouping (Table 8). While FIS and FIT between groupings differed only marginally, the FST value between the two mountains (Akhdar and Shams) and among the six locations was about two and four times higher, respectively, as compared to plateau and slope sites (Table 8). However, the FST values were very small in general.

Table 7 Pairwise genetic distance between locations of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman
Table 8 AMOVA results based on different geographic partitioning of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains range, Sultanate of Oman

Population structure

The PCoA did not reveal any clear substructure among the sampled individuals (Fig. 4) demonstrated by the low discriminatory power of the first two coordinates that explained 12.9% of the total genetic variation observed. Together with the third coordinate they still explained only 17.8% of the total variation. These data were in agreement with the STRUCTURE analysis results, indicating little or no genetic structure. The most likely number of clusters (K) was eight, although less likely it could also equal four or six (Fig. 5). However, most individuals expressed some degree of balanced share of cluster assignments irrespective of the actual number of clusters and grouping. Therefore, neither PCoA and STRUCTURE nor FST values revealed any obvious population substructure or strong differentiation among O. europaea subsp. cuspidata trees.

Fig. 4
figure 4

Principal Coordinates Analysis (PCoA) based on genetic distances between 188 Olea europaea subsp. cuspidata individual trees from four sample sites in the Hajar Mountains range, Sultanate of Oman

Fig. 5
figure 5

The STRUCTURE analysis for all 188 Olea europaea subsp. cuspidata individuals in the Hajar Mountains range, Sultanate of Oman. DeltaK (mean (|L″(K)|)/s.d.(L(K)) and mean log-likelihood Ln(K) ± s.d. (dashed line) inferred by Evanno method and visualized by STRUCTURE HARVESTER (upper plot). The three lower plots present distribution of admixture (Q-values) for 12 studied sites based on the most plausible number of clusters (K) equalled four, six and eight clusters with vertical multi-coloured bars representing likelihood of individual tree genotypes to belong to one of the clusters

Spatial genetic structure

Large-scale spatial analyses based on a Mantel test revealed a low but significant correlation between genetic and geographic distances (r = 0.139, p = 0.04). Fine-scale spatial analyses revealed a positive significant spatial autocorrelation at the Akhdar 3 slope and Shams 2 plateau sites in the first (0–10 m) and second (10–20 m) distance intervals, respectively (Fig. 6). A negative significant autocorrelation was observed for the Shams 2 plateau site at the third (20–30 m), for the Akhdar 3 slope site at the seventh (70–80 m), and for the Shams 3 slope site at the second (10–20 m) distance intervals. Overall, the observed patterns demonstrated no or weak autocorrelation.

Fig. 6
figure 6

Fine-scale spatial genetic structure autocorrelograms of the average kinship coefficient (F(d), black solid line) according to Loiselle et al. (1995) with 95% confidence interval (grey area) along spatial distance (m) for each site of 188 Olea europaea subsp. cuspidata trees in the Hajar Mountains. Circles indicate significant autocorrelation (see geographic data on the sites in Table 1)

Discussion

Dendrology

The obtained dendrological data indicated clearly location and site-specific responses. Signs of human activities such as pruning of branches and logging of trees on Shams seemed to be less visible, which can be historically explained: until recently, only few and small settlements exist on Shams. This could be attributed to a scarce water availability and rather steep terrain (higher water run-off) in comparison to the neighbouring Jabal Al Akhdar. The generally sinking groundwater levels in the region (Al-Kalbani and Price 2015) and the overall more severe water approachability on Jabal Shams might thus also affect the existence of O. europaea subsp. cuspidata, visible in pronounced die-backs on Shams. Additionally, livestock browsing was apparently higher on Shams considering the number of dwarf shrubs (diameter at base) as proxy for high livestock intensities. This observation might be related to the slightly different livelihoods on both peaks; Shams serves predominantly as a rangeland area for the piedmont villages, while mountain villages on Akhdar traditionally practice terrace gardening with nowadays increasing off-farm activities such as tourism and administrative jobs.

The bell-shaped distribution of size class > 1.30 m clearly indicates an ongoing over-aging process of the entire population, which was different from O. europaea subsp. cuspidata population structures observed in Ethiopia (Bekele 2000) with typical L-shaped age class distributions indicative for relatively high proportions of young trees. In our study, trees with heights smaller than < 1.30 m indicate high recruitment rates (L-shape distribution). The older aged individuals with heights smaller than < 1.30 m were also present, but have been kept short due to browsing.

Given the different ecological conditions on both mountains, hypothesis 1 cannot be satisfactorily answered, as human influence seems higher on Akhdar, while livestock significantly affects stands on Shams.

Genetic richness and diversity

Although clonal propagation would be potentially possible and the sampling design should allow us to detect it, we did not find any evidence of vegetative propagation. Similarly, Kassa et al. (2017) did not find clonal colonies (genets) of O. europaea subsp. cuspidata in Ethiopia. However, Baali-Cherif and Besnard (2005) identified genets of O. europaea subsp. laperrinei in Algeria based on nine nSSRs, of which ssrOeUA-DCA3 and -DCA 9 have been also used in our study. The rate of clonal reproduction in Algeria was relatively high − 16% of genets were represented by two or more ramets and the area covered was more than 80 m2 in one of the locations. The absence of genets in our sample indicates an exclusive sexual reproduction mode in this population, probably also for the entire subspecies. This observation would merit further studies in additional stands of the Hajar Mountains as well as other populations, for instance in Dohfar (S-Oman) and neighbouring countries. Therefore, the clonal propagation proposed in hypothesis 2 must be rejected.

Genetic diversity was rather moderate (Ho = 0.55–0.58, He = 0.62–0.64), although slightly lower as compared to the same species in an Ethiopian forest studied by Kassa et al. (2017) using nSSR markers (Ho = 0.66–0.72, He = 0.74–0.77). Four of those SSR markers have also been used in our study, namely GAPU71 (Ho = 0.90 vs. 0.74 in our study, He = 0.92 vs. 0.80 in Kassa et al. 2017), ssrOeUA-DCA3 (Ho = 0.52 vs. 0.82, He = 0.40 vs. 0.86), ssrOeUA-DCA18 (Ho = 0.93 vs. 0.89, He = 0.86 vs. 0.88), and UDO099-011 (Ho = 0.93 vs. 0.81, He = 0.91 vs. 0.86). Erre et al. (2010) and Baali-Cherif and Besnard (2005) found also similar heterozygosity levels in Olea europaea subsp. laperrinei (He = 0.41–0.89 and 0.45–0.87, respectively). The lower heterozygosity level in our study can be explained by the inclusion of the EST-SSR markers leading to overall lower mean values. The EST-SSR markers alone generated much lower average heterozygosity levels (Ho = 0.33, ranging from 0.03 to 0.49 and He = 0.40, ranging from 0.03 to 0.83) than the nSSR markers (Ho = 0.81, ranging from 0.74 to 0.89 and He = 0.87, ranging from 0.80 to 0.94). The EST-SSR based values are also lower than values obtained by Arbeiter et al. (2017) (Ho = 0.68, ranging from 0.29 to 0.96 and He = 0.64, ranging from 0.51 to 0.79) and (Kassa et al. 2017) (Ho = 0.76, ranging from 0.40 to 0.92 and He = 0.79, ranging from 0.52 to 0.93) for the same subset of EST-SSR markers suggesting possibly a lower adaptive variation for the populations under study. Nevertheless, our data on genetic diversity are in agreement with the SSR data found in other published studies of related subspecies. The observation of decreasing diversity with increasing size classes, however, was unexpected and has been to our knowledge only rarely observed as for instance for Quercus rubra in southern New England (USA) by Gerwein and Kesseli (2006). Authors suggested some recovery of alleles when forest cover increases, which might be true also for our study indicating a formerly stronger fragmented population with reduced gene flow subjected to loss of genetic diversity. Alternatively, selected genotypes with augmented growth could have resulted in low genetic diversity with increasing size. Our results, however, do not provide evidence for selected genotypes with comparable genetic composition. Moreover, since we have used DBH as a proxy for age, but have not determined tree age by annual ring counts, these observations remain vague and, if they are not accidental, could have been caused by other factors.

Genetic differentiation and population structure

Genetic differentiation among geographic groups was very low in general, but significant in all cases. The low differentiation among the six sampling locations (FST = 0.012, FST = 0.036) and even lower differentiation between sites (FST = 0.004, FST = 0.009) and mountains (FST = 0.007, FST = 0.020) suggest effective gene flow (at least in the recent past) and a still effectively large population size, supported by an absent or at least a weak fine-scale genetic structure. As not all markers demonstrated significant excess of homozygotes, the fixation index (FIS) cannot serve as a proxy for inbreeding.

The low genetic differentiation measures found for the samples can be explained by long-distance dispersal. Extensive wind pollination and long distance seed dispersal by mammals and birds may ensure comparatively high genetic diversity even in isolated mountain populations of O. europaea subsp. cuspidata as likewise suggested for related O. europaea subspecies (Baali-Cherif and Besnard 2005; Besnard et al. 2007, 2009, 2013; Breton et al. 2006; Kassa et al. 2017). Potential mammal vectors for O. europaea subsp. cuspidata seeds (fruits) dispersal in the Hajar Mountains are livestock (goats and sheep) and feral donkeys nowadays. Livestock species very likely contributed to the dispersal activity in the last decades and still do. However, other taxa such as birds of the genera Blackcaps (Sylvia sp.) and Thrushes (Turdus sp.) are known to be likely effective seed dispersal agents for O. europaea (Rey et al. 1997). Although a reduced gene flow can be expected due to a rough terrain, absence of sufficient flowering trees, and the fragmentation of stands (Abiyu 2012; Kassa et al. 2017), we did not find any environmental or demographic effect on subpopulation structure. Even a ~ 20 km distance between the mountain peaks seems to have no effect on the substructure. Perhaps, it is due to the connection via a mountain crest that hardly goes beyond 1800 m. Wadis (deep valley occasionally filled with water) between mountains are narrow and are unlikely to be major barriers for anemochory and zoochory. This finding was furthermore substantiated by the low correlation between genetic and geographic distances. However, this correlation was significant demonstrating that gene flow is or was relatively limited and that isolation by distance might have played (and certainly still does) an important role in differentiation of olive populations.

In comparison to other population genetic studies of olive subspecies that mostly employed random sampling approaches, an effect of our sampling design on the obtained results should also be considered. Although no closely related or clonal trees have been found, neighbouring trees were slightly more related than more distant trees based on the spatial autocorrelation analysis. Finally, the STRUCTURE analysis did find an ambiguous number of clusters, which can be interpreted as a lack of a strong population substructure. A large individual genetic admixture and rather arbitrary assignment of individuals to different clusters mean no obvious substructure except for the Shams 3 slope site that behaved slightly different. However, it is known that population structure algorithms in BAPS, GENELAND and STRUCTURE computer programs tend to overestimate K, especially when isolation-by-distance is strong (Frantz et al. 2009; Guillot et al. 2005; Pritchard et al. 2000). Therefore, the low, but nonetheless significant isolation-by-distance found in our study can partly explain the rather large number of clusters inferred in our sample by STRUCTURE analysis, although those numbers could be rather illusive. Nevertheless, simulations by Guillot and Santos (2009) found that such interferences are stronger particularly in populations with small dispersal distances and spatially irregular patterns (clumped sampling as ours). Moreover, Pritchard et al. (2010) even stated that they permit allele frequencies to be correlated across populations in STRUCTURE to allow accurate assignments of individuals in very closely related individuals, though possibly at the cost of making it likely to overestimate the number of clusters. Hence, we should not overemphasize the number of clusters, especially when population structure and overall differentiation measures are low. However, effects of more recent fragmentation events could be reflected in the next offspring generation suggesting the need for additional studies, particular with respect to a very low recruitment and die-backs that seems to be chronic for the population in the Hajar Mountains. Finally, hypothesis 3 must be rejected as no evidence on genetic bottlenecks and signatures of inbreeding were found.

Conclusions

We could not identify the driving forces of low regeneration and die-back of trees, but suggest cumulative abiotic and biotic effects as the most plausible cause for demographic change. Remoteness alone appears to be not an explanatory factor for the viability of stands. Therefore, we failed to answer hypothesis 1. The absence of identical multi-locus genotypes led us conclude that clonal growth is non-existent at least for the studied individuals and reject hypothesis 2. The studied populations contained relatively high levels of genetic variation and low differentiation based on both nSSR markers that are likely selectively neutral and EST-SSR markers that could be under selection. Overall, bottleneck effects and inbreeding expected under the prevalent conditions have not been found prompting us to reject also hypothesis 3. However, comparatively high levels of genetic variation do not preclude that this population could become vulnerable, especially considering their fragmentation, the lack of regeneration, over-aging, and ongoing climate variations. The unexpectedly higher diversity found in younger size classes merits further studies of sibling generations to better address the questions regarding sustainability of this population, and whether they need special protection.