Background

Biodiversity patterns are formed by the co-distribution of resident species in space and time. Global and regional conservation relies on understanding the mechanisms that generate and maintain these biodiversity patterns (Condit et al. 2002; Myers & LaManna 2016; Socolar et al. 2016), especially when facing mounting challenges from man-made environmental change, such as climate change, urbanisation and biological invasions (Latombe et al. 2017; Hui & Richardson 2017). As the foundation of these biodiversity patterns, the species-by-site matrix allows the immediate measurement of biodiversity composition and turnover within and across sites so that changes can be traced, assembly processes tentatively inferred, and environmental drivers identified (Dornelas et al. 2014; McGill et al. 2015). To this end, measures of species compositional turnover over space and time, best known under the banner of beta diversity, have been rapidly developed over the past decades, with their performance hotly contested (Magurran & Henderson 2010; Anderson et al. 2011; Shimadzu et al. 2015). Indeed, a good understanding of beta diversity could allow us to partition diversity into within- and between-site dissimilarity (i.e. alpha and beta components) (Jost et al. 2010; Tuomisto 2010). When combined with environmental and trait measures, such diversity partitioning allows us to differentiate drivers or sensitive traits that are crucial to maintain within-site species coexistence from those that are responsible to cross-site differentiation (Lebrija-Trejos et al. 2010; Hui et al. 2013; Latombe et al. 2017).

With these pairwise beta diversity metrics, we could still miss important information on the full spectrum of biodiversity, due to the inability of beta diversity to partition biodiversity of three or more sites (or communities). For instance, it is not possible to express biodiversity components that are unique to a particular site or shared by all sites. Moreover, pairwise turnover is predominately driven by missing or gaining those rare and satellite species between communities, whereas the widespread common species are often shared between communities thus contributing little to beta-diversity turnover. Beta diversity is thus biased towards turnover caused by rare species with a limited distribution. Studies on beta diversity are thus overemphasising the role of specific drivers (including chance) in determining the presence or absence of rare species, and missing the spatial drivers of common species that often dominate ecosystem functioning (McGill et al. 2015; McGeoch & Latombe 2016). To this end, multiple-site similarity/dissimilarity measures have been developed (Koch 1957; Diserud & Odegaard 2007; Baselga et al. 2007). However, these multiple-site metrics provide a composite summary of turnover across all sites that are difficult to interpret (Diserud & Odegaard 2007; Ricotta & Pavoine 2015). Because these metrics are largely a composite summary of alpha, beta and gamma components, they are sensitive to the number of sites concerned. For instance, with increasing sample size, Koch’s and Disrerud & Odegaard’s metrics approach zero, while Baselga et al.’s metric approaches one, making them insensitive for comparing species turnover among large numbers of communities.

Zeta diversity was recently developed as a concept for quantifying the number of species shared by multiple sites (Hui & McGeoch 2014). This metric allows us to calculate all components in multi-site diversity partitioning and provides a common currency for building and connecting other incidence-based diversity measures and patterns. Unlike incidence-based beta diversity metrics that are biased towards identifying the contribution of rare species to turnover, zeta diversity provides information on the full spectrum of rare to intermediate and common species as they contribute to driving compositional turnover. Instead of focusing on comparing multi-site metric performance, the concept of zeta diversity provides an alternative by embracing the n-dependency in multiple-site diversity partitioning (where n represents the number of sites/communities), similar to the argument on embracing the scale-dependency in cross-scale diversity partitioning (Hui & McGeoch 2007, 2008). Although zeta diversity has a number of conceptual and analytical advantages over related multiple-site diversity metrics, it has only just started to be used for quantifying and predicting biodiversity patterns in empirical data (Roura-Pascual et al. 2016; Roigé et al. 2016; Latombe et al. 2017; McGeoch et al. 2017; Vaz et al. 2017; Kunin et al. 2018). To this end, we will illustrate how the outputs from zeta diversity analyses can provide valuable and novel insights about the species turnover and community assembly processes in an Afromontane forest.

In particular, we aim to test whether the observed multi-site species turnover is driven primarily by ecological differences (e.g. species-specific occupancies) or by habitat heterogeneity (i.e. site-specific alpha diversity as a proxy of site quality). Community assembly can be in principle governed by four high level forces: dispersal, drift, selection and speciation (Vellend 2016), with environmental filters often identified as an important driver of community assembly in tropical forests (Lebrija-Trejos et al. 2010). Although the neutral theory of ecological communities (Hubbell 2001) has successfully explained a number of community patterns, notably the lognormal-like relative abundance distribution (Volkov et al. 2003, 2007), it has failed to explain beta turnover in Panama and Amazonian forests (Condit et al. 2002). That is, the selection force (e.g. competition and consequently niche partitioning; Tilman 2004) that is missing in the neutral model could have driven the species turnover in those forests. Consequently, we further refine our search for signs of niche differentiation and competition dominance in driving observed ecological differences and species turnover. Specifically, we examine the parametric form of zeta diversity declining with zeta order (power law [signalling niche differentiation] versus negative exponential [signalling the dominance of stochastic force]; Hui & McGeoch 2014).

Methods

Data and study area

Forest data have greatly contributed to the development of ecological theories and debates in the past (Condit et al. 2000; Hubbell 2001; Liang et al. 2016). Here, we intended to explore species turnover patterns in an Afromontane forest in the southern Cape, part of the largest forest complex in southern Africa (Von Maltitz et al. 2003; Mucina and Rutherford 2006). These forests are scattered on the narrow coastal strip south of the Outeniqua and Tsitsikamma mountain ranges (approximately 33°45’ S latitude) and cover a total area of ca. 600 km2 (Geldenhuys 1991). The Southern Cape Forest enjoys high conservation status (Mucina and Rutherford 2006), with more than half of its extent located in the Garden Route National Park (GRNP). It comprises of 470 species that belong to 280 genera and 106 families (Geldenhuys 1993), of which 47 and 27 are canopy and sub-canopy species respectively. This relatively high richness has been attributed to the range of diverse habitat types and dispersal corridors that linked these forests to the Afromontane Eastern Escarpment (Geldenhuys 1992). Most of the canopy and sub-canopy tree species are widespread, and common or scattered in moist to dry forest. Regeneration is generally good, and mast fruiting occurs in several species (Von Maltitz et al. 2003). Most species are shade tolerant and locally persistent (Midgley et al. 1990).

The study area forms part of the Diepwalle Forest Dynamics Research Site, also known as the Diepwalle French Volume Curve (FVC) Research Area, which was established in 1937 and located to the north of the town of Knysna on the southern coastline of South Africa, at 33°56’ S, 23°09′ E (Gadow et al. 2016). The site is at an altitude of 400 m above mean sea level and has a predominantly southerly to south-westerly aspect. The climate can be described as moist, warm temperate. Rain occurs throughout the year, usually with peaks during the autumn and spring months. The mean annual rainfall at Diepwalle is about 1200 mm. The summers are warm (mean daily maxima 23.2 °C) and winters mild (16.6 °C). The site was used until 1954 for testing experimental management systems, but no further treatments have been applied since. The current forest dynamics monitoring programme was initiated in 1972 when all trees with a diameter at breast height (dbh) of 10 cm and greater were numbered, the species identified and the diameters measured and recorded. The point of measurement was permanently marked by means of a painted line (Van Daalen 1991). Areas covering a total of 67.7 ha and containing more than 50,000 trees have been re-measured periodically and have been managed as part of the Garden Route National Park since 2005.

This study makes use of 1987 data, covering an area of 4.56 ha (approximately 380 m by 120 m) with the main canopy 18 to 22 m high (Van Daalen 1991). The forest plot contains 3215 trees of 25 species (Fig. 1), with Olea capensis subsp. marcocarpa the most common canopy species (731 individuals). It is slow growing and shade tolerant (Van Daalen 1991). Flowers are self-pollinated and by insects, and fruit dispersed by birds and mammals, with mast fruiting events often occurring (Von Maltitz et al. 2003). The top 5 most abundant species account for 72% of the total number of individuals. We divided the forest plot into 114 sites using a 20 m grid mesh and conducted zeta diversity analyses for the matrix of 25 species by 114 sites (Fig. 1).

Fig. 1
figure 1

A snapshot of the 3215 individual stems of 25 species. Different colours represent separate species and size proportional to the diameter at breast height, overlaid with a 20 m-resolution grid mesh, in the forest plot of the Diepwalle State Forest surveyed in 1987 used in this study

Zeta diversity

Similar to alpha and beta diversity, zeta diversity can be used in a variety of analyses so to capture multiple facets of biodiversity. Simply put, zeta diversity of n sites, ζ n (using the lower case Latin letter of zeta), is defined as the number of species shared by n sites, with n named zeta order hereafter (Hui & McGeoch 2014). Computation can be easily implemented by using the R package zetadiv (Latombe et al. 2016). By definition zeta diversity declines with zeta order. For instance, ζ1 is the average number of species in one site (alpha component), and ζ2 is the average number of species shared by any two sites (ζ1ζ2 normally represents beta component). Because species shared by n + 1 sites are a subset of species shared by n sites, the ratio of zeta diversity of adjacent orders (ζn + 1/ζ n ) represents the probability of retention that a species already shared by n sites is also expected to occur in one extra site, or equivalently that the proportion of species known to occur in n sites is also shared by n + 1 sites. This probability of retention tells us the chance of a species with a particular level of occupancy (e.g. occurred in n sites) occurring in an extra site. By calculating spatially explicit zeta diversity, say, using the moving window technique, we could visualise and identify how species similarity decays with geographical distance or environmental gradients, reflecting the distance decay patterns named isolation by distance and isolation by resistance, respectively (e.g. Berthouly-Salazar et al. 2013). Key variables can then be identified using multivariate statistics, such as multi-site generalised dissimilarity modelling (Latombe et al. 2017), to discern different combinations of determinants behind species turnover driven by species of different levels of occupancy.

We used two null models to test drivers of species turnover (measured by zeta diversity of different orders). The two null models permute elements of the species-by-site (row × column) matrix of species occurrence (1: presence; 0: absence) under constraints of keeping the observed row sums or column sums the same as observed (Gotelli & Graves 1996). When the row sums were kept, significant difference between the observed zeta diversity turnover and the null model expectation would signal mechanisms other than those determining ecological differences (here, species occupancies; i.e. row sums) driving the multi-site species turnover in the community. When the column sums were kept, significant difference between the observation and the null model expectation would suggest mechanisms other than those determining local species richness (most possibly site/habitat heterogeneity) driving the multi-site species turnover. As has often been argued in the literature that abiotic gradients between sites (and thus environmental filters) are the primary drivers of species turnover (e.g. Lebrija-Trejos et al. 2010; Soininen 2010), we should expect to see that the observed zeta turnover pattern conforms to the null model expectation with column sums constrained. Alternatively, more common species are increasingly more likely to be shared among sites (see the scale-heritage assumption; Fig. 2 in Hui & McGeoch 2008), which predicts that species turnover driven more by ecological differences of occupancy frequencies and thus expecting the observed pattern to conform to the null model with row sums constrained.

Fig. 2
figure 2

Zeta diversity declines with zeta order. Zeta diversity represents the average number of shared species (red dots; grey band with yellow bounds: ±0.5 Standard deviation); zeta order represents the number of sites in the calculation of zeta diversity. Green curves represent the ±0.5 Standard deviation range of zeta diversity from the null model (1000 runs) keeping row sums (species occupancy) the same as observed. Red curves represent the ±0.5 Standard deviation range of zeta diversity from the null model (1000 runs) keeping column sums (site richness [habitat heterogeneity]) the same as observed. The observed zeta diversity decline follows a power law (blue curve)

To further infer the possible mechanisms behind the likely ecological differences between species that drive species turnover in the community, we explored the parametric form of zeta diversity decline. Zeta diversity normally declines with increasing zeta order in two forms: power law and negative exponential (Hui & McGeoch 2014). A power law decline signals niche differentiation between species (e.g. species A is more likely to occur in site 1 than in site 2, while species B prefers site 2 over site 1). A negative exponential decline represents stochastic occurrence, even when sites are heterogeneous (hosting different alpha diversities). Once the parametric form of zeta diversity decline is determined, we could further calculate a species accumulation curve and thus estimate species richness in the regional pool through extrapolating the total number of species by extending the number of sites n to a large number (Hui & McGeoch 2014). When the decline of zeta diversity follows the negative exponential form, the zeta-based richness estimator becomes the same as the incidence-based Chao II estimator which estimates richness based on the frequency of singletons and doubletons in samples. However, the zeta-based richness estimator allows us to calculate the expected richness when the decline does not follow the negative exponential form and when all resident species are relative common (i.e. lacking of either singletons or doubletons in samples).

Results

We calculated zeta diversity for the first 20 zeta orders, with the mean and standard deviation steadily declining from ζ1 = 9.39 ± 1.68, ζ2 = 6.19 ± 1.39, to ζ5 = 3.80 ± 1.06, ζ10 = 2.55 ± 0.96 and ζ20 = 1.62 ± 0.85 (Fig. 2). Importantly, the null model that constrained the row sums predicted perfectly the species turnover patterns, while the null model that constrained the column sums underestimated the zeta diversity at higher orders (Fig. 2), with the null model expectation close to zero at order 5. Consequently, we can argue that species turnover in this community is completely determined by factors of ecological differences (species occupancies) and has little to do with site/habitat heterogeneity. Moreover, the probability of retention increased with zeta order and increased to above 0.9 for species occupying more than 5 sites (Fig. 3), indicating that common species are more likely to occur than rare species on extra sites and thus signalling ecological differences between species with different levels of occupancy.

Fig. 3
figure 3

Probability of retention for species of a given level of occupancy. The probability is calculated as ζn + 1/ζ n (red dots), representing the probability that a species of a given level of occupancy (n) will be retained in one extra site. The blue curve represents the prediction from the power law decline of zeta diversity (see Fig. 4)

Spatial mapping of zeta diversity for the Afromontane forest plot supported the claim on the insignificant effect of habitat heterogeneity on species turnover, by revealing the fine-grain structure and thus little noteworthy spatial patterns (Fig. 4). However, looking closely, we can see that areas with greater higher-order zeta diversity, representing locally common species in the calculation (when using the moving window technique; e.g. see lighter areas in Figs. 4c and d), seemingly coincides with areas of high alpha diversity (local species richness; lighter areas in Fig. 4a). However, after normalising higher-order zeta diversity by local alpha diversity (i.e. the proportion of locally common species; Fig. 4h), the above assertion seems disappeared: areas with many locally common species (Fig. 4c) do not overlap well with areas having a higher proportion of locally common species (Fig. 4h). This could be explained by the spatially contrasting probability of retaining species at different local levels of commonness (Fig. 4eg). For instance, it is more likely that a locally rare species is retained than a locally common species in the top-left corner of the landscape (compare top-left corner of Fig. 4e with Fig. 4g). Consequently, we suspect that fine-scale processes such as seed dispersal/recruitment with a characteristic scale of 20 to 50 m could be driving such a fine-grained community structure.

Fig. 4
figure 4

Spatial patterns of local zeta diversity. a-d local zeta diversity of zeta order 1 to 4, respectively; e-g the probability of retention (ζn + 1/ζ n ) for zeta order 1 to 3; h normalised zeta diversity (local zeta diversity of order 3 divided by local zeta diversity of order 1 [i.e. by local alpha diversity])

As factors of species occupancies, other than habitat heterogeneity, determine the pattern of species turnover in this community, we further explored the parametric form of zeta diversity declining with zeta order. The decline of zeta diversity with zeta order followed closely a power law (Fig. 2), ζ n  = 9.432·n–0.567, R2 > 0.999, AICc = − 49.06; nonlinear least squares), instead of the negative exponential form (R2 = 0.956, AICc = 51.42), supporting the assumption of a non-stochastic assembly force. We noticed that the exponent of the power law (0.567) lies within but close to the lower bound (0.38) of the 95% confidence intervals for reported communities in literature (Hui & McGeoch 2014). Using the richness estimator derived from the framework of zeta diversity, we estimated 25.83 species in 128 samples (note, there are 25 species observed in 114 samples [sites] of the entire forest plot), 31.05 species in 41 ha, 40.20 species in 52.43 km2, and 43.11 species in 419.43 km2 (Fig. 5).

Fig. 5
figure 5

Estimated number of species from zeta diversity (curve) and observed species accumulation curve (red dots). Note, due to the commonness of the species (lacking of singletons or doubletons), traditional species richness estimators (e.g. Jack-knife or the Chao II estimator) cannot be used for the dataset. Sample grain is 20 m × 20 m, and the extent of 106 samples represents an area of 400 km2

Discussion and conclusions

We have derived three propositions from the zeta diversity analysis of the forest snapshot data: (i) species turnover in this community is completely determined by factors driving ecological differences of species occupancies, and has little to do with site/habitat heterogeneity; (ii) non-stochastic assembly force, although not clearly identified in this study, could explain the increasing probability of retention of locally more common species; (iii) fine-scale processes such as seed dispersal/recruitment with a characteristic scale of 20 to 50 m could be driving such fine-grained community structures. These three propositions are consistent with results on community assembly in Afromontane forests. Indeed, although still poorly understood in these forests, the non-stochastic force of competition between trees of different species plays an important role in forest dynamics (Van Daalen 1993), with the growth of most species below the canopy influenced more by crown form and less by crown position. Canopy and sub-canopy species display distinct differences in their reactions to competition (Seifert et al. 2014). Growth is strongly influenced by light availability for canopy species in the lower canopy and subcanopy levels, while trees at the canopy level of inherently faster growing species usually reach larger sizes and are more responsive to light, with slower growing species occurring below faster growing species and vice versa (Seydack et al. 2011). The growth of subcanopy trees of canopy species is also constrained by the proximity of adult trees of the same species, particularly for the two dominant canopy species, viz. Olea capensis subsp. macrocarpa and Podocarpus latifolius (Seydack 2000). Moreover, species-level analyses of southern Cape forests in the region abound on regeneration and competition (e.g. Midgley et al. 1990; Seifert et al. 2014) and have revealed the pattern of being fine grained indicating that current tree canopy species in a mixed stand can regenerate under the same canopy (Geldenhuys 2009). Most of the canopy tree species in the forests are shade tolerant and can regenerate and establish continuously without major disturbances (except for Olinia ventosa), with the ironwood Olea capensis subsp. macrocarpa dominating both the canopy and regeneration of the fine-grain platform forests, where the disturbance regime is regular, generally with small, natural gaps (Geldenhuys & Maliepaard 1983; Midgley et al. 1995).

As a side-line contribution, due to the lack of rare species (e.g. singletons and doubletons), zeta diversity framework still allows us to extrapolate and estimate the number of species in the regional species pool (Fig. 5). This extrapolation is obviously not a process-based method but based on extending zeta diversity to higher orders using the observed power law. As higher order zeta diversity is only a subset of lower order ones, this declining trend constrains the variation of zeta diversity at higher orders and in turn makes extrapolation of higher order zeta diversity less problematic. Importantly, the formulation of zeta diversity allows the use of any forms of zeta diversity decline for extrapolation. Such non-process based methodology emphasizes on cross-scale pattern consistency and constraints (e.g. Hui et al. 2006; Hui 2009; Harte 2011) and has been shown to outperform many process-based methods for inferring fine scale occupancies and abundances (Hui et al. 2009; Barwell et al. 2014). Such inferences reflect the most probable or likely predictions without additional information/processes assumed. Specifically, the estimator of regional species richness from zeta diversity (Hui & McGeoch 2014) is mathematically related to the theory of finite differences (Ruiz 1996). Zeta diversity for upscaling extrapolation has been compared to more than ten species-area methods and performed reasonably well (Kunin et al. 2018). To this end, we would advocate the use of zeta diversity as a new or unified framework of formulating macroecological patterns, rather than a specific structure or turnover metric, for biodiversity upscaling and partitioning.

We intend to propose zeta diversity as a common language or set of bricks for building the full spectrum of biodiversity structures, but not to specify what the biodiversity structures should look like. Arguably, such non-process based methods could only provide the bound estimates for realised patterns. Obviously, community assemblage patterns are driven by the four high-level processes (Vellend 2016): speciation, drift, dispersal, and selection. However, these four processes could manifest into a plethora of theories, such as the neutral theory which emphases only the drift, dispersal and the possibility of speciation in a metacommunity (Hubbell 2001) and can explain observed skewed species abundance distributions (Volkov et al. 2003, 2007). Such process-based models could further support specific estimations within the derived pattern-based bounds. To date, the marriage between pattern and process based methods has yet to produce cohesive and practical joint theories, largely due to the conundrum in process-to-pattern projection (not simply 1-to-1; rather, could often be 1-to-many and many-to-1).

We only studied a snapshot of the forest community, where the Afromontane community exhibits a rather fine-scale pattern or the lack of notably spatial structure. As a promising future research area, considering more realistic temporal and recruitment dynamics at landscape scales could be fruitful (Condit et al. 2006), for instance using the framework of landscape demography (Hui et al. 2017) which could handle the covariance in local patch dynamics caused by both dispersal and the Moran effect. Indeed, many works have pointed toward the role of demographic rates in space and time as the potential framework for unifying community assemblage patterns and methods (Law et al. 2003; Condit et al. 2006). Extending the zeta diversity framework from a platform of spatial turnover analysis to temporal turnover investigation is an ongoing research area. The extremely slow rate of zeta diversity decline in this community suggests a number of extremely common species dominating the landscape, while the rare species could have a large temporal turnover across years. In particular, the four most common species Ironwood (Olea capensis subsp. Macrocarpa), Kamassi (Gonioma kamassi), Real Yellowwood (Podocarpus latifolius) and Assegai (Curtisia dentata), were also most common in the community in 1972 (15 years before the current survey). In contrast, the 10 most rare species include White Silky Bark (Cassine eucleiformis), Cape Plane (Ochna arborea), Wild Pomegranate (Burchellia bubalina), Cape Beech (Rapanea melanophloeos), Coalwood (Lachnostylis hirta), Cape Holly (Ilex mitis), Dune Olive (Olea exasperata), and three newly emerged species in 1987: Cape Pock-ironwood (Chionanthus foveolatus), Glossyleaf (Rhamnus prinoides) and Wild Gardenia (Rothmannia capensis). Three rare species found in 1972 disappeared in the 1987 community: Hard Pear (Olinia ventosa), Wild Peach (Kiggelaria africana) and White Alder (Platylophus trifoliatus). In particular, Olinia ventosa is fast-growing and regarded to be more light demanding, requiring bigger disturbances for establishment. This could explain its absence during the 1987 re-measurement. As predicted from the zeta diversity framework (Fig. 5), the probability of retention for rare species (roughly occupancy < 5 sites) is much lower than for common ones (occupancy greater than 100 sites). This suggests that conservation planning should differentiate common from rare species as different processes could drive their recruitment and persistence.

By extending the pairwise diversity partitioning into alpha and beta components, the framework of zeta diversity, together with standing tools in community ecology, provides a more complete picture of how diversity is organised in a community. Importantly, zeta diversity allows us to differentiate the role of common versus rare species in driving the compositional turnover patterns, as well as the potentially different sets of environmental determinants behind species turnover [not explored here]. This provides a more refined target and tool for conservation planning. The spatial analysis of zeta diversity points towards some potential of spatially refined site-based conservation planning to balance the conservation of species-rich sites versus sites with more locally common (or rare) species. Indeed, of the five management programmes in the management plan for the Garden Route National Park (SANParks 2012), the Terrestrial Vegetation Conservation Programme is aimed at ensuring the long-term persistence of biodiversity patterns and processes, enabling natural variation in structure, function and composition over space and time, and the Species of Special Concern Programme of SanParks is aimed to identify and maintain viable populations of selected species. These management programmes are supported by a research and monitoring programme, with forest dynamics, forest gap dynamics, forest fire disturbance, vegetation surveys and species of special concern, together with the sustainable use of timber and non-timber forest products, representing important focus areas for forest research (McGeoch et al. 2011; SANParks 2014). Our results further emphasise the potential and necessity of developing different target-species strategies for conserving common versus rare species, and target-site strategies for conserving species-rich sites versus sites with disproportionally more locally common or rare species. Specific objectives (e.g. conserving alpha diversity versus ensuring a certain level of species turnover) may demand different strategies.