Introduction

Species diversity is one of the key measures of ecosystem functioning used in ecological sciences. It is a fundamental feature of biotic communities and an attractive topic in the long-lasting debate of community ecology, as it effectively integrates multiple ecosystem processes and drivers (Wardle et al. 2001; Carli et al. 2018). Determining the number of species present in a plant community and identifying the causal factors behind this pattern is crucial from both theoretical and management perspectives.

Temperate forests range from plant species-poor to species-rich, but these diversity values commonly depend on the geographical scale and habitats analysed (Huntley 1993; Večeřa et al. 2019). Plant species richness in these forests reflects the effects of abiotic factors, biotic interactions, management and human pressures, but the number of plant species is also driven by evolutionary processes. The role of species dispersal strategies for local plant assemblages is also frequently discussed in biodiversity observational studies and syntheses (Zobel 1997; Paillet et al. 2010; Corlett 2016; Schulze et al. 2016; Večeřa et al. 2019; Šipoš et al. 2020). Plant species diversity in floodplain and swamp forests strongly vary, with locally higher richness than in other types of forests such as beech, oak or ravine forests (e.g., Pielech 2021; Hrivnák et al. 2022).

Floodplain forests and alder carrs are dynamic ecosystems with high ecological and conservation values. While riparian forests belong to habitats with strong conservation status according to the European Habitats Directive 92/43/EEC, alder carrs do not share a similar legislative position (see Evans 2010). However broadleaved swamp woodland on non-acidic peat, including alder carrs (EUNIS habitat G1.4), are considered vulnerable habitats at the European level (Janssen et al. 2016). Riparian and swamp alder forests are included in several Central European habitat Red lists (e.g. Rennwald 2000; Chytrý et al. 2019). Both forest types are generally threatened by the synergistic effects of human-induced activities. They are mainly subjected to modifications of natural hydrological regime and associated plant invasions, whose negative effects could be amplified by ongoing climate change (Nilsson & Berggren 2000; Richardson et al. 2007, Attore et al. 2011). These habitats cover a broad scale of flooded and permanently waterlogged broad-leaved forests occurring along water courses, in spring areas and river valleys, as well as in swamps, mires and edges of water reservoirs from lowlands to montane areas (Ellenberg 2009; Douda et al. 2016). Their species richness is influenced by a complex of ecological drivers represented by both local and regional factors, including site heterogeneity and environmental history (Douda et al. 2016; Slezák et al. 2017; Pielech 2021). Furthermore, plant invasions may be increasingly influential in shaping the patterns of species richness within different forest vegetation types. This trend seems to be pronounced especially in riparian forests, which have been identified as one of the most invaded plant communities in Europe (Chytrý et al. 2009; Pyšek et al. 2010; Wagner et al. 2017), including Slovakia (Medvecká et al. 2014; Slezák et al. 2022).

Robust vegetation-plot and ecological data sets of black alder-dominated forests recently collected throughout biogeographical regions of Central Europe provide an opportunity to explore the link between plant species richness and ecological variables. It has been previously found that the High Western Carpathians situated on the border between Slovakia and Poland (belonging to the Alpine bioregion) are the hotspot of native plants in these stands. In contrast, low-altitudinal regions, the Pannonian lowland and southern part of the Western Carpathians (Hungary and Slovakia) and the Polish Plain were considered to be the richest in alien species (Hrivnák et al. 2022). However, the environmental background of these patterns was not evaluated and mechanisms affecting variability in species richness remain poorly understood. Therefore, the present study aimed to identify the environmental factors controlling the species richness in the studied area and especially in the given regions, separately for native and alien plants. In particular, knowledge of the factors that support the species richness of alien plants can help manage activities to limit their occurrence.

Materials and methods

Study sites

The study area encompasses a latitudinal gradient ranging from 45.8085° to 52.6115 °N, with four regions: Pannonian lowland (Pannonia; 45.8085°–47.1271 °N), Matricum as the southern part of the Western Carpathians (MatrWeCa; 47.8565°–48.2861 °N), the High Western Carpathians (HighWeCa; 48.8737°–49.7251 °N) and the Polish Plain (PolPla; 50.9123°–52.6115 °N; Fig. 1). Elevation increases from the Polish and Pannonian lowlands through the Matricum to the (sub)montane areas of the High Western Carpathians. The latter region has the highest amount of precipitation and the lowest air temperature values, while the climatic characteristics of the other regions vary considerably (Table 1). Pannonian lowland is a part of the Pannonian biogeographical region (hereafter referred to only as bioregions) and PolPla refers to the Continental bioregion. MatrWeCa and HighWeCa belong to the Matricum and Carpaticum occidentale within the Alpine bioregion (for more details see Futák 1984; EEA 2019; Cervellini et al. 2020).

Fig. 1
figure 1

Studied area with 140 vegetation plots (black points)

Table 1 Basic statistics of environmental variables used in the modelling of species richness in the studied regions

The study was conducted in floodplain forests (phytosociological alliance Alnion incanae Pawłowski et al. 1928; class Alno glutinosae-Populetea albae P. Fukarek et Fabijani 1968) and alder carrs (alliance Alnion glutinosae Malcuit 1929; Alnetea glutinosae Br.-Bl. et Tx. ex Westhoff et al. 1946), both dominated by Alnus glutinosa. These forests cover the vegetation of periodically flooded floodplains and waterlogged sites.

Field sampling and laboratory analyses

Vegetation data sampling was carried out in physiognomically and structurally homogeneous old-growth forest stands with a dominant cover of Alnus glutinosa in the tree layer (i.e. canopy cover of more than 50%). We used a uniform sampling size (400 m2) and square or rectangular shape that was driven mainly by the river valleys’ morphology. All vascular plants were recorded in each plot for three layers (tree, shrub and herb). The geographical position and elevation of the plot centre were measured using a Global positioning system receiver.

Altogether, we sampled 173 plots using the traditional European phytosociological approach (Dengler et al. 2008) during the vegetation season (June–August) of 2010–2019. Field sampling was carried out with respect to spatial, environmental and vegetational heterogeneity (Hrivnák et al. 2015, 2022; Slezák et al. 2017). The distance between the two nearest plots of the same vegetation type was usually at least 4 km.

Vegetation records were stored in Turboveg database software (Hennekens & Schaminée 2001) and exported to Juice software (Tichý 2002). Then, 35 plots per region (140 plots in total) were randomly selected to create equal datasets for the regions. Alder carrs with 64% of plots prevailed in the whole dataset, but each region was composed of both types of black alder forests. The proportions of vegetation plots were as follows: Pannonia 71% and 29% (alder carrs and floodplain forests), MatrWeCa 57% and 43%, HighWeCa 46% and 54%, and PolPla 83% and 17%. Species nomenclature was standardised according to the Euro + Med (2006). Vascular plants not determined to species level were deleted (i.e. 25 taxa at the genus level with 57 occurrences), and few taxa were merged into aggregates or a broadly defined level (for details see Hrivnák et al. 2022). The remaining plant species were classified as native and aliens (archaeophytes and neophytes) according to Medvecká et al. (2012) for the MatrWeCa and HighWeCa, Zając and Zając (2011), and Tokarska-Guzik et al. (2012, 2014) for the Polish Plain, and Csiky et al. (2023) for the Pannonian region. Only presence-absence data were used in the analyses. Local alpha (α) diversity (species richness) was defined as the number of plant taxa recorded in a single sampling plot.

Three soil samples were taken from the topsoil mineral horizon without the litter layer at a depth of 0–10 cm and pooled into a single sample per plot to reduce soil heterogeneity. Soil samples were air-dried at laboratory temperature, then crushed and passed through a 2 mm sieve. Soil pH was measured in distilled water (soil/water ratio of 1/5) using a pH meter (WTW Inolab pH 720). Total carbon (C) and nitrogen (N) contents were determined using an NCS-FLASH 1112 analyzer (CE Instruments, UK), and plant-available phosphorus (P) was extracted in the Mehlich II solution and measured by spectrophotometry (AES-ICP). Cations of calcium (Ca), magnesium (Mg) and sodium (Na) were extracted in the Mehlich III solution and determined using an atomic absorption spectro-photometer (SensAA, GBC; UK). Mean annual temperature, total annual precipitation and precipitation of the warmest quarter were retrieved from WorldClim version 2 [http://www.worldclim.org] on a grid background with a spatial resolution of 30 arc-seconds. The catchment slope, stream power index (SPI) and topographic wetness index (WI) were derived from a digital elevation model and calculated in GRASS GIS (Neteler et al. 2012). CORINE Land Cover 2000 basic types in the surroundings of the plots (circle with a radius of 500 m) were obtained from GeoModel Solar (2014). Subsequently, the number of these habitats for a given plot was counted and the proportion of the following habitats, artificial surfaces, agricultural areas, forests and semi-natural habitats, and wetlands, were identified. Altogether 20 environmental variables were selected (Table 1).

Data analysis

To identify significant environmental drivers of native and alien species richness in Central European regions, we used model-based regression trees (Zeileis et al. 2008). Trees with Poisson distribution were used to fit species count data. To avoid overfitting and selection bias during recursive splitting, we applied a conditional inference approach where the results of structural change tests (5% significance level) were employed for variable selection and as the stopping criterion for growing the trees (Hothorn et al. 2006). Measured environmental characteristics were used as potential predictors in the tree models. Initially, we screened the environmental data for collinearity using pairwise Spearman correlations (Supplementary information S1). In cases where two predictors exhibited high collinearity (r ˃ 0.65), only one predictor was retained for further analyses. Elevation, annual precipitation, soil C and Mg, and cover of forests were excluded from the analyses due to collinearity. Finally, 15 environmental (explanatory) variables (Table 1) were used to examine the relationships between species richness and the environment. Trees were independently developed for native and alien species, both for all regions (pooled dataset) and individually for each specific region (Pannonia, MatrWeCa, HighWeCa, and PolPla). Residuals of each model were screened for spatial autocorrelation using correlograms (Bjørnstad & Falck 2001); no significant spatial dependence pattern was observed. The analysis was performed in R (R Core Team 2022) using the packages ggparty (Borkovec and Madin 2019), ncf (Bjørnstad and Falck 2001; Bjørnstad 2022) and partykit (Hothorn and Zeileis 2015).

Results

Altogether 432 plant taxa were recorded in all regions including 57 alien plants (~13%). Native plant species richness was the highest in the region HighWeCa (mean value 45.3, min. 20 species—max. 68 species), followed by MatrWeCa (39.5, 19–80), PolPla (35.6, 13–54) and Pannonian (29.6, 13–50). Alien plant species richness showed an opposite pattern. The highest values in the Matricum (3.4, 0–9) contrasted with the lowest values of the HighWeCa (0.7, 0–3), while species richness in the PolPla (2.4, 0–7) and the Pannonian (2.1, 0–6) was similar.

Soil P and C/N ratio showed similar values across all regions. Soil reaction, concentrations of Ca and Mg were higher in the southern regions (MatrWeCa and Pannonian) compared to the northern regions (HighWeCa and PolPla), but the opposite trend was found for C and N. Cover of artificial surfaces was equally low in all regions. PolPla and Pannonian had higher values than the other regions for agricultural areas and wetlands, respectively. Forests showed the highest and lowest values in MatrWeCa and PolPla. The variability of habitats was similar across regions. The mean catchment slope decreased from HighWeCa and MatrWeCa to Pannonian and PolPla regions. The catchment area increased from PolPla to Pannonia, while MatrWeCa and HighWeCa had similar values for this predictor. Topographic indices differed between the regions studied, i.e. SPI increased from PolPla, Pannonia and HighWeCa to MatrWeCa, whereas WI showed much more homogeneous values (Table 1).

The regression tree analysis identified five significant predictors of species richness in the whole dataset. The diversity of native species was divided into two groups based on slope (6.4°) (Fig. 2a). Steeper plots were further subdivided by soil pH into communities with lower and higher diversity on alkaline and acidic soils, respectively. Plots with gentler slopes were partitioned into richer communities of drier areas (precipitation of the warmest quarter ≤ 267 mm) and more humid sites with lower diversity. Alien species also showed the highest richness under drier climatic conditions (precipitation of the warmest quarter ≤ 227 mm) (Fig. 2b). More humid conditions favoured a higher number of alien species in plots with soil phosphorus levels lower than 13 mg/kg. Almost no aliens dwelled on soils with higher P concentrations, and if they did, they were restricted to plots with annual temperatures above 6.5 °C.

Fig. 2
figure 2

Poisson regression trees partitioning the native species richness (a) and alien species richness (b) in all studied regions based on significant environmental predictors. Threshold values for each partition, probabilities based on structural change tests (p), number of plots in a given group (n) and model determination coefficients (R2) are displayed

Regression tree analysis identified significant predictors of plant species richness in all regions except MatrWeCa. In HighWeCa, soil reaction (≤ 6.2) and the presence of artificial surfaces around the plots correlated with the highest number of native plant species (Fig. 3a). In contrast, alien richness significantly increased with the high proportion of agricultural areas (> 68%) (Fig. 4a). In PolPla, higher alien plant richness was associated with a higher stream power index (> 1.54; Fig. 4b). Native species richness was driven by air temperature (threshold value 10.6 °C) in the Pannonian region, where lower temperature increased the richness pattern (Fig. 3b). Richness of alien plants was higher at sites with soil reaction values below 7.2 (Fig. 4c).

Fig. 3
figure 3

Poisson regression trees partitioning the native species richness in High Western Carpathians (a) and Pannonian lowland (b) based on significant environmental predictors. Threshold values for each partition, probabilities based on structural change tests (p), number of plots in a given group (n) and model determination coefficients (R2) are displayed

Fig. 4
figure 4

Poisson regression trees partitioning the alien species richness in High Western Carpathians (a), Polish Plain (b) and Pannonian lowland (c) based on significant environmental predictors. Threshold values for each partition, probabilities based on structural change tests (p), number of plots in a given group (n) and model determination coefficients (R2) are displayed

Discussion

The use of ecological predictors and regression tree analyses to assess species richness in black alder-dominated forests in Central Europe yielded only partially satisfactory outcomes, especially when considering individual regions. Our models explained between 22 and 36% of the variation in species richness data using recent landscape cover, soil and climate as explanatory variables. These results are in line with the range of variation explained in other recent biodiversity studies conducted in floodplain forests (e.g. Slezák et al. 2017, 2022). However, we are aware that the predictive power of the statistical models is strongly controlled by the set of predictors and the modelling algorithm used (Večeřa et al. 2019).

Predictors responsible for variation in species richness patterns slightly overlapped between the full dataset and the regional subsets. The richness of native and alien plant species was controlled by at least a single predictor in almost all regions studied, with a more structured model recognised for the native richness in the HighWeCa. Our results did not find significant variables only for the richness of native and alien plants in the MatrWeCa, and for the richness of native species in the PolPla. There are several mostly methodological explanations for these findings in individual regions. Although we focused on environmental characteristics known to shape community diversity in the floodplain and swamp forests (e.g. Douda 2010; Slezák et al. 2017, 2020; Hrivnák et al. 2020; Pielech 2021), several additional and not analysed factors may also be important for overall species richness. For example, water regime and associated site productivity measures (e.g. light conditions) contribute to micro-site heterogeneity of floodplain forests (Douda et al. 2012), which in turn could play a significant role in plant species assemblages (Pielech 2021; Slezák et al. 2022). Generally, environmental heterogeneity has been recognized as another important factor governing species richness variation (Stein et al. 2014). We also suppose that among-site heterogeneity in species composition (beta diversity), the length of the environmental gradients studied and the limited number of plots sampled per region may influence our results. These may affect the explanatory power of statistical models and hinder the ability to detect more complex species richness-environmental relationships. Beta diversity explained by Jaccard´s dissimilarity of black alder stands is relatively high in all studied regions, with the lowest value found in the HighWeCa (Hrivnák et al. 2022). The combination of community heterogeneity and broad gradients of several environmental variables in this region, compared with other studied regions (Table 1), leads us to assume that ecological thresholds of species diversity are relatively well-defined in the HighWeCa. In addition, environmental prediction of local-richness patterns could be limited by the number of sampling plots.

Eight environmental variables identified as determinants of thresholds for plant species richness helped us to interpret previous diversity trends in both plant groups, alien and native species. In our case, the effects of four landscape characteristics, namely artificial surfaces, agricultural areas, catchment slope and SPI were accompanied by the well-established role of climate (air temperature and precipitation in the warmest quarter) and two soil characteristics (reaction and concentration of P). The artificial surfaces and agricultural areas are closely related to human activities. Intensely used human-made habitats such as arable land/permanent crops, gardens or grasslands are frequently reported sources of the spread of alien plants into native forest habitats (Chytrý et al. 2005, 2009; Wagner et al. 2017). The plant species composition of secondary habitats considerably differs from plant assemblages in floodplain forests (Chytrý et al. 2020). On the other hand, river corridors promote a rapid species migration (Holeštová and Douda 2022) and many plant species of the artificial habitats thus can successfully colonise various microhabitats developed in riparian areas (Havrdová et al. 2023). For example, the native plants Bellis perennis, Rubus fruticosus s. lat. or the alien species Bidens frondosa, Echinocystis lobata, Impatiens glandulifera, Solidago canadensis and S. gigantea are common species of secondary, man-made habitats and European floodplain forests (Douda et al. 2016; Skokanová et al. 2023), and were also included in our vegetation-plot dataset.

Catchment slope and SPI were important predictors for alien (PolPla) and native (all regions together) plant richness. Both characteristics are proxies for the erosive power of flowing water (Moore et al. 1991) and both are frequently mentioned as predictors of plant richness and composition in riparian forests (e.g. Parker and Bendix 1996; Slezák et al. 2017; Pielech 2021). Large catchment areas with shallow slopes in the continental part of Poland are associated with low erosion activity. This commonly indicates human-altered riparian areas and regulated river networks (e.g. Richardson et al. 2007; Perkins et al. 2016; Slezák et al. 2022), which facilitate the spreading of alien plants. On the contrary, steep slopes in mountainous landscapes compared to flat areas in lowlands, positively influence the richness of native species by creating new habitats (e.g. gravel bars, exposed soil bottoms) that are suitable for different, mainly native stress-tolerant and annual plants. Slopes in river segments also overlap with elevational (climatic) gradients (Supplemetary information S1), which provide additional explanation for richness patterns, for example in the case of alien species (Medvecká et al. 2018). Climate and soil reaction are generally recognised as fundamental ecological drivers shaping plant diversity in forests (Večeřa et al. 2019; Wagner et al. 2021), including forest ecosystems in riparian areas (Douda et al. 2016; Slezák et al. 2017). Plots characterized by low precipitation in the warmest quarter of the year were found to be richer in alien plant species, while plots with low temperatures had higher richness of native plant species. Nevertheless, this fact can be underlined by other environmental characteristics, the biogeography of the territory or by history of landscape utilization, which differ in HighWeCa and Polish Plain (e.g. Lieskovský et al. 2018). Soil reaction plays different roles in richness patterns depending on gradient length, geographical areas, habitat types and evolutionary history (Pärtel 2002; Chytrý et al. 2003). In our study, soil reaction values higher than 6.2 reduced the richness of native species in HighWeCa, while values higher than 7.2 decreased alien plant richness in Pannonia. The pH threshold was set to 7.0 for all regions evaluated together. For individual regions, the threshold values were slightly higher than their mean in the given region and observed richness data thus peaked at mid-values of the pH gradient. Apparently, a broader range of soil pH values recorded in HiWeCa (threshold ± min—max: 6.2 ± 4.3—7.2) and Pannonia (7.2 ± 4.7—7.8) favour abundant groups of plants, native and alien, respectively (see Table 1, Figs. 3a and 4c). It has been previously found that the relationship between species richness and productivity or soil reaction frequently showed an unimodal pattern (Mittelbach et al. 2001; Duprè et al. 2008), with a decreasing trend towards both ends of the gradient (i.e. acidic and highly productive sites). However, this non-linear response of diversity can be confounded by the role of other environmental drivers (Schuster and Diekmann 2003; Palpurina et al. 2017; Slezák et al. 2017). Soil reaction values in the margins of the gradient are associated with eco-physiological constraints that limit numerous vascular plants (see intermediate stress hypothesis, Grime 1979 or physiological tolerance hypothesis, Currie et al. 2004). Soil phosphorus, as an important nutrient with direct implications for plant growth, showed a diversity-reducing effect on alien species in the whole dataset. Biodiversity studies from temperate forests, including floodplain forests, suggest that herb richness decreased with increasing P values. Higher soil P concentrations promote the development and high abundance of only a few strong plant competitors in the local community, whose high biomass production and dominance superiority suppress the richness of co-occurring species (Dumortier et al. 2002; Hrivnák et al. 2015).

Our findings indicate that the predictability of plant richness in black alder-dominated forests is regionally-specific. Species richness was controlled by landscape characteristics, climate and soil-induced drivers, with better predictability found for the richness of native than alien plants. However, the observed environmental thresholds for species diversity patterns should be interpreted with the caution due to discrepancies in the number of vegetation plots corresponding to floodplain and swamp forests in each region. More robust vegetation and environmental datasets would therefore provide more complex insights into the role of predictors operating at local and regional scales. Another shortcoming arises from the geographical framework of our sampling, which was conducted only in the river networks of temperate regions. Arid or semi-arid areas with different water-energy dynamics would likely produce other important variables and thresholds responsible for the variation in species richness data. In spite of these limitations of our study, the present results would be useful tools for a conservation strategy that could guide landscape management in the vicinity of riparian and swamp forests.