Background

Although riverine ecosystems are among the most species-rich ecosystems in Central Europe [1,2,3,4], they have been degraded anthropogenically for centuries to provide flood protection or to gain land for agricultural use and human settlement [5]. In consequence, the biodiversity in and around water bodies declined massively [6,7,8,9]. In Germany, restoration measures have therefore been increasingly implemented for more than 20 years to improve habitat diversity and thus promote biodiversity [10, 11]. Particularly with the implementation of the European Water Framework Directive (EU-WFD), the number of restoration projects increased considerably [12, 13]. However, their ecological efficacy is often limited [14,15,16,17]. While some studies report an increase in species diversity [6, 18,19,20,21,22,23,24] others show little evidence of improvement even many years after implementation of restoration measures [8, 11, 25,26,27,28,29,30,31]. Sundermann et al. [32] examined data from 24 restoration projects that dated back between 1 and 12 years and were able to demonstrate that none of the restoration measures examined improved the benthic invertebrate community sufficiently to achieve a good ecological status. Palmer et al. [8] obtained similar findings in their analysis of 78 restoration projects; they were able to demonstrate a statistically significant increase in benthic invertebrate diversity in only 2 of 78 restoration projects. In contrast to these findings, other studies reported a significantly higher richness of floodplain vegetation in restored compared to degraded river stretches [33, 34]. Hering et al. [35] proved that not all organism groups benefit equally from hydromorphological in-stream restoration measures. In general, ecological efficacy of hydromorphological restoration measures (e.g., river widening) is highest for terrestrial and semiaquatic groups of organisms such as floodplain vegetation and ground beetles [15, 33, 34, 36]. Macrophytes responded most strongly to in-stream restoration measures [19, 20, 37,38,39], whereas fish populations and diversity were only slightly improved [19, 22, 37, 40]. Macroinvertebrates showed little or no effect in terms of species richness and diversity [8, 19, 23, 29, 33, 37, 38]. In addition, hydromorphological restoration measures increase the abundance of individual groups of organisms to a greater extent than species diversity [19, 21]. These findings lead to the question why structural restoration measures often resulted in a comparatively low ecological efficacy. In the following, the potential reasons for the lack of restoration success are shown based on a literature review and a case study on the river Horloff (central Germany) in order to derive recommendations for water management practice.

Methods

A literature search was conducted in Web of Science and Google Scholar using the following keywords and their combinations: restoration, renaturation, remediation, revitalization, restoration efficacy, ecological efficiency, ecological efficacy, restoration success, hydromorphological restoration, ecological status, water framework directive, recolonization, biodiversity, stressor, time scale, catchment scale, reach scale, local scale.

Both German and English references, including peer-reviewed papers and gray literature, were included in the review if they contained the mentioned keywords in title or body text. Subsequently, the collected literature was evaluated regarding stressors that affect the ecological efficacy of restoration measures. Stressors were sorted in terms of their effect at the time, catchment, reach, or local scale.

Results

Using the keywords listed in the “Methods” section, 155 studies were found and included in the present evaluation. We identified 20 overarching stressors in the literature review that affect the ecological efficacy of hydromorphological restoration measures and ranked them according to their impact at the catchment, reach, or local scale. Table 1 provides an overview of the stressors and their respective scales. In the following, the stressors and their effects on the ecology of river systems are described.

Table 1 Summary of stressors and the respective scales on which they operate

Time-scale stressor

The term “time-scale stressor” refers to the time since the implementation of a restoration measure. A meta-analysis within the REFORM project identified project age as the most important variable affecting restoration success [19]. Nevertheless, most projects examine restoration success within 1–16 years after the implementation of restoration measures, although it is known that this time period is too short for full community recovery [35, 41, 42]. For instance, Hasselquist et al. [41] used plant communities in restored Swedish rivers to demonstrate that at least 25 years must have elapsed since the implementation of restoration measures to record an increased species richness. For some of the biological quality elements (BQEs), recolonization of restored river stretches can take several years to decades, thus significantly longer than the 6-year management cycles foreseen by the EU-WFD [43,44,45,46,47,48]. Therefore, the time frame of the EU-WFD management cycles is not sufficient to achieve an improvement in the ecological status after a restoration measure.

Large-scale stressors (catchment scale)

Large-scale stressors are those that operate at the catchment scale, such as the input of chemicals and (fine-)sediments into streams.

A question of scale

According to Friberg et al. [15] the scale of the restoration measure should correspond to the scale of the problem, i.e., if water quality is insufficient due to inputs from wastewater treatment plants, these inputs should be reduced on the catchment scale. In other words, the ecological efficacy of restoration measures is impaired if large-scale stressors, e.g., as a result of intensive land use or insufficient water quality, are not adequately taken into account and reduced in parallel.

Massive input of (contaminated) fine sediments

One important factor affecting the ecological efficacy of hydromorphological restoration measures is an increased input of fine sediments following soil erosion, mostly originating from intensive agriculture [15, 49]. This non-intended soil displacement can lower habitat diversity by covering valuable substrates or clogging the hyporheic interstitial [15, 16, 49,50,51,52,53]. Sarriquet et al. [54] demonstrated that some species of Ephemeroptera and Plecoptera do not disperse in substrates with high proportions of fine sediments because they rely on the intact gap system of the hyporheic interstitial. According to Pander et al. [16], a fine sediment (grain size < 1 mm) fraction of 12% to 18% can be considered an upper threshold for favorable habitat conditions. Li et al. [6] also conducted a correlation analysis based on macroinvertebrate structure data collected in 2010, 2011, and 2014 and the respective substrate compositions and were able to show that the proportion of coarse substrate (grain size > 5 mm) correlates significantly positively with species richness. In addition, fine sediment inputs in combination with increased runoff or strong current velocities due to narrowed streambeds can lead to a permanent transport of sediments across the riverbed (sand drift), which intensifies the hydraulic stress for macroinvertebrates [55]. Furthermore, nutrients and pollutants are often associated with fine sediments due to their comparatively large surface area [7, 53, 56, 57]. For example, heavy rainfall events, changes in redox potential, or bioturbation processes can subsequently remobilize these sediment-associated nutrients and contaminants, so that chemical contamination in the water phase often increases with fine sediment input [7, 56]. For long-term enhancement of habitat conditions in streams impacted by fine sediment, Denic and Geist [50] recommend an integrative sediment management on the catchment scale, including land-use changes and extensification of agricultural land. Furthermore, riparian forests and retention areas should be reconnected to their water bodies, as they serve for flood protection and as sediment and nutrient traps [50].

Acidification and bioavailability of metals

Acidification in siliceous mountain streams (type 5 and 5.1 according to [58, 59]) may be particularly relevant if these streams are not buffered by saprobic loads such as wastewater discharges [60, 61]. In these streams, acidifying compounds from the soil and from the air (acid rain) enter the water bodies. This can lead to a drop in pH and a displacement of acid-sensitive taxa such as gastropods, bivalves, and gammarids in favor of acid-tolerant taxa such as plecopterans and trichopterans [60, 62]. In addition, toxic heavy metals are remobilized in the acidic environment and thus become bioavailable with negative impact on the local biocenosis [60]. In addition, the discharge of mining and industrial wastewater as well as from intensive agriculture may lead to an increased input of metals and salts into streams [55, 63, 64].

Increasing salinization

The increasing salinization of water bodies is also promoted by the use of deicing salts, point source discharges and stormwater runoff. Thus, the conductivity of river water is increasing in relation to the degree of urbanization [65]. Increased salinity or conductivity is an additional stressor for sensitive species and can lead to osmotic stress [65]. For example, species of Baetidae, Chironomidae, Gastropoda, Oligochaeta, Nematomorpha, Tricladida, and Hirudinea are especially sensitive to increased salinity [66]. As a result of long-term salinization of rivers, the invertebrate community may change in favor of more salt-tolerant taxa, thereby limiting the ecological efficacy of hydromorphological restoration measures [67,68,69].

Iron hydroxide deposition

Iron hydroxides can precipitate under aerobic conditions if concentrations of 2 mg iron-II per liter are exceeded [55]. Often, elevated iron levels occur due to groundwater lowering caused by agricultural drainage or by the entry of iron-containing groundwater into streams [55]. Iron hydroxide deposition can cause both direct (e.g., toxic effects, gill sticking) and indirect effects (e.g., elevated water turbidity, covering of key habitats) on the aquatic biocenosis. Rasmussen and Lindegaard [70] reported that an increase from 0.2 to 0.3 mg iron-II per liter decreased the number of invertebrate taxa by 21%. This decrease continued successively up to an iron-II concentration of 50 mg/l. Especially grazers which feed predominantly on biofilms as well as ephemeropterans and plecopterans proved to be sensitive [70]. This study clearly shows that elevated iron levels in rivers may change the typical species composition and thus impair the ecological efficacy of hydromorphological restoration measures.

Eutrophication by nitrogen release originating from organic compounds

Organic pollution no longer plays a major role in German rivers since almost 100% of the German population is connected to the public sewerage system, but local pollution still occurs in individual water bodies [71,72,73]. According to Bunzel et al. [71] WWTPs still represent an important source of oxygen-depleting organic contaminants despite extensive technological upgrades. As a result of increasing saprobic loads, the oxygen content in water bodies decreases. This directly affects the species composition of the aquatic community [71]. According to Hering et al. [73] hydromorphological restoration measures are completely ineffective in the presence of organic contaminants, since saprobic pollution overrides all positive factors influencing a restoration measure.

General chemical contamination

In addition to the above-mentioned organic pollution, chemical contamination by priority, river basin-specific and other non-regulated contaminants is also critical to the efficacy of restoration measures [15]. If the EQS derived for priority contaminants or river basin-specific pollutants is exceeded in a water body, this can have negative effects on the biocenosis. These are reflected in a changing water body type-specific species composition and thus impair the ecological efficacy of hydromorphological restoration measures [55]. Sommerhäuser and Hurck [74] postulate that good water quality is a prerequisite for the successful recolonization of restored river stretches. A number of other studies also attribute overriding importance to water quality in the process of recolonization [8, 25, 32, 37, 44, 75, 76]. In the presence of chemical contamination, hydromorphological restoration will not improve the ecological status [75]. In fact, Sundermann et al. [25] demonstrated that only three of 25 studied restoration projects achieved good ecological status. As a result, they concluded that poor water quality prevented recolonization and hence the achievement of good ecological status. Thus, recolonization of restored river stretches appears unlikely as long as stressors are present that impair recolonization [25]. In particular, these include water pollution and inadequate sediment quality [77]. Furthermore, Wagner and Arle [78], who evaluated extensive data sets on the ecological status of macroinvertebrates and fish on water body structure and chemical contamination found that chemical contamination prevents potential positive effects of valuable river structures. The authors explain that structurally valuable river stretches have a higher self-purification capacity and are therefore able to counteract chemical contamination more strongly than structurally degraded river sections. However, the partial compensation of chemical contamination by well-structured river sections is only possible in low-polluted river sections. Consequently, the good ecological status can only be achieved by restoration and revitalization measures if the water body already exhibits a chemically good condition [78].

Although these large-scale stressors operate at the catchment scale, they differ in terms of their impact on the ecological efficacy of restoration measures. Inputs of nutrients, pollutants and fine sediments have a stronger influence on biodiversity [79] and thus the ecological efficacy of restoration measures than, for example, the iron hydroxide content, and hence should be prioritized in restoration measures.

Medium-scale stressors (reach scale)

Medium-scale stressors are those that act at the reach scale, such as the absence of source populations, which are essential for a successful recolonization of restored river stretches.

Altered flow regime, bedload dynamics and depth variance

In particular, the flow regime, the bedload dynamics and the depth variance should be specific of water body type. These parameters shape the formation of habitats and the structure of the riverbed and can thus significantly promote or impair the efficacy of hydromorphological restoration measures [15, 55]. Water discharge has been proven to cause hydraulic stress (e.g., increased bed shear stress), which in particular leads to bed and bank erosion [80]. In addition, bedload deficits caused by river engineering and cross-bank structures also promote bed erosion and channel deepening. As a result of hydraulic stress and bedload deficits, catastrophic drift of macroinvertebrates is often triggered and the formation of specific habitats is prevented [55, 80]. Accordingly, abundance and number of macroinvertebrate taxa are often reduced in affected streams [80]. In addition, massive water withdrawals, e.g., for agricultural irrigation, can also lead to an increase in water temperature in the residual water volume and to the drought of water bodies. Aquatic species that depend on a permanent, low-temperature and oxygen-rich flow of water may become extinct locally due to water shortages or be displaced by more tolerant species. Consequently, the biocenosis specific to the type of water body changes.

Weak self-dynamic development

According to Kail and Wolter [81], morphological restoration measures, i.e., the introduction of substrates and structural elements, are implemented in a majority of water bodies, although the support of self-dynamic development and the initiation of morphodynamic processes are ecologically more effective (process-oriented restoration instead of local-scale interventions) [15, 82,83,84,85]. Sommerhäuser and Hurck [74] report that restored river stretches must first develop their own dynamics over a longer period of time before a biocenosis can form and establish itself.

Low lateral connectivity

The floodplain adjacent to the water body should ideally be type-specific and structurally valuable [83]. By interrupting the watercourse–floodplain connectivity, e.g., through embankment of watercourses, floodplains can no longer serve as water retention areas as well as habitats for a variety of organisms [86,87,88,89]. However, many aquatic and semiaquatic organisms, including macroinvertebrates and fish, require both habitats to develop [55]. In addition, as previously addressed, intact floodplains and riparian corridors can facilitate the recolonization of restored river reaches [90].

Absence of riparian strips

An important aspect of restoring lateral connectivity is the establishment of riparian strips. These support self-dynamic development processes of water bodies, the succession of riparian forests and flood protection, provide habitats for semiaquatic and terrestrial organisms, contribute to shading as well as to the input of woody debris and fallen leaves, and reduce the input of pollutants from intensive agriculture as well as the sediment input from surface runoff and bank erosion [55, 83, 91]. Larson et al. [92] showed in a before–after comparison that clearing all riparian vegetation along a 5 km stretch of a river in an intensively farmed environment massively increased sediment and nutrient inputs. In particular, the mean nitrate concentration increased 10- to 100-fold in the stream after the riparian strip was cleared. In addition, pulse loadings with high nitrate levels were still detectable in the stream 3 years after logging. Lind et al. [91] reported that a 3 m wide riparian strips acts as a basic nutrient and sediment filter. However, to trap more than 75% of nutrient and sediment inputs, 9 to 11 m wide riparian strips would be necessary [91]. To maintain fish and insect diversity in intensively used agricultural areas, riparian strips with of around 25 m width are required, and to maintain bird biodiversity, riparian strips as wide as 144 m are required [91]. The width of riparian strips required to maintain biodiversity increases with increasing slope of the bank [91].

Intensive river maintenance measures

In addition, intensive river maintenance also affects riparian vegetation and thus impairs the ecological efficacy of restoration measures [55, 93, 94]. River engineering measures can create comprehensive maintenance requirements, since the engineering and the input of nutrients lead to a strong plant growth, especially of macrophytes and algae [55, 93]. As a result of eutrophication and mass development of plants, flow velocity is reduced and the water body is increasingly dammed. This results in a decrease of rheophilic species and thus in a direct change in the biocoenosis of the water body [55]. The weedage of water bodies is therefore counteracted by regular weeding of the riverbed in order to support the ongoing flow [55, 93, 94]. In addition to bed weeding, regular river maintenance includes mowing of river beds, embankments, and dikes, clearance profile works, and desilting of river beds [55, 93]. Regular mowing or grazing of riparian areas can also negatively affect water quality, increase sediment input from soil erosion and surface runoff, and lead to warmer riparian conditions [91, 92, 95]. From an ecological perspective, maintenance measures therefore predominantly imply interventions in aquatic and semiaquatic habitats, which are associated with habitat damage and removal of food sources [55, 93, 94]. In addition, the self-dynamic development of water bodies is massively restricted by maintenance measures [83]. As a consequence, the formation of a water body-typical biocenosis is prevented and the ecological efficacy of the respective restoration measure is impaired [93, 94]. Nevertheless, it is not beneficial to expose a watercourse exclusively to its own dynamic development and completely dispense river maintenance measures, as habitats can be destroyed by massive weed encroachment. For this reason, the DWA (German Association for Water, Economy, Wastewater and Waste) [55] recommends a demand-oriented river maintenance, e.g., in form of alternate mowing of banks.

Inappropriate shading and temperature regime

Sufficiently wide riparian strips and intact floodplains also contribute to shading and thus to a more constant temperature regime of water bodies [55, 96,97,98]. Compared to unshaded waterbodies, shading of small and medium-sized waterbodies leads to cooler summer temperatures and thus to higher oxygen levels [99]. This is important for the permanent establishment of particularly sensitive, cold-stenothermic species and thus has a massive influence on the species composition of the biocenosis [55, 99, 100]. For example, Dohet et al. [99] compared the macroinvertebrate composition of three headwaters that differed significantly in terms of forest cover, i.e., shading, and anthropogenic influence. They found that all sensitive stenotic taxa (e.g., preference for headwater streams and low water temperatures) occurred exclusively in the forested, anthropogenically unaffected stream. Therefore, Lind et al. [91] recommend riparian strips with an average width of 21 m in small and medium-sized streams to provide stable water temperatures through shading. In addition to insufficient shading, an altered temperature regime of water bodies can also result from the discharge of waste heat from power plants or cold deep water from dams, leading to a shift in the typical biocenosis of water bodies. Corresponding negative effects are additionally intensified by climate change. As a consequence, the efficacy of implemented structural measures is impaired [55]. For example, Niedrist and Füreder [100] demonstrated a shift in the macroinvertebrate composition due to the immigration of heat-tolerant taxa and a decrease in cold-tolerant taxa in alpine waters in the wake of rising water temperatures due to climate warming. The warmer water temperatures favor in particular competitive, thermophilic neobiota [101, 102]. This may inhibit or even completely prevent the recolonization of restored river stretches by native taxa. Some studies even predict that global warming will cause the extinction of cold-stenothermic species as well as crenal and other species typically present in the source area, because they cannot find a refugial space that fits their thermal requirements [103, 104]. As a result, eurythermal species will increasingly spread upstream to the headwaters and occupy the ecological niches that have been vacated [105, 106]. Furthermore, long-term studies have already demonstrated a decrease in species diversity as a result of climate warming [107, 108].

Insufficient longitudinal continuity

Another factor impairing the efficacy of restoration measures is the disruption of longitudinal connectivity [15, 25, 84]. The lack of stream continuity for fish and hololimnic invertebrates, for example as a result of the construction of migration barriers, has a direct impact on the migration of aquatic organisms, the species composition of the biocenosis, and the recolonization of restored river stretches [15, 109,110,111]. For example, Ramírez et al. [112] did not detect native fish species in streams of Turabo (Puerto Rico, USA) due to a lack of continuity and migration routes. To avoid compromising the efficacy of restoration efforts by disrupting longitudinal connectivity, migration barriers should be made passable to macroinvertebrates and fish, for example by creating fish ladders or bypass channels [113]. In addition to migration barriers, impoundments caused by transverse structures such as weirs also affect the composition of the macroinvertebrate community and thus the recolonization of restored river stretches [55, 65, 69, 114]. The altered flow and depth conditions, as well as the lower oxygen levels as a result of decreasing flow velocities and increasing water temperatures, change the local species composition in favor of stagnant water species [55, 114].

Absence of source populations

The absence of source populations can significantly decrease the efficacy of hydromorphological restoration measures, as organisms cannot migrate unimpeded from the recolonization sources into the restored river sections [55, 74, 115, 116]. Lake et al. [110] and Friberg et al. [15] postulate that a diverse regional species pool is a prerequisite for the successful recolonization of restored river sections. Furthermore, Sundermann et al. [32] explain that the likelihood of species establishment in restored river stretches increases with the species richness of the immediate surrounding. Conversely, source populations that are too far away can also negatively impact the ecological efficacy of restoration measures [32, 117, 118]. Furthermore, Sundermann et al. [32] demonstrated in a meta-analysis that restoration measures did not improve the species composition in river stretches that were not surrounded by watercourses with a high species inventory. If the distance between the restored river stretch and the colonization source is more than 5 km, recolonization is considered unlikely [32].

Biotic interactions

Biotic interactions such as predation or competition for resources have been proven to shape freshwater communities, but have rarely been considered in the planning of restoration measures, so far. Neobiota, in particular, are generalists characterized by low environmental requirements and wide tolerance to factors such as physicochemical parameters, and thus can colonize restored river stretches particularly quickly compared to more specialized taxa [119, 120]. The ecological niches occupied by neobiota or invasive species are then no longer available to native taxa, so that the composition of the aquatic biocoenosis changes and the ecological efficiency of restoration measures is impaired.

White et al. [121] demonstrated that communities in degraded river stretches are often dominated by organisms that had shell or case protections (e.g., snails), making them less vulnerable to predation in comparison to unprotected organisms (e.g., mayflies). Both lethal and sublethal mechanisms between predators and their prey can impair or even completely prevent colonist establishment and thus have an impact on the ecological efficacy of restoration measures. In addition, as strong competitors, protected prey often affect the growth of co-habitants [122]. This has also a direct impact on the species assemblage of the biocoenosis and thus the ecological status according to EU-WFD.

Small-scale stressors (local scale)

Small-scale stressors are those that operate at the local scale, such as the substrate composition at a particular river site.

Inappropriate design of restoration measures

Low ecological efficacy of restoration measures may be due to an inappropriate restoration design [30, 37]. Friberg et al. [15] conducted a meta-analysis of peer-reviewed studies and concluded that channel widening had a significantly higher effect on macrophytes than on fish or macroinvertebrates. Especially, macroinvertebrates did not profit from channel widening, probably because substrate diversity was not increased with the restoration measure [123]. Moreover, Lepori et al. [30] demonstrated that structural improvements may have only minor positive effects on fish and macroinvertebrate biodiversity if the restoration measure solely improves structures that are irrelevant for the target organisms, i.e., does not create new habitats. Thus, improvement of small-scale structures (e.g., substrate composition, presence of key habitats) is required to generate microhabitats and increase macroinvertebrate diversity [30, 37, 124, 125]. For example, Miller et al. [21] showed that diversity of macroinvertebrates was increased by typical in-stream measures such as the implementation of woody debris and the establishment of boulders. These small-scale structures are often inadequately addressed by restoration measures, which frequently focus on the demands of fish species. This is why fish diversity increases in many studies, but species richness of macroinvertebrates does not [30, 37]. Moreover, it is quite obvious that the restoration of a short river section creates less habitat than the restoration of longer stretches [23, 29, 30, 55, 126,127,128]. Therefore, a restored section should always be of an appropriate length.

Insufficient substrate diversity

The absence of key substrates reduces habitat diversity and thus hampers ecological efficacy of hydromorphological restoration measures [15, 129]. The probability of recolonization by typical species increases with increasing habitat diversity [130]. For example, Miller et al. [21] used a meta-analysis of 24 restoration projects to show that macroinvertebrate diversity generally increases with rising habitat diversity. Hering et al. [35] were also able to demonstrate that the species diversity of fish, macrophytes and aquatic invertebrates increases with substrate diversity. In particular, woody debris appears to be an important structural element that contributes significantly to habitat and flow diversity [82, 83, 129]. In order to provide a natural input of woody debris, riparian strips should be established [55] as they help residual populations of species specific to the type of water body to find refuge, from which recolonization of restored river stretches can proceed [90].

Structure of watercourse

According to the DWA [55] the structure of the watercourse must correspond at least to structure class 3 (moderately modified) so that type-specific biocenoses can form and establish. This requires that no other stressors (e.g., chemical contamination) have a local impact on the biocenosis. In particular, the structure of the riverbed and the riparian areas play a decisive role for the biological quality elements (BQEs) fish, macrozoobenthos and macrophytes [78].

Current barriers in the implementation of restoration measures

In addition to the aforementioned biotic and abiotic stressors that impede a successful recolonization of restored river stretches, there are several other deficiencies and hurdles that must be overcome before restoration measures can lead to improvements in the local species diversity.

For example, a common problem in implementing measures is that they are often conducted solely on short, easily accessible or available river stretches. As a result, the holistic view on the water body is missed, not all relevant stressors are identified and appropriately addressed by the restoration measure, which reduces the ecological efficiency and the expected positive effects on biodiversity. Therefore, either combinations of measures that include both structural improvements and the reduction of chemical contamination should be implemented instead of local single measures, or the water bodies should be increasingly managed and restored in a more comprehensive way from source to mouth, if necessary transnationally, in so-called river basin communities [131].

Furthermore, insufficient human and financial resources, competing land uses, and excessive planning, among other factors, are implementation deficits that often result in ecologically ineffective restoration measures [126]. Due to the lack of human and financial resources, measures can be planned and implemented less extensively. For example, the European Commission [43] criticizes that measures proposed by member states are often based solely on what can be achieved with existing budgets. However, the planning and implementation of measures should be based primarily on the present deficits and financing should only be considered secondarily (cost–benefit analysis).

Moreover, the insufficient availability of land complicates and delays the implementation of restoration and revitalization measures nationwide [47] and is therefore considered a “challenge in the implementation of the WFD” [132]. Since sufficient land availability is a prerequisite for the implementation of hydromorphological restoration measures as well as the self-dynamic development of water bodies [47], land adjacent to water bodies must increasingly be made available for water protection through compensation (e.g., land swaps) or purchase.

In addition, the time frame set by the EU-WFD for the implementation of measures and the evaluation of their ecological success appears to be insufficient. For example, measures comprising land use change and/or extensification require long time periods for the implementation due to lengthy planning and approval procedures [44, 131]. Therefore, additional river basin management plans are definitely required beyond 2027 [133].

Case study: restoration at the river Horloff

In the following, we use the river Horloff as an example to show how multiple factors can affect hydromorphological renaturation success, ultimately resulting in the desired improved ecological status not being achieved.

In the following, we use the river Horloff as an example to show how multiple factors can affect hydromorphological restoration measures so that the desired improved ecological status is not achieved.

Characterization of the river Horloff

The river Horloff with its length of 44.5 km is part of the river Nidda catchment area in Hesse (Germany). It rises at an altitude of 524 m above sea level 65 km northeaster of Frankfurt am Main and flows into the Nidda at an altitude of 119 m above sea level near the town Ober-Florstadt. The Horloff catchment extends over 279 km2 and consists of two river types. In its upper reaches the Horloff flows through forests and permanent grassland in the low mountain range Vogelsberg and is assigned to river type 5 as a small siliceous low mountain stream rich in coarse material (Fig. 1a). The upper Horloff has a moderate ecological status (ecological status class 3, [134]). Further downstream, it passes through arable and grassland areas, which often come close to the water’s edge (Fig. 1b). The Wetterau region with its fertile loess soil, through which the lower Horloff flows, is heavily influenced by agriculture. Here the Horloff is a mid-sized siliceous upland river rich in fine-to-coarse material (river type 9). The lower reaches only achieve a bad ecological status (ecological status class 5, [135]).

Fig. 1
figure 1

Characteristic water body structures of the upper (a: sampling site 2) and lower reaches (b: sampling site 23) of the river Horloff (photos: courtesy of Simone Ziebart)

In the past, the course of the Horloff was repeatedly altered, diverted, and straightened over long stretches by anthropogenic interventions, resulting in 63.5% of the upper Horloff and 98.8% of the lower Horloff being assigned a structurally deficient state [134, 135]. Especially the lower Horloff flows as a stretched channel with a trapezoidal cross-section (Fig. 1b). Additionally, the longitudinal connectivity is reduced by artificial instream barriers such as small dams and weirs. Seven of the weirs are impassable for fish and benthic invertebrates (see Fig. 2).

Fig. 2
figure 2

Catchment area of the river Horloff with main tributaries, location of sampling sites, wastewater treatment plants, seven large weirs that act as artificial instream barriers and the analyzed hydromorphological restoration section

Figure 2 also shows the location of the wastewater treatment plants (WWTPs) which can act as further stress factors by discharging treated wastewater into the Horloff or its tributaries (geographical coordinates provided in Table 2). There are four small WWTPs in the upper reaches of the Horloff: Einartshausen with 850 population equivalents (PE), Gonterskirchen (990 PE), Friedrichshütte (100 PE) and Ruppertsburg (990 PE). Further downstream in the lower Horloff follow the large WWTP Hungen-Utphe (78,000 PE) and the WWTP Wölfersheim (6000 PE).

Table 2 Geographical coordinates and characterization of sampling sites and wastewater treatment plants

To improve the poor ecological condition of the lower Horloff, a hydromorphological restoration was realized in two consecutive phases in the years 2002/2003 and 2006/2007 near Echzell, a small village about 35 km north of Frankfurt am Main. The restoration measure comprised a restored river stretch of 1.6 km (Fig. 2) and is characterized by a low depth of intervention (for details: [76]). Even 20 years after the restoration measure was implemented the ecological status in this section is still attributed to status class 5 (bad). This assignment to status class 5 is based on the result for the BQE with the worst evaluation result according to the one out all out principle of the EU-WFD. In the present case, this is the BQE macrozoobenthos, which was consistently rated 5 (bad status) in 2006, 2014, 2017 and 2019. In contrast, the result for the BQE fish community improved from 5 (bad status) to 3 (moderate status) in the same period.

Study sites and sampling

A field and monitoring study with 25 sampling sites along the Horloff was performed between March 2017 and April 2018 to give insights into the factors that determined the poor success of the above-mentioned restoration measure. The sampling sites are indicated in Fig. 2 and characterized in Table 2. In a space-for-time-substitution approach [37], site 16 served as an unrestored reference site upstream of sites 17 and 18 within the hydromorphological restoration section and for the transect sites 19 to 25 downstream of the restoration.

Sediment structure and fine sediment input

To determine the particle size distribution of sediments and the thickness of the fine sediment layer, samples were taken at 23 of the 25 sampling sites in March and April 2018 with a penetrometer (STEP Systems GmbH, Nuremberg, Germany). Only at sampling sites 15 and 23 no samples were collected. Particle size distribution of sediments was determined by sieving 100 g of dried sediment samples with a vibration sieve machine (Retsch AS200 basic, Retsch GmbH, Haan, Germany). The sediments were separated into the appropriate fractions using sieves with mesh sizes of 4 mm, 2 mm, 1 mm, 630 μm, 500 μm, 250 μm, 125 μm, 63 μm and 20 μm and the weight of the fractions was determined. The mean grain size was calculated and classified according to [136]. The thickness of the fine sediment layer was determined by pushing the probe rod of the measuring instrument vertically into the sediment until the resistance reached 200 psi (= 13.8 bar). The penetration depth was measured at four points of the corresponding sampling site to account for local variations and the mean value was calculated (see [137] for details).

The results for both parameters, mean grain size and fine sediment depth, are shown in Fig. 3. In the upper Horloff (sites 1–12) only 6 of the 12 analyzed sediments are characterized as coarse sand and none as an even coarser substrate, such as gravel or stones. At two sites of the upper Horloff (8 and 12), the mean grain size is so small that the substrate is characterized as silt (< 63 µm). The thickest fine sediment layer of 35 and 80 cm in the upper Horloff was determined at the respective two sites. In the lower Horloff (sites 13–25) the situation is even worse, as all sediments except no. 14 (medium sand) fall into the categories fine sand (63–200 µm) and silt (< 63 µm), with the fine sediment layer reaching a depth of up to 111 cm at site 14.

Fig. 3
figure 3

Mean grain size and classification of sediments (a) according to [136] and mean fine sediment depth (b) in the river Horloff. Dashed arrows mark the outflow of the last of the four small wastewater treatment plants (WWTPs) downstream of site 4 in the upper reach of the Horloff and of the WWTP Wölfersheim downstream of site 19. The solid arrow marks the outflow of the large WWTP Hungen-Utphe downstream of site 14. Hatched bar: reference site 16 in the space-for-time-substitution approach to account for differences to sites 17 and 18 within the restoration section (dark green bars) and to transect sites (light green bars) downstream of the restoration section

Our findings contrast with expectations for the two river types the Horloff is assigned to. The upper Horloff (river type 5) should have a predominantly coarse-grained and less fine-grained substrate overall according to [138] as prerequisite for the good ecological status, while the lower Horloff (river type 9) should have a predominantly coarse-grained substrate with fine-grained sediment only in areas with calmed currents. The clear deviation from the expected substrate composition and the massive fine sediment load impairs the development of a river type-specific biocenoses, which means that the ecological efficacy of the hydromorphological restoration measure is already prevented by these factors.

The sources of the massive fine sediment input into the Horloff were analyzed by [139]. The investigation quantified the input via the city drainage with 419 tons per year with a comparably large share from storm water overflow discharges and wastewater treatment plant effluents with 234 and 185 t/a, respectively. In addition, rainwater channels provide a further input of less than 65 t/a. Although this input may appear relevant at first glance, it is almost negligible compared to the erosive fine sediment input from the agricultural areas in the Horloff catchment. Based on the crop species cultivated in the catchment area per field in 2018, the authors quantified the input at 27,100 t/a. In a “worst-case” scenario with widespread maize cultivation, the erosive fine sediment input from arable land into the Horloff even increases to 70,000 t/a. The authors conclude that the fine sediment input into the Horloff makes it difficult, if not impossible, to achieve the good ecological status for the surface waters. To reduce the input of fine sediment, they propose a complex catalogue of measures which, in addition to adapted management practices on agricultural land, include actions to retain fine sediments outside of the river system and the revitalization of the floodplain area with the designation of additional floodplains.

However, the success of restoration measures at the Horloff is not only endangered by the fine sediment input with the resulting fine-grained sediment structure, but also by the pollutant input, which is reflected in both the water (Fig. 4) and the sediments (Fig. 5).

Fig. 4
figure 4

Mixture toxicity based on the sum of toxic units (TU) in Daphnia spec. for 57 analyzed chemicals (a, data from [140]) and mean and standard error of the mean of EC50 values for baseline toxicity, determined with the Microtox assay (b), and of dioxin-like activity, determined with the yeast dioxin screen (c), in water samples from the river Horloff. Dashed arrows mark the outflow of the last of the four small wastewater treatment plants (WWTPs) downstream of site 4 in the upper reach of the Horloff and of the WWTP Wölfersheim downstream of site 19. The solid arrow marks the outflow of the large WWTP Hungen-Utphe downstream of site 14. Hatched bar: reference site 16 in the space-for-time-substitution approach to account for differences to sites 17 and 18 within the restoration section (dark green bars) and to transect sites (light green bars) downstream of the restoration section

Fig. 5
figure 5

Mean and standard error of the mean of EC50 values for baseline toxicity (a) and dioxin-like activity (b) in sediment samples from the river Horloff. Mean and standard deviation of worm numbers in the test with the annelid Lumbriculus variegatus following a 28-days exposure to Horloff sediments (c). Dashed arrows mark the outflow of the last of the four small wastewater treatment plants (WWTPs) downstream of site 4 in the upper reach of the Horloff and of the WWTP Wölfersheim downstream of site 19. The solid arrow marks the outflow of the large WWTP Hungen-Utphe downstream of site 14. Hatched bar: reference site 16 in the space-for-time-substitution approach to account for differences to sites 17 and 18 within the restoration section (dark green bars) and to transect sites (light green bars) downstream of the restoration section. Asterisks indicate significant differences to reference site 16 (one-way ANOVA with Dunnett’s multiple comparison test; ⋆⋆: p < 0.01; ⋆⋆⋆: p < 0.001)

Pollution and biological effects

Water phase

Ref. [140] collected water grab samples between July 2015 and July 2016 in eight sampling campaigns at 13 sites of the river Horloff which are identical with sites 3–5, 14–21, 23 and 25 in our study. These samples were analyzed by liquid chromatography coupled to tandem mass spectrometry (LC–MS/MS) for 161 medium polar emerging pollutants (54 pharmaceuticals, 18 biocides and pesticides, 10 industrial compounds, 4 X-ray contrast media, 3 artificial sweeteners, 2 corrosion inhibitors, 1 repellent, 1 aversive agent and 1 stimulant), including 67 transformation products. As an integrative measure for the potential toxicity caused by these compounds, the authors calculated in a first step toxic units (TUs, dimensionless) for those 57 of the 161 analyzed compounds with available toxicity data by dividing the arithmetic mean concentration from all sampling campaigns at a given site for the respective compound by its 50% effect concentration (EC50) for immobilization in acute tests with daphnids. According to the concentration addition concept of [141] the authors calculated in the second step the potential mixture toxicity as the sum of the individual TUs at a given station (see [140] for details).

The increase in the calculated values for mixture toxicity in the longitudinal course of the Horloff correlates clearly with the inflow of treated wastewater from the WWTPs (Fig. 4a). In the upper reaches of the Horloff, the sum of TUs increased by factor 120 from 0.006 at site 4 to 0.72 at site 5 downstream of the four small WWTPs. After a decrease to 0.02 at site 14 upstream of the large WWTP Hungen-Utphe, this value increased again by factor 80 to 1.60 at site 15 downstream of the WWTP. In the further course of the Horloff, the sum of TUs varied between 1.62 and 2.58. There is little indication for major changes at the restoration sites 17 and 18 compared to the reference site 16 or for a further increase at site 20 downstream of the WWTP Wölfersheim.

The pattern observed for the potential mixture toxicity in the water phase based on the analyzed chemicals is only partially mirrored by the results for two effect-based in vitro bioassays, the Microtox assay (Fig. 4b) and the yeast dioxin assay (YDS, Fig. 4c). Both assays were performed with solid-phase extracted water samples from 3 and 4 sampling campaigns for the YDS and Microtox assay, respectively, according to [142] and [143]. The Microtox assay represents the baseline toxicity of the water samples and measures the luminescence inhibition in the bacterium Aliivibrio fischeri. The results are expressed as EC50 values, referring to the relative enrichment factor (REF) of the respective water sample. An EC50 threshold of 750 REF was set for water samples that reached less than 20% luminescence inhibition and were therefore non-toxic. The measured activities in the YDS were expressed as equivalent concentrations of β-naphthoflavone (β-NF-EQ) and have been corrected for dilution and enrichment so that equivalent concentrations in Fig. 4c refer to native water samples.

The highest baseline toxicity in the water samples was found at site 18 within the restoration section with the lowest EC50 value of 76.6 ± 24.8 related to the REF (Fig. 4b). Although this value does not differ significantly from the reference site 16 (EC50 = 233 ± 79.4 REF), the increase of toxicity by factor 3 is striking. In contrast to the results for the potential mixture toxicity, there was only a slight and statistically insignificant increase in the baseline toxicity at site 15 downstream of the WWTP Hungen-Utphe (EC50 = 164 ± 18.1 REF) and a marked and statistically significant decrease at site 20 downstream of the WWTP Wölfersheim (EC50 = 575 ± 175 REF). The YDS also shows no significant changes caused by the WWTPs. Overall, the dioxin-like activity in the lower Horloff is relatively low compared to other rivers in the Nidda catchment area [144]. The different results for the potential mixture toxicity on the one hand and for the Microtox assay and the YDS on the other hand may be due to the fact that the potential mixture toxicity was determined on the basis of measured concentrations of substances with a very specific mode of action such as pharmaceuticals, biocides and pesticides, but for which neither a high baseline toxicity nor a dioxin-like effect is to be expected.

Sediments

A consistent pattern of increasing in vitro and in vivo toxicity along the course of the river Horloff is achieved if sediments are analyzed (Fig. 5). For the Microtox assay and the YDS sediments from up to seven sampling campaigns were analyzed. Sediments were collected from the upper layer (2 cm), freeze-dried (Martin Christ Gefriertrocknungsanlagen GmbH, Alpha 1–4 LSC plus, Osterode, Germany) and extracted according to [76]. The sediment extracts were analyzed in the same way as described for the solid-phase extracted water samples. An EC50 threshold of 30 mg sediment-equivalents (SEQ) was defined for non-toxic sediment samples in the Microtox assay, i.e., samples that reached less than 20% inhibition of luminescence. Aliquots of surface sediments from sites 2, 7, 10, 14, 18, 19, and 25 sampled in April 2018 were used for a 28-days sediment test with the blackworm Lumbriculus variegatus (Annelida: Oligochaeta) according to OECD guideline 225 [145]. 250 mL glass beakers (6 cm diameter) were filled with 2 cm of sediment and 200 mL river water from the respective sites. On termination of the test on day 28, the reproduction of the blackworms, expressed by the number of individuals per test vessel, and the mean body weight per individual were assessed.

The lowest baseline toxicity of sediments was found in the upper reaches of the Horloff at sites 1 to 3 with EC50 values between 8.49 and 12.4 mg SEQ (Fig. 5a). At site 4, downstream of the WWTPs Einartshausen, Gonterskirchen and Friedrichshütte, the toxicity increased significantly to an EC50 of 4.02 mg SEQ and remained relatively constant over the further course of the Horloff with EC50 values ranging from 2.13 mg SEQ at site 15 to 4.89 mg SEQ at site 18. The main reason why the effluents of the WWTPs Ruppertsburg, Hungen-Utphe and Wölfersheim only had a minor impact on the EC50 values in the sediments was that the baseline toxicity at sites 4 to 25 was already very high with values not exceeding 5 mg SEQ [146].

The effluent of the WWTP Hungen-Utphe only caused a slight increase of dioxin-like activity in the sediments from 20.3 at site 14 to 24.4 µg β-NF-EQ/kg at site 15 (Fig. 5b). In the further course of the Horloff, the dioxin-like activity increased to 39.5 µg β-NF-EQ/kg at site 21 and then fell again to 13.9 µg β-NF-EQ/kg at site 25 just before the confluence with the river Nidda. The inflow from the WWTP Wölfersheim downstream of site 19 did not alter the dioxin-like activity of the sediments. The limited influence of the two WWTPs in the lower Horloff is not surprising, since the dioxin-like activity in the water phase and in sediments of rivers primarily reflects the content of polycyclic aromatic hydrocarbons (PAHs), which enter water bodies primarily via atmospheric deposition and surface runoff (e.g., from motorways) rather than WWTP effluents [147, 148]. Accordingly, the increase in dioxin-like activity at site 19 is likely to be caused by the surface runoff from the A45 motorway bridge and another road runoff further upstream.

The increasing sediment toxicity along the course of the Horloff can be demonstrated not only in the Microtox assay and YDS, but also in the in vivo testing of the sediments with the blackworm L. variegatus (Fig. 5c). While a mean of 39 worms was found at site 2 in the upper reaches of the Horloff after 28 days of exposure to the native sediment, this number gradually decreased to 25.8 at site 25 further downstream.

Active monitoring of biological effects

To investigate potential pollutive effects on the invertebrate community, we conducted an active monitoring study with a crustacean (Gammarus fossarum) and a gastropod (Potamopyrgus antipodarum) species in 2017. G. fossarum is an indicator species of a low-mountain range biocenosis [149]. P. antipodarum is a euryoecious gastropod species which typically inhabits running waters from small creeks to streams, lakes, and estuaries [150]. Individuals of G. fossarum and P. antipodarum were exposed in cages for 28 days in restored and degraded river stretches. After the exposure period, animals were analyzed with respect to mortality and reproduction, which was assessed through the fecundity index (ratio of number of eggs and body length of the female) in G. fossarum and the number of embryos in the brood pouch in P. antipodarum [76, 151].

Mortality of G. fossarum was low in the upper reach of the Horloff at site 3 (19.1%) but gradually increased to 41.7% at site 14 and even 63.3% at site 21 (Fig. 6a). Lower mortalities were only found at sites 19 (11.7%) and 23 (16.7%) while in the restored river stretch (site 18: 26.7%) the mortality was not significantly different from the upstream reference site 16 (36.7%). Overall, the high mortality of an indicator species for low-mountain range rivers within an exposure period of only 28 days is a strong indicator for the unsuitable water quality of the Horloff after the inflow of the first WWTP effluents upstream of site 5. In contrast to the mortality, the fecundity index of G. fossarum was not affected and showed only little variation over the course of the Horloff with values comparable to other monitoring studies [144, 152].

Fig. 6
figure 6

Gammarus fossarum. Mean and standard error of the mean of the percentage mortality (a) and mean and standard deviation of the fecundity index (b) after 28 days of exposure in the active biomonitoring campaigns at the river Horloff. Dashed arrows mark the outflow of the last of the four small wastewater treatment plants (WWTPs) upstream of site 5 in the upper reach of the Horloff and of the WWTP Wölfersheim downstream of site 19. The solid arrow marks the outflow of the large WWTP Hungen-Utphe downstream of site 14. Hatched bar: reference site 16 in the space-for-time-substitution approach to account for differences to the site 18 within the restoration section (dark green bar) and to transect sites (light green bars) downstream of the restoration section. Asterisks indicate significant differences to reference site 16 (Fisher’s exact test; ⋆: p < 0.05; ⋆⋆: p < 0.01)

An inverse pattern for mortality and reproduction is seen in P. antipodarum compared to G. fossarum. While mortality of the snail species was generally low over the course of the Horloff ranging between 4.0% at site 15 and 19.7% at the unrestored reference site 16, it increased significantly to 53.3% at site 18 in the restored river section (Fig. 7a). In contrast to G. fossarum, the reproduction was high at sites 3 and 5 with mean values of 12.5 and 17.6 embryos, respectively, but dropped to mean embryo numbers between 3.48 (site 18) and 6.44 (site 15) further downstream. Again, site 18 with the lowest number of embryos was conspicuous, but the difference from the reference site 16 was not statistically significant.

Fig. 7
figure 7

Potamopyrgus antipodarum. Mean and standard error of the mean of the percentage mortality (a) and mean and standard deviation of the embryo number in the brood pouch (b) after 28 days of exposure in the active biomonitoring campaigns at the river Horloff. Dashed arrows mark the outflow of the last of the four small wastewater treatment plants (WWTPs) upstream of site 5 in the upper reach of the Horloff and of the WWTP Wölfersheim downstream of site 19. The solid arrow marks the outflow of the large WWTP Hungen-Utphe downstream of site 14. Hatched bar: reference site 16 in the space-for-time-substitution approach to account for differences to the site 18 within the restoration section (dark green bar) and to transect sites (light green bars) downstream of the restoration section. Asterisks indicate significant differences to reference site 16 (Fisher’s exact test in a, one-way ANOVA with Dunnett’s multiple comparison test in b; ⋆: p < 0.05; ⋆⋆: p < 0.01; ⋆⋆⋆: p < 0.001)

In summary, the results of the chemical analyses with the calculated TUs, of the in vitro and in vivo tests and of the active monitoring show a marked increase in both the chemical contamination level and biological effects along the course of the Horloff. For some of the parameters examined particularly stressful conditions were determined in the restored section of the river, such as the sediment structure and thickness of the fine sediment layer, the potential mixture toxicity, the baseline toxicity and the biological effects in P. antipodarum. A possible explanation for these results was proposed by [76], who linked the two aspects of deposition of fine sediments and increased toxicity in restored river sections: in contrast to the unrestored sections with their straightened structure and uniform flow pattern, the restored sections are characterized by a higher flow diversity. This allows the deposition of contaminated fine particulate matter in the flow-calmed zones [153]. In these zones the reduced flow velocity supports an increased exchange between sediment and water, so that sediment-bound chemicals are more likely to be remobilized and may affect the macrozoobenthic community [154, 155]. Overall, the Horloff case study underlines that hydromorphological restoration measures must go hand in hand with measures to reduce the input of fine particulate matter and the chemical contamination of water phase and sediments to sustainably improve the ecological status of the body of water.

Conclusion

The restoration of processes (i.e., self-dynamic development) is of overriding importance to achieve the good ecological status compared to the restoration of river morphology (e.g., introduction of substrates), since introduced substrates and created habitats might be destroyed by channel dynamics. Therefore, process-oriented restoration should be preferred over local-scale interventions. Moreover, larger scale projects are more likely to be successful compared to small-scale restoration measures (i.e., measures at local scale and reach scale vs. measures at catchment scale). To ensure that both the water body as well as the local biocenosis can benefit to the greatest extent from a planned restoration measure, individual measures must be implemented in a specific order, which range from measures at catchment scale to measures at reach and local scale.

If a water body suffers from hydrological deficits, these should initially be eliminated and the natural hydrological regime should be restored on the catchment scale (e.g., through reconnection of floodplains and cut-off meanders, guarantee of minimum water flow, removal of dams and weirs). This measure supports important ecological functions such as the temperature and flow regime as well as the substrate diversity.

According to the hierarchical order of measures to reduce stressors and thus restore rivers, chemical contamination from diffuse and point sources should subsequently be removed. Reducing inputs from diffuse (e.g., by sufficiently wide riparian strips, larger retention volumes in stormwater retention basins) and point sources (e.g., via upgrading of large and intermunicipal merging of small wastewater treatment plants) supports a variety of key ecological functions and processes. For example, water quality is improved by the reduction of nutrients and pollutants and substrate diversity increases due to reduced inputs of fine sediments. Only if the water body exhibits sufficient water quality, hydromorphological restoration measures are ecologically effective.

Small-scale hydromorphological measures, such as re-meandering river stretches, and increasing local substrate diversity, or reducing migration barriers at a specific site are beneficial but not effective until hydrology is near-natural and diffuse and point sources are eliminated at the catchment scale. Consequently, local restoration measures can only achieve their full potential, when higher priority stressors are removed at the catchment or reach scale.

In summary, after 20 years of the EU-WFD, it can be stated that conceptual shortcomings, implementation and enforcement deficiencies as well as chemical pollution and structural deficits contribute to the failure to achieve the ecological objectives of the EU-WFD and that the ecological status of our water bodies, which are subject to diverse and partly competing user interests, can only be improved by interlocking measures at all management levels.