1 Introduction

Hydrodynamic models are more and more used to simulate not only the flow and flooding areas of surface waters but also rainfall-induced overland flow in small catchments [3,4,5,6,7], the propagation of flash floods [8,9,10,11], and flood inundation in urban areas [12,13,14,15]. Additionally, to the calculation of the flow field, appropriate methods to represent the runoff generation are needed to establish an integrated hydrological-hydrodynamic model. As 2D shallow water models are typically used for the simulation of relatively short flooding events of some hours up to a few days rather than long-term rainfall–runoff simulations, infiltration represents the most important water loss [16], while evapotranspiration can be neglected in many cases or taken into account in a very simplified way. Infiltration describes the process when rainwater or ponding water is absorbed by the soil. The infiltrated water can either flow further downstream relatively close and parallel to the surface (interflow), be stored in the unsaturated soil zone, or percolate into deeper soil layers and finally contribute to groundwater recharge. For flood modeling it is specifically interesting how much water of the rainfall will lead to overland runoff and how much is “lost” by infiltration. Especially for rural areas and green urban infrastructure, the infiltration losses have to be taken into account. A good understanding of infiltration and its representation in models is crucial not only for the appropriate risk management of heavy rainfalls and flash floods but also for successful solutions of water harvesting and groundwater recharge [17]. Concerning the global trends of ongoing urbanization and climate change coming along with more frequent and more intense hydro-meteorological extremes in terms of floods and droughts, damages from inundations on the one hand and lowered groundwater tables and water stress, on the other hand, will increase and get more severe in future [4, 10, 18, 19]. To mitigate these effects, so-called water-sensitive or climate-adapted urban planning and water resources management strategies considering sustainable urban drainage systems (SUDS) [20, 21] and low impact developments (LIDs) [22,23,24] are gaining increasing importance. Measures such as infiltration basins, swales, raingardens, or permeable pavement should lead back to a more natural water cycle to mitigate flood risks, store water, enhance evapotranspiration, and stabilize groundwater tables. To investigate the effectiveness of such sustainable stormwater management measures with regard to flood mitigation, an appropriate representation of infiltration in 2D hydrodynamic rainfall–runoff models is also needed.

The infiltration process is influenced by many factors, such as soil texture class, soil moisture, soil surface condition, soil bulk density, content of organic matters and litter in the soil, land cover, land use, topography, rainfall characteristics as well as spatial variability of soil properties [17, 25,26,27,28]. It has been, for example, observed that infiltration capacities are usually higher on vegetated than on bare soil surfaces, which is caused among other things by the protection of the surface against the kinetic energy of raindrops preventing the rearrangement of soil particles and thus surface sealing or crust formation. Observations of Ribolzi et al. [29] confirmed the hypothesis that higher effective rainfall intensity was responsible for the formation of less permeable erosion crusts on a 30 % slope than the structural crusts that developed on a 75 % slope. Mohammadzadeh-Habili and Heidarpour [30] carried out column experiments and numerical simulations to study various effects of infiltration into layered soils. If, for example, water infiltrates through an upper layer of finer and less permeable soil than the coarser sublayer, the wetting front, which is usually considered to be stable, becomes unstable forming narrow wetting columns or fingers. Deng and Zhu [31] stated—that similar to saturated flow—the harmonic average of hydraulic conductivities is the best approach to calculate infiltration in layered soils. In their study, they showed that this is true for both, coarse-over-fine and fine-over-coarse layer formations [31]. Assouline and Mualem [17] concluded from their study on heterogeneous small bare catchments, that the impact of surface sealing is much more important than the one of spatial soil variability. Another important factor that can influence infiltration processes is the microtopography, which was, for example, investigated by Esteves et al. [32], Fiedler and Ramirez [33], Mallari et al. [34], Thompson et al. [35], and Xiang et al. [36]. Also, friction can influence infiltration, as higher friction leads to lower flow velocities and more water can infiltrate during the decelerated propagation of the flood wave. Ries et al. [28] conducted 120 experiments with a rainfall simulator on different land covers and soil types with different initial water contents, rain intensities and durations. Based on their observations they stated, that simplified approaches, which are still often applied in models used for the risk management of flash floods—up to the strongest simplification of completely neglecting the runoff reduction during heavy rainfalls—are not suitable. This is shown among others by the fact, that in their conducted experiments even saturated soils still showed significant infiltration rates [28].

There exist many different approaches to model the water losses caused by infiltration, starting from simplified empirical approaches such as the well-known runoff-coefficient, loss-rate, and SCS-CN (Soil Conservation Service Curve Number) methods as well as the Horton equation, more physically-based methods like the Philip infiltration model and the Green–Ampt model, up to the Richards equation. In investigations of Caviedes-Voullième et al. [3], the SCS-CN method was found to be inadequate to be coupled with a distributed model for runoff computations, and as the Richards equation is very complex and the solution needs a lot of computational effort as well as many measured data [37, 38], simplified equations are often used in rainfall–runoff models, where the Green–Ampt model is one of the most popular ones [4, 16, 39, 40]. Developed by Green and Ampt already in 1911 [41], many different applications, modifications, and extensions have been carried out in the last more than 100 years, and in many test cases, it was proven that this model is generally able to appropriately represent infiltration processes [16, 42,43,44].

The main problem for real-world applications consists in estimating the Green–Ampt parameters, namely the hydraulic conductivity, capillary suction head at the wetted front, and effective porosity (representing the saturated soil water content), additionally to the initial soil water content of the soil. The capillary suction in the fine pores of a soil dominates the infiltration process in dry soils, especially at the beginning of a rainfall event, if there is an initial soil moisture deficit which is expressed as difference between saturated and initial soil water content. With increasing water saturation of the soil, the capillary suction reduces and drops to zero, if the soil is completely saturated with water. Then the infiltration depends only on gravity and the infiltration rate is usually assumed to be equal to the saturated hydraulic conductivity [45]. For simple test cases or laboratory experiments, the Green–Ampt parameters might be measured directly, but due to the high effort and costs, it is usually not feasible to directly measure the needed soil properties at enough locations within large areas as it would be needed for real-world applications [46, 47]. If available runoff and sometimes even infiltration time series exist, the Green–Ampt parameters can be considered as calibration parameters [6, 16, 43]. As also observed runoff data is often not available for real-world applications, e.g., considering flash floods or ungauged catchments, the Green–Ampt parameters have to be estimated based on relations to more easily available soil properties such as the soil texture class. Many different of such methods have been developed over the past decades. The probably best known and most cited contribution is the study of Rawls et al. [1], where they developed a table of parameter sets in dependence on soil texture class and soil horizon. These average parameter values are the result of analyzing 5000 soil horizons in the USA, for which the Green–Ampt parameters were derived from the Brooks and Corey parameters that were fitted before to the available water retention data. For a more accurate estimation than taking those average values for the given soil texture class and if more detailed soil information is available, they recommend to predict the water retention matrix potential curve by a regression equation based on particle size distribution, organic matter, bulk density, and 0.33 bar and 15 bar moisture retention values. The best option, of course, would be to determine the Green–Ampt parameters based on measured water retention matrix potential data [1], but as mentioned before this is usually too costly and too much effort for large areas in real-world applications. Other typical values for the Green–Ampt parameters are, for example, presented in the manual of the modeling software company Innovyze [2], referring to different sources, where the hydraulic conductivity is based on minimum infiltration rates observed from measured rainfall, runoff and infiltrometer data, and the initial soil moisture deficit is given for dry conditions as typical moisture deficit at the wilting point, which should be decreased for moist or very wet antecedent conditions [2]. Furthermore, there have been several studies on finding suitable regression equations, where the Green–Ampt parameters can be derived from more easily measured parameters such as percentages of clay, sand, gravel, and the bulk density [36], or from the percentage of surface cover and crusting or initial soil moisture and antecedent rainfall [48].

Especially, on bare soils with no vegetation, the kinetic energy of raindrops can disturb and rearrange soil aggregates resulting in the formation of a surface crust or sealing [49,50,51]. Such layers can lead to a significant decrease in the infiltration capacity by 20 to 2000 times [52, 53]. The thickness of such layer has been reported to vary from 1 to 5 mm [53, 54]. Different adaptions and extensions of infiltration models to account for the effects of a surface crust have been developed [17, 55]. A very simple one is a modified Green–Ampt model, where the hydraulic conductivity is calculated as effective hydraulic conductivity of the crust and subcrust soil [1732, 51, 56].

For practical applications, tabulated standard parameters depending on the soil texture class as represented by Rawls et al. [1] or Innovyze [2] are very interesting and might sometimes be assumed without further knowledge about the actual infiltration behavior, especially when investigating ungauged areas. As these tabulated parameter values are derived from a limited number of field and lab experiments with soils from limited areas, and as it is not clear whether these literature values represent a good assumption of the actual infiltration in larger-scale rainfall–runoff simulations, further analysis is desirable. This study aims to analyze the applicability of tabulated values from two different literature sources to consider infiltration with the Green–Ampt model in 2D shallow water models. Recommendations should be given for rainfall–runoff simulations in ungauged areas in terms of in which cases the tabulated values of which source are suitable to estimate infiltration or giving tendencies how to adapt them. After describing the applied methods, the characteristics and setups of different test cases with measured data are represented. For one test case, a sensitivity analysis is carried out to study the effects of different parameters on infiltration. Afterwards, the results of several simulations with different parameter sets are presented for each test case to evaluate the performance of the model when taking into account the average parameter values compared to the calibrated ones. Three of the test cases are laboratory experiments and one is a field experiment on a small plot in the Thiès catchment in Senegal, where also the extension for crusted soils is exemplarily taken into account to evaluate its effect. Finally, a case study on flash floods in an ungauged desert region in Egypt is shown and the plausibility of the results based on the average values for the Green–Ampt parameters and the impact of a surface crust and friction on infiltration is analyzed.

2 Material and Methods

2.1 2D Shallow Water Model for Overland Flow

The Hydroinformatics Modelling Sytem (hms) is used to simulate the flow field in terms of flow velocities and water depths. It is a Java-based flexible and extendable modeling framework, which has been developed at the Chair of Water Resources Management and Modeling of Hydrosystems of the Technische Universität Berlin, Germany. Previous studies using hms have been carried out, for example, by Hassan et al. [57], Özgen et al. [58], Simons [59], and Tügel et al. [11]. In hms, the depth-averaged 2D shallow water equations are solved with an explicit cell-centered finite volume method, and incorporates robust numerical methods such as the HLLC Riemann solver and a sophisticated total variation diminishing method (TVD) to deal with the numerical challenges, which are associated, for example, with the simulation of very small water depths over complex topography or propagating wet-dry fronts [6]. The general form of the 2D conversation law can be expressed as follows:

$$\begin{aligned} \frac{\partial \mathbf {q}}{\partial t} + \frac{\partial \mathbf {f}}{\partial x} + \frac{\partial \mathbf {g}}{\partial y} =\mathbf {s}, \end{aligned}$$
(1)

where q is the vector of conserved state variables, t is the time, f and g denote the vectors of advective and diffusive fluxes in x- and y-direction, respectively, and the vector s represents the source terms. This equation describes mathematically, that a temporal change of the conserved variables in the control volume can only be caused by a net flux over the surface of the control volume and/or by sinks/sources within the control volume. Inserting the following vectors in the general conservation law (Eq. 1) yields in the shallow water equations (Eq. 2):

$$\begin{aligned} \begin{aligned} \mathbf {q} = \begin{bmatrix} h\\ uh\\ vh\\ \end{bmatrix}, \quad \mathbf {f} = \begin{bmatrix} uh\\ uuh + gh^{2}/2\\ uvh\end{bmatrix}, \quad \\ \mathbf {g} = \begin{bmatrix} vh\\ vuh\\ vvh + gh^{2}/2 \end{bmatrix} \mathbf {s} = \begin{bmatrix} r\\ -g h \frac{ \partial z_B}{ \partial x} - \mathrm{s}_\mathrm{f,x}\\ -g h \frac{ \partial z_B}{ \partial y} - s_{\mathrm{f},y}\end{bmatrix}. \end{aligned} \end{aligned}$$
(2)

Here, the first row of each vector contains the mass balance equation and the second and third rows represent the momentum balance equations in x- and y-direction, respectively. h is the water depth, u and v are the velocity vector components in x- and y-direction, respectively, and \(\mathrm{z}_\mathrm{B}\) is the bottom elevation above datum. uh and vh represent the specific discharge in x- and y-direction, respectively. r is a mass source/sink term accounting for precipitation, infiltration or injection/abstraction of water, and g denotes the gravitational acceleration. The bottom slope terms are \(-gh\frac{ \partial z}{ \partial x}\) and \(-gh\frac{ \partial z}{ \partial y}\) and the momentum sinks due to bottom friction are denoted with \(\mathrm{s}_\mathrm{f,x}\) and \(\mathrm{s}_\mathrm{f,y}\) in x- and y-direction, respectively. The well-known friction law after Manning was used for test cases 2 and 3 as well as for the case study about flash floods in El Gouna:

$$\begin{aligned} \mathrm{s}_\mathrm{f,x} = \frac{g\cdot n^2}{h^{\frac{1}{3}}}u\sqrt{u^2+v^2}, s_\mathrm{f,y} = \frac{g\cdot n^2}{h^{\frac{1}{3}}}v\sqrt{u^2+v^2}, \end{aligned}$$
(3)

where n denotes the Manning roughness coefficient. In the test case 4, the depth-dependent Manning‘s coefficient after Jain et al. [60] was used, as Mügler et al. [5] showed that the application results in the best representation of the velocity field for the considered area:

$$\begin{aligned} n(h) = {\left\{ \begin{array}{ll} n_{0}\frac{h}{h_{0}}^{-\varepsilon } &{} \quad \text{ for } h < h_{0}\\ n_{0} &{} \quad \text{ for } h \ge h_{0}, \end{array}\right. } \end{aligned}$$
(4)

where n(h) denotes the depth-dependent Manning‘s friction coefficient, \(n_{0}\) the minimum land surface-dependent Manning‘s friction coefficient corresponding to flow depth \(h_{0}\) beyond which n is assumed to be constant, and \(\varepsilon\) is a parameter accounting for drag due to vegetation.

As laminar flow conditions were proven in the laboratory experiment after Smith and Woolhiser [61], the laminar friction law as described in Smith and Woohlhiser (1971) and Delfs et al. [62], was used in the first test case:

$$\begin{aligned} \mathrm{s}_\mathrm{f,x} = \frac{g}{{C_{\mathrm{lam}}}^2 h^3 \frac{ \partial z_B}{ \partial x}}u\sqrt{u^2+v^2}, s_{\mathrm{f},y} = \frac{g}{{C_{\mathrm{lam}}}^2 h^3 \frac{ \partial z_B}{ \partial y}}v\sqrt{u^2+v^2}, \end{aligned}$$
(5)

where \(C_\mathrm{lam}\) denotes the laminar friction coefficient. The general form of the balance equation is discretized with a cell-centered finite volume method in space and the forward Euler method in time to calculate the conserved variables at the next time step qn+1 as follows:

$$\begin{aligned} \mathbf {q}^\mathrm{n+1} = \mathbf {q}^\mathrm{n}- \frac{\Delta t}{A}\sum _{k=1}^{n_b}\mathbf {F}_k^n \mathbf {n}_k l_k + \Delta t \mathbf{s}^\mathrm{n}, \end{aligned}$$
(6)

where n + 1 and n denote the new and the old time level, respectively, \(\Delta\)t is the time step and A the area of the considered cell. F denotes the vector of advective and diffusive fluxes over the edge k of the considered cell, \(n_{b}\) is the number of cell edges, n is the normal vector pointing outward of a face, and l is the length of a face. s denotes the source vector at time level n, while for the friction source term the splitting point-implicit method is used to avoid numerical instabilities. A more detailed description can be found in [59].

2.2 Green–Ampt Model for Infiltration

To calculate the mass sink due to infiltration, the Green–Ampt model is used, in which the cumulative infiltration and the infiltration rate are calculated with the following equations:

$$\begin{aligned} F(t) = Kt + (h_{0}-h_\mathrm{f})\Delta \theta \ln \left( 1+\frac{F(t)}{(h_{0}-h_\mathrm{f})\Delta \theta } \right) , \end{aligned}$$
(7)
$$\begin{aligned} f(t) = K\left( 1+\frac{(h_{0}-h_\mathrm{f})\Delta \theta }{F(t)} \right) = \frac{dF}{dt}, \end{aligned}$$
(8)

where F(t) denotes the cumulative depth of infiltration, f(t) the infiltration rate, K the hydraulic conductivity at residual air saturation; according to Whisler and Bouwer [63], K is assumed to be 50 % of the saturated hydraulic conductivity \(K_{s}\). \(h_\mathrm{f}\) is the wetted front capillary suction head, and \(h_{0}\) is the ponding water depth, which is provided by the surface runoff calculation. \(\Delta \theta\) denotes the soil moisture deficit, which is the difference between the saturated soil water content \(\theta_\mathrm{s}\), usually considered with the effective porosity \(n_\mathrm{eff}\), and the initial moisture content \(\theta_\mathrm{i}\). The wetted front capillary suction head, effective porosity, and hydraulic conductivity are called Green–Ampt parameters. The infiltration rate is calculated for each cell and time step depending on the water depth and water saturation of the soil of the previous time step in the considered cell. The calculated infiltration rates for all cells are then taken into account as mass sinks in the mass balance equation (see r in Eq. 2, vector s, first row), while rainfall is represented by a mass source.

When the considered soil tends to generate a surface crust of lower hydraulic conductivity, Brakensiek and Rawls [55] proposed to calculate the effective hydraulic conductivity of a two-layer soil—crust and subcrust—by a harmonic mean [56]:

$$\begin{aligned} K_\mathrm{e} = {\left\{ \begin{array}{ll} K_\mathrm{e} &{} \quad \text{ for } \mathrm{Z}_\mathrm{f} \le \mathrm{Z}_\mathrm{c}\\ \frac{\mathrm{Z}_\mathrm{f}}{\frac{\mathrm{Z}_\mathrm{f}-\mathrm{Z}_\mathrm{c}}{K}+\frac{\mathrm{Z}_\mathrm{c}}{K_\mathrm{c}}} &{} \quad \text{ for } \mathrm{Z}_\mathrm{f} > \mathrm{Z}_\mathrm{c}, \end{array}\right. } \end{aligned}$$
(9)

where \(K_{e}\) is the effective hydraulic conductivity, \(\mathrm{K}_\mathrm{c}\) is the hydraulic conductivity of the crust, K is the hydraulic conductivity of the subcrust soil as used in Eqs. 7 and 8. \(\mathrm{Z}_\mathrm{c}\) is the crust thickness, and \(\mathrm{Z}_\mathrm{f}\) denotes the wetted depth which is calculated by the cumulative infiltration depth from the previous time step divided by the soil moisture deficit. Furthermore, they proposed an equation for the prediction of the steady-state crust conductivity depending on tabled values for a reduction factor for the subcrust conductivity, and the steady state matric potential drop at the crust/subcrust interface (Eq. 10). For more details on this approach, the reader is referred to Rawls et al. [56].

$$\begin{aligned} K_\mathrm{e} = \frac{SC}{1+\Psi _\mathrm{i}/L} \cdot K_\mathrm{s}, \end{aligned}$$
(10)

where SC denotes the reduction factor for subcrust conductivity, \(\Psi_\mathrm{i}\) stands for the steady state capillary potential drop at the crust/subcrust interface, and \(K_{s}\) is the saturated subcrust conductivity, which was set in this study to the same values as the hydraulic conductivity of the subcrust soil as given in the literature values for Green–Ampt parameter K. The approach of calculating the effective conductivity in unsaturated layered soils with a thickness-weighted harmonic average has been considered as suitable approach in several studies [31, 64, 65]. But in formations with fine soil over coarse layers, non-piston flow might be dominating, which is not included in the simplifications of the Green–Ampt model. Zhu and Warrick [65] observed that the harmonic average tends to overestimate infiltration in these cases. Nevertheless, we applied this simplified approach given in Eqs. 9 and 10, to evaluate its performance in our cases.

2.3 Literature Values of Green–Ampt and Crust Parameters

Rawls et al. [1] analyzed different soils and determined average Green–Ampt parameters based on soil texture classes, where seven of them are shown in Table 1.

Table 1 Average values and ranges of the Green–Ampt parameters after Rawls et al. [1]

Other values are, for example, given in the help documentation of the hydraulic and hydrologic modeling software XPStorm and XPSWMM of the Innovyze company [2]. The values for seven different soil texture classes are given in Table 2.

Table 2 Typical values of Green–Ampt parameters given in Innovyze [2]

They refer to typical values for the minimum (asymptotic) infiltration rate corresponding to the saturated hydraulic conductivity for different soil texture classes after Akan [67], which were determined from measured rainfall, runoff and infiltrometer data [68, 69], typical values for the average capillary suction head from several published values, as well as typical values for the initial soil moisture deficit at wilting point [66]. The authors state that these values for the initial soil moisture deficit would apply for very dry conditions, and lower values should be used when wetter initial conditions occur. When comparing these values with the ones after Rawls et al. [1], it is striking that for sand the maximum value for the hydraulic conductivity after Akan [67] is about one order of magnitude lower, and roughly halved for loamy sand, but approximately doubled for loam, while the maximum value for sandy loam and the minimum value for loam after Akan [67] are very similar to the ones after Rawls et al. [1]. For sandy clay loam the minimum value and for clay loam the maximum value after Akan [67] are very similar to those after Rawls et al. [1], and for clay the value after Rawls et al. [1] is closer to the minimum value after Akan [67]. The average capillary suction head after Akan [67] is higher than after Rawls et al. [1] for all texture classes except clay.

Considering a crust with Eq. 10 for \(K_\mathrm{e}\), Rawls et al. [56] determined the reduction factor for the subcrust conductivity and the steady state matric potential drop at the crust/subcrust interface depending on the soil texture class as shown in Table 3.

Table 3 Reduction factor for the subcrust conductivity and the mean steady state matric potential drop at the crust/subcrust interface [56]

2.4 Sensitivity Analysis

For the test case in the Thiès catchment in Senegal, a sensitivity analysis of the Green–Ampt parameters, the crust parameters, and the friction parameters is carried out to see their effects on the runoff hydrograph. For the Green–Ampt parameters the values after Rawls et al. [1] for loamy sand are used for the reference case, and each parameter value is varied one after another to be set to the values for sand and sandy loam after Rawls et al. [1] (Table 1). For the crust approach after Eq. 9, the crust conductivity is varied between the values for loam, sandy clay loam and clay loam after Rawls et al. [1], and the crust thickness is varied between 2, 5 and 10 mm. For the crust approach after Eq. 10, the tabled values after Rawls et al. [56] (Table 3) for sand, loamy sand, and sandy loam are compared. For the friction parameters, the calibrated values after Simons [59] are used for the reference case and are varied within reasonable ranges or set to much higher values, to see that a significant effect is only visible, when using parameter values out of reasonable ranges.

2.5 Optimization Techniques

Especially, if several parameters need to be calibrated, automatic calibration is required as a manual calibration would be very time-consuming, because many simulations have to be carried out varying all calibration parameters sequentially, and possible dependencies between several parameters have to be considered as well. Automatic calibration makes use of an optimization algorithm, to find the optimum parameter combination as fast as possible. Several simulations with varying parameter values are carried out automatically, where the ranges for each parameter can be defined by the user. To carry out an automatic calibration of the Green–Ampt parameters and in some cases also initial water content and the friction within predefined ranges, optimization techniques that are implemented in the SciPy package optimization were used in this study. For the results presented in this work, the simplicial homology global optimization (SHGO) algorithm was used [70]. Within this algorithm, the global minimum of a chosen objective function is searched by varying the different calibration parameters within predefined reasonable ranges. The well-known Nash–Sutcliffe efficiency [71] as well as the nonparametric Kling–Gupta efficiency [72], which are both included in the PyPI package hydroeval [73], were used as objective functions:

$$\begin{aligned} NSE = 1 - \frac{\Sigma (Q_\mathrm{sim}-Q_\mathrm{obs})^2}{\Sigma (Q_\mathrm{obs, \ mean}-Q_\mathrm{obs})^2}, \end{aligned}$$
(11)

where NSE denotes the Nash–Sutcliffe efficiency, Qsim and Qobs are the simulated and observered discharge values at different times respectively, and Qobs, mean is the mean value of the observed values.

$$\begin{aligned} KGE_\mathrm{np} = 1 - \sqrt{(\beta -1)^2 + (\alpha _\mathrm{np}-1)^2 +(r_\mathrm{s}-1)^2},\end{aligned}$$
(12)

where \(KGE_\mathrm{np}\) is the nonparametric Kling–Gupta efficiency, \(\beta\) denotes the bias between simulated and observed mean discharge calculated as simulated mean divided by observed mean, \(\alpha_\mathrm{np}\) is the nonparametric form of the discharge variability, and \(r_\mathrm{s}\) the nonparametric correlation term, calculated with the Spearman rank correlation on the ranks of the observed and simulated discharge time series. For both indicators - NSE and \(KGE_\mathrm{np}\) - a value of one would express a perfect fit of simulated to observed discharges. Within the optimization algorithm, the calibration parameters are varied to reach the minimum of the combined error from NSE and \(KGE_\mathrm{np}\):

$$\begin{aligned} error = (1 - NSE) + (1-KGE_\mathrm{np}). \end{aligned}$$
(13)

2.6 Experimental Setups and Study Areas

Four different test cases with available data for calibration and one case study about flash floods in the ungauged area of El Gouna, Egypt are investigated. The basic information on each case is summarized in Table 4 and the experimental setups (Test 1–4) and study area characteristics (Case study El Gouna) are briefly described in the following sections.

Table 4 Overview of studied test cases

2.6.1 Test Case 1 – Laboratory Experiment after Smith and Woolhiser [61]

As a first test case, the well-known rainfall–runoff experiment of Smith and Woolhiser [61] was chosen. They equipped a laboratory-scale soil flume of 12.2 m length, 0.051 m width, 1.22 m depth with river-deposit sand (Poudre fine sand) and porous flume ends to collect the seepage water to get a prototype infiltrating slope (Fig. 1). To create artificial rainfall, they installed drop-producing manifolds, and to prevent splash erosion, the soil surface was covered with gauze. In the considered experiments, the adjustable slope was set to 1 %. The runoff was measured continuously by a pressure transducer and to observe the moving soil moisture front, gamma-ray attenuation was used. The soil properties were experimentally determined. Through comparison of the observed surface runoff hydrograph and simulations with their coupled model of the one-dimensional Richards equation for infiltration and the kinematic-wave model for overland flow (solved with the finite-difference method), they figured out that the observed data were fitted well when a laminar friction law was taken into account in the model (see Eq. 5).

Fig. 1
figure 1

Sketch of soil flume of the laboratory experiments in Test cases 1, 2 and 3

2.6.2 Test Cases 2 and 3 – Laboratory Experiments after Lima [74]

As the second and third test cases, two laboratory experiments after Lima [74] were chosen. A soil flume according to Fig. 1 of 1 m length, 0.5 width, 0.08 m depth, a slope of 10 %, and with percolated metal plates at the bottom to collect the percolation water was used together with a programmable rainfall simulator to carry out rainfall–runoff experiments with different soils. The first soil was a loam collected from Limburg in the Netherlands (Test 2) and the second soil was a clay loam collected from Alentejo in Portugal (Test 3). The initial and saturated soil water contents as well as the average particle densities and average soil bulk densities were measured by Lima, other parameters such as the hydraulic conductivities were estimated.

2.6.3 Test Case 4 – Field Experiment in Thiès Catchment, Senegal [75]

The field experiment on a plot of 10 m x 4 m within the Thi‘es catchment in Senegal (\(14^{\circ }\)45‘43”N, \(16^{\circ }\)53‘16”W) was considered as the last test case (Fig. 2). The details of the experimental setup are presented in Tatard et al. [75], and Mügler et al. [5], while Simons et al. [6] already presented previous simulations with hms and the Green–Ampt model. The plot consisted of sandy soil and the average slope was approximately 1–1.5 %. Rainfall–runoff tests have been carried out by using a rainfall simulator.

Fig. 2
figure 2

Left: Location of the Thiès catchment in Senegal (made with Natural Earth), right: elevation of the tested basin [76]

2.6.4 Case Study–Flash Flood Simulations in El Gouna, Egypt

El Gouna is a touristic town in the Eastern Desert of Egypt. Although the climate is extremely dry, from time to time strong rainfall events especially in the mountainous areas can generate fast and devastating flash floods, as the usually dry wadi systems turn into rivers with very fast flows. Many settlements, infrastructure, and cities have been constructed inside those usually dry wadi catchments. During flash flood events, these areas are strongly affected and the flooding can cause severe damages to the environment, infrastructure, and properties, and in the worst-case endanger human lives. Consequently, the areas have to face economic, ecological, and social problems due to flash floods. In case of a flood, some streams reach El Gouna, and others drain into the Red Sea north of El Gouna. Heavy rainfall that generates flash floods in the region of El Gouna occurs sporadically in the winter half-year between October–April and often comes from the direction of the Red Sea. Several flash floods and resulting infrastructure damages have been reported in the recent years, e.g. in December 2010, March 2014, October 2015, and October 2016, where total rainfall amounts of approximately 7, 34, 10, and 20 mm, respectively, have been observed by satellite data (for the event in 2010) or recorded by the weather station in El Gouna (for events in 2014, 2015, and 2016) [77]. As a very exceptional occasion, manual discontinuous velocity measurements and corresponding discharge estimations could have been carried out during the flash flood event in March 2014, and are published in Hadidi [78]. Figure 3 represents the location of El Gouna in Egypt, the topography, catchment boundary and streams of Wadi Bili until the point where the velocity measurements have been carried out in 2014, and the detail of the model domain of El Gouna between the measurement point in the Wadi Bili catchment and the Red Sea.

Fig. 3
figure 3

Location of El Gouna and the Wadi Bili catchment in Egypt, topography (AW3D30 \(\copyright\)Jaxa), and streams in the Wadi Bili catchment and model domain for the area of El Gouna as well as the location of discharge measurements in 2014 published in Hadidi [78] (map: \(\copyright\)OpenStreetMap contributors)

2.7 Procedure and Numerical Model Setups

For the four test cases with available data for calibration (4), simulations with different parameter sets are carried out:

  • With average values for the Green–Ampt parameters for the given soil after Rawls et al. [1] (see Table 1 for parameter values) to check the performance if these values would be taken without calibration in case of ungauged areas.

  • With optimum parameters values as result of the automatic calibration, where the ranges are set for the given soil type after Rawls et al. [1] according to Table 1, and/or within larger ranges (for example, the combined ranges of two consecutive soil types) to reach the best fit between model and observations within reasonable ranges.

  • If available: with parameters values that were derived from direct measurements or calibration given in the literature corresponding to the experiments to check the performance when only the remaining unknown parameters are calibrated.

  • With typical values for the Green–Ampt parameters after the help documentation from Innovyze [2] (see Table 2) to check their performance if these values would be taken without calibration in case of ungauged areas.

  • In Test 4: exemplarily considering a thin surface crust of lower hydraulic conductivity with the approaches given in Eqs. 9 and 10, as infiltration with the parameters after Rawls strongly overestimate infiltration and as surface sealing may occur on bare soils during heavy rainfalls.

The runoff hydrographs from different parameter sets are evaluated through comparison to the observed ones. Additionally, a case study with flash flood simulations in Egypt is represented under consideration of different values for the infiltration and friction parameters, analyzing the plausibility of the results and the effect of friction and surface crusts on the infiltration processes. Table 5 gives an overview of the model setups and the overall ranges for friction and Green–Ampt parameters covering all ranges of different optimization and simulation runs for each case. The subsequent sections briefly describe the setup of each case.

Table 5 Overview of model setups and parameter ranges used during the calibration process

2.7.1 Test Case 1 – Laboratory Experiment after Smith and Woolhiser [61]

The domain was discretized in rectangular cells of 1 cm x 1 cm leading to 6100 cells, and all boundaries were defined as open. Artificial rainfall with a rate of 4.2 mm/min (252 mm/h) was imposed over 15 minutes, and the total simulation time was set to 17 minutes. The initial conditions for water depths and flow velocities were set to zero. From the initial soil moisture profile measured by Smith and Woolhiser [61], an average initial saturation of \(S_\mathrm{i} = 0.2\) can be assumed as the initial condition for the soil, which was transferred here into the initial soil water content by multiplying it with the effective porosity : \(\theta_{i} = S_\mathrm{i} \cdot n_\mathrm{eff}\) (neglecting the residual water content), which therefore depends on the assumed saturated soil water content, namely the effective porosity. In one optimization case, the initial and saturated water contents have been used as independent calibration parameters, while in the other cases they were changed together to keep the initial saturation of 0.2. Except for one optimization case, the friction coefficient \(C_\mathrm{lam}\) was set to the value of 1333333 \(\mathrm{m}^{-1}\mathrm{s}^{-1}\) according to Delfs et al. [62].

2.7.2 Test Cases 2 and 3 – Laboratory Experiments after Lima [74]

The domain was discretized in rectangular cells of 5 cm x 5 cm leading to 200 cells, and all boundaries were defined as open. In both cases after Lima [74], a constant rainfall of 2.25 mm/min (135 mm/h) was imposed for 15 minutes and the total simulation time was set to 17 minutes. The initial conditions for water depths and flow velocities were set to zero. The initial soil water content was 0.0107 for the loam from Limburg (Test 2), and 0.006 for the clay loam from Alentejo (Test 3), and were not varied during the simulations. In Test 2, Manning‘s friction coefficient n was varied between 0.010–0.033 \(\mathrm{sm}^{-1/3}\), representing the range from bare sand to bare clay loam after Engman [79], as there is no value given for loam. In Test 3, Manning‘s friction coefficient was varied between 0.012–0.033 \(\mathrm{sm}^{-1/3}\) corresponding to the range for bare clay loam after Engman [79].

2.7.3 Test Case 4 – Field Experiment in Thiés Catchment, Senegal [75]

The domain was discretized in rectangular cells of 10 cm x 10 cm leading to 4141 cells. The upstream boundary was set as open, all other boundaries were set as closed according to Simons [59]. The experimental results in terms of measured rainfall and runoff time series of an 86 minutes long experiment, where the average rainfall intensity was 70.76 mm/h (see Fig. 14 in Sect. 3.5), were used for the numerical investigations. The initial conditions for water depths and flow velocities were set to zero. The initial soil water content was varied between zero as calibrated in Simons [59] and 0.2. Manning‘s friction law with depth-dependent Manning‘s coefficient (Eq. 4 was used with the calibrated parameters from Simons [59]: \(n_{0} = 0.014\) \(\mathrm{sm}^{-1/3}\), \(h_{0} = 0.0045 \ \mathrm{m}\) and \(\varepsilon = 0.1\).

2.7.4 Case Study – Flash Flood Simulations in El Gouna, Egypt

The model domain of about 11 km x 8 km was discretized into 438372 rectangular cells of 15 m x 15 m. The topography is represented by the digital surface model (DSM) ALOSWORLD 3D (AW3D30) with a horizontal resolution of 30 m [80], which was interpolated to a cell size of 10 m. Buildings were incorporated into the DSM by increasing the elevations inside building polygons extracted from OpenStreetMap (see Fig. 3 in Sect. 2.6.4). All boundaries were considered to have a free outflow except for 30 m at the western boundary, where the inflow from the upstream wadi catchment enters the coastal plain. The hydrograph from a hydrological model, which was calibrated with the measured hydrograph by Hadidi [78], was implemented as boundary condition to simulate the flash flood event from 09 March 2014 [81]. Furthermore, a time series of rainfall intensities (Fig. 4) that has been measured at the weather station located at the TU Berlin Campus El Gouna, was imposed as source term in each cell. The accumulated rainfall amount was 34 mm. The initial conditions for water depths and flow velocities were set to zero. The initial soil water content was set to 0.03 \(\mathrm{m}^3/\mathrm{m}^3\) and not varied during the simulation, assuming very dry conditions, which represents the typical conditions in the study area. The friction was varied between 0.01 and 0.02 \(\mathrm{sm}^{-1/3}\) according to the average values for bare sand and bare clay loam after Engman [79].

Fig. 4
figure 4

Rainfall and inflow imposed as source and boundary condition in the model, respectively

As most parts of the model domain consist of natural surfaces, mainly in terms of non-vegetated bare sand, infiltration cannot be neglected. After the Harmonized World Soil Database [82] as well as from field investigations, which were carried out within a student research project at TU Berlin Campus El Gouna, the dominant soil type in that area is sand (Fig. 5). Although no measurements are available for model calibration, former studies showed that taking into account the average Green–Ampt parameters for sand after Rawls et al. [1] leads to a strong overestimation of infiltration, as the entire rainfall inside the domain and almost the whole incoming flood wave from the large wadi catchment completely infiltrated before reaching the city of El Gouna [11]. As there have been large flooding areas observed by the population—reported in personal conversations with shop owners and staff members of the desalination plant as well as through photographs of inundated areas and resulting damages—it is obvious, that the infiltration is not represented realistically. In addition to the overestimation of infiltration due to the assumed Green–Ampt parameters, also spatial rainfall patterns, which are not taken into account as the rainfall data of only one weather station was available, could also contribute to this wrong representation. In particular, bare soils in arid areas are lacking vegetation cover and have therefore no protection from raindrop impact, which might lead to soil crusting coming along with a large reduction of the infiltration capacity [49]. Therefore, the approach of a Green–Ampt model for crusted soils as described in Eq. 9 is taken into account, and the impact of a crust as well as friction is studied. As for sand and loamy sand, almost all water infiltrated very fast, loam was also considered, as this texture class is also occurring in this area according to the Harmonized World Soil Database.

Fig. 5
figure 5

Soil texture classes according to the Harmonized World Soil Database and determined from double-ring infiltrometer tests (map: ©OpenStreetMap contributors)

3 Results

3.1 Sensitivity Analysis

For the test case of the Thiès catchment, a sensitivity analysis of the Green–Ampt parameters, different crust parameters for the two approaches presented in Eqs. 9 and Eq. 10, as well as the parameters of the depth-dependent Manning approach as given in Eq. 4 is presented in the following. Table 6 shows the parameter values for the reference case, which is represented with a solid black line in Figs. 6, 7, 8, as well as for the basic crust setting, when the different crust parameters are varied.

Fig. 6
figure 6

Variation of Green–Ampt parameters: hydraulic conductivity (K), capillary suction at wetted front (\(h_\mathrm{f}\)), saturated soil water content (\(\theta_\mathrm{s}\)), and initial soil water content (\(\theta_\mathrm{i}\))

Fig. 7
figure 7

Variaton of different crust parameters: hydraulic conductivity of the crust (\(\mathrm{K}_\mathrm{c}\)), depth of the crust (\(\mathrm{Z}_\mathrm{c}\)), reduction factor of subcrust conductivity (SC), and steady-state matric potential drop at crust/subcrust interface (\(\Psi_\mathrm{i}\))

Fig. 8
figure 8

Variation of the subcrust hydraulic conductivity (\(K_{SC}\)) for the approach with SC and \(\Psi_\mathrm{i}\) for loamy sand, and the parameters of the depth-dependent Manning law: minimum Manning coefficient (\(n_{0}\)), depth beyond which n is assumed to be constant (\(h_{0}\)), and the parameter accounting for drag (\(\varepsilon\))

Table 6 Parameter values for the reference case and basic case when a crust is considered either after Eqs. 9 or 10

The Green–Ampt parameters for the reference case correspond to the ones after Rawls et al. [1] for loamy sand and are varied sequentially with the values for sand and sandy loam. From Fig. 6, it becomes obvious that the hydraulic conductivity of the three consecutive soil types has the strongest influence on the hydrograph, the capillary suction head and the initial soil water content also have significant influence especially at the beginning, while the saturated soil water content is neglectable in the presented cases. It has to be mentioned that this parameter also has the lowest variation between the three selected consecutive soil types. As expected, the infiltration is higher with increasing hydraulic conductivity as well as with increasing capillary suction leading to decreased surface runoff, while a higher initial soil water content leads to less infiltration and a higher surface runoff. Figure 7 shows the results of varying the crust parameters \(\mathrm{K}_\mathrm{c}\) and \(\mathrm{Z}_\mathrm{c}\) after Eq. 9, as well as SC and \(\Psi_\mathrm{i}\) after Eq. 10. For the approach after Eq. 9, the hydraulic conductivity of the crust was varied between values for loam, sandy clay loam, and clay loam, while the crust thickness was varied between 2, 5 and 10 mm. For the the approach after Eq. 10, the values of the reduction factor for the subcrust conductivity and the steady-state matric potential drop at the crust/subcrust interface were varied corresponding to the values for sandy loam, loamy sand, and sand after Rawls et al. [56]. As expected, both crust approaches significantly reduce the infiltration and therefore lead to higher surface runoff. Lowering the hydraulic conductivity of the crust \(\mathrm{K}_\mathrm{c}\) or increasing the depth of the crust \(\mathrm{Z}_\mathrm{c}\) lead to increased runoff. Changing SC between the selected soil types has almost no influence on the hydrograph, while increasing \(\Psi_\mathrm{i}\) significantly increases surface runoff and therefore decreases the infiltration.

The upper left diagram in Fig. 8 shows the effect, when the crust consideration after Eq. 10 is used with values for loamy sand and the subcrust conductivity is varied. Here, it becomes obvious that the subcrust conductivity has still a very strong effect on the infiltration and surface runoff, when using this approach for the crust, as for sand (11.78 cm/h) the crust effect is not visible at all as the water directly infiltrates, and for sandy loam (1.09 cm/h) the surface runoff is much higher. The three graphs for the parameters of the depth-dependent friction law show, that all parameters have almost no influence on the hydrograph when taking the double or half of the values. Only, when setting the minimum Manning coefficient \(n_{0}\) to much higher values than the range for sand after Engman [79] (0.010–0.016 sm1/3), as for example here to 0.15 sm1/3 corresponding to the recommended value for short grass prairie, an influence can be observed. It has to be mentioned, that in larger systems as for example the model area of El Gouna, the friction has a significant influence on the infiltration, as shown later in Sect. 3.6.

3.2 Test Case 1 – Laboratory Experiment after Smith and Woolhiser [61]

The simulated runoff hydrographs for the different parameter sets (as described in Sect. 2.7) are shown in Fig. 9, and the corresponding parameter sets as well as indicators for the goodness-of-fit are summarized in Table 7. When the average Green–Ampt parameters after Rawls et al. [1] for sand are considered in the model, the surface runoff is strongly overestimated for the whole rising limb of the hydrograph, and runoff starts about 180 seconds before the observed runoff. However, the peak discharge is well captured in magnitude and occurs only 9 seconds earlier than the observed one. The NSE for the overall hydrograph is 0.245, indicating a bad agreement between simulated and observed hydrograph (see Table 5, line 1; green line with circles in Fig. 9. When using the Green–Ampt parameters that resulted from the automatic calibration for the ranges of effective porosity and capillary suction head at the wetted front for sand and the hydraulic conductivity as given in Rawls et al. [1], the runoff is only slightly overestimated at the first part of the rising limb and the runoff starts 80 seconds later than in the observations. But between the times of 580 and 921 seconds, the simulated hydrograph underestimated the observed one, leading to a peak discharge which is only 82 % of the observed one (see Table 7, line 2; purple line with triangles in Fig. 9). With a NSE of 0.871, the overall agreement with the observed hydrograph is much better than with the average parameters for sand. Finally, optimization was carried out, where all Green–Ampt parameters (effective porosity, hydraulic conductivity, and capillary suction head) were varied in larger ranges, where the minimum and maximum values of the ranges from sand and loamy sand after Rawls et al. [1] were taken into account. Furthermore, also the initial soil water content as well as the friction coefficient were varied within the ranges given in Table 8. The starting time of surface runoff as well as the peak discharge magnitude and time, and the overall shape of the observed hydrograph could be captured very well (see Table 7, line 3; blue line with diamonds in Fig. 9. With a NSE of 0.986 the performance is very good and comparable to the results, which were reported in Delfs et al. [62] (using a diffusive wave model for overland flow, Richards model for fluid movement in the unsaturated soil, and an interface layer concept for coupling) showing a NSE of 0.982 (see Table 7, line 4; black dots in Fig. 9. All simulations could not capture the slightly slower decrease in discharge towards the end of the recession limb, which was also reported in the simulation discussion in Smith and Woolhiser [61]. When using values for sand after Innovyze [2] surface runoff was even more overestimated, and the resulting NSE of -33.395 reflects the bad performance. Therefore, the values after Innovyze [2] were not further taken into account in Fig. 9 and Table 7. In Fig. 10 the infiltration rates for the different parameter sets are shown, and as there are no observed infiltration rates available, compared to the ones simulated from Delfs et al. [62] (black dots in Fig. 10). Again, the underestimation of infiltration when considering average Green–Ampt parameters for sand after Rawls et al. [1] is visible in the lower infiltration rates (green line with circles in Fig. 10). The infiltration rate with optimized parameters for the ranges of sand (purple line with triangles in Fig. 10) agrees well with the one simulated by Delfs et al. [62] until the time of 480 seconds and overestimates it afterward. With optimized parameters for the larger ranges between loamy sand and sand, it is the other way round (blue line with diamonds in Fig. 10).

Fig. 9
figure 9

Different simulated runoff hydrographs for for the laboratory experiment after Smith and Woolhiser [61]

Table 7 Different sets of Green–Ampt parameters for simulations of the rainfall–runoff experiment after Smith and Woolhiser [61]
Table 8 Parameter ranges for the optimization routine to calibrate the Green–Ampt parameters between loamy sand–sand and the friction coefficient for laminar flow \(C_\mathrm{lam}\)
Fig. 10
figure 10

Infiltration rates over time for different simulations of the experiment after Smith and Woolhiser [61]

3.3 Test Case 2 – Laboratory Experiment for Loam (Limburg) after Lima [74]

Figure 11 shows the hydrographs for the loam soil of Limburg when considering different Green–Ampt parameters compared to the observed one. The corresponding parameter sets as well as indicators for the goodness-of-fit are given in Table 9. Similar to the test case of the Smith and Woolhiser experiment (Sect. 2.6.1), surface runoff is strongly overestimated when considering the average Green–Ampt parameters for loam after Rawls et al. [1] and starts about 180 seconds earlier than the observed one. The peak discharge is about 17 % higher than the observed one, and the NSE for the overall hydrograph is 1.186, and the \(KGE_\mathrm{np}\) is 0.2 indicating both a bad agreement to the observations (see Table 9, line 1; green line with circles in Fig. 8). The Manning coefficient was set to 0.020 \(\mathrm{sm}_{-1/3}\), which is the recommended value for bare clay loam after Engman [79]. The result when using the optimized parameters within the ranges for loam after Rawls et al. [1] agrees well with the measurements. The start time of surface runoff as well as the overall hydrograph shape are well captured, but the peak discharge is underestimated and only 86 % of the observed one. The NSE is 0.928 and the \(KGE_\mathrm{np}\) is 0.924 (see Table 9, line 2; purple line with triangles in Fig. 11). Almost the same hydrograph with an NSE of 0.930 and \(KGE_\mathrm{np}\) of 0.927 was simulated when using the measured and estimated values of saturated soil water content and the hydraulic conductivity after Lima [74], and calibrating only the capillary suction head hf and Manning‘s friction coefficient (see Table 9, line 3; blue line with diamonds in Fig. 11). For both cases, Manning‘s friction coefficient was calibrated within the range of 0.010–0.033 sm1/3, representing the range from bare sand to bare clay loam after Engman [79], and resulting value was 0.021 \(\mathrm{sm}_{-1/3}\) in both cases, and therefore very close to the recommended value of 0.020 sm1/3 for clay loam after Engman [79]. The capillary suction head was calibrated within the range for loam after Rawls et al. [1]. The fact that two parameter sets with significantly different values for hydraulic conductivity and capillary suction head generated almost the same hydrographs, emphasizes the problem of equifinality during the calibration process of several calibration parameters that depend on each other. A higher hydraulic conductivity can be sort of “compensated” by a lower capillary suction head, although the different processes of capillary-driven and gravity-driven infiltration are dominating at different states of soil water content and therefore different times of the rainfall event. Taking into account the average values for loam after Innovyze [2] with the mean value for the hydraulic conductivity after Akan [67], lead to better results than with average values after Rawls et al. [1] (see Table 9, line 4; orange line with plus signs in Fig. 11). But with a NSE of 0.045 the results are still much worse than the results with calibrated parameters, where the NSE is 0.928. When taking instead of the mean the maximum value for the hydraulic conductivity after Akan [67], the results improved but still strongly underestimate infiltration leading to a NSE of 0.429 (see Table 9, line 5; black dashed line Fig. 11).

Fig. 11
figure 11

Runoff hydrographs for different simulations of the experiment for loamy soil after Lima [74]

Table 9 Different sets of Green–Ampt parameters for simulations of the rainfall–runoff experiment on loamy soil after Lima [74]

Figure 12 presents the infiltration rates, and it clearly shows, that in the case of average values for loam (green line with circles in Fig. 12), the infiltration rate shortly reaches the maximum value of rainfall intensity, but directly starts to drop after the first time step. For the two cases with calibrated parameters the infiltration rates just start to decrease after 180 seconds and decrease much slower (purple line with triangles and blue line with diamonds in Fig. 12). When considering the values for loam after Innovyze [2] with the mean value for the hydraulic conductivity, the infiltration rate starts to decrease after 80 seconds (orange line with plus signs in Fig. 12), and after 120 seconds if the maximum value for the hydraulic conductivity was taken (black dashed line in Fig. 12). As the infiltration rate was not directly measured, the difference between rainfall and observed runoff was taken as reference.

Fig. 12
figure 12

Infiltration rates over time for different simulations of the experiment for loamy soil after Lima [74]

3.4 Test Case 3 – Laboratory Experiment for Clay Loam (Alentejo) after Lima [74]

Figure 10 shows the hydrographs for the clay loam soil of Alentejo, and the corresponding parameter sets as well as indicators for the goodness-of-fit are given in Table 8. When having in mind the two other test cases presented before, it is remarkable that in this case the simulated hydrograph agrees well with the observed one when considering the average values for clay loam after Rawls et al. [1]. The surface runoff is only slightly overestimated and starts about 60 seconds earlier than the observed one. The NSE of 0.724 and of 0.808 indicate a relatively good agreement as well (see Table 10, line 1; green line with circles in Fig. 13). The Manning coefficient was set to 0.020 sm1/3 corresponding to the recommended value for bare clay loam after Engman [79]. Another simulation was done considering the values from optimization when taking into account the ranges for clay loam for the effective porosity and the capillary suction head and the average value of hydraulic conductivity after Rawls et al. [1], and for the Manning coefficient a range between 0.012 and 0.033 sm1/3 corresponding to the range for bare clay loam after Engman [79]. Surface runoff starts approximately 20 seconds before the observed one, leading to a slight overestimation of runoff at the beginning. After about 100 seconds until the end, the simulated hydrograph slightly underestimates the observed one. The NSE of 0.924 and of 0.862 are a little bit higher than for the average values for clay loam (see Table 10, line 2; purple line with triangles in Fig. 13). When taking the measured and estimated values for the saturated soil water content and the hydraulic conductivity after Lima [74] and calibrating only the Manning‘s coefficient and capillary suction head, the results show a very similar hydrograph, but surface runoff starts about 40 seconds before the measured one, first slightly overestimates the measured runoff and after about 120 seconds underestimates it. The NSE of 0.888 is slightly below the one for the optimized parameters, while the is slightly higher (see Table 10, line 3; blue line with diamonds in Fig. 13). For these two cases, where also Manning‘s coefficient was calibrated, values of 0.033 and 0.032 sm1/3, respectively, were obtained being at the upper bound of the range for bare clay loam after Engman [79]. Taking into account the average values for clay loam after Innovyze, the results are slightly worse than with average values after Rawls et al. [1] when taking the mean value for the hydraulic conductivity after Akan [67] (see Table 10, line 4; orange line with plus signs in Fig. 13), and slightly better when taking into account the maximum value for the hydraulic conductivity after Akan [67] (see Table 10, line 5; black dashed line in Fig. 13). Manning‘s coefficient was not calibrated here but fixed to be 0.020 sm1/3 representing the recommended value for bare clay loam to simulate the case that there is no runoff data available for calibration. If Manning‘s coefficient was calibrated also here, it resulted also in a value of 0.033 sm1/3, but the impact on the hydrograph was minor.

Fig. 13
figure 13

Runoff hydrographs for different simulations of the experiment after Lima for clay loam [74]

Table 10 Different sets of Green–Ampt parameters for simulations of the rainfall–runoff experiment on clay loam after Lima [74]

3.5 Test case 4–Field experiment in Thiès catchment, Senegal [75]

The simulated hydrographs of different parameter sets are compared to the measured data in Fig. 14, the different parameter sets as well as goodness-of-fit indicators are given in Table 11. If the average values for sand after Rawls et al. [1] are considered, infiltration is completely overestimated as no surface runoff occurs, which leads to an NSE of 5.05 (see Table 11, line 1; green dotted line in Fig. 14). When considering instead the average values for loamy sand, the simulated hydrograph agrees much better with the observed one, but still underestimates it most of the time. The overall NSE is 0.520 and the KGEnp is 0.656 (see Table 11, line 2; orange dashed line in Fig. 14). When taking instead the values for sand after Innovyze with the mean value for the hydraulic conductivity after Akan [67] into account, the second one of the two smaller peaks in the beginning of the observed hydrograph is relatively well captured, while the rest of the hydrograph is strongly overestimated (see Table 11, line 3; purple plus signs in Fig. 14). The overall NSE is 0.417 indicating a much better fit than with the average values for sand after Rawls et al. [1]. The KGEnp of 0.660 gives a better agreement than with the average values for loamy sand after Rawls et al. [1]. Considering the values after Innovyze with the maximum value for the hydraulic conductivity after Akan [67] results in an improved hydrograph with a NSE of 0.628 and a KGEnp of 0.731, but still clearly underestimates infiltration (see Table 11, line 4; light blue rectangles in Fig. 14). The optimization was carried out for the range of sand after Rawls et al. [1], however a decreased lower threshold according to the calibrated value in Simons [59] was used for the hydraulic conductivity. The hydrograph resulting from the optimized parameters does not capture the measured two smaller peaks in the beginning of the observed runoff, but later on it fits the observations relatively well, leading to an overall NSE of 0.810 (see Table 11, line 5; dark blue solid line in Fig. 14). A last simulation was done considering the modified Green–Ampt model with an effective hydraulic conductivity of crust and subcrust soil as given in Eq. 9 in Sect. 2.2. To exemplarily see the effect of this modification accounting for a surface crust, one simulation with the average values for sand after Rawls et al. [1] for the subcrust soil, and a crust of 5 mm thickness and with an hydraulic conductivity of 0.39 cm/s, which is 30 times lower than the one for the subcrust soil (11.78 cm/s), was carried out. The simulated hydrograph considering such crust captures very well the beginning of surface flow in terms of the first smaller peaks, although it overestimates these peaks by about 35 %. After about 800 seconds, the simulated hydrograph starts to continuously decrease and reaches zero after about 1900 seconds, while the observed one is continuously increasing. Therefore, the simulated values strongly underestimate the observed ones, which leads to an overall bad performance with a NSE of 4.823 (see Table 11, line 6; grey triangles in Fig. 14). As shown in Eq. 9, the effective hydraulic conductivity equals the crust hydraulic conductivity as long as the wetted depth is smaller than the crust thickness, and is calculated by a thickness-weigthed harmonic mean between the hydraulic conductivities of the crust and subcrust soil, when the wetted depth is larger than the crust thickness. Therefore, with increasing time, the infiltration increases due to a stronger impact of the hydraulic conductivity of the subcrust soil on the effective hydraulic conductivity.

Fig. 14
figure 14

Runoff hydrographs for different simulations of the rainfall–runoff experiment in the Thiès catchment in Senegal

Table 11 Different sets of Green–Ampt parameters for simulations of the rainfall–runoff experiment in the Thìès catchment in Senegal

3.6 Case Study – Flash Flood Simulations in El Gouna, Egypt

Figure 15 shows the water depths in the model domain after 9 hours of simulation time considering infiltration with 1) average values for loam after Rawls et al. [1] and 2) average values for sand after Rawls et al. [1] combined with a surface crust after Eq. 9 of 5 mm thickness with a hydraulic conductivity for clay after Rawls et al. [1] and a Manning‘s friction coefficient of n = 0.01 sm1/3 according to the average value for bare sand, and n = 0.02 sm1/3 according to the average value for bare clay loam after Engman [79], respectively. Comparing the results with infiltration for loam and for sand with a crust of 5 mm, it becomes obvious that for loam the local rainfall inside the model domain almost completely infiltrated, while for sand with a 5-mm-thick crust, flooding areas apart from the main stream coming from Wadi Bili were generated, which better agrees with the observations reported by the community. Strong damages were, for example, reported at construction sites along the shore of one lagoon in the South of El Gouna, where the flow was draining into the lagoon. These inundation areas and the stream draining into the lagoon are captured when considering sand with a crust of 5 mm, but are completely dry when considering loam. Also, other inundated areas in El Gouna have been documented by photographs and confirmed by shop owners. These inundated areas are not represented in the case of loam as the water directly infiltrates, while they are captured to a certain extent when considering sand with a crust. When comparing the results from different friction coefficients, the significant effect of the friction becomes obvious, as the flood wave from Wadi Bili propagates much slower, and therefore more infiltration can take place leading to overall smaller water depth. This becomes even clearer in Fig. 16, where the temporal development of water depth at the location indicated with a red circle in Fig. 15 is shown for different simulations. While the effect of friction is less pronounced when no infiltration is considered, it gets more important when considering either infiltration for loam or for sand with a surface crust of 5 mm. With a friction coefficient of n = 0.02 sm1/3, the peak water depth for sand with crust (orange line with rectangles in Fig. 14) is about 60 % smaller compared to the result with n = 0.01 sm1/3 (purple line with triangles in Fig. 16) and occurs about three hours later. For loam, the peak water depth with n = 0.02 sm1/3 (dark blue dashed line in Fig. 16) is about 58 % smaller and 10 hours later than with n = 0.01 sm1/3 (light blue line with crosses in Fig. 16). For the consideration of sand with a crust, the infiltration in the first hours of simulation time is lower compared to loam, so that the local rainfall around that location already generated some surface runoff leading to water depths of a few centimeters at the considered location before the flood wave reaches the location.

Fig. 15
figure 15

Water depth distributions in the model domain after 9 hours of simulation time considering different values for Infiltration and friction parameteters. Zc: crust thickness, Kc: hydraulic conductivity of crust, Kclay: hydraulic conductivity for clay after Rawls et al. [1], see Table 1, n: Manning‘s friction coefficient (background map: \(\copyright\)OpenStreetMap contributors)

Fig. 16
figure 16

Temporal development of simulated water depths at location of maximum water depth indicated with a red circle in Fig. 15, considering different infiltration settings and values of Manning‘s friction coefficient. K: hydraulic conductivity, Kc: hydraulic conductivity of crust, n: Manning‘s friction coefficient in sm–1/3

Shortly after the flood wave has passed the location, the water depth drops to zero, which can be explained by the fact that the wetted depth increased so that the impact of the hydraulic conductivity of the underlying sand is dominating rather than the one of the surface crust. When considering loam, which has a lower hydraulic conductivity than sand but a higher one than the clay of the crust, the infiltration in the beginning is much higher and almost no surface runoff is generated by the local rainfall, as almost all water directly infiltrated. In the case of a friction coefficient of n = 0.01 \(\mathrm{sm}_{-1/3}\), the peak water depth occurs about three hours later for loam than for sand with crust and is about 23 % smaller. On the other hand, the water stays much longer at the location also after the flood wave is over, which is due to the higher volume of water that infiltrated already at the beginning causing a faster saturation of the soil. As the hydraulic conductivity is the limiting factor when the saturation increased and the capillary suction has no strong influence anymore, the water from the flood wave infiltrates much slower than in the case of sand with crust, where at later time steps the effective hydraulic conductivity is dominated by the one of the sand below the crust. Taking into account the values for sand after Innovyze with the minimum value for the hydraulic conductivity (grey line with dots in Fig. 16) results in a peak water depth of 16 cm after about 15.5 hours, which is much more plausible than the result for average values for sand and loamy sand after Rawls et al. [1] (black line with plus signs in 16), where no water depth occurred at the considered location. But similar to the case of loam, the local rainfall around that location directly infiltrated, so that only the flood wave coming from Wadi Bili generated water depths, while the flooding areas induced from the locally fallen rainfall is not captured, and infiltration is therefore still overestimated. On the other hand, similar to the results for loam, the water stays for a longer time after the flood and rainfall event, which in turn agrees better with the observations than the results of sand with a crust, where the water depth drops to zero almost directly after the flood wave has passed. Therefore, a combination of the average values after Innovyze [2] combined with a thin surface crust of lower hydraulic conductivity could lead to the overall most plausible results.

3.7 Percentual Differences of Optimum Values from Average Values after Rawls et al. [1]

Figure 14 shows the improvements of the Nash–Sutcliffe efficiency as well as the corresponding percental changes of initial soil moisture deficit (\(\Delta \theta\)), hydraulic conductivity (K) and capillary suction head at the wetted front (\(h_\mathrm{f}\)) for the two best performing parameter sets of each test case when compared to the average values after Rawls et al. [1], called reference simulation in the following. The NSE could be significantly improved by 112 % and 123 % for Test 1, 211 and 212 % for Test 2 and by even 586 and 568 % for Test 4, while the improvement for Test 3 was with 20 % and 16 % much lower as the reference simulation already gave relatively good results.

As a result of the automatic calibration, the initial soil moisture deficit negligibly decreased in both shown parameters sets for the laboratory experiment after Smith and Woolhiser [61] (Test 1). The hydraulic conductivity was fixed in the case where the automatic calibration was carried out within the ranges for sand after Rawls et al. [1] (see Table 1), and decreased by 36 % when calibration was carried out within the ranges for loamy sand to sand after Rawls et al. [1]. The capillary suction head at the wetted front significantly increased in both parameter sets leading to higher infiltration rates, and therefore better results as the overall infiltration was underestimated in the reference simulation. In the case of lower hydraulic conductivity, it increased by 286 % and therefore much stronger than with a fixed hydraulic conductivity (59 %), balancing the reduced hydraulic conductivity and resulting in a similar infiltration as with the higher hydraulic conductivity.

For the two laboratory experiments after Lima [74] (Test 2 and 3), the initial soil moisture deficit slightly increased leading to an increased infiltration, while the initial soil water content was fixed (taken from measurements from Lima [74]), meaning that effectively the saturated soil water content increased. The hydraulic conductivity was not calibrated but fixed either according to the value after Rawls et al. [1] for the given soil type (no change compared to the reference simulation) or after Lima [74] respectively, where it was 77 % higher in Test 2 and 59 % lower in Test 3 compared to the reference simulations. As infiltration was strongly underestimated in the reference simulation of Test 2, the capillary suction head strongly increased by 506 % in the case of the same hydraulic conductivity, and still significantly increased by 242 % in the case of higher hydraulic conductivity, leading to a stronger infiltration than in the reference simulation. In Test 3, the capillary suction head also increased in both cases, while it is higher in the case of decreased hydraulic conductivity (310 % compared to 129 %).

For the experiment in the Thiès catchment (Test 4), the initial soil moisture deficit slightly decreased as a result of the automatic calibration leading to less infiltration as in the reference simulation where infiltration was strongly overestimated. Using the fixed values after Innovyze [2] led to an increase of the initial soil moisture deficit by 57 % compared to the reference simulation. The hydraulic conductivity was strongly reduced by 86 % in the case of automatic calibration, while the lower bound of the calibration range was decreased according to the calibrated value after Simons [59]. When the fixed value of the maximum hydraulic conductivity for sand after Innovyze [2] was used, it was decreased by 90 % compared to the reference simulation, therefore infiltration induced by hydraulic conductivity was reduced in both cases. Remarkably, the capillary suction head was strongly increased by 288 % during calibration and by 105 % with the fixed value after Innovyze [2], causing an enhanced capillary suction induced infiltration compared to the reference simulation. But the overall infiltration (dependent on initial soil moisture deficit, hydraulic conductivity, and capillary suction) was less than in the reference simulation and therefore led to significantly improved results represented by much higher values of NSE. The fact that for all test cases two considerably different parameter sets led to similar results and therefore similar values of NSE emphasizes again the problem of equifinality during the calibration process, which makes it difficult to give clear recommendations for the single parameters.

In the case study about flash floods in Egypt, a reduction of the hydraulic conductivity by 97 % combined with an increase of the capillary suction head by 80 % (corresponding to the average parameters after Rawls et al. [1] for loam), led to much more plausible results of inundation areas and water depths than with the average parameters after Rawls et al. [1] for the dominant soil type sand.

As shown in Test 4, choosing the values after Innovyze [2] considering the maximum hydraulic conductivity improved the NSE by 568 % compared to the average values after Rawls et al. [1]. In Test 2 and 3, the NSE also increased by 162 % and 8 %, respectively, when the fixed values after Innovyze [2] with the maximum hydraulic conductivity were chosen instead of the average values after Rawls et al. [1] (not represented in Fig. 17). Hence, it can be concluded that the literature values after Innovyze [2] might be generally more suitable to estimate the Green–Ampt parameters in ungauged areas than the average values after Rawls et al. [1]. In the case study about flash floods in El Gouna (see 3.6), the values for sand after Innovyze [2] under consideration of the lowest value for the hydraulic conductivity led to much more plausible results than the average values for sand after Rawls et al. [1]. As using the maximum value instead of the mean value for the hydraulic conductivity after Innovyze [2] resulted in a better performance for the Test cases 2, 3 and 4, it could be concluded to better choose the hydraulic conductivity within the upper part of the range after Innovyze [2] (according to Akan [67]). On the other hand, the results of the case study in El Gouna showed, that even when choosing the lowest value for the hydraulic conductivity after Innovyze [2] infiltration was still overestimated. Overall, according to the presented results, the values after Innovyze [2] can be rather recommended than the ones after Rawls et al. [1], while the selection of the hydraulic conductivity should be checked individually according to the plausibility of the results for each considered case.

Fig. 17
figure 17

Percental changes of the Green–Ampt (GA) parameters for each test case compared to the average values after Rawls et al. [1]. NSE: Nash–Sutcliffe efficiency, \(\Delta \theta\): initial soil moisture deficit, K: hydraulic conductivity, \(h_\mathrm{f}\): capillary suction head and wetted front, n: Manning‘s friction coefficient, max K: maximum hydraulic conductivity after Innovyze [2] according to Akan [67] (see Table 2)

3.8 Overview of Calibrated Parameter Sets for All Cases

Table 12 summarizes the optimum parameter sets resulting from the calibration process in test cases 1–4 and the plausibility analysis for the case study in El Gouna. For the case study of El Gouna, the given parameter set did still not lead to satisfying results, as no flooding areas from the local rainfall have been simulated although they have been observed by the community.

Table 12 Overview of calibrated parameter sets for all cases

4 Discussion

The result of the different test cases as well as the case study in Egypt emphasized that taking into account the average values for the Green–Ampt parameters after Rawls et al. [1] does often not lead to appropriate results. In the laboratory experiments after Smith and Woolhiser [61] for sandy soil and after Lima [74] for loam, the average values after Rawls et al. [1] led to a significant underestimation of infiltration, while in the field experiment in the Thiès catchment in Senegal as well as in the case study in Egypt, they strongly overestimated infiltration, resulting in no surface runoff which is not representing the observed incidents. For the second laboratory experiment after Lima [74] for clay loam, the simulation with average Green–Ampt parameters after Rawls et al. [1] resulted in a relatively good agreement to the observations. When considering the typical values for the Green–Ampt parameters after Innovyze [2], the results for the experiment after Lima [74] for loamy soil as well as for the Thiès catchment and the case study in El Gouna for sandy soil were much better than with the average values after Rawls et al. [1]. Nevertheless, infiltration was still underestimated in the experiment after Lima [74] with loamy soil, and also in the experiment in the Thiès catchment in Senegal, while it was still overestimated for El Gouna, even when using the minimum value for the hydraulic conductivity after Akan [67]. The consideration of a surface crust of a few millimeters generated much more plausible results for the first hours of simulation time, while the fast decrease in water depth after the flood event does not represent the real conditions appropriately. Furthermore, it was shown that the effect of friction on the infiltration process can also be significant, especially in larger-scale applications.

As the performance of average Green–Ampt parameters for sand after Rawls et al. [1] for laboratory experiments tend to go in the reverse direction than for field experiments, it could be argued that the conditions of the laboratory experiments did not represent similar conditions as occurring in the field. One reason could be that during the laboratory experiments, the formation of a crust could have been prevented due to covering the soil with gauze as described in the experimental setup of Smith and Woolhiser [61], that protected the soil surface from raindrop impact. In addition, the soil crust in lab experiments would develop gradually due to the rainfall splash, while in field studies the soil crust might have been formed already before the event itself, for example, due to prior rainfall events and other processes. Furthermore, it might have been the case that in the lab experiment the soil was more homogenous so that no finer particles could lead to clogging of the pores. On the other hand, the trend to overestimate infiltration by average Green–Ampt parameters after Rawls et al. [1] under natural and field conditions was studied here only for sandy soils in semi-arid to arid areas. Case studies in areas with different soil texture classes as well as other climate conditions might lead to other results.

Prado Hernández et al. [83] investigated six different texture classes from loam to clay in Mexico and compared different approaches to estimate the Green–Ampt parameters with measured infiltration rates from double-ring infiltrometer tests. Their results show a strong underestimation of infiltration when considering the average values after Rawls et al. [1]. This observation agrees well with the results from the laboratory experiments after Smith and Woolhiser [61] as well as Lima [74] for loam, where infiltration with average values after Rawls et al. [1] was also strongly underestimated. On the other hand, ponded infiltration rates as measured with double-ring infiltrometers tend to be higher than the actual infiltration rates during natural or simulated rainfall events [50, 84]. Therefore, it might be questionable to evaluate the results from the Green–Ampt model with the ponded infiltration rates from double-ring infiltrometer tests, when the aim is to estimate infiltration during rainfalls. In other studies, such as Xiang et al. [36], the Green–Ampt parameters were derived from rainfall simulator tests, where the observed infiltration probably agrees better with the one during natural rainfalls than the ponded infiltration measured from double-ring infiltrometer test.

5 Conclusions and Outlook

A robust depth-averaged 2D shallow water model incorporating the Green–Ampt model for infiltration was used to study the performance of tabulated values for the Green–Ampt parameters taken from different sources in the literature. For one test case, a sensitivity analysis for the Green–Ampt, crust and friction parameters was carried out. The results show that the hydraulic conductivity, the capillary suction at the wetted front, and the initial soil moisture as well as the crust conductivity, crust thickness and matric potential drop at the crust/subcrust interface significantly affect the infiltration and therefore the surface runoff. Afterwards, four different test cases with observed data to enable an evaluation of model results as well as one real-world application in an ungauged area were investigated, where average values for different soil texture classes after Rawls et al. [1] on the one hand and after Innovyze [2] on the other hand resulted in different performances. The average values after Rawls et al. [1] led to underestimations of infiltration in laboratory experiments with sandy and loamy soil, respectively, while showing a relatively good agreement in a laboratory experiment with clay loam, and strongly overestimated infiltration in a field experiment in Senegal as well as in a case study in Egypt, both with almost bare sandy soil. For the laboratory experiment with loam, the field experiment in Senegal and the case study in Egypt, the values after Innovyze [2] resulted in better performances than the ones after Rawls et al. [1], while they strongly underestimated infiltration in the laboratory experiment after Smith and Woolhiser [61], still underestimated the infiltration for the experiment with loam, and also for the field experiment in Senegal, and overestimated the infiltration in the case study in Egypt. Based on the obtained results for the two cases in real catchments with sandy soil (Test 4 in Thiès catchment, Senegal and the Case study El Gouna, Egypt), it is recommended to use lower values for the hydraulic conductivity than the values for sand given in Rawls et al. [1] to prevent an overestimation of infiltration. In both cases, the hydraulic conductivity was decreased by about 90–100 %, while the capillary suction head at the wetted front was increased by about 80–288 % compared to the average values after Rawls et al. [1]. Despite of this strong increase of the capillary suction, which enhances infiltration particularly at early time steps when the soil moisture deficit is high, the overall infiltration rates are much smaller and more realistic than the ones with average values. This emphasizes the stronger sensitivity of surface runoff on the hydraulic conductivity than on the capillary suction.

Overall, it can be recommended to use the values after Innovyze [2] rather than the average values after Rawls et al. [1] as they led to better results in all studied cases except for the laboratory experiment after Smith and Woolhiser [61] (Test case 1). If the given soil of the considered study area might tend to form a surface crust during heavy rainfalls, as it is, for example, the case for bare soils in arid areas, a thin layer of a few millimeters with reduced hydraulic conductivity to account for this crust and the application of an effective hydraulic conductivity between the crust and subcrust soil, might be suitable to represent the infiltration behavior more realistically. But in the represented results, this approach also led to an overestimation of infiltration at later times steps if the parameter values of the subcrust soil are taken from Rawls et al. [1]. As for both cases in natural areas (in Senegal and Egypt) the soil texture class was sand, further investigations should be carried out for field tests with other soil texture classes to give recommendations also for ungauged areas with other dominant soil texture classes.

Currently, a study on simulating the rainfall simulator experiments represented in Ries et al. [28] is carried out to evaluate the performance of the tabulated Green–Ampt parameters for different land use and soil types. For the case study in Egypt, it is planned to carry out rainfall simulator tests to get a better estimation of the infiltration behavior occurring during natural rainfall events in that area. Through measuring the surface runoff and soil moistures, as well as recording the plot areas with digital photos to generate high-resolution DEMs using photogrammetry, the plot-scale field tests can be simulated and calibrated afterwards to derive appropriate Green–Ampt parameters for the model area. Furthermore, the possibly occurring surface sealing as well as microtopography and their effect on infiltration will be studied, in addition to the influence of the DEM resolution on modeling infiltration. The overall aim is to define the most important determinants that influence infiltration under different circumstances and to find suitable values of Green–Ampt parameters for different cases.