1 Introduction

Flooding is a natural phenomenon and will become a serious natural hazard when its water volume reaches a certain magnitude (Sayers et al. 2002; Alfieri et al. 2014). Floods frequently affect many basins and regions worldwide (Norbiato et al. 2007; Sampson et al. 2015; Curebal et al. 2016; Esposito et al. 2018). It is believed that the risk of floods at the global scale will increase greatly around 2050, and the frequency of floods in 100 years will be doubled in 40% of the world because of the impact of climate change (Arnell and Gosling 2016).

A flooding event includes both the rising period and the recession period. The former is closely connected to rainfall, especially the maximum rainfall rate and the time to the centroid of a rainfall event (Shuster et al. 2008). Shuster and his colleagues proposed that a more precise time step was needed to note the rising period, which means that less precise temporal resolution data could not capture the variability in the stage that was to be expected with the short-term flow phenomena of the rising limb rate. The flood recession period, which typically is longer than 1/2 of an entire flooding event, however, is more important for water use (Shorr 2000; Guan et al. 2014), because a flood could provide abundant water resources (Shao et al. 2009; Ahmad et al. 2014; Liu et al. 2015).

Many studies, therefore, have explored the pattern of flood recession processes based on different physical factors. Rainfall and underlying surface conditions are the two main causal factors that influence the flood recession process in mountainous areas, on the precondition of weak human activity impacts (Khaleghi et al. 2011). Rainfall is the source of flood discharge and is critical in the flood recession process at different temporal resolutions. A higher-than-usual rainfall may lead to a rapid drop in the daily streamflow (and water level) (Ronchail et al. 2018). Similarly, rainfall is also a critical factor that causes changes in monthly flood recession processes (Mishra et al. 2014).

Different underlying surface conditions also cause differences in flood recession processes. Many studies have investigated the impact of land use/cover change (LUCC) on the flood recession process. For example, an increase in forest cover would lead to a substantial reduction in flood peak flows (Costa et al. 2003; Chang and Feng 2017). The conversion of cultivated land or grassland into urban land would increase the flood peak flow (Li and Feng 2011). The amount and rate of flood recession in bare soil are smaller than those in straw-covered fields (Zhang et al. 2007). Terrain is also one of the factors that affect the flood recession process. Fan and Han (1991) conducted a slope experiment in a field and proposed that a 9° slope is a critical point—slopes less than 9° always had a greater impact on the recession flow, and when they exceeded 9°, this effect decreased quickly. Terrain indices are a typical representation of underlying surface factors. Jin et al. (2017) proposed that watersheds with high average topographic index values and their standard deviations have slower water falling rates. The shape of the river cross-section (Ye et al. 2019) and the geomorphological structure (Amit et al. 2002; Li et al. 2006) also have a great impact on the flood recession process.

The control factors of the recession process in various regions are also different. The basin recession process in the eastern part of the United States is greatly affected by the terrain factors, while the basin recession process in the western part is greatly affected by the soil factors (Patnaik et al. 2018). In northern Sweden (Karlsen et al. 2019), the basin topography is the main control factor of the recession process at a relatively high flow, while the basin area becomes important at relatively low flow.

In general, the flood recession process could be inferred by assuming a certain relationship between the aquifer water storage and its discharge. This relationship often refers to linear (Maillet 1905; Tallaksen 1995), exponential (Beven and Kirkby 1979), power functions (Wittenberg 1999; Li et al. 2010; Charron and Ouarda 2015), or other types. There are two key parameters of power-law storage-discharge function and it has been proven that the top five significant factors for recession are location, soil infiltration capacity, channel length, forest cover, and precipitation (Krakauer and Temimi 2011). The linear storage-discharge function originated from groundwater research; thus at present it is widely used in baseflow or dry season streamflow. In the baseflow recession process, the top four factors that have great influence on the key parameters are the mean annual precipitation, mean annual air temperature, soil type, and mean surface slope of a catchment (Beck et al. 2013).

Exponential decay in the recession process could be deduced with a linear storage and drainage relationship assumption. The exponential equation is also the most common fitting curve employed to fit the recession process (Saboe 1966; Amit et al. 2002; Shaw et al. 2013; Zhai et al. 2015). In the equation, a flood recession coefficient could be used to represent the characteristics of a flood recession process. It is also a key factor in flood forecasting in many hydrological models (Yan and Zhang 2014; Song et al. 2016; Kim and Han 2017).

Although the flood recession coefficient has a certain physical meaning (Young and Beven 1991), the physical change mechanism that determines the flood recession coefficient is still unclear. In hydrological models, it is regarded as a parameter that must be estimated using observed data where the phenomenon of “equifinality for different parameters” would easily lead to a “fake” flood recession coefficient. It is critical to clarify what causal factors determine the flood recession coefficient, and further control the flood recession process as a basis of reasonable and scientific management of water resources at the basin scale.

This study aimed to quantify the influence of rainfall and the underlying surface on flood recession processes, with a case in the upper Huaihe River Basin, China. We used the rainfall and runoff data over the period 2006–2015 at eight watersheds in the upper mountainous region of the Huaihe River Basin, an area with little influence by human activities. The underlying surface factors were extracted from the Digital Elevation Model (DEM) dataset with a spatial resolution of 250 m. The Kendall tau test and Spearman rank-order correlation test were used to analyze the correlation between flood recession processes and rainfall, as well as underlying surface conditions. We also used backward elimination to quantify the relative importance of rainfall and underlying surface in flood recession processes.

This article is organized as follows. Section 2 presents the details of the study area and the data used. Section 3 introduces the method for fitting the flood recession process, two correlation tests, and backward elimination. The results are presented in Sect. 4. Following a discussion of the results and their implications in Sect. 5, the study’s conclusions are presented in Sect. 6.

2 Study Area and Data

The Huaihe River Basin is situated between 30°55′–38°20′N and 111°55′–120°45′E, between the Yellow River and the Yangtze River, and spreads from the Tongbo Mountain and ends in the Yellow Sea (Fig. 1). We selected eight watersheds as the study area, which are in the mountainous and hilly areas around the edge of the Huaihe River Basin, with weak influences from human activities (red line in Fig. 1). Their area ranges from 17.9 to 1640 km2. It can be seen from Fig. 1 that the watershed elevation and relief amplitude of the eight watersheds are quite different. All eight watersheds belong to the monsoon climatic zone, the average annual temperature is 16.4 °C (from 2006 to 2015), and the average annual rainfall is 945 mm for the same period.

Fig. 1
figure 1

Location and catchment outlets (shown as the flood stations in the figure) of the Huaihe River Basin study area. The eight catchments are indicated with a red line. The sky-blue area in the map inset in the upper right represents the Huaihe River Basin, and the area with DEM is part of the upper and middle regions of the Huaihe River Basin

Owing to the monsoon and windward mountain terrain conditions, flooding has become the most serious natural hazard in the Huaihe River Basin (Wu et al. 2011; Chen et al. 2016; Zhang 2019). During the decade from 2006 to 2015, 58 flooding events occurred in the study area. Of these events, 20 occurred in the northern tributaries, 30 in the southern tributaries, and eight in the mainstream. The peak flood ranged from 6.65 to 3370 m3/s, and the maximum rainfall intensity changed from 0.22 to 16.4 mm/h.

All the rainfall data are recorded by gauges of the Hydrology Bureau at the Ministry of Water Resources of China and published in the Hydrological Data of Huaihe River Basin, Annual Hydrological Report (China 2006–2015). The time series is from 2006 to 2015. Since the time step of rainfall and runoff data recorded in the report are different, the original data must first be processed to the same temporal resolution. In this study, we unified the time step to 1 h for correspondence and comparison.

The rainfall data are recorded in the form of a rainfall depth over a period. For those periods of more than 1 h, we denoted the rainfall at the first moment as 0 and denoted the rainfall per hour in the period as the average rainfall depth. The runoff record is expressed differently than the rainfall data. It represents a flow at the recorded moment. The recorded time, however, might not be on the hour. Therefore, to match it with the rainfall data, it is processed by simple linear interpolation to the nearest hour. Then an hourly rainfall and runoff data set between 2006 and 2015 was created. We extracted the data of the 58 rainfall and flooding events that occurred in the study area from this data set.

The underlying surface factors were computed by using the DEM from the Resource and Environment Science and Data Center (RESDC)Footnote 1 with a spatial resolution of 250 m. The DEM data were generated by resampling the data based on the latest SRTM-V4.1, and the source data are projected by a WGS84 ellipsoid.

3 Methods

First, the potential impact factors of recession process were extracted based on rainfall data and DEM. Then, we used exponential equation to fit the 58 recession processes collected, and derived 58 recession coefficients. Next, the Spearman rank-order correlation test and Kendall tau test were used to analyze the correlation between the potential impact factors and recession coefficient. Backwards elimination was further used to compare the importance of each potential impact factor on recession coefficient.

3.1 Extracting the Impact Factors

The flood recession process occurs after the maximum rain intensity. Two factors, the distance from the storm center to the outlet of the basin (D) and the decrease rate of rainfall intensity (Vi), therefore, were selected to describe the impact of the temporal and spatial characteristics of rainfall on the flood recession process. The Vi is defined as the average decrease rate of rainfall intensity from the moment when the maximum rainfall intensity occurs to the end of the rainfall event. In general, the rainstorm center always moves during a rainfall event. Therefore, the rainstorm center referred to in this article is the rainstorm center at the moment of the maximum rainfall intensity on the basin. The D is the straight-line distance between the storm center and the outlet of the basin. Furthermore, four factors, namely the basin area (Area), basin shape coefficient (Bs), basin average slope (Slope), and basin relief amplitude (RA) are selected to reflect the underlying characteristics of the eight basins.

The extraction of the underlying surface factors of the basin was based on the DEM data and processed by ArcGIS. First, we divided the Huaihe River Basin by DEM data into several small watersheds and then extracted their area. Next, we compared the extracted subwatershed area with the area recorded in the Hydrological Data of Huaihe River Basin, Annual Hydrological Report (China 2006–2015). If the data of the area extracted by DEM are consistent with those recorded in the report, it can be considered as correct. After that, confirmed width (W) can be calculated by W = Area/L. According to the equation Bs = W/L, RA = Hmax − Hmin, the remaining underlying surface factor values can be calculated. Table 1 shows the value of the underlying surface factors.

Table 1 Underlying surface features of the eight watersheds in the Huaihe River Basin, China

Vegetation and soil are also critical underlying factors that affect the flood recession process. We collected two sets of LUCC data between 2005 and 2015 from RESDC, but it changed little across space; thus, land use was not considered an influencing factor in this study. Factors such as vegetation type, soil type, and soil texture were highly similar in the research area; thus, their influence on flood recession was almost the same in different watersheds, and we did not consider them in this study either.

3.2 Recession Process Fitting

In total, 58 unimodal flooding events occurred in the eight catchments between 2006 and 2015. It has been demonstrated that the peak flow and magnitude of floods are affected by rainfall volume and rainfall intensity (Gottschalk and Weingartner 1998; Zhang et al. 2005). Therefore, in order to reduce the impact of rainfall volume on the flood recession process simulation, the flood recession data were first normalized by dividing the flow that occurred after the peak flow by the peak flow and making the recession curve a decreasing curve with a peak value of 1. Since the exponential equation is one of the common recession curve fitting equations, it was selected to fit each regression curve:

$$y = e^{Rt} ,$$
(1)

where R is the recession coefficient (1/h), a negative value, t is the recession time (t = 0 refers to the time of peak flow), and R controls the shape of the recession curve, indicating the rate of the flood recession. The larger the R is, the smoother the recession curve and the slower the recessional speed.

3.3 Correlation Tests

Because the correlation form between each factor and R is unknown, two rank correlation tests, the Kendall tau test, and the Spearman rank-order correlation test, were used to evaluate their relationship. Both the Kendall tau test and Spearman rank-order correlation test are classic nonparametric statistical methods that use a monotone equation to evaluate the correlation between two variables (Spearman 1904; Kendall 1938). The correlation coefficient of the Spearman rank-order test is expressed by ρ, and the correlation coefficient of the Kendall tau test is expressed by τ. Their values range from − 1 to + 1, and 0 indicates that the two variables are uncorrelated. A positive value indicates a positive correlation, a negative value indicates a negative correlation, and a larger value indicates a stronger correlation.

3.3.1 Spearman Rank-Order Correlation Test

Assume that the two random variables are X and Y, and the number of their elements is N. Xi and Yi represent the ith value of X and Y(1 ≤ i ≤ N). Sort X and Y in ascending or descending order at the same time to get two ranking sets x and y. Use xi to represent the rank of Xi in X, and so do yi. Then, calculate the Spearman rank-order correlation coefficient as follows:

$$\rho = \frac{{\sum\nolimits_{i = 1}^{N} {(x_{i} - \bar{x})(y_{i} - \bar{y})} }}{{\sqrt{\sum\nolimits_{i = 1}^{N} {(x_{i} - \bar{x})^{2} \sum\nolimits_{i = 1}^{N} {(y_{i} - \bar{y})^{2} }} } }},$$
(2)

3.3.2 Kendall Tau Test

Assume that the two random variables are X and Y, and the number of their elements is N. Xi and Yi represent the ith value of X and Y (1 ≤ i ≤ N). If is correct for any i < j  N, then note it as a member of concordant pairs; otherwise, note it as a member of discordant pairs. Record the total number of concordant pairs as Nc and the total number of discordant pairs as Nd. Then, calculate the Kendall tau correlation coefficient as follows:

$$\tau = \frac{{N_{c} - N_{d} }}{N(N - 1)/2}.$$
(3)

3.4 Backward Elimination

Backward elimination is an independent variable selection method for the linear regression model. In this study, it was used to compare the importance of each variable on R. First, enter all variables into the linear regression equation and then calculate the contribution of each variable and remove the variables that do not meet the confidence level. Next, calculate the contribution again, and repeat the steps until all the remaining variables meet the significance level. In this study, the confidence level is set at 95%.

4 Results

All 58 flood recession processes were fitted by the exponential equation (Eq. 1). The results show that 95% of the total flood recession events can be well fitted with R2 values greater than 0.75. Table 2 shows the average R and average R2 of fitting results. All the average R2 values exceed 0.8, proving that the exponential equation matches these flood recession processes.

Table 2 Fitting result of the eight watersheds in the Huaihe River Basin, China

4.1 Rainfall Factors’ Impacts on Flood Recession Coefficient

Figure 2 shows the effect of a single rainfall factor on R. It shows that the value of Vi is between 0 and 0.8 mm/h2. In this Vi interval, the flood recession will be faster with the increase of Vi, as shown in Fig. 2a. This relationship, however, seems discrete and not monotonous since Vi is not the only factor impacting the flood recession process. Some abnormal points can be seen in Fig. 2a. For example, R in Vi between 0.2 and 0.5 mm/h2 is much smaller than in other ranges of Vi. Specifically, at the two points farmed in Fig. 2a, Vi is only 0.4 mm/h2, but R is very small. The relationship between D and R should be positive, which is consistent with the trend shown in Fig. 2b. It also can be seen, however, that even if the rainstorm center is close to the basin outlet, a slower flood recession exists. From this viewpoint, the results obtained by analyzing the influence of a single factor are partially correct. Vi and D, therefore, must be considered in combination.

Fig. 2
figure 2

Scatter plots of two rainfall factors versus flood recession coefficient (R). The two rainfall factors are the decrease rate of rainfall intensity (Vi) and distance (D). R decreases with increase in Vi nonmonotonically (a) and increases with increase in D nonmonotonically (b). The dash line in a shows some abnormal points with medium Vi and smaller R

Considering the impact of Vi and D simultaneously, ViD–R change is plotted (Fig. 3). In Fig. 3, the vertical axis is Vi, the horizontal axis is D, and the color bar represents the value of R. It can be seen that except for the flooding events (D = 0), when Vi is similar, the color of the points in the close horizontal line from left to right change from deep blue to yellow or dark red. This color change means that the farther the distance between the storm center and the outlet, the slower the recessional speed. The recession coefficient R shows a clear division with the change of Vi and D. It is mostly red when Vi is less than 0.2 mm/h2 (which indicates the recessional speed is slow), and blue when the Vi value is between 0.2 and 0.5 mm/h2 (which indicates the recessional speed is much faster). It is mostly green and yellow, however, when Vi is greater than 0.5 mm/h2, indicating that the recessional rate changes slowly compared to the previous stage.

Fig. 3
figure 3

Scatter plots of the decrease rate of rainfall intensity (Vi) and distance (D) versus flood recession coefficient (R). The color bar represents the value of R. When Vi is between 0.2 and 0.5 mm/h2, the flood recession coefficient is much different from others

From the overall relationship of the data, under the same premise of a similar D, when Vi is less than 0.2 mm/h2, R is decreasing with the increase of Vi, but when Vi is greater than 0.2 mm/h2, R is increasing with the increase of Vi. It is difficult to say which part is abnormal. It could be explained, however, that in addition to rainfall, the underlying surface plays a pivotal role in the flood recession process.

4.2 Underlying Surface’s Impacts on Flood Recession Coefficient

Since each watershed experienced more than one flooding event, for the convenience of analysis, an average value of R in each watershed was calculated to explore the impact of the underlying factors on R (Table 2). Figure 4 shows the graph of the average R and the characteristics of the underlying surface condition factors.

Fig. 4
figure 4

Scatter plots of a Area; b basin shape coefficient (Bs); c Slope; and d relief amplitude (RA) versus flood recession coefficient (R). In this graph, R increases with Area and RA, and decreases with Bs and Slope. The positive relationship between RA and R is not really conspicuous

Figure 4 shows that R increases with Area, indicating that flood recession processes are faster in small watersheds than in big watersheds. This is consistent with our general perception. The confluence path of a large watershed is larger than the confluence path of a small watershed; therefore, flood recession processes in the large watershed will be slower.

As for the influence of the average slope of the watershed on R, there is a tendency that R decreases as the slope increases. Steep slopes, therefore, increase the velocity of the flow, and the flood decreases quickly.

There is no significant relationship between Bs and R. The relationship between RA and R is also complex but a little positive, indicating that a greater RA leads to a slower recessional rate. Higher RA may indicate that the terrain in the basin is more complicated. The path of the outflow might then be more tortuous, so the flood recession process becomes slower.

4.3 Dominant Factors in Different Stage of Vi

According to the result of Sect. 4.1, the flooding events could be divided into three types by the value of Vi. The first type is the floods with Vi less than 0.2 mm/h2, the second type is the floods with Vi between 0.2 and 0.5 mm/h2, and the last type is the floods with Vi greater than 0.5 mm/h2. The relationship between the recession coefficient and impact factors of the floods in each category was calculated by the Kendall tau test and the Spearman rank-order correlation test, and the results are shown in Table 3.

Table 3 Results of correlation tests of R and the six impact factors

It shows that the correlation coefficient between watershed area and R is positive, which is consistent with the results in Fig. 4. Similarly, the relationship between Slope, Vi, D and R in Table 3 are also the same as that in Fig. 4. The difference of the sign of the correlation coefficient, however, indicates that the influence of Bs and RA on R differs at different ranges of Vi. Not all of the results, however, satisfy the significance condition. Therefore, we cannot judge the relationship between the influencing factors and R by the numbers in Table 3 alone. Focusing on correlation coefficients that satisfy the significance condition, Vi and D have a great correlation with R when Vi is less than 0.2 mm/h2. The Kendall correlation coefficient and the Spearman correlation coefficient of Vi and R are − 0.368 and − 0.528. The Kendall correlation coefficient and the Spearman correlation coefficient of D and R is 0.280 and 0.396. In type two, R is closely related to underlying surface condition factors, especially the average slope of the watershed. The Kendall correlation coefficient and the Spearman correlation coefficient of Slope and R are − 0.337 and − 0.419. In type three, R has a close connection with RA of the watershed, the Spearman correlation coefficient between them is 0.812.

In summary, in the first type, R has a strong correlation with rainfall, in the second type, R has a strong correlation with underlying surface factors, and the average slope of the watershed is the factor most closely related to R. Although R is also closely related to underlying surface condition factors in the third type, the main role in this type is played by RA of the watershed.

To quantify the influence of each factor on R, backward elimination analysis was used to calculate the coefficient of the six factors and R, where confidence level of 95% was considered. Table 4 shows the fitting results of those meeting the confidence level. Since the unit and magnitude of each factor are different, Table 4 only shows the normalized coefficients. It indicates that if we calculate R by backward elimination, different dominate factors are found in different types. In type one, R is controlled by both rainfall and underlying surface conditions. Among them, the two rainfall factors Vi and D impact R greatly, and their normalization coefficients are − 0.348 and 0.274, respectively. In this type, the main underlying surface factor Bs controls R, and the normalization coefficient is 0.414. Compared with the results of the correlation analysis, the results of backward elimination show that Bs also has a big impact on R, but this effect was ignored in the correlation analysis. In type two, Slope has the main effect on R, and its normalized coefficient is − 0.342. This is consistent with the results of the correlation analysis. In type three, Area has the main effect on R, and its normalized coefficient is 0.760. Compared with the results of the correlation analysis, the fitting results show that RA has no significant impact on R.

Table 4 Normalized coefficients of the factors of fitting results (confidence level = 95%)

5 Discussion

From the results obtained in Sect. 4, we find that the flood recession process is controlled by rainfall, basin average slope, and basin shape, which are similar to the results reported in Song et al. (2016). The significance of watershed area, however, is different in different situations in the study area in the flood recession process, which differs from the studies in Zillgens et al. (2007). Zillgens et al. (2007) indicated that the hydrographs at the larger scales exhibited significantly attenuated recession behavior than in smaller catchments in all bimodal event cases—this behavior could be related to a nested relationship among the watersheds, which did not exist in this study. Since our study did not include random factors such as human activities, the results should also be applicable to other basins with similar natural conditions.

Comparing the results in Tables 3 and 4, it can be found that although sometimes RA has a strong relationship with R, this relationship is not a major impact factor after backward elimination. It can be explained as follows. The fitting curve in Fig. 4 shows that the connection between RA and R is more chaotic than the other underlying surface factors with R. From Table 3, we can see that the relationship between RA and the flood recession coefficient varies with positive and negative correlations in different situations. The uncertain correlation relationship indicates that RA is not a critical causal factor of change in R. RA, therefore, was removed after the backward elimination analysis.

Vi = 0.2 mm/h2 is a key point of the flood recession process. The following example shows what Vi = 0.2 mm/h2 means: According to China’s National Meteorological Center,Footnote 2 a rain event exceeding 15 mm per 12 h could be called a heavy rain. A rainfall event with a uniform decrease in rain intensity at a total rainfall depth of 15 mm within 12 h (its rainfall intensity decreases from 2.5 to 0 in 12 h) is a rainfall event with Vi = 0.2 mm/h2. Our conclusion, therefore, indicates that when heavy rain occurs in a basin, the flood recession process in the river basin is no longer affected by rainfall factors, but controlled by underlying surface factors, which is consistent with Hua and Wen (1980).

6 Conclusion

By selecting eight watersheds in the upper mountainous area of the Huaihe River Basin, we examined the impacts of rainfall and the underlying surface on flood recession processes. The exponential equation was used to describe the flood recession processes, and it performed well for 95% of all flood recession processes, with the R2 values higher than 0.75.

Six factors were selected to describe the rainfall and land surface properties associated with the flood recession processes, namely decrease rate of rainfall intensity (Vi), distance from the storm center to the outlet of the basin (D), basin area (Area), basin shape coefficient (Bs), basin average slope (Slope), and basin relief amplitude (RA). The results of the Kendall tau test and the Spearman rank-order correlation test show that the flood recession coefficient is influenced by different factors in different Vi. When the decrease rate of rain intensity is less than 0.2 mm/h2, the relationship between rainfall and flood recession is the closest. When the decrease rate of rain intensity is higher than 0.2 mm/h2, the flood recession process is most closely related to the underlying surface factors.

Backward elimination was used to compare the importance of each variable on R. The results show that the decrease rate of rain intensity is the most influential factor in flood recession when Vi is less than 0.2 mm/h2. The average slope of the watershed is the most influential factor in the flood recession process when Vi is between 0.2 and 0.5 mm/h2. The area of the watershed has the greatest influence on the recession process when Vi is beyond 0.5 mm/h2.

In summary, although both rainfall and underlying surface conditions impact flood recession processes, they play different roles in different rainfall conditions at different watersheds. We classified flood recession processes in small watersheds based on diverse rainfall characteristics and clarified the dominant factors in different categories. The results of this study are useful for determining the flood recession coefficient and could be used as a reference for flood forecasting and floodwater resource use. When heavy rain or great amount of rainfall occurs, basin slope and area will successively become the dominant factor of flood recession processes. For small watersheds with large average slopes, the recession process is very fast, and special attention should be paid to flood risk control. For large watersheds with small average slopes, however, the recession process will be slower, and more consideration should be given to the use of floodwater resources. More research is needed to further strengthen the view of this study.