1 Introduction

Observed flood flows have for years constituted the basis for the probability estimation of high flood quantiles or exceedance probabilities, values used in the design of hydrotechnical structures, and also in the protection of river-side areas against floods. In the literature one can find numerous studies devoted to that problem. Starting from the nineties of the last century, from moment of publication of the Hosking (1990) paper on probability weighted moments, the number of publications in which distributions of maximum flows were estimated grew rapidly. Their extensive analysis, supported with numerous application examples, is described in papers from the turn of the 20th and 21st centuries. Vogel and Wilson (1996) demonstrated the application of probability weighted moments at nearly 1500 catchment basins in the United States. Katz et al. (2002) used the method of the maximum likelihood to demonstrate that trends in extreme hydrological conditions can be routinely included in analyses of extreme values, with predicted intensification of hydrological cycle within the framework of global climate change. Subsequent studies devoted mainly to non-stationarity of annual maximum flows also include significant introductions describing the applied methods of estimation of maximum flows Xiong et al. (2015).

The methodology of estimation of maximum flows can be divided into two parts (Kidson and Richards 2005):

  1. 1.

    Flood Frequency Analysis (FFA) – only one observation of maximum flow is taken from each hydrological year. On the basis of observations from a multi-year period, assuming their homogeneity, distributions of probability of maximum flows are estimated. Well fitted distributions allow the determination of high quantiles of non-exceedance, treated as n-year waters – flows occurring on average once per n years.

  2. 2.

    Peak Over Threshold (POT) – the method proposed in 1970 by Teodorović and Zelenhasić (1970) (see also Hosking and Wallis 1987 or Mudelsee 2010) requires the determination of all flood events above an adopted threshold. More about threshold level selection can be found in the next manuscripts (Gharib et al. 2017) or (Roth et al. 2015). In this method the result of the analysis can be either the volume of the flood event or the highest flow value in the flood. The POT method requires the determination of parameters that exclude dependent flood events or those of small volume, insignificant from the viewpoint of maximum flow analysis (Teodorović and Zelenhasić 1970). The determination of quantiles of annual maximum flows requires also the estimation of the distribution of the number of flood events in a year.

A detailed review of the application of the above methods is given in reference (Khaliq et al. 2006).

Both methods require the determination of the class of theoretical distributions that correctly describe the maximum flows. More information on various kinds of uncertainties that appear during the estimation of extreme flow distributions can be found in reference (Yen 2002). In the case of the POT method the choice of distribution is limited to either a generalised extreme value distribution GEV or a generalised Pareto distribution (Wang 1991). Both of those distributions are related with the probabilistic theory of extreme distributions (Coles 2001). In the case of the first of those models, the list of the distributions used is more extensive (Kidson and Richards 2005; Maidment 1993; Cassalho et al. 2018) and includes about 10 distributions of various types, beginning with the shifted gamma distribution (Pearson type III), and ending with GEV type distributions. The choice of the class of distributions depends largely on the local conditions. In the USA, the basic distribution in use is the log-Pearson distribution (Bobee 1975), in Poland the set of allowable distributions includes the following: Pearson type III, log-normal, Gumbel (Fisher-Tippett type I) and GEV (Węglarczyk 2015). The choice of a distribution for further use in e.g. designing a hydrotechnical structure is, unfortunately, a subjective one, and in ambiguous cases additional measures of fit are applied, e.g. the Akaike criterion (Akaike 1974).

In standard procedure the choice of distribution is always preceded by the assumption of sample homogeneity. So far it has been assumed that observed maximum flows originate from a simple sample, and if not – the factor affecting maximum flows is primarily climate change (Xiong et al. 2015; Kundzewicz et al. 2005; Yang and Hill 2012; Cannon 2010). The determination of existing trends allows their filtering and processing the data as homogeneous data. This means that it is assumed that all kinds of trends related with changes in the supply of a water course with precipitation and underground waters, changes in the land use of the catchment basin, long-term fluctuations related with climate change, and in the case of POT method also seasonal changes, have no effect on the observed maximum flows. However, the observed annual maximum flows include distinct freshets generated by various processes occurring in the catchment basin. There appears the problem of genetic heterogeneity. To take it into account in a probabilistic model, it is assumed that the phenomenon of a maximum flow is caused by two different processes. This means that the sought distribution is either a mixture of two probability distributions (Hess et al. 2005) or their product (Frances 1998). The above methodology is recently more and more frequently used in statistical modelling of population distributions in various branches of science, e.g. in economics – (Engle and Lunde 2003), in environmental sciences – (Kollu et al. 2012), or in ecology – (żyromski et al. 2016) or (Szulczewski et al. 2018). In this study we limited ourselves to the analysis of mixture distributions of the form:

$$ F(x)=pF_{1}(x)+(1-p)F_{2}(x) $$
(1)

estimating them for annual maximum flows obtained with the FFA method; F1, F2 are component probability distributions, while p ∈ [0,1] is a mixture parameter.

In the literature, for the estimation of maximum flows with the use of formula (1) various sets of component distributions are used. However, every time it is assumed that both of them describe events with their full variation. One of the first studies using a mixture distribution was presented by Hawkins (1974). In that study it was assumed that the components have a normal distribution. In subsequent studies the assumption on the form of the distributions was changed. In 2002, Alila and Mtiraoui (2002) assumed that both distributions in the mixture were log-normal. Their fitting to empirical data from Arizona (USA) was compared by fitting to standard distributions (Pearson type III, log-normal, GEV, and five-parameter Wakeby). In 2007, in two publications, Escalante Sandoval estimated distributions of maximum annual flows by a mixture of two Weibull distributions – (Escalante-Sandoval 2007a), and using EV1 and GEV – (Escalante-Sandoval 2007b). The mixture of EV1 and GEV was estimated with the maximum likelihood method, using the Rosenbrock algorithm. In 2009, in a study by Calenda et al. (2009), a mixture of normal distribution and EV1 was used for the estimation of maximum flows in river Tiber. And finally, in a publication of 2017 (Stojković et al. 2017), the authors proposed a set of mixtures of two distributions of the same type with each other. They studied mixtures of GEV, EV1, Pearson type III, and log-Pearson distributions.

The basic problem in the estimation of mixture distribution is the method of estimation of the parameters. In the case of the FFA method the number of observations is limited, while the number of estimated parameters of distribution (1) is doubled, relative to the classic methods. The literature provides examples of the application of various methods, beginning with the method of moments (Hawkins 1974), through the maximum likelihood method, making use of the Rosenbrock algorithm for the estimation of parameter p of the mixture, to Expectation Maximisation algorithm (Vaidyanathan and Vani Lakshmi 2017) in the estimation of distribution (1) as a mixture of two gamma distributions.

In this paper we propose a new method for statistical description of maximum flows with the FFA method. They are modelled with the use of distribution (1), constructed so as to comply additionally with a condition of genetic character. It is postulated that maximum annual flows are the result of operation of two hydrological processes with significantly different, not necessarily separable, intervals of flow variation.

2 Material and Method

The proposed probabilistic model of maximum annual flows has been subjected to verification in catchment basins of upper and central river Odra (Fig. 1). The area is particularly interesting (Migoń 2010). The left-hand tributaries flowing out of the massif of the Sudetes are mountain rivers, and in their lower section – submontane rivers. The right-hand tributaries, on the other hand, are lowland rivers. In addition, above the city of Wrocław, river Odra itself is partially regulated. This causes that in the area of that section of river Odra there is a frequent occurrence of extreme hydrological events that are the result of the effect of variable natural and anthropogenic environment. Some of them take place in the higher parts of the Sudetes, and thus they have a limited range, but other ones affect developed and densely populated regions. The catalogue of extreme events includes various components of the hydrological cycle. The extreme meteorological events include torrential rains, or rainfalls of moderate intensity but long duration. The catchment basin of upper and central river Odra is also known for its rapid disappearance of snow cover in the winter period, caused by the foehn effect.

Fig. 1
figure 1

Catchment basin of upper and central river Odra

In consequence there appear extreme hydrological phenomena, and under certain circumstances also geomorphological ones. Freshets on the mountain tributaries of river Odra, caused by precipitations or thaws, are frequent and often cause serious damage in riverside localities and areas. Violent floods cause also events of geomorphological character, leading even to remodelling of the relief of the valley bottoms. Rivers of the lowland part of the catchment basin, flowing through agricultural areas, flood mainly in the spring, during the dynamic process of disappearance of snow cover over large areas.

The paper presents nine examples of specially selected rivers and water gauges, chosen so as to represent the diverse conditions of meteorological, hydrological and geomorphological character (Table 1).

Table 1 Characteristics of the selected gauge stations

The water gauges in Międzylesie, Wilkanów and Kamienna Góra are situated in mountain catchment basins in which in recent years violent floods were noted that were underestimated with the standard methods.

In this study it was assumed that the maximum annual flows obtained with the FFA method (input data for the model) are not homogeneous. It was assumed that their values are the result of two different kinds of hydrological processes. They can be classified as maxima, the value of which is related primarily with the long-term condition of the catchment basin. Such maximum flows usually fall within the ranges of medium water levels and do not have a flood character. The second class are actual flood flows, often related with water overflowing the river bed. The two classes are not separable, and in a majority of the water gauge transects it is difficult to indicate a threshold that would separate them univocally. In relation to the above, we sought a mixture distribution that would be universal enough to take into account two kinds of maximum annual flows, and flexible enough to comprise possibly the broadest spectrum of distributions used so far. A detailed analysis concerning the distributions of the mixture components was conducted. Due to the character of the analysed catchment basin it was assumed that the GEV distribution is one of the components, responsible for high flows. The second component of mixture (1) that was subjected to analysis included the following two-parameters distributions: normal, log-normal, gamma, and the Weibull distribution. The compatibility tests conducted demonstrated that only a mixture of gamma and GEV distributions has the best statistical fit. In the cases under analysis, 100% fit effectiveness was obtained for the mixture of gamma and GEV distributions, while the other mixtures did not exceed the level of 50%. In such a case, distribution (1) can be reduced to the form:

$$ F_{\text{MIX}}(x)=p{\Gamma}(x,\alpha,\beta)+(1-p)\text{GEV}(x,\mu,\sigma,\xi) $$
(2)

The proposed distribution (2), further on referred to as MIX, requires the estimation of 6 unknown parameters.

As it is easy to notice, the analysed distribution (2) determines a certain family of distributions. Depending on the values of the estimated parameters, it can be reduced to a two-parameter gamma distribution, a three-parameter GEV, or a two-parameter Gumbel distribution (Fig. 2). With the engineering postulate of choosing one type of distribution for the whole country (Węglarczyk 2015), the MIX distribution (2) appears to be a good proposal. Its flexibility, presented in Fig. 2, allows the estimation of distributions of maximum annual flows in mountain, submontane and lowland rivers, characterised by a high diversity of kinds of flows.

Fig. 2
figure 2

Scheme of possible distributions realized by (2)

The estimation of the unknown parameters of distribution (2) was conducted with the maximum likelihood method. In this case, the likelihood function

$$ L(\theta)=L(p,\alpha,\beta,\mu,\sigma,\xi)=\prod\limits_{i = 1}^{n} f_{\text{MIX}}(x_{i}) $$
(3)

depends on six parameters, where n is the number of observations, and xi, i = 1,…, n the observed maximum annual flows.

The global maximum of the likelihood function was determined using the genetic algorithm of searching for the global extreme of multivariate functions developed by Price et al. (2005). The very definition of distribution (2) indicates that the likelihood function can be multimodal, which was also demonstrated by the calculations performed. Figure 3 presents a two-dimensional graph obtained in the estimation of the distribution of maximum flows at water gauge Międzylesie (Nysa Kłodzka river).

Fig. 3
figure 3

Likelihood function (3) plotted with the assumption of constant values of estimators \(\hat {\alpha }, \hat {\beta }, \hat {\sigma }, \hat {\xi }\); water gauge Międzylesie, Nysa Kłodzka river

The behaviour of the estimated mixture distribution (2) on the example of water gauge Trestno (Odra river) is presented in Fig. 4. Part (a) presents graphs of the density functions of the mixture distribution and its components. One can easily note the range of maximum flows in which the population is estimated by both components of the mixture. Part (b) presents the distribution functions of the mixture, its components, and the fit to observations.

Fig. 4
figure 4

Estimated distribution of the MIX (2) of maximum flows, river Odra, water gauge Trestno

The same example of water gauge Trestno was used to present the transition zone between flows generated by medium flows – estimated by a gamma distribution, and flood flows, estimated by a GEV distribution (Fig. 5). That interval (approximately in the range from 950 to 1100 m3) is situated above the zone of medium states.

Fig. 5
figure 5

Analysis of the influence of component cumulative distributions on the resulting cumulative distribution of the MIX (2), river Odra, water gauge Trestno

With regard to hydrological applications, it is interesting to determine the intervals of confidence for quantiles of exceedance. They were determined with the use of the delta method (Coles 2001). In accordance with that method, the estimator \(\hat {x}_{q}\) of the quantile of maximum annual flow has normal distribution with a mean xq and variance V. In the formal notation, for probability π we obtain:

$$ \hat{x_{q}}=F_{\text{MIX}}^{-1}(\pi) \sim N(x_{q},V). $$
(4)

In this method the variance V in the formula (4) is calculated as follows:

$$ V=\bigtriangledown {x_{q}^{T}}W_{\theta}\bigtriangledown x_{q},\qquad \bigtriangledown x_{q}=\left[\frac{\partial x_{q}}{\partial p},\ldots,\frac{\partial x_{q}}{\partial \xi}\right]^{T}, $$
(5)

where the matrix W𝜃 is approximated by a matrix reverse to the expected information matrix:

$$ I_{O}(\theta)=\left[\begin{array}{lllll}-\frac{\partial^{2}}{\partial p^{2}}l(\theta)&\ldots & -\frac{\partial^{2}}{\partial p \partial\xi}l(\theta)\\ \vdots &&\vdots\\ -\frac{\partial^{2}}{\partial p\partial\xi}l(\theta)&\ldots & -\frac{\partial^{2}}{\partial\xi^{2}}l(\theta) \end{array}\right] $$
(6)

calculated for the estimated values of the parameters.

In the applications presented below, the values of the information matrix (6) were determined numerically, while the derivatives in formula (5) were calculated on the basis for formulae presented in the attached AAppendix. The calculations were performed using the program Mathematica 9, Wolfram Research Inc., and our own original software.

3 Results and Analysis

In the nine water gauges described above, the parameters of standard distributions (log-normal, Pearson type III and GEV) were estimated with the method of the maximum likelihood, and compared with the distribution of the MIX, also estimated with the method of the maximum likelihood. The results of the estimation – adequate p-values of the χ2 goodness-of-fit test and corresponding values of mean absolute relative error (MARE) given in %, are presented in Table 2. The MARE is determined between observed flows exceeding the value of the median and their equivalents calculated from the estimated MIX distribution. The measure of model fit error defined in this manner estimates quantitatively high flows that are of the highest importance in engineering practice. For maximum annual flows observed on a given water gauge the goodness-of-fit χ2 tests were conducted for a predetermined and constant division into classes, irrespective of the tested distribution. The p-test values given in Table 2 are obviously dependent on the number of estimated parameters. In the case of the MIX distribution, it is much more difficult to work with the doubled number of parameters in trying to fit the mixture distribution.

Table 2 Goodness-of-fit of the χ2 tests and mean absolute relative error (MARE) of the estimated distributions with maximum flows at selected water gauge stations

In addition, also indicated in Table 2 are those water gauges for which the estimated parameter \(\hat \xi \) of the GEV and MIX distributions equalled 0, was higher than 0.5, or higher than 1.0. This is significant in view of the existence of GEV distribution moments. In the case when \(0.5\le \hat \xi <1.0\) is it not possible to determine its variance, while when \(\hat \xi \ge 1.0\) it is also impossible to determine its mean value.

Analysis of the results obtained allows to draw the conclusion that the application of the MIX distribution improved the goodness of fit. Contrary to the standard distributions, at none of the analysed water gauges there were grounds to reject the hypothesis of good fitting of annual maximum flows with the mixture distribution. This is firmly supported by the MARE measure of fit, an index which is very close to the engineering intuition. Regarding the goodness of fit of the applied distributions, the water gauges presented in Table 2 can be classified in three groups.

  1. I

    One of them groups those gauging stations at which the standard methods work correctly. In this group the level of p-values of χ2 test, calculated for the standard distributions, did not provide justification for rejecting the goodness-of-fit hypothesis. In such a situation it was up to the decision maker to choose which distribution to use in engineering practice. In the examples presented in Table 2, this group includes the water gauges at Kamienna Góra, Ścinawa and Kłodnica, and those at ŁaŻany and Zbytowa.

  2. II

    The second one groups two water gauges (Międzylesie and Wilkanów), for which the hypothesis of good fitting of maximum flows with the Pearson type III distribution should be rejected. The tests of conformance of the remaining standard distributions, as well as the MIX distribution, did not provide grounds for the rejection of the conformance hypothesis. In this group the value of the estimator of parameter \(\hat \xi \) of the MIX distribution was alarming. In both cases it significantly exceeded the value of 1.0. This means a lack of the expected value in the GEV distribution, and thus serious difficulties in the assessment of variability of the estimated quantiles.

  3. III

    The third group, i.e. those water gauges where all the standard methods indicated the necessity of rejecting the tested hypotheses. In the selected set of water gauges, the third group includes two highly different lowland catchment basins: Trestno, Odra river, and Korzeńsko, Orla river. Only the MIX distribution provides a correct fit.

The most important application of the proposed probabilistic model is the inference from analysis of quantiles determined for the purposes of design of hydrotechnical structures, flood protection etc. Figure 6 presents graphs of probability distributions from four water gauges included in Table 2. They provide a comparison of the best fitting standard distribution and the MIX distribution with appended one-sided area of confidence determined by a quantile of the order of 0.84 (standard value in engineering applications). In the case of water gauge Trestno, due to the lack of fit none of the standard distributions was plotted. The horizontal axis in Fig. 6 was specially formatted in the scale of probability so as to facilitate the interpretation of the results for engineering applications. In Fig. 6a – water gauge ŁaŻany, on the graph of the boundary of the area of confidence determined by the MIX distribution one can observe its local maximum. It is related to the area of determinacy of the GEV distribution when the estimated value of its lower limit falls within the range of variation of low maximum annual flows.

Fig. 6
figure 6

Distribution curves and confidence areas of the non-exceedance quantile of the best-fitting standard distributions and the MIX distribution for chosen water gauges: a-ŁaŻany, b-Trestno, c-Ścinawa, d-Zbytowa

Frequently considered in engineering practice are qualtiles of flow corresponding to the probability of occurrence of 1% of maximum flow, together with its interval of confidence. Those values imply the costs of various kinds of hydrotechnical, flow protection and similar investment (see e.g. Mogollon et al. 2016). Therefore, it is important that they are estimated correctly, so that the estimate is in conformance with engineering intuition and practical knowledge. In this context, Table 3 compiles results obtained with the use of the standard models and the MIX for nine test water gauges. For each of those cases one column presents the 1% quantiles of maximum flow, while another column shows their 84% limits of intervals of confidence. In the case of the standard methods, according to the results presented in Table 3, if there were no grounds for the rejection of the goodness-of-fit hypothesis, a maximum of two best-fitting distributions were selected. In the case of a lack of possibility of estimating a quantile, a dash sign was used. In addition, the last column of the Table 3 presents the highest observed values in the analysed set of maximum annual flows. Those values are given as a kind of reference to the one-hundred-year flood obtained from estimation, and in consequence – to the possibility of conducting a qualitative analysis of fit that is obtained using the standard methods and the proposed MIX distribution. Comparison of those values in the particular cases allows to note that the standard methods often cause a large underestimation or a far-reaching extrapolation. Each of those cases is undesirable in practice. The first may cause e.g. the building of flood levees that will not perform their role, while the second – an absurd increase of the costs of the investment. Apart from a few exceptions, the basic conclusion that follows from Table 3 is that the behaviour of the MIX is characterised by a high stability relative to the results obtained by means of the 3 classic three-parameter distributions. This is supported by the MARE index which is closer to engineering intuition (Table 1).

Table 3 One-percent quantiles and their 84% confidence areas of maximum flow at selected water gauges

However, in the case of the MIX a big problem is the occasional load with the heavy tail of the GEV distribution. In this case there is theoretically no possibility of determination of the areas of confidence for maximum flows with a high probability of non-exceedance.

Unfortunately, an important aspect in the use of the standard methods is the necessity of subjective selection of a statistically substantiated distribution for the estimation of values that determine investment decisions. This is not an easy situation for the decision-makers. Table 3 presents two examples of such water gauges: ŁaŻany and Kłodnica, where depending on the choice of distribution the estimations of quantiles of one-hundred-year flood and their areas of confidence differ at least two-fold. The model of the MIX distribution does not permit such extremely different estimations and, what is more, within the range in which it has been tested, it is much more effective in operation.

4 Conclusions

The MIX distribution (2) proposed in the paper has been analysed against the background of distributions used so far. The comparison was made on the base of observations of maximum annual flows in specially selected water gauges in the upper and central river Odra. They are characterised by a large diversity of natural and anthropogenic conditions. Based on the results obtained we can formulate the following conclusions:

  1. 1.

    The MIX distribution, even in hydrologically difficult cases, where standard methods fail, provides statistically correct estimates, although there are situations in which it is not possible to determine the area of confidence of the tested quantile.

  2. 2.

    In the case of water gauges where the standard distributions give correct estimates, the MIX distribution provides similar values.

  3. 3.

    The MIX distribution takes into account the genetic heterogeneity of observed maximum annual flows – it can be the basic tool in the work of the hydrologist.

  4. 4.

    The MIX model allows to unify the statistical procedure and inference.

  5. 5.

    As opposed to the standard methods, in which statistically justified estimations of high quantiles of non-exceedance are characterised by unacceptably high variation, the MIX model displays a more rational behaviour.

In spite of having conducted an analysis of the effectiveness of the proposed method on several dozen water gauges on Polish and European rivers, the authors realize that the method still needs further testing and verification. The applied genetic algorithm of estimation of the parameters of the MIX, employing the search for the global maximum of a multivariate function, does not ensure the obtainment of the maximum value of the target function. As in this algorithm the initial population is selected at random, the chances of obtaining erroneous values of the estimated parameters were reduced by multiple repetition of the calculation process.