Study area
The study area is situated in south-western Poland, in the upper Nysa Kłodzka River catchment, controlled by gauge Kłodzko (Fig. 1). In terms of geology, KV is a longitudinal tectonic ditch, separating the Central and Eastern Sudetes (Kondracki 2013). To the west KV borders with the Bystrzyckie Mountains, and to the east with the Śnieżnik Massif, the Złote Mountains, and the Bardzkie Mountains. The northern border of the study area is not clearly defined, and the Lower Ścinawka River and the Noworudzkie Lowering are considered extensions of KV (Kondracki 2013). The study area shows considerable differences in altitude: its highest point is the Three-seas Peak (1145 m a.s.l.), while the lowest is located in the Kłodzko town (280 m a.s.l.) (Fig. 1). KV has an undulating and hilly mid-mountain relief, the important features of which are clearly marked morphologically river valleys and various stages of development of the river channel system (Szalińska et al. 2008). Its geology exhibits little variability; the research area is mainly built of the pre-Cambrian metamorphic rocks and sedimentary rocks of the Cretaceous period (Szalińska et al. 2008).
The 182-km long Nysa Kłodzka is the trunk river of KV. It is the left tributary of the Odra (Oder) River, which is the second-longest river of Poland. The Nysa Kłodzka River originates on the slopes of the Three-seas Peak. Initially, the river runs along the Upper Nysa Ditch, which is its natural drainage channel (Staffa 1993). Then, it cuts through KV as a mountain river, and after breaking through the Bardo Mountains it flows from the study area and becomes a meandering, lowland river. Its main tributaries in KV are: the Wilczka (18.2 km), Bystrzyca (25.5 km), Biała Lądecka (52.7 km), and Bystrzyca Dusznicka (33 km) rivers. Their regime is nival-pluvial and pluvial-nival (Wrzesiński 2016, 2021). According to Perz (2019), rivers of KV have two of five types of regimes: type 2—nival moderately developed (the upper section of the Nysa Kłodzka above gauge Międzylesie, the Bystrzyca and the Bystrzyca Dusznicka rivers), and type 4—nival-pluvial regime (the Nysa Kłodzka River below gauge Międzylesie, the Biała Lądecka and Wilczka rivers) (Fig. 1, Table 2).
The KV rivers are susceptible to catastrophic floods triggered by sudden or prolonged rainfall and thaws, and enhanced by a large gradient of the river channels, and also the topography and geology of the sub-catchments. For example, studies of Rutkowska et al. (2016) reveal that the subsoil in the Bystrzyca River sub-catchment is largely made of low-thickness loams with moderate water permeability, which contributes to increasing surface runoff and faster formation of flood waves. Among the most disastrous floods in KV, the “millennium flood” in July 1997 resulted in a huge material losses and a number of victims not only in the Kłodzko region, but also in areas located along the Odra River, including the city of Wrocław.
According to RZGW in Wrocław (2013), the upper part of the Nysa Kłodzka River catchment is situated in the so-called Kłodzko climatic region of the Sudetes climate district. The lowest annual average temperature (4.9 °C) in KV is recorded in the Bystrzyckie Mountains, while the highest (above 8 °C) in the foreland of the Opawskie Mountains (RZGW in Wrocław 2013). Precipitation is differentiated spatially, with its relatively higher annual values in the mountainous southern, western and eastern parts of the study area, in particular in the Bystrzyca Dusznicka and Wilczka and the upper Biała Lądecka sub-catchment (Figs. 1, 2A). Noticeably lower precipitation is recorded in the less elevated, central and northern parts of KV (Fig. 2A). Precipitation is also diversified in terms of the deviation of its values recorded at individual rain gauge stations in relation to the average precipitation in the whole catchment area (Fig. 2A); the highest annual precipitation totals are recorded in rain gauge station Zieleniec (1250.9 mm), while the lowest in Kłodzko (591.4 mm) (Table 1). Moreover, precipitation in KV exhibits apparent variations on the multi-annual and monthly bases.
Table 1 Basic characteristics of analysed rain gauge stations in 1974–2013 Runoff in the whole Nysa Kłodzka catchment controlled by gauge Kłodzko is 382 mm (Table 2), and it is spatially diversified. Among the analysed sub-catchments, the highest runoff is typically recorded in the Wilczka River sub-catchment (740 mm), while the lowest in the Bystrzyca Dusznicka River sub-catchment (401 mm) (Table 2). Taking into account, the differential catchments (see explanation under the Fig. 2 caption), the lowest runoff is in the differential catchment of the Nysa Kłodzka River between gauges Bystrzyca Kłodzka II and Kłodzko (only 108 mm) (Fig. 2B).
Table 2 Basic characteristics of analysed river gauges in 1974–2013 Figure 2B also shows the runoff structure, divided into the surface and underground runoff. In most of the studied catchments the structure of runoff is similar—the percent share of the surface and underground runoff is about 50% each. However, the upper section of the Nysa Kłodzka River, controlled by gauges Międzylesie and Bystrzyca Kłodzka II, clearly differs from this pattern—in that area the surface runoff noticeably prevails, accounting for over 65% of the total runoff.
Data sets
Values of precipitation and runoff collected in KV in the multi-annual period 1974–2013 are the basis of this research. The data were recorded at 11 rain gauge stations (Fig. 1, Table 1) and at eight water gauge stations (Fig. 1, Table 2).
It has to be noted that for the purpose of interpolation of the average precipitation totals, data from the Niemojów rain gauge station were used, however, they were not included in the synchronicity analysis because of location of that gauge beyond the analysed area (Fig. 1).
All data sets were obtained from the resources of the Institute of Meteorology and Water Management—National Research Institute in Warsaw, Poland.
Methods
Interpolation of data
Precipitation values were interpolated using open-source R package MACHISPLIN (Brown 2020). This R package interpolates noisy multivariate data through machine learning ensembling of up to six algorithms: boosted regression trees, neural networks, generalized additive model, multivariate adaptive regression splines, support vector machines, and random forests. It allows to simultaneously evaluate different combinations of the six algorithms to predict the input data. During model tuning, each algorithm is systematically weighted from 0 to 1 and the fit of the ensembled model is evaluated. The best performing model is determined through k-fold cross validation (k = 10) and the model that has the lowest residual sum of squares of test data is chosen. After determining the best model algorithms and weights, a final model is created using the full training dataset. Residuals of the final model are calculated from the full training dataset and these values interpolated using thin-plate-smoothing splines. This creates a continuous error surface and is used to correct most the residual error in the final ensemble model (Brown 2020). Such a described method (based on machine learning algorithms) has been used in recent research regarding precipitation interpolation and found to be reliable (Guo et al. 2020).
In the first step, the annual values of precipitation recorded in 1974–2013 at individual rain gauge stations were transferred into a shapefile, in which each of the stations was properly designated spatially. Then, the precipitation data were interpolated, separately for each year. In that way, 40 raster files were obtained, covering the area larger than KV. Each of them was used to receive the areal sum of precipitation, independently for each analysed sub-catchment (Fig. 1, Table 2). The calculated in this way areal annual precipitation totals were then arranged in the chronological data sequences, reflecting the variability of precipitation in 1974–2013, used in further analyses.
Synchronicity values were interpolated using simpler method, i.e. the Inverse Distance Weighted (IDW) interpolation method. This method is based on the functions of the inverse distances, in which the weights are defined by the opposite of the distance and normalized, so that their sum equals one (Ly et al. 2013). The weights decrease with the increase of the distance.
Synchronous and asynchronous occurrence
In this study, the two-dimensional (bivariate) Archimedean copula functions have been used. Copulas have been defined by Sklar (1959), as a joint distribution function of standard uniform random variables. Copulas are proven as a powerful tool for multivariate analysis of nonlinearly interrelated hydrological and meteorological data (Fan et al. 2017). The Copula functions have been widely applied in hydrological analyses, including recent studies on synchronicity of the maximum runoff and its spatial differentiation in the Warta River catchment (Perz et al. 2020), synchronicity of the maximum and mean flow in the Upper Indus River Basin (Sobkowiak et al. 2020), and the rainfall–runoff relations in the Nysa Kłodzka River catchment in Poland (Perz et al. 2021).
In the research, copulas have been applied to analyse the synchronicity and asynchronicity (probability of synchronous and asynchronous occurrence) between:
-
Precipitation totals recorded in rain gauge stations (PRGS) and average areal precipitation totals for the whole KV (PKV), acquired through interpolation (see Sect. “Interpolation of data”)—results are presented in Sect. “Synchronicity of precipitation”,
-
Runoff totals recorded in sub-catchments (RSC) and runoff totals recorded in water gauge Kłodzko (RKV, describing KV as a whole)—results are presented in Sect. “Synchronicity of runoff”,
-
Areal precipitation totals acquired through interpolation for each sub-catchment (PSC) and runoff: (1) recorded in water gauge closing the same sub-catchment (RSC), and (2) recorded in water gauge Kłodzko (RKV)—results are presented in Sect. “Synchronicity of precipitation and runoff”.
The first step was to select the best matching statistical distributions (among Weibull, Gamma, Gumbel, and log-normal) for the analysed data sets. To estimate values of distribution parameters the maximum likelihood method was used. The goodness-of-fit was checked with the help of the Akaike information criterion (AIC) (Akaike 1974):
$$ {\text{AIC}} = {\text{Nlog}}\left( {{\text{MSE}}} \right) + {2}\left( {{\text{No}}.{\text{ of fitted parameters}}} \right), $$
(1)
where MSE is the mean square error, and N is the sample size, or
$$ \begin{aligned} {\text{AIC}} & = -\, {\text{2log}}\left( {{\text{maximum likelihood for model}}} \right) \\ & + {\text{2}}\left( {{\text{No}}.{\text{ of fitted parameters}}} \right). \\ \end{aligned} $$
(2)
The distribution type with the minimum AIC value is the best fitted (Akaike 1974).
In the next step, the joint distribution of compared data sets was constructed. It was made for paired data sets mentioned in bullets above. Analysis was made only for pairs of hydrologically connected precipitation and runoff data sets, to avoid a situation in which accidental statistical relations would be analysed.
In general, a bivariate Archimedean copula can be defined as:
$$ C_{\theta } \left( {u,v} \right) = \, \phi^{ - 1} \left\{ {\phi \left( u \right) + \phi \left( v \right)} \right\}, $$
(3)
where u and v are marginal distributions, the θ, subscript of copula C, is the parameter hidden in the generating function ϕ, and ϕ is a continuous function called a generator that strictly decreases and is convex from I = [0,1] to [0, ϕ(0)] (Nelsen 1999).
Many copulas belonging to the Archimedean copula family can be used when the correlation between analysed data sets is both positive or negative, what was proved e.g. by Genest and Favre (2007). For this reason, the Clayton, the Gumbel–Hougaard and the Frank copula families (which are one-parameter Archimedean copula functions) were applied in this research. Equations of copula functions, parameter space, generating function ϕ(t), and functional relationship of Kendall’s τθ with a copula parameter for selected single-parameter bivariate Archimedean copulas can be found e.g. in paper of Perz et al. (2021).
The AIC was used to select the best-fitted joint distribution through comparison to the empirical joint distribution.
For each pair of compared data series, 5000 hypothetical values were generated at random, on the basis of previously computed statistical distribution parameters of marginal data sets. These values were used for selecting of the best-fitted copula family for each pair of compared data sets and, in consequence, for the forming of an appropriate function.
The above-described steps resulted in calculating the synchronicity and asynchronicity, i.e. the degree of probable synchronous and asynchronous occurrence, of compared data sets. The generated hypothetical value pairs were analysed in terms of 62.5% and 37.5% probability levels (Gu et al. 2018; Zhang et al. 2014), what led to designation of nine sectors (Table 3). These sectors show different relations between calculated probable values of compared data sets—three sectors (No. 1, 5, 9) with the synchronous occurrences and six sectors (No. 2, 3, 4, 6, 7, 8) with the asynchronous occurrences of compared data sets were designated (Table 3).
Table 3 Designation of sectors The degree of synchronicity (e.g. between compared precipitation and runoff data sets) is the percentage share of generated points in sectors 1, 5, and 9 in total amount of generated points. The asynchronicity was divided into two types:
-
Moderate, which shows “low-medium”, “medium–low”, “medium–high” and “high-medium” relations (sectors 2, 4, 6, 8) and
-
High, which shows “high-low” and “low–high” relations (sectors 3 and 7).
In other words, the synchronicity and asynchronicity (i.e. probability of synchronous and asynchronous occurrences) of analysed variables were determined through a calculation of threshold values of probability ranges:
-
Probable values with a probability of exceedance of < 62.5% were designated as LA/LB;
-
Probable values with a probability of exceedance in a range > 62.5% and < 37.5% were designated as MA/MB;
-
Probable values with a probability of exceedance > 37.5% were designated as HA/HB.
The sum of degrees of synchronicity and asynchronicity is always 100%.
For example, the occurrence of “high” areal precipitation in a given sub-catchment (HPSC) is a synchronous event if in the same time unit “high” runoff from the sub-catchment (HRSC) occurs.
If the synchronicity of PSC and RSC in a given catchment is 70%, this means that in seven out of ten years, the probable PSC is within the same probability range as the probable RSC.
In turn, the asynchronous event can be exemplified by the occurrence of HPSC (e.g., a “20-year precipitation”, p = 5%) and the occurrence of LRSC (e.g., at the level of exceedance probability p = 80%) in the same catchment. As in example above, if the synchronicity is 70%, so the asynchronicity is 30%, what means that statistically the asynchronous event should occur average three times for every ten years.
In the “Results” section, the term “synchronicity”/”asynchronicity” refers to the synchronous/asynchronous occurrence (co-occurrence probability) of the analysed values.
Structure of water balance
The structure of the water balance can be estimated based on the runoff coefficient (C) (Dynowska and Pociask-Karteczka 1999), which represents the integrated effects of infiltration, evaporation, retention, and interception, all of which affect the runoff volume. The runoff coefficient is the ratio of the amount of water flowing out of the catchment area in the time unit to the amount of water that at the same time falls down in the form of precipitation within the catchment boundaries. It determines the percentage of precipitation that flows out of a catchment. These calculations were made not only for the empirical average values, but also for the probable ones.