1 Introduction

Hydrological projections of climate change for the Rhine basin are a valuable asset in water management, impact modeling, and decision-making and climate adaptation studies. To quantify projected changes in hydrological impact studies, trends and changes in past and future climate are usually assessed using global climate models (GCMs). A bias correction on the GCM outputs is essential to obtain realistic, useful and usable hydrological simulations (Wilby et al. 1998a; Fowler et al. 2007; Maraun et al. 2010). Among others (Buishand and Brandsma 1996; Hagemann et al. 2011; Goodess et al. 2012; van Pelt et al. 2012) applied downscaling methods to correct climate model outputs of meteorological variables using observation data. Although uncertainties in precipitation and temperature outputs for future periods are generally reduced by bias correction methods, a systematic analysis of the origin of the bias for present day conditions is usually not included. One known cause of systematic precipitation or temperature biases in GCMs is an error in the representation of atmospheric circulation statistics, with a potentially large implication for the reliability of the future projections of these modeling systems (Van Haren et al. 2012; Wang et al. 2014).

Precipitation bias in climate models is primarily related to coarse resolution, lack of ability to simulate explicitly local processes, misrepresentation of physical processes, all of which can in some areas be amplified by feedbacks between climate components (Wang et al. 2014). In Central Europe, model biases in sea surface temperature contribute to the precipitation bias, although precipitation changes in this area are primarily caused by circulation changes (van Ulden and van Oldenborgh 2006; van Haren et al. 2013). Depending on the area of interest and the physical processes dominating the precipitation biases, we set up a framework to perform a correction of the circulation dependent precipitation bias.

The climate in Europe—and subsequently the local climate and the hydrological response of the Rhine basin—is strongly influenced by the variability of the atmospheric circulation and the unstable nature of the North-Atlantic dynamics—especially in wintertime (Stahl and Demuth 1999; Frei et al. 2001; Haylock and Goodess 2004; Pfister et al. 2004; Tu 2006; Bárdossy 2010b; Cattiaux et al. 2012). Circulation variability plays an important role in determining changes in the temporal and spatial distributions of climatological variables such as precipitation and temperature (Wibig 1999; Slonosky et al. 2000).

Previous studies have included atmospheric variability in methods for model bias correction by conditioning existing methodologies such as quantile regression, quantile mapping and weather generators (resampling) on the governing circulation regime (Wilby et al. 1998b; Huth 1999; Clark et al. 2004; Friederichs and Hense 2007; Hundecha and Bárdossy 2008; Jagger and Elsner 2009; Bárdossy and Pegram 2011).

In this study we adapt the well-known delta factor bias correction method to distinguish between different atmospheric circulation patterns (CPs) for winter and summer precipitation variability. The proposed correction is labeled as the CP-method due to the incorporation of circulation pattern dependent bias correction coefficients. The prevailing circulation regimes in the North European region, responsible for the majority of precipitation variability in the Rhine basin, are derived using an observational record encompassing the period 1961–2005. Using the CPs, we introduce a correction to model precipitation output: the background of the bias may depend on the particular CP, and thus require a different correction. For example, high-pressure precipitation bias can be associated to model bias in wet-day frequency (Suklitsch et al. 2011), while persistent low pressure conditions may generate errors related to advection or precipitation formation (Christensen et al. 1997). The frequency of the circulation patterns from the climate models is compared to the ERA-I frequency distribution. However, no attempt was made to apply a correction to a bias in the CP frequency distribution, as we focus on the correction of the precipitation output for a specific circulation regime.

A method which does not consider circulation regimes but only corrects model precipitation outputs using the standard delta method is labeled here as NoCP-method. The CP-method introduces extra parameters in the correction because of the use of circulation statistics for each precipitation bias-depended subset. The dependence of bias correction coefficients on circulation patterns reduces the overall statistical performance of the CP-method due to the reduction of the sample size used to calibrate every bias correction coefficient. This trade-off between case-specific bias correction coefficients and reduced sample size will be discussed further below.

The circulation patterns are derived through the Maximum Covariance Analysis (MCA, Von Storch and Zwiers 1999; Polade et al. 2013), which objectively finds patterns that optimally explain the winter/summer precipitation variability of the entire Rhine basin. The frequency of occurrence of these CPs is evaluated in two ensembles of GCM simulations: the ESSENCE climate model ensemble (Sterl et al. 2008) and the CMIP5 ensemble (Taylor et al. 2012). For each model non-linear precipitation correction factors are derived. The robustness of the CP-method is tested regarding improvements in precipitation and estimated extreme discharges for the Rhine basin. First, to assess the effect of the CP-method, we compare the spread of the GCM ensembles to the uncorrected model data and the results from the NoCP method. Second, we investigate the impact of the bias correction from the CP-method on estimated discharges by comparison with discharge calculations based on uncorrected and corrected GCM precipitation data. The correction is applied on 5 day precipitation sums. The selection of 5 day sums is in alignment with the targeted hydrological application in the Rhine basin. The CP-method accounts for changes in the mean and extreme precipitation averaged over a given time interval. Kew et al. (2011) found that the bias present in ESSENCE precipitation in the Rhine basin varies with the averaging length. In particular, biases in the wet day frequency are present in 1 day precipitation sums, while in 10- and 20-days sums these biases are insignificant. Biases for the 50 and 99 % quantiles are distinct for 10- and 20-day sums. In hydrological applications, extreme discharges in the Alpine region generally result from extreme multi-day precipitation amounts in the river basin (Ulbrich and Fink 1995; Disse and Engel 2001). This has motivated our selection of 5-day intervals.

A detailed description of the MCA, the data sets used, and the bias correction method follows in Sect. 2. Section 3 presents the derived circulation patterns from the MCA analysis for winter and summer for the period 1961–2005. Also shown are the results from the bias correction and the contribution of the proposed CP correction. Section 4 discusses the results and summarizes the conclusions.

2 Materials and methods

2.1 Area and data details

2.1.1 Precipitation observations

The precipitation data set used in this study is a new high-resolution set covering 134 sub-catchments of the Rhine basin for the period of 1961–2008. The CHR precipitation set (covering 1961–1995; Sprokkereef 2001) was extended with more recent data sets from Germany, Switzerland and France and is referred to as CHR08 (Photiadou et al. 2011). Photiadou et al. (2011) showed that with the HBV-96 hydrological model, the CHR08 data set generates significantly improved extreme discharge statistics compared to alternative precipitation data sets (E-OBS and ERA-Interim precipitation data sets).

2.1.2 Sea level surface pressure data

An extended daily sea level pressure (SLP) record for 1961–2005 is constructed by combining ECMWF ERA40 (1961–1978; Uppala et al. 2005) and ERA-Interim (1979–2005; Dee et al. 2011) SLP reanalysis data. Prior to this, ERA40 SLP data were interpolated to the ERA-I grid and their SLP characteristics were compared for the overlapping years. A comparison of the pressure anomalies and their frequencies between ERA40 and ERA-I show that they are very similar, allowing the combination of the two datasets (not shown here). The extended SLP record is referred to as ERA-SLP and contains daily averaged SLP data on a common 1.5 × 1.5 latitude/longitude grid. The SLP fields cover the area −4.5°W to 25°E longitude and 32°N–63°N latitude (Fig. 1). The time period of 1961–2005 was selected to match the observed precipitation record (see below) and the historical record of CMIP5 outputs.

Fig. 1
figure 1

Sea level pressure interest domain (large rectangle enclosing gridded area), with the 134 sub-catchments of the Rhine river (grey shading, black outline) used for the Maximum Covariance Analysis. Grey dashed lines indicate the grid boxes onto which all GCMs were interpolated and the bias correction was applied. Thick solid black lines indicate the 8 grid cells covering the Rhine basin where the bias correction method is applied

2.1.3 ESSENCE

ESSENCE (Ensemble SimulationS of Extreme weather events under Nonlinear Climate changE, Sterl et al. 2008) is a 17-member ensemble global climate simulation covering 1950–2100, generated using the ECHAM5/MPI-OM coupled climate model. Simulations are carried out at a horizontal resolution of T63 (roughly 2.5°) and 31 vertical atmospheric levels, and are forced by the SRES A1b scenario (Nakicenovic and Swart 2000). Perturbing the initial state of the atmosphere, different ensemble members are formed. Using a large ensemble of climate simulations helps to distinguish internal variability from systematic trends induced by changed external forcing. The large sample also allows a better quantification of changes in extreme values of climate variables (large return periods; Sterl et al. 2008). Precipitation and sea level pressure data outputs were retrieved from the climate model.

2.1.4 CMIP5 climate models

Precipitation and sea level pressure data outputs were also retrieved from the multi-model World Climate Research Programme (WCRP) Coupled Model Intercomparison Project phase 5 (CMIP5; Taylor et al. 2012), prepared for the IPCC assessment report AR5. Here, 19 models from the CMIP5 dataset were used, having no missing values in either SLP or precipitation for the historical period (as for ESSENCE also), chosen to overlap the observational record 1961–2005 (similar climate forcings).

2.2 Methods

The proposed methodology is split into three steps (Fig. 2). The first step concerns the MCA, also known as singular value decomposition (SVD), the retrieval of the circulation patterns and their frequency, while the second step incorporates the circulation statistics derived from the MCA into the bias correction method. The third step is the hydrological application. Here, the uncorrected and the bias corrected (CP and NoCP) precipitation model outputs are used as forcing to the HBV-96 hydrological model to estimate extreme discharge for the Rhine basin.

Fig. 2
figure 2

Diagram of the bias correction and its subsequent analysis steps. Symbols and equations are explained in the methodology section. a Maximum Covariance Analysis and pattern frequency, b CP delta factors correction, c hydrological application

2.2.1 Maximum Covariance Analysis and principal components

The circulation patterns that are related to the hydroclimatological variables of the Rhine basin are obtained using MCA. MCA identifies pairs of SLP and precipitation fields, characterized by high covariance, and the time evolution of their expression in the source data [principal components (PCs)]. Several studies applied MCA to a combination of meteorological and hydrological fields to link large-scale circulation patterns to local rainfall/temperature variability (Bertacchi Uvo et al. 2001; Castaings et al. 2009; Martín et al. 2011). Extended descriptions on MCA analysis can be found in Bretherton et al. (1992), Von Storch and Navara (1995), and Von Storch and Zwiers (1999).

In our study, MCA is applied to the cross-covariance matrix of two spatial–temporal fields of daily mean ERA SLP and CHR08 precipitation in the time period 1961–2005. These fields encompass different spatial areas and resolutions (Fig. 1). The SLP domain selection encompasses atmospheric circulation systems that are potential drivers of precipitation events over Central Europe and potential flooding events in the Rhine basin (Jones and Lister 2009). To test the robustness of the MCA, the calculation of PCs using the geopotential height at 500 hPa instead of SLP led to similar PCs. The CHR08 data were weighted with the area size of each sub-catchment. Five day mean and standard deviation was selected to standardize the SLP and precipitation data.

Two seasonal periods are distinguished: winter (October, November, December, January, February, March) and summer (April, May, June, July, August, September). The MCA was implemented for both seasons separately. The relation between the two variables is explained through the constructed covariance matrix, explained below.

The notation used here is as follows: a time series is denoted by (t), a vector by a boldface letter and a matrix by a capital boldface letter. \({\mathbf{X}}_{ml}\) denotes the ERA SLP data grid points \(m = 1, \ldots ,400,\,{\mathbf{Y}}_{nl}\) refers to the CHR08 precipitation data with sub-catchments \(n = 1, \ldots ,134\) and l is the number of time steps. The cross-covariance matrix \({\mathbf{C}}_{mn}\) of \({\mathbf{X}}_{ml}\) and the transposed (T) \({\mathbf{Y}}_{nl}\) is constructed through:

$${\mathbf{C}}_{mn} = {\mathbf{X}}_{ml} \cdot T\left( {{\mathbf{Y}}_{nl} } \right).$$
(1)

The MCA of \({\mathbf{C}}_{mn}\) solves the following equation:

$${\mathbf{C}}_{mn} = {\mathbf{U}}_{mm} {\mathbf{D}}_{diag\_nn} {\mathbf{V}}_{nn}^{T},$$
(2)

where \({\mathbf{U}}_{mm}\) and \({\mathbf{V}}_{nn}\) represent the spatial anomalies of SLP and precipitation respectively, and \({\mathbf{D}}_{diag\_nn}\) the array of singular values at the diagonal of the matrix, describing the fraction of the explained covariance (EV) between the two variables. The column vectors of U and V are the singular vectors and satisfy the orthogonal relation \({\mathbf{u}}_{i} {\mathbf{u}}_{j}^{T} = {\mathbf{u}}_{j}^{T} {\mathbf{u}}_{i} = {\mathbf{I}}\) and \({\mathbf{v}}_{i} {\mathbf{v}}_{j}^{T} = {\mathbf{v}}_{j}^{T} {\mathbf{v}}_{i} = {\mathbf{I}},\) with I the identity matrix. The circulation patterns (CPs) that are well correlated to fields of precipitation variability in the CHR08 database are thus contained in U.

To construct the principal components of each mode the following equation is applied:

$$PC_{ERA} = \sum {{\mathbf{X}}_{ml} } \cdot {\mathbf{U}}_{mm}$$
(3)

where the PC ERA is the principal component of the SLP modes.

2.2.2 Projections of ESSENCE and CMIP5 SLP on MCA-derived spatial patterns

Here, we used 5 day precipitation sums and SLP fields, of ESSENCE and CMIP5 ensemble, all interpolated to the same grid (Fig. 1). The circulation dependent bias correction of the climate model ensembles ESSENCE and CMIP5 is driven by the CPs, which were derived using MCA. The PCs of the spatial SLP anomalies in the climate model ensembles are constructed by projecting U on the gridded SLP data of ESSENCE and CMIP5 models, \({\mathbf{E}}_{ml} ,\) similar to Eq. (3):

$$PC_{Mod} = \sum {{\mathbf{E}}_{ml} } \cdot {\mathbf{U}}_{mm} .$$
(4)

For each day in the climate model records a dominant CP is identified by selecting the pattern with the largest value of its principal component (PCMod). Each day is categorized according to its dominant PC (selected from the first three PCs) and its amplitude sign. This leads to 6 circulation pattern classes, CP1–CP6. Days for which another PC than one of the first three appears to be the dominant pattern are classified in a “residual” class, CP7. The most frequently occurring CP in a 5 day interval defines all the 5 days within the interval.

This leads to a frequency distribution of dominant CPs that can be compared to a respective distribution in the ERA-SLP database, allowing an evaluation of the models ability to reproduce the circulation statistics associated with the local precipitation of the Rhine basin. We derive 7 CP categories for both the observations, ERA-SLP and the model ensembles, ESSENCE and CMIP5; PC 1–3 (positive amplitudes) are CP 1–3 and CP 4–6 are the negative amplitudes of the PCs 1–3. The rest of the PCs are classified into one category as CP7 (residuals).

2.2.3 The bias correction method: CP-method

A non-linear bias correction is used in which the modeled daily precipitation P is transformed by:

$$P^{*} = aP^{b} ,$$
(5)

where a and b are the transformation coefficients (a, b > 0). Leander and Buishand (2007) used this type of transformation to correct for bias in regional climate model simulations for the Meuse basin. Van Pelt et al. (2012) applied this equation in observed precipitation to account for future changes in precipitation for the Rhine basin, where an excess correction accounted for days with precipitation above a given (high) quantile to correct extreme events. Here we apply this excess correction for model precipitation \(P > q_{\bmod }^{95}\) and \(b > 1,\) where \(q_{\bmod }^{95}\) is the 95th quantile of the model precipitation.

In this study, we adopt the procedure described in Leander and Buishand (2007) to retrieve a and b coefficients specified for each collection of days where one of the seven CPs categories is found to be dominant. Doing so we obtain 7 pairs of transformation coefficients a and b, one pair for each CP. The coefficients a and bare derived using the equations:

$$\begin{aligned} b_{CP} & = \log \left( {q_{{obs\_CP_{i} }}^{95} /q_{{obs\_CP_{i} }}^{60} } \right)/\log \left( {q_{{\bmod \_CP_{i} }}^{95} /q_{{\bmod \_CP_{i} }}^{60} } \right)\,{\text{and}} \\ a_{CP} & = q_{{obs\_CP_{i} }}^{60} /\left( {q_{{\bmod \_CP_{i} }}^{60} } \right)^{b} , \\ \end{aligned}$$
(6)

where \(q_{obs}^{60} ,\,q_{obs}^{95}\) are the 60th and 95th quantiles of the observations (CHR08), \(q_{\bmod}^{60}\), \(q_{\bmod}^{95}\) are the corresponding quantiles of the model precipitation and each category is represented by the index \(CP_{i} = 1, \ldots ,7.\) For extreme events (\(P > q_{\bmod}^{95}\)) Eq. (5) is transformed into:

$$P_{CP}^{*} = \bar{E}_{\bmod \_CP} /\bar{E}_{obs\_CP} \left( {P_{CP} - q_{{\bmod \_CP_{i} }}^{95} } \right) + a_{CPi} \left( {q_{{\bmod \_CP_{i} }}^{95} } \right)^{{b_{CPi} }} ,$$
(7)

with \(\bar{E}_{\bmod} ,\bar{E}_{obs}\) the mean model and observed precipitation excess respectively. The bias correction descripted here is further refer to as the CP-method. The classic delta factors correction is labeled here as the NoCP-method and used in the results section for comparison purposes.

2.3 Hydrological application

The uncorrected, NoCP and CP corrected precipitation outputs are used as forcings in the HBV-96 hydrological model (Bergström and Forsman 1973; Bergström 1976; Lindström et al. 1997) to estimate the annual maximum discharges for the Rhine River. The HBV-96 hydrological model is a semi-distributed conceptual hydrological model originally developed at the Swedish Meteorological and Hydrological Institute (SHMI) in the 1970s. The HBV-96 precipitation-runoff model of the Rhine river basin has been successfully used, for instance, to estimate extreme runoff from catchments or to quantify the impacts of predicted climate changes (Berglöv et al. 2009). HBV-96 describes the most important runoff generating processes. The model consists of subroutines for snow accumulation and melt, a soil moisture accounting procedure, routines for runoff generation and a simple routing procedure. A complete description of the HBV-96 calculation scheme and model structure for the Rhine basin can be found in Eberle et al. (2005) and Sprokkereef (2001). Photiadou et al. (2011) showed that in the Rhine basin, the temperature effect did not have a significant influence on the estimation of extreme discharges. Following this study, temperature forcings in all simulations were derived from E-OBS Version 7 gridded data (Haylock et al. 2008). The E-OBS gridded temperature data set was made available from the ENSEMBLES project (Haylock et al. 2008), where it was constructed for validation of RCMs and for climate change studies. The spatial resolution of this data set is 0.25° by 0.25° on a regular latitude–longitude grid.

3 Results

3.1 SLP Patterns and their frequency distribution

3.1.1 Principal components of co-varying SLP and precipitation patterns

The co-varying SLP and precipitation patterns found by the MCA are ranked with respect of their contribution to the total explained (co)variance (EV). Table 1 presents the explained variance (EV) of the first nine pairs of SLP and precipitation principal components for winter and summer seasons. Summed together, the first three pairs of the principal components already cover a significant fraction of EV. The remaining patterns show little coherence and can therefore be considered to reflect noise. Three principal components are therefore retained for winter and summer season, respectively, explaining approximately 70 % of the total co-variance in each season. Furthermore, the physical representation of the PC showed insignificant patterns.

Table 1 Explained variance (EV %) as derived from the MCA for the 9 first pairs of the principal components of SLP and precipitation for winter and summer season

3.1.2 Frequency distribution of Principal Components

In Fig. 3, the frequency distribution of the principal components for winter and summer season for ERA-SLP is shown together with the respective frequency in the CMIP5 and ESSENCE climate model ensembles diagnosed by their dominant correspondence to the SLP patterns. Also shown is the 95 % range of the probability density obtained from the individual ESSENCE members, and the individual values of the CMIP5 models. In winter (Fig. 3a), ESSENCE displays a lower fraction of days attributed to PC1 than ERA, while CMIP5 models show a good frequency for most models, a number of them showing lower frequencies. For PC2, ESSENCE has the right ensemble mean fraction of days, while the ensemble spread is smaller than the CMIP5 ensemble, where a large number of models have higher fractions of days assigned to PC2 than ERA. For PC3, the CMIP5 ensemble has a very good agreement with the reference data set, while the ESSENCE mean is biased. For the summer season (Fig. 3b) CMIP5 models tend to overestimate the presence of PC1 and PC3. ESSENCE also overestimates PC1 but has a very good agreement with ERA concerning PC3. Both ESSENCE and CMIP5 underestimate the mean fraction of days where PC2 is dominant, with a similar ensemble spread. In summary, the ESSENCE ensemble shows a smaller bias in the frequency distribution of PC’s than CMIP5 in both seasons.

Fig. 3
figure 3

Frequency of the dominant principal components of 5 days of SLP for ERA (black bars), ESSENCE ensemble (grey bars) and CMIP5 models (different colors and shapes) for a winter and b summer season. Error bars indicate the 95 % range of probability from the individual ensemble members

3.1.3 Circulation patterns derived from the MCA for winter and summer

The leading CPs and their corresponding precipitation fields for the winter half year are shown in Fig. 4. The first field (Fig. 4a) illustrates a low-pressure anomaly (LA) in Central Europe and a high-pressure anomaly in Southwestern Europe. This implies a suppressed zonal Westward flow over the Rhine basin, producing relatively wet winter weather. The associated precipitation pattern (Fig. 4b) has overall positive anomalies across the basin with an exception of the southeastern catchment in the Swiss basin which is influenced by local topography. This resulting pattern is linked to past flood events in the Rhine basin (Bárdossy and Pegram 2011). The pair of fields with opposite sign is a high-pressure anomaly covering Central Europe, is associated with negative precipitation anomaly. This first pair of the MCA explains around 55 % of the total EV (Table 1).

Fig. 4
figure 4

SLP anomalies (left column) with associated sub-catchment precipitation anomalies (right column), both derived from MCA. Blue lines indicate negative SLP anomalies while red lines show the opposite. Both fields are normalized by subtracting the mean and dividing by the standard deviation of the 5 day running mean and standard deviation. Results shown are for winter half year season (1961–2005). a Low pressure anomaly (LA), b precipitation anomaly, c NAO+ pressure anomaly, d precipitation anomaly, e MerSN pressure anomaly, f precipitation anomaly

In the second winter pattern (explaining almost 16 % of the EV, Fig. 4c, d), a strong zonal flow similar to the positive phase of the North Atlantic Oscillation (NAO+) (Hurrell 1995; Zveryaev 2009) is associated with precipitation variations in the Rhine basin dominated by enhanced atmospheric moisture advection with excessive precipitation over northern Europe and deficient precipitation over southern Europe. Many studies identified the NAO as a driver of winter precipitation in Northern Europe (Haylock and Goodess 2004). High streamflow anomalies during autumn season were associated with the positive phase of the NAO (Ionita et al. 2011).

The last winter pattern (Fig. 4e, f) we consider here represents a meridional flow with south to north air transport (MerSN). It is characterized by positive precipitation variations in the Moselle basin and negative anomalies in eastern Swiss and German basins (Main).

For summer, the first SLP variation (Fig. 5a) explains around 47 % of the total co-variance and depicts a strong high-pressure anomaly (HA) with westward flow dominating the European continent, and positive precipitation anomalies (Fig. 5b). The second SLP pattern (Fig. 5c) represents the analogy of a positive phase of NAO (NAO+). The Southern European high-pressure anomaly dominates the regional precipitation variations of the Rhine; negative precipitation anomalies for the majority of the (mainly southern) sub-catchments and positive anomalies for the Lower Rhine (Fig. 5d). The meridional flow (Fig. 5e) with south to north air transport (MerSN) produces negative precipitation anomalies for eastern Swiss sub-catchments and positive precipitation anomalies mainly for the Moselle basin (Fig. 5f).

Fig. 5
figure 5

SLP anomalies (left column) with associated sub-catchment precipitation anomalies (right column), both derived from MCA. Blue lines indicate negative SLP anomalies while red lines show the opposite. Both fields are normalized by subtracting the mean and dividing by the standard deviation of the 5 day running mean and standard deviation. Results shown are for summer half year season (1961–2005). a High pressure anomaly (LA), b precipitation anomaly, c NAO+ pressure anomaly, d precipitation anomaly, e MerSN pressure anomaly, f precipitation anomaly

3.2 Bias correction of precipitation

3.2.1 Coefficients used in the correction algorithm

Five-day precipitation sums of ESSENCE and CMIP5 are corrected using the CP-method. The method is applied both to gridded (8 grid points, Fig. 1) and domain averaged precipitation outputs. The summary results of the domain average correction are shown below (Table 2; Figs. 7, 8, 9). The correction at grid level is performed to allow the use of the hydrological model, which demands gridded input data. Corresponding results of the NoCP-method are derived similarly using the same CP classifications as in the CP-method.

Table 2 Mean and standard deviation of a and b coefficients for ESSENCE (where the spread is over the individual ensemble members), for winter and summer season

Table 2 shows the statistical summary of the transformation coefficients for area averaged winter and summer precipitation for each of the ESSENCE members, including their mutual standard deviation. The frequency of each CP in ERA is presented for both winter and summer. The spread between the coefficients derived for the different CPs (CP-method) is considerable: it well exceeds the internal variability of the coefficients in the NoCP method. This variability of regime specific correction coefficients will introduce an extra source of variability in the corrected time series. In particular, CPs with a high frequency of occurrence (e.g. CP1 and CP4 from PC1 in winter and summer) and large deviations of the coefficients from the overall mean will have a relatively large contribution to the overall bias correction.

An illustration of how the CP-method works is given in Fig. 6, where biases in the 60th and 95th percentile of the raw model data are corrected while considering all data in one sample (NoCP) versus applying the correction to a CP subsample (CP). Figure 6a shows the correction of all 5 day precipitation intervals, which are categorized as a CP1 (low pressure anomaly in winter; Fig. 4a), together with the transformation of the 60th and 95th percentile for both methods. The CP method improves the distribution of each CP (Fig. 6a) by matching the two quantiles to the observations better than the NoCP method. For the whole ensemble (Fig. 6b) both CP and NoCP methods make a significant correction for the quantiles, but these corrections are more similar than for the CP1 subsample.

Fig. 6
figure 6

Illustration of the CP-method for winter precipitation in a single uncorrected ensemble member (raw data shown with grey circles). Black lines denote the observations and black crosses are 60th and 95th percentiles of the observations. In (a) the cumulative distribution of one CP precipitation category is shown, with the corresponding quantiles of each method (blue square for the CP-method; flipped triangle for the NoCP-method). The cumulative distribution of a random entire ensemble member is shown in (b) with the respective 60th and 95th percentiles for both NoCP and CP methods

The example in Fig. 6 is further elaborated in Fig. 7, where the two-sided Kolmogorov–Smirnov (K–S) test is applied to the entire data set and the CP subsamples. The K–S test is used to test the null hypothesis that the tested distributions have the same characteristics as the distribution of the observations. High p values (in Fig. 7, y-axis) show the significance of the K–S test, where p values below 0.05 are insignificant. The CP-method results in both summer and winter are lower than the NoCP results, with the winter results being extremely low. Results are shown for uncorrected model data and corrections according to NoCP and CP for winter (Fig. 7a) and summer (Fig. 7b). For both seasons, the CP-method shows higher p values than the NoCP method for nearly all CP categories, which demonstrates a good reproduction of the observed cumulative distribution in each CP category. However, for the entire record the NoCP shows a better performance (particularly in winter), due to the stability in calculating the q95 while the CP-method underperforms.

Fig. 7
figure 7

Whisker plots of Kolmogorov–Smirnov test statistic of the cumulative precipitation distribution of the 7 CPs using CHR08 reference for winter (a) and summer (b), with uncorrected ESSENCE output (black), results from the NoCP-method (grey) and the CP-method (dark grey). Black thick horizontal lines in each whisker plot indicate the median of the 17 members while the solid box shows the 25th to 75th percentiles range of the sample and the whisker show the data range. The whiskers extend to 1.5 times the interquantile range above the top and below the bottom of the box, while the outliers (black dots) mark individual data points that lie beyond the whiskers

3.2.2 Application of the method to the CMIP5 model ensemble

The bias correction was applied to two ensembles, the ESSENCE, as it was mentioned before, and to CMIP5. High return periods (50 years) of 5 day precipitation sums of the observations (CHR08), the ESSENCE ensemble and CMIP5 models for both winter and summer are shown in Figs. 8 and 9. The uncorrected ESSENCE precipitation data (Fig. 8a) show a distinct overestimation at low return periods (<2 years) with a small underestimation of precipitation levels at high return periods and a small spread of the 50 year return levels across the ensembles. The CP correction improves the low return periods but reduces high precipitation rates for the 10 and 20 years return periods (Fig. 8b). The method also decreases the spread at longer return levels, in respect to the spread of the NoCP method. In the summer season, ESSENCE is performing quite well, with a better behavior for the low return periods than in winter. An overestimation in precipitation levels is present for return periods >5 years (Fig. 8c). Both CP and NoCP methods retain the good behavior and the original spread of the ensemble members at return periods >20 years (Fig. 8d).

Fig. 8
figure 8

Return periods of 5-day averaged precipitation sums over the Rhine basin for the ESSENCE ensemble (grey dots) and the CHR08 observations (black line). Top row shows winter ESSENCE uncorrected outputs (a) and corrected using the CP method (light blue crosses) and NoCP (light grey) (b), while bottom row shows summer uncorrected outputs (c) and corrected (d)

Fig. 9
figure 9

Return periods of 5-day averaged precipitation sums over the Rhine basin for the CMIP5 ensemble (grey dots) and the CHR08 observations (black line). Top row shows winter CMIP5 uncorrected outputs (a) and corrected using the CP method (light blue crosses) and NoCP (light grey) (b), while bottom row shows summer uncorrected outputs (c) and corrected (d)

Figure 9 presents the comparison between return times of 5 day precipitation sums from observations and uncorrected and corrected outputs from the CMIP5 ensemble averaged over the Rhine basin for the winter season and summer season. The CMIP5 models display a behavior similar to the ESSENCE model concerning the performance in winter and summer. The CMIP5 ensemble has a smaller spread in the summer than in winter for large return periods (Fig. 9c). The uncorrected CMIP5 precipitation outputs in winter (Fig. 9a) overestimate the observations for low return periods and significantly underestimate for moderate and high return periods. Both CP and NoCP methods in Fig. 9b correct the low returns periods satisfactory, with the CP method decreasing slightly more the spread of the ensemble. In the summer season, the CP-method corrects the bias in low precipitation sums similar to the NoCP method. Concerning high precipitation levels, both NoCP and CP increase the spread; while at return levels between 5 and 20 year the CP performs better than the NoCP (Fig. 9c, d).

The changes in distribution characteristics between uncorrected and corrected ensembles show that in winter these changes are more obvious than in the summer due to larger correction factors imposed by the correction methods (CP & NoCP). This is especially obvious in the ESSENCE ensemble and not so distinct in the CMIP5 ensemble. In the CMIP5 ensemble the changes are distinct and similar in both seasons, with the biggest improvement in the lower quantiles. The different changes between winter and summer are related to a stronger dependence of precipitation on circulation in winter, when the pressure gradients are steeper than in summer.

3.3 Evaluation of simulated discharge

To demonstrate the effects of the bias correction methods on the estimated discharge for the Rhine basin, the HBV-96 hydrological model was forced with the uncorrected ESSENCE and CMIP5 models precipitation and the respective CP and NoCP corrected precipitation. Here we present results for Lobith, the entrance point of Rhine in the Netherlands, and Basel, representing the Swiss basin. For winter and summer two different basins were analyzed in more detail, corresponding to the spatiotemporal characteristics of climatic changes in these subbasins. In the alpine part of the Rhine (upstream of Basel) winter discharges increase and spring flows decline between May and October (summer season in this study: April to September), due to a reduction in snowfall, an increase in glacier melt, and an increase in winter precipitation. In the central part of the Rhine basin (upstream of Lobith), flows are simulated to increase between February and July, and to decrease at other times. This pattern reflects the annual changes in precipitation.

Figures 10 and 11 present the calculated annual maximum discharges and the fitted peak levels with a return time up to 1/100 year using the discharge obtained by using CHR08 as a forcing as reference. This discharge time series is referred to as “observed” in the following. To match the record length of CHR (1961–2008) HBV-96 model runs were performed for a period of similar length (corresponding to the climate forcing imposed during the period 1961–2005). The extreme discharges levels with long return times are estimated from the data using a Gumbel fit (Coles 2001). In Fig. 10a the 95 % confidence interval of the 100 year return period of the observed and the corrected ESSENCE-driven results for Basel are shown. In winter, the CP method leads to a reduced 95 % confidence range of the 100 year return time compared to the NoCP, which underestimates this range. For the summer season at Lobith, both correction methods lead to a similar and satisfactory reduction of the uncertainty range.

Fig. 10
figure 10

Gumbel fits of return time of simulated discharges at Lobith for a summer and b winter, estimated using HBV-96 calculations driven by CHR08 data (black solid line), uncorrected ESSENCE model outputs (red dotted lines), ESSENCE NoCP-corrections (green lines) and CP-corrections (blue lines). The grey range indicates the 95 % confidence intervals for the ensemble for the CP-method

Fig. 11
figure 11

Gumbel fits of return time of simulated discharges at Lobith for a summer and b winter, estimated using HBV-96 calculations driven by CHR08 data (black solid line), uncorrected CMIP5 model outputs (red dotted lines), CMIP5 NoCP-corrections (green lines) and CP-corrections (blue lines). The grey range indicates the 95 % confidence intervals for the ensemble for the CP-method

The estimated annual maximum discharge derived from CMIP5 model output has a wider range of uncertainties for both winter and summer than the ESSENCE ensemble (Figs. 10, 11). Concerning the effect of the bias correction, in winter at Basel (Fig. 11a), both correction methods have larger ranges than the observed, with the CP correction having a smaller range of uncertainty than the NoCP method. In the summer season at Lobith (Fig. 11b) both methods behave similar; they show larger ranges than the observed, with both methods underestimating the lower margin of the confidence interval. Both NoCP and CP for both locations and half-year seasons lead to a significant reduction in the confidence range compared to the uncorrected discharge record. For CMIP5 and ESSENCE, the uncorrected discharge record shows a different behavior. In Fig. 11a, the observations are situated in the middle of the confidence interval, while in Fig. 10a the ESSENCE ensemble shows an excellent agreement with the observed range. At Lobith, however ESSENCE overestimates the range of the confidence intervals for all return levels.

4 Discussion and conclusions

This study assesses the effects of introducing circulation patterns into a bias correction method for precipitation generated by two ensembles of climate model simulations. Specifically an evaluation of simulated discharge from the Rhine basin is carried out. The (co)variability of precipitation over the Rhine basin with regional circulation patterns was analyzed using a MCA for winter and summer half-year data for the period 1961–2005. For each season three leading principal components (PCs) were derived using ERA daily averaged sea level pressure (SLP) and a new high-resolution regional daily precipitation dataset (CHR08). The selected PCs explain a large fraction of the variability of precipitation in the Rhine basin, and are in agreement with previous studies (Zveryaev 2009; Bárdossy 2010a). Each 5 day interval was categorized to a given PC according to the sign and amplitude of PC time series, resulting in 6 classified and one unclassified (containing the residuals) sets of circulation patterns (CPs). The CP categories reflect local and regional pressure gradients; NAO-related westerlies and easterlies in both winter and summer season can be described successfully using the MCA and can explain a large fraction of the seasonal precipitation variability of the Rhine basin. In the CP-method transformation coefficients are estimated using the quantiles of observed and model precipitation distribution of each CP precipitation category (Eq. 6).

In both seasons, the frequency of occurrence of the classified CPs in an ECHAM5 climate model ensemble (referred to as the ESSENCE ensemble) shows a smaller bias than CMIP5. The CP-bias correction that makes use of these circulation characteristics thus has a stronger effect in the CMIP5 ensemble than in ESSENCE, implying that the CP-specific bias correction has a larger impact for models that have a stronger bias in the circulation distribution statistics.

Changes in extreme precipitation statistics result from a mixture of processes, including changes in the frequency distribution of circulation patterns. Biases in these processes in current climate models form a major source of uncertainty (Cattiaux et al. 2012, 2013; Chen 2013). An attempt for a physical interpretation of the circulation-specific bias of precipitation and the frequency of these patterns did not lead to a clear picture. However, we demonstrate that adding circulation bias into the correction can improve the overall extreme precipitation and estimated extreme discharges. For both winter and summer a non-linear bias correction not dependent on CPs (referred to as NoCP) has a higher overall skill (as measured by a Kolmogorov–Smirnov test) than the CP-method when the complete cumulative distribution of 5-day precipitation data is considered. The CP method is based on 60 and 95-percentile values in subsets of the sample, and the according precipitation values have different percentile values in the whole sample, causing non-representative transformation coefficients for the entire distribution. This limitation, however, does not lead to an overall deterioration of the performance of the CP-method: especially for extreme discharge over the Rhine the good performance of the CP-method is retained. In addition, for episodes subject to a particular circulation regime, the CP correction shows a clearly larger K–S significance than NoCP, which demonstrates the influence of circulation regimes on precipitation model bias. In particular, the CP-method recognizes the relationship between low/high precipitation rates and specific circulation patterns, where the NoCP-method does not make this explicit attribution. Our results are in line with the findings of (Lisniak et al. 2012), who included an atmospheric circulation index to downscale coarse resolution precipitation data using a Multiplicative Random Cascade method.

The hydrological application in ESSENCE and CMIP5 models presented here shows a significant improvement of both NoCP and CP-corrections for winter in Basel and summer in Lobith. The NoCP performs better than the CP in Lobith, while the CP correction performs better in winter in the Alpine region. The different performances between CP and NoCP show primarily the dependence of the precipitation on the circulation regimes in the Alpine (due to the topography).

The CP-method allows improvements in the simulation of precipitation changes that are associated to changes in the circulation regime and the frequency of occurrence of the corresponding CPs. The improvement in the precipitation correction leads to improvements in the estimation of extreme discharges in the Rhine basin. In this framework, we allow for physical connection of the bias in precipitation with circulation patterns without violating the good performance of the delta factor correction. However, a simple bias correction (NoCP) still remains a better option, since with less calculation of parameters the NoCP produces similar improvements.

Physics and feedbacks between precipitation, evaporation and atmospheric dynamics vary widely across circulation regimes and may lead to circulation dependent biases (Christensen et al. 1997; Findell and Eltahir 2003; Seneviratne et al. 2006; Suklitsch et al. 2011). One potential key mechanism to consider in hydrological application in such studies is the interception evaporation process that plays a significant role during low flow conditions in summer and fall (Hurkmans et al. 2009). CP-specific bias corrections may appear beneficial for studding these feedbacks, but this is subject for future research.