1 Introduction

Flooding is the most common natural hazard globally, causing significant loss of life and destruction of property and infrastructure. Between 2000 and 2022, floods worldwide resulted in inflation-adjusted economic losses equivalent to US $ 929 billion, representing 3852 floods (Centre for Research on the Epidemiology of Disasters (CRED) 2023). Storms and flooding events are the most common types of natural hazards in South Africa, constituting 71.2% of the total hazards in the country (CRED 2023). In the Western Cape, flooding is driven primarily by cut-off lows, resulting in persistent annual financial losses across the province. Between 2003 and 2014, government departments and municipalities in the province experienced ZAR 4.9 billion in flood-related damages (Pharoah et al. 2016).

Flood frequency analysis (FFA) is used to estimate flood magnitudes associated with specific return periods beyond the observed record (Cunnane 1988). FFA is referred to as design flood estimation. The understanding of the magnitudes of floods and their exceedance probabilities is of prime importance for various aspects of water resources management and planning such as the design of hydraulic structures (Smithers et al. 2015), flood insurance studies (Kjeldsen et al. 2014), flood plain management (Nagy et al. 2017), river ecological studies (Zaman et al. 2012), and stormwater management (Leščešen et al. 2019). FFA faces several challenges in its implementation, with issues such as the selection of theoretical probability distributions and the short record length of gauge data being particularly problematic (Nagy et al. 2017).

FFA can be undertaken at the site or regional level. At the site level, FFA is the most direct method of estimating design floods (Zaman et al. 2012) and uses a record of observed flows at a specific location. Regional FFA refers to the assessment of magnitudes of peak flows at sites clustered as homogenous flood regions owing to similar at-site characteristics (Hosking and Wallis 1997). Regional FFA approaches are advantageous in overcoming issues such as short record lengths and the spatial density of river gauges. At-site FFA is usually undertaken on gauged flow data at a given location if flow data are available for the site under investigation.

FFA can be undertaken using the AMS or partial duration series (PDS). AMS is derived from the highest peak discharge value for each year in the time series (Langbein 1949). The AMS approach is advantageous as events are easily identified, and the approach ensures that selected events are independent of each other. The AMS approach can ignore larger peaks that are not the maximum in their years and be problematic in semi-arid and arid regions where annual peaks can be low (Zaman et al. 2012). In contrast, the PDS approach includes all flood events over a pre-defined level for each year. The PDS approach is suitable when time series records are short and less reliable (Nagy et al. 2017). The PDS method is limited by its ability to ensure the independence of the flood peaks and the absence of a standardised approach for the selection of appropriate threshold values. In South African studies, observed flows are usually fitted to the AMS extracted from the discharge series (Nathanael et al. 2018; Ngongondo et al. 2013; Singo et al. 2016; Smithers et al. 2015; Kjeldsen et al. 2002).

There is no one theoretical probability distribution that applies to all sites, with most research suggesting that distributions be selected based on goodness of fit tests (Markiewicz et al. 2015). In South Africa, the most common probability distributions used in FFA are the Log-Normal (LN), Log Pearson Type III (LP3) and Generalized Extreme Value (GEV) distributions (Van der Spuy and du Plessis 2022). Despite concerns regarding the length and reliability of discharge series records in South Africa, the GEV and LP3 distributions generally produce acceptable results (Van der Spuy and du Plessis 2022).

Precipitation across South Africa is highly variable (MacKellar et al. 2014). While rainfall climate projections remain uncertain, the majority of climate models suggest that annual rainfall will decline across the country (World Bank Group 2021). Despite decreases in average precipitation, rainfall intensity is expected to increase under future climate scenarios in South Africa (Abiodun et al. 2020; Kruger and Nxumalo 2017; Engelbrecht et al. 2013; Westra et al. 2013). Intense rainfall is the primary driver of floods in South Africa in smaller catchments compared to widespread rainfall in larger catchments (Smithers and Schulze 2004). Therefore, with the projected increase in rainfall intensity, the prevalence of flooding will likely increase across the region.

The Western Cape province of South Africa experiences Mediterranean winter rainfall due to the influence of mid-latitude cyclones (du Plessis and Schloms 2017; Roffe et al. 2019). Winter rainfall in parts of the Western Cape of South Africa is expected to decline due to climate change (Engelbrecht et al. 2013). Synoptic features which drive extreme weather events such as floods are potentially increasing in the Western Cape (Engelbrecht et al. 2013; Abiodun et al. 2016). Pharoah et al. (2016) note that extreme daily cut-off low-induced rainfall has increased across the Western Cape province, coupled with record flood peaks in several rivers between 2003 and 2014. De Waal et al. (2017) show that 63% of rainfall stations in the Western Cape exhibited an increase in 20- and 50-year, 1-day rainfall extremes.

Numerous studies have been conducted on the linkage of climate change to flood frequency (Dong et al. 2018; Iqbal et al. 2018; Maurer et al. 2018; Quintero et al. 2018; Almasi and Soltani 2017; Camici et al. 2014; Qin and Lu 2014; Shaw and Riha 2011; Mareuil et al. 2007; Kay et al. 2006; Muzik 2002). In the South African context, traditional FFA studies have been undertaken in several provinces, including KwaZulu-Natal (Kjeldsen et al. 2002); Mpumalanga (Heritage et al. 2001); Limpopo (Singo et al. 2016); and the Western Cape (Tempelhoff et al. 2009). However, few studies have focused on flood frequency estimation under future climate change scenarios in South Africa. Studies undertaken by Abiodun et al. (2020), Du Plessis and Scholms (2017), and De Waal et al. (2017) have focused on flood design estimations based on extreme rainfall under future climate change in the Western Cape; however, no studies have been undertaken using discharge series from river gauges across the province.

The assessment of flood frequency as a result of climate change is typically assessed using global circulation models (GCMs), downscaling techniques, and hydrological models (Xu et al. 2005). The coarse resolution of GCMs (approximately 150–300 km2) cannot serve as direct inputs into hydrological models (Ougahi et al. 2022). Therefore, downscaling techniques must be applied to GCMs to obtain a finer spatial or temporal resolution (Dong et al. 2018; Iqbal et al. 2018; Maurer et al. 2018; Quintero et al. 2018; Almasi and Soltani 2017; Camici et al. 2014; Qin and Lu 2014). There are two primary categories of downscaling: dynamical downscaling and statistical downscaling. In dynamical downscaling, a high-resolution climate model (a regional climate model (RCM)) that is limited to a specific region is run using one or more GCM grids, with the boundary conditions driven by the underlying GCM (Seidou et al. 2012). RCMs provide outputs at a finer solution resolution (20–60 km2, compared to GCMs (150–300 km2) (Chokkavarapu and Mandla 2019). Statistical downscaling refers to the statistical relationship that is developed between historical observational data and outputs of climate models (both regional and global) (Copernicus Climate Change Service 2021). This study is limited to statistical downscaling methods as these approaches are computationally inexpensive. In addition, statistical downscaling methods are helpful in downscaling to point locations, such as a weather station or river gauge.

This paper examines the changes in flood frequency curves (FFC) across selected sites in the Western Cape province in South Africa. The specific objectives of the study include:

  • Evaluating FFCs under three statistical downscaling methods (delta change (DC), linear scaling (LS) and quantile delta mapping (QDM) approaches) and two instantaneous peak flow (IPF) estimation methods: the Sangal and Fuller methods.

  • Comparing FFCs generated between individual GCMs and the GCM ensemble mean FFC.

  • Investigating changes in FFCs under future climate scenarios (representative concentration pathway (RCP) 4.5 and RCP 8.5) for two periods (2030–2060 and 2070–2100).

2 Methodology

2.1 Study area

The Western Cape province is situated in the southwestern corner of South Africa and is one of the country’s nine provinces (Fig. 1). The province is bordered by the Eastern Cape and Northern Cape provinces. The Western Cape has a total area of 129 462 km2 and is characterised by a semi-arid climate with an annual average rainfall of 450 mm with significant variability across the province (Western Cape Government 2018). The Western Cape comprises two water management areas (WMA): the Breede-Gouritz and the Berg-Olifants (Western Cape Government 2018).

Fig. 1
figure 1

Map of the Western Cape province illustrating selected river gauges and weather stations across water management areas

2.2 Data acquisition

Daily precipitation data was acquired from the South African Weather Service (SAWS) climate database and the hydrological services platform of the Department of Water and Sanitation (DWS). Average daily discharge and monthly evaporation data were obtained from the hydrological services platform of the Department of Water and Sanitation. Rainfall stations and river gauges were assessed to ensure a sufficiently long data record was present. To ensure consistency across sites, the data period of 1970–2000 was selected. Data completeness was evaluated for each river gauge and rainfall station. The proportion of missing observations was less than 1% on average across all sites. Missing data was interpolated using the Kalman smoothing function which is recommended for long time series correction (Moritz 2017). This was undertaken using the R-based ImputeTS package. This approach has been applied successfully in other studies focused on flood frequency and precipitation changes under future climate change (Bulti et al. 2021; Brönnimann et al. 2019; Ngongondo et al. 2011). The location of precipitation stations and river gauges are detailed in Tables 1 and 2, and As precipitation stations and river gauges seldom occur in the same geographical location, pairs of precipitation stations and river gauges were matched based on a least distance criterion. In addition, pairs were assessed to ensure their joint presence in the tertiary and quaternary catchments and similarity with respect to biophysical variables such as topography and climate. SAWS data sets are available upon request at https://www.weathersa.co.za/home/equiries_climatedata, while DWS datasets can be downloaded from https://www.dws.gov.za/Hydrology/Verified/hymain.aspx.

Table 1 Summary of rainfall station locations and characteristics
Table 2 Summary of river gauge locations and characteristics

2.3 GCM acquisition and clustering, scenario selection and downscaling

Climate projections used within this study were based on eight GCMs of the World Climate Research Project Coupled Model Intercomparison Project—Phase 5 (CMIP5) (Table 3). Suitable GCMs were identified using the GCMeval tool developed by Parding et al. (2020). The tool was used to rank, contrast and filter CMIP5 climate models that best represent present precipitation trends in the Southern Africa region while considering bias, spatial correlation and other skill scores. In addition to the models selected using the GCMeval tool, this study integrated two additional climate models (MIROC5 and GFDL-ESM2G) into the ensemble in order to be consistent with a prior investigation conducted in the same location by Pillay (2023). The CMIP5 data was downloaded from the Earth System Grid Federation platform (https://esgf-node.llnl.gov/projects/esgf-llnl/). For all GCMs, historical precipitation data from 1961 to 2005 and future-projected precipitation data from 2006 to 2100 under RCP 4.5 and RCP 8.5 scenarios were acquired.

Table 3 Description of the CMIP5 climate models used (CEDA 2024)

Raw GCMs must be downscaled to be viable for applications at a local scale. Three statistical downscaling methods were applied in this study. These include the DC, LS and QDM methods. The control period for statistical downscaling was 1970–2000, while the future periods under investigation were 2030–2060 and 2070–2100. The DC and LS approaches are described by Eqs. 1 and 2.

$${\text{P}}_{{\text{DC}}} = {\text{P}}_{{\text{obs}},{\text{d}}} \times \frac{{{\mu }\left( {{\text{P}}_{{\text{sim}},{\text{m}}} } \right)}}{{{\mu }\left( {{\text{P}}_{{\text{hst}},{\text{m}}} } \right)}}$$
(1)
$${\text{P}}_{{\text{LS}}} { } = {\text{P}}_{{\text{sim}},{\text{d}}} \times \frac{{{\mu }\left( {{\text{P}}_{{\text{obs}},{\text{m}}} } \right)}}{{{\mu }\left( {{\text{P}}_{{\text{hst}},{\text{m}}} } \right)}}$$
(2)

where P is precipitation, obs is the observed time series, hst is the GCM output of the control period, and sim refers to the GCM output of future scenarios. µ refers to the mean, while m and d refer to the monthly and daily time step.

The QDM approach differs from the DC and LS techniques as it allows the climate change trend to be extracted from projected quantiles. This is advantageous as it allows for quantile corrections across the full distribution of values, thereby matching the reference distribution (Camici et al. 2014). As Xavier et al. (2022, p. 178) outlined, the QDM approach “preserves model-projected relative changes in precipitation data, and it corrects possible systematic biases in the quantiles of modelled data in respect to observed values.” The QDM implementation is broken down into two steps. First, the absolute or relative changes in the quantiles between the reference and future periods are calculated. This is represented by Eq. 3. Thereafter, the bias-corrected precipitation dataset is attained by multiplying the relative changes by the historical corrected value—Eq. 4. Further details on the QDM are outlined in Cannon et al. (2015).

$$\Delta_{{\text{sim}}} ({\text{i}}) = \frac{{{\text{Q}}_{{\text{sim}}({\text{i}})} }}{{{\text{F}}_{{\text{hst}}}^{ - 1} \left[ {{\text{F}}_{{\text{sim}}} \left[ {{\text{Q}}_{{\text{sim}}({\text{i}})} } \right]} \right]}}$$
(3)
$${\text{P}}_{{\text{QDM}}} = {\text{ F}}_{{\text{obs}}}^{ - 1} \left[ {{\text{F}}_{{\text{sim}}} \left[ {{\text{Q}}_{{\text{sim}}\left( {\text{i}} \right)} } \right]} \right] \times \Delta_{{\text{sim}}} ({\text{i}})$$
(4)

where Δsim (i) is the relative changes in the ith quantiles between historical and future GCM simulation, and Qsim(i) is the ith quantile of future simulated data. Fsim and Fhst−1 represent the cumulative distribution function (CDF) for the future period and the inverse CDF of the simulated historical period, respectively.

2.4 Hydrological modelling

2.4.1 Overview of the water resources simulation model (WRSM)/pitman daily time step model

The WRSM/Pitman daily time step model was used to simulate future average daily discharge estimates. The WRSM/Pitman model is a mathematical model that tracks water movement through a network of catchments, irrigation areas, reservoirs, mining activities and river channels (Bailey and Pitman 2016). The mathematical framework of the model was initially developed and communicated in Pitman (1976). The mathematical model was coded and run in Python 3.11. Daily precipitation and monthly evaporation were inputs to the WRSM/Pitman model. Mean daily evaporation estimates were derived from monthly evaporation estimates.

The WRSM/Pitman model is advantageous for the simulation of future average daily discharge data as less uncertainty is introduced into the modelling framework owing to fewer input data requirements. Hydrological simulations for future climate change scenarios were performed using the parameters estimated based on the historical calibrations of observed and simulated average daily discharge for the control period (1970–2000).

2.4.2 Mathematical framework

The WRSM/Pitman daily time step model initially disaggregated daily precipitation into hourly intervals. The mathematical relationship between hourly and daily rainfall amounts was assessed using data from weather stations in Pretoria in the Gauteng province (Pitman 1976). Equation 5 was used to determine the storm event duration in hours, while Eq. 6 was used to calculate the hourly precipitation amounts and was based on a typical S-shaped rainfall mass curve. Evaporation and interception losses were removed from hourly precipitation amounts, with the remaining daily precipitation available for surface runoff or infiltration. Surface runoff was calculated using a symmetrical triangular frequency distribution and is described in Eqs. 712 with the remaining water available as soil moisture. Soil moisture percolates into groundwater before entering the river channel. These processes are described in Eqs. 13 and 14. Lastly, total discharge was calculated by adding groundwater outflow and surface water runoff while accounting for time delays and attenuation using the Muskingum method.

$$d = 0.964 + 0.13736 \times p$$
(5)
$$y = \frac{x^n }{{x^n + (1 - x)^n }}$$
(6)

where y is the cumulative precipitation for the event duration (mm) divided by total precipitation (mm per event), x is the cumulative time (hours) divided by total event time (hours), n = exponent related to the non-uniformity of precipitation input (mm), d represents the duration of a storm event (hours), and p is the precipitation (mm/day).

$$R_h = \frac{2 (P_h - Z1)^3 }{{3 (Z3 - Z1)^2 }};\quad Z1 \le P_h \le Z2$$
(7)
$$R_h = P_h - Z2 + \frac{{2 \left( {P_h - Z1} \right)^3 }}{{3 \left( {Z3 - Z1} \right)^2 }};\quad Z2 \le P_h \le Z3$$
(8)
$$R_h = P_h - Z2; \quad P_h \ge Z3$$
(9)
$$Z1 = \frac{{Z_{min} Z3}}{{Z_{max} }}$$
(10)
$$Z2 = \frac{(Z1 + Z3)}{2}$$
(11)
$$Z3 = \frac{{4Z_{max} }}{{2^{\frac{2S}{{St}}} }}$$
(12)

where Ph is hourly rainfall (mm/h), Rh is the surface runoff (mm/h), Zmin is the nominal minimum infiltration rate (mm/h), Zmax is the nominal maximum infiltration rate (mm/h), S is soil moisture (mm), and St is the soil moisture capacity (mm).

$$Pe = Ft\left[ {\frac{S - Sl}{{St - Sl}}} \right]^{Pow}$$
(13)

where Pe is the percolation flow rate (mm/day), Ft is the percolation at soil moisture capacity (mm/day), Sl is the soil moisture storage below which no percolation occurs (mm), and Pow is the power of soil moisture-percolation curve.

$$Gwft = \frac{{W_G^\frac{2}{3} }}{{Gl\sqrt {St} }}$$
(14)

where Gwft is the groundwater flow rate (mm/day), WG is the groundwater storage (mm), and Gl represents the groundwater delay constant (day).

Bailey and Pitman (2016) outline prescribed ranges for model variables. Internal model variables and suggested variable initial values and ranges described in Eqs. 714 are described in the Supplementary Information (Table S16).

2.4.3 Hydrological model calibrations

In the Pitman/WRSM hydrological model, calibration statistics include the mean and standard deviation of annual runoff, log mean and standard deviation of annual runoff and the index of season flow (Bailey and Pitman 2016). These calibration statistics were deemed to be insufficient for this study. Considering the application of peak flows within FFA, metrics such as the percentage error between the simulated and observed mean annual peak flows were assessed in addition to other calibration performance statistics typically used in hydrological studies. These included the Pearson’s correlation coefficient (r), the coefficient of determination (R2), Kling-Gupta Efficiency (KGE) and the Nash–Sutcliffe Efficiency (NSE). Bailey and Pitman (2016) outlined a set of acceptable ranges for each variable used within the WRSM/Pitman model. Parameter adjustments were made based on these acceptable ranges after each simulation. In certain cases, adjustments were made beyond the suggested ranges. This was undertaken to evaluate the limitations of the hydrological model in projecting discharge under future climate scenarios. Further investigation was undertaken by visual inspection of simulated hydrographs.

During the calibration of the WRSM/Pitman model for selected sites, the lag of surface runoff and percolation parameters was set to 0 days and 0.01 mm per day, respectively (Reinwarth et al. 2018). This is recommended from ephemeral rivers (Pitman 1976). In this study, the suppression of the lag variable during parameterization was implemented to maintain the peak discharge statistical profile within the simulated time series. The WRSM/Pitman model defines lag as the time delay in measuring runoff at the catchment outlet (Bailey and Pitman 2016). It was found that a higher lag parameter resulted in peak discharges being underestimated, causing a smoothing effect of daily discharge hydrographs.

This study used the Kling-Gupta Efficiency (KGE) metric as the primary calibration performance measure. The KGE statistic is advantageous in comparison to the Nash–Sutcliffe Efficiency (NSE) metric. Gupta et al. (2009) note that discharge variability is not correctly considered in the NSE. The KGE metric provides a more balanced form of the combined components of the NSE (correlation, bias, ratio of variances or coefficients of variation) (Liu 2020). In addition, the NSE metric resulted in variable performance and sensitivity in assessing maximum daily runoffs (Lin et al. 2017). As outlined in Kling et al. (2012), the KGE metric can be assessed by the following scheme:

  • “Good” (KGE > 0.75),

  • “Intermediate” (0.75 > KGE > 0.5),

  • “Poor” (0.5 > KGE > 0),

  • “Very poor” (KGE < 0).

2.5 Instantaneous peak flow estimation

A significant concern in FFA is the availability of precipitation data at a sub-daily time step from GCMs and hydrological models. To overcome this issue, the study adopted an empirical approach to estimate daily instantaneous peak flows (IPF) outlined in Sangal (1983) and Fuller (1914). This approach has also been taken in studies by others investigating flood frequency changes under future climate change such as Almasi and Soltani (2017). The Sangal and Fuller methods are outlined in Eqs. 15 and 16.

$$Q_{max} = \frac{{\left( {4Q_2 - Q_1 - Q_3 } \right)}}{2}$$
(15)

where Qmax is IPF (m3/s), Q2 mean daily flow at the time when peak flow occurred, Q1 and Q3 are daily mean flow in previous and next day of the peak (m3/s).

$$Q_{max } = Q(1 + 2.66 A^{ - 0.3} )$$
(16)

where Qmax is IPF (m3/s), Q is mean daily flow (m3/s), and A is the catchment area (km2).

2.6 Flood frequency analysis

FFA was undertaken using the AMS for IPFs at each site using the Weibull plotting position. The AMS datasets were extracted for individual GCMs and the GCM ensemble mean for the scenarios (RCP 4.5 and RCP 8.5) and future periods (2030–2060 and 2070–2100) under evaluation and the observations representing the control period (1970–2000). In addition, AMS datasets for the individual GCM and the GCM ensemble mean were extracted by the statistical downscaling and IPF estimation methods under investigation. The GCM ensemble mean represents the mean AMS series of IPF across individual GCMs for each period. Considering that the AMS is extracted from IPF data at the daily time step, the clustering of GCMs into an ensemble cannot be directly constructed by taking the mean across days within each GCM. This is due to the difference in the timing of wet and dry days across GCMs. Maimone et al. (2019) discuss this issue in further detail.

Three theoretical distributions, including the GEV, LN and LP3 distributions, were assessed for flood design estimation. These distributions were identified by Van der Spuy and du Plessis (2022) as the most commonly used distributions in South African FFA studies. Parameter estimation was undertaken using the maximum likelihood approach. The advantages of the L-moments approach, which has been applied extensively in hydrological applications, are acknowledged. Hosking and Wallis (1997) noted that the L-moments approach provided more robust parameter estimations in the presence of extreme values. Considering the application of FFA under future climate change scenarios, this study adopted the maximum likelihood method for parameter estimation owing to its application in both hydrology and climate change studies. The Kolmogorov–Smirnov, Cramér–von Mises and Chi-squared goodness of fit tests were conducted at a significance level (α) of 0.05 to determine the most suitable distribution. Test statistics are presented in Table 4. IPF quantiles were calculated for the 2-, 5-, 10-, 25-, 50-, and 100-year return periods using the selected distribution.

Table 4 Test statistics used to assess goodness of fit

Chow (1954) suggested that FFA by theoretical probability distributions could be assessed using frequency factors ‘K’ based on statistical parameters. Peak discharge (XT) was calculated using Eq. 17:

$$X_T = \mu + K_T \cdot s$$
(17)

With the mean (μ), standard deviation (s), and KT is the distribution-specific frequency factor for the return period T.

3 Results and discussion

3.1 Global circulation model ensembles and statistical downscaling outputs

Mean annual rainfall (MAR) trends are measured between the control period and climate scenarios (RCP 4.5 and RCP 8.5) under both the near (2030–2060) and long term (2070–2100). Figure 2 depicts the MAR based on the QDM downscaling method for the control period and future climate scenarios across evaluated locations in the Western Cape. The MAR estimates for the QDM, LS and DC methods are presented in Tables S1 and S3 in the Supplementary Information. A decreasing MAR trend is evident at several sites—Vredendal, Jonkershoek, Vogel Vallij and Eenzaamheid—irrespective of the downscaling method applied. For these sites, the percentage decrease in MAR between the near and long term for the RCP 4.5 scenario is less than percent decrease under RCP 8.5.

Fig. 2
figure 2

Mean annual precipitation (mm per year) with standard deviation under control and future climate scenarios across locations in the Western Cape (Downscaling Method: QDM)

There are conflicting results between the DC, LS and QDM sites for the Stompdrift Dam and Kammanassie sites. For the DC method, there is a decreasing MAR trend detected for both sites across all scenarios and time periods, while under the QDM method, a 0.54 and 2.14% increase is present for the Kammanassie site for the near-term period (2030–2060) under RCP 4.5 and 8.5, respectively. For the Stompdrift Dam site, under the QDM method, an increase of 10.88, 11.21 and 10.78% in MAR is evident for the RCP 4.5 2030–2060, RCP 8.5 2030–2060 and RCP 4.5 2070–2100, respectively. A smaller percentage increase of 1.21% is detected under the RCP 8.5 for the 2070–2100 period. For the LS downscaling method, percentage increases between 50.99 and 65.12% are detected across both scenarios and time periods for the Kammanassie site. Similarly, increases between 261.68 and 307.73% are detected for the Stompdrift Dam site. For the Grabouw site, there is consensus between the DC and QDM methods regarding the percentage decrease in MAR detected across all scenarios and periods in contrast to the percentage increases between 61.78 and 93.95% present under the LS method.

The decrease in mean annual precipitation detected across future climate scenarios for selected sites in the Western Cape concurs with the findings of several studies, including Hewitson and Crane (2006), Engelbrecht et al. (2019), Du Plessis and Kalima (2021) and Ngwenya and Simatele (2024). The decreasing rainfall trend for future climate projections follows a historical decreasing trend across the Western Cape detected by Lakhraj-Govender and Grab (2019).

Several factors may influence future changes in precipitation in the Western Cape. Decreases in the winter rainfall in the southwestern Cape and west coast have typically coincided with poleward dynamics of the southern hemisphere subtropical high-pressure belt and mid-latitude westerlies (Roffe et al. 2021; Lakhraj-Govender and Grab 2019). Poleward shifts of moisture corridors can be attributed to the Southern Annular Mode and the expansion of the sub-tropical anticyclones in the South Atlantic and South Indian Ocean (Lakhraj-Govender and Grab 2019). Changes in Antarctic sea ice and the expansion of the Hadley cell are likely to result in changes in rainfall for regions that derive moisture from the mid-latitudes (Pascale et al. 2016; Roffe et al. 2021). Changes in climate modes such as the Southern Oscillation Index and solar (sunspot) activity may also contribute to inter-annual rainfall variability in the Western Cape (Ndebele et al. 2020, 2022; Mazibuko et al. 2021).

3.2 Hydrological modelling

Average daily discharge simulations were run on seven pairs of precipitation stations and river gauges across the Western Cape. Calibration and parameterization of the WRSM/Pitman daily model were undertaken by comparing observed and simulated average daily discharge generated from observed daily precipitation for the control period (1970–2000). Results from the calibration are presented in Table 5 and calibration hydrographs in Fig. 3. Of the seven pairs calibrated, no calibrations within the “good” category were achieved for the KGE metric. Calibrations with “intermediate” results for the KGE metric were achieved for the Jonkershoek, Little Berg River, and Palmiet sites. The assessment of percentage error between observed and simulated mean annual peak flows resulted in acceptable thresholds (percentage error < 10%) for the Doring River, Jonkershoek, Little Berg River, Palmiet and Marnewicks River according to guidelines outlined in Bailey and Pitman (2016). Considering the findings across calibration performance statistics, only the Jonkershoek and Little Berg River were assessed further within the FFA.

Table 5 Calibration performance statistics across locations in the Western Cape
Fig. 3
figure 3

Hydrographs of observed and simulated daily discharge (m3/s) for selected sites

The influence of the selected hydrological model on climate change FFA has been noted in several studies, such as Maurer et al. (2018), Mizukami et al. (2016) and Mendoza et al. (2015). These studies suggest that the choice of hydrological model and rigour of the calibration is a more significant contributor to uncertainty than other methodological choices. This is also a significant finding in this study. The successful calibration of hydrological models is complex in Southern Africa since less than 9% of mean annual precipitation is converted into mean annual runoff, while 5% recharges groundwater (Fikileni and Wolski 2022; Edokpayi et al. 2020). Despite this regional trend, an investigation by Lakhraj-Govender and Grab (2019) detected significant correlations between annual rainfall and annual river flows at several sites across the Western Cape.

The WRSM/Pitman model had limited success in simulating flows at river gauges in the Western Cape. Of the seven sites investigated, two sites obtained acceptable calibrations; however, the strength of KGE scores for these sites were only in the “intermediate” category described by Kling et al. (2012). Several factors could be attributed to the calibration performance of the WRSM/Pitman model. First, the model was developed using rainfall stations in Pretoria in the Gauteng province of South Africa. In addition, there is a spatial and seasonal variation in rainfall in South Africa. The Gauteng province receives most of its rainfall in summer in the form of orographic or convective showers. At the same time, the Western Cape experiences winter rainfall that originates from cyclones over the South Atlantic (du Plessis and Schloms 2017; Roffe et al. 2019). Therefore, the derivation of storm event duration and rainfall amounts within the WRSM/Pitman model may be more suitable for areas with convective rainfall (locally limited), with a prevalence of cumulonimbus-induced thundershowers and frontal rainfall.

The semi-arid climate may have also influenced calibrations of the WRSM/Pitman model in the Western Cape. Andersson et al. (2011) also noted challenges using hydrological models in arid climates in Southern Africa. Flood design estimation is challenging in arid and semi-arid areas owing to the high variability in rainfall, inter-annual vegetation cover and intermittent streamflows (Zaman et al. 2012). Flow measurement may also be problematic in semi-arid areas due to siltation of inlet pipes at river gauges (Zaman et al. 2012). Cordery and Fraser (2000) stated that flooding events that occurred in arid regions were usually a result of high-intensity storms over a smaller area of the catchment. Therefore, flooding variability is much greater across arid regions than in other areas (Zaman et al. 2012; Cordery and Fraser 2000; Farquharson et al. 1992).

Poor hydrological model calibrations may also be attributed to catchment size variability. Smaller catchments are more challenging to model as the drivers of hydrological flow, beyond surface runoff and groundwater processes, may be significant. Watson et al. (2021) noted that for small catchments in the Western Cape, such as the Verlorenvlei and Bot, evapotranspiration, simulating days of zero flow and baseflow could dampen the hydrological response of catchments. Furthermore, the use of lumped hydrological models, such as the WRSM/Pitman model, can be problematic as they cannot simulate certain model elements, such as slow declines in groundwater storage (Watson et al. 2021).

3.3 Instantaneous peak flows

Figure 4 depicts changes in the mean daily IPF (m3/s) under the control and future climate scenarios and time periods based on the Fuller and Sangal IPF estimation methods. IPF estimates using the Fuller method are higher under all scenarios for both sites compared to estimates based on the Sangal method. There is agreement between both methods on the decrease in mean daily IPFs detected across all scenarios for both sites. The percentage decrease in mean daily IPF is largest under the RCP 8.5 2070–2100 scenario at both sites. The higher percent decreases in IPF are present for the Little Berg River site. Under the Fuller method, there is between a 38.95 and 85.76% decrease in mean daily IPF across all scenarios. Additionally, in the case of the Jonkershoek site, percentage decreases in mean daily IPF are lower in comparison to the Little Berg River. Under the RCP 4.5 2030–2060, RCP 4.5 2070–2100 and RCP 8.5 2030–2060 scenarios, mean daily IPF decreased by − 27.94, − 30.64, and − 31.91%, respectively (Fuller method). Tables 6 and 7 summarize the percentage changes in estimated mean daily IPF for the control and future climate scenarios across selected sites using the Fuller and Sangal methods, respectively.

Fig. 4
figure 4

Estimated mean daily IPF (m3/s) with standard deviation under control and future climate scenarios for the Sangal and Fuller methods (GCM Ensemble; Downscaling Method: QDM)

Table 6 Percentage change in Fuller estimated IPFs (m3/s) between control and future climate scenarios across selected sites in the Western Cape (GCM Ensemble; Downscaling Method: QDM)
Table 7 Percentage change in Sangal estimated IPFs (m3/s) between control and future climate scenarios across selected sites in the Western Cape (GCM Ensemble; Downscaling Method: QDM)

The finding of reduced mean daily IPFs under future climate change concurs with other studies undertaken in the region. Edokpayi et al. (2020) reported a projected reduction of 20–60% in annual runoff under an unconstrained mitigation scenario, while a 5–20% decrease could be realized under a constrained scenario. A study of the Eerste River in the Western Cape also found a projected reduction in river flows of between 8 and 18% (Du Plessis and Kalima 2021), while Cullis et al. (2015) noted that flow reductions were notable in the southwestern Cape catchments under all climate models assessed. Decreases in mean daily IPF follow the decreases in historical river flow across the Western Cape identified by Lakhraj-Govender and Grab (2019).

Studies evaluating the impact of climate change on hydrological flow have found that projected rainfall plays a significant role in flow variability (Arnell and Gosling 2013). This is also true in South Africa (Edokpayi et al. 2020). A study by Davis et al. (2010) found that an 8% reduction in annual rainfall could result in a 30 and 31% decrease in surface runoff and groundwater, respectively. In addition, reduced IPFs under future climate change can be attributed to the potential increase in evaporation rates and temperature and changes to the timing of rainfall. Arnell (1999) predicted a reduction in river flow of between 26 and 40% of the Zambezi River system and a 40% increase in evaporation. Reduced river flow cannot be solely attributed to climate change. Lakhraj-Govender and Grab (2019) stated that historical variations in river flow indicated that both anthropogenic (water abstraction for agriculture and land use change) and natural (increased evaporation, ENSO phases, catchment size) factors were influential.

3.4 Flood frequency analysis

3.4.1 Distribution fitting and selection

Based on the Kolmogorov–Smirnov, Cramér–von Mises and Chi-squared goodness of fit tests with a confidence level of 95% (α > 0.05), the LP3 distribution was selected as the most suitable distribution based on its consistency across all sites, scenarios and IPF and downscaling methods. For several scenarios under the DC method, the LN performed poorly in terms of its statistical significance under goodness of fit tests. A sample of the Kolmogorov–Smirnov goodness of fit tests are provided in supplementary material (Tables S4 to S15 in the Supplementary Information). Generally, the results demonstrate that the flood frequency distribution does not change under future scenarios. Figure 5 provides an overview of fitted distributions under control and future scenarios for the Little Berg River using the QDM downscaling and Fuller IPF methods. Figures S1S3 in the Supplementary Information depict distribution fitting for the Little Berg River using the IPF Sangal Method and both methods for the Jonkershoek.

Fig. 5
figure 5

Fitting LP3 distributions to IPFs for the control and future scenarios for Little Berg River (GCM ensemble mean; Downscaling Method: QDM; IPF Estimation Method: Fuller)

3.4.2 FFA by downscaling method

Figures 6 and 7 illustrate the change in FFCs under different statistical downscaling methods for the Jonkershoek and Little Berg River based on the Fuller and Sangal IPF methods. There is agreement between the LS and QDM downscaling methods for both sites where FFCs exhibit a decreasing trend across all future climate scenarios in comparison to the control scenario. Under the DC method, FFCs show a decreasing trend for high-probability events (1 in 2, 1 in 5, 1 in 10). At the same time, the FFC curves are higher than the control scenario for high-severity events (> 1 in 10). There is agreement in FFCs trends between the Fuller and Sangal IPF methods between both sites for the LS and QDM methods. Under the DC method, there is a difference detected between the Fuller and Sangal methods for the Little Berg River where the Sangal FFCs under future scenarios show a decreasing trend in comparison to the control scenario except for the RCP 4.5 2070–2100.

Fig. 6
figure 6

LP3 FFCs fitted under control and future scenarios for Jonkershoek and Little Berg River (GCM ensemble mean; Downscaling Method: DC, LS and QDM; IPF Estimation Method: Fuller)

Fig. 7
figure 7

LP3 FFCs fitted under control and future scenarios for Jonkershoek and Little Berg River (GCM ensemble mean; Downscaling Method: DC, LS and QDM; IPF Estimation Method: Sangal)

As noted in Figs. 6 and 7, the DC method performed inconsistently across the Jonkershoek and Little Berg River sites compared to the LS and QDM methods. The DC method assumes stationarity as the relative correction factor is applied to the control period observed rainfall. The method maintains wet days and reduces the daily climate variability associated with raw GCM outputs (Tabari et al. 2021). However, Shaw and Riha (2011) noted that the use of DC downscaling resulted in an underestimation in the range of uncertainties from the GCM or RCM outputs as it was based on changes in mean climate across periods.

In this study, there was agreement between the LS and QDM methods regarding the direction of FFCs; however, quantiles generated using the LS approach were generally lower compared to the QDM approach. This suggests that peak flows may be underestimated under the LS approach. Soriano et al. (2019) and Dobler et al. (2012). Soriano et al. (2019) note that only advanced statistical downscaling methods, such as the QDM approach, were suitable for reproducing peak flows. Quantile mapping methods are also recommended for temperature projections (Fan et al. 2021). More broadly, Chen et al. (2011) state that the choice of the downscaling method is the largest source of uncertainty with respect to specific elements that might influence FFA, such as spring mean discharge, annual low flow, and peak discharge.

3.4.3 FFA by climate model

Figures 8 and 9 depict FFCs for different climate models at the Little Berg River and Jonkershoek sites under the QDM downscaling and Fuller IPF methods. For the Little Berg River site, FFCs representing individual downscaled GCMs generally exhibit a decreasing trend for future scenarios compared to the control scenario for high-probability events. In contrast, selected individual GCMs result in an increasing flood frequency trend. For example, under the RCP 4.5 2030–2060 scenario, the CanESM2, MRI-CGCM3 and ACCESS1-0 are higher than the control scenario for return periods greater than 1 in 25.

Fig. 8
figure 8

LP3 fitted FFCs by climate model for future scenarios at Little Berg River (Downscaling Method: QDM; IPF Estimation Method: Fuller)

Fig. 9
figure 9

LP3 fitted FFCs by climate model for future scenarios at Jonkershoek (Downscaling Method: QDM; IPF Estimation Method: Fuller)

The FFCs for individual GCMs for Jonkershoek lie closer to the control scenario FFC for the RCP 4.5 2030–2060, RCP 4.5 2070–2100 and RCP 8.5 2030–2060, suggesting that there is less variability between GCMs and greater model agreement regarding the decreasing trend in FFCs under future climate scenarios. Under all future scenarios for both sites, FFCs estimated using the GCM ensemble mean are below the control scenario in addition to a majority of individual GCMs.

GCMs may vary in their ability to reproduce rainfall distribution statistics, seasonality patterns, and extremes (Hughes et al. 2014). According to Arnell et al. (Arnell et al. 2003), GCMs that overestimate rainfall also tend to overestimate runoff. Therefore, the choice of GCM is crucial in determining the direction of the FFCs across the Jonkershoek and Little Berg River sites. This finding aligns with Camici et al. (2014) and Kay et al. (2009). Furthermore, FFCs based on IPFs may introduce further complexity if GCMs cannot capture extreme rainfall statistics. Andersson et al. (2011) found decreased river flow; however, there was no consensus between climate models on increased risks of high flows.

GCMs inherently possess systematic errors because of conceptual deficiencies (Stergiou et al. 2021). In addition, the simulation of extreme precipitation continues to be an issue within GCMs owing to variations in model considerations of diurnal cycles in convective rainfall and its integration at coarse resolutions (Bulti et al. 2021; Chokkavarapu and Mandla 2019; Dedekind et al. 2016). This is also the case in the Western Cape, where despite the broad consensus regarding the projected decreases in winter rainfall, there are large areas of disagreement between models. Hewitson and Crane (2006) also suggest that this may be owing to the varied precipitation parameterization within GCMs and the integration of local conditions and feedback mechanisms (Andersson et al. 2011).

GCMs may be able to simulate extreme rainfall over the west coast regions (Mason and Joubert 1997). However, GCMs tend to overestimate rainfall over the southwest coast of South Africa (Mason and Joubert 1997). Studies evaluating GCM skill in South African catchments conclude that GCM models such as those developed by Goddard Institute for Space Studies and Max Planck Institute are more skilled than others (Hughes et al. 2014). However, these models may still vary in their consistency across coastal and inland regions. Hughes et al. (2014) noted that only a limited reduction in the overall uncertainty is attained when using skilled GCM outputs in a hydrological modelling context.

Changes in rainfall can alter the distribution of water spatially and temporally within a catchment area (Edokpayi et al. 2020). The use of coarse or low-resolution GCMs (300 km × 300 km) can be problematic as there is a scale mismatch with hydrological models, which typically operate at the catchment scale (± 50 km × 50 km) (Mareuil et al. 2007; Chokkavarapu and Mandla 2019; Watson et al. 2021). Hydrological flows depend on feedback loops and dependencies within different processes and land features (Watson et al. 2021). Finer resolution climate models would be able to capture variables at the local scale more accurately, thereby ensuring that investigations of hydrological catchments are more robust (Andersson et al. 2011). In the Western Cape, there is a strong correlation between elevation and rainfall; for example, high flows in the Breede and Berg catchments are derived from surface run-off from increased precipitation in mountainous areas (Watson et al. 2021). Simulating precipitation over pronounced topography is a notable challenge for GCMs (Andersson et al. 2011). Therefore, extreme flows and, consequently, FFCs may be underestimated in the Western Cape. Furthermore, if GCMs do not correctly capture spatial patterns of rainfall, extreme flows may be incorrectly located.

RCMs could address the scale issue if they are able to appropriately capture regional and local climate (Arnell and Jones 2003). Using RCMs is only advantageous if the driving GCM can capture regional variability in projected climate variables. As Andersson et al. (2011) mentioned, the forcing GCM is still responsible for transmitting the broad signal. The reliability of RCM projections is relatively unknown in the Western Cape (Abiodun et al. 2016). All Coordinated Regional Downscaling Experiment RCMs under-estimate extreme rainfall threshold values in the Western Cape (Abiodun et al. 2016).

To overcome the variability in extreme precipitation outputs from GCMs, a GCM ensemble is recommended (Fan et al. 2021; Camici et al. 2014; Qin and Lu 2014). The structure of the GCM ensemble is critical. In this study, FFCs related to the GCM ensemble mean mostly exhibited a decreasing trend in comparison to individual GCM FFCs and the FFC of the control period (Figs. 8 and 9). This finding is highlighted by Giuntoli et al. (2021). The multi-model GCM ensemble based on the mean across GCMs can suppress AMS for distribution fitting. This can complicate the assessment of the direction of FFCs under future climate scenarios. Other studies have attempted to overcome this issue by using the median across the GCM ensemble (Qin and Lu 2014) or more advanced methods such as Bayesian hierarchical models (Giuntoli et al. 2021).

3.4.4 FFA by IPF method

Figure 10 depicts the fitted FFCs for the GCM ensemble mean under the Fuller and Sangal IPF estimation methods. For both IPF estimation methods, a decreasing trend is evident in all scenarios across both sites compared to the control scenario. At the Little Berg River location, IPF for the 1 in 2 flood magnitude decreased by between 7.68 and 20.55% under the Fuller method. The highest percentage decreases have occurred within the RCP 8.5 2070–2100 scenario for all return periods. The percentage decreases for the 1 in 100 flood magnitudes range between 65.84 and 69.08%. The Jonkershoek site exhibits a similar trend to the Little Berg River site. Under high probability events such as the 1 in 2 return period, percentage differences between estimated quantiles are small across all scenarios, ranging from a 0.53% increase to a 3.44% decrease. In comparison, a range of percentage decreases between 39.22–83.67% is evident for the 1 in 100 return period. Quantile estimates for Little Berg River and the Jonkershoek for the 2-, 5-, 10-, 25-, 50-, and 100-year return periods under control and future climate scenarios are provided in Tables 8 and 9, respectively.

Fig. 10
figure 10

LP3 fitted FFCs under control and future scenarios for Jonkershoek and Little Berg River (GCM ensemble mean; Downscaling Method: QDM; IPF Estimation Method: Fuller and Sangal)

Table 8 Design flood estimates for the Little Berg River under control and future climate scenarios (GCM Ensemble; Downscaling Method: QDM; IPF Estimation Method: Fuller)
Table 9 Design flood estimates for the Jonkershoek under control and future climate scenarios (GCM Ensemble; Downscaling Method: QDM; IPF Estimation Method: Fuller)

A comparison of the IPF estimation method is not possible within this research as observed data is unavailable at the sub-daily timestep. Differences between the Fuller and Sangal methods in determining the direction of FFCs are discussed in Almasi and Soltani (2017). This study notes that the Fuller IPF estimation method may be less efficient in the Western Cape as it only integrates catchment area into its formulation, excluding other variables such as slope, land use and land cover, soil drainage and moisture. Hydrological flow in the Western Cape is driven by surface runoff from mountainous regions and regional ground flow (Watson et al. 2021). Therefore, slope-related variables should be integrated into IPF estimation approaches.

Lee et al. (2020) undertook a comparison of empirical IPF methods. The study found that the Fuller method had the worst performance, leading to a significant underestimation of peak flows. However, others, such as Maghsood et al. (2019) and Jimeno-Sáez et al. (2017), successfully estimated IPFs using the approach. Several studies have adjusted the Fuller equation for use in different regions (Jimeno-Sáez et al. 2017). The Sangal method may also be inefficient and has been found to underestimate (Loyeh and Bavani 2021) and overestimate peak flows (Chen et al. 2017). The issue is acute in studies focused on small catchments or large flow events. The Sangal method was developed on a peak flow dataset during snowmelt (Chen et al. 2017). Therefore, its performance in the Western Cape may not be appropriate as extreme rainfall events typically drive high flows.

4 Conclusions and recommendations

This study found that FFCs using a GCM ensemble based on mean AMS exhibited a decreasing trend at the Little Berg River and Jonkershoek sites. These results were found for the LS and QDM downscaling methods (irrespective of the IPF method). In contrast, an increasing trend was found for low probability return periods under the DC method. The QDM is deemed a suitable downscaling method for projecting extreme peak flows. Despite the consensus in the decreasing trend in FFCs for the Sangal and Fuller methods using QDM downscaling, individual GCMs were found to significantly influence the direction of FFCs. Therefore, the selection of GCMs in an ensemble is critical. The assessment of suitable GCMs for climate change FFA cannot simply rely on their projection skill; they must also consider their applicability to the regional context, their ability to integrate local scale features and their precipitation parameterization approaches. It is important to note that although empirical downscaling can allow for projections that are consistent with circulation patterns, it is not capable of integrating local-scale feedback (Hewitson and Crane 2006). Other downscaling methods could be used to enhance the FFA in South Africa. Hewitson and Crane (2006) showed that the self-organizing maps downscaling approach has been successful in capturing the spatial pattern of rainfall across the Western Cape. There is consensus between IPF methods; however, quantile estimations are lower under the Fuller method in comparison to the Sangal method for high-magnitude events.

Hydrological homogeneity in the future cannot be assumed owing to climate change drivers. Therefore, incorporating climate change aspects into FFA must be carefully considered. Climate change FFA is limited because methodologies, approaches and techniques emanate from two separate disciplines: hydrology and climate science. Variations in the direction of the FFCs are caused by uncertainty from methodological choices with respect to GCMs, downscaling approaches and hydrological models and their calibration (Xu et al. 2005). Further research is required if quantile estimations can be used for practical applications. Future studies on climate change FFA in South Africa should incorporate other advanced statistical and dynamical downscaling methods, alternative approaches to extract the GCM ensemble peak flows and GCM skill evaluations for precipitation. This study concludes with the following recommendations that aim to provide methodological insights based on this research for future studies conducted in the region.

4.1 Recommendations

Overcoming data availability challenges: Smithers (2012) notes the need for design flood estimation in smaller South African catchments. However, most of these catchments remain ungauged; therefore, observed discharge data is unavailable for these locations. Since the 1970s, there has been a notable decline in the number of operational rainfall stations providing reliable hydrological information; this is problematic (Pitman and Bailey 2021). Future studies could incorporate the use of satellite quasi-precipitation products such as the Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS), which can overcome reliability and completeness issues of observed rainfall datasets in South Africa (Pitman and Bailey 2021).

GCM Uncertainty: It is widely acknowledged that the uncertainty within GCMs can play a crucial role in influencing the reliability of hydrological studies incorporating climate change considerations (Hawkins and Sutton 2011; Najafi et al. 2011). Future studies must continue to assess GCM performance within climate change FFA studies (Bannister et al. 2017; Miao et al. 2014) while incorporating the use of multi-model GCM ensembles can enhance the reliability of future climate projections (Jose et al. 2022). RCMs may be better suited to hydrological studies that cover a catchment scale or areas with complex topography (Kim et al. 2021).

Downscaling methods: Future climate change FFA studies should focus on the use of more advanced statistical downscaling methods such as quantile mapping and stochastic weather generators to better assess flood frequency trends in the Western Cape. Quantile mapping methods are advantageous as they are able to simulate peak flows in comparison to the DC and LS method. Khazaei et al. (2012) proved the feasibility of using a weather generator to simulate rainfall and temperature data as inputs of a rainfall-runoff model for climate change impact assessment on floods in Iran. In addition, the study highlighted that weather generators could produce long-time series, which can overcome the issue of short record lengths in FFA.

Hydrological Modelling: Hydrological models that exhibit the storm duration and rainfall trends of the study area must be selected. In this study, the WRSM/Pitman model limited the analysis as successful calibrations were not possible at all sites. Often, hydrological models are “black boxes” that do not disclose their mathematical framework. This results in difficulty in determining whether the hydrological model is suitable or how the modelling framework can be adapted to be locally appropriate. In terms of applying the WRSM/Pitman model in the Western Cape, mathematical equations describing the relationship between the storm event duration and rainfall would need to be redeveloped using historical data from weather stations based in the province.