Introduction

During late 2019 and early 2020, unprecedented high lake water-levels were observed in Lake Victoria, resulting in massive flooding in the lake-adjacent areas1. The high-water-levels re-ignited the scientific debate on the impacts of a changing climate on Lake Victoria’s water budget. Indeed, the impacts of a changing climate in East Africa2,3 on the biodiversity of the Lake Victoria Basin (LVB) are already being observed at multiple biological levels ranging from genes to biomes. Some of these impacts include enhanced spatio-temporal precipitation variability, a reduction in biodiversity (including fish), periodical lake water-level and quality fluctuations, and dwindling crop yields and emergence of crop diseases in the lake-adjacent farming areas4,5. Given that the lake’s fishes, and biodiversity in general, are significantly sensitive and vulnerable to climate change, adequate monitoring and appropriate conservation actions are required to minimize the impacts of climate change on the lake’s biodiversity.

Some work has been done towards understanding precipitation patterns and associated water-level fluctuations over Lake Victoria and its basin. For instance, the lake has been found to be notorious with intense lightning and convective storms6. The lake’s severe weather and water currents have also been linked to boat accidents linked to approximately 5000 deaths on the lake annually7,8. However, the understanding of weather and climate dynamics in the LVB remains limited due to, in part, the inability of current climate models to adequately resolve weather and climate dynamics over Lake Victoria9,10,11. This paper complements earlier works by using bias-corrected model data from the sixth phase of the Coupled Model Intercomparison Project (CMIP612) to generate the most realistic estimates of future precipitation patterns over the study domain.

Methods

Study area

The study focussed on Lake Victoria and its immediate adjacent catchment area (Fig. 1). The lake is located at an altitude of about 1135 m above sea-level, spanning an area of about 68,000 square kilometres13. It is Africa’s largest freshwater body and the world’s largest inland fishery supporting at least 40 million inhabitants14. The lake and its resources fuel East Africa’s economy with the lake’s catchment providing about 90% of hydropower for Uganda, Burundi, and Rwanda. The lake’s basin (LVB) is renowned for its richness and endemism in species4,15,16. It has massive natural resources including forests, fisheries, rangelands, and wetlands from which communities in and around the basin draw their livelihoods. During the dry season over the Ethiopian highlands from whence the Nile’s main tributary (the Blue Nile) comes, the White Nile—the only outlet for Lake Victoria—contributes about 70% of all water reaching Egypt17.

Figure 1
figure 1

A topographic map of the study domain produced using three primary layers, namely, elevation data sourced as a 30 × 30-m raster file from the United States Geological Survey (USGS) archive (https://earthexplorer.usgs.gov/), administrative boundaries sourced from the Global Administrative Areas (GADM) archive (www.gadm.org/download), and water bodies sourced from the World Bank archives (https://datacatalog.worldbank.org/search/dataset/0040797). All cartographical work was performed in ArcGIS Desktop 10.8.1 version 10.8.14362 (https://www.esri.com/).

The LVB region experiences complex meteorological and climatic conditions influenced by local and large-scale systems. For instance, the region is punctuated with large and small lakes that strongly modify the mesoscale atmospheric dynamics by creating a thermal contrast with the adjacent land areas. The thermal contrast contributes to convection over the lake during the night while restraining it during the day18,19. Additionally, the katabatic and anabatic winds induced by the mountainous areas in the LVB modify precipitation characteristics20 while storms on land influence the region’s temperature, humidity, and atmospheric conditions—resulting in strong nightly storms over Lake Victoria19. The mountainous areas in and around the LVB strongly influence the flow of large-scale moisture-carrying winds, resulting in modulated precipitation patterns9,21. Modelling these complex local and large-scale systems, whose understanding is still limited22, is quite challenging and efforts are underway to improve their representation in regional and global models22.

Data

Three gridded data products were used as reference datasets in the current study, including the Climate Hazards Group InfraRed Precipitation with Station data, version 2.0 (hereinafter CHIRPS). CHIRPS is a quasi-global precipitation dataset available from 1981 to near present, spanning 50°S–50°N and all longitudes. It incorporates climatology, in-situ station data, and satellite imagery to form a gridded precipitation time series23. The current study used the monthly 0.25° by 0.25° resolution dataset, downloaded from https://www.chc.ucsb.edu/data.

The second dataset used is the Multi-Source Weighted-Ensemble Precipitation version 2.8 (hereinafter MSWEP), downloaded from https://www.gloh2o.org/mswep/. MSWEP is a global 3-hour 0.1° resolution precipitation product that merges gauge, reanalysis, and satellite data to obtain high quality precipitation estimates across the globe24. Lastly, we used the Global Precipitation Climatology Centre (GPCC) full data monthly product version 2020 (v2020) at 0.5° by 0.5° resolution, obtained from the NOAA PSL, Boulder, Colorado, USA via https://psl.noaa.gov. The full data product is based on quality-controlled data from 67,200 observation stations worldwide. The dataset contains monthly totals on a regular grid with precipitation anomalies at the stations interpolated before being superimposed on the GPCC climatology v2020 in the corresponding resolution25. These datasets have been validated and used widely over the study domain (e.g.,2,26,27).

To assess future precipitation patterns over the study domain, we used 17 bias-corrected general circulation model (GCM) datasets (Table 1) from the NASA Earth Exchange Global Daily Downscaled Projections (NEX-GDDP-CMIP6). The NEX-GDDP-CMIP6 datasets are produced using a daily variant of monthly bias correction/spatial disaggregation method, resulting in a 0.25-degree horizontal product28. They are downscaled historical and future (spanning the period 1950–2100) climate projections based on CMIP6 model output, including datasets from the ScenarioMIP model runs29,30 that are based on the Shared Socioeconomic Pathways (SSPs31) Simulations from the ScenarioMIP provide the basis for investigating various science and policy questions particularly relevant to scenario-based analysis such as the role of specific forcings29. The current study used projections under the SSP5-8.5 emission scenario due to its ability (unlike lower emission scenarios) to reproduce global warming projections exceeding 2.0 °C32.

Table 1 NEX-GDDP-CMIP6 datasets used in the study. All the data have a spatial resolution of 0.25 × 0.25 degrees and are based on SSP5-8.5.

Analysis

First, we assessed the performance of the bias-corrected CMIP6 model simulations in reproducing precipitation patterns over the study domain, relative to observations. Here, we computed differences in mean spatial and temporal values between observations and model simulations. The performance assessment was done with reference to the 1985–2014 baseline corresponding to the last 30 years of the CMIP6 historical runs which cover the period 1850–2014. The 1985–2014 period is widely used in CMIP6 projection analysis studies (e.g.,49,50).

Secondly, an assessment of future precipitation patterns over the study domain was done by calculating differences in mean values between the future period and the control period (1971–1999), using the annual precipitation indices and statistics presented in Table 2. Here, the periods 2020–2049 and 2070–2099 were used to represent the “near future (MID)” and “future (FUT)”, respectively.

Table 2 A list of annual precipitation indices and annual precipitation statistics used in the current study.

Additionally, we computed mean annual (ANN) and seasonal (MAM, JJA, and OND) precipitation values over the study domain. We also used the 99p–90p statistic (Table 2) to represent the right tail of precipitation distribution over the study domain51. Here, 90p and 99p were calculated by accumulating daily precipitation values over each grid point with 99p and 90p, representing very intense and heavy precipitation events, respectively. The percentiles for 99p and 90p were calculated for all days in the data.

All statistical computations were done on the native data grids before bilinearly interpolating the observational datasets (with relatively finer resolutions than CMIP6 model data) onto CMIP6 model data grids to facilitate comparison as described in52,53. When computing differences in mean values for various variables, we used the Student’s t test54 to show significant values at 99% confidence level.

Results and discussion

Validation of CMIP6 historical simulations over LVB

A plot of precipitation climatology over the study domain (Fig. 2, top panel) shows MAM to be the wettest season followed by OND. Relatively more precipitation is recorded on Lake Victoria than the onshore areas, with the western part of the lake showing a wetter regime than the rest of the lake during MAM and OND seasons. A similar pattern is recorded for the OND season, except with lower precipitation values than those of the MAM season. The JJA season is relatively dry with most of the domain recording less than 50 mm/month of precipitation. During this season, the southern part of the domain is drier than the northern part, with precipitation values over parts of western Kenya (north-east of the study domain) exceeding 150 mm/month. Annually (ANN), most of the study domain records at least 100 mm/month with areas over the lake and western Kenya recording the highest values. The highest mean precipitation values (up to 300 mm/month) are recorded in the MAM season while the least values (less than 50 mm/month) are recorded in JJA.

Figure 2
figure 2

Precipitation climatology over the study domain using MSWEP data (top panel), differences in mean precipitation between an ensemble mean of the top five performing models (mid panel), and annual cycles for all the three reference datasets and all the 17 model datasets (bottom panel). Stippling (mid panel plots) shows significant values at 99% confidence interval. All units are in mm/month.

A plot of the precipitation climatology over the study domain showed no discernible differences between the three reference datasets (Supplementary Information, Fig. S 1). MSWEP and CHIRPS data showed similar results, with GPCC showing marginally wetter conditions. Hence, MSWEP was chosen as the reference dataset for the model validation part of the study. Here, models generally showed a dry bias relative to MSWEP data with biases ranging roughly between -50 and 50 mm/month for most of the models (Fig. 2, mid panel; Supplementary Information, Figs. S 1, S 2, S 3, S 4 and S 5). The greatest biases (below-100 mm/month) were recorded over the western part of Lake Victoria (around 0.5° South and 32.3° East). However, the biases recorded for most models were insignificant (at 99% confidence level) over most of the domain. Further, an ensemble mean for the top-five models with the least biases showed a better performance than an ensemble mean of all the 17 models used in the study. The top five models (CMCC, CNRM, GFDL, MPI_LR, and NorESM2_LM) were selected for featuring among the top performers per category (Supplementary Information, Figs. S 1, S 2, S 3, S 4 and S 5).

Apart from KACE, all other models reproduced the study domain’s annual cycles in concert with the reference datasets (Fig. 2, bottom panel). Notably, our results show a departure from the traditional rainy seasons; extending the MAM season to include February and May (with a peak in April), and the OND season to begin in September with a peak in November. This observation points to a potential influence of Lake Victoria, which tends to have a different climatology than its neighbouring areas55,56,, on the study domain's climatology.

Generally, the bias-corrected CMIP6 model data used in the current study could reproduce the study domain’s climatology with good skill. The observed minimal biases were expected due to the relatively coarse spatial resolution of CMIP6 model outputs as well as the study area’s strong local climate systems and mesoscale convective systems9,57. Additionally, the skill of the current climate models is limited by the current understanding of the climate system (including the interactions between the atmosphere and the ocean) coupled with limitations in computational technology10.

Analysis of future precipitation patterns over Lake Victoria Basin

An analysis of future precipitation patterns over the study domain shows a general increase in mean precipitation for all the seasons considered (Fig. 3, top and mid panels). Most of the domain records significant changes in precipitation with more changes likely to occur over Lake Victoria and its surroundings than the rest of the study domain. More changes are shown for the end of the century period compared to the mid-century period, for all the seasons. All the models agree on the change signal for ANN (Supplementary Information, Fig. S 6) while 88% of the models agree on the signal (wet) for OND (Supplementary Information, Fig. S 9). A lower model agreement (70%) is seen for the MAM season compared to ANN and OND while JJA records a 52% model agreement (Supplementary Information, Fig. S 8).

Figure 3
figure 3

Changes in mean annual (ANN) and seasonal (MAM, JJA, and OND) precipitation climatology for the period 2040–2069 (MID; top panel) and 2070–2099 (FUT; mid panel) relative to the 1985–2014 period (CTL), using an ensemble mean of the top-five models. Stippling (for spatial plots) shows significant values (at 99% confidence level). Bottom panel presents mean values for ANN, MAM, JJA, and OND averaged over the study domain for the CTL, MID, and FUT periods, for all the models. All units are in mm/month.

A look at the precipitation patterns averaged over the study domain (Fig. 3, bottom panel) showed a general increase in precipitation for ANN, MAM, and OND. In the 2050s (MID), precipitation over the study domain would increase by about 5% for ANN, MAM, JJA, and OND, relative to the baseline period (CTL). More changes would be expected by the end of the century for ANN (16%), MAM (10%), and OND (18%), respectively. No discernible changes are recorded for JJA.

At the daily timescale, all the three statistics analysed in our study (SDII, Rx5day, and 99p–90p) showed a significant increase by the mid and end-of-the-century periods (Fig. 4, bottom panel). Here, precipitation intensity (SDII) showed a potential increase of 6% and 16% by mid and end of the century, respectively. The changes were more pronounced over the eastern part of Lake Victoria and western Kenya (around 1° South and 34° East). Greater changes (18% and 29% for mid and end century periods, respectively) were recorded for the maximum 5-day precipitation (Rx5day). Just like in the SDII case, changes in Rx5day were more pronounced over Lake Victoria and its environs (centre of study domain). The 99p–90p statistic showed the greatest changes with a mean increase of about 22% and 47% for mid and end century periods, respectively. Again, the changes were more pronounced over Lake Victoria and its environs than the rest of the study domain.

Figure 4
figure 4

As in Fig. 3 but for SDII, Rx5day, and 99p–90p. SDII and 99p–90p are in mm/day while Rx5day is in mm.

The projected changes in precipitation patterns over the study area may be attributed to an increase in moisture content9 resulting from changes in local and mesoscale feedbacks58 as well as changes in mesoscale dynamics that affect thermal inertia of the lakes in the study area9. Additionally, increased warming may also alter large-scale atmospheric dynamics (such as patterns in moisture carrying lower tropospheric easterlies over the study area) and influence synoptic systems; potentially resulting in enhanced moisture convergence over the study domain59,60. We recommend a detailed detection and attribution study to enhance the understanding of potential drivers and their exact influence on the projected precipitation patterns over the LVB.

Conclusion

Our findings indicate that more significant precipitation changes over the LVB are likely to occur at the sub-seasonal level compared to the seasonal and annual scales. Considering the relatively high sensitivity of our study domain to weather and climate change and variability, our findings contribute to the scientific evidence that can inform disaster risk reduction and climate change adaptation strategies for the study domain. Specifically, the projected increase in the width of the right tail precipitation distribution (99p–90p) and intensity (SDII) call for more robust measures to minimize disaster impact in a region where approximately 5000 people (mostly fishermen) perish annually on Lake Victoria from extreme weather-related accidents8. The projected increase in precipitation intensity, coupled with an intensification of hazardous thunderstorms over Lake Victoria61,62, call for adaptive capacity strengthening for fishermen and lake-adjacent communities most of whom are vulnerable to disaster risk associated with extreme weather and climate events63.

Moreover, the projected changes over Lake Victoria and its adjacent land areas were more pronounced than over other parts of the study area. In a region characterized by perennial weather-related disasters and extreme weather events, the projected changes may cause further disruption to the wellbeing of the region’s communities and systems. Additionally, the projected changes have a substantial implication for the study domain, and the larger Nile Basin Region, which is currently experiencing a rapid population density increase64 coupled with conflicts over water resources65,66. Hence, enhanced production and use of climate services is recommended to minimize the risk posed by the projected changes and, ultimately, enhance the socio-economic wellbeing of communities in the LVB. Results from the current study provide a basis for further research to enhance the understanding of climate change and variability over the LVB and inform adaptive capacity strengthening measures for the region.