1 Introduction

The data set produced in the third phase of the Coupled Model Intercomparison Project (CMIP) formally known as CMIP3 data set (Meehl et al. 2007) has been widely used by the climate scientists to examine the potential future climate conditions all over the globe. Until now, there are more than 600 published documents available on different climatic issues using CMIP3 data set. A list of the publications using CMIP3 data set is available on http://www-pcmdi.llnl.gov/ipcc/subproject_publications.php. Among them, only one study addresses the potential future climate scenarios over the Arabian Peninsula using the CMIP3 data set (Almazroui et al. 2016). The previous studies (e.g., Almazroui et al. 2013, Barfus and Bernhofer 2014) examined the performance of CMIP3 models over the Arabian Peninsula in the present day climate. Some other studies also provided insight into recent climate change and extremes over the Arabian Peninsula (e.g., Almazroui et al. 2012a, b; Deng et al. 2015; Donat et al. 2014; El Kenawy et al. 2014; AlSarmi and Washington 2011, 2013, Kwarteng et al. 2009; Evans 2009). However, a comprehensive analysis of the uncertainties associated with future projections over the Arabian Peninsula for different categories of CMIP3 models including well performing models compared to the observations is missing in the literature.

The Arabian Peninsula has been neglected in the climate change research compared to the other areas in the world. The international initiatives made by Intergovernmental Panel on Climate Change (IPCC) have placed the Arabian Peninsula at the edge of Asian and African regions, and hence, very less information is available to understand the future climate projections and associated uncertainties over this region. The uncertainties in the future projections vary from region to region and model to model and, additionally, depend on various climatic parameters. For examples, over the tropics, the signal-to-noise ratio is lowest for precipitation, whilst it is highest for temperature (Hawkins and Sutton 2009). Moreover, the uncertainty of precipitation in the CMIP3 models for the 21st century is very large and different models even disagree on sign of change in trends (Biasutti et al. 2008). The precipitation will increase (decrease) in some (other) regions at the end of 21st century, whilst temperature will increase almost everywhere, though magnitude of increase/decrease varies from place to place and model to model (Meehl et al. 2005; Almazroui et al. 2016).

The decision-makers and stakeholders in the Arabian Peninsula are looking for reasonable climate projections for their adaptation and strategic planning. Since there are large uncertainties in climate projections, an alternative to reducing the spread could be useful to develop adaptation strategies for most extreme ensemble members (e.g., Daron et al. 2014; Whetton et al. 2012; Hallegatte 2009). Since there is no common sense what the best strategy is, the filtering out of well performing models for climatic parameters (such as precipitation and temperature) could be beneficial for climate change impact studies over the Arabian Peninsula. The main objectives of this study are (1) categorization of the CMIP3 models into different groups based on their individual performance in simulation of present day climate over the Arabian Peninsula and (2) assessment of uncertainties in the simulated potential future climate over the Arabian Peninsula by the different categories of CMIP3 models.

In a recent study, Almazroui et al. (2016) examined the future changes in temperature and precipitation over the Arabian Peninsula using CMIP3 models without considering the performance of individual CMIP3 models in the present day climate. There is no comprehensive study over the Arabian Peninsula that assesses the uncertainties in the future climate projections for different categories of CMIP3 models. The present study fills this gap by first analyzing the CMIP3 data sets and categorizing the models into different groups based on their performance in the present day climate. Since the IPCC has also released its latest fifth phase of CMIP data (CMIP5) through its Fifth Assessment Report (AR5), in a subsequent study, an assessment of potential future climate and associated uncertainties over the Arabian Peninsula based on CMIP5 data set is documented.

The paper is organized as follows. Section 2 describes the data and methodology used in this paper. Section 3 presents the main results of this study. Summary and conclusions are given in the Sect. 4.

2 Data and methodology

2.1 Data and models

In the present study, the data from available 22 CMIP3 models from the CMIP archives (http://www-pcmdi.llnl.gov/) are analyzed. Table 1 shows a list of models used in this study. First, we examined the performance of the CMIP3 models over the Arabian Peninsula for the present climate (1970–1999). For this purpose, we first employed the observational data for surface air temperature and precipitation from 27 stations across Saudi Arabia which covers about 80% area of the Peninsula. To compare CMIP3 model data with station observation, we extracted data from the grid point where stations are located. The model data are grid averaged values, whereas station data represent point values and this may also introduce some additional uncertainty in the results. However, this is the most common technique used to compare the model data with station observations (Islam 2009; Islam et al. 2008; Islam and Uyeda 2007). To avoid this uncertainty, we also assessed the performance of CMIP3 models using gridded (1º × 1° grid) CRU observational data set obtained from the Climatic Research Unit (CRU TS 3.23, Mitchell and Jones 2005; New et al. 2000).

Table 1 CMIP3 models used in this study. The references for these models are available in Almazroui et al. (2016)

2.2 Methodology

Prior to analyzing the CMIP3 model outputs with different horizontal resolutions (see Table 1), all 22 CMIP3 model data sets are regridded to a common 1º × 1º grid. In the next step, the temperature and precipitation biases for individual models as well as for multi-model ensemble (MME) are calculated with reference to the observations for the present climate (1970–1999). Note that the MME is calculated with equal weight. Only grid points covering the land areas are taken into consideration for this analysis.

Based on simulated temperature and precipitation biases, standard deviation, annual cycle, systematic error (SyE), the normalized root-mean-square error (NRMSE), the standard error (SE), and the CMIP3 models are categorized into four different categories, namely (1) all 22 CMIP3 models, (2) precipitation-based models, (3) temperature-based models, and (4) best performing models. The models which simulated the domain averaged precipitation (temperature) bias between ±1 intermodel standard deviation are referred to as precipitation (temperature)-based models. The precipitation (temperature)-based models are further assessed in obtaining the annual cycle, SyE, NRMSE, and SE measures compared to the observations. Only those CMIP3 models which simulated low SyE, NRMSE, and SE together with annual precipitation (temperature) cycle that closely resembles to observations are kept into the above categories. The fourth category of the models is named as the best performing models. This category includes CMIP3 models having both precipitation and temperature biases within ±1 intermodel standard deviation. In addition, the best performing models simulated the annual cycles of temperature and precipitation that closely resembles to the observations and display small values of SyE, NRMSE, and SE compared to the other three categories (see Sect. 3 for further details).

In the next step, the changes in temperature and precipitation in the future climate and associated uncertainties are computed for the mid (2021–2050) and end (2070–2099) of the 21st century relative to the present climatology (1970–1999) for both all (22 CMIP3) and best performing models. The future projections are examined for the Special Report on Emission Scenarios (SRES) B1, A1B, and A2 (IPCC 2000). To quantify the uncertainties associated with projected temperature and precipitation over the Arabian Peninsula, we used ±1 intermodel standard deviation (STD) and the intermodel range between the minimum and maximum value as metrics for both the all and best performing models. We assessed the robustness of the future climate change for both categories of model ensembles. Following the previous studies (e.g., Almazroui et al. 2016; Heansler et al. 2013a, b; Solomon et al. 2007), we also employed a measure which describes the agreement of the different model projections in the direction of change. An increase (or decrease) in the projected signal is considered to be robust if at least 66% of the models agree in the direction of change. We used the Mann–Kendall trend test (Kendall 1976) to assess the significance of the trends in the temperature and precipitation over the Arabian Peninsula in the 21st century. In addition to above, we also examined the temperature projection for the future period by adding the present day bias to the projected temperatures. This is referred as the corrected temperature. Although, it is not necessary that the bias in the present climate continues in the future, but the bias corrected future values of the parameters may provide some reasonable estimates for long-term future planning.

3 Results and discussion

3.1 Analysis of present day climate (1970–1999) climate

Prior to the analysis of CMIP3 database, a brief description on the study area is important to understand the climatic conditions over the Arabian Peninsula. The Arabian Peninsula (comprising of Bahrain, Kuwait, Oman, Qatar, Saudi Arabia, United Arab Emirates, and Yemen) is located in the southwest Asia, which is actually the junction of Asia and Africa. The world’s largest continuous sand desert the Rub Al-Khali is located in Saudi Arabia, a unique country that covers about 80% area of the Arabian Peninsula (Almazroui et al. 2012a). The peninsula is surrounded by the water bodies such as the Red Sea (in the West), Arabian Gulf (in the East), and the Arabian Sea (in the South). Its climate is influenced by the Mediterranean Sea region in the winter season and by the South Asian Summer Monsoon in the summer season. The Sudan low also influences the climate of the peninsula. The southern part of the Arabian Peninsula (Saudi Arabia to western Yemen) is covered by the mountains with maximum elevation about 2200 m (Almazroui 2011). The mountain ranges have sharp elevation to the west near the Red Sea and they gradually decrease towards the East. The western sharp edge of the mountain facing the Red Sea plays an important role in the occurrence of heavy rainfall in the peninsula (Chakraborty et al. 2006). In general, the climate of the peninsula is characterized by arid and semi-arid (Almazroui et al. 2012b). The middle-to-north of Saudi Arabia is dry (below 150 mm), while the southwest of the peninsula is wettest (above 150 mm). The annual total precipitation varies from 25 mm (at a station Wejh, 26.20°N and 36.47°E) to 230 mm (at a station Abha, 18.23°N and 42.66°E), and is markedly influenced by seasonal variations (Almazroui et al. 2012a, b). The low temperature (below 21 °C) persists in the northwest and southwest of the peninsula while relatively high temperature (24–27 °C) in the middle-to-south. In the present day climate, the observed data show a significant decreasing trend (47.8 mm/decade) in the precipitation, while the mean temperature over the Arabian Peninsula significantly increased by 0.60 °C/decade (Almazroui et al. 2012a).

The annual mean temperature and precipitation and their standard deviation over the Arabian Peninsula for present day climate show that the eastern and central-western parts of the peninsula have the highest annual mean temperature (Fig. 1). On the other hand, the northwestern and southwestern mountainous regions receive slightly low temperature compared to the eastern and central-western parts. The temperature standard deviation is larger in the northern parts of the Arabian Peninsula compared to the southern and southeastern parts. The large variability in the annual mean temperature over northern parts of the Arabian Peninsula is generally associated with the eastward moving synoptic systems from the Mediterranean region throughout the year, particularly in the winter season that influence largely the near surface temperature over this region. The larger parts of the Arabian Peninsula, particularly the northwestern and over the sand desert Rub Al-Khali (Almazroui et al. 2012a, b), observe dry weather throughout the year. Precipitation is highly variable and the southwestern mountainous region of Saudi Arabia and Yemen receive the highest annual mean precipitation which mainly occurs in the summer season (Fig. 1c, d). In addition to the above the eastern parts of Yemen, Oman and the western parts of Jordan also receive enhanced annual mean precipitation. The precipitation over Yemen/Oman occurs in the summer months, whilst over Jordan, it is from the winter months. The standard deviation of precipitation is also large over these regions. Almazroui et al. (2012b) showed that a large proportion of the annual mean precipitation over the Arabian Peninsula is received during the wet season (November–March). Due to its geographical location, the larger contribution to the annual mean rainfall comes from extreme precipitation events that occur in a short interval of time (Almazroui et al. 2012a, b).

Fig. 1
figure 1

a Mean temperature and b standard deviation, based on CRU observations for the period 1970–1999. cd same as a, b except for precipitation. The unit of temperature is °C and precipitation is mm/year

As mentioned earlier, Almazroui et al. (2013) analyzed precipitation from 22 IPCC AR4 multi-model data sets for the period (1979–2000) and found two groups of models (each composed of 5 models) performing better over the Arabian Peninsula in different seasons. In the present study, we further examined the performance of individual CMIP3 models over the Arabian Peninsula for both temperature and precipitation, and categorized them into four categories (discussed earlier). The spatial distribution of temperature and precipitation biases by all models with respect to the CRU observations over the Arabian Peninsula is shown in Figs. 2 and 3. It can be seen that the spatial bias in the temperature and precipitation varies widely among the CMIP3 models. Most of the CMIP3 models simulate warm (cold) bias over the southern (central and northern) parts of the peninsula. The intermodel range for temperature bias between the all models varies within >±7 °C over the Arabian Peninsula (Fig. 2). The MME mean shows a reduction of cold (warm) bias over the central and northwestern (southern and eastern) parts of the peninsula. Similar to the temperature, the spatial pattern of the precipitation bias also varies among the CMIP3 models. The larger portion of the Arabian Peninsula is dominated by the dry bias in most of the models simulations. However, some CMIP3 models (Fig. 3b, d, k, n, o, p, s, t; Table 1) also display wet bias specially over the southeastern parts of the peninsula. Interestingly, the MME reduces remarkably the wet bias over the southern and southeastern parts of the Arabian Peninsula.

Fig. 2
figure 2

Simulated mean temperature bias (in °C) over the Arabian Peninsula by 22 CMIP3 models listed in Table 1. The bias is computed with reference to the CRU observational data set for the present climate (1970–1999). The labels on the upper right corner in each panel correspond to the models listed in Table 1. The last panel shows the multi-model ensemble (MME) mean

Fig. 3
figure 3

Same as Fig. 2 except for precipitation (mm/month)

From the above analysis, it is difficult to assess which of the CMIP3 models perform better in simulation of the temperature and precipitation over the Arabian Peninsula. We further examined the domain averaged temperature and precipitation biases over the peninsula using both CRU data and surface observations (Fig. 4). For brevity, the results with CRU data are not shown here. It can be seen that the models performing better for temperature are not necessarily well suited for precipitation (Fig. 4a, c). Furthermore, the variability in the temperature and precipitation bias also varies largely among different models. Based on the above analysis (Figs. 2, 3, 4), the CMIP3 models are grouped into four categories. The first category, all, includes all 22 CMIP3 models irrespective of their performance over the Arabian Peninsula.

Fig. 4
figure 4

Mean bias of the CMIP3 models (listed in Table 1) with respect to the station observation a temperature (unit: °C) and b precipitation (unit: mm/month). The CMIP3 models data are extracted at observation coordinates and averaged over 27 locations. The colored lines (orange/grey) show multi-model ±1 standard deviation. The multi-model ensemble mean bias (MMEMB) is shown by the dashed line. The red (blue) colors are adopted to indicate the warm (dry) and cold (wet) CMIP3 models over the Arabian Peninsula

The second category of models named as temperature-based models. The temperature bias simulated by these models domain lies within ±1 intermodel standard deviation. These models simulated lowest SysE, NRMSE, and SE with reference to observed surface air temperature (Table 2). Results show that temperature values estimated by most of the CMIP3 models follow the annual cycle except BCC-CM1 (Fig. 5). In addition, we also applied pattern matching technique for the annual cycle to examine the performance of CMIP3 models over the Arabian Peninsula. The monthly temperature simulated by temperature-based models closely following the pattern of the observations. It is found that 7 models out of 22 follow temperature trends that are closer to the observations. These models are referred as temperature-based models and are shown with superscript T in Table 1.

Table 2 CMIP3 models simulated systematic error (SyE), normalized root-mean-square error (NRMSE), and standard error (SE) with reference to the observed temperature
Fig. 5
figure 5

Annual cycle of temperature (unit: °C) obtained from all 22 CMIP3 models and compared with station observations averaged over the period 1970–1999. The CMIP3 models’ data are extracted at nearest grid point of observation coordinate and averaged over 27 locations. The observation is shown by black dashed line

The third category of models is referred as precipitation-based models. These are the models which display the precipitation bias within ±1 intermodel standard deviation. In addition to low precipitation bias, the third category of models also reveals lowest SysE, NRMSE, and SE with reference to observed precipitation (Table 3). The annual cycle of the simulated precipitation reveals that most of the CMIP3 models overestimate the rainfall during the dry (June to September) season, while they underestimate rainfall during the wet season (November–April) (Fig. 6). Simulations of precipitation annual cycle reveals that only few models are closer to the observations (Fig. 6). By applying the pattern matching technique and a careful examination of simulated precipitation biases and errors analysis reveals that only five models out of 22 are suitable for precipitation analysis over the Arabian Peninsula. This group of model is referred to as precipitation-based model and is represented by superscript P in Table 1. The fourth category of the CMIP3 models is named as the best performing models. These are the models which display lowest precipitation and temperature bias over the Arabian Peninsula. The best performing models are filtered out from second and third categories based on their individual performance in simulations of temperature and precipitation biases as well as the annual cycle, SyE, NRMSE, SE, etc. (Tables 2, 3) over the peninsula. Based on the above analysis, the changes in future signals of temperature and precipitation and associated uncertainty are analyzed only for all and best performing models.

Table 3 CMIP3 models simulated systematic error (SyE), normalized root-mean-square error (NRMSE), and standard error (SE) with reference to the observed precipitation
Fig. 6
figure 6

Annual cycle of monthly rainfall (unit: mm) obtained from all 22 CMIP3 models and compared with observations averaged over 1970–1999. The CMIP3 rainfalls are extracted at nearest grid point of observation coordinate and averaged over 27 locations. The observation is shown by black dashed line

3.2 Changes in temperature and precipitation in the future climate

In a recent study, Almazroui et al. (2016) examined the future climate change over the Arabian Peninsula using CMIP3 models without taking into account performance of the individual CMIP3 models in simulation of present day climate. Our results for the first category (i.e., all models) are almost similar to those presented by Almazroiu et al. (2016) in their Figs. 2 and 5; however, the future climate projections are more pronounced in case of best performing models presented in this study. Figure 7 shows the future changes in temperature over the Arabian Peninsula for three SRES scenarios B1, A1B, and A2. In all cases, the central and northern parts of the Arabian Peninsula display enhanced change in future temperature compared to its surroundings. Projected temperature values over the Arabian Peninsula for three future scenarios B1, A1B, and A2 ranges between ~1 and ~2.9 °C for near future (2021–2050) and ~1.8 to ~4.2 °C for far future (2070–2099) (Fig. 7). Furthermore, it can be seen that the more than 66% of the models temperature in the CMIP3 projections agree in sign of future changes (shown by the dotted areas). Compared to the all models, the best performing models also project similar spatial pattern of future changes in temperature (Fig. 8). However, as mentioned earlier, the future changes in the temperature are more pronounced in case of best performing models compared to the all models. Similar to the all models ensemble, the best performing models also display higher temperature changes in the central and northern parts of the peninsula. For the best performing models, the projected temperature values for three future scenarios B1, A1B, and A2 range from 0.96 to ~2.2 °C for near future (2021–2050) and ~1.9 to ~5.2 °C for far future (2070–2099) (Fig. 8), which is slightly larger than the all models.

Fig. 7
figure 7

Future changes in all (22 CMIP3) multi-model ensemble mean temperature (unit: °C) for near future (2021–2050, left panel) and far future (2070–2099, right panel) with reference to the present climate (1970–1999). Dotted areas indicate regions where at least 66% of CMIP3 models agree on sign of future changes signals of temperature over the Arabian Peninsula. The SRES scenarios are B1 (top), A1B (middle), and A2 (bottom)

Fig. 8
figure 8

Same as Fig. 7 except for the selected best performing models

The projected changes in annual mean precipitation over the Arabian Peninsula show a southeast–northwest dipole like pattern with enhanced (reduced) precipitation over the southeastern (northwestern) parts of the peninsula (Fig. 9). This pattern is almost similar to that shown by Almzaroui et al. (2016) in their Fig. 5, though a time slot for the two studies is slightly different. The projected change in annual precipitation over the Arabian Peninsula ranges to ±20% for the near future (2021–2050) and nearly ±50% for the far future (2070–2099). Similar to future temperature changes, the future precipitation change signals are more intense towards the end of the 21st century. The dotted areas in Fig. 9 show the regions where at least 66% of the CMIP3 models agree in the sign of direction.

Fig. 9
figure 9

Future changes in all (22 CMIP3) models ensemble mean precipitation (in %) for near (2021–2050, left panel) and far (2070–2099, right panel) with reference to the base period (1970–1999). Dotted areas indicate regions where at least 66% of CMIP3 models agree on sign of future changes signals of precipitation over the Arabian Peninsula. The SRES scenarios are B1 (top), A1B (middle), and A2 (bottom)

For best performing models, the MME change also show nearly similar dipole like pattern as shown for the all models ensemble (Fig. 10). However, the future increase in precipitation is more pronounced over the southern parts of Saudi Arabia, United Arab Emirates, and Yemen (e.g., Fig. 10e, f) compared to the all models ensemble which shows a future intensification over eastern parts of Saudi Arabia and adjacent countries of United Arab Emirates and Oman (e.g. Fig. 9e, f). Moreover, the reduction in the future precipitation is also more pronounced in the best performing models over the northwestern parts of Saudi Arabia compared to the all models. Similar to the temperature changes, for the best performing models, the future changes of annual mean precipitation signals are more intense towards the end of the 21st century. Due to its geographical location, the Arabian Peninsula receives very low amount of annual mean precipitation. The main contribution of this annual mean precipitation comes from a few heavy rainfall events (Almazroui et al. 2014). It is not clear from this analysis whether extreme rainfall events will increase over this region in the future climate. Furthermore, to what extent the best performing global model simulations reproduce the observed precipitation extremes over this region. These questions are open for further studies.

Fig. 10
figure 10

Same as Fig. 9 except for the best performing models

3.3 Uncertainty in projections of temperature and precipitation

Figure 11 shows the evolution of the annual mean temperature anomalies from 2001 to 2100 averaged over the Arabian Peninsula. The MME mean with ±1 intermodel standard deviation (STD) and intermodel range is also shown for all (Fig. 11a, c) and best performing (Fig. 11b, d) models. It can be seen that the temperature increases continuously from the 2001 to the end of the 21st century under the B1, A1B, and A2 scenarios, with the strongest warming under A2 at the end of this century. The projected difference in the temperature values among the three scenarios is weak in the early 21st century, and gradually increases with integration of time. The B1 scenario shows lower temperature changes during the all integration period; however, A1B displays slightly higher values compared to B1 till the end of 21st century. After 2070s, the temperature values for A2 are higher compared to both B1 and A1B. It is important to note that the ±1 intermodel STD and intermodel range also show larger spread in case of all models (Fig. 11a, c) compared to the best performing models (Fig. 11b, d). Furthermore, the MME projection of temperature under A1B and A2 scenarios are slightly higher towards the end of 21st century for best performing models compared to the all models (Fig. 11a, b). Based on Mann–Kendall trend test, the increasing trends of future change in temperature for all/best models towards the end of twenty-first century are significant to 99% level for all three scenarios A2, A1B, and B1.

Fig. 11
figure 11

a Annual mean temperature anomalies (unit: °C) averaged over the Arabian Peninsula, relative to 1970–1999 for B1 (green), A1B (blue), and A2 (red) scenarios from the MME mean (solid line) with ±1 intermodel standard deviation (color shading). The ±1 intermodel standard deviation for B1, A1B, and A2 scenarios are shown by pale green, sky blue, and violet red colors. b Same as a except for best performing models. cd is same as a, b except the shading shows the intermodel range (see text for further explanation)

To further analyze the uncertainties in temperature projection, the Box-and-Whisker plot in Fig. 12 shows MME mean temperature with ±1 intermodel STD and intermodel range for near future (2021–2050) and far future (2070–2099) periods. The results show an increase in uncertainty with time under the three emissions scenarios (Fig. 12a, b). By the end of the 21st century, the annual mean temperature is projected to increase by 2.32 ± 2.45, 3.49 ± 2.49, and 3.82 ± 1.47 °C, relative to the present day climate, under the B1, A1B, and A2 scenarios, respectively, while the intermodel ranges are projected to be −3.36 to 6.08, −2.26 to 7.68, and −1.79 to 7.40 °C, respectively (Fig. 12a, b).

Fig. 12
figure 12

Box-and-Whisker plots show the domain averaged temperature change over the Arabian Peninsula for the future periods a 2021–2050 and b 2070–2099 w.r.t. the base period 1970–1999 for all models (red) and best performing models (blue). The band in the box represents the change in MME mean; the bottom (top) of the box represents the MME mean SAT change ±1 intermodel standard deviation. The ends of the Whiskers represent the intermodel range, i.e., multi-model maximum and minimum change w.r.t base period MME mean simulation values

The uncertainties associated with temperature projection decrease remarkably for best performing models compared to the all models (Fig. 12a, b). The results further show an increase in the MME mean temperature for best performing models but a decrease in uncertainty range under the three emissions scenarios. By the end of the 21st century, the annual mean temperature is noted to be increased by 3.85 ± 1.54, 4.91 ± 1.61, and 5.36 ± 1.47 °C, relative to the present climate, under the B1, A1B, and A2 scenarios, respectively, while the intermodel ranges are projected to be 0.84 to 5.96, 1.94 to 7.29, and 2.75 to 7.10 °C, respectively (Fig. 12c, d).

Figure 13 shows the evolution of the annual mean precipitation anomaly (in %) from 2001 to 2100 averaged over the Arabian Peninsula. The MME (solid lines) with ±1 intermodel STD and intermodel range (color shading in Fig. 13a, b) is also shown for all and best performing models. Unlike temperature, the domain averaged precipitation over the Arabian Peninsula does not show any clear increasing trend, but a strong interannual variability can be seen over the entire period of integration. The interannual variability is larger in the all models (Fig. 13a) compared to the best performing models (Fig. 13b). However, the spread associated with intermodel STD and intermodel range is reduced remarkably for the best performing models compared to the all models (Fig. 13c, d).

Fig. 13
figure 13

Same as Fig. 11 except for precipitation (%)

The Box-and-Whisker plot shows that MME mean precipitation increases as a result of global warming in the 21st century, with the greatest increases under A2 (Fig. 14). The uncertainty in precipitation projection increases with integration of time and emissions scenario from lower range to higher range. By the end of the 21st century, the annual mean precipitation seems to be increased by 5.16 ± 30, 10.48 ± 34, and 15.29 ± 43%, relative to the present climate, under the B1, A1B, and A2 scenarios, respectively, for all models, and it is 3.2 ± 25, 1.82 ± 28, and 5.3 ± 32% for best performing models (Fig. 14b). For all models, the intermodel range of projected precipitation varies widely from −94 to 265, −95 to 322, and −95 to 375% for the B1, A1B, and A2 scenarios, respectively, and it is −71 to 175, −74 to 205, and −75 to 235%, respectively, for best performing models (Fig. 14b). Contrary to the all models, the intermodel range for the best performing models shows less scenario dependence (Fig. 14). By the end of 21st century an increase MME mean precipitation can be seen in the A2 scenario for both best performing and all models. In comparison with the temperature, the increasing rate in precipitation during the 21st century is smaller, but with a larger uncertainty in its projection.

Fig. 14
figure 14

Same as Fig. 12 except for precipitation (%). The band in the box represents the change in the MME mean; the bottom (top) of the box represents the MME mean change ±1 intermodel standard deviation. The ends of the whiskers represent the intermodel range, i.e., multi-model maximum and minimum change w.r.t base period MME mean simulation values

Although there is no guarantee that the nature of the present day climate will be continued to the projection period (Reichler and Kim 2008; Reifen and Toumi 2009), however, comparison of model data with the base period observation is commonly used technique for screening good performing models over a particular region. Model projected temperature and precipitation are not free from uncertainties whatever the category they are in, however, corrected (projected plus the bias in base period) values of the parameters may provide reasonable estimates for long-term planning. Figure 15 shows the corrected temperature for the B1, A1B, and A2 scenarios for both all and best performing models. In case of B1 and A2 scenarios, the south and southeastern (northwestern) parts of the Arabian Peninsula are expected to receive higher (lower) temperature in future, whereas for A1B scenario, the wider east–west belt is expected to receive higher annual mean temperature. Similar to temperature, the precipitation also displays homogenous pattern for B1, A1B, and A2 scenarios with more increase in the southern and southeastern sides and less towards the north and northwestern sides. Compared to the projected temperature and precipitation values (Figure S1; Figure S2), the corrected temperature and precipitation (Figs. 15, 16) show nearly similar spatial distribution with slight differences in magnitude. These results may verify with the newly developed SAUDI-KAU coupled global climate model (Almazroui et al. 2017) simulations with high resolutions in the future study.

Fig. 15
figure 15

MME mean corrected future temperature (units: °C) for all models (first and second columns) and selected best performing models (third and fourth columns)

Fig. 16
figure 16

Same as Fig. 15 except for precipitation (mm/day)

4 Summary and conclusions

In this study, we categorized the 22 CMIP3 models into four groups based on their performance in the present day climate (1970–1999) climate and examined the uncertainties associated with future projection of annual mean surface air temperature and precipitation over the Arabian Peninsula under SRES B1, A1B, and A2 scenarios. Based on our analysis, the CMIP3 models are grouped into the following four categories, namely (1) all models, (2) precipitation-based models, (3) temperature-based models, and (4) best performing models. Out of these four categories, the models in group (1) and (4) are used for the assessment of uncertainties in future projections of annual mean temperature and precipitation over the Arabian Peninsula. The fourth category of models, i.e., the best performing models, revealed the lowest multi-model ensemble (MME) mean bias and the intermodel standard deviation values for surface air temperature and precipitation over the peninsula compared to the first three categories. The best performing models not only reproduce well the annual cycle of temperature and precipitation but also display lowest systematic error, NRMSE, and standard error compared to the observations.

The analysis of future simulations revealed a continuous increase in temperature over the Arabian Peninsula during the 21st century. For the period 2070–2099, the temperature is projected to increase by 2.32 ± 2.45, 3.49 ± 2.49, and 3.28 ± 1.47 °C, relative to the present day climate, under the SRES B1, A1B, and A2 scenarios, respectively, while the intermodel ranges are projected to be from −3.36 to 6.08, −2.26 to 7.68, and −1.79 to 7.40 °C, respectively. The best performing models remarkably reduced the uncertainties in temperature projections where annual mean temperature is increased by 3.85 ± 1.54, 4.91 ± 1.61, and 5.36 ± 1.47 °C, relative to the present climate, under the B1, A1B, and A2 scenarios, respectively; and the intermodel ranges are projected to be from 0.84 to 5.96, 1.94 to 7.29 and 2.75 to 7.10 °C, respectively. Spatially, the annual mean temperature shows a homogeneous warming pattern across the Arabian Peninsula. The warming rate is increasing more from central to the northern parts of the peninsula. Meanwhile, for the period 2070–2099, the annual mean precipitation is projected to increase by 5.16 ± 30, 10.48 ± 34, and 15.29 ± 43%, relative to the present climate, under the B1, A1B, and A2 scenarios, respectively, while the intermodel ranges are projected to be from −94 to 265, −95 to 322, and −95 to 375%, respectively, for all models. The best performing models slightly reduced the precipitation uncertainty band with MME mean projected to increase by 3.2 ± 25, 1.82 ± 28, and 5.3 ± 32%, relative to the present day climate, under the B1, A1B, and A2 scenarios, respectively, while the intermodel ranges are projected to be from −71 to 175, −74 to 205, and −75 to 235%, respectively. Although at a low confidence level (at least 66% of model agree in sign of change), a homogeneous wetting (drying) pattern is projected across southern and eastern (northwestern) parts of the Arabian Peninsula at annual scale, with a larger increase at the end of 21st century.

Although, the projected changes in the temperature and precipitation by the 22 CMIP3 models over the Arabian Peninsula closely resemble to those shown by Almazroui et al. (2016), but the present study further indicates that a better choice of models from the CMIPs (e.g. CMIP3 and CMIP5) could reduce the uncertainty range associated with the future projections of temperature and precipitation over the Arabian Peninsula. Following above results, we plan another study to examine the projected future changes and associated uncertainties in the temperature and precipitation simulated by different categories of CMIP5 models over the Arabian Peninsula. For climate change impact assessments, the potential inputs with reduced uncertainty range in future projections may help the policy makers to address better the climate-related threats in the Arabian Peninsula.