1 Introduction

Most developing countries, like Egypt, suffer from a scarcity of meteorological weather stations over space and time (El Kenawy et al. 2019a). Also, the available observations are uncertain due to the proximity of most weather stations to urban settlements, within airports, and on coasts (Peterson et al. 1998). As a result, more than the available observations are insufficient to study and understand the spatio-temporal changes of most atmospheric parameters; thus, reanalysis datasets are often used instead of observational data to overcome such limitations (El Kenawy et al. 2019b). Many operational centers worldwide have produced, developed, and updated various reanalysis datasets over the last few decades that include a variety of observations. They are freely available (e.g., European Centre for Medium-Range Weather Forecasts (ECMWF), National Centers for Environmental Prediction (NCEP), and the National Aeronautics and Space Administration (NASA)). Accordingly, many studies and applications concerned with climate extremes have used such datasets in different regions around the world, such as ECMWF/ERA-Interim over Egypt (Morsy and El Afandi 2021), ERA-40, JRA-55, and 20CR over China (Zhao et al. 2020), Observational-Reanalysis Hybrid OR above East of Africa (Gebrechorkos et al. 2019), and NASA POWER in Mediterranean region (Rodrigues and Braga 2021). The reanalysis from the fifth-generation ECMWF/ERA5 has decreased climatological temperature biases and enhanced the depiction of inter-annual variability over East Africa (Gleixner et al. 2020). NASA Prediction of Worldwide Energy Resources (NASA/POWER) dataset provided a reasonable performance for the near-surface parameters (except for relative humidity) against in situ observations, and it can be used in case of missing or unavailable observations, particularly for maximum and minimum temperature over Egypt (Aboelkhair et al. 2019). Furthermore, the Climate Forecast System Reanalysis (NCEP/CFSR) dataset for temperature products may perform well in monthly streamflow and evapotranspiration simulations, making it a viable alternative data source for hydrological simulations in certain gauge-deficient regions (Zhang et al. 2020).

Severe weather occurrences have increased in frequency and intensity, resulting in more worldwide catastrophes that significantly affect society and the economy (Zhou et al. 2020; Chikabvumbwa et al. 2022). Globally, extreme warm events increased, and extreme cold events decreased, especially in the Loess Plateau (China), where the trend amplitudes of cold extremes were higher than those of warm extremes (Sun et al. 2016). Also, the frequency of severe hot and cold weather occurrences in Nigeria has increased significantly, and the cold weather events have decreased, where the most notable trends in warm and cold nights are in Guinea and the Sahel regions (Abatan et al. 2016). Furthermore, as the earth warms, cold extremes will decrease while warm extremes increase, making the vulnerability of cold extremes to climate change more remarkable than warm extremes (Wang et al. 2017). Finally, the evaluating of the different available reanalysis datasets in measuring extreme climatology and long-term trends in extreme climate provides valuable information. However, investigating reanalysis datasets in quantifying the temperature extremes (warm and cold) under climate change conditions still needs comprehensive studies. Where most of the previous studies focused on study the changes of heatwave indices and aspects over Egypt (Morsy and El Afandi 2021), Arabian Peninsula (Almazroui et al. 2017), Middle East and North Africa region (Driouech et al. 2020), and Eastern Mediterranean and the Middle East (Lelieveld et al. 2012; Zittis et al. 2016). Therefore, this study aims the following:

  1. 1-

    To identify the performance of multiple gridded reanalysis datasets (ERA5, CFSR, and POWER) for the spatio-temporal distribution of maximum and minimum temperatures during four consecutive decades (1981–2020) over Egypt.

  2. 2-

    To assess the spatio-temporal suitability of these gridded reanalysis datasets to explain the extreme events and reproduce the changes in warm and cold extreme temperatures.

  3. 3-

    To investigate and interpret the decadal and spatial variability, anomaly, and trend for different maximum and minimum temperature indices during the climate period from 1981 to 2020 over Egypt.

2 Study area

Egypt is located in the northeastern part of Africa, with its borders roughly ranging from 22° to 32° N and from 25° and 38° E with a total area of about 1.01 million km2. The Mediterranean Sea bordered it to the north, while the Red Sea bound it to the east (Fig. 1). From the climatological point of view, Egypt has a semi-desert climate in the south and a Mediterranean climate in the north. During the summer, it is hot and dry, while it is mild and rainy in the winter. According to the Köppen climatic classification, Egypt has a hot desert climate. Consequently, the weather is often dry throughout the year (Nashwan 2019).

Fig. 1
figure 1

The geographical distribution and elevation of the selected weather stations in Egypt

3 Dataset and methodology

3.1 Data acquisition

To achieve the main objectives of this study, near-surface (2-m) maximum (Tmax) and minimum (Tmin) temperatures are downloaded from four different data sources. These datasets include observational data at 24 ground-based weather stations covering the different Egyptian climatic zones (Fig. 1), and three high resolution gridded datasets, as described in Table 1. The observed dataset is obtained from the Global Surface Summary of the Day (GSOD) on the NOAA National Centers for Environmental Information (NCEI) website (https://www.ncei.noaa.gov/maps/daily/). The observed data is used to evaluate and investigate the accuracy of the other three gridded reanalysis datasets and to assess their performance and suitability for simulating temperature extremes. The three acquired reanalysis datasets are ECMWF/ERA5 (https://cds.climate.copernicus.eu/), NASA/POWER (https://power.larc.nasa.gov/data-access-viewer/), and NCEP (CFSR: https://rda.ucar.edu/datasets/ds093.0/; CFSv2: https://rda.ucar.edu/datasets/ds094.0/).

Table 1 The main characteristics of the used observation and reanalysis datasets

The obtained hourly ERA5 and 6-h NCEP datasets are converted to daily datasets using the Climate Data Operator (CDO) to obtain a temporal resolution compatible with the observed data and the NASA /POWER dataset.

3.2 Statistical evaluation of gridded reanalysis datasets

Several robust statistical procedures are applied to measure the goodness of fit and accuracy of the ERA5, CFSR, and POWER datasets for Tmax and Tmin estimates and capture their extremes (warm and cold) against observation. The used statistical metrics are as follows:

3.2.1 Coefficient of determination (R2)

Coefficient of determination (R2) is a method of calculating the accuracy of datasets by quantifying the variance in observed data, which is explained by the variation in the estimated data from gridded datasets. It ranges from 0 to 1, where the highest values (closer to 1) show less error fluctuation and a better match, and values greater than 0.5 are acceptable (Pineiro et al. 2008; Moriasi et al. 2007).

$${R}^{2}=1-\frac{\sum_{i=1}^{n}{\left({O}_{i}-{R}_{i}\right)}^{2}}{\sum_{i=1}^{n}{\left({O}_{i}-{O}_{a}\right)}^{2}}$$
(1)

where Oi, Oa, and Ri are the observed, observed average, and reanalysis values, and n is the number of accessible observations.

3.2.2 Willmott index of agreement (WI)

Willmott index of agreement (WI) is the standardized measure of gridded reanalysis datasets error ranging from 0 to 1. The number of 1 represents the complete agreement, and the value of 0 shows no agreement at all (Willmott 1981; Abba et al. 2020).

$$\mathrm{WI}=1-\frac{\sum_{i=1}^{n}{\left({O}_{i}-{R}_{i}\right)}^{2}}{\sum_{i=1}^{n}(\left|{R}_{i}-{O}_{a}\right|-{\left|{O}_{i}-{O}_{a}\right|)}^{2}}$$
(2)

3.2.3 Mean bias error (MBE)

Mean bias error (MBE) is a measure that shows if the gridded reanalysis dataset is overestimated or underestimated by comparing it with the actual observations in terms of systematic error (Willmott and Matsuura 2005; Singh et al. 2013).

$$\mathrm{MBE}=\frac{\sum_{i=1}^{n}({R}_{i}-{O}_{i})}{n}$$
(3)

3.2.4 Root mean square error (RMSE)

Root mean square error (RMSE) is a measure of the mean standard deviation of gridded reanalysis datasets compared the observed values (Gauch et al. 2003; Bai et al. 2010).

$$\mathrm{RMSE}=\sqrt{\frac{{\sum }_{i=1}^{n}{\left({R}_{i}-{O}_{i}\right)}^{2}}{n}}$$
(4)

3.2.5 Mean absolute error (MAE)

Mean absolute error (MAE) gives equal weight to all individual deviations for the gridded reanalysis datasets compared to the actual observations throughout the test sample (Chai and Draxler 2014; Willmott and Matsuura 2005)

$$\mathrm{MAE}=\frac{1}{n}\times \sum_{i=1}^{n}\left|{O}_{i}-{R}_{i}\right|$$
(5)

3.2.6 Mean percentage error (MPE %)

Mean percentage error (MPE %), a positive MPE value indicates that the gridded reanalysis dataset values are underestimated on average. In contrast, a negative value indicates that the gridded reanalysis dataset values are overestimated (Ouda et al. 2015).

$$\mathrm{MPE}=\frac{\sum_{i=1}^{n}\left(\frac{\left({O}_{i}-{R}_{i}\right)}{{O}_{i}}\times 100\right)}{n}$$
(6)

3.3 Temperature indices, variability, and Mann–Kendall test

The climate data operator (CDO) tool is used to compute several indices that describe the changes in temperature extremes including monthly maximum value of daily Tmax (TXx °C), monthly minimum value of daily Tmax (TXn °C), monthly maximum value of daily Tmin (TNx °C), monthly minimum value of daily Tmin (TNn °C), and the coefficient of variation (COV) for these indices (TXx, TXn, TNx, and TNn). Also, the Grid Analysis and Display System (GrADS) software is used to calculate decadal average and anomaly of TXx, TXn, TNx, and TNn. Finally, the modified version of non-parametric Mann–Kendall test package (modifiedmk) in R software (Patakamuri et al. 2020) is used to perform the Mann–Kendall trend test of Pre-Whitened (pwmk) time series data in the presence of serial correlation using Yue and Wang (2002) approach. The pwmk test is employed to assess the existence of monotonic increasing or decreasing trends and the significant serial correlation (Tau) in the computed indices (TXx, TXn, TNx, and TNn). The performed sequential steps for Tau rank correlation are documented by Kendall (1970), where Kendall’s Tau varies between − 1 for decreasing trend and + 1 for increasing trend.

4 Results and discussion

4.1 Statistical evaluation

The statistical metrics of R2, WI, RMSE, MAE, MBE, and MPE are used to evaluate the daily Tmax (°C) and Tmin (°C) values from ERA5, CFSR, and POWER datasets against the observed values at the selected 24 metrological stations.

Figure 2 shows the geostatistical and spatial distribution of R2, MBE, and RMSE between the estimated Tmax (°C) and Tmin (°C) values from the three gridded reanalysis datasets (ERA5, CFSR, and POWER) and observed values. To identify the goodness of fit and accuracy of the selected three datasets, the proportional symbol circle maps for R2, MBE, and RMSE as geostatistical and spatial distribution maps covering all stations over Egypt are created and performed using ArcGIS Pro2.8.0 software.

Fig. 2
figure 2

Geostatistical and spatial distribution of Tmax (left panel) and Tmin (right panel) evaluation

As shown in Fig. 2a and b, both Tmax (°C) and Tmin (°C) have R2 higher than 0.80, suggesting that more than 80% of the observed values are explained or estimated by the three gridded reanalysis dataset (ERA5, CFSR, and POWER). R2 value for Tmax ranges from 0.93 to 0.98 °C in ERA5, from 0.90 to 0.98 °C in POWER, and from 0.89 to 0.97 °C in CFSR, while for Tmin, it ranges from 0.88 to 0.96 °C in ERA5, from 0.86 to 0.96 °C in POWER, and from 0.83 to 0.93 °C in CFSR. These results indicate that the ERA5 dataset can further explain the observed values of Tmax and Tmin followed by POWER and CFSR. As shown in Fig. 2c, Dakhla, Kharga, and Bahria stations from the three gridded datasets have the highest MBE while Shalateen, Aswan, Asyut, and Minya stations have the lowest MBE for the Tmax. For Tmin (Fig. 2d), the highest MBE values from the three gridded datasets for Tmin are found at Cairo, Aswan, and Ras Sidr while the lowest MBE values are found at Farafra, Dabaa, and Minya. On the other hand, the geospatial distribution of RMSE from the three gridded datasets for Tmax and Tmin are shown in Fig. 2 e and f respectively. The highest RMSE values for Tmax are detected at Dakhla, Kharga, and Bahariya stations, and the lowest RMSE values are found at Shalateen, Aswan, Asyut, and Minya stations. Additionally, the highest RMSE values are found at Cairo, Aswan, and Ras Sidr, while the lowest RMSE values are found at Farafra, Dabaa, and Minya. It is also noticed that the geospatial distribution and values of RMSE closely match the geospatial distribution and values of MBE.

Also, WI, MAE, and MPE (not shown) are also utilized to assess the suitability and accuracy of the three gridded datasets. The WI values vary from 0.94 to 0.99 °C in the three gridded datasets for Tmax, while for Tmin, it ranges from 0.94 to 0.99 °C in the three gridded datasets except at Aswan, Cairo, Luxor, Marsa Alam, and Ras Sidr in the CFSR. Regarding MAE values for Tmax from ERA5, POWER, and CFSR are 3.58, 2.75, and 2.44 °C respectively, while its values for Tmin are 3.86, 2.74, and 2.23 °C for CFSR, POWER, and ERA5 respectively. Furthermore, the maximum MPE for Tmax is 8% in CFSR, 8.8% in POWER, and 11.68% in ERA5, while the maximum MPE for Tmin is 13.7% in ERA5, 14.5% in POWER, and 15% in CFSR. These results show that the three gridded datasets have goodness of fit, considerable agreement, and accurate estimation for Tmax and Tmin compared to the observed data along the study climate period (1981–2020) over Egypt.

4.2 Investigation of extremes

Due to the noticed slight differences in the accuracy of the three gridded datasets for estimating Tmax and Tmin values, their performance to simulate and capture the extreme warm and cold temperatures should be investigated to be used appropriately in the discussion and analysis of extremes.

Table 2 shows the differences for the highest Tmax values (heat wave) between the observed and estimated values from the three grid datasets at five stations (Aswan, Asyut, Cairo, Arish, and Matruh) in Egypt. Nine Tmax extreme events (cases) were detected throughout the study period (1981–2020) that agreed to occur at all stations, including five in June, three in July, and one in August. These nine events are selected as each event has no less than 3 consecutive days with Tmax greater than or equal to 30 °C at the five stations. For extreme warm of Tmax, the highest positive difference is 3.95 °C at Cairo in CFSR and the lowest positive difference is 0.09 °C at Aswan in ERA5, while the highest negative difference is − 3.91 °C at Arish in POWER, and the lowest negative difference is − 0.01 °C at Cairo in POWER.

Table 2 The differences (°C) between the observed and estimated highest Tmax values

Table 3 shows the differences for the lowest Tmin (cold wave) between the observed and estimated values from the three grid datasets at the same five stations in Egypt. It is noticed that there are 12 extreme cold events that have no less than 3 consecutive days with Tmin below 10 °C during the study period (1981–2020) including ten cases in January and two in February. For the extreme cold of Tmin, the CFSR in Arish station has the greatest positive difference (3.98 °C), and the CFSR in Cairo has the lowest positive difference (0.87 °C), while the CFSR in Aswan has the greatest negative difference (− 3.93 °C) and the POWER in Matruh has the lowest negative difference (− 0.20 °C).

Table 3 The differences (°C) between the observed and estimated lowest Tmin values

For both Tmax and Tmin, the maximum positive and negative differences (errors) range from − 4 to 4 °C, revealing that the three gridded datasets can be properly used to discuss and analyze extreme warm and cold temperatures.

4.3 Extreme temperature variability

The decadal average, anomaly, variability, and trend of extreme Tmax indices (TXx, TNx) and extreme Tmin indices (TXn, TNn) computed from the three gridded datasets (CFSR, ERA5, and POWER) are discussed and analyzed in this section.

4.3.1 Average and anomaly of temperature indices

Figure 3 illustrates the spatial distribution of the decadal average of TXx computed from the three gridded datasets. The area mean of the decadal average of TXx from CFSR dataset is the greatest (44.7 °C) in comparison to POWER (43.6 °C) and ERA5 (42.3 °C). The TXx values from the three gridded datasets varied between 35 and 48 °C throughout the four decades (1981–2020). The lowest values of TXx are found in the northern coastal zone, southern Sinai, and southwestern Egypt due to the influence of the Mediterranean Sea, Mount Saint Catherine, and Mount Uweinat respectively.

Fig. 3
figure 3

Spatial distribution of the decadal average and decadal anomaly of TXx

The influence of the Mediterranean climate extends southward to about 26° N (lowest TXx), which agrees with Aboelkhair et al. 2019. The region south of 26° N has the highest values of TXx which gradually increases from 40 to 48 °C over southeast Egypt. The values of TXx vary throughout the four decades, progressively increasing from the first (− 2: + 0.25 °C) to the fourth (− 0.25: + 2 °C) decade as shown by the decadal anomaly in Fig. 3. The decadal anomaly of TXx during the first decade over most of Egypt is negative (< 0 °C), which means that the first decade contains TXx values less than the climatological values. The decadal anomaly of TXx increases (− 0.5: + 1 °C) in the second decade and reaches about + 1.5 °C in the third decade. The largest decadal anomaly of TXx (> + 2 °C) is detected during the fourth, especially over the Nile Delta in north.

Figure 4 illustrates the decadal average and decadal anomaly of TXn values during the study period (1981–2020), demonstrating that the TXn values from the three gridded datasets ranged from 5 to 20 °C. The area mean of the decadal average of TXn over Egypt is 14.2 °C, 13.7 °C, and 13.4 °C from POWER, CFSR, and ERA5 respectively. TXn values are below 10 °C in Sinai, the Mediterranean coast and southwestern Egypt, while it exceeds 20 °C in southeastern Egypt. The TXn values along the Red Sea coast exceed 15 °C due to the influence of the Red Sea Mountain ranges. The TXn values are lowest in the second decade compared to the other three decades.

Fig. 4
figure 4

Spatial distribution of the decadal average and decadal anomaly of TXn

From the climatic point of view, TXn values are low over the Western Desert during the first decade and over the Eastern Desert and Sinai during the second decade, while the last two decades have high TXn values as indicated by the decadal anomaly in Fig. 4. The TXn in eastern Egypt in the first decade is greater than in the second decade by about 2 °C, where its decadal anomalies are about + 1 °C and − 1 °C in the first and second decades, respectively. The third decade has the greatest TXn values compared to the other three decades with positive anomaly (+ 0.5 to + 1.5 °C) except for some negative anomaly (− 0.25 °C) in northern Egypt.

Figure 5 shows the decadal average and decadal anomaly of TNx, where the decadal average ranges between 20 and 30 °C. The decadal average of TNx increased steadily from the first decade to the last decade. The decadal anomaly of TNx indicates that the first decade has the highest negative anomaly values, reaching − 2 in both ERA5 and POWER. At the same time, the CFSR data showed anomaly of up to + 0.5 °C in small and sparse locations. During the second decade, the decadal anomaly of TNx gradually increased and reached more than + 1.5 °C in CFSR and + 0.75 in both POWER and ERA5. The TNx values continue to increase during the third decade as the positive anomalies (+ 0.5) cover larger areas than in the previous two decades. The highest anomalies (> + 1.5 °C) during the fourth decade from the three gridded datasets except the Eastern Desert and South Sinai (− 2 °C) from the CFSR dataset. Overall, a comparison between the first and fourth decades revealed an increase of about 2 °C in TNx values in most of Egypt during the study period.

Fig. 5
figure 5

Spatial distribution of the decadal average and decadal anomaly of TNx

The geographical distribution of the decadal average and its associated anomaly shown in Fig. 6 reveals that CFSR produces the lowest TNn values (− 5 °C) particularly along the Nile River compared to both EAR5 and POWER. Moreover, the highest values of TNn are found on the coasts of the Mediterranean and the Red Sea, exceeding 5 °C in the three gridded datasets. The ERA5 showed the largest values of TNn along the river (2 – 5 °C) compared to POWER (1 – 4 °C) and CFSR (below − 1 °C). The POWER has the highest area mean value of TNn (2.5 °C) followed by ERA5 (2 °C) and CFSR (0.8 °C).

Fig. 6
figure 6

Spatial distribution of the decadal average and decadal anomaly of TNn

In addition, the study area covered by the negative (positive) values of TNn gradually decreases (increases) from the first to the fourth decade according to the three gridded datasets, indicating that TNn increased throughout the study period. The decadal anomaly of TNn in northern and eastern Egypt did not exceed + 0.5 °C during the first decade with prevailing values ranging from + 0.5 to − 0.5 °C. While the decadal anomaly of TNn increased to more than + 1 °C in ERA5 and + 0.5 °C in both CFSR and POWER during the second decade. The positive decadal anomaly of TNn exceeds + 1 °C in the third decade especially in the ERA5 data with some negative anomaly values in both CFSR and ERA5. The decadal anomalies increased by + 2 °C in the fourth decade compared to the first decade in central and southern Egypt, although ERA5 and CFSR had the lowest anomalies in northwestern and eastern Egypt respectively. Thus, the decadal anomaly of TNn did not exceed + 0.5 °C in the first decade but increased gradually throughout the decades up to more than + 2 °C in the fourth decade.

4.3.2 Trend and COV of temperature indices

Figure 7 shows the Mann–Kendall rank correlation (Tau) and the coefficient of variation (COV) to illustrate and characterize the TXx trend and variability throughout Egypt. A positive Tau trend dominates in the first decade, with values ranging from − 0.3 to + 0.3 except southern Sinai, Eastern Desert, and southern Egypt that have Tau higher than + 0.3 and exceed + 0.9 in Aswan. The second and third decades are dominated by negative Tau values higher than − 0.6, indicating a significant negative trend, particularly in the Western Desert and the majority of the Nile Valley. Tau has positive values (+ 0.3 to + 0.6) in the second decade in the Nile Delta, southern Sinai, Red Sea coastal region, southwest Egypt, and the Eastern Desert. In the fourth decade, Tau ranges from − 0.3 to + 1 over most of Egypt with higher positive (most significant) Tau in Western Desert, northern Sinai, and northwest Egypt. From the first to the third decade, low COV values (< 2) spread across southern and southwestern Egypt and south Sinai in the fourth decade. While COV values more than 2 up to 5 are common in other areas, particularly in the northern coastal range. The COV values indicate that TXx variability increased in the second decade compared to the first decade, particularly in ERA5, while POWER showed the lowest variability. The geographical distribution of COV values is negatively correlated with Tau values, where their correlation ranges between − 0.3 and − 0.5 during the first two decades, while it ranges from − 0.1 to − 0.25 during the last two decades in the three gridded datasets.

Fig. 7
figure 7

Geographical distribution of the decadal Mann–Kendall rank correlation (Tau) and decadal coefficient of variation (COV) for TXx

Figure 8 shows the values of the spatial distribution of the decadal Tau and COV for TXn, where the first decade indicates a positive Tau throughout the study region except the Western Desert in the CFSR data and some scattered locations in northern Egypt in the ERA5. The three gridded datasets produce positive Tau values higher than + 0.3 over the Red Sea coast and southeast Egypt.

Fig. 8
figure 8

Geographical distribution of the decadal Mann–Kendall rank correlation (Tau) and decadal coefficient of variation (COV) for TXn

The second decade is marked by a general positive Tau (> 0) except over southern Nile Valley in CFSR. The three gridded datasets almost have the highest positive Tau values (> + 0.3) throughout the third decade. It is noticed that the negative Tau values increased in the fourth decade, especially in the Western Desert, to reach less than − 0.3 near the Libyan borders, where ERA5 has the greatest Tau values and POWER has the lowest Tau. Moreover, the lowest COV values (< 10) covered large area in the first and fourth decades, and the highest COV values (> 10) covered the large area in the second and third decades, where the highest variability (COV > 20) appears in the eastern delta, Sinai, and sometimes in southwestern Egypt. Sometimes sites with high COV values correspond to places with low Tau values of TXn where the correlation is negative (− 0.04 to − 0.13) during the first decade and positive (0.04 to 0.5) during the other three decades.

The spatial distribution of decadal Tau and COV for TNx is shown in Fig. 9, where the Tau values ranging from − 0.6 to + 0.9 during the selected four decades. Furthermore, the significant negative trend (Tau >  − 0.6) is found in a few regions like in the western borders from CFSR and Eastern Desert from POWER. Also, the CFSR exhibit positive Tau values (> + 0.3) and increases to + 0.9 in specific areas (middle of the Western Desert and southeast Egypt), and it rarely exceeds + 0.6 in both EAR5 and POWER datasets. During the second decade, the areas covered by the negative Tau values (< − 0.3) increased over separate locations in the Western Desert, while the positive Tau (+ 0.3 to + 0.6) still dominates most of Egypt in POWER and ERA5. The negative Tau (< 0) values of TNx in the third decade covered the largest area, while the significant positive Tau (> + 0.3) dominated most of Egypt in the fourth decade with small scattered negative Tau values (> − 0.3). The CFSR is producing higher COV values for TNx Than both ERA5 and POWER during the study period. Furthermore, the spatial distribution of the decadal COV of TNx is negatively correlated with its corresponding decadal Tau of TNx during the first three decades which ranges from − 0.01 to − 0.5, while the fourth decade has a positive correlation (0.04 to 0.2).

Fig. 9
figure 9

Geographical distribution of the decadal Mann–Kendall rank correlation (Tau) and decadal coefficient of variation (COV) for TNx

For the TNn, the geographical distribution of the decadal Tau and COV are shown in Fig. 10. The highest negative Tau values of TNn (> − 0.6) in the first decade is detected in CFSR at different locations in Sinai and Eastern Desert. While the positive Tau values (> + 0.3) are mainly found in the Western Desert and northern part of the Eastern Desert. The positive Tau values for TNn dominate in EAR5 followed by POWER, where the highest positive Tau values (significant trend) ranges from + 0.3 to + 0.9 are detected in northern Egypt and dispersed regions along the Red Sea coast. In the second decade, the negative Tau values of TNn appears in a small area in north of Egypt and gradually increase to record the highest positive Tau of TNn (+ 0.9) in southern Egypt in the CFSR.

Fig. 10
figure 10

Geographical distribution of the decadal Mann–Kendall rank correlation (Tau) and decadal coefficient of variation (COV) for TNn

In the third decade, the most negative Tau values of TNn (− 0.3 to − 0.6) are found in Sinai, while the significant trend (Tau >  + 0.6) appears in the Eastern Desert in EAR5 and POWER and in the Western Desert in CFSR. In the fourth decade, most of Egypt has negative Tau values of TNn which never occurred in any of the first three decades, where the positive Tau values of TNn are only found in Sinai, southeast Egypt, and sometimes northern Egypt as in POWER. The decadal COV values of TNn less than 5 dominate over the Mediterranean and Red Sea coasts, while COV values of 6–12 dominate over Western Desert and Sinai in the three gridded datasets during all decades.

Additionally, it is detected that the variability of TNn is higher in CFSR than in both POWER and ERA5. Finally, the correlation between the spatial distribution of decadal COV and Tau of TNn overall decades is mostly negative and ranges from − 0.6 to + 0.3. The findings for the selected indices (TXx, TXn, TNx, and TNn) are in great agreement with the results of several studies of extreme warm and cold events worldwide (e.g., Sun et al. 2016; Wang et al. 2017).

5 Conclusions

Daily data for maximum (Tmax) and minimum (Tmin) temperatures were downloaded from three gridded reanalysis datasets (ERA5, CFSR, and POWER) for four consecutive decades (1981–2020) to study the variability of extreme warm and cold events over Egypt. The Tmax and Tmin of the three reanalysis datasets were evaluated against the observations at 24 in-site stations and showed a good agreement and can be employed appropriately to interpret extreme warm and cold temperature events in Egypt. Furthermore, the monthly maximum value of daily Tmax (TXx), monthly minimum value of daily Tmax (TXn), monthly maximum value of daily Tmin (TNx), and monthly minimum value of daily Tmin (TNn) were calculated using the climate data operator (CDO) tool to investigate the variability of extremes over Egypt.

The main findings of this study can be summarized as follows:

  • The area mean of the decadal average of TXx from the CFSR dataset (44.7 °C) is higher than both POWER (43.6 °C) and ERA5 (42.3 °C).

  • The decadal anomaly of TXx increased to + 1, + 1.5, and more than + 2 °C in the second, third, and fourth decades respectively.

  • The Mann–Kendall rank correlation (Tau) for TXx is mostly positive in the first and fourth decades and mostly negative in the second and third decades.

  • The lowest coefficient of variation (COV) for TXx is less than 2 and detected in southern and southwestern Egypt and sometimes in Sinai, and ranges from 1 to 5 over Egypt across all decades.

  • The area mean of the decadal average of TXn from POWER (14.2 °C) is greater than both CFSR (13.7 °C) and ERA5 (13.4 °C).

  • The third decade has the greatest TXn values with a positive anomaly between + 0.5 and + 1.5 °C except for northern Egypt (− 0.25 °C).

  • All decades almost have positive Tau values of TXn, but the third decade has a significant positive trend (Tau >  + 0.6), and the fourth decade has a significant negative trend (Tau <  − 0.3).

  • The first and fourth decades have the lowest COV values (< 10) for TXn over most Egypt, while the second and third decades have the highest COV values (> 10) over most Egypt.

  • The area mean of the decadal average of TNx is the greatest (27.5 °C) in ERA5 as compared to both POWER (27 °C) and CFSR (26 °C).

  • The highest negative decadal anomaly of TNx (− 2 °C) is detected in first decade and increased gradually up to the highest positive anomaly in the last decade (+ 2 °C) in most of Egypt.

  • Tau values for TNx from the three gridded datasets indicate that the significant positive trends (> + 0.3) are detected in the fourth decade followed by the first decade, while the significant negative trends (< − 0.3) are found in the third decade followed by the second decade.

  • CFSR produces higher COV values for TNx (> 7) than both ERA5 and POWER throughout the selected four decades over Egypt.

  • The area mean value of the decadal average of TNn from CFSR (0.8 °C) is lower than both EAR5 (2 °C) and POWER (2.5 °C).

  • The decadal anomaly was less than + 0.5 °C in the first decade but exceed + 2 °C in the fourth decade, indicating that TNn increased gradually throughout the study period (1981–2020).

  • The significant positive trend of TNn (Tau >  + 0.3) appears during the third decade followed by the second decade, while the significant negative trend (Tau <  − 0.3) occurs during the fourth decade.

  • The lowest decadal COV values of TNn (< 5) are detected over the Mediterranean and Red Sea coasts, while the highest COV values (6–12) dominate most of the Western Desert and Sinai.

  • Generally, the COV and Tau of all indices over Egypt are often negatively correlated (> − 0.5).

  • Finally, one can conclude that the significant increase in the trend of extreme warm and the slight decrease in the trend of extreme cold are due to the climatic increase in both mean and variance of temperature over Egypt.