1 Introduction

Drought is a transitory, prolonged anomaly characterized by a period of below-normal precipitation values in a specific area. According to the American Meteorological Society (1997), drought can be classified into four interrelated categories over time (Tian et al. 2020; Yihdego et al. 2019). First, the initial cause of any drought is a shortage of precipitation (meteorological drought), leading to soil water deficiency that impacts crops when it occurs in the growing season (agricultural drought). After a prolonged period of no rainfall, there is a shortage of stream flow, groundwater recharge and reservoir volume (hydrological drought). Finally, the water resources needed to supply existing demand are affected (socioeconomic drought). Flash drought is a critical subseasonal phenomenon that can be devastating for ecosystems and, consequently, for the economy and health in general (Christian et al. 2021).

Drought impacts are often assessed using drought indices that quantify the degree of drought hazard based on climate data (Vicente-Serrano et al. 2012). It is difficult to objectively quantify the characteristics of drought episodes in terms of intensity, magnitude, duration and spatial extent (Tian et al. 2020; Vicente-Serrano et al. 2012), and previous studies have shown that there is no single drought index with the ability to comprehensively assess water stress conditions in any type of bioclimatic variant (Cai et al. 2023; Yihdego et al. 2019). Even so, we can classify the indices into different groups: climate-based indices for assessing meteorological drought; remote sensing-based indices for vegetation analysis and soil-moisture-based indices mainly for assessing agricultural drought; hydrological-based indices for controlling stream flow alterations; and mixed-based indices, as a combination of the previous indices, since the different types of drought are interconnected (Abdourahamane et al. 2022). It has been shown that combining multiple variables and indices can improve drought characterizations (Liu et al. 2020). In either case, the calculation of drought indices requires the availability of hydrometeorological, vegetation cover, crop productivity and flow regime data, depending on the type of drought to be monitored and the impacts to be assessed (Abdourahamane et al. 2022). These data come from in situ data-recording stations (weather stations and gauges) and from remote sensing data, which complement the previous data by increasing their spatiotemporal resolution and can be used in combination with in situ data or alone in regions with a lack of in situ data (Rhee et al. 2010; Xie and Fan 2021).

Within the climate-based group, several indices have been defined over the years, such as the Palmer drought severity index (PDSI) proposed by Palmer (1965), based on a soil water–balance equation. Subsequently, the widely used standardized precipitation index (SPI) (McKee et al. 1993), based on a probabilistic approach to precipitation, was defined. Finally, the standardized precipitation evapotranspiration index (SPEI) proposed by Vicente-Serrano et al. (2010) is a multiscale drought index based on climatic data. It can be used to determine the onset, duration, and magnitude of drought conditions relative to normal conditions.

In recent years, many studies have examined the relationship between different drought indices to identify the impacts of drought from different perspectives. For this purpose, the correlation between different climate-based drought indices (mostly PDSI, SPI, SPEI) and hydrological-, vegetation-, or agricultural productivity-based indices are usually analyzed (Abdourahamane et al. 2022; Alkaraki and Hazaymeh 2023; Cai et al. 2023; Kogan 1995b; Liu et al. 2020; Potop 2011; Rhee et al. 2010).

Nonetheless, with the integration of remote sensing data, new opportunities for drought monitoring have emerged by combining climate-based drought indices and satellite data with other biophysical parameters to determine negative impacts on vegetation (Yihdego et al. 2019). Regarding satellite data sources, remote sensing-based indices are frequently calculated using data from the Moderate Resolution Imaging Spectroradiometer Satellite (MODIS), mostly focused on products derived from the normalized difference vegetation index (NDVI) (Rouse et al. 1973), the most widely used index worldwide (Abdourahamane et al. 2022; Alkaraki and Hazaymeh 2023; Cai et al. 2023; Gu et al. 2007; Liu et al. 2020; Orimoloye et al. 2021; Rhee et al. 2010; Xie and Fan 2021). The MODIS instrument captures images of the earth every day or two. The extensive data products derived from MODIS observations describe characteristics of the land, oceans, and atmosphere that can be used to study both local and global scale phenomena and trends.

The information provided by the NDVI and its relationship with drought can be used directly or through other derived indices, such as the vegetation condition index (VCI) (Kogan 1995b) based on the NDVI or the vegetation health index (VHI) (Kogan 1997), which is a mixed index combining the VCI with the temperature condition index (TCI) (Kogan 1995a). These products have been widely applied for detecting and monitoring drought since they are recommended by the World Meteorological Organization (WMO) (Xie and Fan 2021).

Another option is to work directly with the NDVI value. Recent studies such as Vicente-Serrano et al. (2019) have addressed the relationship between the NDVI and drought events. They concluded that although vegetation activity over large parts of Spain is closely related to the interannual variability of drought, there are clear seasonal differences in the response of the NDVI to drought. Thus, apart from the NDVI, there are other spectral indices that are useful for determining the negative effects of drought on vegetation. These are the dry matter productivity (DMP) (Monteith 1972), which represents the overall growth rate or increase in dry biomass of vegetation and is directly related to the net primary productivity of the ecosystem, and the fraction of absorbed photosynthetically active radiation (FAPAR), which quantifies the fraction of solar radiation absorbed by living leaves for photosynthetic activity. Additionally, the fraction of vegetation cover (Fcover), which refers only to the green and living elements of the canopy, corresponds to the fraction of soil covered by green vegetation. In practice, it quantifies the spatial extent of vegetation. Finally, the leaf area index (LAI) is defined as half the total area of green canopy elements per unit of horizontal soil surface. Some of these indices have been previously correlated with meteorological indices, such as the FAPAR (Gobron et al. 2005; Rossi et al. 2008;).

It should be noted that not all vegetation types behave in the same way during drought episodes. Moreover, there are few studies assessing the effects of drought by season, although seasonality and prevailing time scales can be relevant to assess drought impacts on different vegetation types (Vicente-Serrano et al. 2013). Therefore, a detailed study by vegetation type and season is needed to understand in detail the sometimes devastating effects of drought. If a spatial analysis can also be performed by grouping areas with the same performance, concise actions for economic aid or more focused actions for preventing the effects of climate change could be envisaged. This type of analysis would have important implications for characterizing priority intervention areas and informed decision-making to drive mitigation and adaptation efforts (Khoshnazar et al. 2023).

The main purpose of this work is to test the relationship, both temporally and spatially, between meteorological and remote sensing indicators in a drought event in southwestern Spain. To this end, we seek to answer the following questions: Is there a correlation between the indicators? Does the correlation depend on vegetation type? Does the correlation depend on season? Can the correlation be spatially modeled?

2 Materials and method

2.1 Study area

The study area is the Extremadura region in western Spain (Fig. 1), which is covered mainly by forest and crops. Forested areas comprise evergreen forests, deciduous forests, shrubs, grasslands and dehesas. The dehesa is a typical cover of Extremadura and the neighboring border region with Portugal (where it is called montado) defined by Devesa Alcaraz (1995) as “pasturelands populated by holm and/or cork oaks, with an understory of open grassland, cereal crops, or Mediterranean scrub.”

Fig. 1
figure 1

Study area

The average annual precipitation of the area ranges from 400–1300 mm and is mainly concentrated in October–April, while during the summer, there is close to 0 mm of precipitation.

2.2 Indices

There are two different types of vegetation indices: radiometric indices, which are spectral transformations of two or more bands, and biophysical indices, which are based on radiative transfer models related to robust assumptions, mainly regarding canopy architecture (Huete et al. 2002).

The vegetation indices used in this work were downloaded from open access databases. The following vegetation indices from the Copernicus Global Land Service were used:

DMP: A biophysical index computed by the Monteith variant with a specific term that considers potentially important factors, such as the effect of droughts, nutrient deficiencies, pests and plant diseases. However, it can be argued that the adverse effects of disease and water or nutrient scarcity are manifested in the remotely sensed FAPAR term also used in the calculation of DMP.

FAPAR: A biophysical index dependent on the structure of the canopy, the optical properties of the vegetation elements and the lighting conditions (Smets and Lacaze 2019). The European Drought Observatory (European Commission 2023) uses the FAPAR anomaly to detect and monitor the impacts of agricultural drought on vegetation growth and productivity.

FCOVER: A biophysical index that corresponds to the gap fraction for the nadir direction. It is used to separate vegetation and soil in energy balance processes, including temperature and evapotranspiration. It is computed from the LAI and other canopy structural variables and does not depend on variables such as the geometry of illumination, in contrast to the FAPAR (Smets and Lacaze 2019). For estimation, only the green vegetation at both the overstory and understory levels will be considered.

Finally, the LAI serves to calculate the FCOVER and other indices and can therefore be considered a primary biophysical variable related to the canopy structure and is sensitive to droughts or other natural hazards (Delegido et al. 2015).

All indices that quantify green vegetation need to be calculated considering variables such as leaf water content, leaf dry matter content and leaf chlorophyll content (Weiss and Baret 2016), which are directly related to drought and are mainly related to dry or wet conditions.

Finally, the well-known NDVI is a spectral vegetation index that considers the fact that green and vigorous vegetation is reflected more in the NIR band and less in the visible part, making this index sensitive to drought.

The meteorological index SPEI was also obtained from the open database of the meteorological drought monitor of the Spanish scientific research council (Reig et al. 2023). As noted above, this index identifies different drought types and impacts.

The SPEI uses the climatic water balance. It compares the available water with the atmospheric evaporative demand (Beguería et al. 2014). The climatic water balance is calculated at different time scales, and the results are fit to a log-logistic probability distribution.

Following the classification of Zhang et al. (2023), the severity of drought is classified by grading the SPEI as follows: no drought for values above -1, light drought for values between -1.5 and -1, moderate drought for values between -2 and -1.5, and severe drought for values below -2.

On the basis of all these data, the methodology followed is shown in Fig. 2.

Fig. 2
figure 2

Workflow

2.3 Selection of study period and data collection

In terms of meteorological drought, historical precipitation time series were analyzed, and the period from October 2004 to October 2006 was selected. After limiting the study to this period of time, we downloaded all the abovementioned indices from the two databases in raster format and with a 1-km spatial resolution.

The vegetation indices were provided in 10-day periods and the SPEI in four-time steps per month, requiring the prior homogenization of the series to obtain equivalence for each period.

This resulted in a total of 360 rasters for the vegetation indices and 96 rasters for the drought index. The information was then extracted to a regular sample grid of locations regularly distributed throughout the study region. The distance between each position was 10 km, giving a total of 1737 locations to study.

2.4 Time series and correlation

A complete time series was generated with all 792,072 observations to establish correlations and comparisons between the evolution of the different vegetation indices and the SPEI. Using the severity degrees proposed by Zhang et al. (2023), we analyzed only the period with SPEI values below -1.0 (Fig. 2), which corresponds with a complete year from March 2005 to March 2006.

The time series were debugged by cleaning the outliers, such as pixels with digital levels of 255 or other atypical values.

Then, correlations were measured by Pearsons’s statistics and established according to two fundamental characteristics of the vegetation cover. First, a distinction was made between the different seasons of the year. Additionally, the correlations were investigated by vegetation type. For this purpose, three cartographic sources were used:

The ESA_WCM map, published on 24 October 2022 (Zanaga et al. 2022). This is a 10-m resolution cover map that distinguishes 11 categories, although we only worked with those that exclusively refer to the vegetation present in the study area.

The FME (Forest Map of Extremadura) vector map has a scale of 1:25 000 and is an official cartographic source appropriate for the description of the vegetation categories occurring in the study area. As in the previous case, only the vegetation categories present in the study area in considerable proportions were analyzed.

Finally, an ad hoc map was used with vegetation categories grouped according to their hydrological properties in Fragoso-Campón et al. (2021). This map is also in raster format with 10-m resolution that allows distinguishing even between deciduous and evergreen trees, a key aspect in the study of vegetation performance by season.

2.5 Space–time cube analysis

The spatial–temporal relationships between the performance of the SPEI and the vegetation indices were analyzed to verify their spatial pattern and the degree of importance of the vegetation indices in the variation in the SPEI index.

First, to analyze data distributions and patterns in the context of both space and time, spatial–temporal cubes was constructed to analyze the possible clusters with the same temporal variations of all the indices. Then, we performed partitions of time series, stored in the space–time cubes, based on the similarity of time series characteristics. The clusters were calculated separately for each studied index tending to increase and decrease at the same time and then merged to analyze the locations with the same temporal pattern.

Second, a forest-based forecast analysis, using the random forest algorithm, was performed to analyze whether temporal variations in vegetation indices could explain the performance of the SPEI index. Therefore, SPEI was chosen as the predictor variable, and all vegetation indices were chosen as explanatory variables. The forest regression model was trained using 100 trees and one-fourth of the number of time steps as time windows on each location of the space–time cube. The fit of the predicted model with the time series was further measured with the forecast root mean square error (RMSE) (Eq. 1).

$$\mathrm{Forecast}\;\mathrm{RMSE}=\sqrt{\frac{\sum_{t=1}^T{(c_t-r_t)}^2}T}$$
(1)

where T is the number of time steps, ct is the value of the forest model, and rt is the raw value of the time series at time t.

This analysis also involved a study of the lagged effects between the vegetation indices and the SPEI and the importance of the explanatory variables, according to each time lag. Each vegetation index was converted into time lagged predictor within each training time window of the forest model. This allowed estimating any lagged effects between the explanatory variables and SPEI.

3 Results

According to the decay and rise of the time series from March 2005 to March 2006 shown in Fig. 3, there is a similarity of trends between the SPEI index and the vegetation indices.

Fig. 3
figure 3

Average time series and drought severity by season

According to the classification of Zhang et al. (2023), Fig. 3 shows that severe drought was concentrated in summer and autumn, which was higher in the latter season. In the summer and winter seasons, the number of locations and periods with drought were considerable, but their intensity was classified within the light and moderate intervals.

3.1 Index correlations

Figure 4a shows the correlations between the time series. According to ESA_WCM vegetation categories, grassland obtained the highest Pearson correlations values in all the indices, with the FAPAR, LAI and NDVI having the highest correlation with the SPEI index. Furthermore, the highest correlations occurred mainly in autumn, which is when drought severity was greatest, but slight correlations were also found in spring. In contrast, the DMP index had extremely low correlations in any season of the year.

Fig. 4
figure 4figure 4figure 4

Pearson correlation values between indices for the (a) ESA_WCM map, (b) FME map, (c) ad hoc map

Regarding the Pearson correlation values for the FME vegetation categories (Fig. 4b), once again, autumn had the highest correlations, especially for FAPAR, LAI and NDVI. This time, since there is higher discrimination of vegetation categories, in addition to grasslands, good results were also obtained for Shrubland (Retama) but only in autumn, since in spring correlations are very low for all the indices in that category. As in the previous map, the DMP index obtained particularly low correlations in these two categories.

Finally, Fig. 4c shows the ad hoc map categories, and the results are more detailed and lead to better interpretations. The same seasonal trends can be observed in the best-correlation seasons and indices. However, in addition to grasslands, high correlations were also found for dehesas, which are very open woodlands with a large amount of grass between them. Additionally, when distinguishing between evergreen and deciduous trees, the correlations in autumn are completely opposite when leaves are present and absent. This is the case for evergreen and deciduous hardwood forests. Additionally, the high correlation value for deciduous hardwood forests with the DMP is remarkable, which is also totally opposite to the correlation of the rest of the vegetation indices.

3.2 Space–time cube analysis

Figure 4 shows that in more than 500 locations (dark green), all the studied indices have a similar performance (they were classified within the same cluster). In addition, most of the locations seem to be clustered very close together in specific areas, especially in the south and southeast of the region.

In contrast, more than 300 studied locations coincided in only 17% of the indices in terms of similarity of time series performance. In this case, there is considerably more dispersion, although there is a slightly more localized area in the eastern part of the region.

The clusters with the highest coincidence (100 percent of the indices) belong to grassland according to the WCM and the FME (Fig. 5). These results are consistent with the correlation analyses performed, as this is the category with the highest correlations between the vegetation indices studied and the SPEI. Regarding the ad hoc map, the coincidence between spatiotemporal clusters is more evenly distributed among all the categories, with a low coincidence of clusters in the shrub category, which also coincides with it being a category with low Pearson correlation values.

Fig. 5
figure 5

Spatial analysis. Agreement of time series and vegetation types for the ESA_WCM map, FME vegetation map, and ad hoc map

Finally, Fig. 6 shows that at most the indices are lagged by 2 periods of 10 days each and that in the case of the first 10 days of lag, the most important explanatory variables in the model are FAPAR and LAI, closely followed by NDVI.

Fig. 6
figure 6

Importance of variables according to time lags

The RMSE obtained in the forecast was only 0.058 in the SPEI index (5.8% of the SPEI index), which is within the 90% confidence level at which the model estimation was set.

4 Discussion

The time series chosen for the study is a full year (March 2005–March 2006) classified as a drought year according to the SPEI index. This characteristic is unusual since in the study area, despite the occurrence of multiple dry spells, autumn and spring are usually rainy. It also provides a continuous amount of data in time, which made the results more robust. For example, the recent study by Shahzad et al. (2023) examined variations in trends of changes in vegetation and drought indices, and although it used 4 years (1984, 1993, 2007, and 2012), none of the years were fully qualified as drought years according to the normalized vegetation supply water index (NVSWI), which was used to categorize drought severity.

As indicated above, the SPEI can be calculated at different time scales. In this work, we used a 9-month time scale. This scale is very appropriate for the period studied since, according to Vicente-Serrano et al. (2013), unusual severe droughts correspond to long SPEI time scales. This 9-month time scale was found by Zhang et al. (2017) to be the most sensitive to vegetation where grasslands are the more dominant cover.

As shown in Fig. 2, in both spring and autumn, the lines representing the average time trend of each index have a notable geometric similarity. This occurs if we compare the SPEI index with the FAPAR, NDVI, and FCOVER indices, and in contrast, we can see that the DMP index does not have this geometric similarity. These similarities are demonstrated by the generally better correlations of the autumn and spring seasons with almost all types of vegetation. This better correlation, especially in autumn, is closely linked to the fact that drought was most severe in this season than in the rest of the year. As is normal, the trends of the vegetation indices decrease in spring and increase in autumn, and we analyzed the degree of similarity of this decrease or increase with that of the drought index. Analyzing Fig. 2 again, it can be seen that when there is no drought (in autumn 2004 or spring 2006), the trend of the SPEI index has no relationship with that of the rest of the vegetation indices.

Regarding the vegetation indices, one of the most strongly correlated with the SPEI is the NDVI. In this context, Jalayer et al. (2023) presented an index closely related to the NDVI and obtained very low Pearson correlation values with the SPEI (0.258 with SPEI at a 6-month time scale). Rhee et al. (2010) used the scaled drought condition index (SDCI), which was also calculated from the NDVI, and obtained Pearson correlation values with a 6-month SPI of approximately 0.70 for an arid region in the month of May and for the grassland category. Considering that our study area also belongs to an arid climate, their results are similar, although slightly lower, to those obtained in this work for grasslands, especially those categorized by the FME. The difference could be because their vegetation categories also included pasture hay and cultivated crops. In addition, those authors only analyzed 1 month of the whole spring period, and they used the SPI, which does not include the evapotranspiration effect.

Our findings show that LAI has the best correlations with SPEI in most of the categories and mainly in autumn. The effects of drought on the LAI were also analyzed by Bai et al. (2023), and their conclusions confirm our results: the intensity of drought occurrence in autumn has a very high influence on the LAI. In general terms, Kim et al. (2017) obtained good correlations between the LAI and drought (they used a 9-month time scale SPI) based on the idea that LAI values decreased under water stress conditions.

Additionally, the FAPAR index has also been correlated with SPEI in some studies. In this sense, our results were confirmed by Rossi et al. (2008), also working in Spain, who found that FAPAR anomalies seem to be more correlated with precipitation anomalies than NDVI anomalies over a wider range of land cover types, but they recommended that the study should be performed seasonally, as in our case. The European Drought Observatory (European Commission 2019) also recommends the use of the “FAPAR anomaly” as an indicator for the SPEI.

In contrast, the worst results in almost all vegetation types were obtained by the DMP index. In this sense, Smets et al. (2019) confirmed that “Some potentially important factors, such as drought stress, nutrient deficiencies, pests and plant diseases, are omitted in the DMP product… On the other hand, it might be argued that the adverse effects of drought, diseases and shortages of nutrients are manifested (sooner or later) via the RS-derived FAPAR. The DMP algorithm does not include a water stress factor to account for short-term drought stresses. In drought sensitive vegetation types, this can lead to an overestimation of the actual plant productivity.”

The best vegetation type in terms of correlation was grassland in all studied maps. High evapotranspiration rates or reduced growing seasons caused by increased summer drought have an unfavorable impact on pasture growth (Rubio and Roig 2017), making it highly vulnerable to drought, especially in warmer climates, as confirmed in Zhang et al. (2021).

This drought impact on vegetation activity in the Mediterranean region has been assessed by Gouveia et al. (2017), who also applied a cluster analysis with the NDVI quantifying drought severity and using the SPEI, and they performed the correlations by season, although they used only data from one representative month of each season. Our results are consistent with theirs, as they found a good correlation in spring for the vegetation type grouped as Oceanic Mediterranean (OM), which is type that includes our study area. However, their study was not as detailed as ours, as we were able to distinguish the correlation of the NDVI with and other indices for up to 12 vegetation categories (at the ad hoc map).

Regarding the spatiotemporal analysis, Jalayer et al. (2023) also performed a spatiotemporal study of some indices but did not compare the trends and groupings of all of them at the same time in space. In turn, Khoshnazar et al. (2023) performed a cluster analysis for drought data, but they did so not for the time series but for the spatial vulnerability to drought.

As climate change is modifying precipitation and evapotranspiration patterns, droughts will be increasingly severe and frequent throughout the world, especially in the study area due to its aridity and vulnerability. Based on the prediction map of potential evapotranspiration in a historical scenario of the National Plan for Adaptation to Climate Change (PNACC), it can be observed in Fig. 7 that the areas of the region with the highest volume of evapotranspiration are in the cluster in which all the locations studied behave in the same way in terms of vegetation indices and SPEI. These locations with the same performance have a mean potential evapotranspiration value of 87.43 mm/month, while the average for the country is 76.77 mm/month.

Fig. 7
figure 7

Potential evapotranspiration of the time series clusters

5 Conclusions

In this work, we performed a detailed study of the correlations between the SPEI meteorological index, and some vegetation indices provided by the Copernicus Global Land Service program during a prolonged drought episode in the Extremadura region, Spain.

The indices with the highest correlation with the SPEI were the FAPAR, LAI, and NDVI. Since the indices used are freely available and come from a reliable source with good resolution and continental coverage, they are candidates as drought predictors if the SPEI values are not available in the study area.

The correlations were studied by season and according to the vegetation categories classified according to three different cartographies. The results show that grasslands have the highest similarity of performance between all the studied indices. In this sense, the importance of the cartography used is clear, since the greater the detail, the more detailed the vegetation behavior in the face of drought.

For the seasons of the year, the highest correlation was found in autumn and spring. In the analyzed period, drought was the most severe in autumn. In addition, drought has the greatest effects during vegetation growth and decay, since once the maximum senescence or decay is reached, the climatological variations do not affect the plants.

The results of the spatiotemporal study are consistent with those of the correlation study, since grasslands were found to behave the most consistently temporally.

In addition, the location with coincidence in behavior was located in an area with a prediction of very high evapotranspiration values according to predictive models of climate change.

The major significance of the outcomes of this study reveals the agreement in the spatiotemporal behavior of the indices during the drought period studied also reflects well the vulnerability to drought, therefore, the behavior of the vegetation indices can be used a predictor of drought events.