Introduction

Agricultural drought monitoring is a pre-requirement for agricultural management and is of great importance for sustainable development, especially in arid regions. Therefore, it is crucial to develop an integrated agricultural drought index to understand the evolution of agricultural drought monitoring, forecasting, risk assessment, and crisis management (Feng et al. 2019; Tian et al. 2022). Challenges related to monitoring agricultural drought include, but are not limited to, obtaining the required datasets. Global gridded and satellite products with high spatial resolution and temporally continuous datasets can be very useful in monitoring any type of drought, particularly agricultural drought (Al-Yaari et al. 2019).

One of the main challenges in using reanalysis or other types of global products for drought analysis is dealing with errors in these products compared to ground-based measurements (Ma et al. 2019; Fan et al. 2022). Despite extensive research on error analysis of global products for precipitation (Azizi et al. 2020; Saemian et al. 2021; Ghomlaghi et al. 2022; Fooladi et al. 2023), temperature (Gleixner et al. 2020; Rodrigues 2021; Zhang et al. 2021; Xu et al. 2023), evapotranspiration (Shirmohammadi-Aliakbarkhani and Saberali 2020; Elnashar et al. 2021; Ochege et al. 2021; Panahi et al. 2021; Zhu et al. 2022; Liu et al. 2023; Yao et al. 2023), and soil moisture (Li et al. 2020, 2022; Wu et al. 2021; Fan et al. 2022; Huang et al. 2022; Sion et al. 2022), there are still research gaps when it comes to the combination of global products (Allies et al. 2022; Min et al. 2022). Many studies have evaluated combination models to increase the accuracy of multi-product approaches compared to single-product approaches (Fooladi et al. 2021, 2023; Jiao et al. 2021; Chen et al. 2022). In recent years, multi-product datasets have been used for developing integrated drought indices (Jiao et al. 2021; Alkaraki and Hazaymeh 2023a).

The past few decades have been accompanied by a transformation in methods for developing drought indices, shifting from single input variables (e.g., standardized precipitation index (SPI) (McKee et al. 1993) and soil moisture index (SSI) (Hao and AghaKouchak 2013)) to multi-variate indices (e.g., standardized precipitation evapotranspiration index (SPEI) (Vicente-Serrano et al. 2010), standardized precipitation temperature index (SPTI) (Wable et al. 2019), and new multivariate standardized drought index (MSDI) (Hao and AghaKouchak 2013)). Additionally, indices based on parametric probability distribution functions have been replaced by nonparametric frameworks that describe drought more effectively (Hao and Aghakouchak 2014; Farahmand and AghaKouchak 2015; Alizadeh and Nikoo 2018). Furthermore, the development of combined condition indices (CCIs) such as vegetation health index (VHI) (Kogan 1995), scaled drought condition index (SDCI) (Rhee et al. 2010b), microwave integrated drought index (MIDI) (Zhang and Jia 2013), and integrated drought monitoring index (IDMI) (Arun Kumar et al. 2021) has been widely advanced using single condition indices (SCIs) such as precipitation condition index (PCI) (Rhee et al. 2010a), temperature condition index (TCI) (Kogan 1995), potential evapotranspiration condition index (PETCI) (Allen et al. 2007), soil moisture condition index (SMCI) (Zhang and Jia 2013), and vegetation condition index (VCI) (Kogan 1995).

Identifying suitable variables that contribute to the prediction accuracy of drought indices poses significant challenges in drought management (Ghazipour and Mahjouri 2022). To develop accurate drought prediction models, prior to selection, feature selection (FS) methods have been employed to identify the most effective predictors (Feng et al. 2019). Although feature selection methods have been widely utilized in various disciplines, their application in drought research has been limited to a few studies, such as variable selection using random forests (VSURF) (Feng et al. 2019), principal component analysis (PCA) (Arun Kumar et al. 2021), and Boruta random forest (Jamei et al. 2023). This study attempts to bridge this gap by proposing a comprehensive framework for predicting agricultural drought using feature selection methods based on different climates. Various machine learning methods such as artificial neural networks (ANNs) (Deo and Şahin 2015; Feng et al. 2019) and support vector regression (SVR) (Azamathulla and Ghani 2011; Malik et al. 2021; Ghazipour and Mahjouri 2022) have been employed thus far.

Based on the comprehensive review of the literature, there is a potential to enhance the performance of integrated agricultural drought indices. This study makes significant contributions to the literature by (1) presenting a comprehensive accuracy evaluation by combining six products (i.e., CRU TS4.05, TerraClimate, ERA5, MERRA2, GLDAS2.1, and GLEAM3.6a) for four key agrometeorological variables including mean temperature (T), precipitation (P), root-zone soil moisture (SM), and potential evapotranspiration (PET) using in situ observations under various climates in Iran; (2) determining the significance of key drought variables for predicting agricultural drought based on various FS methods; (3) developing SVR models with the best predictors for accurate agricultural drought prediction.

This study developed a machine learning feature selection-based model for predicting agricultural drought prediction multi-product datasets. The main research aims included (i) determining the superior combination of products for P, T, PET, and SM variables with the highest accuracy using ground-based measurements; (ii) finding appropriate drought-affecting variables for predicting drought indices between SCIs, and (iii) comparing new integrated agricultural drought indices based on SVR FS-based with CCIs (i.e., VHI, SDCI, MIDI, and IDMI) using statistical criteria and drought characteristics.

Material and methods

Case study

The study area, Iran, is located between the latitude of 25°–40° N and longitude 44°–63° E (Fig. 1). As the second biggest country in the Middle East, Iran has a population of ~ 82 million and covers an area of 1,648,195 km2 (Bazrafshan and Cheraghalizadeh 2021; Rezaei 2021). The lowest and the highest points in Iran are 29 m below sea level and 5597 m above mean sea level located on the southern coast of the Caspian Sea and Mount Damavand, respectively (Bazrafshan and Cheraghalizadeh 2021).

Fig. 1
figure 1

Climate classification of stations and basins based on the aridity index across the study area

The mean annual P in the 1987–2019 period was 357 mm, and the difference between the lowest (52 mm y−1) and highest (1692 mm y−1) P was 1642 mm (Madani 2014; Moshir Panahi et al. 2020). PET for the study site was computed based on the FAO 56 Penman–Monteith (PMF56) method (Allen et al. 1998). The mean difference of PET in Iran is approximately 2868 mm y−1 due to climate diversity from the Caspian Sea’s southern coast to the southeastern regions in Iran (Sharafi and Mohammadi Ghaleni 2021).

The aridity index (AI) value for the study site was calculated based on the ratio of mean annual P to the mean annual PET (AI = P/PET). This index was used to categorize the aridity of the basins, including hyperarid (AI ≤ 0.03), Arid (0.03 < AI ≤ 0.2), semiarid (0.2 < AI ≤ 0.75), and humid (AI > 0.75) regions (UNESCO 1979; Tsiros et al. 2020). In this study, AI was used to classify the hydrological sub-basin climates of the case study (Fig. 1).

Iran has 30 main hydrological sub-basins, 9 of which (37%) were classified as hyperarid (HA), 11 (42%) as arid (AR), 7 (18%) as semiarid (SA), and 3 (2%) as humid (HU) climates (Fig. 1). The specifications of these basins are presented in Table 1.

Table 1 Specifications of basins in the present study

The workflow of the research is summarized in six steps (Fig. 2).

Fig. 2
figure 2

Multistep workflow for agricultural drought prediction based on ensemble models with various feature selection methods

Sources of datasets

The research datasets were obtained from three sources including ground-based measurements, reanalysis products, and remote sensing datasets. These data sources are described in the following subsections.

Ground-based measurements

Two types of ground-based datasets, including synoptic and agrometeorological station measurements, were sourced from Iran's Meteorological Organization (i.e., IRIMO https://data.irimo.ir/login/login.aspx). The daily meteorological variables including P, minimum and maximum air temperature (Tmin and Tmax), sunshine duration (Sd), relative humidity (RH), and wind speed (U2) for the period 1987–2019 were obtained from IRIMO for 100 synoptic stations (Fig. 1). Daily PET was obtained using the FAO 56 Penman–Monteith (PMF56) method (Allen et al. 2006) and measured meteorological variables for 100 synoptic stations (Table 2).

Table 2 Ground-based measurement datasets used in the present study

Addtionally, SM data in seven depths (including, 5, 10, 20, 30, 50, 70, and 100 cm below the soil surface) with a temporal resolution of 3-h from 43 agrometeorological stations during 2014–2021 were used (Table 2). Among the 43 stations, 20 stations were selected after preprocessing, based on having the least missing data for all depths within the 2014–2019 period. To determine the average root zone soil moisture (0–100 cm), the soil moisture measurements for each station from various depths are weighted after calculating the monthly soil moisture at all depths (Zhang et al. 2020; Xu et al. 2021; Ji et al. 2022). Weights were assigned to measurement depths based on the ratio of the distance between adjacent depths and the total root-zone depth. The assigned weights were 0.05, 0.05, 0.10, 0.10, 0.20, 0.20, and 0.30 for depth intervals of 0–5, 5–10, 10–20, 20–30, 30–50, 50–70, and 70–100 cm, respectively. In this analysis, various clusters or climates were analyzed (Table 1). The variation and anomalies of P, T, PET, and SM in four climates (i.e., HA, AR, SA, and HU) are presented in Fig. 3.

Fig. 3
figure 3

Variation and anomaly of a P, b T, c PET, and d SM in four climates

The yearly changes in variables (solid black line), trend of variation (dashed cyan line), and long-term anomalies (red and green bar charts) are presented in Fig. 3. A declining trend for P was detected across all climates (Fig. 3a1–a4), which influenced the overall precipitation trend in Iran. The negative precipitation anomalies during 2016–2018 were noticeable in hyper-arid (Fig. 3a1) and arid climates (Fig. 3a2). In recent years (2014–2019), the wettest year was 2019, with positive anomalies of 21, 67, 89, and 164 mm in HA, AR, SA, and HU climates, respectively. Meanwhile, 2017 was the driest year with anomalies of -35, -27, -150, and -328 mm from the long-term mean in HA, AR, SA, and HU climates, respectively. A clear upward trend in temperature was evident across all studied climates (Fig. 3b1–b4) with positive temperature anomalies starting from 1998. The average annual temperature increased by 1.24, 0.57, 0.88, and 1.40 °C from 1987 to 2019 in HA, AR, SA, and HU climates, respectively.

As observed, the increasing trend of PET (Fig. 3c1–c4) corresponds to an increasing trend of temperature in all climates. The PET exhibited a significant rising trend, with an approximately 13, 7, 5, and 3 mm per year increase from 1987 to 2019, in HA, AR, SA, and HU climates, respectively. Positive anomalies of PET were seen in 2000 in different climates. The variation in soil moisture showed a downtrend in all climates except for the humid climate (Fig. 3d4), which had an increasing trend. The greatest decrease in SM is associated with the highest increase in T and PET, occurring in the hyperarid climate (Fig. 3d1). The negative anomaly of SM can be observed in 2017, indicating an agricultural drought.

Reanalysis products

In this study, six global gridded products including CRU TS4.05, ERA5, TerraClimate, MERRA-2, GLDAS2.1, and GLEAM 3.6a were used for two aims: (i) to evaluate the variable accuracy of these products compared to ground measurements and (ii) determine a superior combination of these products for calculating the drought condition indices. A summary of this study’s product characteristics is demonstrated in Table 3.

Table 3 Global gridded products reanalysis-based specifications

In this study, a range of 0–100 cm was defined for the root-zone depth, and different datasets with varying soil layer depths were used. To obtain the averaged soil moisture in this range, the datasets were weighted based on their corresponding depths. For example, soil moisture measurements from GLDAS at depths of 0–10 cm, 10–40 cm, and 40–100 cm were weighted with values of 0.1, 0.3, and 0.6, respectively, to calculate the root zone soil moisture (Xu et al. 2021; Ji et al. 2022). Here, the TerraClimate dataset represents soil moisture as a root zone depth due to the absence of 100 cm depth segmentation in its variables. As for the GLEAM products, the downloaded variable for root-zone soil moisture was used.

Remote sensing data

The VCI datasets used here are based on data from vegetation health products developed by the National Oceanic and Atmospheric Administration and processed by the Center for Satellite Applications and Research (i.e., NOAA STAR) (https://www.star.nesdis.noaa.gov/smcd/emb/vci/VH/vh_ftp.php).

The vegetation health products include the VCI obtained from the radiance data captured by the advanced very high-resolution radiometer (AVHRR). The vegetation health products derived from AVHRR utilize the NOAA global area coverage dataset, spanning from 1981 to the present. The data and images in the vegetation health products (4-km spatial resolution) are composited weekly. The VCI time series from 1981 to 2021 was downloaded in the NETCDF format (https://www.star.nesdis.noaa.gov/pub/corp/scsb/wguo/data/Blended_VH_4km/VH/) with a weekly temporal resolution (Fernández-Tizón et al. 2020).

Accuracy evaluation of single- and multi-products

The accuracy of different products varies with climate, elevation, and location (Fallah et al. 2020; Fooladi et al. 2023). To address this gap, it is recommended to create a combined dataset by integrating multiple products. This approach aims to enhance the accuracy of the resulting dataset. However, before combining products, it is necessary to evaluate single products against in situ observations to select the best options.

For a more accurate combination of products, observations of soil moisture as a key variable in agricultural drought prediction were classified in different climates based on AI. The accuracy of P, T, PET, and SM variables was assessed separately in each climate.

Then, the accuracy ratio of the six products (i.e., CRU, ERA, TerraClimate, MERRA, GLDAS, and GLEAM) for the four variables was assessed using three different methods, namely (i) superior single product (SSP), (ii) superior multiple products in each month from January to December (SMM), and (iii) superior multiple products in each month from first to end of time series (SMT).

In the SSP method, the product with the highest accuracy was selected for each of the four variables. In SMM, first, the time series of the variables was broken into 12 (months). Then, the product with the highest accuracy in each month was selected. In SMT, for each month from the start to the end of the time series, the products with the nearest value to the observations were selected. Figure 4 shows the violin plots of variables related to six products and three combination-based approaches (i.e., SSP, SMM, and SMT) versus in situ observations for each variable (P, T, PET, and SM) in different climates.

Fig. 4
figure 4figure 4

Probability distribution function of single and combined products compared to observed variables as violin plots in climates

Figure 4 indicates the violin plots of T, P, PET, and SM in four climates including HA, AR, SA, and HU related to six products and three combined-based approaches compared to observations.

The mean and median values (represented by the red and green lines) indicate that the ranges of variable data across all climates close estimations to the observations. This demonstrates the accuracy of the combined products in capturing the variability in the variable datasets. In contrast, the results of single products for each variable deviate from observations in all climates. The combined approach, which involves integrating multiple reanalysis products, demonstrates a higher level of accuracy and lower uncertainty when compared to observations. As shown in Fig. 4 a-g, the superior single product for P and T variables was ERA5 in HA, AR, and HU climates, and CRU in the SA climate. SSP for PET (Fig. 4i–l) was CRU in the HA, AR, and SA climates and GLEAM in the HU climate. SSP for SM (Fig. 4p) was GLEAM in HA, AR, and SA (Fig. 4m–o) and ERA5 in HU. According to the violin plot in Fig. 4, the soil moisture data series in the reanalysis product exhibited a higher number of anomalies compared to other variables. This finding highlights the complexity of modeling soil moisture, particularly in the root zone depth, which is a crucial variable for predicting agricultural drought.

Drought indices

The drought indices used in this research were divided into three categories including standardized indices (SIs), SCIs, as well as CCIs. SIs include SPEI, SPI, and SSI, which were calculated using ground-based data, as target indices for agricultural drought prediction. SCIs were assumed as features (inputs or predictors), which are calculated based on the most accurate combined products (SMT procedure) to model agricultural drought. These included the PCI, TCI, PETCI, SMCI, and VCI. To assess the accuracy of predicting agricultural indices, several widely used combination indices such as VHI, SDCI, IDMI, and MIDI were taken into account. These three types of indices are explained in the following sections.

Single condition indices (SCIs)

Many studies have used single products to calculate SCIs, such as calculating PCI using global precipitation climatology center (GPCC) (Zhang et al. 2017), climate hazards group infrared precipitation with stations (CHIRPS) (Kumar et al. 2021), global precipitation measurement (GPM) (Alkaraki and Hazaymeh 2023b), and tropical rainfall measuring mission (TRMM) (Cai et al. 2023; Zhang et al. 2022a, b), TCI using MODIS (Arun Kumar et al. 2021; Zhang et al. 2022b; Alkaraki and Hazaymeh 2023b; Cai et al. 2023), PETCI using MODIS (Alkaraki and Hazaymeh 2023b), SMCI using GLDAS (Zhang et al. 2017), ESA-CCI (Arun Kumar et al. 2021), and SMAP(Alkaraki and Hazaymeh 2023b), and VCI using AVHRR (Zhang et al. 2017) and MODIS (Arun Kumar et al. 2021; Cai et al. 2023; Zhang et al. 2022a, b). In this study, the selected five SCIs including PCI, TCI, PETCI, SMCI, and VCI were calculated using the SMT datasets. The formulas for obtaining SCIs are listed in Table 4.

Table 4 Descriptions of single condition indices used in the study

All these SCIs range from zero to one, with smaller value indicating a more severe drought (Zhang et al. 2022a). For all five SCIs, multiple timescales including 1, 3, 6, 9, and 12 months, were calculated in all climates. In this way, by calculating five SCIs in five-time scales, a matrix with 25 columns (features) by 72 rows (monthly datasets from Jun-14 to Dec-19 based on SM ground-based data) was prepared as an input matrix for the prediction model in each climate.

Standardized indices (SIs)

So far, several studies have used standardized indices at multiple timescales, especially SPI (Zhang et al. 2017; Alkaraki and Hazaymeh 2023b; Yin and Zhang 2023) and SPEI (Tian et al. 2020; Ali et al. 2022; Yang et al. 2023), to evaluate the newly developed indices. To avoid probability distribution function fitting, which may not always be the best selection of a distribution function, a nonparametric approach was proposed for deriving SIs (Farahmand and AghaKouchak 2015; Fooladi et al. 2021). The SIs including SPI, SPEI, and SSI were assumed for target selection. To achieve this aim, first, SPIn, SPEIn, and SSIn were calculated at multiple timescales (i.e., n = 1, 3, 6, 9, and 12 months) based on ground measurements. Then, the correlation between SCIs and SIs was examined at various timescales to determine which SI timescale is more correlated with SCIs and select optimal SIs as agricultural target indices.

Between SPI, PSEI, and SSI at multiple timescales, SPEI-3 and SPEI-6 were selected as target indices for agricultural drought prediction. Some studies have selected SPEI-3 (Guo et al. 2019) or SPEI-6 (Tian et al. 2020) as agricultural target indices. The SPI index does not account for the influence of global warming and climate change (Kazemzadeh et al. 2022). SPEI is considered more useful for assessing drought events as it takes into account both precipitation and evapotranspiration, providing a comprehensive perspective on drought conditions. This study employed SPEI-3 with a seasonal accumulation period and SPI-6 with a semi-yearly accumulation period to characterize seasonal drought and semi-annual agricultural drought events.

Combined condition indices (CCIs)

Table 5 presents four CCIs derived from weighted linear combinations of SCIs. The CCIs include the MIDI, SDCI, VHI, and IDMI. The formulas for calculating these CCIs, along with their respective references, are provided in Table 5.

Table 5 Characteristics of combined condition indices used in the study

The linear combination method, similar to the SCIs illustrated in Table 5, is the simplest method for generating new multivariate drought indices (Guo et al. 2019). However, calculating the weights of different variables due to various accumulation periods can be challenging (Hao and Singh 2015). This study's aims were accomplished by considering the research background on linear weighted CCIs by (a) adding PETCI as one of the most important variables for developing an integrated agricultural condition drought index to IDMI, (b) considering accumulation period of SCIs for accurate agricultural drought prediction using multi-timescale SCIs as predictors, and (c) clustering the datasets based on different climates to consider the effects of climates on the relations between key variables.

Feature selection strategy

The FS strategy plays a crucial role in drought prediction by identifying the most relevant and informative features or variables that contribute to accurate prediction of drought events (Feng et al. 2019; Rostami et al. 2021). Feature selection methods have several advantages such as reducing variance, increasing the model’s accuracy, and shortening training time (Lap et al. 2023). The feature selection methods in this research were applied for two reasons; (i) selecting the more efficient and relevant SCIs (i.e., PCI, TCI, PETCI, SMCI, VCI) or predictors for agricultural drought prediction in various climates and (ii) increasing the accuracy of the machine learning prediction model by eliminating redundant features.

Feature selection methods can be classified into three primary categories based on the level of supervision available: unsupervised, semi-supervised, and supervised. Additionally, these methods are categorized into five groups based on the selection strategy employed: Filter, Embedded, Wrapper, Hybrid, and Ensemble (Alhenawi et al. 2022). In this study, five feature selection methods including PCA, ReliefF, minimal redundancy maximal relevance (MRMR), neighborhood components analysis (NCA), and Osprey algorithm optimization (OOA) were used as appropriate predictors in agricultural drought prediction. Among the five selected FS methods, PCA, ReleifF, and MRMR belong to the Filter group, NCA to the Embedded group, and OOA to the Wrapper group. These FS methods are explained briefly in the following sections.

Filter methods

Within filter methods, features are chosen based on the statistical characteristics of the data, without employing any learning models. In this particular model, features are assessed and prioritized using information-theoretic measures, and subsequently, the features with the highest ranks are chosen (Labani et al. 2018). In this study, three methods including PCA, ReliefF, and MRMR from filter approaches were used for selecting the best inputs in agricultural drought modeling.

PCA is a mathematical method utilized to decrease the dimensionality of a dataset (Jackson 1983). Due to its simplicity and nonparametric nature, PCA has been extensively employed in various types of analyses as a means of extracting pertinent information from intricate datasets (Wold et al. 1987; Arun Kumar et al. 2021). In PCA, as a feature selection method, large absolute factor loadings indicate that the corresponding variables have a greater impact on the factor compared to other variables (Guo et al. 2002).

The ReliefF algorithm (Kira and Rendell 1992), initially developed for binary classification problems, has been improved to address multiclass problems through the ReliefF algorithm (Kononenko 1994). The ReliefF algorithm was developed by adapting ReliefF to address continuous class or regression problems (Robnik-Šikonja and Kononenko 1997). In this research, the SVR model was constructed using the first k-PC factors, and for each factor, the top five SCIs with the highest loadings were chosen as inputs for the PCA.

This study employed ReliefF and MRMR, widely recognized univariate filters (Alhenawi et al. 2022). The ReliefF method chooses random instances and looks for a specified number of nearest neighbors with the same class, as well as K-nearest neighbors with different classes. After that, the process is repeated for each feature by computing the average of all classes over a specific number of iterations (Zhang et al. 2003; Alhenawi et al. 2022). The MRMR method chooses features that exhibit higher relevance (demonstrating a strong correlation with the target class) and lower redundancy (displaying a weak correlation with other features) (Mandal and Mukhopadhyay 2013).

Embedded strategy

In the embedded model, a learning algorithm is employed to explore and find the optimal set of features (Zhang et al. 2015). As an embedded strategy, the NCA algorithm (Goldberger et al. 2005) was incorporated in this study as one of the learning methods. NCA does not rely on any parametric assumptions regarding the data distribution and can naturally handle multiclass problems (Yang et al. 2012). Here, NCA was used for selecting SCIs with more relevance to agricultural drought index targets (i.e., SPEI-3 and SPEI-6).

Wrapper strategy

In the wrapper approach, a search method is employed to discover the best feature subset. At each step of the search strategy, a subset of features is generated and evaluated using a classifier or another learning model. Ultimately, the optimal generated feature subset is chosen as the final feature set (Chandrashekar and Sahin 2014; Rostami et al. 2021). The OOA, developed by Dehghani and Trojovský (2023), is inspired by the hunting behavior of Osprey. The algorithm mimics the hunting strategies of ospreys, showcasing an exceptional ability to optimize and swiftly converge. Drawing upon the inborn capability of ospreys, the algorithm presented in this study demonstrates a remarkable ability to efficiently identify optimal solutions (Hu et al. 2023). In this study, OOA was applied as a search strategy for selecting more relevant SCIs in each climate or clustering region (Fig. 1).

Developed indices

SVR, functioning as a supervised learning mechanism, employs the concept of structural risk minimization to improve the handling of multi-dimensional problems. The SVR approach utilizes linear equations within the simulation algorithm, resulting in improved performance by effectively employing a kernel function. In the SVR method, a nonlinear mapping of ϕ is computed in the trait space, where Xt represents the input data in Rm and Y(Xt) represents the output data in R (Drucker et al. 1996; Vapnik 1999):

$$ Y(X_{t} ) = w^{T} \times \phi (X_{t} ) + b $$
(10)

Here, the weights (w) and biases (b) represent the values of the regression function, which are determined by minimizing the following function.

$$ \begin{gathered} \mathop {\min }\limits_{{w.b.e_{i} }} j(w.e) = \frac{1}{2}w^{T} w + \frac{\gamma }{2}\sum\nolimits_{t = 1}^{n} {e_{t}^{2} } \hfill \\ S.T:w^{T} \times \phi (X_{t} ) + b\begin{array}{*{20}c} {} & {t = 1,2,...,n} \\ \end{array} \hfill \\ \end{gathered} $$
(11)

In Eq. (11), e represents the errors, while gamma (γ) corresponds to the regularization parameters utilized in the model to regulate the smoothness of the approximation function. The optimal figures for these parameters are defined by the users. A review of the literature shows that SVR is commonly used for agricultural drought prediction (Tian et al. 2018; Prodhan et al. 2022).

Here, SVR was performed with six input datasets including all 25 features, i.e., five SCIs (PCI, TCI, PETCI, SMCI, and VCI), by five multi-timescales (1, 3, 6, 9, and 12 months) as five SCIs selected using PCA, RReliefF, MRMR, NCA, and OOA FS methods. The predicted agricultural drought indices with each input dataset were compared with SPEI-3 and SPEI-6 as target agricultural indices in different climates using statistical criteria.

To prioritize the selection of pertinent inputs in the study, the cross-correlation coefficient approach is employed to assess the time delay between SPEI3 and SPEI6, and each input variable. The analysis reveals that the lag times for each index (PCI, TCI, PETCI, SMCI, and VCI) in relation to SPEI3 and SPEI6 mostly fall within the same month (zero lag time) across the majority of the four climates. Furthermore, the correlation coefficient between the chosen input indices for the preceding 0–12 months and the current SPEI3 and SPEI6 generally diminishes as the lag time increases, particularly beyond 3 months, in most regions examined. Consequently, the input indices are considered to be contemporaneous with SPEI3 and SPEI6 in the modeling process, without any lag time.

Drought characteristics

The run theory (Herbst et al. 1966) is commonly used for drought event identification. Typically, a threshold of -1 is employed to identify drought events. Droughts generally occur when there is a persistence of below-average precipitation for a duration exceeding three consecutive months (Liu et al. 2016; Haile et al. 2020). A drought event was defined here as a period lasting for at least three consecutive months, during which the drought index values remained below − 1. Three classes of drought including moderate, severe, and extreme are distinguished for different indices based on Table 6.

Table 6 Drought classification based on SIs and CCIs used in this study

Drought characteristics such as the number of events (N), duration (D), frequency (F), and intensity (I) were calculated for each drought event using the categories outlined in Table 6. The drought duration of a drought is computed using Eq. (12) (Xu et al. 2019):

$$ D = \frac{{\sum\nolimits_{i = 1}^{n} {d_{i} } }}{n} $$
(12)

The D is determined by summing the durations of individual drought events (di) for each station, where n represents the total drought events. The drought frequency is calculated through Eq. (13) (Spinoni et al. 2014; Wang et al. 2018):

$$ F = \frac{{n_{{\text{m}}} }}{{N_{{\text{m}}} }} \times 100\% $$
(13)

F is calculated using drought months (nm) divided by total months (Nm). The drought intensity (I) is defined by Eq. (14) (Wang et al. 2018; Haile et al. 2020).

$$ I = \left| {\frac{1}{n}\sum\nolimits_{i = 1}^{n} {DI_{i} } } \right| $$
(14)

I is determined by summing the accumulated drought index values below the threshold (− 1) for each drought event (i), where n represents the total number of drought events. In this study, drought characteristics including N, D, F, and I were calculated for target indices (i.e., SPEI-3 and SPEI-6), CCIs (i.e., VHI, SDCI, MIDI, and IDMI) and six modeled indices using SVR (i.e., all features as well as PCA, ReliefF, MRMR, NCA, and OOA methods) in various climates from Jan-14 to Dec-19.

Evaluation criteria

Various performance criteria have been employed thus far to assess the quality of reanalysis datasets and evaluate the accuracy of prediction results. In this study, the accuracy of the reanalysis data and each model's predictions was evaluated using six statistical measures: the Pearson correlation coefficient (R), coefficient of determination (R2), mean bias error (MBE), root mean square error (RMSE), mean absolute error (MAE), and normalized root mean square error (NRMSE). The explanations for each of the statistical measures are provided in the following section.

$$ R = \frac{{\sum\nolimits_{t = 1}^{N} {\left( {M_{t} - \overline{M} } \right)\left( {E_{t} - \overline{E} } \right)} }}{{\sqrt {\left[ {\sum\nolimits_{t = 1}^{N} {\left( {M_{t} - \overline{M} } \right)^{2} } } \right]\left[ {\sum\nolimits_{t = 1}^{N} {\left( {E_{t} - \overline{E} } \right)^{2} } } \right]} }} $$
(15)
$$ R^{2} = \left[ {\frac{{\sum\nolimits_{t = 1}^{N} {\left( {M_{t} - \overline{M} } \right)\left( {E_{t} - \overline{E} } \right)} }}{{\sqrt {\left[ {\sum\nolimits_{t = 1}^{N} {\left( {M_{t} - \overline{M} } \right)^{2} } } \right]\left[ {\sum\nolimits_{t = 1}^{N} {\left( {E_{t} - \overline{E} } \right)^{2} } } \right]} }}} \right]^{2} $$
(16)
$$ {\text{MBE}} = \frac{1}{N}\sum\nolimits_{t = 1}^{N} {\left( {E_{t} - M_{t} } \right)} $$
(17)
$$ {\text{MAE}} = \frac{1}{N}\sum\nolimits_{t = 1}^{N} {\left| {E_{t} - M_{t} } \right|} $$
(18)
$$ {\text{RMSE}} = \sqrt {\frac{1}{N}\sum\nolimits_{i = 1}^{N} {\left( {E_{t} - M_{t} } \right)^{2} } } $$
(19)
$$ {\text{NRMSE}} = \frac{{{\text{RMSE}}}}{{\overline{M} }} $$
(20)

In Eqs. (15) to (20), \(M_{t}\) and \(E_{t}\) are the variables based on measured and modeled data, \(\overline{M}\) and \(\overline{E}\) are the mean values of measured and modeled variables, respectively, and N represents the number of datasets.

Results and discussions

Accuracy evaluation of products

A single reanalysis product has uncertainties in temporal and spatial spaces, which can be reduced by combining multiple products. The results in Table 7 indicated that SMT can significantly improve the accuracy of variables in all climates.

Table 7 Accuracy of single and combined time series using reanalysis products

The accuracy of SMT based on MAE for the P variable showed an increase of 49%, 51%, 62%, and 26% in comparison with superior single products (SSP_ERA5) in HA, AR, SA, and HU climates, respectively. Overall, the accuracy ranking with regard to MAE and R2 of the five P and T products was SMT > SMM > SSP(ERA5) > CRU > GLDAS > TERRA > MERRA.

The CRU product was selected as the SSP for PET in all climates. The mean absolute error was reduced by 10%, 15%, 18%, and 6%, regarding SMT in comparison with SSP for PET in HA, AR, SA, and HU climates, respectively. The single products, even for SSP, showed a low correlation with ground-based soil moisture measurements, especially in the HA climate. In contrast, SMT indicated high accuracy with a maximum MAE of 7% for soil moisture volume in HA climates.

Taylor diagrams, originally introduced by Taylor (2001), were utilized here to provide a succinct summary and overview of the performance of the reanalysis products. The Taylor diagram (Fig. 5) presents three statistical indices, namely root mean square deviation (RMSD), standard deviation (SD), and R. Figure 5 illustrates the Taylor diagram for each individual and combined dataset, categorized by different climates in Iran.

Fig. 5
figure 5

Taylor diagram of single and combined reanalysis products for each climate

As shown in Fig. 5a1–a4, the Taylor diagrams provide a visual representation of the accuracy of precipitation data for each individual reanalysis product dataset as well as two combined datasets (SMM and SMT) compared to ground-based measurements across all climates. In this regard, it can be observed that both combined datasets (SMM and SMT) exhibit higher accuracy compared to single precipitation products. Furthermore, the SMT procedure demonstrated superior performance compared to SMM in terms of precipitation data across all climates.

As observed from the Taylor diagrams, ERA5 and CRU products exhibited better agreement with ground-based measurements of P, T, and PET. This is evident from their higher R and lower RMSD when compared to the ground-based data series. The SD and RMSD are crucial indicators for evaluating dataset accuracy. In this context, both combined datasets exhibit reduced RMSD, while maintaining a similar SD for all variables. This highlights the effectiveness of the combined datasets in improving the accuracy of the data. In general, Fig. 5 demonstrates that the SMT generated using single reanalysis products achieves a notable enhancement in the accuracy of variable time series when compared to ground-based measurements.

Feature selection results

Figure 6a, b demonstrates the average percentage of SCIs in each FS method, in which a specific feature is selected for SPEI-3 and SPI-6, respectively. The results for SPEI-3 (Fig. 6a) and SPEI-6 (Fig. 6b) show similarity in the average percentage of selection by SCIs in each FS method. As can be seen among all SCIs, PCI, PETCI, and SMCI were the most important and selected 40, 18, and 17 percent of the features through the five feature selection methods. PCI has been reported by other researchers as the most important index in agricultural drought prediction (Arun Kumar et al. 2021; Zhang et al. 2022b; Cai et al. 2023). Out of the five SCIs, TCI and VCI were selected less than other SCIs for agricultural drought prediction in SPEI-3 and SPEI-6.

Fig. 6
figure 6

Average percentage of selection of SCIs in each FS method for SPEI-3 and SPI-6

In Table 8, the average percentage of relative importance for SCIs is provided separately for each climate. These results show that PCI, in all climates except HA, has the highest relative importance among SCIs, which is marked with a double underline.

Table 8 Average percentage of the relative importance of SCIs for each target and climate

The highest relative importance of SCIs in the HA climate belongs to PETCI with 40 and 36 percent for SPEI-3 and SPEI-6, respectively. This shows that PET plays a crucial role in agricultural drought prediction in hyperarid regions because of very low precipitation. The relative importance of SMCI increases from HA to HU climate, which shows the greater effect of soil moisture in humid regions on agricultural drought. The correlation coefficients as scatter plots of modeled and target indices SPEI-3 and SPEI-6 for all CCIs are illustrated in Fig. 7a1–d1 and a2–d2.

Fig. 7
figure 7figure 7

Scatter plots of modeled and target indices SPEI-3 (a1–d1) and SPEI-6 (a2–d2) for all CCIs in different climates

The highest and lowest R in Fig. 7 belongs to SVR with all features as inputs (SVR-all f.). Furthermore, three CCIs (i.e., SDCI, MIDI, and IDMI) showed a similar range of R with an average correlation coefficient of 0.44. Besides, R had higher values in the SA climate (see Fig. 7c1 and c2), indicating the higher accuracy of agricultural drought indices in semiarid regions compared to other climates.

Drought characteristics

The intensity for three drought classes, including extreme drought (dark red color), severe drought (red color), and moderate color (orange color), is analyzed based on the drought classes of Table 6. These drought events were shown for all combined condition indices (SVR models and CCIs) compared to SPEI-3 and SPEI-6 in all agrometeorological stations from Jan. 14 to Dec. 19. The most severe agricultural droughts occurred during Jun. 17 and May 18 (for SPEI-3) and Sept. 17 and May 18 (for SPEI-6).

The highest similarity between SPEI-3 and SPEI-6 as target agricultural drought indices and other indices belonged to SVR-all f. Moreover, SVR with inputs from the ReliefF method demonstrated the same drought events as target indices. Among all indices, VHI, IDMI, and PCA showed more disagreement with SPEI-3&6 in determining drought classes, especially in the period 2017–2018. The variations in the drought frequency based on studied indices are illustrated in Fig. 8 separately for each climate.

Fig. 8
figure 8

Box plots of drought frequency for studied indices in different climates

The multiplication signs (×) in box plots of Figure 8 show the average drought frequency in all studied drought indices including targets (i.e., SPEI-3&6), CCIs (i.e., VHI, SDCI, MIDI, and IDMI), and SVR results with different inputs (i.e., all features, PCA, ReliefF, MRMR, NCA, OOA). The average frequency of these agricultural drought indices for HA, AR, SA, and HU climates was 47, 50, 59, and 42 and 35, 33, 54, and 39 percent in 3- and 6-month timescales (Fig. 8a-b). The mean duration of drought events based on SVR and CCIs indices versus targets is shown in Table 9 for each climate.

Table 9 Duration of agricultural drought (month) based on CCIs and SVR indices in different climates

By increasing the time scales of SPEI from 3 to 6 months, drought duration has increased significantly, especially in humid climates from 12.0 to 21.0 months. So far, many studies (Mesbahzadeh 2020; Kazemzadeh et al. 2022; Naderi and Moghaddasi 2022; Sharafi and Ghaleni 2022) indicate a decrease in the number of events and drought frequency and an increase in severity and drought duration by increasing the timescale of drought indices.

Results of Table 9 indicated that SVR with all features as input had the nearest duration compared to the target indices. Meanwhile, SVR had the nearest duration to target indices in SPEI-3 for HA, AR, and SA climates and in SPEI-6 for AR climate with input features from ReliefF, OOA, NCA, and ReliefF.

Disscusion

In our study, we evaluated various feature selection techniques to determine their effectiveness in accurately monitoring drought across different climate regions in Iran. The results of our analysis indicate that ReliefF surpassed other methods in agricultural drought modeling due to its capacity to handle multiclass issues and its robustness in managing incomplete and noisy data (Bolón-Canedo et al. 2013). The best SVR results using ReliefF methods beside target indices are presented in Fig. 9.

Fig. 9
figure 9

Drought intensity in SVR ReliefF-based versus SPEI-3 and SPEI-6 indices in different climates

As shown in Fig. 9a1, a2, an increasing trend of drought intensity is observed in the hyperarid climate, especially for SPEI-6 (Fig. 9a2). The temporal drought intensity in semiarid and humid climates was severely low in comparison with other climates, especially in 2019.

Also, the findings underscore the significant role of potential evapotranspiration (PET) as a key predictor for agricultural drought modeling compared to other climatic variables especially in hyper-arid climates. PET is impacted by atmospheric water content, meaning that in cases of adequate water supply, potential evapotranspiration primarily mirrors atmospheric conditions (Abrar Faiz et al. 2022), a scenario commonly observed in humid climates. One reason for the heightened importance of PET in hyper-arid climates is the limited water supply in these regions and the crucial role of PET in the water balance within hyper-arid environments.

Before PET, P is considered the most critical feature for agricultural drought modeling across various climates. This is because precipitation is a primary determinant of water availability for agricultural activities and plant growth. In most climates, the amount, frequency, and distribution of precipitation play a crucial role in assessing drought conditions and their impact on agricultural productivity (Guo et al. 2019).

Conclusion

Considering to spatial and temporal uncertainties of global gridded datasets based on single products, the combined datasets from six popular products significantly improved the accuracy of datasets compared to situ measurements. Moreover, the accuracy evaluation demonstrated the varying performance of single products under different temporal (month) and spatial (climate) conditions.

Drought prediction and monitoring plans often rely on information obtained based on different key variables used in drought indices. In the present study, agricultural drought indices were developed through SVR with different feature selection methods to determine the most relevant SCIs in different climates. The relative importance of variables based on reanalysis datasets for five single condition indices (i.e., PCI, TCI, PETCI, SMCI, and VCI) was analyzed and compared with ground-based measurements of SIs. The performance of the SVR feature-based methods was assessed by comparing their results with four combined condition indices, specifically SPEI-3 and SPEI-6. The findings revealed that SVR-ReliefF results are consistent with SPEI and the model outperforms other feature selection (FS) methods. The results showed the higher relative importance of PETCI in hyperarid climates compared to other SCIs. These results highlight the key role of the PET variable as a suitable predictor of agricultural drought in regions with very low precipitation. However, in AR, SA, and HU climates P was the most important variable in agricultural drought prediction.

Future works are recommended to evaluate two-stage feature selection methods based on spatial and temporal variations for selecting more relevant variables in agricultural drought prediction. Moreover, it is recommended to consider the lag time in different variables for assessing drought propagation from meteorological to agricultural.