Skip to main content

Advertisement

Log in

Combination of data-driven models and best subset regression for predicting the standardized precipitation index (SPI) at the Upper Godavari Basin in India

  • Original Paper
  • Published:
Theoretical and Applied Climatology Aims and scope Submit manuscript

Abstract

Standardized precipitation index prediction and monitoring are essential to mitigating the effect of drought actions on precision farming, environments, climate-smart agriculture, and the water cycle. In this study, four data-driven models, additive regression, random subspace, M5Pruned (M5P), and bagging tree models, were adopted to predict the standardized precipitation index (SPI) at the Upper Godavari Basin for various periods (3 months, 6 months, and 12 months). The data-driven models’ input data was pre-processed with machine learning models to increase quality and the model’s performance a priori. These four models predicted the SPI-3, SPI-6, and SPI-12 months based on three metrological station data. Based on the statistical performance metrics such as correlation coefficient (R), root mean square error (RMSE), mean absolute error (MAE), relative absolute error (RAE), and root relative squared error (RRSE), our findings showed that the bagging was the best model for predicting SPI-3 and SPI-6 while the M5P the best for SPI-12 estimation in station 1, while in stations 2 and 3, M5P was superlative in predicting the SPI-3 and SPI-12 months, and the bagging was best in SPI-6. All the best models had acceptable mid-term drought forecasting based on the SPI-3, SPI-6, and SPI-12 months for three stations in the Upper Godavari Basin in India. The machine learning models created in this study produced satisfactory results in short-term and mid-term drought forecasting, and it will be a new strategy for water developers and planners to use for future management and scheduling.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  • Adarsh S, Janga Reddy M (2019) Evaluation of trends and predictability of short-term droughts in three meteorological subdivisions of India using multivariate EMD-based hybrid modelling. Hydrol Process 33:130–143

    Google Scholar 

  • Aghelpour P, Bahrami-Pichaghchi H, Kisi O (2020) Comparison of three different bio-inspired algorithms to improve ability of neuro fuzzy approach in prediction of agricultural drought, based on three different indexes. Comput Electron Agric 170:105279

    Google Scholar 

  • Ali M, Deo RC, Downs NJ, Maraseni T (2018) An ensemble-ANFIS based uncertainty assessment model for forecasting multi-scalar standardized precipitation index. Atmospheric Res 207:155–180

    Google Scholar 

  • Aragão LE, Anderson LO, Fonseca MG, Rosan TM, Vedovato LB, Wagner FH, Silva CV, Junior CHS, Arai E, Aguiar AP (2018) 21st century drought-related fires counteract the decline of Amazon deforestation carbon emissions. Nat Commun 9:1–12

    Google Scholar 

  • Bahrami M, Bazrkar S, Zarei AR (2019) Modeling, prediction and trend assessment of drought in Iran using standardized precipitation index. J Water Clim Change 10:181–196

    Google Scholar 

  • Barzkar A, Najafzadeh M, Homaei F (2022) Evaluation of drought events in various climatic conditions using data-driven models and a reliability-based probabilistic model. Nat Hazards 110:1931–1952

    Google Scholar 

  • Belayneh A, Adamowski J (2012) Standard precipitation index drought forecasting using neural networks, wavelet neural networks, and support vector regression. Appl Comput Intell Soft Comput Article ID 794061:13. https://doi.org/10.1155/2012/794061

  • Belayneh A, Adamowski J, Khalil B (2016) Short-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet transforms and machine learning methods. Sustain Water Resour Manag 2:87–101. https://doi.org/10.1007/s40899-015-0040-5

    Article  Google Scholar 

  • Breiman L (1996) Bagging predictors. Mach Learn 24:123–140

    Google Scholar 

  • Carrão H, Russo S, Sepulcre-Canto G, Barbosa P (2016) An empirical standardized soil moisture index for agricultural drought assessment from remotely sensed data. Int J Appl Earth Obs Geoinformation 48:74–84

    Google Scholar 

  • Choubin B, Malekian A, Golshan M (2016) Application of several data-driven techniques to predict a standardized precipitation index. Atmósfera 29:121–128

    Google Scholar 

  • Dai F, Lee C, Ngai YY (2002) Landslide risk assessment and management: an overview. Eng Geol 64:65–87

    Google Scholar 

  • Deo RC, Kisi O, Singh VP (2017) Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model. Atmos Res 184:149–175. https://doi.org/10.1016/j.atmosres.2016.10.004

    Article  Google Scholar 

  • Dice J, Rodziewicz D (2020) Drought risk to the agriculture sector, federal reserve Bank of Kansas City, Econ Rev 105(2):61–86

  • Ditthakit P, Pinthong S, Salaeh N et al (2021) Using machine learning methods for supporting GR2M model in runoff estimation in an ungauged basin. Sci Rep 11:19955. https://doi.org/10.1038/s41598-021-99164-5

    Article  Google Scholar 

  • Domenikiotis C, Spiliotopoulos M, Tsiros E, Dalezios N (2004) Early cotton production assessment in Greece based on a combination of the drought Vegetation Condition Index (VCI) and the Bhalme and Mooley Drought Index (BMDI). Int J Remote Sens 25:5373–5388

    Google Scholar 

  • Duan K, Sun G, Caldwell PV, McNulty SG, Zhang Y (2018) Implications of upstream flow availability for watershed surface water supply across the conterminous United States. JAWRA J Am Water Resour Assoc 54:694–707

    Google Scholar 

  • Ebrahimpour M, Rahimi J, Nikkhah A, Bazrafshan J (2015) Monitoring agricultural drought using the standardized effective precipitation index. J Irrig Drain Eng 141:04014044

    Google Scholar 

  • Elbeltagi A, Kumar M, Kushwaha NL et al (2023a) Drought indicator analysis and forecasting using data driven models: case study in Jaisalmer. India Stoch Environ Res Risk Assess 37:113–131. https://doi.org/10.1007/s00477-022-02277-0

    Article  Google Scholar 

  • Elbeltagi A, Pande CB, Kumar M et al (2023b) Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-023-25221-3

    Article  Google Scholar 

  • Finn JA, Suter M, Haughey E, Hofer D, Lüscher A (2018) Greater gains in annual yields from increased plant diversity than losses from experimental drought in two temperate grasslands. Agric Ecosyst Environ 258:149–153

    Google Scholar 

  • Hänsel S, Schucknecht A, Matschullat J (2016) The Modified Rainfall Anomaly Index (mRAI)—is this an alternative to the Standardised Precipitation Index (SPI) in evaluating future extreme precipitation characteristics? Theor Appl Climatol 123:827–844

    Google Scholar 

  • Heddam S, Kisi O (2018) Modelling daily dissolved oxygen concentration using least square support vector machine, multivariate adaptive regression splines and M5 model tree. J Hydrol 559:499–509

    Google Scholar 

  • Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20:832–844

    Google Scholar 

  • Hubbard KG, Wu H (2005) Modification of a crop-specific drought index for simulating corn yield in wet years. Agron J 97:1478–1484

    Google Scholar 

  • Ibrahimi A, Baali A (2018) Application of several artificial intelligence models for forecasting meteorological drought using the standardized precipitation index in the Saiss Plain (Northern Morocco). Int J Intell Eng Syst 11:267–275

    Google Scholar 

  • Jang SH, Lee J-K, Oh JH, Jo JW, Cho Y (2017) The probabilistic drought forecast based on the ensemble technique using the Korean surface water supply index. Nat Hazards Earth Syst Sci Discuss 1–51

  • Juhasz T, Kornfield J (1978) The Crop Moisture Index: unnatural response to changes in temperature. J Appl Meteorol 17:1864–1866

    Google Scholar 

  • Khosravi K, Cooper JR, Daggupati P, Pham BT, Bui DT (2020) Bedload transport rate prediction: Application of novel hybrid data mining techniques. J Hydrol 585:124774

    Google Scholar 

  • Komasi M, Sharghi S, Safavi HR (2018) Wavelet and cuckoo search-support vector machine conjugation for drought forecasting using Standardized Precipitation Index (case study: Urmia Lake, Iran). J Hydroinformatics 20:975–988

    Google Scholar 

  • Kumar Gautam V, Pande CB, Kothari M et al (2022) Exploration of groundwater potential zones mapping for hard rock region in the Jakham river basin using geospatial techniques and aquifer parameters. Adv Space Res. https://doi.org/10.1016/j.asr.2022.11.022

    Article  Google Scholar 

  • Liu C, Yang C, Yang Q, Wang J (2021) Spatiotemporal drought analysis by the standardized precipitation index (SPI) and standardized precipitation evapotranspiration index (SPEI) in Sichuan Province. China Sci Rep 11:1–14

    Google Scholar 

  • Lopez-Nicolas A, Pulido-Velazquez M, Macian-Sorribes H (2017) Economic risk assessment of drought impacts on irrigated agriculture. J Hydrol 550:580–589

    Google Scholar 

  • Malik A, Tikhamarine Y, Sammen SS et al (2021a) Prediction of meteorological drought by using hybrid support vector regression optimized with HHO versus PSO algorithms. Environ Sci Pollut Res 28:39139–39158. https://doi.org/10.1007/s11356-021-13445-0

    Article  Google Scholar 

  • Malik A, Tikhamarine Y, Souag-Gamane D et al (2021b) Support vector regression integrated with novel meta-heuristic algorithms for meteorological drought prediction. Meteorol Atmos Phys 133:891–909. https://doi.org/10.1007/s00703-021-00787-0

    Article  Google Scholar 

  • Mehdizadeh S, Ahmadi F, DanandehMehr A, Safari MJS (2020) Drought modeling using classic time series and hybrid wavelet-gene expression programming models. J Hydrol 587:125017

    Google Scholar 

  • Meyer SJ, Hubbard KG, Wilhite DA (1993) A crop-specific drought index for corn: I Model development and validation. Agron J 85:388–395

    Google Scholar 

  • Mohamadi S, Sammen SS, Panahi F et al (2020) Zoning map for drought prediction using integrated machine learning models with a nomadic people optimization algorithm. Nat Hazards 104:537–579. https://doi.org/10.1007/s11069-020-04180-9

    Article  Google Scholar 

  • Mokhtarzad M, Eskandari F, Vanjani NJ, Arabasadi A (2017) Drought forecasting by ANN, ANFIS, and SVM and comparison of the models. Environ Earth Sci 76:1–10

    Google Scholar 

  • Moron V (1994) Guinean and Sahelian rainfall anomaly indices at annual and monthly scales (1933–1990). Int J Climatol 14:325–341

    Google Scholar 

  • Nguyen LB, Li QF, Ngoc TA, Hiramatsu K (2015) Adaptive neuro-fuzzy inference system for drought forecasting in the cai river basin in Vietnam. J Fac Agric Kyushu Univ 60:405–415

    Google Scholar 

  • Ntale HK, Gan TY (2003) Drought indices and their application to East Africa. Int J Climatol J R Meteorol Soc 23:1335–1357

    Google Scholar 

  • Orimoloye IR, Olusola AO, Belle JA et al (2022) Drought disaster monitoring and land use dynamics: identification of drought drivers using regression-based algorithms. Nat Hazards 112:1085–1106. https://doi.org/10.1007/s11069-022-05219-9

    Article  Google Scholar 

  • Pande CB, Al-Ansari N, Kushwaha NL, Srivastava A, Noor R, Kumar M, Moharir KN, Elbeltagi A (2022) Forecasting of SPI and meteorological drought based on the artificial neural network and M5P model tree land. 11(11):2040. https://doi.org/10.3390/land11112040

  • Pande CB, Kushwaha NL, Orimoloye IR et al (2023a) Comparative assessment of improved SVM method under different kernel functions for predicting multi-scale drought index. Water Resour Manage 37:1367–1399. https://doi.org/10.1007/s11269-023-03440-0

  • Pande CB, Kadam SA, Rajesh J, Gorantiwar SD, Shinde MG (2023b) Predication of sugarcane yield in the semi-arid region based on the sentinel-2 data using vegetation’s indices and mathematical modeling. In: Pande CB, Moharir KN, Singh SK, Pham QB, Elbeltagi A (eds). Climate change impacts on natural resources, ecosystems and agricultural systems. Springer Climate. Springer, Cham. https://doi.org/10.1007/978-3-031-19059-9_12

  • Pande CB, Moharir KN (2023c) Application of hyperspectral remote sensing role in precision farming and sustainable agriculture under climate change: A review. In: Pande CB, Moharir KN, Singh SK, Pham QB, Elbeltagi A (eds). Climate change impacts on natural resources, ecosystems and agricultural systems. Springer Climate. Springer, Cham. https://doi.org/10.1007/978-3-031-19059-9_21

  • Pande CB, Moharir KN, Varade A (2023d) Water conservation structure as an unconventional method for improving sustainable use of irrigation water for soybean crop under rainfed climate condition. In: Pande CB, Moharir KN, Singh SK, Pham QB, Elbeltagi A (eds). Climate change impacts on natural resources, ecosystems and agricultural systems. springer climate. Springer, Cham. https://doi.org/10.1007/978-3-031-19059-9_28

  • Peng-cheng Q, Min L, Lan L (2016) Application of effective precipitation index in rainstorm flood disaster monitoring and assessment. Chin J Agrometeorol 37:84

    Google Scholar 

  • Pham BT, Phong TV, Nguyen-Thoi T, Parial K, Singh SK, Ly H-B, Nguyen KT, Ho LS, Le HV, Prakash I (2020) Ensemble modeling of landslide susceptibility using random subspace learner and different decision tree classifiers. Geocarto Int 37(3):735–757. https://doi.org/10.1080/10106049.2020.1737972

  • Poornima S, Pushpalatha M (2019) Drought prediction based on SPI and SPEI with varying timescales using LSTM recurrent neural network. Soft Comput 23:8399–8412

    Google Scholar 

  • Pramudya Y, Komariah, Dewi WS, Sumani, Mujiyo, Sukoco T A and Rozaki Z (2016) Remote sensing for estimating agricultural land use change as the impact of climate change (Proc of SPIE) 9877:987720–1.

  • Roodposhti MS, Safarrad T, Shahabi H (2017) Drought sensitivity mapping using two one-class support vector machine algorithms. Atmos Res 193:73–82

    Google Scholar 

  • Sattari MT, Sureh FS (2019) Drought prediction based on standardized precipitation- evapotranspiration index by using M5 tree model. Int Civil Eng Archit Conf 1–14

  • Shamshirband S, Hashemi S, Salimi H, Samadianfard S, Asadi E, Shadkani S, Kargar K, Mosavi A, Nabipour N, Chau K-W (2020) Predicting standardized streamflow index for hydrological drought using machine learning models. Eng Appl Comput Fluid Mech 14:339–350

    Google Scholar 

  • Shelar RS et al (2022) Sub-watershed prioritization of Koyna river basin in India using multi criteria analytical hierarchical process, remote sensing and GIS techniques. Phys Chem Earth 128:103219. https://doi.org/10.1016/j.pce.2022.103219

    Article  Google Scholar 

  • Soh Y, Koo C, Huang Y, Fung K (2018) Application of artificial intelligence models for the prediction of standardized precipitation evapotranspiration index (SPEI) at Langat River Basin. Malaysia Comput Electron Agric 144:164–173

    Google Scholar 

  • Sohrabi MM, Ryu JH, Abatzoglou J, Tracy J (2015) Development of soil moisture drought index to characterize droughts. J Hydrol Eng 20:04015025

    Google Scholar 

  • Spennemann PC, Rivera JA, Saulo AC, Penalba OC (2015) A comparison of GLDAS soil moisture anomalies against standardized precipitation index and multisatellite estimations over South America. J Hydrometeorol 16:158–171

    Google Scholar 

  • Stone CJ (1985) Additive regression and other nonparametric models. Ann Stat 13:689–705

  • Tan CP, Yang JP, Li M (2015) Temporal-spatial variation of drought indicated by SPI and SPEI in Ningxia Hui autonomous region. China Atmos 6(10):1399–1421

    Google Scholar 

  • Tong S, Lai Q, Zhang J, Bao Y, Lusi A, Ma Q, Li X, Zhang F (2018) Spatiotemporal drought variability on the Mongolian Plateau from 1980–2014 based on the SPEI-PM, intensity analysis and Hurst exponent. Sci Total Environ 615:1557–1565

    Google Scholar 

  • Vicente-Serrano SM, Beguería S, López-Moreno JI (2010) A multiscalar drought index sensitive to global warming: the standardized precipitation evapotranspiration index. J Climate 23:1696–1718

    Google Scholar 

  • Wang X, Jiang D, Lang X (2017) Future extreme climate changes linked to global warming intensity. Sci Bull 62:1673–1680

    Google Scholar 

  • Webber H, Ewert F, Olesen JE, Müller C, Fronzek S, Ruane AC, Bourgault M, Martre P, Ababaei B, Bindi M (2018) Diverging importance of drought stress for maize and winter wheat in Europe. Nat Commun 9:1–10

    Google Scholar 

  • Wu H, Hayes MJ, Wilhite DA, Svoboda MD (2005) The effect of the length of record on the standardized precipitation index calculation. Int J Climatol J r Meteorol Soc 25:505–520

    Google Scholar 

  • Xu B, Lin B (2015) Factors affecting carbon dioxide (CO2) emissions in China’s transport sector: a dynamic nonparametric additive regression model. J Clean Prod 101:311–322

    Google Scholar 

  • Xu L, Abbaszadeh P, Moradkhani H, Chen N, Zhang X (2020) Continental drought monitoring using satellite soil moisture, data assimilation and an integrated drought index. Remote Sens Environ 250:112028

    Google Scholar 

  • Yang Y, Zhang S, Roderick ML, McVicar TR, Yang D, Liu W, Li X (2020) Comparing Palmer Drought Severity Index drought assessments using the traditional offline approach with direct climate model outputs. Hydrol Earth Syst Sci 24:2921–2930

    Google Scholar 

  • Yariyan P, Janizadeh S, Phong TV, Nguyen HD, Costache R, Le HV, Pham BT, Pradhan B, Tiefenbacher JP (2020) Improvement of best first decision trees using bagging and dagging ensembles for flood-risk mapping. Water Resour Manag. https://doi.org/10.1007/s11269-020-02603-7

    Article  Google Scholar 

  • Yaseen ZM, Ali M, Sharafati A, Al-Ansari N, Shahid S (2021) Forecasting standardized precipitation index using data intelligence models: regional investigation of Bangladesh. Sci Rep 11:1–25

    Google Scholar 

  • Yu H, Zhang Q, Xu C-Y, Du J, Sun P, Hu P (2019) Modified palmer drought severity index: model improvement and application. Environ Int 130:104951

    Google Scholar 

  • Zhan C, Gan A, Hadi M (2011) Prediction of lane clearance time of freeway incidents using the M5P tree algorithm. IEEE Trans Intell Transp Syst 12:1549–1557

    Google Scholar 

Download references

Acknowledgements

Thanks to the NASA POWER, Prediction of Worldwide Energy Resources (https://power.larc.nasa.gov/), for providing the data needed in this research

Author information

Authors and Affiliations

Authors

Contributions

Ahmed Elbeltagi and Chaitanya B. Pande had the original idea of the research. Chaitanya B. Pande: Conceptualization, Development of Methodology, Formal analysis, Original writing and drafting, Writing—review and editing. Ahmed Elbeltagi: Conceptualization, Formal analysis, Software, Writing—review and editing. Romulus Costache, Saad Sh. Sammen, Rabeea Noor: Original writing and drafting, Writing—review and editing. All authors approved the final version for submission.

Corresponding author

Correspondence to Ahmed Elbeltagi.

Ethics declarations

Ethics approval

The authors confirm that this article is original research and has not been published or presented previously in any journal or conference in any language (in whole or in part).

Consent to participate and consent to publish

The authors declare that they have consent to participate and consent to publish.

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pande, C.B., Costache, R., Sammen, S.S. et al. Combination of data-driven models and best subset regression for predicting the standardized precipitation index (SPI) at the Upper Godavari Basin in India. Theor Appl Climatol 152, 535–558 (2023). https://doi.org/10.1007/s00704-023-04426-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00704-023-04426-z

Navigation