Abstract
Conventional hydrological modeling is highly tedious for a data-sparse tropical river basin heavily loaded by hydraulic structures. The application of promising machine learning algorithms in this context is not explored much. This study investigates the suitability of two machine learning approaches—random forest regression model (RF) and gradient boosting regression model (GBM) in predicting the hydrological drought indices—standardized streamflow index (SSI) and streamflow drought index (SDI) based on the predicted meteorological drought indices—standardized precipitation index (SPI) and standardized precipitation evapotranspiration index under the future climate Representative Concentration Pathway (RCP) scenarios RCP 4.5 and RCP 8.5. The study predicts long-term hydrological drought indices SSI and SDI from 2006 to 2099 based on global climate models projections. The future drought years were extracted using the run theory. The study reveals that the river basin was prone to hydrological drought occurrences 50–65% of the time during the historic period (1988–2005) and 32–70% during the future period (2006–2099). It is evident from the results that both RF and GBM models follow a similar pattern of dry events in the future, which implies the prediction to be accurate. The study recommends the RF model for the study area, considering the model performances.
Similar content being viewed by others
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
References
Adede C, Oboko R, Wagacha P. W, Atzberger C (2019) A mixed model approach to vegetation condition prediction using artificial neural networks (ANN): case of Kenya’s operational drought monitoring. Remote Sensing 11(9):1099. ]https://doi.org/10.3390/rs11091099
Breiman L (2001) Random forests. Mach Learn 45:5–32. https://doi.org/10.1023/A:1010933404324
Chen J, Li M,Wang W (2012) Statistical uncertainty estimation using random forests and its application to drought forecast. Math Problems Eng 2012
da Silva JJC, Medeiros V, Garrozi C, Montenegro A, Gonçalves GE (2019) Random Forest techniques for spatial interpolation of evapotranspiration data from Brazilian’s Northeast. Comput Electron Agric 166:105017. https://doi.org/10.1016/j.compag.2019.105017
Dong-Dong Z, Wei H, Peng-Cheng Y, Tai-Chen F (2014) Research on drought in southwest China based on the theory of run and two-dimensional joint distribution theory. Acta Physica Sinica 63(23). https://doi.org/10.7498/aps.63.230204.
Drisya J, Kumar DS, Roshni T (2021) Hydrological drought assessment through streamflow forecasting using wavelet enabled artificial neural networks. Environ Dev Sustain 23(3):3653–3672
Edossa DC, Woyessa YE, Welderufael WA (2014) Analysis of droughts in the central region of South Africa and their association with SST anomalies. Int J Atmos Sci. https://doi.org/10.1155/2014/508953
Farahmand A, AghaKouchak A (2015) A generalized framework for deriving nonparametric standardized drought indicators. Adv Water Resour 76:140–145. https://doi.org/10.1016/j.advwatres.2014.11.012
Ferreira LB, da Cunha FF (2020) Multi-step ahead forecasting of daily reference evapotranspiration using deep learning. Comput Electron Agric 178:105728. https://doi.org/10.1016/j.compag.2020.105728
Fleig AK, Tallaksen LM, Hisdal H, Demuth S (2005) A global evaluation of streamflow drought characteristics. Hydrol Earth Syst Sci Discussions 2(6):2427–2464
Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38(4):367–378. https://doi.org/10.1016/S0167-9473(01)00065-2
Guerrero-Salazar PLA, Yevjevich VM (1975) Analysis of drought characteristics by the theory of runs. Doctoral dissertation, Colorado State University. Libraries
Heim RR Jr (2002) A review of twentieth-century drought indices used in the United States. Bull Am Meteor Soc 83(8):1149–1166. https://doi.org/10.1175/1520-0477-83.8.1149
Herbst PH, Bredenkamp D, Barker HMG (1966) A technique for the evaluation of drought from rainfall data. J Hydrol 4:264–272. https://doi.org/10.1016/0022-1694(66)90084-9
Hesami Afshar M, Sorman AU, Yilmaz MT (2016) Conditional copula-based spatial-temporal drought characteristics analysis—a case study over Turkey. Water 8(10):426. https://doi.org/10.3390/w8100426
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Huang H, Liang Z, Li B, Wang D, Hu Y, Li Y (2019) Combination of multiple data-driven models for long-term monthly runoff predictions based on Bayesian model averaging. Water Resour Manage 33(9):3321–3338. https://doi.org/10.1007/s11269-019-02305-9
Jagadeesh P, Anupama C (2014) Statistical and trend analyses of rainfall: a case study of Bharathapuzha river basin, Kerala. India ISH J Hydraul Eng 20(2):119–132. https://doi.org/10.1080/09715010.2013.843280
Karin Kruup. Clearing the buzzwords in machine learning, May 2018
Kermen Ç, Gül O. (2018, September) Comparing two streamflow-based Drought Indices. In Water resources and wetlands. Eds. P. Gastescu, P. Bretcan. 4th International Conference (pp. 05–09).
Li X, Li Y, Chen A, Gao M, Slette IJ, Piao S (2019) The impact of the 2009/2010 drought on vegetation growth and terrestrial carbon balance in Southwest China. Agric for Meteorol 269:239–248. https://doi.org/10.1016/j.agrformet.2019.01.036
Liu XF, Wang SX, Zhou Y, Wang FT, Yang G, Liu WL (2016) Spatial analysis of meteorological drought return periods in China using copulas. Nat Hazards 80(1):367–388. https://doi.org/10.1007/s11069-015-1972-7
Lu J, Yan J, Li Y (2018) The temporal variation characteristics of drought in Yunnan-Guizhou area during 1960 to 2014 based on SPEI and run-length theory. J Zhejiang Univ (sci Ed) 45(3):363–372
Mardani M, Gong E, Cheng JY, Vasanawala SS, Zaharchuk G, Xing L, Pauly JM (2018) Deep generative adversarial neural networks for compressive sensing MRI. IEEE Trans Med Imaging 38(1):167–179. https://doi.org/10.1109/TMI.2018.2858752
McKee TB, Doesken NJ, Kleist J (1993) The relationship of drought frequency and duration to time scales. In: Proceedings of the 8th conference on applied climatology, vol 17, No. 22, pp 179–183
Mehr AD (2021) Drought classification using gradient boosting decision tree. Acta Geophys. https://doi.org/10.1007/s11600-021-00584-8
Merghadi A, Abderrahmane B, Tien Bui D (2018) Landslide susceptibility assessment at Mila Basin (Algeria): a comparative assessment of prediction capability of advanced machine learning methods. ISPRS Int J Geo Inf 7(7):268. https://doi.org/10.3390/ijgi7070268
Mishra AK, Singh VP (2011) Drought modeling–A review. J Hydrol 403(1–2):157–175. https://doi.org/10.1016/j.jhydrol.2011.03.049
Modarres R (2007) Streamflow drought time series forecasting. Stoch Env Res Risk Assess 21(3):223–233. https://doi.org/10.1007/s00477-006-0058-1
Naghibi SA, Ahmadi K, Daneshi A (2017) Application of support vector machine, random forest, and genetic algorithm optimized random forest models in groundwater potential mapping. Water Resour Manage 31(9):2761–2775. https://doi.org/10.1007/s11269-017-1660-3
Nalbantis I, Tsakiris G (2009) Assessment of hydrological drought revisited. Water Resour Manag 23(5):881–897. https://doi.org/10.1007/s11269-008-9305-1
Nay J, Burchfield E, Gilligan J (2018) A machine-learning approach to forecasting remotely sensed vegetation health. Int J Remote Sens 39(6):1800–1816. https://doi.org/10.1080/01431161.2017.1410296
Ozkaya A, Zerberg Y (2019) A 40-year analysis of the Hydrological Drought Index for the Tigris Basin. Turkey Water 11(4):657. https://doi.org/10.3390/w11040657
Rose J, Chithra NR (2020) Evaluation of temporal drought variation and projection in a tropical river basin of Kerala. J Water Clim Change 11(S1):115–132. https://doi.org/10.2166/wcc.2020.240
Salimi H, Asadi E, Darbandi S (2021) Meteorological and hydrological drought monitoring using several drought indices. Appl Water Sci 11(2):1–10
Shamshirband S, Hashemi S, Salimi H, Samadianfard S, Asadi E, Shadkani S, Chau KW (2020) Predicting standardized streamflow index for hydrological drought using machine learning models. Eng Appl Comput Fluid Mech 14(1):339–350. https://doi.org/10.1080/19942060.2020.1715844
Solomatine DP (2006) Data-driven modeling and computational intelligence methods in hydrology. Encyclop Hydrol Sci. https://doi.org/10.1002/0470848944.hsa021
Sundararajan K, Garg L, Srinivasan K, Bashir AK, Kaliappan J, Ganapathy GP, Meena T (2021) A contemporary review on drought modeling using machine learning approaches. CMES-Comput Model Eng Sci 128(2):447–487
Szalai S, Szinell CS, Zoboki J (2000) Drought monitoring in Hungary. Early Warn Syst Drought Prepared Drought Manag 57:182–199
Thompson NC, Greenewald K, Lee K, Manso GF (2020) The computational limits of deep learning. arXiv preprint arXiv:2007.05558.
Tyralis H, Papacharalampous G (2017) Variable selection in time series forecasting using random forests. Algorithms 10(4):114. https://doi.org/10.3390/a10040114
Varughese A, Hajilal MS, George B (2017) Analysis of historical climate change trends in Bharathapuzha River Basin, Kerala. India Nat Environ Pollut Technol 16(1):237–242
Vicente-Serrano SM, Beguería S, López-Moreno JI (2010) A multiscalar drought index sensitive to global warming: the standardized precipitation evapotranspiration index. J Clim 23(7):1696–1718. https://doi.org/10.1175/2009JCLI2909.1
Vicente-Serrano SM, Van der Schrier G, Beguería S, Azorin-Molina C, Lopez-Moreno JI (2015) Contribution of precipitation and reference evapotranspiration to drought indices under different climates. J Hydrol 526:42–54. https://doi.org/10.1016/j.jhydrol.2014.11.025
Wable PS, Jha MK, Shekhar A (2019) Comparison of drought indices in a semi-arid river basin of India. Water Resour Manage 33(1):75–102. https://doi.org/10.1007/s11269-018-2089-z
Wang W, Zhu Y, Xu R, Liu J (2015) Drought severity change in China during 1961–2012 indicated by SPI and SPEI. Nat Hazards 75(3):2437–2451. https://doi.org/10.1007/s11069-014-1436-5
Were K, Bui DT, Dick B, Singh BR (2015) A comparative assessment of support vector regression, artificial neural networks, and random forests for predicting and mapping soil organic carbon stocks across an Afromontane landscape. Ecol Ind 52:394–403. https://doi.org/10.1016/j.ecolind.2014.12.028
Wilhite DA (2000) Drought as a natural hazard: concepts and definitions. In: Wilhite DA (ed) Hazards disasters ser. Routledge
Wilhite DA, Glantz MH (1985) Understanding: the drought phenomenon: the role of definitions. Water International 10(3):111–120. https://doi.org/10.1080/02508068508686328
Wu J, Long J, Liu M (2015) Evolving RBF neural networks for rainfall prediction using hybrid particle swarm optimization and genetic algorithm. Neurocomputing 148:136–142. https://doi.org/10.1016/j.neucom.2012.10.043
Yevjevich VM (1967) Objective approach to definitions and investigations of continental hydrologic droughts, An (Doctoral dissertation. Colorado State University, Libraries)
Yisehak B, Zenebe A (2021) Modeling multivariate standardized drought index based on the drought information from precipitation and runoff: a case study of Hare watershed of Southern Ethiopian Rift Valley Basin. Modeling Earth Systems and Environment 7(2):1005–1017. https://doi.org/10.1007/s42452-020-3097-1
Zhang DD, Yan DH, Lu F, Wang YC, Feng J (2015) Copula-based risk assessment of drought in Yunnan province. China Natural Hazards 75(3):2199–2220. https://doi.org/10.1007/s11069-014-1419-6
Zhu S, Xu Z, Luo X, Liu X, Wang R, Zhang M, Huo Z (2021) Internal and external coupling of Gaussian mixture model and deep recurrent network for probabilistic drought forecasting. Int J Environ Sci Technol 18(5):1221–1236
Zucchini W, Adamson P. T. (1984). The occurrence and severity of droughts in South Africa. Report 91/1/84 and Appendix. Water Research Commission, Pretoria
Acknowledgements
The authors would like to express gratitude to the anonymous reviewers, editor, and associate editor of the journal. The first author would like to thank the Ministry of Education, India, for supporting the work in the form of a Ph.D. fellowship. We also thank the Indian Meteorological Department (IMD), Pune, for providing rainfall and temperature data, and Central Water Commission of India for providing the streamflow data.
Author information
Authors and Affiliations
Contributions
M.A.J.R. was involved in the data collection, investigation, methodology, formal analysis, model development, software, and writing—original draft; C.N.R. contributed to the investigation, supervision, and writing—review and editing, data curation, and validation.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Consent for publication
The manuscript is original. It has not been published previously by any of the authors and is even not under the consideration in any other journal at the time of submission.
Additional information
Editorial responsibility: Samareh Mirkia.
Rights and permissions
About this article
Cite this article
Rose, M.A.J., Chithra, N.R. Tree-based ensemble model prediction for hydrological drought in a tropical river basin of India. Int. J. Environ. Sci. Technol. 20, 4973–4990 (2023). https://doi.org/10.1007/s13762-022-04208-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13762-022-04208-6