Skip to main content

Advertisement

Log in

Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models

  • Research Article
  • Published:
Environmental Science and Pollution Research Aims and scope Submit manuscript

Abstract

Agriculture, meteorological, and hydrological drought is a natural hazard which affects ecosystems in the central India of Maharashtra state. Due to limited historical data for drought monitoring and forecasting available in the central India of Maharashtra state, implementing machine learning (ML) algorithms could allow for the prediction of future drought events. In this paper, we have focused on the prediction accuracy of meteorological drought in the semi-arid region based on the standardized precipitation index (SPI) using the random forest (RF), random tree (RT), and Gaussian process regression (GPR-PUK kernel) models. A different combination of machine learning models and variables has been performed for the forecasting of metrological drought based on the SPI-6 and 12 months. Models were developed using monthly rainfall data for the period of 2000–2019 at two meteorological stations, namely, Karanjali and Gangawdi, each representing a geographical region of Upper Godavari river basin area in the central India of Maharashtra state which frequently experiences droughts. Historical data from the SPI from 2000 to 2013 was processed to train the model into machine learning model, and the rest of the 2014 to 2019-year data were used for testing to forecast the SPI and metrological drought. The mean square error (MSE), root mean square error (RMSE), adjusted R2, Mallows’ (Cp), Akaike’s (AIC), Schwarz’s (SBC), and Amemiya’s PC were used to identify the best combination input model and best subregression analysis for both stations of SPI-6 and 12. The correlation coefficient (\(r\)), mean absolute error (MAE), root mean square error (RMSE), relative absolute error (RAE), and root relative squared error (RRSE) were used to perform evaluation for SPI-6 and 12 months of both stations with RF, RT, and GPR-PUK kernel models during the training and testing scenarios. The results during testing phase revealed that the RF was found as the best model in forecasting droughts with values of \(r\), MAE, RMSE, RAE (%), and RRSE (%) being 0.856, 0.551, 0.718, 74.778, and 54.019, respectively, for SPI-6 while 0.961, 0.361, 0.538, 34.926, and 28.262, respectively, for SPI-12 scales at Gangawdi station. Further, the respective values of evaluators at Karanjali station were 0.913 and 0.966, 0.541 and 0.386, 0.604 and 0.589, 52.592 and 36.959, and 42.315 and 31.394 for PUK kernel and RT models, respectively, during SPI-6 and SPI-12. Machine learning models are potential drought warning techniques because they take less time, have fewer inputs, and are less sophisticated than dynamic or scientific models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

RF:

Random forest

RT:

Random tree

GPR:

Gaussian process regression

SPI:

Standardized precipitation index

MSE:

Mean square error

RMSE:

Root mean square error

RAE:

Relative absolute error

RRSE:

Root relative squared error

IPCC:

Intergovernmental Panel on Climate Change

EDI:

Effective drought index

SPEI:

Standardized precipitation evapotranspiration index

SPI:

Standardized precipitation index

PDSI:

Palmer drought severity index

PDN:

Percent departure from normal

VCI:

Vegetation condition index

RDI:

Reconnaissance drought index

ML:

Machine learning

DML:

Deep machine learning

ANNs:

Artificial neural networks

ANFIS:

Adaptive neuro-fuzzy inference system

WANN:

Wavelet-based artificial neural system

SVM:

Support vector machine

M5P:

M5 pruning tree

MARS:

Multivariate adaptive regression splines

REPTree:

Error pruning tree

RSS:

Random subspace

MLP:

Multilayer perceptron

GEP:

Gene expression programming

ELM:

Extreme learning machine

ARIMA-ANN:

Autoregressive integrated moving average-artificial neural network

WANFIS:

Wavelet-based artificial-fuzzy inference system

GIS:

Geographic information system

R 2 :

Coefficient of determination

References

Download references

Acknowledgements

Not applicable.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Ahmed Elbeltagi: development of ML models, formal analysis, and writing review and editing. Chaitanya B. Pande: original draft writing, discussion section, formal analysis, methodology, supervision, data collection and analysis for modeling purpose, processing of data, revision of paper, major work completed, response to all reviewer’s comments, main contribution in revision of paper, writing review and editing, investigation. Manish Kumar: writing results and discussion section and creating the Taylor diagrams and analysis. Abebe Debele Tolche: original draft writing, writing review, and editing.

Sudhir Kumar Singh, Akshay Kumar, and Dinesh Kumar Vishwakarma: writing review and editing.

Corresponding author

Correspondence to Dinesh Kumar Vishwakarma.

Ethics declarations

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Responsible Editor: Philippe Garrigues

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Elbeltagi, A., Pande, C.B., Kumar, M. et al. Prediction of meteorological drought and standardized precipitation index based on the random forest (RF), random tree (RT), and Gaussian process regression (GPR) models. Environ Sci Pollut Res 30, 43183–43202 (2023). https://doi.org/10.1007/s11356-023-25221-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11356-023-25221-3

Keywords

Navigation