Skip to main content

Advertisement

Log in

Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

The present research examined the potential of two important feature selection methods, Bayesian Networks (BN) and Recursive Feature Elimination (RFE), in identifying the optimum predictors for forecasting rainfall in the Godavari Basin in India. Initially, a set of ‘probable hydro-climatological variables’ is chosen based on previous studies. Following a correlation analysis between these probable predictors and monthly rainfall, the most correlated contiguous zones were chosen as ‘potential predictors' which were subsequently used as inputs to the two feature selection algorithms, BN and RFE for selecting the ‘optimum predictors’. The optimum predictions were further utilised to develop seven state-of-the-art Machine Learning (ML) models, including Support Vector Regression (SVR), Gaussian Process Regression (GPR), Multivariate Adaptive Regression Splines (MARS), Random Forest (RF), Parallel Multi-Population Genetic Programming (PMPGP (5 demes (Case 1), 9 demes (Case 2), and 17 demes (Case 3)). The result of the correlation analysis revealed that the domain for Relative Humidity, Total Precipitable Water should be considered over the study area, and for U-Wind, V-Wind and Surface Pressure, the zones around Indian Ocean, Arabian Sea and Persian Gulf should be considered respectively. BN, as a feature selection technique for choosing the optimum predictors, was found to be more effective than RFE. In terms of prediction models, GPR and PMPGP models outperformed others, both when used alone and in conjunction with feature selection methods. The R2 values for GPR models vary from 0.82–0.41, whereas the same varies from 0.81–0.31 for PMPGP models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Availability of Data and Materials

The Copernicus Climate Change Service (C3S) Climate Data Store (CDS) provided the ERA 5 Reanalysis datasets. The Indian Meteorological Department (IMD), Pune, provided the gridded rainfall data. All codes that support the findings of this study are available from the corresponding author upon reasonable request.

References

Download references

Acknowledgements

The authors would like to extend their gratitude to the Department of Science and Technology (DST). The authors would also like to thank the European Centre for Medium-Range Weather Forecasts (ECMWF) and the Indian Metrological Department (IMD) for making the necessary datasets available publicly for academic research. The authors would also like to thank the anonymous reviewers for their constructive comments in improving the quality of the paper.

Funding

This work was funded by the Department of Science and Technology (DST), India through project number ECR/2017/001880.

Author information

Authors and Affiliations

Authors

Contributions

Prabal Das: Data Collection, Formal analysis, Investigation, Writing—original draft. D. A. Sachindra: Conceptualisation, Formal analysis, Investigation, Writing – Review and editing. Kironmala Chanda: Conceptualisation, Supervision, Investigation, Funding acquisition, Writing – Review and editing.

Corresponding author

Correspondence to Kironmala Chanda.

Ethics declarations

Ethical Approval

The authors ensure that this article has not been published elsewhere and that there has been no plagiarism.

Consent of Participate

The authors agree to participate in the journal.

Consent to Publish

The authors have agreed to publish in this journal.

Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. India for funding this research.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 520 KB)

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Das, P., Sachindra, D.A. & Chanda, K. Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms. Water Resour Manage 36, 6043–6071 (2022). https://doi.org/10.1007/s11269-022-03341-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-022-03341-8

Keywords

Navigation