Skip to main content
Log in

Predictive modeling of water quality index (WQI) classes in Indian rivers: Insights from the application of multiple Machine Learning (ML) models on a decennial dataset

  • ORIGINAL PAPER
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Despite high pollution levels in Indian rivers, a comprehensive study on the water quality index (WQI) remains elusive. WQI values were computed, and their classes were determined using six water quality parameters from an available decennial dataset (n = 3595) on Indian Rivers. This study aims to assess the spatial distribution of WQI values and their classes across Indian River systems while exploring the application of machine learning (ML) based models in predicting WQI classes using a reduced number of input parameters.

Modeling experiments were designed on five models- Decision Tree (DT), Random Forest (RF), Gradient Boosted Trees (GBT), Artificial Neural Network (ANN), and Support Vector Machine (SVM) for predicting WQI classes. Each model was trained with input parameters and WQI classes with 2990 datasets. Testing of WQI classes by each model was made on 605 datasets under different framework sets. Models’ performance metrics were evaluated by accuracy, weighted mean recall and precision, and F-score.

Our study demonstrates that the two largest systems, Ganga and Brahmaputra, lie on the extremes of the WQI (mean) spectrum, reflecting the impact of contrasting population density, industrial activities, change in land-use-land-cover pattern, and agricultural use on the riverine WQI. Our modeling experiments underscore that with only three input parameters, GBT can predict WQI classes with > 80% of performance metrics. With only two input parameters, GBT, RF, and ANN, all can provide reliable estimates. Our study highlights that ML models can serve as decision-supporting tools for water resource policymakers and managers in making effective pollution control and water resource management decisions.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig.3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

All data were shown in the main manuscript and supplementary information.

References

Download references

Acknowledgements

We acknowledge Pandit Deendayal Energy University for the extended support provided to carry out the research in a smooth manner. We thank Yuvraj Singh Jadon (Lloyds Banking Group, Cardiff, UK) for his timely support in understanding ML-related works to a better extent. The authors thank the two anonymous reviewers for their constructive comments, which helped in improving the clarity of the manuscript.

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Study conception and design by AD and SS. Machine learning algorithms were performed by SS and PS. Modeling experiments were performed by SS. Manuscript written by SS and AD.

Corresponding author

Correspondence to Anirban Das.

Ethics declarations

Consent to participate

Not applicable.

Consent to publish

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 1295 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Singh, S., Das, A. & Sharma, P. Predictive modeling of water quality index (WQI) classes in Indian rivers: Insights from the application of multiple Machine Learning (ML) models on a decennial dataset. Stoch Environ Res Risk Assess (2024). https://doi.org/10.1007/s00477-024-02741-z

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00477-024-02741-z

Keywords

Navigation