Skip to main content
Log in

Harnessing machine learning tools for water quality assessment in the Kebili shallow aquifers, Southwestern Tunisia

  • Original Article
  • Published:
Acta Geochimica Aims and scope Submit manuscript

Abstract

An integrated method that implements multivariate statistical analysis and ML methods to evaluate groundwater quality of the shallow aquifers of the Djerid and Kebili district, Southern Tunisia, was adopted. An evaluation of their suitability for irrigation and/or drinking purposes is necessary. A comprehensive hydrochemical assessment of 52 samples with entropy weighted water quality index (EWQI) was also proposed. Eleven water parameters were calculated to ascertain the potential use of those resources in irrigation and drinking. Multivariate analysis showed two main components with Dim1 (variance = 62.3%) and Dim.2 (variance = 22%), due to the bicarbonate, dissolution, and evaporation and the intrusion of drainage water. The evaluation of water quality has been carried out using EWQI model. The calculated EWQI for the Djerid and Kebili waters (i.e., 52 samples) varied between 7.5 and 152.62, indicating a range of 145.12. A mean of 79.12 was lower than the median (88.47). From the calculation of EWQI, only 14 samples are not suitable for irrigation because of their poor to extremely poor quality (26.92%). The bivariate plot showed high correlation for EWQI ~ TH (r = 0.93), EWQI ~ SAR(r = 0.87), indicating that water quality depended on those parameters. Different ML algorithms were successfully applied for the water quality classification. Our results indicated high prediction accuracy (SVM > LDA > ANN > kNN) and perfect classification for kNN, LDA and Naive Bayes. For the purposes of developing the prediction models, the dataset was divided into two groups: training (80%) and testing (20%). To evaluate the models’ performance, RMSE, MSE, MAE and R2 metrics were used. kNN (R2 = 0.9359, MAE = 6.49, MSE = 79.00) and LDA (accuracy = 97.56%; kappa = 96.21%) achieved high accuracy. Moreover, linear regression indicated high correlation for both training (R2 = 0.9727) and testing data (0.9890). This well confirmed the validity of LDA algorithm in predicting water quality. Cross validation showed a high accuracy (92.31%), high sensitivity (89.47%) and high specificity (95%). These findings are fundamentally important for an integrated water resource management in a larger context of sustainable development of the Kebili district.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data availability

Data are available upon request.

References

Download references

Acknowledgements

The authors extend their appreciation to Professor Md Galal Uddin from Civil Engineering, College of Science and Engineering, University of Galway, Ireland, for his careful and active contribution to the reviewing of the manuscript. His thoughtful comments and suggestions have undoubtedly improved the document submitted for evaluation.

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Contributions

ZK: Formal analysis, Writing—original draft, Writing—review and editing. Conceptualization, Methodology, Supervision, Data Curation, Visualization, Validation; KZ, RT: Conceptualization, Methodology, Funding acquisition, Validation.

Corresponding author

Correspondence to Zohra Kraiem.

Ethics declarations

Conflict of Interest

The authors declare no conflict of interest.

Informed consent

Not applicable.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kraiem, Z., Zouari, K. & Trabelsi, R. Harnessing machine learning tools for water quality assessment in the Kebili shallow aquifers, Southwestern Tunisia. Acta Geochim (2024). https://doi.org/10.1007/s11631-024-00689-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11631-024-00689-z

Keywords

Navigation