Skip to main content
Log in

Prediction of Irrigation Water Quality Indices Using Random Committee, Discretization Regression, REPTree, and Additive Regression

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

This study aims to evaluate the performance of four ensemble machine learning methods, i.e., Random Committee, Discretization Regression, Reduced Error Pruning Tree, and Additive Regression, to estimate water quality parameters of Biochemical Oxygen Demand BOD and Dissolved Oxygen DO. Data from Anbar City on the Euphrates River in western Iraq was employed for the model's training and validation. The best subset regression analysis and correlation analysis were used to determine the best input combinations and to ascertain variable correlation, respectively. Besides, sensitivity analysis was employed to determine the standardized coefficient for BOD and DO predictions, hence knowing the significance of the relevant physical and chemical parameters. Results revealed that temperature, turbidity, electrical conductivity, Ca++, and chemical oxygen demand were identified as the best input combinations for BOD prediction. In contrast, the variable combination of temperature, turbidity, chemical oxygen demand, SO4−1, and total suspended solids was identified as the best input combination for DO prediction. It was also demonstrated that the random committee model was superior for predictions of BOD and DO, followed by the discretization regression model. For predicting BOD (DO), the correlation coefficient and root mean square error were 0.8176 (0.7833) and 0.3291 (0.3544), respectively, during the testing stage. The present investigation provided approaches for addressing difficulties in irrigation water quality prediction through artificial intelligence techniques and thence serve as a tool to overcome the obstacles towards better water management.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data Availability

The data presented in this study are available at a reasonable request from the corresponding author.

Abbreviations

Temp:

Temperature

COD:

Chemical Oxygen Demand

Turb:

Turbidity

EC:

Electrical Conductivity

Ca:

Calcium

TSS:

Total Suspended Solids

SO4:

Sulphate

TDS:

Total Dissolved Solids

Alk:

Alkaline

REPTree:

Reduced Error Pruning Tree

RC:

Random Committee

AR:

Additive Regression

RD:

Regression by Discretization

CC:

Coefficient of Correlation

MAE:

Mean Absolute Error

RMSE:

Root Mean Square Error

RAE:

Root Absolute Error

RRSE:

Root Relative Standard Error

References

  • Ahmed AN, Othman FB, Afan HA, Ibrahim RK, Fai CM, Hossain MS, Ehteram M, Elshafie A (2019) Machine learning methods for better water quality prediction. J Hydrol 578(May)

  • Antanasijević D, Pocajt V, Povrenović D et al (2013) Modelling dissolved oxygen content using artificial neural networks: Danube River, North Serbia, case study. Environ Sci Pollut Res 20:9006–9013

    Article  Google Scholar 

  • Asadollahfardi G, Taklify A, Ghanbari A (2021) Application of artificial neural network to predict TDS in Talkheh Rud River. J Irrig Drain Eng 138(4):363–370

    Article  Google Scholar 

  • Al-Sulttani AO, Al-Mukhtar M, Roomi AB, Farooque AA, Khedher KM, Yaseen ZM (2021) Proposition of new ensemble data-intelligence models for surface water quality prediction. IEEE Access 9:108527–108541

    Article  Google Scholar 

  • Biesbroek R, Wright SJ, Eguren SK, Bonotto A, Athanasiadis IN (2022) Policy attention to climate change impacts, adaptation and vulnerability: a global assessment of National Communications (1994–2019). Climate Policy 22(1):97–111

    Article  Google Scholar 

  • Chen K, Chen H, Zhou C, Huang Y, Qi X, Shen R, Liu F et al (2020) Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data. Water Res 171:115454

    Article  Google Scholar 

  • Chen W, Hong H, Li S, Shahabi H, Wang Y, Wang X, Ahmad BB (2019) Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles. J Hydrol 575:864–873

    Article  Google Scholar 

  • Chipman HA, George EI, McCulloch RE (2010) BART: Bayesian additive regression trees. Ann Appl Stat 4(1):266–298

    Article  Google Scholar 

  • Devasena CL (2014) Comparative analysis of random forest, REP tree and J48 classifiers for credit risk prediction. Int J Comput Appl 975(8887):30–36

    Google Scholar 

  • El Bilali A, Taleb A (2020) Prediction of irrigation water quality parameters using machine learning models in a semi-arid environment. J Saudi Soc Agric Sci 19(7):439–451

    Google Scholar 

  • Elbeltagi A, Raza A, Hu Y, Al-Ansari N, Kushwaha NL et al (2022) Data intelligence and hybrid metaheuristic algorithms-based estimation of reference evapotranspiration. Appl Water Sci 12(7):1–18

    Article  Google Scholar 

  • Elbeltagi A, Srivastava A, Kushwaha NL, Juhász C, Tamás J, Nagy A (2023) Meteorological data fusion approach for modeling crop water productivity based on ensemble machine learning. Water 15(1):30

    Article  Google Scholar 

  • Elkiran G, Nourani V, Abba SI (2019) Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J Hydrol 577:123962

    Article  Google Scholar 

  • Fuller R, Landrigan PJ, Balakrishnan K, Bathan G, Stephan Bose-O’Reilly, Michael Brauer, Jack Caravanos, et al (2022) Pollution and health: A progress update. Lancet Planet Health 6(6):e535–e547

    Article  Google Scholar 

  • Gad M, Saleh AH, Hussein H, Elsayed S, Farouk M (2023) Water quality evaluation and prediction using irrigation indices, artificial neural networks, and partial least square regression models for the Nile River, Egypt. Water 15(12):2244

    Article  Google Scholar 

  • Gao C, Hao M, Chen J, Gu C (2021) Simulation and design of joint distribution of rainfall and tide level in Wuchengxiyu Region. China. Urban Clim 40:101005. https://doi.org/10.1016/j.uclim.2021.101005

    Article  Google Scholar 

  • Hussan WU, Khurram Shahzad M, Seidel F, Nestmann F (2020) Application of soft computing models with input vectors of snow cover area in addition to hydro-climatic data to predict the sediment loads. Water (Switzerland) 12(5)

  • IPCC (2021) Technical Summary. Climate change 2021: The physical science basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, pp 33–144

    Google Scholar 

  • Islam ARM, Talukdar S, Akhter S, Eibek KU, Rahman M, Pal S et al (2022) Assessing the impact of the Farakka Barrage on hydrological alteration in the Padma River with future insight. Sustainability 14(9):5233

    Article  Google Scholar 

  • Jiao Y, Zhu G, Meng G, Lu S, Qiu D, Lin X, ... Sun N (2023) Estimating non-productive water loss in irrigated farmland in arid oasis regions: Based on stable isotope data. Agric Water Manag 289:108515

    Article  Google Scholar 

  • Kadkhodazadeh M, Farzin S (2021) A novel LSSVM model integrated with GBO algorithm to assessment of water quality parameters. Water Resour Manag 35(12):3939–3968

    Article  Google Scholar 

  • Khadke L, Pattnaik S (2021) Impact of initial conditions and cloud parameterization on the heavy rainfall event of Kerala (2018). Model Earth Syst Environ 7(4):2809–2822

    Article  Google Scholar 

  • Khosravi K, Golkarian A, Melesse AM, Deo RC (2022) Suspended sediment load modeling using advanced hybrid rotation forest based elastic network approach. J Hydrol 127963

  • Khosravi K, Mao L, Kisi O, Yaseen ZM, Shahid S (2018) Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized andean catchment in Chile. J Hydrol 567:165–179

    Article  Google Scholar 

  • Khosravi K, Miraki S, Saco PM, Farmani R (2021) Short-term river streamflow modeling using ensemble-based additive learner approach. J Hydro-Environ Res 39:81–91

    Article  Google Scholar 

  • Kim SE, Seo IW (2015) Artificial neural network ensemble modeling with conjunctive data clustering for water quality prediction in rivers. J Hydro-Environ Res 9(3):325–339

    Article  Google Scholar 

  • Li Q, Lu L, Zhao Q, Hu S (2023) Impact of inorganic solutes’ Release in groundwater during oil shale in situ exploitation. Water 15(1):172

    Article  Google Scholar 

  • Liu Z, Xu J, Liu M, Yin Z, Liu X, Yin L, ... Zheng W (2023) Remote sensing and geostatistics in urban water-resource monitoring: a review. Mar Freshw Res. https://doi.org/10.1071/MF22167

    Article  Google Scholar 

  • Luo J, Niu F, Lin Z, Liu M, Yin G, ... Gao Z (2022) Abrupt increase in thermokarst lakes on the central Tibetan Plateau over the last 50 years. CATENA 217:106497. https://doi.org/10.1016/j.catena.2022.106497

    Article  Google Scholar 

  • Lu H, Ma X (2020) Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249:126169

    Article  Google Scholar 

  • Mahdi N, Amirhossein A, Mohammad G, Benyamin C, Mostafa HK, Kourosh B (2023) A smart sustainable decision support system for water management of power plants in water stress regions. Expert Syst Appl 230:120752

    Article  Google Scholar 

  • Nguyen DH, Le XH, Anh DT, Kim SH, Bae DH (2022) Hourly streamflow forecasting using a Bayesian additive regression tree model hybridized with a genetic algorithm. J Hydrol 606:127445. https://doi.org/10.1016/j.jhydrol.2022.127445

    Article  Google Scholar 

  • Niranjan A, Nutan DH, Nitish A, Shenoy PD, Venugopal KR (2018, April) ERCR TV: Ensemble of random committee and random tree for efficient anomaly classification using voting. Int Conf Converg Technol (I2CT) 1–5. IEEE

  • Niranjan A, Prakash A, Veena N, Geetha M, Shenoy PD, Venugopal KR (2017, December) EBJRV: An ensemble of Bagging, J48 and random committee by voting for efficient classification of intrusions. Int WIE Conf Electr Comput Eng (WIECON-ECE) 51–54

  • Piraei R, Afzali SH, Niazkar M (2023) Assessment of XGBoost to estimate total sediment loads in rivers. Water Resour Manag 0123456789

  • Qiu D, Zhu G, Lin X, Jiao Y, Lu S, Liu J et al (2023) Dissipation and movement of soil water in artificial forest in arid oasis areas: Cognition based on stable isotopes. CATENA 228:107178

    Article  Google Scholar 

  • Rajaee T, Khani S, Ravansalar M (2020) Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: a review. Chemom Intell Lab Syst 200(February):103978

    Article  Google Scholar 

  • Rui S, Zhou Z, Jostad HP, Wang L, Guo Z (2023) Numerical prediction of potential 3-dimensional seabed trench profiles considering complex motions of mooring line. Appl Ocean Res 139:103704

    Article  Google Scholar 

  • Saha TK, Pal S, Sarda R (2022) Impact of river flow modification on wetland hydrological and morphological characters. Environ Scie Pollut Res 1–21

  • Sasan Z, Fatemeh GJ, Jiří JK, Awais B, Mostafa HK (2023) Sustainable and optimized values for municipal wastewater: The removal of biological oxygen demand and chemical oxygen demand by various levels of geranular activated carbon- and genetic algorithm-based simulation. J Clean Prod 417:137932

    Article  Google Scholar 

  • Shahdad M, Saber B (2022) Drought forecasting using new advanced ensemble-based models of reduced error pruning tree. Acta Geophys 70(2):697–712

    Article  Google Scholar 

  • Shamshirband S, Nodoushan EJ, Adolf JE, Manaf AA, Mosavi A, Chau KW (2019) Ensemble models with uncertainty analysis for multi-day ahead forecasting of chlorophyll a concentration in coastal waters. Eng App Comput Fluid Mech 13(1):91–101

    Google Scholar 

  • Singha S, Pasupuleti S, Singha SS, Singh R, Kumar S (2021) Prediction of groundwater quality using efficient machine learning technique. Chemosphere 276:130265

    Article  Google Scholar 

  • Tao H, Al-Khafaji ZS, Qi C, Zounemat-Kermani M, Kisi O, Tiyasha T, Chau KW et al (2021) Artificial intelligence models for suspended river sediment prediction: State-of-the art, modeling framework appraisal, and proposed future research directions. Eng App Computa Fluid Mech 15(1):1585–1612

    Google Scholar 

  • Tiyasha, Tung TM, Yaseen ZM (2020) A survey on river water quality modelling using artificial intelligence models: 2000–2020. J Hydrology 585–124670. https://doi.org/10.1016/j.jhydrol.2020.124670

  • Wang P, Yao J, Wang G, Hao F, Shrestha S, Xue B, Xie G, Peng Y (2019) Exploring the application of artificial intelligence technology for identification of water pollution characteristics and tracing the source of water quality pollutants. Sci Total Environ 693:133440

    Article  Google Scholar 

  • Wu X, Feng X, Wang Z, Chen Y, Deng Z (2023) Multi-source precipitation products assessment on drought monitoring across global major river basins. Atmos Res 295:106982

    Article  Google Scholar 

  • Yin L, Wang L, Li T, Lu S, Yin Z, Liu X, ... Zheng W (2023a) U-Net-STN: A novel end-to-end lake boundary prediction model. Land 12(8):1602. https://doi.org/10.3390/land12081602

    Article  Google Scholar 

  • Yin L, Wang L, Keim BD, Konsoer K, Yin Z, Liu M, ... Zheng W (2023b) Spatial and wavelet analysis of precipitation and river discharge during operation of the Three Gorges Dam, China. Ecol Indic 154:110837

    Article  Google Scholar 

  • Yin L, Wang L, Li T, Lu S, Tian J, Yin Y, ... Zheng W (2023c) U-Net-LSTM: Time series-enhanced lake boundary prediction modeL. Land 12(10):1859

    Article  Google Scholar 

  • Zhou G, Yang Z (2023) Analysis for 3-D morphology structural changes for underwater topographical in Culebrita Island. Int J Remote Sens 44(7):2458–2479

    Article  Google Scholar 

  • Zhu G, Liu Y, Shi P, Jia W, Zhou J, Liu Y, ... Zhao K (2022a) Stable water isotope monitoring network of different water bodies in Shiyang River basin, a typical arid river in China. Earth Syst Sci Data 14(8):3773–3789

    Article  Google Scholar 

  • Zhu X, Xu Z, Liu Z, Liu M, Yin Z, Yin L, ... Zheng W (2022) Impact of dam construction on precipitation: a regional perspective. Marine and Freshwater Research. https://doi.org/10.1071/MF22135

Download references

Funding

There is no external and internal fund for this work.

Author information

Authors and Affiliations

Authors

Contributions

Ahmed Elbeltagi had the main idea of the paper; Mustafa Al-Mukhtar prepared the datasets; Ahmed Elbeltagi analyzed datasets by using a multi-collinearity statistical method, sensitivity method, best subset regression method, developed and implemented the ML models, supervision, Conceptualization, Funding Acquisition; Aman Srivastava and Leena Khadke: conducted analysis, developed plots, and drafted content for model description, results, and discussions; Tariq Al-Musawi writing, review and editing; Mustafa Al-Mukhtar completed abstract, introduction, study area description and conclusion; Aman Srivastava and Ahmed Elbeltagi improved and reviewed the manuscript sections. Mustafa Al-Mukhtar and Aman Srivastava have contributed equally to this work and shared the first authorship. All authors read and approved the final version of the paper.

Corresponding author

Correspondence to Ahmed Elbeltagi.

Ethics declarations

Ethics Approval

The authors confirm that this article is original research and has not been published or presented previously in any journal or conference in any language (in whole or in part).

Consent to Participate and Consent to Publish

The authors declared that they approved on the submission of the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Al-Mukhtar, M., Srivastava, A., Khadke, L. et al. Prediction of Irrigation Water Quality Indices Using Random Committee, Discretization Regression, REPTree, and Additive Regression. Water Resour Manage 38, 343–368 (2024). https://doi.org/10.1007/s11269-023-03674-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-023-03674-y

Keywords

Navigation