Abstract
This study aims to evaluate the performance of four ensemble machine learning methods, i.e., Random Committee, Discretization Regression, Reduced Error Pruning Tree, and Additive Regression, to estimate water quality parameters of Biochemical Oxygen Demand BOD and Dissolved Oxygen DO. Data from Anbar City on the Euphrates River in western Iraq was employed for the model's training and validation. The best subset regression analysis and correlation analysis were used to determine the best input combinations and to ascertain variable correlation, respectively. Besides, sensitivity analysis was employed to determine the standardized coefficient for BOD and DO predictions, hence knowing the significance of the relevant physical and chemical parameters. Results revealed that temperature, turbidity, electrical conductivity, Ca++, and chemical oxygen demand were identified as the best input combinations for BOD prediction. In contrast, the variable combination of temperature, turbidity, chemical oxygen demand, SO4−1, and total suspended solids was identified as the best input combination for DO prediction. It was also demonstrated that the random committee model was superior for predictions of BOD and DO, followed by the discretization regression model. For predicting BOD (DO), the correlation coefficient and root mean square error were 0.8176 (0.7833) and 0.3291 (0.3544), respectively, during the testing stage. The present investigation provided approaches for addressing difficulties in irrigation water quality prediction through artificial intelligence techniques and thence serve as a tool to overcome the obstacles towards better water management.
Similar content being viewed by others
Data Availability
The data presented in this study are available at a reasonable request from the corresponding author.
Abbreviations
- Temp:
-
Temperature
- COD:
-
Chemical Oxygen Demand
- Turb:
-
Turbidity
- EC:
-
Electrical Conductivity
- Ca:
-
Calcium
- TSS:
-
Total Suspended Solids
- SO4:
-
Sulphate
- TDS:
-
Total Dissolved Solids
- Alk:
-
Alkaline
- REPTree:
-
Reduced Error Pruning Tree
- RC:
-
Random Committee
- AR:
-
Additive Regression
- RD:
-
Regression by Discretization
- CC:
-
Coefficient of Correlation
- MAE:
-
Mean Absolute Error
- RMSE:
-
Root Mean Square Error
- RAE:
-
Root Absolute Error
- RRSE:
-
Root Relative Standard Error
References
Ahmed AN, Othman FB, Afan HA, Ibrahim RK, Fai CM, Hossain MS, Ehteram M, Elshafie A (2019) Machine learning methods for better water quality prediction. J Hydrol 578(May)
Antanasijević D, Pocajt V, Povrenović D et al (2013) Modelling dissolved oxygen content using artificial neural networks: Danube River, North Serbia, case study. Environ Sci Pollut Res 20:9006–9013
Asadollahfardi G, Taklify A, Ghanbari A (2021) Application of artificial neural network to predict TDS in Talkheh Rud River. J Irrig Drain Eng 138(4):363–370
Al-Sulttani AO, Al-Mukhtar M, Roomi AB, Farooque AA, Khedher KM, Yaseen ZM (2021) Proposition of new ensemble data-intelligence models for surface water quality prediction. IEEE Access 9:108527–108541
Biesbroek R, Wright SJ, Eguren SK, Bonotto A, Athanasiadis IN (2022) Policy attention to climate change impacts, adaptation and vulnerability: a global assessment of National Communications (1994–2019). Climate Policy 22(1):97–111
Chen K, Chen H, Zhou C, Huang Y, Qi X, Shen R, Liu F et al (2020) Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data. Water Res 171:115454
Chen W, Hong H, Li S, Shahabi H, Wang Y, Wang X, Ahmad BB (2019) Flood susceptibility modelling using novel hybrid approach of reduced-error pruning trees with bagging and random subspace ensembles. J Hydrol 575:864–873
Chipman HA, George EI, McCulloch RE (2010) BART: Bayesian additive regression trees. Ann Appl Stat 4(1):266–298
Devasena CL (2014) Comparative analysis of random forest, REP tree and J48 classifiers for credit risk prediction. Int J Comput Appl 975(8887):30–36
El Bilali A, Taleb A (2020) Prediction of irrigation water quality parameters using machine learning models in a semi-arid environment. J Saudi Soc Agric Sci 19(7):439–451
Elbeltagi A, Raza A, Hu Y, Al-Ansari N, Kushwaha NL et al (2022) Data intelligence and hybrid metaheuristic algorithms-based estimation of reference evapotranspiration. Appl Water Sci 12(7):1–18
Elbeltagi A, Srivastava A, Kushwaha NL, Juhász C, Tamás J, Nagy A (2023) Meteorological data fusion approach for modeling crop water productivity based on ensemble machine learning. Water 15(1):30
Elkiran G, Nourani V, Abba SI (2019) Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J Hydrol 577:123962
Fuller R, Landrigan PJ, Balakrishnan K, Bathan G, Stephan Bose-O’Reilly, Michael Brauer, Jack Caravanos, et al (2022) Pollution and health: A progress update. Lancet Planet Health 6(6):e535–e547
Gad M, Saleh AH, Hussein H, Elsayed S, Farouk M (2023) Water quality evaluation and prediction using irrigation indices, artificial neural networks, and partial least square regression models for the Nile River, Egypt. Water 15(12):2244
Gao C, Hao M, Chen J, Gu C (2021) Simulation and design of joint distribution of rainfall and tide level in Wuchengxiyu Region. China. Urban Clim 40:101005. https://doi.org/10.1016/j.uclim.2021.101005
Hussan WU, Khurram Shahzad M, Seidel F, Nestmann F (2020) Application of soft computing models with input vectors of snow cover area in addition to hydro-climatic data to predict the sediment loads. Water (Switzerland) 12(5)
IPCC (2021) Technical Summary. Climate change 2021: The physical science basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, pp 33–144
Islam ARM, Talukdar S, Akhter S, Eibek KU, Rahman M, Pal S et al (2022) Assessing the impact of the Farakka Barrage on hydrological alteration in the Padma River with future insight. Sustainability 14(9):5233
Jiao Y, Zhu G, Meng G, Lu S, Qiu D, Lin X, ... Sun N (2023) Estimating non-productive water loss in irrigated farmland in arid oasis regions: Based on stable isotope data. Agric Water Manag 289:108515
Kadkhodazadeh M, Farzin S (2021) A novel LSSVM model integrated with GBO algorithm to assessment of water quality parameters. Water Resour Manag 35(12):3939–3968
Khadke L, Pattnaik S (2021) Impact of initial conditions and cloud parameterization on the heavy rainfall event of Kerala (2018). Model Earth Syst Environ 7(4):2809–2822
Khosravi K, Golkarian A, Melesse AM, Deo RC (2022) Suspended sediment load modeling using advanced hybrid rotation forest based elastic network approach. J Hydrol 127963
Khosravi K, Mao L, Kisi O, Yaseen ZM, Shahid S (2018) Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized andean catchment in Chile. J Hydrol 567:165–179
Khosravi K, Miraki S, Saco PM, Farmani R (2021) Short-term river streamflow modeling using ensemble-based additive learner approach. J Hydro-Environ Res 39:81–91
Kim SE, Seo IW (2015) Artificial neural network ensemble modeling with conjunctive data clustering for water quality prediction in rivers. J Hydro-Environ Res 9(3):325–339
Li Q, Lu L, Zhao Q, Hu S (2023) Impact of inorganic solutes’ Release in groundwater during oil shale in situ exploitation. Water 15(1):172
Liu Z, Xu J, Liu M, Yin Z, Liu X, Yin L, ... Zheng W (2023) Remote sensing and geostatistics in urban water-resource monitoring: a review. Mar Freshw Res. https://doi.org/10.1071/MF22167
Luo J, Niu F, Lin Z, Liu M, Yin G, ... Gao Z (2022) Abrupt increase in thermokarst lakes on the central Tibetan Plateau over the last 50 years. CATENA 217:106497. https://doi.org/10.1016/j.catena.2022.106497
Lu H, Ma X (2020) Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249:126169
Mahdi N, Amirhossein A, Mohammad G, Benyamin C, Mostafa HK, Kourosh B (2023) A smart sustainable decision support system for water management of power plants in water stress regions. Expert Syst Appl 230:120752
Nguyen DH, Le XH, Anh DT, Kim SH, Bae DH (2022) Hourly streamflow forecasting using a Bayesian additive regression tree model hybridized with a genetic algorithm. J Hydrol 606:127445. https://doi.org/10.1016/j.jhydrol.2022.127445
Niranjan A, Nutan DH, Nitish A, Shenoy PD, Venugopal KR (2018, April) ERCR TV: Ensemble of random committee and random tree for efficient anomaly classification using voting. Int Conf Converg Technol (I2CT) 1–5. IEEE
Niranjan A, Prakash A, Veena N, Geetha M, Shenoy PD, Venugopal KR (2017, December) EBJRV: An ensemble of Bagging, J48 and random committee by voting for efficient classification of intrusions. Int WIE Conf Electr Comput Eng (WIECON-ECE) 51–54
Piraei R, Afzali SH, Niazkar M (2023) Assessment of XGBoost to estimate total sediment loads in rivers. Water Resour Manag 0123456789
Qiu D, Zhu G, Lin X, Jiao Y, Lu S, Liu J et al (2023) Dissipation and movement of soil water in artificial forest in arid oasis areas: Cognition based on stable isotopes. CATENA 228:107178
Rajaee T, Khani S, Ravansalar M (2020) Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: a review. Chemom Intell Lab Syst 200(February):103978
Rui S, Zhou Z, Jostad HP, Wang L, Guo Z (2023) Numerical prediction of potential 3-dimensional seabed trench profiles considering complex motions of mooring line. Appl Ocean Res 139:103704
Saha TK, Pal S, Sarda R (2022) Impact of river flow modification on wetland hydrological and morphological characters. Environ Scie Pollut Res 1–21
Sasan Z, Fatemeh GJ, Jiří JK, Awais B, Mostafa HK (2023) Sustainable and optimized values for municipal wastewater: The removal of biological oxygen demand and chemical oxygen demand by various levels of geranular activated carbon- and genetic algorithm-based simulation. J Clean Prod 417:137932
Shahdad M, Saber B (2022) Drought forecasting using new advanced ensemble-based models of reduced error pruning tree. Acta Geophys 70(2):697–712
Shamshirband S, Nodoushan EJ, Adolf JE, Manaf AA, Mosavi A, Chau KW (2019) Ensemble models with uncertainty analysis for multi-day ahead forecasting of chlorophyll a concentration in coastal waters. Eng App Comput Fluid Mech 13(1):91–101
Singha S, Pasupuleti S, Singha SS, Singh R, Kumar S (2021) Prediction of groundwater quality using efficient machine learning technique. Chemosphere 276:130265
Tao H, Al-Khafaji ZS, Qi C, Zounemat-Kermani M, Kisi O, Tiyasha T, Chau KW et al (2021) Artificial intelligence models for suspended river sediment prediction: State-of-the art, modeling framework appraisal, and proposed future research directions. Eng App Computa Fluid Mech 15(1):1585–1612
Tiyasha, Tung TM, Yaseen ZM (2020) A survey on river water quality modelling using artificial intelligence models: 2000–2020. J Hydrology 585–124670. https://doi.org/10.1016/j.jhydrol.2020.124670
Wang P, Yao J, Wang G, Hao F, Shrestha S, Xue B, Xie G, Peng Y (2019) Exploring the application of artificial intelligence technology for identification of water pollution characteristics and tracing the source of water quality pollutants. Sci Total Environ 693:133440
Wu X, Feng X, Wang Z, Chen Y, Deng Z (2023) Multi-source precipitation products assessment on drought monitoring across global major river basins. Atmos Res 295:106982
Yin L, Wang L, Li T, Lu S, Yin Z, Liu X, ... Zheng W (2023a) U-Net-STN: A novel end-to-end lake boundary prediction model. Land 12(8):1602. https://doi.org/10.3390/land12081602
Yin L, Wang L, Keim BD, Konsoer K, Yin Z, Liu M, ... Zheng W (2023b) Spatial and wavelet analysis of precipitation and river discharge during operation of the Three Gorges Dam, China. Ecol Indic 154:110837
Yin L, Wang L, Li T, Lu S, Tian J, Yin Y, ... Zheng W (2023c) U-Net-LSTM: Time series-enhanced lake boundary prediction modeL. Land 12(10):1859
Zhou G, Yang Z (2023) Analysis for 3-D morphology structural changes for underwater topographical in Culebrita Island. Int J Remote Sens 44(7):2458–2479
Zhu G, Liu Y, Shi P, Jia W, Zhou J, Liu Y, ... Zhao K (2022a) Stable water isotope monitoring network of different water bodies in Shiyang River basin, a typical arid river in China. Earth Syst Sci Data 14(8):3773–3789
Zhu X, Xu Z, Liu Z, Liu M, Yin Z, Yin L, ... Zheng W (2022) Impact of dam construction on precipitation: a regional perspective. Marine and Freshwater Research. https://doi.org/10.1071/MF22135
Funding
There is no external and internal fund for this work.
Author information
Authors and Affiliations
Contributions
Ahmed Elbeltagi had the main idea of the paper; Mustafa Al-Mukhtar prepared the datasets; Ahmed Elbeltagi analyzed datasets by using a multi-collinearity statistical method, sensitivity method, best subset regression method, developed and implemented the ML models, supervision, Conceptualization, Funding Acquisition; Aman Srivastava and Leena Khadke: conducted analysis, developed plots, and drafted content for model description, results, and discussions; Tariq Al-Musawi writing, review and editing; Mustafa Al-Mukhtar completed abstract, introduction, study area description and conclusion; Aman Srivastava and Ahmed Elbeltagi improved and reviewed the manuscript sections. Mustafa Al-Mukhtar and Aman Srivastava have contributed equally to this work and shared the first authorship. All authors read and approved the final version of the paper.
Corresponding author
Ethics declarations
Ethics Approval
The authors confirm that this article is original research and has not been published or presented previously in any journal or conference in any language (in whole or in part).
Consent to Participate and Consent to Publish
The authors declared that they approved on the submission of the final manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Al-Mukhtar, M., Srivastava, A., Khadke, L. et al. Prediction of Irrigation Water Quality Indices Using Random Committee, Discretization Regression, REPTree, and Additive Regression. Water Resour Manage 38, 343–368 (2024). https://doi.org/10.1007/s11269-023-03674-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-023-03674-y