Skip to main content

Advertisement

Log in

Dissolved oxygen concentration predictions for running waters with using hybrid machine learning techniques

  • Original Article
  • Published:
Modeling Earth Systems and Environment Aims and scope Submit manuscript

Abstract

Water is one of the most essential elements in nature that forms the basis of human life and contributes to the economic growth and development of societies. Safe water is closely related to environmental health and activities. The lives of all the animals on our planet depend on water and oxygen. Moreover, sufficient dissolved oxygen (DO) is crucial for the survival of aquatic animals. In the present research, temperature (T) and flow (Q) variables were used to predict DO. The time series were monthly and data were related to the Cumberland River in the southern United States from 2008 to 2018. Support vector regression (SVR) was employed for prediction of the model in both standalone and hybrid forms. The employed hybrid models consisted in SVR combined with metaheuristic algorithms of chicken swarm optimization (CSO), social ski-driver (SSD) optimization, Black widow optimization (BWO), and the Algorithm of the innovative gunner (AIG). Pearson correlation coefficient was utilized to select the best input combination. Box plots and Taylor diagrams were employed in the interpretation of the results. It was observed that all the four hybrid models achieved better results. Also according to the evaluation criteria among the models used the following were found: SVR–AIG with the coefficient of determination (R2 = 0.963), the root mean square error (RMSE = 0.644 mg/l), the mean absolute value of error (MAE = 0.568 mg/l), the Nash–Sutcliffe coefficient (NS = 0.864), and bias percentage (BIAS = 0.001). Overall the research showed that hybrid models increased the accuracy of the single SVR model by 6.52–1.75%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  • Adhaileh MH, Alsaade FW (2021) Modelling and prediction of water quality by using artificial intelligence. Sustainability 13(4):42–59. https://doi.org/10.3390/su13084259

    Article  Google Scholar 

  • Afan HA, El-Shafie A, Yaseen ZM, Hameed MM, Mohtar HW, Hussain A (2015) ANN based sediment prediction model utilizing different input scenarios. Water Resour Manage 29(4):1231–1245

    Google Scholar 

  • Ahmed MH, Lin LS (2021) Dissolved oxygen concentration predictions for running waters with different land use land cover using a quantile regression forest machine learning technique. J Hydrol 597:324–341. https://doi.org/10.1016/j.jhydrol.2021.126213

    Article  Google Scholar 

  • Ahmed AAM, Shah SMA (2017) Application of adaptive neuro-fuzzy inference system (ANFIS) to estimate the biochemical oxygen demand (BOD) of Surma River. J King Saud Univ Eng Sci 29(3):237–243

    Google Scholar 

  • Alizadeh MJ, Kavianpour MR (2015) Development of wavelet-ANN models to predict water quality parameters in Hilo Bay, Pacific Ocean. Mar Pollut Bull 98(1–2):171–182

    Google Scholar 

  • Asadollah SB, Sharafati A, Motta D, Yaseen ZM (2021) River water quality index prediction and uncertainty analysis: a comparative study of machine learning models. J Environ Chem Eng 9(4):228–245. https://doi.org/10.1016/j.jece.2020.104599

    Article  Google Scholar 

  • ASCE (1993) Criteria for evaluation of watershed models. J Irrig Drain Eng 119(3):429–442

    Google Scholar 

  • Basak D, Pal S, Patranabis DC (2007) Support vector regression. Neural Inf Process 11:203–225

    Google Scholar 

  • Chai T, Draxler RR (2014) Root mean square error (RMSE) or mean absolute error (MAE)? -Arguments against avoiding RMSE in the literature. Geosci Model Dev 7:1247–1250

    Google Scholar 

  • Chapman D (1992) Water Quality Assessments. Chapman and Hall Ltd, London, pp 88–104

    Google Scholar 

  • Dabanlı İ, Şen Z (2018) Precipitation projections under GCMs perspective and Turkish Water Foundation (TWF) statistical downscaling model procedures. Theoret Appl Climatol 132:153–166

    Google Scholar 

  • Dehghani R, Torabipoudeh H, Younesi H, Shahinejad B, (2020) Daily streamflow prediction using support vector machine-artificial flora (SVM-AF) hybrid model. Acta Geophys 68:1763–1778

    Google Scholar 

  • Diaz RJ, Rosenberg R (2008) Spreading dead zones and consequences for marine ecosystems. Science 321(4):926–929

    Google Scholar 

  • Dizaji AR, Hosseini SA, Rezaverdinejad V, Sharafati A (2020) Groundwater contamination vulnerability assessment using DRASTIC method, GSA, and uncertainty analysis. Arab J Geosci 13(4):1–15

    Google Scholar 

  • Dogan E, Lent Sengorur B, Koklu R (2009) Modeling biological oxygen demand of the Melen River in Turkey using an artificial neural network technique. J Environ Manage 90(5):19–35

    Google Scholar 

  • Duie Tien B, Khosravi K, Tiefenbacher J, Nguyen H, Kazakis N (2020) Improving prediction of water quality indices using novel hybrid machine-learning algorithms. J Sci Total Environ 721(4):136–154

    Google Scholar 

  • Forstinus NO, Ikechukwu NE, Emenike MP, Christiana AO (2016) Water and waterborne diseases: a review. Int J Trop Dis Health 12(4):1–14

    Google Scholar 

  • Gocić M, Motamedi S, Shamshirband S, Petković D, Ch S, Hashim R, Arif M (2015) Soft computing approaches for forecasting reference evapotranspiration. Comput Electron Agric 113(4):164–173. https://doi.org/10.1016/j.compag.2015.02.010

    Article  Google Scholar 

  • Guo H, Hung JJ, Zhu X, Wang B, Tiang S, Xu W, Mai Y (2021) A generalized machine learning approach for dissolved oxygen estimation at multiple spatiotemporal scales using remote sensing. Environ Pollut 288(4):58–69. https://doi.org/10.1016/j.envpol.2021.117734

    Article  Google Scholar 

  • Gupta HV, Sorooshian S, Yapo PO (1999) Status of automatic calibration for hydrologic models: comparison with multilevel expert calibration. J Hydrol Eng 4(2):135–143

    Google Scholar 

  • Hamel L (2009) Knowledge discovery with support vector machines. Wiley, Hoboken

    Google Scholar 

  • Hayyolalam V, Kazem AAP (2020) Black widow optimization algorithm: A novel meta-heuristic approach for solving engineering optimization problems. Eng Appl Artif Intell 87:103249

    Google Scholar 

  • Huang J, Liu S, Gua Hassan S, Xu L, Hunag C (2021) A hybrid model for short-term dissolved oxygen content prediction. Comput Electron Agric 186(4):325–339. https://doi.org/10.1016/j.compag.2021.106216

    Article  Google Scholar 

  • Ighalo JO, Adeniyi AG, Adeniran JA, Ogunniyi S (2020) A systematic literature analysis of the nature and regional distribution of water pollution sources in Nigeria. J Clean Prod 124(3):566–576

    Google Scholar 

  • Kesgin E, Agaccioglu H, Dogan A (2020) Experimental and numerical investigation of drainage mechanisms at sports fields under simulated rainfall. J Hydrol 580:124251

    Google Scholar 

  • Khalil B, Adamowski J, Abdin A, Elsaadi A (2019) A statistical approach for the estimation of water quality characteristics of ungauged streams/watersheds under stationary conditions. J Hydrol 569(4):106–116

    Google Scholar 

  • Khosravi K, Nohani E, Maroufinia E, Pourghasemi HRA (2016) GIS-based flood susceptibility assessment and its mapping in Iran: a comparison between frequency ratio and weights of evidence bivariate statistical models with multi-criteria method. Nat Hazards 83(2):1–41

    Google Scholar 

  • Kisi O, Karahan M, Sen Z (2006) River suspended sediment modeling using fuzzy logic approach. Hydrol Process 20:4351–4362

    Google Scholar 

  • Kisi O, Ay M (2012) Comparison of ANN and ANFIS techniques in modeling dissolved oxygen. In: Sixteenth International Water Technology Conference, IWTC-16. Istanbul, Turkey, pp 1–10

  • Krishna RS, Mishra J, Ighalo JO (2020) Rising Demand for Rain Water Harvesting System in the World: A Case Study of Joda Town, India. World Sci News 146(4):47–59

    Google Scholar 

  • Legates DR, McCabe GJ (1999) Evaluating the use of “goodness-of-fit” measures in hydrologic and hydroclimatic model validation. Water Recourse Res 35:233–241

    Google Scholar 

  • Li W, Wu H, Zhu N, Jiang Y, Tan J, Guo Y (2020a) Prediction of dissolved oxygen in a fishery pond based on gated recurrent unit (GRU). Inf Process Agric 8(1):185–193

    Google Scholar 

  • Li W, Fang H, Qin G, Tan X, Huang Z, Zeng F, Du H, Li S (2020b) Concentration estimation of dissolved oxygen in Pearl River Basin using input variable selection and machine learning techniques. Sci Total Environ 731:128–139. https://doi.org/10.1016/j.scitotenv.2020.139099

    Article  Google Scholar 

  • Lin JY, Cheng CT, Chau KW (2006) Using support vector machines for long-term discharge prediction. Hydrol Sci J 51(3):599–612

    Google Scholar 

  • Liu H, Yang R, Duan Z, Wu H (2021) A hybrid neural network model for marine dissolved oxygen concentrations time-series forecasting based on multi-factor analysis and a multi-model ensemble. Engineering 19(4):112–128. https://doi.org/10.1016/j.eng.2020.10.023

    Article  Google Scholar 

  • Lo Conti F, Hsu KL, Noto LV, Sorooshian S (2014) Evaluation and comparison of satellite precipitation estimates with reference to a local area in the Mediterranean Sea. Atmos Res 138:189–204

    Google Scholar 

  • Lu H, Ma X (2020) Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249:126169

    Google Scholar 

  • Meng X, Liu Y, Gao X, Zhang H (2014) A new bio-inspired algorithm: chicken swarm optimization. In: International conference in swarm intelligence, vol 10. Springer, Cham, pp 86–94

  • Misra D, Oommen T, Agarwal A, Mishra SK, Thompson AM (2009) Application and analysis of support vector machine based simulation for runoff and sediment yield. Biosyst Eng 103(3):527–535

    Google Scholar 

  • Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Am Soc Agric Biol Eng 50(3):885–900

    Google Scholar 

  • Musie M, Sen S, Srivastava P (2019) Comparison and evaluation of gridded precipitation datasets for streamflow simulation in data scarce watersheds of Ethiopia. J Hydrol 579:124168

  • Nagelkerke NJD (1991) A Note on a General Definition of the Coefficient of Determination. Biometrika 78(3):691–692. https://doi.org/10.1093/biomet/78.3.691

    Article  Google Scholar 

  • Nagy H, Watanabe K, Hirano M (2002) Prediction of sediment load concentration in rivers using artificial neural network model. J Hydraul Eng 128:558–559

    Google Scholar 

  • Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models: part 1. A discussion of principles. J Hydrol 10(3):282–290

    Google Scholar 

  • Niu WJ, Feng ZK, Zeng M, Feng BF, Min YW, Cheng CT, Zhou JZ (2019) Forecasting reservoir monthly runoff via ensemble empirical mode decomposition and extreme learning machine optimized by an improved gravitational search algorithm. Appl Soft Comput J 82(4):589–598

    Google Scholar 

  • Nourani V (2017) An emotional ANN (EANN) approach to modeling rainfall-runoff process. J Hydrol 544(3):267–277

    Google Scholar 

  • Pengxin D, Zhang M, Bing J, Jia J, Zhang D (2019) Evaluation of the GSMaP_Gauge products using rain gauge observations and SWAT model in the Upper Hanjiang River Basin. Atmos Res 2191:153–165

    Google Scholar 

  • Pijarski P, Kacejko P (2019) A new metaheuristic optimization method: the algorithm of the innovative gunner (AIG). Eng Optim 51(12):2049–2068

    Google Scholar 

  • Pincus R, Batstone CP, Hofmann RJP, Taylor KE, Glecker PJ (2008) Evaluating the present-day simulation of clouds, precipitation, and radiation in climate models. J Geophys Res Atmos 113(D14):1–10

    Google Scholar 

  • Poli R, Kennedy J, Blackwell T (2007) Particle Swarm Optimization. Swarm Intell 1(1):33–57

    Google Scholar 

  • Priyadarshi N, Azam F, Solanki SS, Sharma AK, Bhoi AK, Almakhles D (2017) A bio-inspired chicken swarm optimization-based fuel cell system for electric vehicle applications. Bio-inspired neurocomputing 10. Springer, Singapore, pp 297–308

    Google Scholar 

  • Radwan M, Willems P, El-Sadek A, Berlamont J (2003) Modelling of dissolved oxygen and biochemical oxygen demand in river water using a detailed and simplified model. Int J River Basin Manage 1(4):97–103

    Google Scholar 

  • Rajaee T, Khani S, Ravansalar M (2020) Artificial intelligence-based single and hybrid models for prediction of water quality in rivers: a review. Chemom Intell Lab Syst 200(4):186–197. https://doi.org/10.1016/j.chemolab.2020.103978

    Article  Google Scholar 

  • Ross AC, Stock AC (2019) An assessment of the predictability of column minimum dissolved oxygen concentrations in Chesapeake Bay using a machine learning model. Estuarine”. Coastal Shelf Sci 221:53–65. https://doi.org/10.1016/j.ecss.2019.03.007

    Article  Google Scholar 

  • Salcedo-Sanz S, Deo RC, Carro-Calvo L, Saavedra-Moreno B (2016) Monthly prediction of air temperature in Australia and New Zealand with machine learning algorithms”. Theoret Appl Climatol 125(4):13–25

    Google Scholar 

  • Satari B, Karimi K, Taherzadeh M, Zamani A (2016) Co-production of fungal biomass derived constituents and ethanol from citrus wastes free sugars without auxiliary nutrients in airlift bioreactor. Int J Mol Sci 17(3):302. https://doi.org/10.3390/ijms17030302

    Article  Google Scholar 

  • Sebastian PA, Peter KV (2009) Spiders of India. Universities press, Hyderabad

    Google Scholar 

  • Sevat E, Dezetter A (1991) Selection of calibration objective functions in the context of rainfall-runoff modeling in a Sudanese savannah area. Hydrol Sci J 36(4):307–330

    Google Scholar 

  • Shi P, Li G, Yuan Y, Huang G, Kuang L (2019) Prediction of dissolved oxygen content in aquaculture using Clustering-based Softplus Extreme Learning Machine. Comput Electron Agric 157(4):329–338. https://doi.org/10.1016/j.compag.2019.01.004

    Article  Google Scholar 

  • Sigaroudi AE, Nayeri ND, Peyrovi H (2013) Antecedents of elderly home residency in cognitive healthy elders: a qualitative study. Global J Health Sci 5:200–2007

    Google Scholar 

  • Taylor KE (2001) Summarizing multiple aspects of model performance in a single diagram. J Geophys Res Atmos 106:7183–7192

    Google Scholar 

  • Tharwat A, Gabe T (2019) Parameters optimization of support vector machines for imbalanceddata using social ski driver algorithm. Neural Comput Appl 32(3):514–527

    Google Scholar 

  • Tharwat A, Darwishb A, Hassanien A (2020) Rough sets and social ski-driver optimization for drug toxicity analysis. Comput Methods Programs Biomed 197:1–11

    Google Scholar 

  • Tiyasha T, Tung TM, Bhagat SK, Tan ML, Jawad AH, Mohtar W, Yassen ZM (2021) Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models. Mar Pollut Bull 170:412–431. https://doi.org/10.1016/j.marpolbul.2021.112639

    Article  Google Scholar 

  • Vapnik VN (1995) The nature of statistical learning theory, vol 10. Springer, New York, pp 250–320

    Google Scholar 

  • Vapnik VN (1998) Statistical learning theory, vol 12. Wiley, New York, pp 250–320

    Google Scholar 

  • Vapnik V, Chervonenkis A (1991) The necessary and sufficient conditions for consistency in the empirical risk minimization method. Pattern Recognit Image Anal 1(3):283–305

    Google Scholar 

  • Wehner MF (2013) Very extreme seasonal precipitation in the NARCCAP ensemble: model performance and projections. Clim Dyn 40(1–2):59–80

    Google Scholar 

  • Xu C, Chen X, Zhang L (2021) Predicting river dissolved oxygen time series based on stand-alone models and hybrid wavelet-based models. J Environ Manage 295(4):166–178. https://doi.org/10.1016/j.jenvman.2021.113085

    Article  Google Scholar 

  • Yaseen ZM, Ehteram M, Sharafati A, Shahid S, Al-Ansari N, El-Shafie A (2018) The integration of nature-inspired algorithms with least square support vector regression models: application to modeling river dissolved oxygen concentration. Water 10(3):11–24

    Google Scholar 

  • Yoon H, Jun SC, Hyun Y, Bae GO, Lee KK (2011) A comparative study of artificial neural networks and support vector machines for predicting groundwater levels in a coastal aquifer. J Hydrol 396(4):128–138

    Google Scholar 

  • Zhu N, Ji X, Tan J, Jiang Y, Gou Y (2021) Prediction of dissolved oxygen concentration in aquatic systems based on transfer learning. Comput Electron Agric 180:385–399. https://doi.org/10.1016/j.compag.2020.105888

    Article  Google Scholar 

  • Zouache D, Arby YO, Nouioua F, Abdelaziz FB (2019) Multi-objective chicken swarm optimization: A novel algorithm for solving multi-objective optimization problems. Comput Ind Eng 129:377–391

    Google Scholar 

Download references

Acknowledgements

The authors are grateful to the Geological Survey of the United States for contributing to the collection of data needed to get the job done.

Funding

The University of Lorestan Khorramabad Iran supported our research work (Grant No. 1).

Author information

Authors and Affiliations

Authors

Contributions

The authors include Dr. Reza Dehghani and Dr.Hassan Torabi Poudeh consistently participated in the preparation of this article.

Corresponding author

Correspondence to Hassan Torabi Poudeh.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Availability of data and material

The data and material availability will be made available to researchers after receiving the email.

Code availability

The code availability will be made available to researchers after receiving the email.

Consent to participate

The study is exempt from review by the Ethics U.S. Geological centre, based on the basis that this type of study is non-human subject research and waived the need for informed consent.

Consent for publication

The author agrees to publish the article in the above publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dehghani, R., Torabi Poudeh, H. & Izadi, Z. Dissolved oxygen concentration predictions for running waters with using hybrid machine learning techniques. Model. Earth Syst. Environ. 8, 2599–2613 (2022). https://doi.org/10.1007/s40808-021-01253-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s40808-021-01253-x

Keyword

Navigation