Skip to main content
Log in

Multilevel split of high-dimensional water quality data using artificial neural networks for the prediction of dissolved oxygen in the Danube River

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

In this study, a self-organizing network-based monitoring location similarity index (LSI) was coupled with Ward neural networks (WNNs) with the aim to create a more accurate, but less complex, multiple sites model for the prediction of dissolved oxygen (DO) content. This multilevel splitting approach comprises the LSI-based grouping of monitoring locations according to their similarity, and virtual splitting of processed data based on their features using WNN. The values of 18 water quality parameters monitored for 12 years at 17 sites on the Danube River flow thought Serbia were used. The optimal input combinations were selected using partial mutual information algorithm with termination based on the Akaike information criterion. LSI-based splitting has yielded two groups of monitoring sites that were modeled with separate WNN models. The number and types of selected inputs differed between those two groups of sites, which was in agreement with possible pollution sources. Multiple performance metrics have revealed that the WNN models perform similar or better than multisite DO prediction models published in the literature, while using two to four times less inputs and data patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Hao RX, Li SM, Li JB, Zhang QK, Liu F (2013) Water quality assessment for wastewater reclamation using principal component analysis. J Environ Inform 21:45–54. https://doi.org/10.3808/jei.201300231

    Article  Google Scholar 

  2. Purkait B, Kadam SS, Das SK (2008) Application of artificial neural network model to study arsenic contamination in groundwater of Malda district, Eastern India. J Environ Inform 12:140–149. https://doi.org/10.3808/jei.200800132

    Article  Google Scholar 

  3. Yeon IS, Kim JH, Jun KW (2008) Application of artificial intelligence models in water quality forecasting. Environ Technol 29:625–631. https://doi.org/10.1080/09593330801984456

    Article  Google Scholar 

  4. Wetzel RG, Likens GE (2000) Dissolved oxygen. Limnological analyses. Springer, New York, pp 73–84. https://doi.org/10.1007/978-1-4757-3250-4_6

    Book  Google Scholar 

  5. Heddam S, Kisi O (2017) Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors. Environ Sci Pollut Res 24:16702–16724. https://doi.org/10.1007/s11356-017-9283-z

    Article  Google Scholar 

  6. Mishra AK, Kumar B, Dutta J (2016) Prediction of hydraulic conductivity of soil bentonite mixture using hybrid-ANN approach. J Environ Inform 27:98–105. https://doi.org/10.3808/jei.201500292

    Article  Google Scholar 

  7. Olyaie E, Zare Abyaneh H, Danandeh Mehr A (2017) A comparative analysis among computational intelligence techniques for dissolved oxygen prediction in Delaware River. Geosci Front 8:517–527. https://doi.org/10.1016/j.gsf.2016.04.007

    Article  Google Scholar 

  8. Heddam S (2016) Use of optimally pruned extreme learning machine (OP-ELM) in forecasting dissolved oxygen concentration (DO) several hours in advance: a case study from the Klamath River, Oregon, USA. Environ Process 3:909–937. https://doi.org/10.1007/s40710-016-0172-0

    Article  Google Scholar 

  9. Raheli B, Aalami MT, El-Shafie A, Ghorbani MA, Deo RC (2017) Uncertainty assessment of the multilayer perceptron (MLP) neural network model with implementation of the novel hybrid MLP-FFA method for prediction of biochemical oxygen demand and dissolved oxygen: a case study of Langat River. Environ Earth Sci 76:1–16. https://doi.org/10.1007/s12665-017-6842-z

    Article  Google Scholar 

  10. Ay M, Kişi Ö (2017) Estimation of dissolved oxygen by using neural networks and neuro fuzzy computing techniques. KSCE J Civ Eng 21:1631–1639. https://doi.org/10.1007/s12205-016-0728-6

    Article  Google Scholar 

  11. Keshtegar B, Heddam S (2017) Modeling daily dissolved oxygen concentration using modified response surface method and artificial neural network: a comparative study. Neural Comput Appl. https://doi.org/10.1007/s00521-017-2917-8

    Article  Google Scholar 

  12. Csábrági A, Molnár S, Tanos P, Kovács J (2017) Application of artificial neural networks to the forecasting of dissolved oxygen content in the Hungarian section of the river Danube. Ecol Eng 100:63–72. https://doi.org/10.1016/j.ecoleng.2016.12.027

    Article  Google Scholar 

  13. Dragoi EN, Kovács Zs, Juzsakova T, Curteanu S, Cretescu I (2018) Environmental assessment of surface waters based on monitoring data and neuro-evolutive modelling. Process Saf Environ 120:136–145

    Article  Google Scholar 

  14. Li X, Cheng Z, Yu Q, Bai Y, Li C (2017) Water-quality prediction using multimodal support vector regression: case study of Jialing River, China. J Environ Eng 143:97–107. https://doi.org/10.1061/(asce)ee.1943-7870.0001272

    Article  Google Scholar 

  15. Kisi O, Sanatipour M, Hashemi A, Teimourzadeh K, Shiri J (2013) Modeling of dissolved oxygen in river water using artificial intelligence techniques. J Environ Inform 22:92–101. https://doi.org/10.3808/jei.201300248

    Article  Google Scholar 

  16. Keshtegar B, Heddam S (2018) Modeling daily dissolved oxygen concentration using modified response surface method and artificial neural network: a comparative study. Neural Comput Appl 30:2995–3006

    Article  Google Scholar 

  17. Cao W, Huan J, Liu C, Qin Y, Wu F (2018) A combined model of dissolved oxygen prediction in the pond based on multiple-factor analysis and multi-scale feature extraction. Aquac Eng 84:50–59

    Article  Google Scholar 

  18. Fijani E, Barzegar R, Deo R, Tziritis E, Skordas K (2019) Design and implementation of a hybrid model based on two-layer decomposition method coupled with extreme learning machines to support real-time environmental monitoring of water quality parameters. Sci Total Environ 648:839–853

    Article  Google Scholar 

  19. Singh KP, Basant A, Malik A, Jain G (2009) Artificial neural network modeling of the river water quality—a case study. Ecol Model 220:888–895. https://doi.org/10.1016/j.ecolmodel.2009.01.004

    Article  Google Scholar 

  20. Emamgholizadeh S, Kashi H, Marofpoor I, Zalaghi E (2014) Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models. Int J Environ Sci Technol 11:645–656. https://doi.org/10.1007/s13762-013-0378-x

    Article  Google Scholar 

  21. Ji X, Shang X, Dahlgren RA, Zhang M (2017) Prediction of dissolved oxygen concentration in hypoxic river systems using support vector machine: a case study of Wen-Rui Tang River, China. Environ Sci Pollut Res 24:16062–16076. https://doi.org/10.1007/s11356-017-9243-7

    Article  Google Scholar 

  22. Antanasijević D, Pocajt V, Perić-Grujić A, Ristić M (2014) Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo Simulation uncertainty analysis. J Hydrol 519:1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009

    Article  Google Scholar 

  23. Šiljić Tomić A, Antanasijević D, Ristić M, Perić-Grujić A, Pocajt V (2018) A linear and non-linear polynomial neural network modeling of dissolved oxygen content in surface water: inter- and extrapolation performance with inputs’ significance analysis. Sci Total Environ 610–611:1038–1046. https://doi.org/10.1016/j.scitotenv.2017.08.192

    Article  Google Scholar 

  24. Kohonen T (2013) Essentials of the self-organizing map. Neural Netw 37:52–65. https://doi.org/10.1016/j.neunet.2012.09.018

    Article  Google Scholar 

  25. Kalteh AM, Hjorth P, Berndtsson R (2008) Review of the self-organizing map (SOM) approach in water resources: analysis, modelling and application. Environ Model Softw 23:835–845. https://doi.org/10.1016/j.envsoft.2007.10.001

    Article  Google Scholar 

  26. Crnković DM, Antanasijević DZ, Pocajt VV, Perić-Grujić AA, Antonović D, Ristić MĐ (2016) Unsupervised classification and multi-criteria decision analysis as chemometric tools for the assessment of sediment quality: a case study of the Danube and Sava River. CATENA 144:11–22. https://doi.org/10.1016/j.catena.2016.04.025

    Article  Google Scholar 

  27. Kohonen T (2001) Self-organizing maps. Springer, Berlin

    Book  Google Scholar 

  28. Huan J, Cao WJ, Liu XQ (2017) A dissolved oxygen prediction method based on K-means clustering and the ELM neural network: a case study of the Changdang Lake, China. Appl Eng Agric 33:461–469. https://doi.org/10.13031/aea.11786

    Article  Google Scholar 

  29. Antanasijević D, Pocajt V, Antanasijević J, Perić-Grujić A, Ristić M (2018) A novel SON2-based similarity index and its application for the rationalization of river water quality monitoring network. River Res Appl 34:144–152. https://doi.org/10.1002/rra.3231

    Article  Google Scholar 

  30. Milošković A, Dojčinović B, Kovačević S, Radojković N, Radenković M, Milošević D, Simić V (2016) Spatial monitoring of heavy metals in the inland waters of Serbia: a multispecies approach based on commercial fish. Environ Sci Pollut Res 23:9918–9933. https://doi.org/10.1007/s11356-016-6207-2

    Article  Google Scholar 

  31. Morina A, Morina F, Djikanović V, Spasić S, Krpo-Ćetković J, Lenhardt M (2016) Seasonal variation in element concentrations in surface sediments of three rivers with different pollution input in Serbia. J Soils Sediments 16:255–265. https://doi.org/10.1007/s11368-015-1211-6

    Article  Google Scholar 

  32. Voza D, Vukovic M, Takic L, Nikolic D, Mladenovic-Ranisavljevic I (2015) Application of multivariate statistical techniques in the water quality assessment of Danube River, Serbia. Arch Environ Prot 41:96–103. https://doi.org/10.1515/aep-2015-0044

    Article  Google Scholar 

  33. Šiljić Tomić AN, Antanasijević DZ, Ristić MĐ, Perić-Grujić AA, Pocajt VV (2016) Modeling the BOD of Danube River in Serbia using spatial, temporal, and input variables optimized artificial neural network models. Environ Monit Assess 188:300. https://doi.org/10.1007/s10661-016-5308-1

    Article  Google Scholar 

  34. Thwin MMT, Quah T-S (2005) Application of neural networks for software quality prediction using object-oriented metrics. J Syst Softw 76:147–156. https://doi.org/10.1016/j.jss.2004.05.001

    Article  Google Scholar 

  35. Smith BA, Mcclendon RW, Hoogenboom G (2007) Improving air temperature prediction with artificial neural networks. Int J Comput Inf Eng 1:3159–3166. https://doi.org/10.1999/1307-6892/10353

    Article  Google Scholar 

  36. Smith BA, Mcclendon RW, Hoogenboom G (2007) An enhanced artificial neural network for air temperature prediction. Int J Comput Inf Eng 1:2166–2177. https://doi.org/10.1999/1307-6892/3911

    Article  Google Scholar 

  37. Mustafa MR, Rezaur RB, Saiedi S, Isa MH (2012) River suspended sediment prediction using various multilayer perceptron neural network training algorithms—a case study in Malaysia. Water Resour Manag 26:1879–1897. https://doi.org/10.1007/s11269-012-9992-5

    Article  Google Scholar 

  38. Johnson SR, Jurs PC (1999) Prediction of the clearing temperatures of a series of liquid crystals from molecular structure. Chem Mater 11:1007–1023. https://doi.org/10.1021/cm980674x

    Article  Google Scholar 

  39. Šindelář R, Babuška R (2004) Input selection for nonlinear regression models. IEEE Trans Fuzzy Syst 12:688–696. https://doi.org/10.1109/TFUZZ.2004.834810

    Article  Google Scholar 

  40. May RJ, Maier HR, Dandy GC, Fernando TMKG (2008) Non-linear variable selection for artificial neural networks using partial mutual information. Environ Model Softw 23:1312–1326. https://doi.org/10.1016/j.envsoft.2008.03.007

    Article  Google Scholar 

  41. Sharma A (2000) Seasonal to interannual rainfall probabilistic forecasts for improved water supply management: part 1—a strategy for system predictor identification. J Hydrol 239:232–239. https://doi.org/10.1016/S0022-1694(00)00346-2

    Article  Google Scholar 

  42. May RJ, Dandy GC, Maier HR, Nixon JB (2008) Application of partial mutual information variable selection to ANN forecasting of water quality in water distribution systems. Environ Model Softw 23:1289–1299. https://doi.org/10.1016/j.envsoft.2008.03.008

    Article  Google Scholar 

  43. Fernando TMKG, Maier HR, Dandy GC (2009) Selection of input variables for data driven models: an average shifted histogram partial mutual information estimator approach. J Hydrol 367:165–176. https://doi.org/10.1016/j.jhydrol.2008.10.019

    Article  Google Scholar 

  44. Li X, Maier HR, Zecchin AC (2015) Improved PMI-based input variable selection approach for artificial neural network and other data driven environmental and water resource models. Environ Model Softw 65:15–29. https://doi.org/10.1016/j.envsoft.2014.11.028

    Article  Google Scholar 

  45. Ward Systems Group Inc. (2008) Neuroshell 2 v4.2

Download references

Acknowledgements

The authors are grateful to the Ministry of Education, Science and Technological Development of the Republic of Serbia for financial support (Project Number 172007).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Davor Antanasijević.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 472 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Antanasijević, D., Pocajt, V., Perić-Grujić, A. et al. Multilevel split of high-dimensional water quality data using artificial neural networks for the prediction of dissolved oxygen in the Danube River. Neural Comput & Applic 32, 3957–3966 (2020). https://doi.org/10.1007/s00521-019-04079-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-019-04079-y

Keywords

Navigation