Abstract
In this study, a self-organizing network-based monitoring location similarity index (LSI) was coupled with Ward neural networks (WNNs) with the aim to create a more accurate, but less complex, multiple sites model for the prediction of dissolved oxygen (DO) content. This multilevel splitting approach comprises the LSI-based grouping of monitoring locations according to their similarity, and virtual splitting of processed data based on their features using WNN. The values of 18 water quality parameters monitored for 12 years at 17 sites on the Danube River flow thought Serbia were used. The optimal input combinations were selected using partial mutual information algorithm with termination based on the Akaike information criterion. LSI-based splitting has yielded two groups of monitoring sites that were modeled with separate WNN models. The number and types of selected inputs differed between those two groups of sites, which was in agreement with possible pollution sources. Multiple performance metrics have revealed that the WNN models perform similar or better than multisite DO prediction models published in the literature, while using two to four times less inputs and data patterns.
Similar content being viewed by others
References
Hao RX, Li SM, Li JB, Zhang QK, Liu F (2013) Water quality assessment for wastewater reclamation using principal component analysis. J Environ Inform 21:45–54. https://doi.org/10.3808/jei.201300231
Purkait B, Kadam SS, Das SK (2008) Application of artificial neural network model to study arsenic contamination in groundwater of Malda district, Eastern India. J Environ Inform 12:140–149. https://doi.org/10.3808/jei.200800132
Yeon IS, Kim JH, Jun KW (2008) Application of artificial intelligence models in water quality forecasting. Environ Technol 29:625–631. https://doi.org/10.1080/09593330801984456
Wetzel RG, Likens GE (2000) Dissolved oxygen. Limnological analyses. Springer, New York, pp 73–84. https://doi.org/10.1007/978-1-4757-3250-4_6
Heddam S, Kisi O (2017) Extreme learning machines: a new approach for modeling dissolved oxygen (DO) concentration with and without water quality variables as predictors. Environ Sci Pollut Res 24:16702–16724. https://doi.org/10.1007/s11356-017-9283-z
Mishra AK, Kumar B, Dutta J (2016) Prediction of hydraulic conductivity of soil bentonite mixture using hybrid-ANN approach. J Environ Inform 27:98–105. https://doi.org/10.3808/jei.201500292
Olyaie E, Zare Abyaneh H, Danandeh Mehr A (2017) A comparative analysis among computational intelligence techniques for dissolved oxygen prediction in Delaware River. Geosci Front 8:517–527. https://doi.org/10.1016/j.gsf.2016.04.007
Heddam S (2016) Use of optimally pruned extreme learning machine (OP-ELM) in forecasting dissolved oxygen concentration (DO) several hours in advance: a case study from the Klamath River, Oregon, USA. Environ Process 3:909–937. https://doi.org/10.1007/s40710-016-0172-0
Raheli B, Aalami MT, El-Shafie A, Ghorbani MA, Deo RC (2017) Uncertainty assessment of the multilayer perceptron (MLP) neural network model with implementation of the novel hybrid MLP-FFA method for prediction of biochemical oxygen demand and dissolved oxygen: a case study of Langat River. Environ Earth Sci 76:1–16. https://doi.org/10.1007/s12665-017-6842-z
Ay M, Kişi Ö (2017) Estimation of dissolved oxygen by using neural networks and neuro fuzzy computing techniques. KSCE J Civ Eng 21:1631–1639. https://doi.org/10.1007/s12205-016-0728-6
Keshtegar B, Heddam S (2017) Modeling daily dissolved oxygen concentration using modified response surface method and artificial neural network: a comparative study. Neural Comput Appl. https://doi.org/10.1007/s00521-017-2917-8
Csábrági A, Molnár S, Tanos P, Kovács J (2017) Application of artificial neural networks to the forecasting of dissolved oxygen content in the Hungarian section of the river Danube. Ecol Eng 100:63–72. https://doi.org/10.1016/j.ecoleng.2016.12.027
Dragoi EN, Kovács Zs, Juzsakova T, Curteanu S, Cretescu I (2018) Environmental assessment of surface waters based on monitoring data and neuro-evolutive modelling. Process Saf Environ 120:136–145
Li X, Cheng Z, Yu Q, Bai Y, Li C (2017) Water-quality prediction using multimodal support vector regression: case study of Jialing River, China. J Environ Eng 143:97–107. https://doi.org/10.1061/(asce)ee.1943-7870.0001272
Kisi O, Sanatipour M, Hashemi A, Teimourzadeh K, Shiri J (2013) Modeling of dissolved oxygen in river water using artificial intelligence techniques. J Environ Inform 22:92–101. https://doi.org/10.3808/jei.201300248
Keshtegar B, Heddam S (2018) Modeling daily dissolved oxygen concentration using modified response surface method and artificial neural network: a comparative study. Neural Comput Appl 30:2995–3006
Cao W, Huan J, Liu C, Qin Y, Wu F (2018) A combined model of dissolved oxygen prediction in the pond based on multiple-factor analysis and multi-scale feature extraction. Aquac Eng 84:50–59
Fijani E, Barzegar R, Deo R, Tziritis E, Skordas K (2019) Design and implementation of a hybrid model based on two-layer decomposition method coupled with extreme learning machines to support real-time environmental monitoring of water quality parameters. Sci Total Environ 648:839–853
Singh KP, Basant A, Malik A, Jain G (2009) Artificial neural network modeling of the river water quality—a case study. Ecol Model 220:888–895. https://doi.org/10.1016/j.ecolmodel.2009.01.004
Emamgholizadeh S, Kashi H, Marofpoor I, Zalaghi E (2014) Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models. Int J Environ Sci Technol 11:645–656. https://doi.org/10.1007/s13762-013-0378-x
Ji X, Shang X, Dahlgren RA, Zhang M (2017) Prediction of dissolved oxygen concentration in hypoxic river systems using support vector machine: a case study of Wen-Rui Tang River, China. Environ Sci Pollut Res 24:16062–16076. https://doi.org/10.1007/s11356-017-9243-7
Antanasijević D, Pocajt V, Perić-Grujić A, Ristić M (2014) Modelling of dissolved oxygen in the Danube River using artificial neural networks and Monte Carlo Simulation uncertainty analysis. J Hydrol 519:1895–1907. https://doi.org/10.1016/j.jhydrol.2014.10.009
Šiljić Tomić A, Antanasijević D, Ristić M, Perić-Grujić A, Pocajt V (2018) A linear and non-linear polynomial neural network modeling of dissolved oxygen content in surface water: inter- and extrapolation performance with inputs’ significance analysis. Sci Total Environ 610–611:1038–1046. https://doi.org/10.1016/j.scitotenv.2017.08.192
Kohonen T (2013) Essentials of the self-organizing map. Neural Netw 37:52–65. https://doi.org/10.1016/j.neunet.2012.09.018
Kalteh AM, Hjorth P, Berndtsson R (2008) Review of the self-organizing map (SOM) approach in water resources: analysis, modelling and application. Environ Model Softw 23:835–845. https://doi.org/10.1016/j.envsoft.2007.10.001
Crnković DM, Antanasijević DZ, Pocajt VV, Perić-Grujić AA, Antonović D, Ristić MĐ (2016) Unsupervised classification and multi-criteria decision analysis as chemometric tools for the assessment of sediment quality: a case study of the Danube and Sava River. CATENA 144:11–22. https://doi.org/10.1016/j.catena.2016.04.025
Kohonen T (2001) Self-organizing maps. Springer, Berlin
Huan J, Cao WJ, Liu XQ (2017) A dissolved oxygen prediction method based on K-means clustering and the ELM neural network: a case study of the Changdang Lake, China. Appl Eng Agric 33:461–469. https://doi.org/10.13031/aea.11786
Antanasijević D, Pocajt V, Antanasijević J, Perić-Grujić A, Ristić M (2018) A novel SON2-based similarity index and its application for the rationalization of river water quality monitoring network. River Res Appl 34:144–152. https://doi.org/10.1002/rra.3231
Milošković A, Dojčinović B, Kovačević S, Radojković N, Radenković M, Milošević D, Simić V (2016) Spatial monitoring of heavy metals in the inland waters of Serbia: a multispecies approach based on commercial fish. Environ Sci Pollut Res 23:9918–9933. https://doi.org/10.1007/s11356-016-6207-2
Morina A, Morina F, Djikanović V, Spasić S, Krpo-Ćetković J, Lenhardt M (2016) Seasonal variation in element concentrations in surface sediments of three rivers with different pollution input in Serbia. J Soils Sediments 16:255–265. https://doi.org/10.1007/s11368-015-1211-6
Voza D, Vukovic M, Takic L, Nikolic D, Mladenovic-Ranisavljevic I (2015) Application of multivariate statistical techniques in the water quality assessment of Danube River, Serbia. Arch Environ Prot 41:96–103. https://doi.org/10.1515/aep-2015-0044
Šiljić Tomić AN, Antanasijević DZ, Ristić MĐ, Perić-Grujić AA, Pocajt VV (2016) Modeling the BOD of Danube River in Serbia using spatial, temporal, and input variables optimized artificial neural network models. Environ Monit Assess 188:300. https://doi.org/10.1007/s10661-016-5308-1
Thwin MMT, Quah T-S (2005) Application of neural networks for software quality prediction using object-oriented metrics. J Syst Softw 76:147–156. https://doi.org/10.1016/j.jss.2004.05.001
Smith BA, Mcclendon RW, Hoogenboom G (2007) Improving air temperature prediction with artificial neural networks. Int J Comput Inf Eng 1:3159–3166. https://doi.org/10.1999/1307-6892/10353
Smith BA, Mcclendon RW, Hoogenboom G (2007) An enhanced artificial neural network for air temperature prediction. Int J Comput Inf Eng 1:2166–2177. https://doi.org/10.1999/1307-6892/3911
Mustafa MR, Rezaur RB, Saiedi S, Isa MH (2012) River suspended sediment prediction using various multilayer perceptron neural network training algorithms—a case study in Malaysia. Water Resour Manag 26:1879–1897. https://doi.org/10.1007/s11269-012-9992-5
Johnson SR, Jurs PC (1999) Prediction of the clearing temperatures of a series of liquid crystals from molecular structure. Chem Mater 11:1007–1023. https://doi.org/10.1021/cm980674x
Šindelář R, Babuška R (2004) Input selection for nonlinear regression models. IEEE Trans Fuzzy Syst 12:688–696. https://doi.org/10.1109/TFUZZ.2004.834810
May RJ, Maier HR, Dandy GC, Fernando TMKG (2008) Non-linear variable selection for artificial neural networks using partial mutual information. Environ Model Softw 23:1312–1326. https://doi.org/10.1016/j.envsoft.2008.03.007
Sharma A (2000) Seasonal to interannual rainfall probabilistic forecasts for improved water supply management: part 1—a strategy for system predictor identification. J Hydrol 239:232–239. https://doi.org/10.1016/S0022-1694(00)00346-2
May RJ, Dandy GC, Maier HR, Nixon JB (2008) Application of partial mutual information variable selection to ANN forecasting of water quality in water distribution systems. Environ Model Softw 23:1289–1299. https://doi.org/10.1016/j.envsoft.2008.03.008
Fernando TMKG, Maier HR, Dandy GC (2009) Selection of input variables for data driven models: an average shifted histogram partial mutual information estimator approach. J Hydrol 367:165–176. https://doi.org/10.1016/j.jhydrol.2008.10.019
Li X, Maier HR, Zecchin AC (2015) Improved PMI-based input variable selection approach for artificial neural network and other data driven environmental and water resource models. Environ Model Softw 65:15–29. https://doi.org/10.1016/j.envsoft.2014.11.028
Ward Systems Group Inc. (2008) Neuroshell 2 v4.2
Acknowledgements
The authors are grateful to the Ministry of Education, Science and Technological Development of the Republic of Serbia for financial support (Project Number 172007).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Antanasijević, D., Pocajt, V., Perić-Grujić, A. et al. Multilevel split of high-dimensional water quality data using artificial neural networks for the prediction of dissolved oxygen in the Danube River. Neural Comput & Applic 32, 3957–3966 (2020). https://doi.org/10.1007/s00521-019-04079-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-019-04079-y