Abstract
In this study, a novel event detection model based on data-driven estimation and support vector machine (SVM) classification was developed and assessed. The developed model takes advantage of the data-driven model - namely artificial neural networks (ANNs) - to predict the complicated behavior of water quality parameters without relevant physical and chemical knowledge. In addition, SVM presents high classification performance when dealing with high-dimensional data and has a better generalization ability than ANNs so that SVM can complement ANN predictions. Key parameters of SVM were optimized by genetic algorithm. After calculation of ANN prediction error and outlier classification by SVM, the event probability was estimated by Bayesian sequence analysis. The performance of the proposed model was evaluated using data from a real water distribution system with randomly simulated events. The results illustrated that the proposed model exhibited a great detection ability compared with two models with analogous structures, a pure SVM classification model and a conventional ANN-threshold classification model, demonstrating the superiority of the hybrid data-driven – SVM classification model.
Similar content being viewed by others
References
Abokifa AA, Haddad K, Lo C, Biswas P (2019) Real-time identification of cyber-physical attacks on water distribution systems via machine learning-based anomaly detection techniques. J Water Resour Plan Manag 145(1)
Arad J, Housh M, Perelman L, Ostfeld A (2013) A dynamic thresholds scheme for contaminant event detection in water distribution systems. Water Res 47(5):1899–1908
Basheer IA, Hajmeer M (2000) Artificial neural networks: fundamentals, computing, design, and application. J Microbiol Methods 43(1):3–31
Bazzani A, Bevilacqua A, Bollini D, Brancaccio R, Campanini R, Lanconelli N, Riccardi A, Romani D (2001) An SVM classifier to separate false signals from microcalcifications in digital mammograms. Phys Med Biol 46(6):1651–1663
Boser BE, Guyon IM, Vapnik VN (1992) Training algorithm for optimal margin classifiers. Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, 144–152
Burchard-Levine A, Liu S, Vince F, Li M, Ostfeld A (2014) A hybrid evolutionary data driven model for river water quality early warning. J Environ Manag 143:8–16
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Disc 2(2):121–167
Byvatov E, Fechner U, Sadowski J, Schneider G (2003) Comparison of support vector machine and artificial neural network Systems for Drug/nondrug classification. J Chem Inf Comput Sci 43(6):1882–1889
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Hall J, Zaffiro AD, Marx RB, Kefauver PC, Radha Krishnan E, Haught RC, Herrmann JG (2007) On-line water quality parameters as indicators of distribution system contamination. J Am Water Works Assoc 99(1):66–77
Hart D, McKenna SA, Klise K, Cruz V, Wilson M (2007) CANARY: A water quality event detection algorithm development tool. Restoring Our Natural Habitat - Proceedings of the 2007 World Environmental and Water Resources Congress 1–9
Hart WE, Murray R (2010) Review of sensor placement strategies for contamination warning systems in drinking water distribution systems. J Water Resour Plan Manag 136(6):611–619
Hou D, Song X, Zhang G, Zhang H, Loaiciga H (2013a) An early warning and control system for urban, drinking water quality protection: China's experience. Environ Sci Pollut Res 20(7):4496–4508
Hou D, He H, Huang P, Zhang G, Loaiciga H (2013b) Detection of water-quality contamination events based on multi-sensor fusion using an extented Dempster-Shafer method. Meas Sci Technol 24(5)
Hou D-B, Chen Y, Zhao H-F, Huang P-J, Zhang G-X (2013c) Water quality anomaly detection method based on RBF neural network and wavelet analysis. Transducer Microsyst Technol 32(2):138–141
Housh M, Ostfeld A (2015) An integrated logit model for contamination event detection in water distribution systems. Water Res 75:210–223
Khorshidi MS, Nikoo MR, Ebrahimi E, Sadegh M (2019) A robust decision support leader-follower framework for design of contamination warning system in water distribution network. J Clean Prod 214:666–673
Liu S, Che H, Smith K, Chang T (2015a) A real time method of contaminant classification using conventional water quality sensors. J Environ Manag 154:13–21
Liu S, Che H, Smith K, Lei M, Li R (2015b) Performance evaluation for three pollution detection methods using data from a real contamination accident. J Environ Manag 161:385–391
Liu S, Smith K, Che H (2015c) A multivariate based event detection method and performance comparison with two baseline methods. Water Res 80:109–118
Maier HR, Dandy GC (2000) Neural networks for the prediction and forecasting of water resources variables: a review of modelling issues and applications. Environ Model Softw 15(1):101–124
MATLAB (2014a) MATLAB Version R2014a. The MathWorks Inc.
McKenna SA, Wilson M, Klise KA (2008) Detecting changes in water quality data. J Am Water Works Assoc 100(1):74–85
Oliker N, Ostfeld A (2014) A coupled classification – evolutionary optimization model for contamination event detection in water distribution systems. Water Res 51:234–245
Perelman L, Arad J, Housh M, Ostfeld A (2012) Event detection in water distribution systems from multivariate water quality time series. Environ Sci Technol 46(15):8212–8219
Rodriguez MJ, Sérodes JB (1998) Assessing empirical linear and non-linear modelling of residual chlorine in urban drinking water systems. Environ Model Softw 14(1):93–102
Samanta B (2004) Gear fault detection using artificial neural networks and support vector machines with genetic algorithms. Mech Syst Signal Process 18(3):625–644
Swets JA (1988) Measuring the accuracy of diagnostic systems. Science 240(4857):1285–1293
Taormina R, Galelli S (2018) Deep-learning approach to the detection and localization of cyber-physical attacks on water distribution systems. J Water Resour Plan Manag 144(10)
Wang W, Xu Z, Lu W, Zhang X (2003) Determination of the spread parameter in the Gaussian kernel for classification and regression. Neurocomputing 55(3-4):643–663
Acknowledgements
This study was supported in part by the Natural Science Foundation of China (Nos. 51978483, 51778444 and 51808222), National Major Science and Technology Project of China (No. 2017ZX07207004), the Fundamental Research Funds for the Central Universities (No. 22120180123), Shanghai Sailing Program (No. 18YF1406000) and the Ministry of the Science and Technology in Taiwan (MOST-107-2221-E-992-008-MY3).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
ESM 1
(PDF 371 kb)
Rights and permissions
About this article
Cite this article
Zou, XY., Lin, YL., Xu, B. et al. A Novel Event Detection Model for Water Distribution Systems Based on Data-Driven Estimation and Support Vector Machine Classification. Water Resour Manage 33, 4569–4581 (2019). https://doi.org/10.1007/s11269-019-02317-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-019-02317-5