Abstract
With the increasing volume of environmental monitoring data, extracting valuable insights from multivariate time series sensor data can facilitate comprehensive information utilization and support informed decision-making in environmental management. However, there is a dearth of comprehensive research on multivariate data analysis for process monitoring in typical polluting enterprises. In this study, an artificial neural network model based on back-propagation algorithm (BP-ANN) was developed to predict the wastewater and exhaust gas emissions using IoT data obtained from process monitoring of a typical polluting enterprise located in Taizhou, Zhejiang Province, China. The results indicate that the model constructed has a high predictive coefficient of determination (R2) with values of 0.8510, 0.9565, 0.9561, 0.9677, and 0.9061 for chemical oxygen demand (COD), potential of hydrogen (pH), electrical conductivity (EC), flue gas emission (FGE), and non-methane hydrocarbon concentration (NMHC) respectively. For the first time, the variable importance measure (VIM)–assisted BP-ANN was employed to investigate the internal and external correlations between wastewater and exhaust gas treatment, thereby enhancing the interpretability of mapping features in the BP-ANN model. The predicted errors for pH and FGE have been demonstrated to fall within the range of − 0.62 ~ 0.30 and − 0.21 ~ 0.15 m3/s, respectively, with average relative errors of 1.05% and 9.60%, which is advantageous in detecting anomalous data and forecasting pollution indicator values. Our approach successfully addresses the challenge of segregating data analysis for wastewater disposal and exhaust gas disposal in the process monitoring of polluting enterprises, while also unearthing potential variables that significantly contribute to the BP-ANN model, thereby facilitating the selection and extraction of characteristic variables.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.
References
Aldaghi, T., & Javanmard, S. (2021). The evaluation of wastewater treatment plant performance: A data mining approach. Journal of Engineering, Design and Technology Ahead-of-Print. https://doi.org/10.1108/JEDT-07-2021-0394
Bahramian, M., Dereli R. K., Zhao, W., Giberti, M., & Casey, E. (2023). Data to intelligence: The role of data-driven models in wastewater treatment. Expert Systems with Applications, 217, 119453. https://doi.org/10.1016/j.eswa.2022.119453
Barcellos, D. d. S., & Souza, F. T. d. (2022). Optimization of water quality monitoring programs by data mining. Water Research, 221, 118805. https://doi.org/10.1016/j.watres.2022.118805
Bekkari, N., & Zeddouri, A. (2019). Using artificial neural network for predicting and controlling the effluent chemical oxygen demand in wastewater treatment plant. Management of Environmental Quality: An International Journal, 30, 593–608. https://doi.org/10.1108/MEQ-04-2018-0084
Cabaneros, S. M., Calautit, J. K., & Hughes, B. R. (2019). A review of artificial neural network models for ambient air pollution prediction. Environmental Modelling & Software, 119, 285–304. https://doi.org/10.1016/j.envsoft.2019.06.014
Castrillo, M., & García, Á. L. (2020). Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods. Water Research, 172, 115490. https://doi.org/10.1016/j.watres.2020.115490
Dikshit, A., Pradhan, B., & Santosh, M. (2022). Artificial neural networks in drought prediction in the 21st century–A scientometric analysis. Applied Soft Computing, 114, 108080. https://doi.org/10.1016/j.asoc.2021.108080
Han, K., & Wang, Y. (2021). A review of artificial neural network techniques for environmental issues prediction. Journal of Thermal Analysis and Calorimetry, 145, 2191–2207. https://doi.org/10.1007/s10973-021-10748-9
Haykin, S. S. (2010). Neural networks and learning machines. Pearson Press.
Jadhav, A. R., Pathak, P. D., & Raut, R. Y. (2023). Water and wastewater quality prediction: Current trends and challenges in the implementation of artificial neural network. Environmental Monitoring and Assessment, 195, 321. https://doi.org/10.1007/s10661-022-10904-0
Krupnova, T. G., Rakova, O. V., Bondarenko, K. A., & Tretyakova, V. D. (2022). Environmental justice and the use of artificial intelligence in urban air pollution monitoring. Big Data and Cognitive Computing, 6(3), 75. https://doi.org/10.3390/bdcc6030075
Li, B., Lu, C., Zhao, J., Tian, J., Sun, J., & Hu, C. (2023). Operational parameter prediction of electrocoagulation system in a rural decentralized water treatment plant by interpretable machine learning model. Journal of Environmental Management, 333, 117416. https://doi.org/10.1016/j.jenvman.2023.117416
Li, M., Fu, H., Du, Y., Huang, X., Zhang, T., Tang, H., & Li, H. (2022). Laser induced breakdown spectroscopy combined with hybrid variable selection for the prediction of the environmental risk Nemerow index of heavy metals in oily sludge. Journal of Analytical Atomic Spectrometry, 37, 1099–1108. https://doi.org/10.1039/D2JA00048B
Liukkonen, M., Laakso, I., & Hiltunen, Y. (2013). Advanced monitoring platform for industrial wastewater treatment: Multivariable approach using the self-organizing map. Environmental Modelling & Software, 48, 193–201. https://doi.org/10.1016/j.envsoft.2013.07.005
Masood, A., & Ahmad, K. (2021). A review on emerging artificial intelligence (AI) techniques for air pollution forecasting: Fundamentals, application and performance. Journal of Cleaner Production, 322, 129072. https://doi.org/10.1016/j.jclepro.2021.129072
Nafouanti, M. B., Li, J., Nyakilla, E. E., Mwakipunda, G. C., & Mulashani, A. (2023). A novel hybrid random forest linear model approach for forecasting groundwater fluoride contamination. Environmental Science and Pollution Research, 30, 50661–50674. https://doi.org/10.1007/s11356-023-25886-w
Nair, A., Hykkerud, A., & Ratnaweera, H. (2022). Estimating phosphorus and COD concentrations using a hybrid soft sensor: A case study in a norwegian municipal wastewater treatment plant. Water, 14(3), 332. https://doi.org/10.3390/w14030332
Nisbet, R., Miner, G., & Yale, K. (2018). Chapter 3 - The data mining and predictive analytic process, Handbook of statistical analysis and data mining applications (2nd ed.). Academic Press.
Norouzi, H., Moghaddam, A. A., Celico, F., & Shiri, J. (2021). Assessment of groundwater vulnerability using genetic algorithm and random forest methods (case study: Miandoab plain, NW of Iran). Environmental Science and Pollution Research, 28, 39598–39613. https://doi.org/10.1007/s11356-021-12714-2
Peng, C., Kai, W., Kun, Z., & Fanchao, M. (2022). Monitoring of wastewater treatment process based on multi-stage variational autoencoder. Expert Systems with Applications, 207, 117919. https://doi.org/10.1016/j.eswa.2022.117919
Qiu, R., Wang, Y., Wang, D., Qiu, W., Wu, J., & Tao, Y. (2020). Water temperature forecasting based on modified artificial neural network methods: Two cases of the Yangtze River. Science of the Total Environment, 737, 139729. https://doi.org/10.1016/j.scitotenv.2020.139729
Sanchez-Fernández, A., Fuente, M. J., & Sainz-Palmero, G. I. (2015). Fault detection in wastewater treatment plants using distributed PCA methods. 2015 IEEE 20th Conference on Emerging Technologies & Factory Automation (ETFA), 1–7.
Velasco, L. C., Bongat, J. F., Castillon, C., Laurente, J., & Tabanao, E. (2023). Days-ahead water level forecasting using artificial neural networks for watersheds. Mathematical Biosciences and Engineering, 20, 758–774. https://doi.org/10.3934/mbe.2023035
Vousoughi, F. D. (2023). Wavelet-based de-noising in groundwater quality and quantity prediction by an artificial neural network. Water Supply, 23, 1333–1348. https://doi.org/10.2166/ws.2023.021
Wang, D., Thunéll, S., Lindberg, U., Jiang, L., Trygg, J., Tysklind, M., & Souihi, N. (2021). A machine learning framework to improve effluent quality control in wastewater treatment plants. Science of the Total Environment, 784, 147138. https://doi.org/10.1016/j.scitotenv.2021.147138
Wei, P., Lu, Z., & Song, J. (2015). A comprehensive comparison of two variable importance analysis techniques in high dimensions: Application to an environmental multi-indicators system. Environmental Modelling & Software, 70, 178–190. https://doi.org/10.1016/j.envsoft.2015.04.015
Wei, W., Ramalho, O., Malingre, L., Sivanantham, S., Little, J. C., & Mandin, C. (2019). Machine learning and statistical models for predicting indoor air quality. Indoor Air, 29, 704–726. https://doi.org/10.1111/ina.12580
Xiao, H., Huang, D., Pan, Y., Liu, Y., & Song, K. (2017). Fault diagnosis and prognosis of wastewater processes with incomplete data by the auto-associative neural networks and ARMA model. Chemometrics and Intelligent Laboratory Systems, 161, 96–107. https://doi.org/10.1016/j.chemolab.2016.12.009
Xu, A., Chang, H., Xu, Y., Li, R., Li, X., & Zhao, Y. (2021). Applying artificial neural networks (ANNs) to solve solid waste-related issues: A critical review. Waste Management, 124, 385–402. https://doi.org/10.1016/j.wasman.2021.02.029
Yang, S.-S., Yu, X.-L., Ding, M.-Q., He, L., Cao, G.-L., Zhao, L., Tao, Y., Pang, J.-W., Bai, S.-W., Ding, J., & Ren, N.-Q. (2021). Simulating a combined lysis-cryptic and biological nitrogen removal system treating domestic wastewater at low C/N ratios using artificial neural network. Water Research, 189, 116576. https://doi.org/10.1016/j.watres.2020.116576
Zhai, W. (2019). Analysis technology of environmental monitoring data based on internet of things environment and improved neural network algorithm. Applied Ecology and Environmental Research, 17, 14505–14516. https://doi.org/10.15666/aeer/1706_1450514516
Zhong, S., Zhang, K., Bagheri, M., Burken, J. G., Gu, A., Li, B., Ma, X., Marrone, B. L., Ren, Z. J., Schrier, J., Shi, W., Tan, H., Wang, T., Wang, X., Wong, B. M., Xiao, X., Yu, X., Zhu, J.-J., & Zhang, H. (2021). Machine learning: New ideas and tools in environmental science and engineering. Environmental Science & Technology, 55, 12741–12754. https://doi.org/10.1021/acs.est.1c01339
Funding
This work was supported by the Science and Technology Program of Zhejiang Province (No. 2021C03178), the Ecological Environment Research and Achievement Extension Project of Zhejiang Province (No. 2022HT0010), and the Science and Technology Plan Project of Taizhou (No. 22gyb37).
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Wenya Zhao: methodology; investigation; writing—original draft. Peili Zhang: project supervision; research design; data collection; writing—editing and modification. Da Chen: data processing; writing—editing and modification. Hao Wang: funding support, data collection. Binghua Gu: artificial intelligence technology support. Jue Zhang: writing—editing and modification.
Corresponding author
Ethics declarations
Ethics approval
Not applicable.
Competing interests
The authors declare no competing interests.
Disclaimer
All authors have read, understood, and have complied as applicable with the statement on “Ethical responsibilities of Authors” as found in the instructions for authors and are aware that with minor exceptions, no changes can be made to authorship once the paper is submitted.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhao, W., Zhang, P., Chen, D. et al. Data mining from process monitoring of typical polluting enterprise. Environ Monit Assess 195, 1109 (2023). https://doi.org/10.1007/s10661-023-11733-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10661-023-11733-5