Skip to main content

Advertisement

Log in

Data mining from process monitoring of typical polluting enterprise

  • Research
  • Published:
Environmental Monitoring and Assessment Aims and scope Submit manuscript

Abstract

With the increasing volume of environmental monitoring data, extracting valuable insights from multivariate time series sensor data can facilitate comprehensive information utilization and support informed decision-making in environmental management. However, there is a dearth of comprehensive research on multivariate data analysis for process monitoring in typical polluting enterprises. In this study, an artificial neural network model based on back-propagation algorithm (BP-ANN) was developed to predict the wastewater and exhaust gas emissions using IoT data obtained from process monitoring of a typical polluting enterprise located in Taizhou, Zhejiang Province, China. The results indicate that the model constructed has a high predictive coefficient of determination (R2) with values of 0.8510, 0.9565, 0.9561, 0.9677, and 0.9061 for chemical oxygen demand (COD), potential of hydrogen (pH), electrical conductivity (EC), flue gas emission (FGE), and non-methane hydrocarbon concentration (NMHC) respectively. For the first time, the variable importance measure (VIM)–assisted BP-ANN was employed to investigate the internal and external correlations between wastewater and exhaust gas treatment, thereby enhancing the interpretability of mapping features in the BP-ANN model. The predicted errors for pH and FGE have been demonstrated to fall within the range of − 0.62 ~ 0.30 and − 0.21 ~ 0.15 m3/s, respectively, with average relative errors of 1.05% and 9.60%, which is advantageous in detecting anomalous data and forecasting pollution indicator values. Our approach successfully addresses the challenge of segregating data analysis for wastewater disposal and exhaust gas disposal in the process monitoring of polluting enterprises, while also unearthing potential variables that significantly contribute to the BP-ANN model, thereby facilitating the selection and extraction of characteristic variables.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

  • Aldaghi, T., & Javanmard, S. (2021). The evaluation of wastewater treatment plant performance: A data mining approach. Journal of Engineering, Design and Technology Ahead-of-Print. https://doi.org/10.1108/JEDT-07-2021-0394

    Article  Google Scholar 

  • Bahramian, M., Dereli R. K., Zhao, W., Giberti, M., & Casey, E. (2023). Data to intelligence: The role of data-driven models in wastewater treatment. Expert Systems with Applications, 217, 119453. https://doi.org/10.1016/j.eswa.2022.119453

  • Barcellos, D. d. S., & Souza, F. T. d. (2022). Optimization of water quality monitoring programs by data mining. Water Research, 221, 118805. https://doi.org/10.1016/j.watres.2022.118805

  • Bekkari, N., & Zeddouri, A. (2019). Using artificial neural network for predicting and controlling the effluent chemical oxygen demand in wastewater treatment plant. Management of Environmental Quality: An International Journal, 30, 593–608. https://doi.org/10.1108/MEQ-04-2018-0084

    Article  Google Scholar 

  • Cabaneros, S. M., Calautit, J. K., & Hughes, B. R. (2019). A review of artificial neural network models for ambient air pollution prediction. Environmental Modelling & Software, 119, 285–304. https://doi.org/10.1016/j.envsoft.2019.06.014

    Article  Google Scholar 

  • Castrillo, M., & García, Á. L. (2020). Estimation of high frequency nutrient concentrations from water quality surrogates using machine learning methods. Water Research, 172, 115490. https://doi.org/10.1016/j.watres.2020.115490

  • Dikshit, A., Pradhan, B., & Santosh, M. (2022). Artificial neural networks in drought prediction in the 21st century–A scientometric analysis. Applied Soft Computing, 114, 108080. https://doi.org/10.1016/j.asoc.2021.108080

  • Han, K., & Wang, Y. (2021). A review of artificial neural network techniques for environmental issues prediction. Journal of Thermal Analysis and Calorimetry, 145, 2191–2207. https://doi.org/10.1007/s10973-021-10748-9

    Article  CAS  Google Scholar 

  • Haykin, S. S. (2010). Neural networks and learning machines. Pearson Press.

    Google Scholar 

  • Jadhav, A. R., Pathak, P. D., & Raut, R. Y. (2023). Water and wastewater quality prediction: Current trends and challenges in the implementation of artificial neural network. Environmental Monitoring and Assessment, 195, 321. https://doi.org/10.1007/s10661-022-10904-0

  • Krupnova, T. G., Rakova, O. V., Bondarenko, K. A., & Tretyakova, V. D. (2022). Environmental justice and the use of artificial intelligence in urban air pollution monitoring. Big Data and Cognitive Computing, 6(3), 75. https://doi.org/10.3390/bdcc6030075

    Article  Google Scholar 

  • Li, B., Lu, C., Zhao, J., Tian, J., Sun, J., & Hu, C. (2023). Operational parameter prediction of electrocoagulation system in a rural decentralized water treatment plant by interpretable machine learning model. Journal of Environmental Management, 333, 117416. https://doi.org/10.1016/j.jenvman.2023.117416

  • Li, M., Fu, H., Du, Y., Huang, X., Zhang, T., Tang, H., & Li, H. (2022). Laser induced breakdown spectroscopy combined with hybrid variable selection for the prediction of the environmental risk Nemerow index of heavy metals in oily sludge. Journal of Analytical Atomic Spectrometry, 37, 1099–1108. https://doi.org/10.1039/D2JA00048B

    Article  Google Scholar 

  • Liukkonen, M., Laakso, I., & Hiltunen, Y. (2013). Advanced monitoring platform for industrial wastewater treatment: Multivariable approach using the self-organizing map. Environmental Modelling & Software, 48, 193–201. https://doi.org/10.1016/j.envsoft.2013.07.005

    Article  Google Scholar 

  • Masood, A., & Ahmad, K. (2021). A review on emerging artificial intelligence (AI) techniques for air pollution forecasting: Fundamentals, application and performance. Journal of Cleaner Production, 322, 129072. https://doi.org/10.1016/j.jclepro.2021.129072

  • Nafouanti, M. B., Li, J., Nyakilla, E. E., Mwakipunda, G. C., & Mulashani, A. (2023). A novel hybrid random forest linear model approach for forecasting groundwater fluoride contamination. Environmental Science and Pollution Research, 30, 50661–50674. https://doi.org/10.1007/s11356-023-25886-w

    Article  CAS  Google Scholar 

  • Nair, A., Hykkerud, A., & Ratnaweera, H. (2022). Estimating phosphorus and COD concentrations using a hybrid soft sensor: A case study in a norwegian municipal wastewater treatment plant. Water, 14(3), 332. https://doi.org/10.3390/w14030332

    Article  CAS  Google Scholar 

  • Nisbet, R., Miner, G., & Yale, K. (2018). Chapter 3 - The data mining and predictive analytic process, Handbook of statistical analysis and data mining applications (2nd ed.). Academic Press.

    Google Scholar 

  • Norouzi, H., Moghaddam, A. A., Celico, F., & Shiri, J. (2021). Assessment of groundwater vulnerability using genetic algorithm and random forest methods (case study: Miandoab plain, NW of Iran). Environmental Science and Pollution Research, 28, 39598–39613. https://doi.org/10.1007/s11356-021-12714-2

    Article  Google Scholar 

  • Peng, C., Kai, W., Kun, Z., & Fanchao, M. (2022). Monitoring of wastewater treatment process based on multi-stage variational autoencoder. Expert Systems with Applications, 207, 117919. https://doi.org/10.1016/j.eswa.2022.117919

  • Qiu, R., Wang, Y., Wang, D., Qiu, W., Wu, J., & Tao, Y. (2020). Water temperature forecasting based on modified artificial neural network methods: Two cases of the Yangtze River. Science of the Total Environment, 737, 139729. https://doi.org/10.1016/j.scitotenv.2020.139729

  • Sanchez-Fernández, A., Fuente, M. J., & Sainz-Palmero, G. I. (2015). Fault detection in wastewater treatment plants using distributed PCA methods. 2015 IEEE 20th Conference on Emerging Technologies & Factory Automation (ETFA), 1–7.

  • Velasco, L. C., Bongat, J. F., Castillon, C., Laurente, J., & Tabanao, E. (2023). Days-ahead water level forecasting using artificial neural networks for watersheds. Mathematical Biosciences and Engineering, 20, 758–774. https://doi.org/10.3934/mbe.2023035

  • Vousoughi, F. D. (2023). Wavelet-based de-noising in groundwater quality and quantity prediction by an artificial neural network. Water Supply, 23, 1333–1348. https://doi.org/10.2166/ws.2023.021

    Article  CAS  Google Scholar 

  • Wang, D., Thunéll, S., Lindberg, U., Jiang, L., Trygg, J., Tysklind, M., & Souihi, N. (2021). A machine learning framework to improve effluent quality control in wastewater treatment plants. Science of the Total Environment, 784, 147138. https://doi.org/10.1016/j.scitotenv.2021.147138

  • Wei, P., Lu, Z., & Song, J. (2015). A comprehensive comparison of two variable importance analysis techniques in high dimensions: Application to an environmental multi-indicators system. Environmental Modelling & Software, 70, 178–190. https://doi.org/10.1016/j.envsoft.2015.04.015

    Article  Google Scholar 

  • Wei, W., Ramalho, O., Malingre, L., Sivanantham, S., Little, J. C., & Mandin, C. (2019). Machine learning and statistical models for predicting indoor air quality. Indoor Air, 29, 704–726. https://doi.org/10.1111/ina.12580

    Article  CAS  Google Scholar 

  • Xiao, H., Huang, D., Pan, Y., Liu, Y., & Song, K. (2017). Fault diagnosis and prognosis of wastewater processes with incomplete data by the auto-associative neural networks and ARMA model. Chemometrics and Intelligent Laboratory Systems, 161, 96–107. https://doi.org/10.1016/j.chemolab.2016.12.009

    Article  CAS  Google Scholar 

  • Xu, A., Chang, H., Xu, Y., Li, R., Li, X., & Zhao, Y. (2021). Applying artificial neural networks (ANNs) to solve solid waste-related issues: A critical review. Waste Management, 124, 385–402. https://doi.org/10.1016/j.wasman.2021.02.029

    Article  Google Scholar 

  • Yang, S.-S., Yu, X.-L., Ding, M.-Q., He, L., Cao, G.-L., Zhao, L., Tao, Y., Pang, J.-W., Bai, S.-W., Ding, J., & Ren, N.-Q. (2021). Simulating a combined lysis-cryptic and biological nitrogen removal system treating domestic wastewater at low C/N ratios using artificial neural network. Water Research, 189, 116576. https://doi.org/10.1016/j.watres.2020.116576

  • Zhai, W. (2019). Analysis technology of environmental monitoring data based on internet of things environment and improved neural network algorithm. Applied Ecology and Environmental Research, 17, 14505–14516. https://doi.org/10.15666/aeer/1706_1450514516

  • Zhong, S., Zhang, K., Bagheri, M., Burken, J. G., Gu, A., Li, B., Ma, X., Marrone, B. L., Ren, Z. J., Schrier, J., Shi, W., Tan, H., Wang, T., Wang, X., Wong, B. M., Xiao, X., Yu, X., Zhu, J.-J., & Zhang, H. (2021). Machine learning: New ideas and tools in environmental science and engineering. Environmental Science & Technology, 55, 12741–12754. https://doi.org/10.1021/acs.est.1c01339

    Article  CAS  Google Scholar 

Download references

Funding

This work was supported by the Science and Technology Program of Zhejiang Province (No. 2021C03178), the Ecological Environment Research and Achievement Extension Project of Zhejiang Province (No. 2022HT0010), and the Science and Technology Plan Project of Taizhou (No. 22gyb37).

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Wenya Zhao: methodology; investigation; writing—original draft. Peili Zhang: project supervision; research design; data collection; writing—editing and modification. Da Chen: data processing; writing—editing and modification. Hao Wang: funding support, data collection. Binghua Gu: artificial intelligence technology support. Jue Zhang: writing—editing and modification.

Corresponding author

Correspondence to Peili Zhang.

Ethics declarations

Ethics approval

Not applicable.

Competing interests

The authors declare no competing interests.

Disclaimer

All authors have read, understood, and have complied as applicable with the statement on “Ethical responsibilities of Authors” as found in the instructions for authors and are aware that with minor exceptions, no changes can be made to authorship once the paper is submitted.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 396 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, W., Zhang, P., Chen, D. et al. Data mining from process monitoring of typical polluting enterprise. Environ Monit Assess 195, 1109 (2023). https://doi.org/10.1007/s10661-023-11733-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10661-023-11733-5

Keywords

Navigation