Skip to main content
Log in

Water quality soft-sensor prediction in anaerobic process using deep neural network optimized by Tree-structured Parzen Estimator

  • Research Article
  • Published:
Frontiers of Environmental Science & Engineering Aims and scope Submit manuscript

Abstract

Anaerobic process is regarded as a green and sustainable process due to low carbon emission and minimal energy consumption in wastewater treatment plants (WWTPs). However, some water quality metrics are not measurable in real time, thus influencing the judgment of the operators and may increase energy consumption and carbon emission. One of the solutions is using a soft-sensor prediction technique. This article introduces a water quality soft-sensor prediction method based on Bidirectional Gated Recurrent Unit (BiGRU) combined with Gaussian Progress Regression (GPR) optimized by Tree-structured Parzen Estimator (TPE). TPE automatically optimizes the hyperparameters of BiGRU, and BiGRU is trained to obtain the point prediction with GPR for the interval prediction. Then, a case study applying this prediction method for an actual anaerobic process (2500 m3/d) is carried out. Results show that TPE effectively optimizes the hyperparameters of BiGRU. For point prediction of CODeff and biogas yield, R2 values of BiGRU, which are 0.973 and 0.939, respectively, are increased by 1.03%–7.61% and 1.28%–10.33%, compared with those of other models, and the valid prediction interval can be obtained. Besides, the proposed model is assessed as a reliable model for anaerobic process through the probability prediction and reliable evaluation. It is expected to provide high accuracy and reliable water quality prediction to offer basis for operators in WWTPs to control the reactor and minimize carbon emission and energy consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Data Accessibility Statement Data not available due to commercial restrictions.

Abbreviations

WWTPs:

Wastewater Treatment Plants

SVM:

Support Vector Machine

RF:

Random Forest

GPR:

Gaussian Process Regression

ANN:

Artificial Neural Network

RNN:

Recurrent Neural Network

LSTM:

Long Short-Time Memory

GRU:

Gated Recurrent Unit

BiGRU:

Bidirectional Gated Recurrent Unit

TPE:

Tree-structured Parzen Estimator

RS:

Random Search

GS:

Grid Search

RBF:

Radial Basis Function

SE:

Square Exponential Covariance function

RQ:

Rational Quadratic

ALK:

Alkalinity

OLR:

Organic Loading Rate

HRT:

Hydraulic Retention Time

COD:

Chemical Oxygen Demand

MSE:

Mean Square Error

RMSE:

Root Mean Square Error

MAPE:

Mean Absolute Percentage Error

CP:

Coverage Percentage

MWP:

Mean Width Percentage

CRPS:

Continuous Ranked Probability Score

CDF:

Cumulative Distribution Function

PDF:

Probability Density Function

PIT:

Probability Integral Transform

References

  • Chen Q L, Chai W, Qiao J F, IEEE (2010). Modeling of Wastewater Treatment Process Using Recurrent Neural Network. Jinan: IEEE, 5872–5876

    Google Scholar 

  • Ching P M L, So R H Y, Morck T (2021). Advances in soft sensors for wastewater treatment plants: a systematic review. Journal of Water Process Engineering, 44: 102367

    Article  Google Scholar 

  • Darvishi H, Ciuonzo D, Eide E R, Rossi P S (2021). Sensor-fault detection, isolation and accommodation for digital twins via modular data-driven architecture. IEEE Sensors Journal, 21(4): 4827–4838

    Article  CAS  Google Scholar 

  • Di Maria F, Micale C (2015). The contribution to energy production of the aerobic bioconversion of organic waste by an organic Rankine cycle in an integrated anaerobic-aerobic facility. Renewable Energy, 81: 770–778

    Article  CAS  Google Scholar 

  • Ferro C A T (2014). Fair scores for ensemble forecasts. Quarterly Journal of the Royal Meteorological Society, 140(683): 1917–1923

    Article  Google Scholar 

  • Han H G, Zhang J C, Du S L, Sun H Y, Qiao J F (2021). Robust optimal control for anaerobic-anoxic-oxic reactors. Science China. Technological Sciences, 64(7): 1485–1499

    Article  CAS  Google Scholar 

  • Hauck M, Maalcke-Luesken F A, Jetten M S M, Huijbregts M A J (2016). Removing nitrogen from wastewater with side stream anammox: What are the trade-offs between environmental impacts? Resources, Conservation and Recycling, 107: 212–219

    Article  Google Scholar 

  • Heydari B, Sharghi E A, Rafiee S, Mohtasebi S S (2021). Use of artificial neural network and adaptive neuro-fuzzy inference system for prediction of biogas production from spearmint essential oil wastewater treatment in up-flow anaerobic sludge blanket reactor. Fuel, 306: 121734

    Article  CAS  Google Scholar 

  • Hochreiter S, Schmidhuber J (1997). Long short-term memory. Neural Computation, 9(8): 1735–1780

    Article  CAS  Google Scholar 

  • Jiang Y, Yin S, Dong J, Kaynak O (2021). A review on soft sensors for monitoring, control, and optimization of industrial processes, control, and optimization of industrial processes. IEEE Sensors Journal, 21(11): 12868–12881

    Article  Google Scholar 

  • Jupp P E, Kume A (2020). Measures of goodness of fit obtained by almost-canonical transformations on Riemannian manifolds. Journal of Multivariate Analysis, 176: 104579

    Article  Google Scholar 

  • Kadlec P, Gabrys B, Strandt S (2009). Data-driven Soft Sensors in the process industry. Computers & Chemical Engineering, 33(4): 795–814

    Article  CAS  Google Scholar 

  • Kang L, Chen R S, Xiong N, Chen Y C, Hu Y X, Chen C M (2019). Selecting hyper-parameters of gaussian process regression based on non-inertial particle swarm optimization in Internet of things. IEEE Access: Practical Innovations, Open Solutions, 7: 59504–59513

    Article  Google Scholar 

  • Kim M, Yang Y N, Morikawa-Sakura M S, Wang Q H, Lee M V, Lee D Y, Feng C P, Zhou Y L, Zhang Z Y (2012). Hydrogen production by anaerobic co-digestion of rice straw and sewage sludge. International Journal of Hydrogen Energy, 37(4): 3142–3149

    Article  CAS  Google Scholar 

  • Laio F, Tamea S (2007). Verification tools for probabilistic forecasts of continuous hydrological variables. Hydrology and Earth System Sciences, 11(4): 1267–1277

    Article  Google Scholar 

  • Li X Y, Yi X H, Liu Z H, Liu H B, Chen T, Niu G Q, Yan B, Chen C, Huang M Z, Ying G G (2021). Application of novel hybrid deep leaning model for cleaner production in a paper industrial wastewater treatment system. Journal of Cleaner Production, 294: 126343

    Article  CAS  Google Scholar 

  • Newhart K B, Holloway R W, Hering A S, Cath T Y (2019). Data-driven performance analyses of wastewater treatment plants: a review. Water Research, 157: 498–513

    Article  CAS  Google Scholar 

  • Nguyen H P, Liu J, Zio E (2020). A long-term prediction approach based on long short-term memory neural networks with automatic parameter optimization by Tree-structured Parzen Estimator and applied to time-series data of NPP steam generators. Applied Soft Computing, 89: 106116

    Article  Google Scholar 

  • Ozcan G, Pajovic M, Sahinoglu Z, Wang Y B, Orlik P V, Wada T, IEEE (2016). Online State of Charge Estimation for Lithium-Ion Batteries Using Gaussian Process Regression. Florence: IEEE, 998–1003

    Google Scholar 

  • Pham V, Bluche T, Kermorvant C, Louradour J (2014). Dropout Improves Recurrent Neural Networks for Handwriting Recognition. Hersonissos, Greece: IEEE, 285–290

    Google Scholar 

  • Putatunda S, Rama K, Acm (2018). A Comparative Analysis of Hyperopt as Against Other Approaches for Hyper-Parameter Optimization of XGBoost. Shanghai: ACM

    Book  Google Scholar 

  • Qiao S, Wang Q, Zhang J, Pei Z (2020). Detection and classification of early decay on blueberry based on improved deep residual 3D convolutional neural network in hyperspectral images. Scientific Programming, 2020: 1–12

    Article  Google Scholar 

  • Safari M a M, Masseran N, Majid M H A (2020). Robust reliability estimation for lindley distribution: a probability integral transform statistical approach. Mathematics, 8(9): 1634

    Article  Google Scholar 

  • Samuelsson O, Björk A, Zambrano J, Carlsson B (2017). Gaussian process regression for monitoring and fault detection of wastewater treatment processes. Water Science and Technology, 75(12): 2952–2963

    Article  CAS  Google Scholar 

  • Şenol H (2021). Methane yield prediction of ultrasonic pretreated sewage sludge by means of an artificial neural network. Energy, 215: 119173

    Article  Google Scholar 

  • Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014). Dropout: A simple way to prevent neural networks from overfitting. Journal of Machine Learning Research, 15: 1929–1958

    Google Scholar 

  • Szelag B, Gawdzik A, Gawdzik A (2017). Application of selected methods of black box for modelling the settleability process in wastewater treatment plant. Ecological Chemistry and Engineering S-Chemia I Inzynieria Ekologiczna S, 24(1): 119–127

    Article  Google Scholar 

  • Wang H T, Yang Y, Keller A A, Li X, Feng S J, Dong Y N, Li F T (2016). Comparative analysis of energy intensity and carbon emissions in wastewater treatment in USA, Germany, China and South Africa. Applied Energy, 184: 873–881

    Article  CAS  Google Scholar 

  • Wang J, Cui Q, Sun X (2021). A novel framework for carbon price prediction using comprehensive feature screening, bidirectional gate recurrent unit and Gaussian process regression. Journal of Cleaner Production, 314: 128024

    Article  Google Scholar 

  • Wei J P, Liang G F, Alex J, Zhang T C, Ma C B (2020). Research progress of energy utilization of agricultural waste in China: Bibliometric analysis by citespace. Sustainability (Basel), 12(3): 812

    Article  Google Scholar 

  • Wu X, Wang Y, Wang C, Wang W, Dong F (2021). Moving average convergence and divergence indexes based online intelligent expert diagnosis system for anaerobic wastewater treatment process. Bioresource Technology, 324: 124662

    Article  CAS  Google Scholar 

  • Xu Y, Gao W, Qian F, Li Y (2021). Potential analysis of the attention-based LSTM model in ultra-short-term forecasting of building HVAC energy consumption. Frontiers in Energy Research, 9: 730640

    Article  Google Scholar 

  • Yaginuma K, Tanabe S, Kano M (2022). Gray-box soft sensor for water content monitoring in fluidized bed granulation. Chemical & Pharmaceutical Bulletin, 70(1): 74–81

    Article  CAS  Google Scholar 

  • Zeng G M, Li X D, Jiang R, Li J B, Huang G H (2006). Fault diagnosis of WWTP based on improved support vector machine. Environmental Engineering Science, 23(6): 1044–1054

    Article  CAS  Google Scholar 

  • Zhang C, Wei H, Zhao X, Liu T, Zhang K (2016). A Gaussian process regression based hybrid approach for short-term wind speed prediction. Energy Conversion and Management, 126: 1084–1092

    Article  Google Scholar 

  • Zhang Z, Ye L, Qin H, Liu Y, Wang C, Yu X, Yin X, Li J (2019). Wind speed prediction method using shared weight long short-term memory network and gaussian process regression. Applied Energy, 247: 270–284

    Article  Google Scholar 

Download references

Acknowledgements

This research was supported by the National Natural Science Foundation of China (Nos. 41977300 and 41907297), the Science and Technology Program of Guangzhou (China) (No. 202002020055) and the Fujian Provincial Natural Science Foundation (China) (No. 2020I1001).

Author information

Authors and Affiliations

Authors

Contributions

Junlang Li: Algorithm design, Writing - Original draft. Zhenguo Chen: Code debugging, Writing - Review & Editing, Investigation. Xiaoyong Li: Algorithm optimization, Writing - Review & Editing. Xiaohui Yi: Conceptualization. Yingzhong Zhao: Experimental data visualization. Xinzhong He: Dataset resources. Zehua Huang: Formal analysis. Mohamed A. Hassaan: Writing - Review & Editing. Ahmed El Nemr: Writing - Review & Editing. Mingzhi Huang: Project administration, Funding acquisition.

Corresponding authors

Correspondence to Zhenguo Chen or Mingzhi Huang.

Additional information

Highlights

• Hybrid deep-learning model is proposed for water quality prediction.

• Tree-structured Parzen Estimator is employed to optimize the neural network.

• Developed model performs well in accuracy and uncertainty.

• Usage of the proposed model can reduce carbon emission and energy consumption.

Special Issue—Artificial Intelligence/Machine Learning on Environmental Science & Engineering (Responsible Editors: Yongsheng Chen, Xiaonan Wang, Joe F. Bozeman III & Shouliang Yi)

Supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, J., Chen, Z., Li, X. et al. Water quality soft-sensor prediction in anaerobic process using deep neural network optimized by Tree-structured Parzen Estimator. Front. Environ. Sci. Eng. 17, 67 (2023). https://doi.org/10.1007/s11783-023-1667-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11783-023-1667-3

Keywords

Navigation