A hybrid regression model for water quality prediction


In this work, we propose a hybrid regression model to solve a specific problem faced by a modern paper manufacturing company. Boiler inlet water quality is a major concern for the paper machine. If water treatment plant can not produce water of desired quality, then it results in poor health of the boiler water tube and consequently affects the quality of the paper. This variation is due to several crucial process parameters. We build a hybrid regression model based on regression tree and support vector regression for boiler water quality prediction and show its excellent performance as compared to other state-of-the-art.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3


  1. 1.

    Antanasijević, D., Pocajt, V., Perić-Grujić, A., Ristić, M.: Multilevel split of high-dimensional water quality data using artificial neural networks for the prediction of dissolved oxygen in the Danube river. In: Neural Computing and Applications, pp. 1–10 (2019)

  2. 2.

    Avila, R., Horn, B., Moriarty, E., Hodson, R., Moltchanova, E.: Evaluating statistical model performance in water quality prediction. J. Environ. Manag. 206, 910–919 (2018)

    Article  Google Scholar 

  3. 3.

    Batchelder, G.W.: Process for the demineralization of water. US Patent 3,171,799 (1965)

  4. 4.

    Bennett, K.P., Blue, J.: A support vector machine approach to decision trees. In: 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No. 98CH36227), vol. 3, pp. 2396–2401. IEEE (1998)

  5. 5.

    Bhattacharya, B., Solomatine, D.P.: Neural networks and m5 model trees in modelling water level-discharge relationship. Neurocomputing 63, 381–396 (2005)

    Article  Google Scholar 

  6. 6.

    Breiman, L.: Classification and Regression Trees. Routledge, London (2017)

    Google Scholar 

  7. 7.

    Chakraborty, T., Chakraborty, A.K., Chattopadhyay, S.: A novel distribution-free hybrid regression model for manufacturing process efficiency improvement. J. Comput. Appl. Math. (2019) (To appear)

  8. 8.

    Chakraborty, T., Chakraborty, A.K., Murthy, C.: A nonparametric ensemble binary classifier and its statistical properties. Stat. Probab. Lett. 149, 16–23 (2019)

    Article  Google Scholar 

  9. 9.

    Chakraborty, T., Chattopadhyay, S., Chakraborty, A.K.: A novel hybridization of classification trees and artificial neural networks for selection of students in a business school. Opsearch 55(2), 434–446 (2018)

    Article  Google Scholar 

  10. 10.

    Chang, F., Liu, C.C.: Decision tree as an accelerator for support vector machines. In: Ding, X. (ed.) Advances in Character Recognition. IntechOpen, Rijeka (2012)

    Google Scholar 

  11. 11.

    Cherkassky, V., Ma, Y.: Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 17(1), 113–126 (2004)

    Article  Google Scholar 

  12. 12.

    Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    Google Scholar 

  13. 13.

    Gmar, S., Helali, N., Boubakri, A., Sayadi, I.B.S., Tlili, M., Amor, M.B.: Electrodialytic desalination of brackish water: determination of optimal experimental parameters using full factorial design. Appl. Water Sci. 7(8), 4563–4572 (2017)

    Article  Google Scholar 

  14. 14.

    Kumar, M.A., Gopal, M.: A hybrid svm based decision tree. Pattern Recogn. 43(12), 3977–3987 (2010)

    Article  Google Scholar 

  15. 15.

    Lhassani, A., Rumeau, M., Benjelloun, D., Pontie, M.: Selective demineralization of water by nanofiltration application to the defluorination of brackish water. Water Res. 35(13), 3260–3264 (2001)

    Article  Google Scholar 

  16. 16.

    Loh, W.Y.: Classification and regression trees. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 1(1), 14–23 (2011)

    Article  Google Scholar 

  17. 17.

    Mahuli, S., Rhinehart, R., Riggs, J.: ph control using a statistical technique for continuous on-line model adaptation. Comput. Chem. Eng. 17(4), 309–317 (1993)

    Article  Google Scholar 

  18. 18.

    Ouyang, Y., Nkedi-Kizza, P., Wu, Q., Shinde, D., Huang, C.: Assessment of seasonal variations in surface water quality. Water Res. 40(20), 3800–3810 (2006)

    Article  Google Scholar 

  19. 19.

    Schölkopf, B., Smola, A.J., Bach, F., et al.: Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond. MIT Press, Cambridge (2002)

    Google Scholar 

  20. 20.

    Singh, K.P., Basant, A., Malik, A., Jain, G.: Artificial neural network modeling of the river water quality—a case study. Ecol. Model. 220(6), 888–895 (2009)

    Article  Google Scholar 

  21. 21.

    Sugumaran, V., Muralidharan, V., Ramachandran, K.: Feature selection using decision tree and classification through proximal support vector machine for fault diagnostics of roller bearing. Mech. Syst. Signal Process. 21(2), 930–942 (2007)

    Article  Google Scholar 

  22. 22.

    Vedelago, R., Millar, G.J.: Process evaluation of treatment options for high alkalinity coal seam gas associated water. J. Water Process Eng. 23, 195–206 (2018)

    Article  Google Scholar 

  23. 23.

    Wang, X., Zhou, Y., Zhao, Z., Wang, L., Xu, J., Yu, J.: A novel water quality mechanism modeling and eutrophication risk assessment method of lakes and reservoirs. Nonlinear Dyn 96(2), 1037–1053 (2019). https://doi.org/10.1007/s11071-019-04837-6

    Article  Google Scholar 

  24. 24.

    Zhang, J.: Hc-dt/svm: a tightly coupled hybrid decision tree and support vector machines algorithm with application to land cover change detections. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Data Mining for Geoinformatics, pp. 9–18. ACM (2010)

  25. 25.

    Zhang, J., Li, Y., Zeng, X., Huang, G., Li, Y., Zhu, Y., Kong, F., Xi, M., Liu, J.: Effluent trading planning and its application in water quality management: a factor-interaction perspective. Environ. Res. 168, 286–305 (2019)

    Article  Google Scholar 

Download references


The authors acknowledge the concerned editor and reviewers for their constructive comments.

Author information



Corresponding author

Correspondence to Tanujit Chakraborty.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Chakraborty, T., Chakraborty, A.K. & Mansoor, Z. A hybrid regression model for water quality prediction. OPSEARCH 56, 1167–1178 (2019). https://doi.org/10.1007/s12597-019-00386-z

Download citation


  • Water quality
  • Decision tree
  • Support vector regression
  • Hybrid model