Skip to main content

Advertisement

Log in

Trihalomethane prediction model for water supply system based on machine learning and Log-linear regression

  • Original Paper
  • Published:
Environmental Geochemistry and Health Aims and scope Submit manuscript

Abstract

Laboratory determination of trihalomethanes (THMs) is a very time-consuming task. Therefore, establishing a THMs model using easily obtainable water quality parameters would be very helpful. This study explored the modeling methods of the random forest regression (RFR) model, support vector regression (SVR) model, and Log-linear regression model to predict the concentration of total-trihalomethanes (T-THMs), bromodichloromethane (BDCM), and dibromochloromethane (DBCM), using nine water quality parameters as input variables. The models were developed and tested using a dataset of 175 samples collected from a water treatment plant. The results showed that the RFR model, with the optimal parameter combination, outperformed the Log-linear regression model in predicting the concentration of T-THMs (N25 = 82–88%, rp = 0.70–0.80), while the SVR model performed slightly better than the RFR model in predicting the concentration of BDCM (N25 = 85–98%, rp = 0.70–0.97). The RFR model exhibited superior performance compared to the other two models in predicting the concentration of T-THMs and DBCM. The study concludes that the RFR model is superior overall to the SVR model and Log-linear regression models and could be used to monitor THMs concentration in water supply systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

Data will be made available on request.

References

Download references

Acknowledgements

Financial support was received from the Key Program of the Shanghai Science and Technology Commission (19DZ1204401).

Funding

This work was supported by the Key Program of the Shanghai Science and Technology Commission (19DZ1204401).

Author information

Authors and Affiliations

Authors

Contributions

H.L.: Acquisition and analysis of data; Methodology; Model testing; Writing. Y.C.: Revising it critically for important intellectual content. Y.Z.: Methodology; Polish; Final approval of the version to be submitted. X.H.: Polish. S.S.: Investigation; Polish.

Corresponding author

Correspondence to Shihu Shu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 60 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, H., Chu, Y., Zhu, Y. et al. Trihalomethane prediction model for water supply system based on machine learning and Log-linear regression. Environ Geochem Health 46, 31 (2024). https://doi.org/10.1007/s10653-023-01778-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10653-023-01778-3

Keywords

Navigation