Skip to main content
Log in

A review of data-driven modelling in drinking water treatment

  • Review paper
  • Published:
Reviews in Environmental Science and Bio/Technology Aims and scope Submit manuscript

Abstract

There are significant opportunities to optimize drinking water treatment and water resource management using data-driven models. Modelling can help define complex system behaviour, such as water quality and environmental systems, giving insight into expected outcomes from changing conditions. Many water treatment models have been developed, such as predicting treated water quality based on coagulant addition or disinfection by-product formation from chlorination, which can be used to better inform operators of optimal treatment parameters to limit risk and reduce cost. Data-driven models, in particular, present an opportunity to learn relationships from patterns in historical data without the need to pre-define mechanisms or variable interactions. Furthermore, models built on currently monitored data are likely easier to implement since they utilize water quality measures that are already in place. However, data-driven approaches have significant challenges, including increased uncertainty in model validity, challenges in interpreting model behaviour and decision logic, and increased likelihood of incorporating biases from training data. This article presents a review of data-driven model applications in drinking water treatment to highlight opportunities related to protecting source water quality, optimizing treatment processes, and interpreting of sensor data. There is a focus on identifying approaches and algorithms best suited for specific applications and the interpretability of trained models. Successful implementation of data-driven models in critical systems, such as water treatment, requires that models be validated, and a model’s decision-making logic can be identified and scrutinized.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

Canada’s National Long-term Water Quality Monitoring database (open data).

References

Download references

Funding

Natural Sciences and Engineering Research Council (NSERC) Discovery Grant.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nicolas M. Peleato.

Ethics declarations

Conflict of interest

None.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aliashrafi, A., Zhang, Y., Groenewegen, H. et al. A review of data-driven modelling in drinking water treatment. Rev Environ Sci Biotechnol 20, 985–1009 (2021). https://doi.org/10.1007/s11157-021-09592-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11157-021-09592-y

Keywords

Navigation