Skip to main content
Log in

Cluster Analysis and Predictive Modeling of Urban Water Distribution System Leaks with Socioeconomic and Engineering Factors

  • Published:
Water Resources Management Aims and scope Submit manuscript

Abstract

Water distribution network (WDN) failures can disrupt operations and cause economic damage. Although leakage has been widely discussed, few studies have integrated spatial clusters with engineering, environmental, and socioeconomic factors simultaneously. This study proposes an approach to explore the role of socioeconomic factors in understanding leak risks. Using a unique data set of more than 4,000 reported leak events within the City of Los Angeles (2010–2013), the analysis (1) assesses the effectiveness of including socioeconomic factors with engineering factors in explaining observed leaks, (2) identifies spatial clusters of leaks, and (3) develops a predictive model with machine learning to identify spatial areas with high risks of failure. Results indicate that distinct clusters of leaks are evident, accounting for 20–30% of all leaks in the study area in a given year. Multivariate regression modeling showed that geography, socioeconomic, and engineering factors are statistically significant in predicting leaks. A predictive model with machine learning was developed, identifying key factors. The model had accuracy rates of 93.29% and 92.45% for interpolation and extrapolation prediction scenarios, respectively. The approach demonstrates the potential value of incorporating socioeconomic indicators into the models for WDN rehabilitation. Moreover, the approach demonstrates how municipal leak loss mitigation programs can consider a broad set of predictive factors to optimize investments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data Availability

The data is available in an online repository: https://github.com/erikporse/artes. Data used in the study was compiled from multiple sources, including: (CEC 2021); (County’s Enterprise GIS (eGIS) Steering Committee 2018); (Krishnakumar and Poston 2016); (OEHHA 2022); (Pincetl and LA Energy Atlas Development Team 2023); (Poston and De Groot 2014); (Poston and Stevens 2015).

Code Availability

Code generated or used during the study are available in an online repository: https://github.com/erikporse/artes.

References

Download references

Acknowledgements

Many thanks to Eric Fournier, Dan Cheng, Claire Hirashiki, and Stephanie Pincetl at the California Center for Sustainable Communities within UCLA’s Institute of the Environment and Sustainability for guidance and technical assistance.

Funding

Support for this research was provided by the Beijing Social Sciences Foundation (grant number 18GLC070), the Fundamental Research Funds for the Central Universities (grant number 2019JBW007), and China Scholarship Council (grant number 201707095080). The Office of Water Programs at California State University, Sacramento also supported the research.Beijing Social Sciences Foundation,18GLC070,Qing Shuang,Fundamental Research Funds for the Central Universities,2019JBW007,Qing Shuang,China Scholarship Council,201707095080,Qing Shuang.

Author information

Authors and Affiliations

Authors

Contributions

Shuang Q. and Porse E. designed the research, collected the data, and programed the code; Zhao R.T. tested the model; Shuang Q. and Zhao R.T. wrote the original draft preparation; Porse E. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Qing Shuang.

Ethics declarations

Ethical Approval

Not applicable.

Consent to Participate

Not applicable.

Consent to Publish

Not applicable.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 291 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shuang, Q., Zhao, R.T. & Porse, E. Cluster Analysis and Predictive Modeling of Urban Water Distribution System Leaks with Socioeconomic and Engineering Factors. Water Resour Manage 38, 385–400 (2024). https://doi.org/10.1007/s11269-023-03676-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11269-023-03676-w

Keywords

Navigation