Abstract
Water distribution network (WDN) failures can disrupt operations and cause economic damage. Although leakage has been widely discussed, few studies have integrated spatial clusters with engineering, environmental, and socioeconomic factors simultaneously. This study proposes an approach to explore the role of socioeconomic factors in understanding leak risks. Using a unique data set of more than 4,000 reported leak events within the City of Los Angeles (2010–2013), the analysis (1) assesses the effectiveness of including socioeconomic factors with engineering factors in explaining observed leaks, (2) identifies spatial clusters of leaks, and (3) develops a predictive model with machine learning to identify spatial areas with high risks of failure. Results indicate that distinct clusters of leaks are evident, accounting for 20–30% of all leaks in the study area in a given year. Multivariate regression modeling showed that geography, socioeconomic, and engineering factors are statistically significant in predicting leaks. A predictive model with machine learning was developed, identifying key factors. The model had accuracy rates of 93.29% and 92.45% for interpolation and extrapolation prediction scenarios, respectively. The approach demonstrates the potential value of incorporating socioeconomic indicators into the models for WDN rehabilitation. Moreover, the approach demonstrates how municipal leak loss mitigation programs can consider a broad set of predictive factors to optimize investments.
Similar content being viewed by others
Data Availability
The data is available in an online repository: https://github.com/erikporse/artes. Data used in the study was compiled from multiple sources, including: (CEC 2021); (County’s Enterprise GIS (eGIS) Steering Committee 2018); (Krishnakumar and Poston 2016); (OEHHA 2022); (Pincetl and LA Energy Atlas Development Team 2023); (Poston and De Groot 2014); (Poston and Stevens 2015).
Code Availability
Code generated or used during the study are available in an online repository: https://github.com/erikporse/artes.
References
Abokifa AA, Sela L (2023) Integrating spatial clustering with predictive modeling of pipe failures in water distribution systems. URBAN WATER J 20:465–476. https://doi.org/10.1080/1573062X.2023.2180393
American Society of Civil Engineers (2017) Infrastructure report card: drinking Water. American Society of Civil Engineers (ASCE), Reston, VA (Reprinted)
Bailony MR, Hararah MK, Salhab AR, Ghannam I, Abdeen Z, Ghannam J (2011) Cancer registration and healthcare access in West Bank, Palestine: a GIS analysis of childhood cancer, 1998–2007. INT J CANCER 129:1180–1189. https://doi.org/10.1002/ijc.25732
Barton NA, Farewell TS, Hallett SH, Acland TF (2019) Improving pipe failure predictions: factors affecting pipe failure in drinking water networks. WATER RES 164:114926. https://doi.org/10.1016/j.watres.2019.114926
CEC (2021) California Building Climate Zones. https://cecgis-caenergy.opendata.arcgis.com/datasets/CAEnergy::california-building-climate-zones/explore
County’s Enterprise GIS (eGIS) Steering Committee (2018) Los Angeles County GIS Data Portal. https://egis3.lacounty.gov/dataportal/. Accessed 12 Jan 2020
Da Silveira APP, Mata-Lima H (2021) Assessing energy efficiency in water utilities using long-term data analysis. Water Resour Manag 35:2763–2779. https://doi.org/10.1007/s11269-021-02866-8
Diamantopoulou MJ (2023) Machine learning in environmental modeling: A case study with groundtruth data from Seich–Sou suburban forest, Greece. Paper presented at the 12th World Congress on Water Resources and Environment (EWRA 2023), Thessaloniki, Greece
Fan X, Wang X, Zhang X, Yu XB (2022) Machine learning based water pipe failure prediction: the effects of engineering, geology, climate and socio-economic factors. Reliab Eng Syst Safe 219:108185. https://doi.org/10.1016/j.ress.2021.108185
Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene Selection for Cancer classification using support Vector machines. MACH LEARN 46:389–422. https://doi.org/10.1023/A:1012487302797
Hani A, Nechem D, Hani S, Bougherira N, Toumi F, Djabri L, Chaffai H (2023) Multi-criteria analysis and characterization of the integrated water resources management model in the Annaba region. Paper presented at the 12th World Congress on Water Resources and Environment (EWRA 2023), Thessaloniki, Greece
Hunaidi O (2006) (Condition assessment of water pipes) Proceedings of the EPA Workshop on Innovation and Research for Water Infrastructure in the 21st Century, EPA Workshop, Arlington, VA, USA
Krishnakumar P, Poston B (2016) Los Angeles water main leaks since 2010, Los Angeles Times, http://graphics.latimes.com/los-angeles-pipe-leaks/
Kulldorff M (2022) SaTScanTM User Guide. http://www.satscan.org/techdoc.html. Accessed: 14 Nov 2023
LADWP (2023) 2022-23 Water infrastructure plan. Los Angeles. Reprinted
LA County (2023) Los Angeles County GIS Data Portal. https://egis3.lacounty.gov/dataportal/
Li Z, Wang J, Yan H, Li S, Tao T, Xin K (2022) Fast Detection and Localization of Multiple Leaks in Water Distribution Network jointly driven by Simulation and Machine Learning. J Water Res Plan Man 148:5022005. https://doi.org/10.1061/(ASCE)WR.1943-5452.0001574
Mazumder RK, Salman AM, Li Y (2021) Failure risk analysis of pipelines using data-driven machine learning algorithms. Struct Saf 89:102047. https://doi.org/10.1016/j.strusafe.2020.102047
Misiūnas D (2008) Failure monitoring and asset condition asssessment in water supply systems. Vilniaus Gedimino technikos universitetas
Mondal S, Singh D, Kumar R (2022) Crime hotspot detection using statistical and geospatial methods: a case study of Pune City, Maharashtra, India. GeoJournal 87:5287–5303. https://doi.org/10.1007/s10708-022-10573-z
Morrison R (2021) The Aging Pipes Dilemma. https://www.contractormag.com/piping/article/21153216/the-aging-pipes-dilemma. Accessed: 2 Nov 2023
OEHHA (2022) CalEnviroScreen v 2.0. Sacramento, CA: California Office of Environmental Health Hazard Assessment. https://oehha.ca.gov/calenviroscreen. Accessed: 1 Nov 2023
Pincetl S, LA Energy Atlas Development Team (2023) LA Energy Atlas. California Center for Sustainable Communities. UCLA, Los Angeles, CA. (Reprinted)
Poston B, De Groot L (2014) Water pipe leaks in Los Angeles, Los Angeles Times, http://www.latimes.com/visuals/graphics/la-me-g-water-leaks-20141107-htmlstory.html
Poston B, Stevens M (2015) LA’s Aging Water Pipes; a $1-Billion Dilemma, Los Angeles Times, Accessed 13 Jan 2020. http://graphics.latimes.com/la-aging-water-infrastructure/
Rahimi-Golkhandan A, Aslani B, Mohebbi S (2022) Predictive resilience of interdependent water and transportation infrastructures: a sociotechnical approach. Socio-Econ Plan Sci 80:101166. https://doi.org/10.1016/j.seps.2021.101166
Reyes-Velarde A (2018) Century-old water main breaks in South Los Angeles, submerging streets and cars and spurring evacuations. https://www.latimes.com/local/lanow/la-me-ln-south-la-water-main-break-20181221-story.html. Accessed 2 Nov 2023
Robertson C, Nelson TA (2010) Review of software for space-time Disease surveillance. Int J health Geogr 9:16. https://doi.org/10.1186/1476-072X-9-16
Robles-Velasco A, Cortés P, Muñuzuri J, De Baets B (2023) Prediction of pipe failures in water supply networks for longer time periods through multi-label classification. Expert Syst Appl 213:119050. https://doi.org/10.1016/j.eswa.2022.119050
Roushangar K, Ghasempour R, Nourani V (2022) Spatiotemporal analysis of droughts over different climate regions using hybrid clustering method. Water Resour Manag 36:473–488. https://doi.org/10.1007/s11269-021-02974-5
Stimers M, Lenagala S, Haddock B, Paul BK, Mohler R (2022) Space-time clustering with the space-time permutation model in SaTScan™ Applied to Building Permit Data following the 2011 Joplin, Missouri Tornado. Int J Disast Risk Sc 13:962–973. https://doi.org/10.1007/s13753-022-00456-9
Swilling M (2011) Reconceptualising urbanism, ecology and networked infrastructures. Soc Dyn 37:78–95. https://doi.org/10.1080/02533952.2011.569997
Tripathi V, Mohanty MP, Singh H (2023) Fidelity of machine learning models in capturing flood inundation through geomorphic descriptors over Ganga sub-basin, India. Paper presented at the 12th World Congress on Water Resources and Environment (EWRA 2023), Thessaloniki, Greece
U.S. Census Bureau (2023) American Community Survey. https://www.census.gov/data.html. Accessed: 1 Nov 2023
Wadhwa A, Thakur MK (2022) Rapid surveillance of COVID-19 by timely detection of geographically robust, alive and emerging hotspots using particle Swarm Optimizer. Appl Geogr 144:102719
Acknowledgements
Many thanks to Eric Fournier, Dan Cheng, Claire Hirashiki, and Stephanie Pincetl at the California Center for Sustainable Communities within UCLA’s Institute of the Environment and Sustainability for guidance and technical assistance.
Funding
Support for this research was provided by the Beijing Social Sciences Foundation (grant number 18GLC070), the Fundamental Research Funds for the Central Universities (grant number 2019JBW007), and China Scholarship Council (grant number 201707095080). The Office of Water Programs at California State University, Sacramento also supported the research.Beijing Social Sciences Foundation,18GLC070,Qing Shuang,Fundamental Research Funds for the Central Universities,2019JBW007,Qing Shuang,China Scholarship Council,201707095080,Qing Shuang.
Author information
Authors and Affiliations
Contributions
Shuang Q. and Porse E. designed the research, collected the data, and programed the code; Zhao R.T. tested the model; Shuang Q. and Zhao R.T. wrote the original draft preparation; Porse E. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Ethical Approval
Not applicable.
Consent to Participate
Not applicable.
Consent to Publish
Not applicable.
Conflicts of Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shuang, Q., Zhao, R.T. & Porse, E. Cluster Analysis and Predictive Modeling of Urban Water Distribution System Leaks with Socioeconomic and Engineering Factors. Water Resour Manage 38, 385–400 (2024). https://doi.org/10.1007/s11269-023-03676-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11269-023-03676-w