Abstract
The stability prediction of landslide dams is the basis for reducing and even eliminating the damage caused by landslide dam failures. However, the stability of landslide dams with missing data cannot be predicted by the existing rapid evaluation methods. A landslide dam database containing 2783 historical cases from 49 countries worldwide is created. Considering 8 variables and the missing data, the XGBoost model for predicting the stability of landslide dams is established, and the impact of missing data on the prediction accuracy of the model is also discussed. The XGBoost prediction model can not only predict the landslide dam cases with missing data, but the accuracy of the XGBoost model has also been improved compared with that of the widely used rapid evaluation methods. The characteristic analysis framework of landslide dam stability prediction is proposed based on SHapley Additive exPlanations attribution theory, and the results show the importance of each characteristic and its influence on landslide dam stability from both overall and individual perspectives. The Meilonggou, Usoi, and Mount St. Helens landslide dams are introduced to demonstrate the applicability and accuracy of the XGBoost prediction model proposed in this study.
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig1_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig2_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig3_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig4_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig5_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig6_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig7_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig8_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig9_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig10_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig11_HTML.png)
![](http://media.springernature.com/m312/springer-static/image/art%3A10.1007%2Fs10346-022-01947-y/MediaObjects/10346_2022_1947_Fig12_HTML.png)
Similar content being viewed by others
Data availability
The electronic supplementary file containing 2783 landslide dam cases is available to authorized users.
References
Canuti P, Casagli N, Ermini L (1998) Inventory of landslide dams in the Northern Apennine as a model for induced flood hazard forecasting. Managing Hydro-Geological Disasters in a Vulnerable Environment. CNR-GNDCI and UNESCO IHP, Perugia, 189–202
Casagli N, Ermini L (1999) Geomorphic analysis of landslide dams in the Northern Apennine. Trans Jpn Geomorphol 20(3):219–249
Chen TQ, Guestrin C (2016) XGBoost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York, USA. ACM, pp 785–794
Costa JE, Schuster RL (1988) The formation and failure of natural dam. Geol Soc Am Bull 100:1054–1068
Costa JE, Schuster RL (1991) Documented historical landslide dams from around the world. U.S. Geological Survey open-file report 91–239, Washington, USA
Dong JJ, Li YS, Kuo CY, Sung RT, Li MH, Lee CT, Chen CC, Lee WR (2011a) The formation and breach of a short-lived landslide dam at Hsiaolin Village, Taiwan-part I: post-event reconstruction of dam geometry. Eng Geol 123(1–2):40–59
Dong JJ, Tung YH, Chen CC, Liao JJ, Pan YW (2011b) Logistic regression model for predicting the failure probability of a landslide dam. Eng Geol 117(1–2):52–61
Ermini L, Casagli N (2003) Prediction of the behavior of landslide dams using a geomorphological dimensionless index. Earth Surf Process Landf 28(1):31–47
Fan XM, Dufresne A, Subramanian SS et al (2020) The formation and impact of landslide dams-state of the art. Earth Sci Rev. https://doi.org/10.1016/j.earscirev.2020.103116
Fielding AH, Bell JF (1997) A review of methods for the assessment of prediction errors in conservation presence/absence models. Environ Conserv 24(1):38–49. https://doi.org/10.1017/s0376892997000088
Hu KH, Zhang XP, Luo H, Liu BT, Chen HY (2020) Investigation on “6.17” debris flow disaster chain in Meilonggou, Danba County. J Mt Sci 38(06):945–951 (in Chinese)
Hussain MA, Chen ZL, Kalsoom I, Asghar A, Shoaib M (2022) Landslide susceptibility mapping using machine learning algorithm: a case study along Karakoram Highway (KKH). J Indian Soc Remote Sens, Pakistan. https://doi.org/10.1007/s12524-021-01451-1
Iqbal J, Dai FC, Xu L, Tu XB (2013) Characteristics of large-sized landslide dams around the world. EGU General Assembly Conference Abstracts, p 15
Korup O (2004) Geomorphometric characteristics of New Zealand landslide dams. Eng Geol 73(1–2):13–35
Lehre AK, Collins B, Dunne T (1982) Preliminary post-eruption sediment budget for the North Fork Toutle River drainage, June 1980-May 1981. Mount St Helens, Washington
Liang WZ, Luo SZ, Zhao GY, Wu H (2020) Predicting hard rock pillar stability using GBDT, XGBoost, and LightGBM algorithms. Mathematics 8(5):765. https://doi.org/10.3390/math8050765
Liu N, Cheng ZL, Cui P, Chen NS (2013) Dammed lake and risk management. Science Press, Beijing (in Chinese)
Liu N, Yang QG, Chen ZY (2016) Hazard mitigation for barrier lakes. Changjiang River Publication, China (in Chinese)
Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. Conference on Neural Information Processing Systems, pp 4765–4774
Marmion M, Hjort J, Thuiller W, Luoto M (2008) A comparison of predictive methods in modelling the distribution of periglacial landforms in Finnish Lapland. Earth Surf Process Landf 33(14):2241–2254
Mcdonald JA, Pedersen J (1984) Alternative strategies for a permanent outlet for spirit lake near Mount St. Helens, Washington
Peng M, Zhang LM (2012a) Analysis of human risks due to dam break floods-part 2: application to Tangjiashan landslide dam failure. Nat Hazards 64(2):1899–1923
Peng M, Zhang LM (2012b) Breaching parameters of landslide dams. Landslides 9:13–31
Schuster RL (2004) Usoi landslide dam and lake Sarez, Pamir mountains, Tajikistan. Environ Eng Geosci 10(2):151–168
Shan YB, Chen SS, Zhong QM (2020) Rapid prediction of landslide dam stability using the logistic regression method. Landslides. https://doi.org/10.1007/s10346-020-01414-6
Shen DY, Shi ZM, Peng M, Zhang LM, Jiang MZ (2020) Longevity analysis of landslide dams. Landslides 17(8):1797–1821. https://doi.org/10.1007/s10346-020-01386-7
Shi ZM, Ma XL, Peng M, Zhang LM (2014) Statistical analysis and efficient dam burst modelling of landslide dams based on a large-scale database. Chin J Rock Mech Eng 33(9):1780–1790 (in Chinese)
Swanson FJ, Graham RL, Grant GE (1985) Some effects of slope movements on river channel. Proceedings of the International Symposium on Erosion, Debris Flow and Disaster Prevention, Tsukuba, Japan, pp 273–27
Swanson FJ, Oyagi N, Tominaga M (1986) Landslide dams in Japan. Landslide dams: processes, risk and mitigation. Geotech Special Publication (ASCE) 3:131–145
Swets J (1988) Measuring the accuracy of diagnostic systems. Science 240(4857):1285–1293
Tacconi Stefanelli C, Catani F, Casagli N (2015) Geomorphological investigations on landslide dams. Geoenviron Disasters 2:21
Tacconi Stefanelli C, Segoni S, Casagli N, Catani F (2016) Geomorphic indexing of landslide dams evolution. Eng Geol 208:1–10
Tong YX (2008) Quantitative analysis for stability of landslide dam. Master Thesis, National Central University, Taiwan (in Chinese)
Ullah B, Kamran M, Rui YC (2022) Predictive Modeling of short-term rockburst for the stability of subsurface structures using machine learning approaches: t-SNE, K-means clustering, and XGBoost. Mathematics 10(3):449. https://doi.org/10.3390/math10030449
Zheng HC, Shi ZM, Shen DY, Peng M, Hanley KJ, Ma CY, Zhang LM (2021) Recent advances in stability and failure mechanisms of landslide dams. Front Earth Sci 9:201. https://doi.org/10.3389/feart.2021.659935
Zhong QM, Chen SS, Mei SA, Cao W (2018) Numerical simulation of landslide dam breaching due to overtopping. Landslides 16(6):1183–1192
Funding
This work is supported by the National Science Fund for Distinguished Young Scholars (grant number 52125904), the National Natural Science Foundation of China (grant number 51979224), and the Program 2022TD-01 for Shaanxi Provincial Innovative Research Team (grant number 2022TD-01).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shi, N., Li, Y., Wen, L. et al. Rapid prediction of landslide dam stability considering the missing data using XGBoost algorithm. Landslides 19, 2951–2963 (2022). https://doi.org/10.1007/s10346-022-01947-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10346-022-01947-y