Abstract
An accurate assessment of debris-flow susceptibility is of great importance to the prevention and control of debris-flow disasters in mountainous areas. In this study, by applying the recursive feature elimination-random forest (RFE-RF) and the stacking ensemble learning including multiple heterogeneous learners, the high accuracy of the debris-flow susceptibility is assessed. The study area is determined in the Dongchuan District, Kunming City, Yunnan Province, China, where the debris-flows are prone to occur. By taking the grid unit as the assessment unit, 22 debris-flow hazard factors are preliminarily selected from multiple data sources, such as geology, topography, and precipitation, in accordance with the interpretation of debris-flow points. Next, total 16 factors are selected to construct the hazard factor system with the RFE-RF method, contribution rate, and Pearson correlation analysis for the primary factors. Finally, the base learners of the ensemble model are selected using accuracy and diversity metrics. In addition, the debris-flow susceptibility assessment of stacking ensemble learning, that multiplies the advantages and differences of different learners, is constructed, aiming at quantitatively analyzing the susceptibility of debris-flow in the study area. The natural breakpoint model is selected to classify the five levels for each grid unit. As for the prediction performance of the stacking ensemble learning including multiple heterogeneous learners, comparisons are conducted with the four base learner methods of support vector machine , back propagation neural network , extreme gradient boosting tree, and random forest (RF), as well as the four ensemble strategies of simple average , weighted average , weighted vote , and blending, respectively. As indicated by the results, the very low and low susceptibility zones of debris-flow are mainly concentrated in the eastern and western parts in the study area. The very high and high susceptibility zones are mainly distributed on the two banks of Xiaojiang River Valley and the south bank of Jinsha River, where there is fragile geological environment and high risk, in the study area. The medium susceptibility zone is mainly distributed around the very high and high susceptibility zones. There are excellent accuracy and stability in the stacking ensemble learning model of debris-flow susceptibility in the mountainous areas, when combining with the RFE-RF model and the diversity measurement. As for the stacking ensemble learning therein, the area under curve value of the receiver-operating characteristic, the accuracy value, and F1 score are the maximum, reaching 95.6%, 88.6%, and 88.9%, respectively. Besides, the root mean square error value is the minimum, namely 0.287, which indicates that stacking ensemble learning including multiple heterogeneous learners is a high-performance model for debris-flow susceptibility assessment. The findings can provide a scientific basis for the disaster prevention and mitigation in the mountainous areas.
Similar content being viewed by others
References
Althuwaynee OF, Pradhan B, Park HJ et al (2014) A novel ensemble decision tree-based CHi-squared Automatic Interaction Detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping. Landslides 11:1063–1078. https://doi.org/10.1007/s10346-014-0466-0
Angillieri MYE (2020) Debris flow susceptibility mapping using frequency ratio and seed cells, in a portion of a mountain international route Dry Central Andes of Argentina. CATENA 189:104504. https://doi.org/10.1016/j.catena.2020.104504
Anowar F, Sadaoui S, Selim B (2021) Conceptual and empirical comparison of dimensionality reduction algorithms (pca, kpca, lda, mds, svd, lle, isomap, le, ica, t-sne). Comput. Sci. Rev. 40:100378. https://doi.org/10.1016/j.cosrev.2021.100378
Bregoli F, Medina V, Chevalier G et al (2015) Debris-flow susceptibility assessment at regional scale: Validation on an alpine environment. Landslides 12:437–454. https://doi.org/10.1007/s10346-014-0493-x
Cama M, Conoscenti C, Lombardo L et al (2016) Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: a test in the Giampilieri catchment (Sicily, Italy). Environ Earth Sci 75:1–21. https://doi.org/10.1007/s12665-015-5047-6
Cao J, Zhang Z, Du J et al (2020) Multi-geohazards susceptibility mapping based on machine learning—A case study in Jiuzhaigou. China Nat Hazard 102:851–871. https://doi.org/10.1007/s11069-020-03927-8
Chao Z, Kunlong Y, Ying C (2020) Landslide susceptibility assessment by applying the coupling method of radial basis neural network and adaboost: A case study from the Three Gorges Reservoir area. Earth Sci 45:1865–1876. https://doi.org/10.3390/rs14092257
Chen H, Zhang LM (2015) EDDA 1.0: integrated simulation of debris flow erosion, deposition and property changes. Geosci Model Dev 8:829–844. https://doi.org/10.5194/gmd-8-829-2015
Chen J, Li Y, Zhou W et al (2017) Debris-flow susceptibility assessment model and its application in semiarid mountainous areas of the Southeastern Tibetan Plateau. Nat Hazard Rev 18:05016005. https://doi.org/10.1061/(ASCE)NH.1527-6996.0000229
Chen Z, Liang S, Ke Y et al (2020) Landslide susceptibility assessment using different slope units based on the evidential belief function model. Geocarto Int 35:1641–2166. https://doi.org/10.1080/10106049.2019.1582716
Chen T, GUESTRIN C et al (2016) Xgboost: A scalable tree boosting system; proceedings of the Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining. https://doi.org/10.1145/2939672.2939785
Chevalier GG, Medina V, Hürlimann M et al (2013) Debris-flow susceptibility analysis using fluvio-morphological parameters and data mining: application to the Central-Eastern Pyrenees. Nat Hazard 67:213–238. https://doi.org/10.1007/s11069-013-0568-3
Dash RK, Falae PO, Kanungo DP (2022) Debris flow susceptibility zonation using statistical models in parts of Northwest Indian Himalayas—implementation, validation, and comparative evaluation. Nat Hazard 111:2011–2058. https://doi.org/10.1007/s11069-021-05128-3
Du G, Zhang Y, Yang Z et al (2019) Landslide susceptibility mapping in the region of eastern Himalayan syntaxis, Tibetan Plateau, China: a comparison between analytical hierarchy process information value and logistic regression-information value methods. Bull Eng Geol Environ 78:4201–4215
Fang Z, Wang Y, Peng L et al (2021) A comparative study of heterogeneous ensemble-learning techniques for landslide susceptibility mapping. Int J Geog Inf Sci 35:321–347. https://doi.org/10.1080/13658816.2020.1808897
Gao R, Wang C, Han S et al (2022) A Research on cross-regional debris flow susceptibility mapping based on transfer learning. Remote Sens 14:4829. https://doi.org/10.3390/rs14194829
Giano SI, Pescatore E, Siervo V (2021) Morphometry and debris-flow susceptibility map in mountain drainage basins of the Vallo di Diano. Southern Italy Remote Sens 13:3254. https://doi.org/10.3390/rs13163254
Gorsevski PV, Brown MK, Panter K et al (2016) Landslide detection and susceptibility mapping using LiDAR and an artificial neural network approach: a case study in the Cuyahoga Valley National Park. Ohio Landslides 13:467–484. https://doi.org/10.1007/s10346-015-0587-0
Gu T, Li J, Wang M et al (2021) Landslide susceptibility assessment in Zhenxiong County of China based on geographically weighted logistic regression model. Geocarto Int. https://doi.org/10.1080/10106049.2021.1903571
Hong H, Liu J, Bui DT et al (2018) Landslide susceptibility mapping using J48 decision tree with AdaBoost, bagging and rotation forest ensembles in the Guangchang area (China). CATENA 163:399–413. https://doi.org/10.1016/j.catena.2018.01.005
Hu X, Zhang H, Mei H et al (2020) Landslide susceptibility mapping using the stacking ensemble machine learning method in Lushui. Southwest China Appl Sci 10:4016. https://doi.org/10.3390/app10114016
Hu X, Mei H, Zhang H et al (2021) Performance evaluation of ensemble learning techniques for landslide susceptibility mapping at the Jinping county, Southwest China. Nat Hazard 105:1663–1689. https://doi.org/10.1007/s11069-020-04371-4
Huang F, Yao C, Liu W et al (2018) Landslide susceptibility assessment in the Nantian area of China: a comparison of frequency ratio model and support vector machine. Geomatics Nat Hazards Risk 9:919–938. https://doi.org/10.1080/19475705.2018.1482963
Huang F, Zhang J, Zhou C et al (2020) A deep learning algorithm using a fully connected sparse autoencoder neural network for landslide susceptibility prediction. Landslides 17:217–229. https://doi.org/10.1007/s10346-019-01274-9
Huang F, Pan L, Fan X et al (2022a) The uncertainty of landslide susceptibility prediction modeling: suitability of linear conditioning factors. Bull Eng Geol Environ 81:1–19. https://doi.org/10.1007/s10064-022-02672-5
Huang H, wang Y, Li Y et al (2022b) Debris-flow susceptibility assessment in China: a comparison between traditional statistical and machine learning methods. Remote Sens 14:4475. https://doi.org/10.3390/rs14184475
Li Z, Chen J, Tan C et al (2021) Debris flow susceptibility assessment based on topo-hydrological factors at different unit scales: a case study of Mentougou district. Beijing Environ Earth Sci 80:1–19. https://doi.org/10.1007/s12665-021-09665-9
Li W, Fang Z, Wang Y (2022) Stacking ensemble of deep learning methods for landslide susceptibility mapping in the Three Gorges Reservoir area. China Stochastic Environ Res Risk Assess 36:2207–2228. https://doi.org/10.1007/s00477-021-02032-x
Liang Z, Wang CM, Zhang ZM et al (2020) A comparison of statistical and machine learning methods for debris flow susceptibility mapping. Stochastic Environ Res Risk Assess 34:1887–1907. https://doi.org/10.1007/s00477-020-01851-8
Liu YY, Di BF, Yu Z (2018) Debris flows susceptibility assessment in Wenchuan earthquake areas based on random forest algorithm model. Mt Res 36:765–773. https://doi.org/10.16089/j.cnki.1008-2786.000372
Liu X, Miao C, Guo L (2019) Acceptability of debris-flow disasters: comparison of two case studies in China. Int J Disaster Risk Reduct 34:45–54. https://doi.org/10.1016/j.ijdrr.2018.11.004
Marino P, Subramanian SS, Fan X et al (2022) Changes in debris-flow susceptibility after the Wenchuan earthquake revealed by meteorological hydro-meteorological thresholds. CATENA 210:105929. https://doi.org/10.1016/j.catena.2021.105929
Ministry of Natural Resources (2021) National Geological Disaster Bulletin [EB/OL]. https://www.mnr.gov.cn/
Pandey VK, Sharma KK, Pourghasemi HR et al (2019) Sedimentological characteristics and application of machine learning techniques for landslide susceptibility modelling along the highway corridor Nahan to Rajgarh (Himachal Pradesh) India. CATENA 182:104150. https://doi.org/10.1016/j.catena.2019.104150
Polat A (2021) An innovative, fast method for landslide susceptibility mapping using GIS-based LSAT toolbox. Environ Earth Sci 80:1–18. https://doi.org/10.1007/s12665-021-09511-y
Qiao S, Qin S, Sun J et al (2021)Development of a region-partitioning method for debris flow susceptibility mapping. J Mountain Sci 18(5):1177–1191. https://doi.org/10.1007/s11629-020-6497-1
Qin SW, Qiao SS, Yao JY et al (2022) Establishing a GIS-based evaluation method considering spatial heterogeneity for debris flow susceptibility mapping at the regional scale. Nat Hazard. https://doi.org/10.1007/s11069-022-05487-5
Qing F, Zhao Y, Meng X et al (2020) Application of machine learning to debris flow susceptibility mapping along the China-Pakistan Karakoram highway. Remote Sens 12:2933. https://doi.org/10.3390/rs12182933
Qiu C, Su L, Zou Q et al (2022) A hybrid machine-learning model to map glacier-related debris flow susceptibility along Gyirong Zangbo watershed under the changing climate. Sci Total Environ 818:151752. https://doi.org/10.1016/j.scitotenv.2021.151752
Qiu ML, Huang L, Tang BH (2023) Bridge detection method for HSRRSIs based on YOLOv5 with a decoupled head. Int J Digital Earth 16(1):113–129. https://doi.org/10.1080/17538947.2022.2163514
Rahmati O, Tahmasebipour N, Haghizadeh A et al (2017) Evaluation of different machine learning models for predicting and mapping the susceptibility of gully erosion. Geomorphology 298:118–137. https://doi.org/10.1016/j.geomorph.2017.09.006
Wang S, Dong P, Tian Y (2017) A novel method of statistical line loss estimation for distribution feeders based on feeder cluster and modified XGBoost. Energies 10:2067. https://doi.org/10.3390/en10122067
Wang Y, Duan H, Hong H (2019) A comparative study of composite kernels for landslide susceptibility mapping: a case study in Yongxin County China. CATENA 183:104217. https://doi.org/10.1016/j.catena.2019.104217
Wang Y, Feng L, Li S et al (2020) A hybrid model considering spatial heterogeneity for landslide susceptibility mapping in Zhejiang Province China. CATENA 188:104425. https://doi.org/10.1016/j.catena.2019.104425
Wang C, Wang X, Zhang H et al (2022) Assessment of environmental geological disaster susceptibility under a multimodel comparison to aid in the sustainable development of the regional economy. Environ Sci Pollut Res 30(3):1–19. https://doi.org/10.1007/s11356-022-22649-x
Wolpert DH (1992) Stacked Generalization Neural Networks 5:241–259. https://doi.org/10.1016/S0893-6080(05)80023-1
Wu B, Qiu W, Jia J et al (2020a) Landslide susceptibility modeling using bagging-based positive-unlabeled learning. IEEE Geosci Remote Sens Lett 18:766–770. https://doi.org/10.1109/LGRS.2020.2989497
Wu Y, Ke Y, Chen Z et al (2020b) Application of alternating decision tree with AdaBoost and bagging ensembles for landslide susceptibility mapping. CATENA 187:104396. https://doi.org/10.1016/j.catena.2019.104396
Xiong K, Adhikari BR, Stamatopoulos CA et al (2020) Comparison of different machine learning methods for debris flow susceptibility mapping: a case study in the Sichuan Province. China Remote Sens 12:295. https://doi.org/10.3390/rs12020295
Xu K, Guo Q, Li Z et al (2015) Landslide susceptibility evaluation based on BPNN and GIS: a case of Guojiaba in the Three Gorges Reservoir Area. Int J Geog Inf Sci 29:1111–1124. https://doi.org/10.1080/13658816.2014.992436
Xu J, Cheng X, Huang Q et al (2017) Susceptibility evaluation of debris flow based on experience weight method combined with “3S” technology: a case study from Dongchuan in Yunnan Province, China. In Proceedings of the IOP Conference Series: Earth and Environmental Science. p. 022051. https://doi.org/10.1088/1755-1315/95/2/022051
Youssef AM, Pourghasemi HR, Pourtaghi ZS et al (2016) Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia. Landslides 13:839–856. https://doi.org/10.1007/s10346-015-0614-1
Zêzere J, Pereira S, Melo R et al (2017) Mapping landslide susceptibility using data-driven methods. Sci Total Environ 589:250–267. https://doi.org/10.1016/j.scitotenv.2017.02.188
Zhang S, Wu G (2019) Debris flow susceptibility and its reliability based on random forest and GIS. Earth Sci 44:3115–3134. https://doi.org/10.3799/dqkx.2019.081
Zhang Y, Ge T, Tian W et al (2019) Debris flow susceptibility mapping using machine-learning techniques in Shigatse area. China Remote Sens 11:2801. https://doi.org/10.3390/rs11232801
Zhou Y, Yue D, Liang G et al (2022) Risk assessment of debris flow in a mountain-basin area, western China. Remote Sens 14:2942. https://doi.org/10.3390/rs14122942
Funding
This research was funded by the National Natural Science Foundation of China (41761081) and Basic Research Program of Yunnan Province (202201AU070112).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Li, K., Zhao, J. & Lin, Y. Debris-flow susceptibility assessment in Dongchuan using stacking ensemble learning including multiple heterogeneous learners with RFE for factor optimization. Nat Hazards 118, 2477–2511 (2023). https://doi.org/10.1007/s11069-023-06099-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11069-023-06099-3