Skip to main content

Advertisement

Log in

Discussion on the tree-based machine learning model in the study of landslide susceptibility

  • Original Paper
  • Published:
Natural Hazards Aims and scope Submit manuscript

Abstract

This study reported an application of the tree-based models to landslide susceptibility. The landslide inventory and ten conditioning factors were first constructed, based on data availability and climate. Subsequently, three tree-based models, decision tree (DT), DT-Boosting, and random forest (RF), were established and compared with the support vector machine (SVM) to analyze the difference in model prediction. Finally, the effect and causes of tree-based algorithms on prediction results were explored based on the working mechanism of the susceptibility model. Results show that there is no multicollinearity among the conditioning factors. The predicted results produced by the tree-based model display the discontinuous distribution compared with the SVM, not only presented in the point-based prediction but the surface-based heterogeneity. Moreover, heterogeneity on the susceptibility map relates to the tree-based algorithm and factor grading, especially the classification of important factors. Besides, DT-Boosting appears the highest numerical features, with large values of AUC (0.981), specificity (0.960), sensitivity (0.956) and accuracy (0.958) in the training phase, and high prediction of AUC (0.862), specificity (0.759), sensitivity (0.843) and accuracy (0.801) in the validation phase. In terms of fluctuation, the RF is smaller than that of DT-Boosting. Further, the susceptibility map generated by RF, with the largest D-value of 7.81, can well capture the difference in landslide susceptibility. This study provides a deep understanding for the application of tree-based machine learning models to landslide susceptibility.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  • Achour Y, Pourghasemi HR (2020) How do machine learning techniques help in increasing accuracy of landslide susceptibility maps? Geosci Front 11:871–883

    Article  Google Scholar 

  • Akinci H, Zeybek M (2021) Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin) Turkey. Nat Hazard 108(2):1515–1543

    Article  Google Scholar 

  • Barella CF, Sobreira FG, Zêzere JL (2018) A comparative analysis of statistical landslide susceptibility mapping in the southeast region of Minas Gerais state, Brazil. Bull Eng Geol Env 78:3205–3221

    Article  Google Scholar 

  • Bui DT, Tsangaratos P, Nguyen V-T, Liem NV, Trinh PT (2020) Comparing the prediction performance of a Deep Learning Neural Network model with conventional machine learning models in landslide susceptibility assessment. CATENA 188:104426

    Article  Google Scholar 

  • Cama M, Conoscenti C, Lombardo L, Rotigliano E (2016) Exploring relationships between grid cell size and accuracy for debris-flow susceptibility models: a test in the Giampilieri catchment (Sicily, Italy). Environ Earth Sci 75(3):1–21

    Article  Google Scholar 

  • Catani F, Lagomarsino D, Segoni S, Tofani V (2013) Landslide susceptibility estimation by random forests technique: sensitivity and scaling issues. Nat Hazard 13:2815–2831

    Article  Google Scholar 

  • Chen X, Chen W (2021) GIS-based landslide susceptibility assessment using optimized hybrid machine learning methods. CATENA 196:104833

    Article  Google Scholar 

  • Chen W, Shirzadi A, Shahabi H, Ahmad BB, Zhang S, Hong H, Zhang N (2017) A novel hybrid artificial intelligence approach based on the rotation forest ensemble and naïve Bayes tree classifiers for a landslide susceptibility assessment in Langao County, China. Geomat Nat Haz Risk 8:1955–1977

    Article  Google Scholar 

  • Chen W, Shahabi H, Shirzadi A, Hong H, Akgun A, Tian Y, Liu J, Zhu AX, Li S (2018a) Novel hybrid artificial intelligence approach of bivariate statistical-methods-based kernel logistic regression classifier for landslide susceptibility modeling. Bull Eng Geol Env 78:4397–4419

    Article  Google Scholar 

  • Chen W, Yan X, Zhao Z, Hong H, Bui DT, Pradhan B (2018b) Spatial prediction of landslide susceptibility using data mining-based kernel logistic regression, naive Bayes and RBFNetwork models for the Long County area (China). Stoch Environ Res Risk Assess 78(1):247–266

    Google Scholar 

  • Chowdhuri I, Pal SC, Chakrabortty R, Malik S, Das B, Roy P (2021) Torrential rainfall-induced landslide susceptibility assessment using machine learning and statistical methods of eastern Himalaya. Nat Hazard 107:697–722

    Article  Google Scholar 

  • Costanzo D, Rotigliano E, Irigaray C, Jiménez-Perálvarez JD, Chacón J (2012) Factors selection in landslide susceptibility modelling on large scale following the gis matrix method: application to the river Beiro basin (Spain). Nat Hazard 12:327–340

    Article  Google Scholar 

  • Deprez M, De Kock T, De Schutter G, Cnudde V (2020) A review on freeze-thaw action and weathering of rocks. Earth-Sci Rev 203:103143

    Article  Google Scholar 

  • Dou J, Tien Bui D, Yunus AP, Jia K, Song X, Revhaug I, Xia H, Zhu Z (2015) Optimization of causative factors for landslide susceptibility evaluation using remote sensing and GIS data in parts of Niigata. Japan. Plos One 10:e0133262

    Article  Google Scholar 

  • Dou J, Yunus AP, Bui DT, Merghadi A, Sahana M, Zhu Z, Chen C-W, Han Z, Pham BT (2019a) Improved landslide assessment using support vector machine with bagging, boosting, and stacking ensemble machine learning framework in a mountainous watershed, Japan. Landslides 17:641–658

    Article  Google Scholar 

  • Dou J, Yunus AP, Tien Bui D, Merghadi A, Sahana M, Zhu Z, Chen CW, Khosravi K, Yang Y, Pham BT (2019b) Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci Total Environ 662:332–346

    Article  Google Scholar 

  • Dou J, Yunus AP, Merghadi A, Shirzadi A, Nguyen H, Hussain Y, Avtar R, Chen Y, Pham BT, Yamagishi H (2020) Different sampling strategies for predicting landslide susceptibilities are deemed less consequential with deep learning. Sci Total Environ 720:137320

    Article  Google Scholar 

  • Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage WZ (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning. Eng Geol 102:99–111

    Article  Google Scholar 

  • Gao J, Liu Y (2011) Climate warming and land use change in Heilongjiang Province, Northeast China. Appl Geogr 31:476–482

    Article  Google Scholar 

  • Goetz JN, Brenning A, Petschko H, Leopold P (2015) Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput Geosci 81:1–11

    Article  Google Scholar 

  • Gudiyangada Nachappa T, Tavakkoli Piralilou S, Gholamnia K, Ghorbanzadeh O, Rahmati O, Blaschke T (2020) Flood susceptibility mapping with machine learning, multi-criteria decision analysis and ensemble using Dempster Shafer Theory. J Hydrol 590:125275

    Article  Google Scholar 

  • Hong H, Pradhan B, Sameen MI, Chen W, Xu C (2017) Spatial prediction of rotational landslide using geographically weighted regression, logistic regression, and support vector machine models in Xing Guo area (China). Geomat Nat Haz Risk 8:1997–2022

    Article  Google Scholar 

  • Hong H, Miao Y, Liu J, Zhu AX (2019) Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping. CATENA 176:45–64

    Google Scholar 

  • Huang Y, Zhao L (2018) Review on landslide susceptibility mapping using support vector machines. CATENA 165:520–529

    Article  Google Scholar 

  • Huang F, Cao Z, Guo J, Jiang S-H, Li S, Guo Z (2020) Comparisons of heuristic, general statistical and machine learning models for landslide susceptibility prediction and mapping. CATENA 191:104580

    Article  Google Scholar 

  • Huggel C, Salzmann N, Allen S, Caplan-Auerbach J, Fischer L, Haeberli W, Larsen C, Schneider D, Wessels R (2010) Recent and future warm extreme events and high-mountain slope stability. Philos Trans A Math Phys Eng Sci 368:2435–2459

    Google Scholar 

  • Islam ARMT, Saha A, Ghose B, Pal SC, Chowdhuri I, Mallick J (2021) Landslide susceptibility modeling in a complex mountainous region of Sikkim Himalaya using new hybrid data mining approach. Geocarto Int 1–26

  • Kalantar B, Pradhan B, Naghibi SA, Motevalli A, Mansor S (2017) Assessment of the effects of training data selection on the landslide susceptibility mapping: a comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomat Nat Haz Risk 9:49–69

    Article  Google Scholar 

  • Li J, Wang W, Han Z, Li Y, Chen G (2020) Exploring the impact of multitemporal DEM data on the susceptibility mapping of landslides. Appl Sci 10(7):2518

    Article  Google Scholar 

  • Liu Q, Huang D, Tang A, Han X (2021a) Model performance analysis for landslide susceptibility in cold regions using accuracy rate and fluctuation characteristics. Nat Hazard 108(1):1047–1067

    Article  Google Scholar 

  • Liu Z, Gilbert G, Cepeda JM, Lysdahl AOK, Piciullo L, Hefre H, Lacasse S (2021b) Modelling of shallow landslides with machine learning algorithms. Geosci Front 12:385–393

    Article  Google Scholar 

  • Lombardo L, Opitz T, Ardizzone F, Guzzetti F, Huser R (2020) Space-time landslide predictive modelling. Earth-Sci Rev 209:103318

    Article  Google Scholar 

  • Ma Z, Mei G, Piccialli F (2020) Machine learning for landslides prevention: a survey. Neural Comput Appl 33(17):10881–10907

    Article  Google Scholar 

  • Merghadi A, Yunus AP, Dou J, Whiteley J, ThaiPham B, Bui DT, Avtar R, Abderrahmane B (2020) Machine learning methods for landslide susceptibility studies: A comparative overview of algorithm performance. Earth-Sci Rev 207:103225

    Article  Google Scholar 

  • Moayedi H, Mehrabi M, Mosallanezhad M, Rashid ASA, Pradhan B (2018) Modification of landslide susceptibility mapping using optimized PSO-ANN technique. Eng Comput 35:967–984

    Article  Google Scholar 

  • Nachappa TG, Ghorbanzadeh O, Gholamnia K, Blaschke T (2020) Multi-hazard exposure mapping using machine learning for the state of Salzburg, Austria. Remote Sens 12(17):2757

    Article  Google Scholar 

  • Nhu V-H, Hoang N-D, Nguyen H, Ngo PTT, Thanh Bui T, Hoa PV, Samui P, Tien Bui D (2020) Effectiveness assessment of Keras based deep learning with different robust optimization algorithms for shallow landslide susceptibility mapping at tropical area. CATENA 188:104458

    Article  Google Scholar 

  • Pal SC, Das B, Malik S (2019) Potential landslide vulnerability zonation using integrated analytic hierarchy process and GIS technique of upper rangit catchment area, West Sikkim, India. J Indian Soc Remote Sens 47:1643–1655

    Article  Google Scholar 

  • Palamakumbure D, Flentje P, Stirling D (2015) Consideration of optimal pixel resolution in deriving landslide susceptibility zoning within the Sydney Basin, New South Wales, Australia. Comput Geosci 82:13–22

    Article  Google Scholar 

  • Pham BT, Prakash I (2017) A novel hybrid model of bagging-based naïve bayes trees for landslide susceptibility assessment. Bull Eng Geol Env 78:1911–1925

    Article  Google Scholar 

  • Pham BT, Pradhan B, Tien Bui D, Prakash I, Dholakia MB (2016) A comparative study of different machine learning methods for landslide susceptibility assessment: A case study of Uttarakhand area (India). Environ Model Softw 84:240–250

    Article  Google Scholar 

  • Pourghasemi HR, Rahmati O (2018) Prediction of the landslide susceptibility: Which algorithm, which precision? CATENA 162:177–192

    Article  Google Scholar 

  • Pourghasemi HR, Kornejady A, Kerle N, Shabani F (2020) Investigating the effects of different landslide positioning techniques, landslide partitioning approaches, and presence-absence balances on landslide susceptibility mapping. CATENA 187:104364

    Article  Google Scholar 

  • Pradhan B, Lee S (2010) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Model Softw 25:747–759

    Article  Google Scholar 

  • Reichenbach P, Rossi M, Malamud BD, Mihir M, Guzzetti F (2018) A review of statistically-based landslide susceptibility models. Earth Sci Rev 180:60–91

    Article  Google Scholar 

  • Saha A, Pal SC, Santosh M, Janizadeh S, Chowdhuri I, Norouzi A, Roy P, Chakrabortty R (2021) Modelling multi-hazard threats to cultural heritage sites and environmental sustainability: The present and future scenarios. J Clean Prod 320:128713

    Article  Google Scholar 

  • Saleem N, Huq ME, Twumasi NYD, Javed A, Sajjad A (2019) Parameters derived from and/or used with digital elevation models (DEMs) for landslide susceptibility mapping and landslide risk assessment: a review. ISPRS Int J Geo-Inf 8(12):545

    Article  Google Scholar 

  • Sameen MI, Pradhan B, Bui DT, Alamri AM (2020) Systematic sample subdividing strategy for training landslide susceptibility models. CATENA 187:104358

    Article  Google Scholar 

  • Soma AS, Kubota T, Mizuno H (2019) Optimization of causative factors using logistic regression and artificial neural network models for landslide susceptibility assessment in Ujung Loe Watershed, South Sulawesi Indonesia. J Mt Sci 16:383–401

    Article  Google Scholar 

  • Steger S, Brenning A, Bell R, Petschko H, Glade T (2016) Exploring discrepancies between quantitative validation results and the geomorphic plausibility of statistical landslide susceptibility maps. Geomorphology 262:8–23

    Article  Google Scholar 

  • Tehrany MS, Shabani F, Jebur MN, Hong H, Chen W, Xie X (2017) GIS-based spatial prediction of flood prone areas using standalone frequency ratio, logistic regression, weight of evidence and their ensemble techniques. Geomatics, Nat Hazard Risk 8(2):1538–1561

    Article  Google Scholar 

  • Thai Pham B, Shirzadi A, Tien Bui D, Prakash I, Dholakia MB (2018) A hybrid machine learning ensemble approach based on a Radial Basis Function neural network and Rotation Forest for landslide susceptibility modeling: a case study in the Himalayan area India. Int J Sedim Res 33(2):157–170

    Article  Google Scholar 

  • Thi Ngo PT, Panahi M, Khosravi K, Ghorbanzadeh O, Kariminejad N, Cerda A, Lee S (2021) Evaluation of deep learning algorithms for national scale landslide susceptibility mapping of Iran. Geosci Front 12:505–519

    Article  Google Scholar 

  • Tien Bui D, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378

    Article  Google Scholar 

  • Tien Bui D, Shirzadi A, Shahabi H, Geertsema M, Omidvar E, Clague J, Thai Pham B, Dou J, Talebpour Asl D, Bin Ahmad B, Lee S (2019) New ensemble models for shallow landslide susceptibility modeling in a semi-arid watershed. Forests 10(9):743

    Article  Google Scholar 

  • Van Natijne AL, Lindenbergh RC, Bogaard TA (2020) Machine learning: new potential for local and regional deep-seated landslide nowcasting. Sensors 20(5):1425

    Article  Google Scholar 

  • Wang G, Lei X, Chen W, Shahabi H, Shirzadi A (2020) Hybrid computational intelligence methods for landslide susceptibility mapping. Symmetry 12(3):325

    Article  Google Scholar 

  • Xiao T, Yin K, Yao T, Liu S (2019) Spatial prediction of landslide susceptibility using GIS-based statistical and machine learning models in Wanzhou County, Three Gorges Reservoir, China. Acta Geochimica 38:654–669

    Article  Google Scholar 

  • Yao T, Xue Y, Chen D, Chen F, Thompson L, Cui P, Koike T, Lau WKM, Lettenmaier D, Mosbrugger V, Zhang R, Xu B, Dozier J, Gillespie T, Gu Y, Kang S, Piao S, Sugimoto S, Ueno K, Wang L, Wang W, Zhang F, Sheng Y, Guo W, Ailikun Yang X, Ma Y, Shen SSP, Su Z, Chen F, Liang S, Liu Y, Singh VP, Yang K, Yang D, Zhao X, Qian Y, Zhang Y, Li Q (2019) Recent third pole’s rapid warming accompanies cryospheric melt and water cycle intensification and interactions between monsoon and environment: multidisciplinary approach with observations, modeling, and analysis. Bull Am Meteorol Soc 100:423–444

    Article  Google Scholar 

  • Yin G, Luo J, Niu F, Lin Z, Liu M (2021) Machine learning-based thermokarst landslide susceptibility modeling across the permafrost region on the Qinghai-Tibet Plateau. Landslides 18(7):2639–2649

    Article  Google Scholar 

  • Zhang L, Li Y, Zhang F, Chen L, Pan T, Wang B, Ren C (2020) Changes of winter extreme precipitation in Heilongjiang province and the diagnostic analysis of its circulation features. Atmosp Res 245:105094

    Article  Google Scholar 

  • Zhao D-M, Jiao Y-M, Wang J-L, Ding Y-P, Liu Z-L, Liu C-J, Qiu Y-M, Zhang J, Xu Q-E, Wu C-R (2020) Comparative performance assessment of landslide susceptibility models with presence-only, presence-absence, and pseudo-absence data. J Mt Sci 17(12):2961–2981

    Article  Google Scholar 

Download references

Acknowledgements

This research is supported and funded by the National Key Research and Development Program of China (2016YFE0202400 and 2018YFC0809605). The authors are grateful for this support, and would like to thank the professors for providing the dates used in this study as well. Data related to this study have been uploaded, and the authors confirm that the data supporting the findings of this study are available within the supplementary materials.

Funding

Funding was provided by “The National Key Research and Development Program of China” (Grant No. 2016YFE0202400 and 2018YFC0809605).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Aiping Tang.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 63 kb)

Supplementary file2 (PDF 1346 kb)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, Q., Tang, A., Huang, Z. et al. Discussion on the tree-based machine learning model in the study of landslide susceptibility. Nat Hazards 113, 887–911 (2022). https://doi.org/10.1007/s11069-022-05329-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11069-022-05329-4

Keywords

Navigation