Skip to main content

Advertisement

Log in

Application and interpretability of ensemble learning for landslide susceptibility mapping along the Three Gorges Reservoir area, China

  • Original Paper
  • Published:
Natural Hazards Aims and scope Submit manuscript

Abstract

Landslides pose a significant threat to China’s Three Gorges Reservoir area. Many ensemble learning models have been applied to landslide susceptibility mapping (LSM) in this region, as it forms the foundation of landslide risk management. However, most landslide susceptibility models lack interpretability, hindering the explanation of the relative importance and interactive mechanisms among landslide conditioning factors. This study evaluates and interprets three tree-based ensemble learning models—XGBoost, Random Forest (RF), and Light GBM—for LSM in the Yichang section of the Three Gorges Reservoir area, employing SHAP (SHapley Additive exPlanations) analysis. Among these models, XGBoost and RF exhibit similar the area under the receiver operating characteristic curve (AUROC) values of 0.96 and 0.95, slightly outperforming Light GBM with an AUROC of 0.93. We identify four crucial landslide conditioning factors from a dataset of 714 landslide samples by individual interpretation, shedding light on specific elements that drive higher susceptibility and recommending suitable mitigation measures for different landslide. Global interpretation via SHAP reveals that elevation, Normalized Difference Vegetation Index, distance from river, distance from road, slope, and lithology are the primary factors influencing landslide susceptibility. We delve deeply into the relationships among these factors, their values, and the mechanisms triggering landslides. In addition, to enhance the credibility and reliability of SHAP interpretation results, we cross-referenced these results with relevant literature on the formation mechanism of landslides in the Three Gorges Reservoir area. This study contributes to a better understanding of landslide risk management and bridges the gap between advanced machine learning models and interpretable results by introducing SHAP. Furthermore, we augment the SHAP analysis results with domain-specific expertise in the field of landslides, helping to bridge the potential shortcomings of SHAP as a data-driven-based approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Data availability

Some codes generated during the study are available in the following repository. Contact: e-mail: liubo_lb68@163.com. Software required: Python3.7. The partial source codes are available for downloading at the link: https://github.com/liubo-lb68/LSM/blob/main/SHAP

References

Download references

Acknowledgements

We would like to express our gratitude to the Geological Environmental Center of Hubei Province for providing crucial spatial data, including the coordinates of 714 landslide samples, which greatly contributed to this research. Their cooperation and support were instrumental in the success of this study.

Furthermore, we extend our appreciation to the editorial team of the journal and the anonymous reviewers for their valuable feedback and suggestions during the review process. Their input significantly improved the quality and completeness of this paper.

Funding

This research is supported by the National Natural Science Foundation of China [Grant Numbers 72074198, 71874165]; the National Social Science Fund of China [grant numbers 23AZD072]; Key Research and Development Program of Hubei Province [Grant Number 2021BCA219]; Young Talents Foundation of The Central Propaganda Department [Grant Number 2020084007]; the Fundamental Research Funds for the Central Universities, China University of Geosciences (Wuhan) [Grant Number CUG2642022006].

Author information

Authors and Affiliations

Authors

Contributions

BL involved in conceptualization, methodology, software, writing—original draft, and investigation. HG involved in validation, resources, and writing—review and editing. JL involved in resources and writing—review and editing. XK involved in writing—review and editing. XH involved in software.

Corresponding author

Correspondence to Haixiang Guo.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 91 kb)

Appendices

Appendix 1: The Pearson correlation coefficient between pairs of 15 landslide conditioning factors

 

Elevation

Aspect

Slope

Plan curvature

Profile curvature

TRI

Surface roughness

TWI

SPI

Distance from river

Land use

Distance from road

Distance from fault

Lithology

NDVI

Elevation

1.00

0.01

0.20

0.00

− 0.10

0.22

0.15

− 0.12

0.03

0.48

0.36

0.43

− 0.05

0.18

0.48

Aspect

 

1.00

0.01

0.00

0.00

0.00

0.00

0.03

0.03

− 0.02

0.07

− 0.01

0.01

0.01

− 0.03

Slope

  

1.00

0.02

− 0.01

0.75

0.89

− 0.32

0.35

− 0.05

0.35

0.10

− 0.07

0.01

0.27

Plan Curvature

   

1.00

− 0.35

0.01

0.03

− 0.41

− 0.46

− 0.06

0.00

− 0.02

− 0.01

− 0.01

− 0.03

Profile Curvature

    

1.00

− 0.03

− 0.03

0.24

0.25

− 0.03

− 0.05

− 0.02

− 0.02

0.03

− 0.02

TRI

     

1.00

0.71

− 0.16

0.33

− 0.04

0.34

0.11

− 0.06

− 0.03

0.28

Surface Roughness

      

1.00

− 0.25

0.27

− 0.04

0.27

0.10

− 0.06

0.00

0.23

TWI

       

1.00

0.72

− 0.01

− 0.11

− 0.04

0.01

− 0.06

− 0.07

SPI

        

1.00

− 0.03

0.12

0.01

− 0.06

− 0.03

0.10

Distance from River

         

1.00

0.08

0.29

− 0.02

0.12

0.33

Land Use

          

1.00

0.20

− 0.03

0.15

0.44

Distance from Road

           

1.00

0.08

0.09

0.47

Distance from Fault

            

1.00

− 0.04

− 0.04

Lithology

             

1.00

0.29

NDVI

              

1.00

Appendix 2: Landslide conditioning factors

See Figs. 13, 14, and 15.

Fig. 13
figure 13

Factor layers of the study area (a elevation, b slope, c aspect, d profile curvature, e plan curvature, f topographic relief)

Fig. 14
figure 14

Factor layers of the study area (a surface roughness, b topographic wetness index, c land use, d stream power index, e NDVI, f distance from road)

Fig. 15
figure 15

Factor layers of the study area (a distance from fault, b lithology, c distance from river)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, B., Guo, H., Li, J. et al. Application and interpretability of ensemble learning for landslide susceptibility mapping along the Three Gorges Reservoir area, China. Nat Hazards 120, 4601–4632 (2024). https://doi.org/10.1007/s11069-023-06374-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11069-023-06374-3

Keywords

Navigation