Skip to main content

Advertisement

Log in

A proposal for an approach to mapping susceptibility to landslides using natural language processing and machine learning

  • Original paper
  • Published:
Landslides Aims and scope Submit manuscript

Abstract

Compiling an inventory is a fundamental step for carrying out assessments of landslide hazards. However, data in sufficient quantity and quality are not always available. Thus, this study puts forward an approach for drawing up a landslide inventory using textual data from telephone records, and for mapping hazards of landslides in an urban area. Forty thousand seven hundred ninety-two textual records and the naive Bayes algorithm were used to classify them, and these form the landslide inventory. After creating the inventory, the random forest algorithm with 12 conditioning variables was used to map landslide hazards. The text classification model obtained an accuracy of 0.8671 and a Kappa index of 0.8038. The hazard mapping model obtained accuracy of 0.9503 and an AUC (area under the curve)-ROC (receiver operating characteristics) of 0.9870. The results produced by the model were also compared with real landslides reported in news reports and were shown to be close to what had happened, thus demonstrating the ability of the proposed approach to predict landslides. Finally, the proposed approach can be used in simulation environments, thereby supporting strategic decision-making associated with hazard analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

Download references

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001. This study is also financially supported by the National Council for Scientific and Technological Development (CNPq)–process numbers 305792/2017-2, 305119/2017-6, and 421851/2018-0.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Maisa Mendonça Silva.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rodrigues, S.G., Silva, M.M. & Alencar, M.H. A proposal for an approach to mapping susceptibility to landslides using natural language processing and machine learning. Landslides 18, 2515–2529 (2021). https://doi.org/10.1007/s10346-021-01643-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10346-021-01643-3

Keywords

Navigation