Skip to main content
Log in

Flood susceptibility mapping in an arid region of Pakistan through ensemble machine learning model

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

Floods are among the most destructive natural hazards. Therefore, their prediction is pivotal for flood management and public safety. Factors contributing to flood are different for every watershed as they depend upon the characteristics of each watershed. Therefore, this study evaluated the factors contributing to flood and the precise location of high and very high flood susceptibility regions in Karachi. A new ensemble model (LR-SVM-MLP) is introduced to develop the susceptibility map and evaluate influencing factors. This ensemble model was formed by employing a stacking ensemble on Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP). A spatial database was generated for the Karachi watershed, which included; twelve conditioning factors as independent variables, 652 flood points and the same number of non-flood points as dependent variables. This data was then randomly divided into 70% and 30% to train and validate models, respectively. To analyse the collinearity among factors and to scrutinize each variable's predictive power, multicollinearity test and Information Gain Ratio were applied, respectively. After training, the models were evaluated on various statistical measures and compared with benchmark models. Results revealed that the proposed ensemble model outperformed Logistic Regression (LR), Support Vector Machine (SVM), and Multi-Layer Perceptron (MLP) and produced a precise and accurate map. Results of ensemble model showed 99% accuracy in training and 98% accuracy in testing datasets. This ensemble model can be used by flood management authorities and the government to contribute to future research studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Availability of data and material

Not Applicable.

Code availability

Not Applicable.

References

Download references

Funding

This work was funded by the National Key Research and Development Program (2018YFC1506506), the Frontier Project of the Applied Foundation of Wuhan (2019020701011502), the Key Research and Development Program of Jiangxi Province (20201BBG71002), and the LIESMARS Special Research Funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jianzhong Lu.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interests.

Ethical approval

Not Applicable.

Consent to participate

Not Applicable.

Consent for publication

Not Applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yaseen, A., Lu, J. & Chen, X. Flood susceptibility mapping in an arid region of Pakistan through ensemble machine learning model. Stoch Environ Res Risk Assess 36, 3041–3061 (2022). https://doi.org/10.1007/s00477-022-02179-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-022-02179-1

Keywords

Navigation