Skip to main content

A novel hybrid intelligent model of support vector machines and the MultiBoost ensemble for landslide susceptibility modeling

Abstract

The main aim of this study is to propose a novel hybrid intelligent model named MBSVM which is an integration of the MultiBoost ensemble and a support vector machine (SVM) for modeling of susceptibility of landslides in the Uttarakhand State, Northern India. Firstly, a geospatial database for the study area was prepared, which includes 391 historical landslides and 16 landslide-affecting factors. Then, the sensitivity of different combinations of these factors for modeling was validated using the forward elimination technique. The MBSVM landslide model was built using the datasets generated from the best selected factors and validated utilizing the area under the receiver operating characteristic (ROC) curve (AUC), statistical indexes, and the Wilcoxon signed-rank test. Results show that this novel hybrid model has good performance both in terms of goodness of fit with the training dataset (AUC = 0.972) and the capability to predict landslides with the testing dataset (AUC = 0.966). The efficiency of the proposed model was then validated by comparison with logistic regression (LR), a single SVM, and another hybrid model of the AdaBoost ensemble and an SVM (ABSVM). Comparison results show that the MBSVM outperforms the LR, single SVM, and hybrid ABSVM models. Thus, the proposed model is a promising and good alternative tool for landslide hazard assessment in landslide-prone areas.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

References

  • Abdulwahid WM, Pradhan B (2016) Landslide vulnerability and risk assessment for multi-hazard scenarios using airborne laser scanning data (LiDAR) Landslides:1–20

  • Althuwaynee OF, Pradhan B, Park H-J, Lee JH (2014) A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. Catena 114:21–36

    Article  Google Scholar 

  • Antronico L, Borrelli L, Coscarelli R, Gullà G (2015) Time evolution of landslide damages to buildings: the case study of Lungro (Calabria, southern Italy). Bull Eng Geol Environ 74:47–59

    Article  Google Scholar 

  • Antronico L, Borrelli L, Coscarelli R, Pasqua A, Petrucci O, Gullà G (2013) Slope movements induced by rainfalls damaging an urban area: the Catanzaro case study (Calabria, southern Italy). Landslides 10:801–814

    Article  Google Scholar 

  • Araújo MB, New M (2007) Ensemble forecasting of species distributions trends in ecology & evolution 22:42-47

  • Breiman L (1996) Bagging predictors. Mach Learn 24:123–140

    Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45:5–32

    Article  Google Scholar 

  • Budimir M, Atkinson P, Lewis H (2015) A systematic review of landslide probability mapping using logistic regression. Landslides 12:419–436

    Article  Google Scholar 

  • Chen H, Lee C (2003) A dynamic model for rainfall-induced landslides on natural slopes. Geomorphology 51:269–288

    Article  Google Scholar 

  • Chen W, Pourghasemi HR, Kornejady A, Zhang N (2017a) Landslide spatial modeling: introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques. Geoderma 305:314–327

    Article  Google Scholar 

  • Chen W, Pourghasemi HR, Naghibi SA (2017b) A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China. Bull Eng Geol Environ:1–18

  • Chung C-JF, Fabbri AG (1999) Probabilistic prediction models for landslide hazard mapping. Photogrammetric Eng Remote Sens 65:1389–1399

    Google Scholar 

  • Chung C-JF, Fabbri AG (2003) Validation of spatial prediction models for landslide hazard mapping. Natural Hazards 30:451–472

    Article  Google Scholar 

  • Dash M, Liu H (1997) Feature selection for classification. Intell Data Anal 1:131–156

    Article  Google Scholar 

  • Dickson ME, Perry GL (2016) Identifying the controls on coastal cliff landslides using machine-learning approaches. Environ Modell Soft 76:117–127

    Article  Google Scholar 

  • Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10:1895–1923

    Article  Google Scholar 

  • Ding Q, Chen W, Hong H (2017) Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int 32:619–639

    Google Scholar 

  • Erdal HI, Karakurt O (2013) Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms. J Hydrol 477:119–128

    Article  Google Scholar 

  • Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage W, on behalf of the JTC-1 Joint Technical Committee on Landslides and Engineered Slopes (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng Geol 102:85–98

    Article  Google Scholar 

  • Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory. Springer, pp 23–37

  • Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55:119–139

    Article  Google Scholar 

  • Frye C (2007) About the Geometrical Interval Classification Method. http://blogs.esri.com/esri/arcgis

  • Glade T, Anderson MG, Crozier MJ (2006) Landslide hazard and risk. John Wiley & Sons

  • Goh AT, Zhang Y, Zhang R, Zhang W, Xiao Y (2017) Evaluating stability of underground entry-type excavations using multivariate adaptive regression splines and logistic regression. Tunn Underground Space Technol 70:148–154

    Article  Google Scholar 

  • Gorsevski PV, Brown MK, Panter K, Onasch CM, Simic A, Snyder J (2016) Landslide detection and susceptibility mapping using LiDAR and an artificial neural network approach: a case study in the Cuyahoga Valley National Park. Ohio Landslides 13:467–484

    Article  Google Scholar 

  • Gullà G, Peduto D, Borrelli L, Antronico L, Fornaro G (2017) Geometric and kinematic characterization of landslides affecting urban areas: the Lungro case study (Calabria, southern Italy). Landslides 14:171–188

    Article  Google Scholar 

  • Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81:166–184

    Article  Google Scholar 

  • Hájek P, Olej V (2015) Word Categorization of Corporate Annual Reports for Bankruptcy Prediction by Machine Learning Methods. In: International Conference on Text, Speech, and Dialogue. Springer, pp 122–130

  • Hájek P, Olej V, Myšková R (2015) Predicting financial distress of banks using random subspace ensembles of support vector machines. In: Artificial Intelligence Perspectives and Applications. Springer, pp 131–140

  • Hattermann F et al (2017) Cross-scale intercomparison of climate change impacts simulated by regional and global hydrological models in eleven large river basins. Clim Change 141:561–576

    Article  Google Scholar 

  • Hoang N-D, Tien Bui D (2018) Predicting earthquake-induced soil liquefaction based on a hybridization of kernel Fisher discriminant analysis and a least squares support vector machine: a multi-dataset study. Bull Eng Geol Environ 77:191–204

    Article  Google Scholar 

  • Hoehler FK (2000) Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol 53:499–503

    Article  Google Scholar 

  • Hong H et al (2018) Landslide susceptibility mapping using J48 decision tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). CATENA 163:399–413

    Article  Google Scholar 

  • Hong H et al (2017) A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China). Environ Earth Sci 76:652

    Article  Google Scholar 

  • Hosmer DW, Lemeshow S (2000) Interpretation of the fitted logistic regression model Applied Logistic Regression, Second Edition:47–90

  • Huang J, Wu P, Zhao X (2013) Effects of rainfall intensity, underlying surface and slope gradient on soil infiltration under simulated rainfall experiments. Catena 104:93–102

    Article  Google Scholar 

  • Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22:679–688

    Article  Google Scholar 

  • Ilia I, Tsangaratos P (2016) Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map. Landslides 13:379–397

    Article  Google Scholar 

  • Jaafari A, Najafi A, Pourghasemi H, Rezaeian J, Sattarian A (2014) GIS-based frequency ratio and index of entropy models for landslide susceptibility assessment in the Caspian forest, northern Iran. Int J Environ Sci Technol 11:909–926

    Article  Google Scholar 

  • Jaafari A, Najafi A, Rezaeian J, Sattarian A, Ghajar I (2015) Planning road networks in landslide-prone areas: a case study from the northern forests of Iran. Land Use Policy 47:198–208

    Article  Google Scholar 

  • Jaafari A, Rezaeian J, Omrani MSO (2017) Spatial prediction of slope failures in support of forestry operations safety. Croatian J Forest Eng 38:107–118

    Google Scholar 

  • Jelinek H, Abawajy J, Kelarev A, Chowdhury M, Stranieri A (2014) Decision trees and multi-level ensemble classifiers for neurological diagnostics. Austral J Med Sci 1:1–12

    Google Scholar 

  • Kelarev A, Dazeley R, Stranieri A, Yearwood J, Jelinek H (2012) Detection of CAN by ensemble classifiers based on ripple down rules. In: Pacific Rim Knowledge Acquisition Workshop. Springer, pp 147–159

  • Khosravi K et al (2018) A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci Total Environ 627:744–755

    Article  Google Scholar 

  • Komori O, Eguchi S, Ikeda S, Okamura H, Ichinokawa M, Nakayama S (2016) An asymmetric logistic regression model for ecological data. Methods Ecol Evol 7:249–260

    Article  Google Scholar 

  • Kotti M, Benetos E, Kotropoulos C, Pitas I (2007) A neural network approach to audio-assisted movie dialogue detection. Neurocomputing 71:157–166

    Article  Google Scholar 

  • Kumar* A, Mittal H, Kumar R, Ahluwalia RS (2017) Empirical Attenuation relationship for Peak Ground Horizontal Acceleration for North-East Himalaya. Vietnam J Earth Sci 39(1):47-57

  • Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data biometrics:159–174

  • Lau NN (2018) Determination of ground displacement of 25 April 2015 Nepal earthquake by GNSS precise point positioning Vietnam J Earth Sci 40:17–25

  • Margottini C, Canuti P, Sassa K (2013) Landslide science and practice vol 1. Springer

  • Marjanović M, Kovačević M, Bajat B, Voženílek V (2011) Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol 123:225–234

    Article  Google Scholar 

  • Meinhardt M, Fink M, Tünschel H (2015) Landslide susceptibility analysis in central Vietnam based on an incomplete landslide inventory: Comparison of a new method to calculate weighting factors by means of bivariate statistics) Geomorphology 234:80–97

    Article  Google Scholar 

  • Moosavi V, Niazi Y (2016) Development of hybrid wavelet packet-statistical models (WP-SM) for landslide susceptibility mapping. Landslides 13:97–114

    Article  Google Scholar 

  • Neuhäuser B, Terhorst B (2007) Landslide susceptibility assessment using “weights-of-evidence” applied to a study area at the Jurassic escarpment (SW-Germany). Geomorphology 86:12–24

    Article  Google Scholar 

  • Parhami B (2006) Introduction to parallel processing: algorithms and architectures. Springer Science & Business Media

  • Pham BT, Bui DT, Prakash I (2017a) Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study Geotechnical and Geological Engineering:1–15

  • Pham BT, Khosravi K, Prakash I (2017b) Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at Pauri Garhwal area, Uttarakhand, India Environmental Processes:1–20

  • Pham BT, Nguyen V-T, Ngo V-L, Trinh PT, Ngo HTT, Bui DT (2017c) A Novel Hybrid Model of Rotation Forest Based Functional Trees for Landslide Susceptibility Mapping: A Case Study at Kon Tum Province, Vietnam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 186–201

  • Pham BT, Pradhan B, Tien Bui D, Prakash I, Dholakia MB (2016a) A comparative study of different machine learning methods for landslide susceptibility assessment: a case study of Uttarakhand area (India). Environ Model Software 84:240–250. https://doi.org/10.1016/j.envsoft.2016.07.005

    Article  Google Scholar 

  • Pham BT, Prakash I (2017a) Evaluation and Comparison of LogitBoost Ensemble, Fisher’s Linear Discriminant Analysis, Logistic Regression, and Support Vector Machines Methods for Landslide Susceptibility Mapping Geocarto International:1–32

  • Pham BT, Prakash I (2017b) A Novel Hybrid Intelligent Approach of Random Subspace Ensemble and Reduced Error Pruning Trees for Landslide Susceptibility Modeling: A Case Study at Mu Cang Chai District, Yen Bai Province, Viet Nam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 255–269

  • Pham BT, Prakash I (2017c) A novel hybrid model of Bagging-based Naïve Bayes Trees for landslide susceptibility assessment. Bulletin of Engineering Geology and the Environment:1–15

  • Pham BT, Prakash I (2017d) Spatial Prediction of Rainfall Induced Shallow Landslides Using Adaptive-Network-Based Fuzzy Inference System and Particle Swarm Optimization: A Case Study at the Uttarakhand Area, India. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 224–238

  • Pham BT, Prakash I, Bui DT (2017d) Spatial prediction of landslides using hybrid machine learning approach based on Random Subspace and Classification and Regression Trees Geomorphology:1–15

  • Pham BT, Shirzadi A, Bui DT, Prakash I, Dholakia M (2017e) A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation Forest for landslide susceptibility modeling: a case study in the Himalayan area, India International Journal of Sediment Research

  • Pham BT, Son LH, Hoang T-A, Nguyen D-M, Tien Bui D (2018a) Prediction of shear strength of soft soil using machine learning methods. Catena 166:181–191

  • Pham BT, Tien Bui D, Pham HV, Le HQ, Prakash I, Dholakia MB (2016b) Landslide hazard assessment using random SubSpace fuzzy rules based classifier ensemble and probability analysis of rainfall data: a case study at mu Cang Chai District, Yen Bai Province (Viet Nam) journal of the Indian Society of Remote Sensing:1-11 https://doi.org/10.1007/s12524-016-0620-3

  • Pham BT, Tien Bui D, Prakash I (2018b) Bagging based Support Vector Machines for spatial prediction of landslides. Enviro Earth Sci 77:146

  • Pham BT, Tien Bui D, Prakash I, Dholakia MB (2017f) Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. CATENA 149(Part 1):52–63. https://doi.org/10.1016/j.catena.2016.09.007

    Article  Google Scholar 

  • Pingale SM, Chandra H, Sharma H, Mishra SS (2012) Morphometric analysis of Maun watershed in Tehri-Garhwal district of Uttarakhand using GIS. Int J Geomat Geosci 3:373–387

    Article  Google Scholar 

  • Pourghasemi H, Moradi H, Aghda SF (2013) Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances. Nat Hazards 69:749–779

    Article  Google Scholar 

  • Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350–365

    Article  Google Scholar 

  • Ray RL, Jacobs JM (2007) Relationships among remotely sensed soil moisture, precipitation and landslide events. Nat Hazards 43:211–222

    Article  Google Scholar 

  • Rodriguez JJ, Kuncheva LI, Alonso CJ (2006) Rotation forest: a new classifier ensemble method. IEEE Trans Patt Anal Mach Intell 28:1619–1630

    Article  Google Scholar 

  • Samui P (2008) Slope stability analysis: a support vector machine approach. Environ Geol 56:255

    Article  Google Scholar 

  • Shibata R (1981) An optimal selection of regression variables Biometrika:45–54

  • Shirzadi A et al (2017) Shallow landslide susceptibility assessment using a novel hybrid intelligence approach. Environ Earth Sci 76:60

    Article  Google Scholar 

  • Tien Bui D, Ho T-C, Pradhan B, Pham B-T, Nhu V-H, Revhaug I (2016a) GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ Earth Sci 75:1–22. https://doi.org/10.1007/s12665-016-5919-4

    Article  Google Scholar 

  • Tien Bui D, Hoang N-D (2017) A Bayesian framework based on a Gaussian mixture model and radial-basis-function Fisher discriminant analysis (BayGmmKda V1. 1) for spatial prediction of floods. Geosci Model Dev 10:3391

    Article  Google Scholar 

  • Tien Bui D, Nguyen QP, Hoang N-D, Klempe H (2016b) A novel fuzzy k-nearest neighbor inference model with differential evolution for spatial prediction of rainfall-induced shallow landslides in a tropical hilly area using gis Landslides:1–17

  • Tien Bui D, Pham BT, Nguyen QP, Hoang N-D (2016c) Spatial prediction of rainfall-induced shallow landslides using hybrid integration approach of Least-Squares Support Vector Machines and differential evolution optimization: a case study in Central Vietnam. Int J Digit Earth 9:1–21. https://doi.org/10.1080/17538947.2016.1169561

    Article  Google Scholar 

  • Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211

    Article  Google Scholar 

  • Tien Bui D, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016d) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378. https://doi.org/10.1007/s10346-015-0557-6

    Article  Google Scholar 

  • Tsangaratos P, Ilia I (2016a) Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. CATENA 145:164–179

    Article  Google Scholar 

  • Tsangaratos P, Ilia I (2016b) Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi perfection, Greece. Landslides 13:305–320

    Article  Google Scholar 

  • Vapnik V (1995) The nature of statistical learning theory. Ed. by Springer. Springer-Verlag, Verlag

    Book  Google Scholar 

  • Varnes DJ (1984) Landslide hazard zonation: a review of principles and practice, vol 3. UNESCO, Paris

    Google Scholar 

  • Webb GI (2000) Multiboosting: a technique for combining boosting and wagging. Mach Learn 40:159–196

    Article  Google Scholar 

  • Zhang C, Ma Y (2012) Ensemble machine learning. Springer

  • Zhang W, Goh AT (2016a) Evaluating seismic liquefaction potential using multivariate adaptive regression splines and logistic regression. Geomech Eng 10:269–280

    Article  Google Scholar 

  • Zhang W, Goh AT (2016b) Multivariate adaptive regression splines and neural network models for prediction of pile drivability. Geosci Front 7:45–52

    Article  Google Scholar 

  • Zhang W, Goh ATC (2013) Multivariate adaptive regression splines for analysis of geotechnical engineering systems. Comput Geotech 48:82–95

    Article  Google Scholar 

Download references

Acknowledgements

We thank to the Director of BISAG, Government of Gujarat, Gandhinagar, Gujarat, India, for helping and supporting this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Binh Thai Pham.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Pham, B.T., Jaafari, A., Prakash, I. et al. A novel hybrid intelligent model of support vector machines and the MultiBoost ensemble for landslide susceptibility modeling. Bull Eng Geol Environ 78, 2865–2886 (2019). https://doi.org/10.1007/s10064-018-1281-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10064-018-1281-y

Keywords

  • Landslide susceptibility mapping
  • Machine learning
  • Ensembles
  • MultiBoost
  • SVM
  • GIS
  • India