Abstract
The main aim of this study is to propose a novel hybrid intelligent model named MBSVM which is an integration of the MultiBoost ensemble and a support vector machine (SVM) for modeling of susceptibility of landslides in the Uttarakhand State, Northern India. Firstly, a geospatial database for the study area was prepared, which includes 391 historical landslides and 16 landslide-affecting factors. Then, the sensitivity of different combinations of these factors for modeling was validated using the forward elimination technique. The MBSVM landslide model was built using the datasets generated from the best selected factors and validated utilizing the area under the receiver operating characteristic (ROC) curve (AUC), statistical indexes, and the Wilcoxon signed-rank test. Results show that this novel hybrid model has good performance both in terms of goodness of fit with the training dataset (AUC = 0.972) and the capability to predict landslides with the testing dataset (AUC = 0.966). The efficiency of the proposed model was then validated by comparison with logistic regression (LR), a single SVM, and another hybrid model of the AdaBoost ensemble and an SVM (ABSVM). Comparison results show that the MBSVM outperforms the LR, single SVM, and hybrid ABSVM models. Thus, the proposed model is a promising and good alternative tool for landslide hazard assessment in landslide-prone areas.
Similar content being viewed by others
References
Abdulwahid WM, Pradhan B (2016) Landslide vulnerability and risk assessment for multi-hazard scenarios using airborne laser scanning data (LiDAR) Landslides:1–20
Althuwaynee OF, Pradhan B, Park H-J, Lee JH (2014) A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. Catena 114:21–36
Antronico L, Borrelli L, Coscarelli R, Gullà G (2015) Time evolution of landslide damages to buildings: the case study of Lungro (Calabria, southern Italy). Bull Eng Geol Environ 74:47–59
Antronico L, Borrelli L, Coscarelli R, Pasqua A, Petrucci O, Gullà G (2013) Slope movements induced by rainfalls damaging an urban area: the Catanzaro case study (Calabria, southern Italy). Landslides 10:801–814
Araújo MB, New M (2007) Ensemble forecasting of species distributions trends in ecology & evolution 22:42-47
Breiman L (1996) Bagging predictors. Mach Learn 24:123–140
Breiman L (2001) Random forests. Mach Learn 45:5–32
Budimir M, Atkinson P, Lewis H (2015) A systematic review of landslide probability mapping using logistic regression. Landslides 12:419–436
Chen H, Lee C (2003) A dynamic model for rainfall-induced landslides on natural slopes. Geomorphology 51:269–288
Chen W, Pourghasemi HR, Kornejady A, Zhang N (2017a) Landslide spatial modeling: introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques. Geoderma 305:314–327
Chen W, Pourghasemi HR, Naghibi SA (2017b) A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China. Bull Eng Geol Environ:1–18
Chung C-JF, Fabbri AG (1999) Probabilistic prediction models for landslide hazard mapping. Photogrammetric Eng Remote Sens 65:1389–1399
Chung C-JF, Fabbri AG (2003) Validation of spatial prediction models for landslide hazard mapping. Natural Hazards 30:451–472
Dash M, Liu H (1997) Feature selection for classification. Intell Data Anal 1:131–156
Dickson ME, Perry GL (2016) Identifying the controls on coastal cliff landslides using machine-learning approaches. Environ Modell Soft 76:117–127
Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10:1895–1923
Ding Q, Chen W, Hong H (2017) Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int 32:619–639
Erdal HI, Karakurt O (2013) Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms. J Hydrol 477:119–128
Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage W, on behalf of the JTC-1 Joint Technical Committee on Landslides and Engineered Slopes (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng Geol 102:85–98
Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory. Springer, pp 23–37
Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55:119–139
Frye C (2007) About the Geometrical Interval Classification Method. http://blogs.esri.com/esri/arcgis
Glade T, Anderson MG, Crozier MJ (2006) Landslide hazard and risk. John Wiley & Sons
Goh AT, Zhang Y, Zhang R, Zhang W, Xiao Y (2017) Evaluating stability of underground entry-type excavations using multivariate adaptive regression splines and logistic regression. Tunn Underground Space Technol 70:148–154
Gorsevski PV, Brown MK, Panter K, Onasch CM, Simic A, Snyder J (2016) Landslide detection and susceptibility mapping using LiDAR and an artificial neural network approach: a case study in the Cuyahoga Valley National Park. Ohio Landslides 13:467–484
Gullà G, Peduto D, Borrelli L, Antronico L, Fornaro G (2017) Geometric and kinematic characterization of landslides affecting urban areas: the Lungro case study (Calabria, southern Italy). Landslides 14:171–188
Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81:166–184
Hájek P, Olej V (2015) Word Categorization of Corporate Annual Reports for Bankruptcy Prediction by Machine Learning Methods. In: International Conference on Text, Speech, and Dialogue. Springer, pp 122–130
Hájek P, Olej V, Myšková R (2015) Predicting financial distress of banks using random subspace ensembles of support vector machines. In: Artificial Intelligence Perspectives and Applications. Springer, pp 131–140
Hattermann F et al (2017) Cross-scale intercomparison of climate change impacts simulated by regional and global hydrological models in eleven large river basins. Clim Change 141:561–576
Hoang N-D, Tien Bui D (2018) Predicting earthquake-induced soil liquefaction based on a hybridization of kernel Fisher discriminant analysis and a least squares support vector machine: a multi-dataset study. Bull Eng Geol Environ 77:191–204
Hoehler FK (2000) Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol 53:499–503
Hong H et al (2018) Landslide susceptibility mapping using J48 decision tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). CATENA 163:399–413
Hong H et al (2017) A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China). Environ Earth Sci 76:652
Hosmer DW, Lemeshow S (2000) Interpretation of the fitted logistic regression model Applied Logistic Regression, Second Edition:47–90
Huang J, Wu P, Zhao X (2013) Effects of rainfall intensity, underlying surface and slope gradient on soil infiltration under simulated rainfall experiments. Catena 104:93–102
Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22:679–688
Ilia I, Tsangaratos P (2016) Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map. Landslides 13:379–397
Jaafari A, Najafi A, Pourghasemi H, Rezaeian J, Sattarian A (2014) GIS-based frequency ratio and index of entropy models for landslide susceptibility assessment in the Caspian forest, northern Iran. Int J Environ Sci Technol 11:909–926
Jaafari A, Najafi A, Rezaeian J, Sattarian A, Ghajar I (2015) Planning road networks in landslide-prone areas: a case study from the northern forests of Iran. Land Use Policy 47:198–208
Jaafari A, Rezaeian J, Omrani MSO (2017) Spatial prediction of slope failures in support of forestry operations safety. Croatian J Forest Eng 38:107–118
Jelinek H, Abawajy J, Kelarev A, Chowdhury M, Stranieri A (2014) Decision trees and multi-level ensemble classifiers for neurological diagnostics. Austral J Med Sci 1:1–12
Kelarev A, Dazeley R, Stranieri A, Yearwood J, Jelinek H (2012) Detection of CAN by ensemble classifiers based on ripple down rules. In: Pacific Rim Knowledge Acquisition Workshop. Springer, pp 147–159
Khosravi K et al (2018) A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci Total Environ 627:744–755
Komori O, Eguchi S, Ikeda S, Okamura H, Ichinokawa M, Nakayama S (2016) An asymmetric logistic regression model for ecological data. Methods Ecol Evol 7:249–260
Kotti M, Benetos E, Kotropoulos C, Pitas I (2007) A neural network approach to audio-assisted movie dialogue detection. Neurocomputing 71:157–166
Kumar* A, Mittal H, Kumar R, Ahluwalia RS (2017) Empirical Attenuation relationship for Peak Ground Horizontal Acceleration for North-East Himalaya. Vietnam J Earth Sci 39(1):47-57
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data biometrics:159–174
Lau NN (2018) Determination of ground displacement of 25 April 2015 Nepal earthquake by GNSS precise point positioning Vietnam J Earth Sci 40:17–25
Margottini C, Canuti P, Sassa K (2013) Landslide science and practice vol 1. Springer
Marjanović M, Kovačević M, Bajat B, Voženílek V (2011) Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol 123:225–234
Meinhardt M, Fink M, Tünschel H (2015) Landslide susceptibility analysis in central Vietnam based on an incomplete landslide inventory: Comparison of a new method to calculate weighting factors by means of bivariate statistics) Geomorphology 234:80–97
Moosavi V, Niazi Y (2016) Development of hybrid wavelet packet-statistical models (WP-SM) for landslide susceptibility mapping. Landslides 13:97–114
Neuhäuser B, Terhorst B (2007) Landslide susceptibility assessment using “weights-of-evidence” applied to a study area at the Jurassic escarpment (SW-Germany). Geomorphology 86:12–24
Parhami B (2006) Introduction to parallel processing: algorithms and architectures. Springer Science & Business Media
Pham BT, Bui DT, Prakash I (2017a) Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study Geotechnical and Geological Engineering:1–15
Pham BT, Khosravi K, Prakash I (2017b) Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at Pauri Garhwal area, Uttarakhand, India Environmental Processes:1–20
Pham BT, Nguyen V-T, Ngo V-L, Trinh PT, Ngo HTT, Bui DT (2017c) A Novel Hybrid Model of Rotation Forest Based Functional Trees for Landslide Susceptibility Mapping: A Case Study at Kon Tum Province, Vietnam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 186–201
Pham BT, Pradhan B, Tien Bui D, Prakash I, Dholakia MB (2016a) A comparative study of different machine learning methods for landslide susceptibility assessment: a case study of Uttarakhand area (India). Environ Model Software 84:240–250. https://doi.org/10.1016/j.envsoft.2016.07.005
Pham BT, Prakash I (2017a) Evaluation and Comparison of LogitBoost Ensemble, Fisher’s Linear Discriminant Analysis, Logistic Regression, and Support Vector Machines Methods for Landslide Susceptibility Mapping Geocarto International:1–32
Pham BT, Prakash I (2017b) A Novel Hybrid Intelligent Approach of Random Subspace Ensemble and Reduced Error Pruning Trees for Landslide Susceptibility Modeling: A Case Study at Mu Cang Chai District, Yen Bai Province, Viet Nam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 255–269
Pham BT, Prakash I (2017c) A novel hybrid model of Bagging-based Naïve Bayes Trees for landslide susceptibility assessment. Bulletin of Engineering Geology and the Environment:1–15
Pham BT, Prakash I (2017d) Spatial Prediction of Rainfall Induced Shallow Landslides Using Adaptive-Network-Based Fuzzy Inference System and Particle Swarm Optimization: A Case Study at the Uttarakhand Area, India. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 224–238
Pham BT, Prakash I, Bui DT (2017d) Spatial prediction of landslides using hybrid machine learning approach based on Random Subspace and Classification and Regression Trees Geomorphology:1–15
Pham BT, Shirzadi A, Bui DT, Prakash I, Dholakia M (2017e) A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation Forest for landslide susceptibility modeling: a case study in the Himalayan area, India International Journal of Sediment Research
Pham BT, Son LH, Hoang T-A, Nguyen D-M, Tien Bui D (2018a) Prediction of shear strength of soft soil using machine learning methods. Catena 166:181–191
Pham BT, Tien Bui D, Pham HV, Le HQ, Prakash I, Dholakia MB (2016b) Landslide hazard assessment using random SubSpace fuzzy rules based classifier ensemble and probability analysis of rainfall data: a case study at mu Cang Chai District, Yen Bai Province (Viet Nam) journal of the Indian Society of Remote Sensing:1-11 https://doi.org/10.1007/s12524-016-0620-3
Pham BT, Tien Bui D, Prakash I (2018b) Bagging based Support Vector Machines for spatial prediction of landslides. Enviro Earth Sci 77:146
Pham BT, Tien Bui D, Prakash I, Dholakia MB (2017f) Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. CATENA 149(Part 1):52–63. https://doi.org/10.1016/j.catena.2016.09.007
Pingale SM, Chandra H, Sharma H, Mishra SS (2012) Morphometric analysis of Maun watershed in Tehri-Garhwal district of Uttarakhand using GIS. Int J Geomat Geosci 3:373–387
Pourghasemi H, Moradi H, Aghda SF (2013) Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances. Nat Hazards 69:749–779
Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350–365
Ray RL, Jacobs JM (2007) Relationships among remotely sensed soil moisture, precipitation and landslide events. Nat Hazards 43:211–222
Rodriguez JJ, Kuncheva LI, Alonso CJ (2006) Rotation forest: a new classifier ensemble method. IEEE Trans Patt Anal Mach Intell 28:1619–1630
Samui P (2008) Slope stability analysis: a support vector machine approach. Environ Geol 56:255
Shibata R (1981) An optimal selection of regression variables Biometrika:45–54
Shirzadi A et al (2017) Shallow landslide susceptibility assessment using a novel hybrid intelligence approach. Environ Earth Sci 76:60
Tien Bui D, Ho T-C, Pradhan B, Pham B-T, Nhu V-H, Revhaug I (2016a) GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ Earth Sci 75:1–22. https://doi.org/10.1007/s12665-016-5919-4
Tien Bui D, Hoang N-D (2017) A Bayesian framework based on a Gaussian mixture model and radial-basis-function Fisher discriminant analysis (BayGmmKda V1. 1) for spatial prediction of floods. Geosci Model Dev 10:3391
Tien Bui D, Nguyen QP, Hoang N-D, Klempe H (2016b) A novel fuzzy k-nearest neighbor inference model with differential evolution for spatial prediction of rainfall-induced shallow landslides in a tropical hilly area using gis Landslides:1–17
Tien Bui D, Pham BT, Nguyen QP, Hoang N-D (2016c) Spatial prediction of rainfall-induced shallow landslides using hybrid integration approach of Least-Squares Support Vector Machines and differential evolution optimization: a case study in Central Vietnam. Int J Digit Earth 9:1–21. https://doi.org/10.1080/17538947.2016.1169561
Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211
Tien Bui D, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016d) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378. https://doi.org/10.1007/s10346-015-0557-6
Tsangaratos P, Ilia I (2016a) Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. CATENA 145:164–179
Tsangaratos P, Ilia I (2016b) Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi perfection, Greece. Landslides 13:305–320
Vapnik V (1995) The nature of statistical learning theory. Ed. by Springer. Springer-Verlag, Verlag
Varnes DJ (1984) Landslide hazard zonation: a review of principles and practice, vol 3. UNESCO, Paris
Webb GI (2000) Multiboosting: a technique for combining boosting and wagging. Mach Learn 40:159–196
Zhang C, Ma Y (2012) Ensemble machine learning. Springer
Zhang W, Goh AT (2016a) Evaluating seismic liquefaction potential using multivariate adaptive regression splines and logistic regression. Geomech Eng 10:269–280
Zhang W, Goh AT (2016b) Multivariate adaptive regression splines and neural network models for prediction of pile drivability. Geosci Front 7:45–52
Zhang W, Goh ATC (2013) Multivariate adaptive regression splines for analysis of geotechnical engineering systems. Comput Geotech 48:82–95
Acknowledgements
We thank to the Director of BISAG, Government of Gujarat, Gandhinagar, Gujarat, India, for helping and supporting this study.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Pham, B.T., Jaafari, A., Prakash, I. et al. A novel hybrid intelligent model of support vector machines and the MultiBoost ensemble for landslide susceptibility modeling. Bull Eng Geol Environ 78, 2865–2886 (2019). https://doi.org/10.1007/s10064-018-1281-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10064-018-1281-y