Abstract
A landslide susceptibility analysis has been made in the Río Aguas catchment (Almeria, Southeast Spain), using two statistical models, Maximum Entropy (MaxEnt) and Geographically Weighted Logistic Regression (GWLR). For this purpose, a previous landslide inventory has been used and re-elaborated, reaching a total incidence of 2.58% of the whole area. Different types of movements have been distinguished, being rock falls, slides and complex movements the predominant. From the inventory, the centroid of the rupture zone has been extracted to represent the landslides introduced in the models. A previous factor analysis has been made, using 12 predictors related to morphometry, hydrography, geology and land cover, with 5 m grid spacing, allowing the selection of factors to be used in the analysis and discarding those showing correlation between them. Then, MaxEnt and GWLR models are applied using different distributions of training and testing samples from the landslide inventory. For the validation, the Area Under the Curve of the Receiver Operating Characteristic (AUC-ROC) has been used but additionally, the degree of fit (DF) has allowed to validate the rupture zones themselves, not only the centroids. Results show an excellent prediction with both metrics in all the methods and samples, but the better results are obtained in the GWLR method for AUC and in the MaxEnt for the degree of fit. Therefore, a consensus model of both methods has been obtained, that improves even more the results reaching an AUC value of 0.99 and a degree of fit of 90%.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Susceptibility to landslides is defined as the probability that a risk phenomenon will occur in a specific area and an undetermined future based on the correlation of conditioning factors with the distribution of past events (Brabb 1984; Fell et al. 2008). Thus, in studies of landslides susceptibilty, only the spatial variable of the probability of occurrence of landslides is taken into account without the involvement of the temporal variable. Both variables, together with the magnitude of the landslide phenomena, constitute the hazard (Guzzetti et al. 2005) and should be the starting point for the risk assessment analyses (Brabb 1991; Fell et al. 2008; Remondo et al. 2008; van Westen et al. 2008). The evaluation and mapping of susceptibility through the spatial zoning of areas prone to movements, that is, areas where landslides are likely, even without their temporal variable, are very useful for the prevention and forecast of risks (Einstein 1988).
Since the first attempts to assess and determine susceptibility in the 1960s (Macau 1963; Milliès-Lacroix 1968) through to the present, a multitude of methods have been used depending on the extension of the study areas and the approaches adopted (Boussouf 1995; Aleotti & Chowdhury 1999; Van Westen 2000; Chacón et al. 2006; Reichenbach et al. 2018) with the later providing a detailed overview and review of approaches frequently adopted. These methods can be grouped into two large groups: (1) qualitative methods and (2) quantitative methods.
The qualitative methods (Carrara & Merenda 1974; Stevenson 1977) are the direct result of analysing the landslide inventory or the geomorphological and geoenvironmental conditions of the study region. The susceptibility zoning depends on experience and the direct appreciation by the author of the analysis, which can hinder the reproducibility of the results from one author to another, and even the comparisons of these results. In short, while these methods are based on understanding the geological and geomorphological ground conditions of a project area and how they relate to landslide activity, and perhaps less on the use of sophisticated statistical methods, the landslide susceptibility results can sometimes be highly subjective.
Quantitative methods make it possible to determine areas of landslide susceptibility using statistical/probabilistic models or deterministic methods (Aleotti & Chowdhury 1999; Van Westen 2000), which increases the objectivity of the methods compared to qualitative methods, limiting, although not completely eliminating the degree of freedom left to the author of the study. This can allow for the identification, through statistical analysis, of relationships between different factors and processes (or combinations of factors and processes) that might be influencing landslide behaviour. However, the results should always be checked and challenged to ensure that any statistical relationships being identified in the data fit with an understanding of the ground conditions and slope processes acting across a chosen study area. This also includes checking the quality of the data being used in the analysis––both the factor mapping and the landslide inventory data––as well as how it is being used.
Statistical models of landslide susceptibility are based on cross-correlation analysis and zonal statistics between the dependent variable (landslides) and the independent variables represented by the conditioning factors of instability (geoenvironmental variables) to determine the areas prone to landslides instability. There is a wealth of statistical methods for assessing and mapping susceptibility to landslides. According to Reichenbach et al. 2018, statistical methods can be grouped into classic statistical methods such as linear regression, logistic regression or discriminant analysis (Carrara 1988; Chung et al. 1995; Baeza & Corominas 2001; Guzzetti et al. 1999, 2005); methods based on indices such as the weight of evidence or matrices (DeGraff & Romesburg 1980; Bonham-Carter et al. 1990; Irigaray et al. 1999, 2007; Fernández et al. 2003); multi-criteria evaluation (Irigaray et al. 1996; Castellanos Abella & Van Westen 2008; Gorsevski & Jankowski 2010); machine learning methods such as decision trees, random forest or support vector machines (Ballabio & Sterlacchini 2012; Goetz et al. 2015; Hong et al. 2015); deep learning methods such as neural networks (Lee et al. 2004; Melchiorre et al. 2008; Pradhan & Lee 2010; Yilmaz 2010; Zare et al. 2013) and other methods.
In relation to these methods, there is an evolution in the use of classification methods from simpler classical linear models towards models that reduce their limitations, such as generalized linear models (GLM) and generalized additive models (GAM). This leads to advanced models that use machine learning (ML) spatial statistics, such as geostatistics or global spatial regression models and geographically weighted regression models (GWR) (Florez García & Pérez Castillo 2019). The search to capture nonlinear relationships and local variations between the dependent variable and the independent variables with the availability of statistical data analysis software and geographic information systems (GIS) have contributed to this evolution (Van Westen et al. 2000).
In southern Spain (Andalusia), the areas of greatest landslide susceptibility are located in the whole of the Betic Cordilleras (Pita et al. 1999; Chacón et al. 2006; Irigaray et al. 2007). This is due to a landscape dominated by many of conditioning factors widely attributed to landslide activity: steep slopes, a geology prone to landslide activity, a regime dominated by very dry periods punctuated by extremely high-intensity rainstorms and a scarce to very scarce vegetation cover as a consequence of regional climate and also deforestation. More specifically, in southeastern Spain, several research studies on landslides have been carried out (Fernández et al. 1994, 2003; Alcantara-Ayala & Thornes 1996; El Hamdouni et al. 2008; Jiménez-Perálvarez et al. 2011). In the province of Almería, however, although many studies have looked at the causes and distribution of landslide in the area (e.g. Ferre 1997; Griffiths et al. 2002; Hart 2004; Geach et al. 2017), there are currently no published examples of landslide susceptibility mapping using statistical methods despite Almería being one of the provinces most affected by landslide activity in the Andalusia region (Pita et al. 1999).
As an objective of the research presented here, the results of a statistically based approach to landslide susceptibility mapping of the Río Aguas catchment within the province of Almería are presented. This includes using statistical models based on maximum entropy (MaxEnt) and geographically weighted logistic regression (GWLR). This last model is based on logistic regression (LR) but also allows the authors to capture the local spatial variations between the variables. Previous studies showed the good performance of the MaxEnt and LR models in comparison with other models to perform landslide susceptibility mapping (Felicísimo et al. 2013; Hong et al. 2016). Likewise, a comparison of the results of both models is established and landslide susceptibility mapping of the hydrographic basin is proposed, thus contributing to the knowledge of the processes of slope instability in the area.
2 Materials and methods
2.1 The study area
The study area is located in the southeast of Spain and belongs to the province of Almería (Fig. 1), it covers the Río Aguas catchment with a surface area of 541.32 km2. The climate is arid with a mean annual precipitation of less than 210 mm (Esteban‐Parra et al. 1998), torrential and highly irregular; the average temperature is 17° according data from the Environmental Information Network of Andalusia of the Andalusian Regional Government (REDIAM 2022).
The hydrographic basin of the Río Aguas drains the Neogene intramountain depressions of Sorbas and a small part of Vera. It has an average altitude and slope of 454 m and 14°, respectively, and is bounded by the Sierra de Los Filabres to the north, where it reaches its maximum altitude (1353 m), the Sierras Alhamilla and Cabrera to the south, the dividing line with the Tabernas basin to the west and the Mediterranean Sea to the east, where the Río Aguas flows into.
The Sorbas Neogene Basin is a sector of the Tabernas–Sorbas Basin (Ott d’Estevou and Montenat 1990) that is arranged on a metamorphic basement of the Alpujárride and Nevado–Filábride complexes of the Internal Zones of the Eastern Betic Cordillera (Fig. 2). It is an elongated depression about 40 km whose morphology is conditioned by folding with kilometric axes of ENE-WSW orientation (Weijermars et al. 1985); from north to south the antiform of Sierra de Los Filabres, the synclines of Tabernas-Sorbas and the antiform of Sierra Alhamilla and Sierra Cabrera can be distinguished.
The post-orogenic filling of the Sorbas Basin is made up of sedimentary formations from the Middle Miocene to the Quaternary, which rest in unconformity on the metamorphic Paleozoic and carbonate Mesozoic rocks of the Betic substratum that make up the mountains (Sierras) that delimit the basin in its southern and northern part. These basin deposits show great variability both in surface extension and in thickness, giving rise to several stratigraphic units separated by unconformities (Dabrio & Polo 1995; Martín & Braga 1994). It is worth highlighting the presence of levels of Messinian gypsum that reach up to 40 m thick and are intensely karstified in the Sorbas outcrop -The Karst of Sorbas- (Pulido-Bosch 1986; Calaforra & Pulido-Bosch 1997).
With continued tectonic uplift and sea level fluctuations linked to Quaternary climate changes, sedimentation across the area moved from marine to terrestrial conditions, leading to the formation of a proto-drainage system. Mapping of river terraces across the region has shown that the Sorbas Basin initially drained southwards through a low between the Sierras Cabrera and Alhamilla, before being beheaded by a rapidly headward eroding, eastward-flowing Vera Basin drainage approximately 70 ka (Harvey & Wells 1987; Candy et al. 2005). The landscape response to this river capture event, which increased the catchment area by 50% and resulted in a 90 m base-level difference at the capture site (Harvey & Wells 1987), was the propagation of a wave of fluvial incision-erosion upstream into the captured network (Mather 2000; Mather et al. 2002; Stokes et al. 2002). At present, the upstream limit of this capture-related wave of incision decays approximately 20 km from the capture site (Mather et al. 2002; Stokes et al. 2002) with the rate and style of landscape response being controlled by the bedrock geology into which the fluvial erosion and incision are occurring (Griffiths & Stokes 2008). A key part of the landscape response and rapid incision of the drainage network within this part of the Rio Aguas catchment area (predominantly upstream but also downstream of the capture site) has been the formation of oversteepened and unstable slopes, resulting in landslides (Griffiths, et al. 2002; Hart 2004). Landslides are also seen within the steep mountainous terrain of the area, again, a response to the relatively rapid tectonic uplift of the region and incision of the drainage network.
2.2 Landslide inventory
To Hart’s inventory of 2004, which reached 315 movements, another 254 have been added, which amounts to 569 movements (Fig. 3). The area affected by the landslides is 13.96 km2, which represents 2.58% of the surface of the Río Aguas catchment/basin. For the inventory, photointerpretation with the stereoscopic vision of multitemporal and multiscale orthophotos from Google Earth and with support from the Iberpix 4 stereo web of the National Geographic Institute (IGN) was applied. The use of this approach with images available on web map servers for the elaboration of landslide inventories has already been addressed in various studies (Costanzo et al. 2012; Eeckhaut et al. 2012; Zhang et al. 2015; Lombardo et al. 2016a). Based on this procedure and on the table of coordinates of the movements by Hart (2004), the corresponding polygons have been digitized.
61.86% of movements are classified as rock fall type (representing 44.22% of the area), 29.70% slides (29.44% of the area), 6.50% complex movements (24.91% of the area) and 1.93% debris flow (1.43% of the area). Apart from the recorded debris flows, there is a great shortage of flows in the study area, which has already been verified in previous studies in areas with similar characteristics (Jiménez-Perálvarez et al. 2011). This fact could be related to the ephemeral nature and reduced size of these movements, which is consistent with semi-arid environments in which the movements originate after torrential rains with a return period of several years. After a certain period of time, the traces of the ephemeral flows, especially the smaller ones, disappear from the landscape.
The mean surface of the movement is 2.45 ha, the minimum is 0.03 ha and the maximum is 49.56 ha. 74.34% of these movements (423) have a surface less than or equal to the average, with 44.64% of the total being less than one ha. The distribution of movements and affected areas, as well as the average surface according to the landslide type, is shown in Table 1.
In addition to the four main sectors in which the movements described by Griffiths et al. (2005) are located, near the towns of Sorbas, Cariatiz (Los Alías), Góchar and along a section of the Jauto River, the new inventory shows a fifth zone of landsliding to the south of Sorbas (between Sorbas and Los Risas) (Fig. 4) and a sixth zone in the Sierra Cabrera to the southwest of La Alcantarilla, perhaps reflecting the higher resolution of the imagery being used in the landslide mapping. In other locations, landslides concentrate along with normal contacts between competent and underlying incompetent weaker materials (e.g. gypsum overlying calcareous mudstones).
The movements in the vicinity of Sorbas and Góchar are the smallest (an average of 0.88 ha), those of the Jauto River and Sierra Cabrera reach an average of 2.8 ha and those with the largest surface are of the complex type, with an average of 4.6 ha, which are located mainly in the contacts between competent and incompetent materials.
The distribution of movements by lithologies is shown in Fig. 5. Approximately 37% of the total surface affected by movements is located in the metamorphic materials of the Alpujárride and Nevado-Filábride complexes of the Betic Internal Zones and the remaining 63% is located in the post-orogenic deposits of the Sorbas Basin. Among the former, carbonate rocks and phyllites stand out, which in absolute terms reach 18.65 and 7.77% of the landslides area, respectively, and in relative terms 7.55 and 6.25% of the area of each lithological unit. Within the post-orogenic materials, the conglomerates, sands and marl, limestone, marl and gypsum stand out in absolute terms (between 11 and 13%) and in relative terms the conglomerates, sands and marl, and gypsum (percentages close to 10%).
2.3 Methods
Two statistical methods have been used for the evaluation of susceptibility, the maximum entropy (MaxEnt) and the geographically weighted logistic regression (GWLR). Both methods are used for the study of binomial dependent variables (presence/pseudo-absence and presence/absence, respectively) and allow the management of continuous and categorical independent variables or predictors. The modelling has been carried out with software applications, MaxEnt 3.4.4 (Phillips et al. [internet]) and SAGA 7.8.2 for the GWLR (Conrad et al. 2015), also using QGIS 3.10.12, ArcGIS 10.5 and SPSS2.3.1.
2.3.1 Maximum entropy model (MaxEnt)
The first method used is a statistical-probabilistic modelling approach of machine learning based on maximum entropy. For this, the MaxEnt software has been used (Phillips et al. 2006), developed by Steven J. Phillips of AT&T Labs together with Miroslav Dudik and Robert E. Schapire of Princeton University, for the Center for Biodiversity and Conservation of the American Museum of Natural History (AMNH). This approach has been widely used in ecology (distribution of species and modelling of ecological niches); in fact, in just seven years, its use exceeded 1000 publications to predict species distribution (Merow et al. 2013). However, its application for landslide susceptibility models had never been used before 2012, but since then, it has been applied with great success for susceptibility mapping by several authors (Vorpahl et al. 2012; Felicísimo et al. 2013; Pourghasemi et al. 2012a, b; Convertino et al. 2013; O’Banion & Olsen 2014; Kim et al. 2015; Park 2015; Davis & Blesius 2015; Lombardo et al. 2016b; Kornejady et al. 2017; Chen et al. 2017; Kerekes et al. 2018; Jiao et al. 2019; Pandey et al. 2020). MaxEnt is based on a presence-only methodology and can generate correlations between the points of occurrence or presence and the environmental predictor variables. Therefore, it is ideal for analysing the variety of geospatial and geological variables that contribute to landslide risks (O’Banion & Olsen 2014).
Entropy is an indicator that has its origin in information theory (Shannon 1948) and allows us to measure the degree of disorder in a data set. It can be interpreted as the expected value of the information contained in the data (Jaynes 1982; Kleidon et al. 2010) that is needed to describe a pattern or a process. MaxEnt estimates the probability of occurrence of landslides by finding the probability distribution of the maximum entropy, that is, the closest to a uniform distribution throughout the study area and therefore converts it into a less arbitrary distribution to represent a incomplete information, from points of presence and a set of associated geoenvironmental data. Maximizing entropy (Jaynes 1957), MaxEnt detects the variables with the highest information value to predict the patterns, in our case, the landslide distribution pattern. This method has also been reinterpreted as a type of logistic regression based on presence/pseudo-absence (Renner & Warton 2013).
To achieve the adjustment of a distribution function, MaxEnt uses an algorithm iteratively; the adjustment process involves a random-walk in the parameter space (the coefficients for each predictor) with the assignment of a parameter value. Per iteration, the software proposes new parameter values trying to increase the gain up to its asymptote, or up to the number of iterations defined by the user in the advanced parameters tab of MaxEnt. The software seeks to maximize the profit to obtain the best model. The higher the gain, the more discriminant the distribution. The random-walk starts from a uniform probability of occurrence in a geographic space and moves away from this distribution due to data restrictions and continues in successive iterations until the increase in gain falls below a convergence threshold, set by default in MaxEnt but which can be defined by the user (Phillips et al. [Internet]).
2.3.2 Geographically weighted logistic regression model (GWLR)
In addition, a spatial statistical model, the geographically weighted logistic regression (GWLR), has been applied. Local models GWR were first developed in the late twentieth century by Brunsdon et al. (1996) and Fotheringham et al. (1996, 1997, 2002). Unlike global models such as neural networks or logistic regression (LR), GWRs do take into account spatial variations between variables for susceptibility assessments, since some spatial variables show certain trends and not spatial stationarity. The theoretical basis for the GWR is Tobler’s observation that “everything is related to everything else, but near things are more related than distant things” (Tobler 1970).
The GWLR, instead of establishing a single global model for the prediction, is based on moving windows in which the coefficients of each of the predictors vary from one spatial unit to another, using the surrounding samples, giving greater weight when the sample is closer. Its principle, quite simple, consists of estimating local models by least squares, weighting each observation by a decreasing function according to its distance from the estimation point. The distance from which the weight of the observations is considered zero is known as bandwidth. The choice of the bandwidth value has a great influence on the results. The higher the bandwidth value, the greater the number of observations that the kernel will give a nonzero weight. The combination of local models allows the construction of a global model with specific properties (Ardilly et al. 2018).
2.4 Models application
MaxEnt uses as input a list of specific locations with only the presence of species, in our case the set of points corresponding to the centroid of each of the surfaces of the rupture zone or equivalent (Boussouf et al. 1994) of the inventoried landslides. The environmental predictors or conditioning factors used for modelling in raster format have been selected from a broad set of geoenvironmental factors.
Unlike MaxEnt, the GWLR model requires presence and absence data for the dependent variable; in this case, the same number of absence and presence data has been added. The input data, as with MaxEnt, are made up of a point data file of locations (landslide centroids) and a set of predictor layers. Table 2 summarizes the formats of the input and output variables of the two models.
For the GWLR, a bandwidth must also be established. From the model input data, the SAGA software suggests a bandwidth based on the size and resolution of the grid system, but these data can be changed. To set the bandwidth, the Akaike Information Criterion (AICc) (Fotheringham et al. 2002) was also applied in ArcGIS and the highest bandwidth value was chosen, this being the one proposed by default by SAGA.
2.4.1 Predictor factors selection
For the susceptibility mapping of the study area, in addition to the dependent variable, 12 conditioning or predictive factors have been taken into account, which act as independent variables (Fig. 6). These are (1) Altitude; (2) Aspect; (3) Curvature;(4) Distance to faults; (5) The density of first-order streams; (6) Land cover; (7) Lithology; (8) Slope; (9) Distance to the streams; (10) Distance to geological boundaries between competent and incompetent materials; (11) Topographic Position Index (TPI); (12) Terrain Ruggedness Index (TRI).
The morphometric parameters were obtained from the Digital Terrain Model 1st Coverage, with 5 m grid spacing available at the download centre of the National Geographic Institute (IGN), as well as the drainage network. For the other factors, the Geological Map at 1/50,000 scale has been used, MAGNA, acquired from the Portal of the Geological and Mining Institute of Spain (IGME) and the land use was obtained from The CORINE LandCover (CLC) 2018 of European Environment Agency. Once the data are downloaded, the corresponding geoprocesses are applied in ArcGIS to obtain derived raster maps with the same resolution (5 m) as the morphometric parameters and their classifications. In the case of the drainage network, contacts and fractures distance operators are applied around the linear elements; meanwhile for lithology, rasterization is applied.
In reference to the conditioning factors, from a traditional statistical model point of view (Renner & Warton 2013), the use of the whole highly correlated predictors is not recommended for MaxEnt and they should be preselected. Meanwhile, based on machine learning, it is suggested to include all reasonable predictors in the model and let the algorithm decide which ones are important, through regularization.
Nevertheless, a statistical approach was adopted instead of varying the regularization as occurs in machine learning. Thus, to use only one variable from each pair of highly correlated predictors, the multicollinearity test between the independent variables allows selecting the variables to be taken into account for modelling with MaxEnt. This same multicollinearity analysis should also be applied to choose the predictors to use with GWLR since the existence of multicollinearity between predictors reduces the degree of efficiency of the GWLR model.
To determine the degree of correlation between predictors, the Pearson coefficient has been used. For this reason, categorical data (lithology and landcover) have been hierarchized to convert them into scalar data; an approximate classification based on the work of Nicholson & Hencher (1997) was applied to lithology and a ranking based on the relationship between land use as a function of the slope has been used for landcover. Table 3 shows the correlations between the independent variables that are all below the critical level of 0.7 (Clark & Hosking 1986) except for the Slope and TRI variables that present a bivariate correlation (Pearson’s r) of 0.98. Thus, 11 of the 12 predictors have been selected for modelling, excluding the TRI variable.
Regarding the conditions in which landslides occur, lithology has already been described, the more susceptible materials being the carbonate rocks and phyllites of Alpujarride Complex, as well as the post-orogenic conglomerates, sands and marl, limestone, marl and gypsum. This ocurrs particularly where stronger lithologies are ovelrying weaker ones. Among the other factors, landslides show an exponential trend that increases with the slope. Moreover, the movements are distributed in a wide range of height and aspect, although they present a higher concentration at heights close to 400 m and N orientation. In terms of the hydrographic network, the distances to channels of between 50 and 150 m are the most prone for the terrain instability. Meanwhile, the maximum concentration of movements occurs at distances of up to 200–500 m from mapped geological faulting and up to 10 m for geological contacts between competent and underlying incompetent weaker materials. Areas of open space with little or no vegetation were seen as those mostly affected by the mapped landslide distribution, possibly reflecting the nature of land use in the area, as well as the semi-arid to arid climate.
2.4.2 Susceptibility modelling
From the selected conditioning factors and the inventory data, three models were carried out with each method, a first model using Hart’s inventory, 2004 (315 movements, approx. 55%) as training data and our inventory as test or validation data (254 movements, approx. 45%). A second model was developed with the same data, exchanging the training data for the test data and vice versa. Both models have been developed in order to study the influence of the inventory on the modelling. A third model was elaborated as the result of the average of ten models obtained by bootstrapping, joining the two inventories and using randomly 60% of movements as training and the remaining 40% as tests in each of the ten executions. The same parameters were applied for both MaxEnt and GWLR. In the case of MaxEnt, it should be noted that its application allows producing three types of results in its predictions: without format (raw), cumulative and logistic. The susceptibility maps generated in this study by MaxEnt are of the cumulative type.
3 Results
3.1 Susceptibility maps
The cumulative map obtained with the MaxEnt method has been classified to delimit five classes of susceptibility from very low (green) to very high (dark red); the same classification was applied to the GWLR result. The spatial distribution of the differentiated susceptibility classes is shown in the maps in Figs. 7, 8 and 9, for the three models (samples) described in the previous section. In them, it is found that the exchange of the training and test data between models 1 and 2 (Figs. 7 and 8) produces different results of susceptibility zoning. At the same time, the results of modelling 3 (Fig. 9) are closer to those of modelling 2 (Fig. 8).
In general, there is a good fit between the isopleth map in Fig. 4 and the areas most affected by landslides, described in Sect. 2.2, and the established models.
3.2 Results verification
There are several metrics that can be used to evaluate and validate a susceptibility model. In this work, we refer to evaluation as the quantification of the fit between the susceptibility classification model and the distribution of landslides used to calibrate or train the model. However, validation is the estimation of the fit of the model’s predictive capacity, using landslides not used for the model’s calibration or training.
A useful statistical tool for measuring the discriminant ability of the model and for comparing the relative validity between models is the Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) that is provided between the different MaxEnt output plots. The ROC curve is a statistical tool used to determine the discrimination capacity of a binomial test or model; it is a graph resulting from representing, for each threshold value, the measures of sensitivity and specificity and in which the sensitivity evaluates the “commission” while the complementary of the specificity (1- specificity) evaluates the “omission”.
The goodness of the models is estimated using the AUC, whose values can be between 0.5 and 1; the higher the AUC, the better the prediction. Ideally, the AUC values should be above 0.5; an AUC equal to or less than this threshold indicates a random prediction or model without discriminant ability, and a value of 1 indicates that the model has a perfect fit.
There are several conventional scales of interpretation of the AUC that allow classifying the results according to their degree of accuracy or goodness. The Swets (1988) criterion establishes that an area below 0.7 has a low discriminant capacity and that one below 0.9 can be useful for some purposes, cataloguing those greater than 0.9 with high accuracy. The AUC-ROC values > 0.90 are considered excellent, according to Metz (1978), between 0.80 and 0.89 good; 0.70 and 0.79 regular and < 0.70 bad.
For both MaxEnt and GWLR, we have applied two evaluation and validation techniques with the ROC tool, the cross-validation for models 1 and 2, and bootstrapping for model 3, using the proportions indicated in Sect. 2.4.2. The AUC values are shown in Figs. 10 and 11, and in Table 4, where it is observed that both algorithms offer good discriminating capacity. According to the Metz criterion, the AUCs obtained allow the models to be rated as excellent both in classification (training sample) and in prediction (testing or validation sample), except for the 1-MaxEnt model, which presents an excellent classification but only good prediction.
However, for the evaluation of the performance of the models, apart from the AUC-ROC, we have used the index of the degree of fit (Goodchild 1986; Baeza 1994) that allows us to calculate the percentage of the mobilized surface in each degree or class of susceptibility and thus determine the fit of the movements to the established susceptibility, using the following formula:
where (mi) is the mobilized surface in class (i) of susceptibility and (ti) total surface of class (i) of susceptibility.
In this way, the spatial extension of both landslides and susceptibility zoning is taken into account, which is not done when using performance metrics such as the AUC-ROC approach. Since the final objective of the research is to establish the spatial distribution of the susceptibility projected on a map and that the landslides themselves correspond more to polygons––not points-, it seems important to carry out the evaluation with data of the degree of fit (DF), which is based on the zonal nature of the phenomena and factors.
The distribution by classes of the susceptibility of the study area, as well as the degree of fit of the movements (considering all of them, that is, the sum of the test sample and the validation sample), for each model, is collected at Fig. 12. The results obtained suggest that there is a good degree of fit for all the models, with the MaxEnt results being better than those of the GWLR, as the first ones present lower DF values in the low susceptibility classes and higher in levels of high susceptibility.
4 Discussion
First, regarding the inventory, the Río Aguas catchment has an incidence of landslides of 2.58% of its total area in the same order of magnitude than those found in other areas of similar characteristics in the Betic Cordilleras (Fernández et al. 2003; Irigaray et al. 2007; Jiménez-Perálvarez et al. 2011). By landslide type, rock fall is abundant, followed by slides and complex movements, with debris flows being the scarcest group. The scarcity of flows that have already been observed in other semi-arid zones of similar characteristics stands out (Jiménez-Perálvarez et al. 2011) and could be related to the ephemeral nature and the small size of these movements, generated in torrential rainstorms across the area.
Factor analysis reveals that approximately 37% of the total surface of the drainage basin affected by movements is located in metamorphic materials of the Alpujárride and Nevado–Filábride complexes, and the remaining 63% is located in the post-orogenic fills of the Sorbas Basin. Among the former, the most susceptible lithologies are carbonate rocks related to rock falls and slides, and phyllites related mainly to slides, the complex movements being related to the geological boundaries between competent and underlying incompetent weaker materials, which all agree with previous works (Fernández et al. 1994; 2003). Among the latter, conglomerates, sands and marls; gypsum; limestone and gravels, sands and silts are the more prone materials to instability. Meanwhile, landslides show a increasing distribution with the slope-angle is typically expected (Brabb et al. 1972; DeGraff & Romesburg 1980), but a concentration at around 400 m elevation and on mainly north-facing slopes possibly indicates some form of structural, as well as micro-climatic, control. Moreover, the landslides are predominantly located within close proximity of the drainage network (consistent with observations made by Hart (2004) and Geach et al. (2017)), faulting and geological boundaries between incompetent and competent underlying materials due to the weakening that these features produce in rocks.
This observation fits with the geological and geomorphological evolution of the Rio Aguas catchment area, and in particular, the ongoing impact of the Río Feos-Río Aguas river capture event described above, as well as the effects of tectonic uplift and incision of the drainage network across the wider region. As noted by Geach et al (2017), many of these drainage lines are still actively responding to a combination of the river capture, tectonic uplift and base-level change, with this being reflected in the landslide distribution mapped by Griffiths et al. (2002) and Hart (2004), as well the landslide susceptibility mapping presented here (Fig. 9).
The results obtained when using AUC values to evaluate the performance of the methods and models allow qualifying these as excellent (AUC > 0.9). No great differences are observed between these statistical adjustment values, although model 2 presents higher values than model 1, as well as the GWLR method values higher than the MaxEnt, especially if we consider the validation samples. However, the greatest differences are observed in the susceptibility distribution and the degree of adjustment (Fig. 12). In general, the maps obtained with the MaxEnt method are less conservative (without a tendency to overpredicting), that is, they present a smaller extension of the areas of high-very high susceptibility regarding to low-very low and present a better degree of adjustment than those obtained with GWLR. At the same time, in the MaxEnt method, the zones of medium susceptibility predominate over the extreme ones, unlike in the GWLR method, where extreme susceptibilities predominate. Furthermore, in the MaxEnt method, the distribution of classes is quite similar in the three models used, while in the GWLR, there is a certain shift towards classes with higher susceptibility in model 3 (it is more conservative/tendency to overpredicting).
Regarding the predictive capacity of the models, for which the validation samples have been considered, model 2 allows to obtain better AUC values compared to model 1 (Fig. 10). These results are also reflected in the distribution of the degree of adjustment by susceptibility classes (DF), with the adjustment results at the very high susceptibility class being higher in the case of model 2 than in model 1 (Fig. 13). It must be taken into account that the models are trained from data obtained by different authors, the first data set of 315 movements is from Hart (2004) and the second of 254 movements has been obtained for this work, which could be significant for the study of the influence of the inventory on the modelling. By repeating the same effect of improvement of model 2 in the AUC and the DF, both in the prediction with MaxEnt as with the GWLR, we believe that there is a clear influence from used inventory data. Modelling 2 (254 movements for training) improves the prediction over modelling 1 (315 movements for training). This could be explained by a certain bias in the 2004 inventory, due to the space–time scale used, since that the 2004 inventory was based on aerial photographs at a scale of 1:30,000 of flights between September 1984 and April 1985 that covered the majority of the área, in addition to 1:13,333 photographs taken in 1996 that only covered a small part of the área, and field verification mapping (Griffiths et al. 2002; Hart 2004). Meanwhile, the new inventory has been compiled using multiscale and multitemporal photointerpretation as indicated above.
For model 3, the mean AUC values for the training and test data set, obtained with the GWLR (0.994 and 0.972), are better than those of MaxEnt (0.93 and 0.927); however, the MaxEnt models are somewhat less conservative and present a better degree of fit (DF) than the GWLR (Fig. 12), as happened in models 1 and 2. Furthermore, if we compare the DF values with their respective distributions of susceptibility, MaxEnt modelling has a clear advantage, since in this case, the surfaces classified as high to very high susceptibility represent only 17.24% of the study area, but they concentrate more than 92% of the areas affected by landslides. However, for GWLR, those same susceptibility levels, whose value is more than double that in MaxEnt (approximately 41% of the total area), are affected by approximately 86% of the landslides in the area, which implies a better capacity to susceptibility discrimination of the MaxEnt model. Therefore, it can be concluded that both methods allow obtaining models with excellent AUC values, even with differences in the AUC themselves and in the DFs.
In this way, to obtain the susceptibility distribution of the area, we have resorted to a consensus model of both methods, used in other disciplines (Araújo & New 2007) and also proposed for mapping susceptibility to landslides (Felicísimo et al. 2013). This model is the arithmetic mean of the obtained models and is established by simple GIS geoprocessing with map algebra. The new resulting susceptibility distribution has been subjected to a new evaluation and validation using the same data sets used for the GWRL bootstrapping (used to obtain model 3) obtaining average AUC values of 0.990 for performance and 0.991 for validation. We have also calculated the susceptibility distribution (Table 5) and the degrees of fit (Fig. 14).
This model proposed as the final susceptibility map of the study area (Fig. 15) allows to obtain an average training AUC equivalent to the best AUC of the two performed (GWLR) and even improves the one corresponding to the validation. Likewise, an improvement is observed in the DF of the GWLR, which were below the DF of MaxEnt. Compared to the GWLR, the adopted model improves performance by lowering the DF values of the susceptibility levels, very low, low and moderate, maintaining the high grade and increasing the DF of the very high level.
5 Conclusions
The Río Aguas catchment has an incidence of landslides of 2.58% of its total area. By type and area affected, rock fall is abundant, followed by slides and complex movements, with debris flow being the scarcest group. Approximately 37% of the total surface of the drainage basin affected by landslides is located in metamorphic materials of the Alpujárride and Nevado–Filábride complexes, and the remaining 63% is located in the post-orogenic fills of the Sorbas Basin. Of the former, the most susceptible lithologies are carbonate rocks and phyllites and of the latter, conglomerates, sands and marls; gypsum; limestone and gravels, sands and silts.
Two statistical models have been used for the landslide susceptibility modelling, MaxEnt (maximum entropy) and GWLR (geographically weighted logistic regression) where each landslide is represented as a point location corresponding to the centroid of the rupture zone and the factor layers being converted into a raster format. This study has used 11 predictors related to morphometry, hydrography, geology and land cover, with 5 m grid spacing. This resolution and the representation of landslides as point location make it necessary to have inventories that capture the full range of landslide activity as a gridded format. This study has been able to verify that the inventory obtained from Google Earth images can be recommended for this type of modelling by offering multiscale and multitemporal data, in addition to its advantage of free access to data, as have already used in previous works (Costanzo et al. 2012; Eeckhaut et al. 2012; Zhang et al. 2015; Lombardo et al. 2016a).
For the proposal of a susceptibility model of the Río Aguas catchment, the performance and predictive capacities of both methods were determined (MaxEnt and GWLR) by using the evaluation and validation statistics of the ROC curves, considering different configurations of the training and validation samples. Thus, excellent values of the area index parameter under the ROC curve (AUC), higher than 0.9, have been obtained. However, it was considered important to combine the statistical metrics used here with the fit of the polygonal data to the models, since the final object of the research is to establish the spatial distribution of susceptibility projected on a map and the mapped polygonal landslides data, not with the point locations used in the analysis. This study therefore also evaluated the degree of fit (DF) of the results, thus measuring the fit of the landslides polygons to the susceptibility areas. This also showed good results with the two models.
The evaluation and validation of the results of the MaxEnt and GWLR models have not allowed us to rule out one model or another for the mapping of susceptibility by producing comparable yields, so a consensus model (the arithmetic mean of both) (Araújo & New 2007; Felicísimo et al. 2013) was used. This simple model combination technique has been shown to further improve the fit data for both performance and prediction of the final proposed model (AUC = 0.99). In this way, around 36% of the surface of the Río Aguas basin has been classified as having high to very high susceptibility, both zones presenting a DF of 90%, which reports the excellent fit of the model.
The authors are very aware that similar studies are frequently published, seeking to demonstrate how different statistical methods and techniques have been used and tested to map landslide susceptibility across different project study areas. As noted by Hearn & Hart (2019), many of these papers aspire to develop methodologies that could be used by engineering or planning practitioners, but often appear to ignore the geological and geomorphological evolution of those study areas (and therefore, the context of why those landslides are occurring) and thus ignoring those factors that are of relevance to those same practitioners. As well as focussing on assessing different statistical models and how they might predict the mapped landslide datasets used in this study, this study has also sought to use a landslide dataset that was based on both detailed mapping of the Rio Aguas catchment area and an understanding of those overarching factors that were potentially driving that landslide activity (e.g. tectonic uplift, base-level change and drainage evolution through river capture).
References
Alcantara-Ayala, I., & Thornes, J. B. (1996) The environmental dimensions of mass failure in semi-arid Spain. In Landslides, 1837–1842.
Aleotti P, Chowdhury R (1999) Landslide hazard assessment: summary review and new perspectives. Bull Eng Geol Env 58(1):21–44
Araújo MB, New M (2007) Ensemble forecasting of species distributions. Trends Ecol Evol 22(1):42–47. https://doi.org/10.1016/j.tree.2006.09.010
Ardilly P, Audric S, de Bellefon M-P, Buron M-L, Durieux E, Eusebio P, Favre-Martinoz C, Floch J-M, Fontaine M, Genebes L, et al. (2018) Manuel d’analyse spatiale
Baeza Adell, C. (1994) Evaluación de las condiciones de rotura y la movilidad de los deslizamientos superficiales mediante el uso de técnicas de anáisis multivariante. In TDX (Tesis Doctorals en Xarxa). https://upcommons.upc.edu/handle/2117/93582#.XJkD1FITUQk.mendeley
Baeza C, Corominas J (2001) Assessment of shallow landslide susceptibility by means of multivariate statistical techniques. Earth Surf Proc Land 26(12):1251–1263. https://doi.org/10.1002/esp.263
Ballabio C, Sterlacchini S (2012) Support vector machines for landslide susceptibility mapping: the Staffora river basin case study. Italy Math Geosci 44(1):47–70. https://doi.org/10.1007/s11004-011-9379-9
Bonham-Carter GF, Agterberg FP, Wright DF (1990) Integration of geological datasets for gold exploration in Nova Scotia. Intro Read Geogr Inf Syst. https://doi.org/10.1029/sc010p0015
Boussouf S, Irigaray C, Chacon J (1994) Cartographie de l’aléa des mouvements de versants du bord nord-occidental de la dépression de Grenade (Espagne). In RAA Balkema (Ed.), International congress International Association of Engineering Geology pp 2223–2231
Boussouf S (1995) Cartographie de l’aléa des mouvements de versants du bord nord-occidental de la dépression de Grenade (Espagne). Contribution à l’étude des risques géologiques. Université Abdelmalek Essaâdi. https://doi.org/10.13140/RG.2.2.21226.57281
Brabb EE (1984) Innovative approaches to landslide hazard and risk mapping. Iandslides Glissements De Terrain IV Int Symp Landslides Toronto, Canada 1:307–323
Brabb EE (1991) The world landslide problem. Epis J Int Geosci 14(1):52–61
Brabb, E. E., Pampeyan, E. H., & Bonilla, M. G. (1972) Landslide susceptibility in San Mateo County, California.
Brunsden C, Fotheringham AS, Charlton ME (1996) Geographically weighted regression: a method for exploring spatial nonstationarity. Geogr Anal. https://doi.org/10.1111/j.1538-4632.1996.tb00936.x
Calaforra JM, Pulido-Bosch A (1997) Peculiar landforms in the gypsum karst of Sorbas (southeastern Spain). Carbonates Evaporites 12(1):110
Candy I, Black S, Sellwood BW (2005) U-series isochron dating of immature and mature calcretes as a basis for constructing Quaternary landform chronologies for the Sorbas basin, southeast Spain. Quatern Res 64(1):100–111. https://doi.org/10.1016/j.yqres.2005.05.002
Carrara A, Merenda L (1974) Methodology for an inventory of slope instability events in Calabria (southern Italy). Geologia Applicata e Idrogeologica 9:237–255
Carrara A (1988) Multivariate models for landslide hazard evaluation. A “black box” approach. Workshop on Natural Disasters in European Mediterranean Countries, Perugia, Italy, 205–224.
Castellanos Abella EA, Van Westen CJ (2008) Qualitative landslide susceptibility assessment by multicriteria analysis: a case study from San Antonio del Sur, Guantánamo. Cuba Geomorphology 94(3–4):453–466. https://doi.org/10.1016/j.geomorph.2006.10.038
Chacón J, Irigaray C, Fernández T, El Hamdouni R (2006) Engineering geology maps: Landslides and geographical information systems. Bull Eng Geol Env 65(4):341–411. https://doi.org/10.1007/s10064-006-0064-z
Chen W, Pourghasemi HR, Kornejady A, Zhang N (2017) Landslide spatial modeling: introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques. Geoderma. https://doi.org/10.1016/j.geoderma.2017.06.020
Chung C-JF, Fabbri AG, Westen Van CJ (1995) Multivariate regression analysis for landslide hazard zonation. Geographical information systems in assessing natural hazards. Springer, 107–133
Clark WAV, Hosking PL (1986) Statistical methods for geographers (Issue 310 C5)
Conrad O, Bechtel B, Bock M, Dietrich H, Fischer E, Gerlitz L, Wehberg J, Wichmann V, Böhner J (2015) System for automated geoscientific analyses (SAGA) v. 2.1.4. Geoscientific Model Development 8(7):1991–2007
Consejería de Política Industrial y Energía, 2011 Mapa Geológico-Minero de Andalucía a escala 1:400.000, Junta de Andalucía. Available online: https://www.juntadeandalucia.es/portalandaluzdelamineria/Inicio.action?nameSpace=%2F.
Convertino M, Troccoli A, Catani F (2013) Detecting fingerprints of landslide drivers: a MaxEnt model. J Geophys Res Earth Surf. https://doi.org/10.1002/jgrf.20099
Costanzo D, Cappadonia C, Conoscenti C, Rotigliano E (2012) Exporting a google earth™ aided earth-flow susceptibility model: a test in central Sicily. Nat Hazards 61(1):103–114. https://doi.org/10.1007/s11069-011-9870-0
Dabrio CJ, Polo MD (1995) Oscilaciones eustáticas de alta frecuencia en el Neógeno superior de Sorbas (Almería, sureste de España). Geogaceta 18:75–78
Davis J, Blesius L (2015) A hybrid physical and maximum-entropy landslide susceptibility model. Entropy. https://doi.org/10.3390/e17064271
DeGraff JV, Romesburg, C. (1980) Regional landslide susceptibility assessment for wildland management: a matrix approach.
Einstein NH (1988) Special lecture: landslide risk assessment procedure. Int Symp Landslides 5:1075–1090
El Hamdouni R, Irigaray C, Fernández T, Chacón J, Keller EA (2008) Assessment of relative active tectonics, southwest border of the Sierra Nevada (southern Spain). Geomorphology 96(1–2):150–173. https://doi.org/10.1016/j.geomorph.2007.08.004
Esteban-Parra MJ, Rodrigo FS, Castro-Diez Y (1998) Spatial and temporal patterns of precipitation in Spain for the period 1880–1992. Int J Climatol 18(14):1557–1574. https://doi.org/10.1002/(sici)1097-0088(19981130)18:14%3c1557::aid-joc328%3e3.3.co;2-a
Felicísimo ÁM, Cuartero A, Remondo J, Quirós E (2013) Mapping landslide susceptibility with logistic regression, multiple adaptive regression splines, classification and regression trees, and maximum entropy methods: a comparative study. Landslides. https://doi.org/10.1007/s10346-012-0320-1
Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage WZ (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng Geol 102(3):85–98. https://doi.org/10.1016/j.enggeo.2008.03.022
Fernández T, Irigaray C, El Hamdouni R, Chacón J (2003) Methodology for landslide susceptibility mapping by means of a GIS. Application to the contraviesa area (Granada, Spain). Nat Hazards 30(3):297–308. https://doi.org/10.1023/B:NHAZ.0000007092.51910.3f
Fernández T, Irigaray C, Chacón J (1994) Large scale analysis and mapping of determinant factors of landsliding affecting rock massifs in the eastern Costa del Sol (Granada, Spain) in a GIS. In: International Congress International Association of Engineering Geology, 4649–4658
Ferre E (1997) Unidades de diagnóstico para la evaluación de la peligrosidad geomorfológica en el valle del Andrax (Prov. de Almería). In Baética: Estudios de arte, geografía e historia (Issue 19, pp. 111–134). https://dialnet.unirioja.es/servlet/articulo?codigo=95371&info=resumen&idioma=SPA
Florez García AC, Pérez Castillo JN (2019) Técnicas para la predicción espacial de zonas susceptibles a deslizamientos. Avances Investigación En Ingeniería. https://doi.org/10.18041/1794-4953/avances.1.5188
Fotheringham AS, Charlton M, Brunsdon C (1996) The geography of parameter space: an investigation of spatial non-stationarity. Int J Geogr Inf Syst 10(5):605–627. https://doi.org/10.1080/026937996137909
Fotheringham AS, Charlton M, Brunsdon C (1997) Two techniques for exploring non-stationarity in geographical data. Geograph Syst 4(1):59–82
Fotheringham S, Charlton M, Brunsdon C (2002) Chp 2: geographically weighted regression: the basics. In geographically weighted regression: the analysis of spatially varying relationships
Geach MR, Stokes M, Hart A (2017) The application of geomorphic indices in terrain analysis for ground engineering practice. Eng Geol 217:122–140. https://doi.org/10.1016/j.enggeo.2016.12.019
Goetz JN, Brenning A, Petschko H, Leopold P (2015) Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput Geosci 81:1–11
Goodchild MF (1986) Spatial autocorrelation (Vol. 47). Geo Books
Gorsevski PV, Jankowski P (2010) An optimized solution of multi-criteria evaluation analysis of landslide susceptibility using fuzzy sets and Kalman filter. Comput Geosci 36(8):1005–1020. https://doi.org/10.1016/j.cageo.2010.03.001
Griffiths JS, Stokes M (2008) Engineering geomorphological input to ground models: an approach based on earth systems. Q J Eng GeolHydrogeol 41(1):73–91. https://doi.org/10.1144/1470-9236/07-010
Griffiths JS, Mather AE, Hart AB (2002) Landslide susceptibility in the Río Aguas catchment, SE Spain. Q J Eng GeolHydrogeol 35(1):9–17. https://doi.org/10.1144/qjegh.35.1.9
Griffiths JS, Hart AB, Mather AE, Stokes M (2005) Assessment of some spatial and temporal issues in landslide initiation within the Río Aguas catchment. South East Spain Landslides 2(3):183–192. https://doi.org/10.1007/s10346-005-0004-1
Guzzetti F, Carrara A, Cardinali M, Reichenbach P (1999) Landslide hazard evaluation: a review of current techniques and their application in a multi-scale study. Cent Italy Geomorphol 31(1–4):181–216. https://doi.org/10.1016/S0169-555X(99)00078-1
Guzzetti F, Reichenbach P, Cardinali M, Galli M, Ardizzone F (2005) Probabilistic landslide hazard assessment at the basin scale. Geomorphology 72(1–4):272–299. https://doi.org/10.1016/j.geomorph.2005.06.002
Hart AB (2004) Landslide investigation in the Río Aguas catchment. Southeast Spain. University of Plymouth. http://hdl.handle.net/10026.1/2097
Harvey AM, Wells SG (1987) Response of quaternary fluvial systems to differential epeirogenic uplift: Aguas and Feos river systems, southeast Spain. Geology 15(8):689–693. https://doi.org/10.1130/0091-7613(1987)15%3c689:ROQFST%3e2.0.CO;2
Hearn GJ, Hart AB (2019) Landslide susceptibility mapping: a practitioner’s view. Bull Eng Geol Env 78(8):5811–5826. https://doi.org/10.1007/s10064-019-01506-1
Hong H, Pradhan B, Xu C, Bui DT (2015) Spatial prediction of landslide hazard at the Yihuang area (China) using two-class kernel logistic regression, alternating decision tree and support vector machines. CATENA 133:266–281
Hong H, Naghibi SA, Pourghasemi HR, Pradhan B (2016) GIS-based landslide spatial modeling in Ganzhou City. China Arab J Geosci 9(2):1–26. https://doi.org/10.1007/s12517-015-2094-y
Irigaray C, Fernández T, El Hamdouni R, Chacón J (1999) Verification of landslide susceptibility mapping: a case study. Earth Surf Process Landf J British Geomorphol Res Group 24(6):537–544
Irigaray C, Fernández T, El Hamdouni R, Chacón J (2007) Evaluation and validation of landslide-susceptibility maps obtained by a GIS matrix method: examples from the Betic Cordillera (southern Spain). Nat Hazards 41(1):61–79. https://doi.org/10.1007/s11069-006-9027-8
Irigaray C, Chacón J, Fernández T (1996) Methodology for the analysis of landslide determinant factors by means of a GIS: application to the Colmenar area (Malaga, Spain). Landslides. Balkema, Rotterdam, 163–172
Jaynes ET (1957) Information theory and statistical mechanics. Phys Rev 106(4):620
Jaynes ET (1982) On the rationale of maximum-entropy methods. Proc IEEE. https://doi.org/10.1109/PROC.1982.12425
Jiao Y, Zhao D, Ding Y, Liu Y, Xu Q, Qiu Y, Liu C, Liu Z, Zha Z, Li R (2019) Performance evaluation for four GIS-based models purposed to predict and map landslide susceptibility: a case study at a world heritage site in southwest China. Catena. https://doi.org/10.1016/j.catena.2019.104221
Jiménez-Perálvarez JD, Irigaray C, Hamdouni RE, Chacón J (2011) Landslide-susceptibility mapping in a semi-arid mountain environment: an example from the southern slopes of Sierra Nevada (Granada, Spain). Bull Eng Geol Env 70(2):265–277. https://doi.org/10.1007/s10064-010-0332-9
Kerekes A, Poszet SL, Gal A (2018) Landslide susceptibility assessment using the maximum entropy model in a sector of the Cluj-Napoca Municipality Romania. Revista de Geomorfologie. https://doi.org/10.21094/rg.2018.039
Kim HG, Lee DK, Park C, Kil S, Son Y, Park JH (2015) Evaluating landslide hazards using RCP 4.5 and 8.5 scenarios. Environ Earth Sci 73(3):1385–1400. https://doi.org/10.1007/s12665-014-3775-7
Kleidon A, Malhi Y, Cox PM (2010) Maximum entropy production in environmental and ecological systems. Philos Trans R Soc b: Biol Sci. https://doi.org/10.1098/rstb.2010.0018
Kornejady A, Ownegh M, Bahremand A (2017) Landslide susceptibility assessment using maximum entropy model with two different data sampling methods. CATENA. https://doi.org/10.1016/j.catena.2017.01.010
Lee S, Ryu JH, Won JS, Park HJ (2004) Determination and application of the weights for landslide susceptibility mapping using an artificial neural network. Eng Geol 71(3–4):289–302. https://doi.org/10.1016/S0013-7952(03)00142-X
Lombardo L, Fubelli G, Amato G, Bonasera M (2016a) Presence-only approach to assess landslide triggering-thickness susceptibility: a test for the Mili catchment (north-eastern Sicily, Italy). Nat Hazards 84(1):565–588. https://doi.org/10.1007/s11069-016-2443-5
Lombardo L, Bachofer F, Cama M, Märker M, Rotigliano E (2016b) Exploiting maximum entropy method and ASTER data for assessing debris flow and debris slide susceptibility for the Giampilieri catchment (north-eastern Sicily’ Italy). Earth Surf Process Landf. https://doi.org/10.1002/esp.3998
Macau, F. (1963) Previsión de los movimientos del terreno. Informaciones y estudios. Servicio Geológico de Obras Públicas Del MOP. Boletin 16:83
Martín JM, Braga JC (1994) Messinian events in the Sorbas Basin in southeastern Spain and their implications in the recent history of the Mediterranean. Sed Geol. https://doi.org/10.1016/0037-0738(94)90042-6
Mather AE (2000) Adjustment of a drainage network to capture induced base-level change: an example from the Sorbas Basin SE Spain. Geomorphology 34(3):271–289. https://doi.org/10.1016/S0169-555X(00)00013-1
Mather AE, Stokes M, Griffiths JS (2002) Quaternary landscape evolution: a framework for understanding contemporary erosion southeast Spain. Land Degrad Dev 13(2):89–109. https://doi.org/10.1002/ldr.484
Melchiorre C, Matteucci M, Azzoni A, Zanchi A (2008) Artificial neural networks and cluster analysis in landslide susceptibility zonation. Geomorphology 94(3–4):379–400. https://doi.org/10.1016/j.geomorph.2006.10.035
Merow C, Smith MJ, Silander JA (2013) A practical guide to MaxEnt for modeling species’ distributions: what it does, and why inputs and settings matter–Merow–2013–Ecography–Wiley Online Library. Ecography.
Metz CE (1978) Basic principles of ROC analysis. Semin Nucl Med. https://doi.org/10.1016/S0001-2998(78)80014-2
Milliès-Lacroix A (1968) glissements de terrains. Présentation d’une carte prévisionnelle des mouvements de masse dans le Rif (Maroc septentrional). Mines Et Géologie 27:45–55
Nicholson DT, Hencher SR (1997) Assessing the potential for deterioration of engineered rockslopes. Proceedings of the IAEG Symposium, Athens, 911–917.
O’Banion MS, Olsen MJ (2014) Predictive seismically-induced landslide hazard mapping in oregon using a maximum entropy model (MaxEnt). NCEE 2014––10th U.S. national conference on earthquake engineering: frontiers of earthquake engineering. https://doi.org/10.4231/D3C24QN8T
Ott d’Estevou P, Montenat C (1990) Le Bassin Sobras–Tabernas. Documents Et Travaux De L’institut Géologique Albert De Lapparent 12–13:101–128
Pandey VK, Pourghasemi HR, Sharma MC (2020) Landslide susceptibility mapping using maximum entropy and support vector machine models along the highway corridor Garhwal Himalaya. Geocarto Int. https://doi.org/10.1080/10106049.2018.1510038
Park NW (2015) Using maximum entropy modeling for landslide susceptibility mapping with multiple geoenvironmental data sets. Environ Earth Sci. https://doi.org/10.1007/s12665-014-3442-z
Phillips SJ, Anderson RP, Schapire RE (2006) Maximum entropy modeling of species geographic distributions. Ecol Model. https://doi.org/10.1016/j.ecolmodel.2005.03.026
Phillips SJ, Dudík M, Schapire RE [Internet] Maxent software for modeling species niches and distributions (Version 3.4.1). Available from url: http://biodiversityinformatics.amnh.org/open_source/maxent/.
Pita MF, Caravaca I, Feria JM, Alcalá-Zamora A, Vallejo I (1999) riesgos catastróficos y ordenación del territorio en andalucía. Consejería de Obras Públicas y Transportes, Junta de Andalucía.
Pourghasemi HR, Mohammady M, Pradhan B (2012a) Landslide susceptibility mapping using index of entropy and conditional probability models in GIS: Safarood Basin Iran. Catena. https://doi.org/10.1016/j.catena.2012.05.005
Pourghasemi HR, Pradhan B, Gokceoglu C (2012b) Remote sensing data derived parameters and its use in landslide susceptibility assessment using shannon’s entropy and GIS. Appl Mech Mater 225:486–491. https://doi.org/10.4028/www.scientific.net/AMM.225.486
Pradhan B, Lee S (2010) Landslide susceptibility assessment and factor effect analysis: backpropagation artificial neural networks and their comparison with frequency ratio and bivariate logistic regression modelling. Environ Model Softw 25(6):747–759. https://doi.org/10.1016/j.envsoft.2009.10.016
Pulido-Bosch A (1986) Le karst dans les gypses de Sorbas (Almeria). Aspects Morphologiques Et Hydrogéologiques Karstologia Mémoires 1:27–35
REDIAM, 2022. Descargas de información ambiental, Red de información Ambiental de Andalucía, Consejería de Sostenibilidad, Medio Ambiente y Economía Azul, Junta de Andalucía. Available online: https://www.juntadeandalucia.es/medioambiente/portal/web/guest/acceso-rediam/descargas/descargas-de-informacion-ambiental.
Reichenbach P, Rossi M, Malamud BD, Mihir M, Guzzetti F (2018) A review of statistically-based landslide susceptibility models. Earth Sci Rev 180:60–91. https://doi.org/10.1016/j.earscirev.2018.03.001
Remondo J, Bonachea J, Cendrero A (2008) Quantitative landslide risk assessment and mapping on the basis of recent occurrences. Geomorphology 94(3–4):496–507. https://doi.org/10.1016/j.geomorph.2006.10.041
Renner IW, Warton DI (2013) Equivalence of MAXENT and poisson point process models for species distribution modeling in ecology. Biometrics. https://doi.org/10.1111/j.1541-0420.2012.01824.x
Sanz de Galdeano C, Shanov SS, Galindo-Zaldívar J, Radulov A, Nikolov G (2010) A new tectonic discontinuity in the Betic Cordillera deduced from active tectonics and seismicity in the Tabernas Basin. J Geodyn 50(2):57–66. https://doi.org/10.1016/j.jog.2010.02.005
Shannon CE (1948) A mathematical theory of communication. Bell Syst Techn J 27(3):379–423
Stevenson PC (1977) An empirical method for the evaluation of relative landslip risk. Bull Int Assoc Eng Geol Bull de l Assoc int de Géol de l’ingénieur 16(1):69–72
Stokes M, Mather AE, Harvey AM (2002) Quantification of river-capture-induced base-level changes and landscape development, Sorbas Basin, SE Spain. Geol Soc Lond Spec Publ. 191(1):23–35. https://doi.org/10.1144/GSL.SP.2002.191.01.03
Swets J (1988) Measuring the accuracy of diagnostic systems. Science 240(4857):1285–1293. https://doi.org/10.1126/science.3287615
Tobler WR (1970) A computer movie simulating urban growth in the Detroit region. Econ Geogr. https://doi.org/10.2307/143141
Van Den Eeckhaut M, Hervás J, Jaedicke C, Malet JP, Montanarella L, Nadim F (2012) Statistical modelling of Europe-wide landslide susceptibility using limited landslide inventory data. Landslides 9(3):357–369. https://doi.org/10.1007/s10346-011-0299-z
van Westen CJ, Soeters R, Sijmons K (2000) Digital geomorphological landslide hazard mapping of the Alpago area, Italy. Int J Appl Earth Obs Geoinf 2(1):51–60
van Westen CJ, Castellanos E, Kuriakose SL (2008) Spatial data for landslide susceptibility, hazard, and vulnerability assessment: an overview. Eng Geol 102(3–4):112–131. https://doi.org/10.1016/j.enggeo.2008.03.010
van Westen CJ (2000) The modelling of landslide hazards using GIS. Surv Geophys 21(2–3):241–255
Vorpahl P, Elsenbeer H, Märker M, Schröder B (2012) How can statistical models help to determine driving factors of landslides? Ecol Model 239:27–39. https://doi.org/10.1016/j.ecolmodel.2011.12.007
Weijermars R, Roep TB, Van Den Eeckhout V, Postma G, Kleverlaan K (1985) Uplift history of a Betic fold nappe inferred from Neogene-Quaternary sedimentation and tectonics (in the Sierra Alhamilla and Almeria, Sorbas and Tabernas basins of the Betic Cordilleras SE Spain. Geologie En Mijnbouw 64(4):397–411
Yilmaz I (2010) Comparison of landslide susceptibility mapping methodologies for Koyulhisar, Turkey: conditional probability, logistic regression, artificial neural networks, and support vector machine. Environ Earth Sci 61(4):821–836
Zare M, Pourghasemi HR, Vafakhah M, Pradhan B (2013) Landslide susceptibility mapping at Vaz Watershed (Iran) using an artificial neural network model: a comparison between multilayer perceptron (MLP) and radial basic function (RBF) algorithms. Arab J Geosci 6(8):2873–2888. https://doi.org/10.1007/s12517-012-0610-x
Zhang J, Gurung DR, Liu R, Murthy MSR, Su F (2015) Abe Barek landslide and landslide susceptibility assessment in Badakhshan Province. Afghanistan Landslides 12(3):597–609. https://doi.org/10.1007/s10346-015-0558-5
Acknowledgements
The authors acknowledge the help of the group TEP213 (SFT) of Andalusian Plan of Research, Development and Innovation (PAIDI); the Centre for Advanced Studies in Earth Sciences, Energy and Environment of the University of Jaén; and Atkins Ltd.
Funding
Funding for open access publishing: Universidad de Jaén/CBUA. The authors have not disclosed any funding.
Author information
Authors and Affiliations
Contributions
SB and TF contributed to conceptualization; SB contributed to methodology; SB and AH contributed to field work; SB contributed to data curation; SB and TF contributed to writing; TF and AH contributed to supervision. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors have not disclosed any competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Boussouf, S., Fernández, T. & Hart, A.B. Landslide susceptibility mapping using maximum entropy (MaxEnt) and geographically weighted logistic regression (GWLR) models in the Río Aguas catchment (Almería, SE Spain). Nat Hazards 117, 207–235 (2023). https://doi.org/10.1007/s11069-023-05857-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11069-023-05857-7