1 Introduction

Shallow landslides and debris flows are dangerous phenomena responsible for a large number of casualties and economic losses (Schuster and Fleming 1986; Crosta 1998; Catani et al. 2005; Montrasio et al. 2009). Many efforts have been made to develop process-based models in order to assess slope stability at basin scale (Guzzetti et al. 1999, 2008). However, when trying to predict or model shallow landslides in a distributed way over large areas, one of the main problems is not the stability model itself, but the uncertainty in the spatial variability of the input parameters (Khazai and Sitar 2000; Savage et al. 2004; Guzzetti et al. 2007). There is large agreement on considering soil thickness as one of the most important factors controlling shallow landsliding (Johnson and Sitar 1990; Wu and Sidle 1995; Van Asch et al. 1999; Dietrich et al. 2008). Therefore, attention should be paid in choosing the right criterion to enter this parameter in basin scale slope stability models. To enter soil thickness in basin scale models, many authors rely on straightforward and simplistic solutions such as considering a spatially constant value in the whole studied area (Khazai and Sitar 2000; Savage et al. 2004), using soil thickness classes (Revellino et al. 2008), assigning a constant value for each geological formation encountered in the analysed site (Savage et al. 2004) or deriving a distributed soil thickness map from a single known topographic attribute such as the elevation (Saulnier et al. 1997) or the slope gradient (De Rose 1996; Salciarini et al. 2006). More complex methods that make use of multivariate statistical analyses (Gessler et al. 2000; Tesfa et al. 2009) or that employ process-based models (Casadei et al. 2003; Pelletier and Rasmussen 2009) are less frequently used as input data in large scale slope stability analyses because they are very site-specific and require some effort to be correctly applied and calibrated over large areas.

The objective of this paper is to compare five methods of producing spatially variable soil thickness maps in order to: (1) understand the sensitivity of slope stability analysis to soil thickness; (2) discuss how error estimation in soil thickness models influences slope stability assessment; (3) evaluate which soil thickness model is more effective when used in conjunction with basin scale slope stability models. To fulfil these objectives, a slope stability model will be applied on a pixel-by-pixel basis using a variety of soil thickness maps each obtained using a distinct method. This experiment will be repeated in two test sites characterised by very different geological and geomorphological settings.

2 Methodologies

2.1 Slope stability model

The model used in our analysis takes inspiration from Iverson’s model for landslides triggered by rain infiltration and couples a hydrological model based on a reduced form of Richards’ equation with an infinite slope stability analysis (Iverson 2000). Due to the assumption needed to find the simplified solution for the Richards’ equation, Iverson’s model neglects the effect of soil suction on the overall effective cohesion. In this modified version, the additional cohesion due to suction in unsaturated soils has been included within the slope stability model. Moreover, the slope stability simulator used during this study allows us to find a solution not only for a single rainfall event but also for complex storms characterised by different and alternating intensities in both space and time. The stability model is implemented in a computer code designed to be applied in operational forecasting systems: The analyses are performed in near real time and the computations can be continued without restating the analysis from the beginning, whenever new rainfall data are available. In addition, the code can also be used for back analysis, simulating a historical rainfall event as in the present work. The computer code used is not yet available to the public, since still under final development status in a PhD thesis (Rossi in preparation). In this work a preliminary version was used (Leoni 2009). As every other model based on infinite slope theory, it is designed to be applied wherever soil coverage, compared to the underlying bedrock, exhibits a marked change in hydraulic properties.

The model can be divided in two independent parts: the hydrological tool that calculates the spatial pressure head distribution ψ (Z, t) as a function of time t and depth Z, and the stability analysis tool that supplies the distributed value of the factor of safety as outcome.

In the simplified solution of Richards’ equations, pressure head ψ (Z, t) is subdivided in two cases: during rainfall (Eq. 1) and after rainfall (Eq. 2) (Iverson 2000).

$$ \psi (Z) = Z\beta \left( {1 - \frac{d}{Z}} \right) + Z{\frac{I}{{K_{Z} }}}[R(t)]\quad t < T $$
(1)
$$ \psi (Z) = Z\beta \left( {1 - \frac{d}{Z}} \right) + Z{\frac{I}{{K_{Z} }}}[R(t) - R(T)]\quad t \ge T $$
(2)

where β is the initial steady state pressure head, d is the water table depth, I is the intensity of rainfall, K Z is the hydraulic diffusivity, T is the rainfall end time and R(t) is the response function (Iverson 2000). In this way, we are able to obtain the pressure head ψ, which is an input of the stability model.

The stability analysis is based on the limit equilibrium method for an infinite slope. The adopted solution for the infinite slope model is that proposed by Skempton and DeLory (1957), in the form suggested by Iverson (2000). In this equation, the factor of safety is described as a function of depth and time, and it is divided into a time-varying component FS′(Z, t) and a steady background component FS 0 (Z):

$$ FS\left( {Z,t} \right) = \, FS_{0} \left( Z \right) \, + \, FS\prime \left( {Z,t} \right) $$
(3)

the safety factor can be written:

$$ FS(Z,\,t) = {\frac{\tan \varphi }{\tan \alpha }} + {\frac{c}{{\gamma_{s} Z\sin \alpha \cos \alpha }}} + {\frac{{\psi (Z,t)\gamma_{w} \tan \varphi }}{{\gamma_{s} Z\sin \alpha \cos \alpha }}} $$
(4)

where α is the slope angle, φ is the soil friction angle, γ s is soil unit weight, γ w is the unit weight of groundwater, c is the cohesion, and Z is depth.

For every pixel, the model calculates the factor of safety at increasing depths and allocates the lower FS value found. The computation is automatically repeated at different time steps. Depending on the depth at which the model is working, the sign of pressure head shows if the soil is saturated or unsaturated. If the pressure head is positive, the soil is saturated and the factor of safety can be calculated using directly Eq. 4. Instead, if the pressure head is negative, the soil is unsaturated, and it is necessary to take into account the increase in shear strength due to soil suction. In this case, c is the effective cohesion and can be written in this form (Leoni 2009):

$$ c \, = \, c_{0} + \, c\prime \left( \psi \right) $$
(5)

where c′(ψ) is the apparent cohesion function of the pressure head ψ.

In our analysis, the model will be applied in two test sites on a pixel-by-pixel basis at hourly time steps. At every time step, a new FS map is computed and finally, a summarising FS map of the event is produced assigning to every pixel the lowest FS value assumed in the various time steps (landslides are expected in the pixels which become unstable, even if only for a single time step). A validation is carried out comparing the summarising maps with a landslide inventory in order to quantify correct predictions and errors committed by each simulation. The threshold value of FS = 1 represents the physical limit for which a slope is ideally considered stable (limit equilibrium). Higher and lower values encountered in the simulations will be interpreted as stable and unstable (respectively) portions of the hill slope. To better highlight the differences in FS distribution due to the use of different soil thickness models, the summarising maps are displayed with an intermediate class of FS values (1 < FS < 1.3), which is not considered in the validation. During the validation, sensitivity, specificity and likelihood ratio of the simulations were calculated and used for a quantitative comparison (Beguería 2005; Alatorre and Beguería 2009). Sensitivity is defined as the ratio between true positives (the area correctly classified as unstable) and the actually unstable area; specificity is the ratio between false positives (the area erroneously classified as unstable) and the actually stable area; the likelihood ratio is defined as sensitivity/(1–specificity).

In all the simulations performed on each test site, all the input parameters are held constant, excepted soil thickness. The latter influences both the hydrological tool and the stability tool, as it determinates the lower extent to which the calculation of ψ and FS is performed (see Eqs. 1, 2, and 4).

2.2 Soil thickness models

To feed in the slope stability model with respect to soil thickness Z, different soil thickness maps, obtained using the following methods, will be used.

2.2.1 GIST model (and its simplified version)

The geomorphologically indexed soil thickness (GIST) is an empirical model that combines morphometric attributes with geomorphological and geological factors (Segoni 2008; Segoni and Catani 2008; Catani et al. 2010). This model can be applied at basin scale on a pixel-by-pixel basis using commercial GIS systems.

The model is based on three factors (C, P and S), whose value ranges from 0 to 1: A higher value indicates a tendency to have a thicker soil. The values assumed at every pixel by each factor depend on the local value of a corresponding morphometric attribute (curvature for factor C; position along the hill slope profile for factor P; slope gradient for factor S), but the relationship linking the morphometric attribute with the corresponding factor is not constant over the entire test area. Factor C values are basically assumed inversely dependent on curvature values, except where geomorphologic surveys highlights a direct proportionality (e.g. large accumulations of loose material at the footslopes); the area is divided according to the toposequences characterising the hillslopes, and for each subdivision the relationship linking factor P and hillslope position is independently calibrated; factor S cuts off soil thickness values when the local gradient exceeds a threshold which is different for every lithological class encountered in the area. The product of the three factors is then translated into a soil thickness measure by means of calibration functions, differentiated on a lithological basis and defined by means of a few in situ soil thickness measurements. For further details, see Catani et al. (2010).

In the following applications, also a simplified version of the GIST model will be used. In the simplified GIST model (sGIST henceforth), soil thickness is derived by means of a combination of the three morphometric attributes, without any geomorphological indexing (Catani et al. 2010).

2.2.2 Topographic models

The other soil thickness maps that will be used in the simulations are derived by the application of very simple methods, based on the correlation between soil thickness and a single topographic attribute. We refer to these as ‘topographic models’. Such methods are widely used (Saulnier et al. 1997; Salciarini et al. 2006; Blesius et al. 2009; De Rose 1996; Godt et al. 2008) because they can be easily and quickly applied at large areas in order to obtain spatially variable soil thickness maps without extensive field surveys.

In our case, the following methods have been used:

  • Z model: Soil thickness values are inversely proportional to the elevation, according to a linear law derived from a few calibration measures (Saulnier et al. 1997). This model relies on the assumption that in higher locations erosion usually prevails over deposition, producing shallower soils; vice versa at lower elevations, the predominance of deposition usually determines thicker colluvial and alluvial soils.

  • S model: Soil thickness values are inversely proportional to the slope gradient, according to a linear law derived from a few calibration measures. Similar approaches are among the most used (Saulnier et al. 1997; Blesius et al. 2009). This model relies on the assumption that on steeper slopes erosive processes are more intense, thus soils are shallower; in contrast, erosion is weaker on flat surfaces so that deposition prevails and thicker colluvial and alluvial soils can be usually found.

  • Sexp model: This model is very similar to the previous one, but the relationship between slope gradient and soil thickness is established by an exponential law (thus soil thickness is assumed to decrease more rapidly with increasing slope gradient values). A similar approach has been used by De Rose (1996), Salciarini et al. (2006) and Godt et al. (2008).

The aforementioned methods were applied to the test sites on a pixel-by-pixel basis by means of a GIS system to obtain spatially variable soil thickness maps.

3 Test site 1: Terzona basin

3.1 Test site description

The first test site, corresponding to the Terzona Creek basin, is 24 km2 wide and is located in the Chianti, Tuscany, Central Italy (Fig. 1). The site is mainly constituted by gentle hills made up of Pliocenic and Quaternary terrains and in the eastern part of the basin a rocky bedrock (Paleocene and Eocene turbiditic flysch) gives rise to higher (up to 512 m a.s.l.) and rougher reliefs. The hydraulic network has a marked erosive behaviour (also because of the still active tectonic uplift). The area is sparsely urbanised, and it is mainly covered by vineyards, olive groves and, to a lesser extent, small woods.

Fig. 1
figure 1

Location, morphology and lithology of the Terzona basin

3.2 Model application

In the Terzona case, the slope stability is modelled in response to a 24-h-long rainfall event that occurred in the November 2008. The same amount of rain recorded at 1-h intervals by a rain gauge located 6 km away from the Terzona basin (in an almost identical geographic and climatic setting) is assumed to uniformly fall over the whole basin. The simulation is subdivided into 24 time steps (one for each recording of the rain gauge). Slope angle was derived from a 10-m grid digital elevation model. A different set of values of mechanical parameters were considered for each geological formation. The values were partly determined by means of geotechnical analyses and partly using literature data related to case studies located near the Terzona basin and pertaining to the same kind of materials (Bianchi et al. 2001; Focardi & Garzonio 1983; Canuti et al. 1982). To enter soil thickness in the stability model, four different maps were used in distinct simulations. These maps were obtained by means of the methods called S, Z, sGIST and GIST (see Sect. 2.2). For further details on the application of these models to the Terzona test site, see also Catani et al. 2010. Figure 2 shows the very different soil thickness patterns obtained with the aforementioned models.

Fig. 2
figure 2

The four soil thickness maps computed in the Terzona basin

S-model returns the shallowest soils in the steepest flanks of the valleys more deeply incised by the hydraulic network, while very deep soils are expected in the alluvial plain and in large sectors of the interfluves occupied by pliocenic terrains (where a flat morphology is often encountered). Z model returns a very shallow soil at the highest altitudes (rocky relief) which progressively thickens moving downhill and at lower elevations. In the map derived from the simplified version of the GIST model, the dependence from the position along the hill slope is the more clearly noticeable, while using the full geomorphologically indexed soil thickness model the spatial organisation of soil thickness is clearly dependent on the geological characteristics of the area (compare Fig. 2 and Fig. 1).

These patterns were validated by means of 162 measures performed in situ. As it can be clearly seen in Fig. 3, the GIST model-derived soil thickness map had the best agreement: Compared to the other models, the mean absolute error was remarkably lower (11 cm versus 27, 53 and 94 cm), large residuals were less recurrent, and the frequency distribution was more balanced (while in general Z, S and sGIST models tended to overestimate the soil thickness).

Fig. 3
figure 3

Frequency distribution of residuals (expressed as the difference between predicted and observed soil thicknesses). Bin size is 8 cm. Modified after Catani et al. 2010

In the next section, we will evaluate the extent to which these errors in the soil thickness maps influenced the slope stability assessment.

3.3 Results and validation

By varying the soil thickness pattern, the slope stability model returned very different outputs (Fig. 4). When topographic methods were used to input soil thickness in the stability model, the latter depicted a very widespread instability in the part of the basin dominated by conglomerates. In addition, the S-model also returned FS values near to the limit equilibrium in large sectors of the rocky relief. Using the sGIST model, the instability was returned almost only in the clayey portions of the hillslopes, while 1 < FS < 1.3 values characterised the portion of the basin dominated by conglomerates. The FS map generated using Z derived from the GIST model strongly differed from the previous ones, as it was dominated by highly stable slopes and instability was limited to the footslopes and midslopes of the steepest granular soil hillslopes.

Fig. 4
figure 4

Factor of safety maps obtained using the soil thickness patterns shown in Fig. 2

A complete validation could not be performed because a field survey highlighted that the rainfall event did not trigger any shallow landslides. However, since the primary object of this work is the assessment of the soil thickness model influence in the factor of safety computation, the number of unstable pixels could be used to compare the reliability of the four models: The best one should help to archive the fewest unstable pixels (Table 1).

Table 1 Validation of the instability maps computed using different soil thickness models

The scenario depicted with the GIST model is the most consistent: 98% of the area was correctly classified as stable, while using the other soil thickness maps the percentage drops to 97% (sGIST) and 60% (Z and S). The difference between GIST and sGIST models becomes clearer when the pixels near the limit equilibrium are taken into account (Fig. 4).

4 Test site 2: Armea basin

4.1 Test site description

The second test site corresponds to the mid-portion and high portion of the Armea Creek catchment (Fig. 5), a small mountain basin located in the Western Alps (Liguria, Northern Italy). The basin covers 37 km2, and it is characterised by a high energy of relief, as the hydraulic network flows in deeply incised and narrow valleys from high mountains (up to 1,298 m a.s.l.) to the sea in a very short distance. From a geological point of view, the study area is dominated by Cretaceous Flysch (limestone and sandstone) with a rather complex structural setting; faults, thrusts, a tectonic window, recumbent folds and polyphase folds give evidence of the great deformation brought by the Alpine orogenesis (Sagri 1980; Sagri 1984; Menardi-Noguera 1988; Merizzi and Seno 1991). The steep slopes and the high energy of relief make the area very prone to erosive processes, and among these, shallow landsliding is one of the most important active shaping factors in the Armea valley. The study area is sparsely urbanised: For the most part, it is occupied by forests, small olive groves are present in the western side of the valley. The population concentrates in the small village of Ceriana (situated at the centre of the area) and in sparse houses scattered over the valley.

Fig. 5
figure 5

Location, morphology and lithology of the Armea basin

4.2 Model application

The Armea model run is related to a severe rainfall event that occurred in December 2006, which triggered several shallow landslides. The evolution of the slope stability is computed through 24 time steps (1 h each) making use of distributed rainfall maps (1.75 km spatial resolution) obtained by means of radar measurements and downscaling techniques (Segoni et al. 2009).

Slope angle was derived from a 5-m resolution digital elevation model.

As in the Terzona case study, a certain spatial variability has been granted to geotechnical parameters of the soil and a distinct value was used for each geological formation. Such values were obtained with laboratory and in situ geotechnical analyses and from data sets already available at the Earth Science Department of Firenze.

As for soil thickness maps, the Z, S, Sexp and GIST models (see Sect. 2.2) were applied to the Armea basin obtaining very different results (Fig. 6). As in the previous case study, slope- and elevation-dependent soil thickness maps exhibit a spatial variability identical to that of the attribute used for its derivation. Despite that, the two gradient-based models returned very different results as the exponential relationships (Sexp model) tended to depict shallower soils than the linear one (S model). The GIST model-derived soil thickness pattern, in turn, is evidently influenced by the lithology of the bedrock and by the relative position along the hillslope profile.

Fig. 6
figure 6

Soil thickness maps computed in the Armea basin by means of four different models

A number of direct soil thickness measures varying from 91 to 154 (depending on the number of measures required for the calibration of each model) were used to validate these maps (Fig. 7). As in the Terzona basin, Z and S models had the worst performance, with high mean absolute error, high frequency of large residuals and a pronounced tendency to overestimate soil thickness. Sexp model returned quite good results (45 cm mean absolute error), but the frequency distribution of the residuals showed a proneness to underestimation. The soil thickness map obtained with the GIST model seemed more reliable than the others: Mean absolute error was lower (23 cm), very large residuals were less frequent and no marked tendency to overestimation or underestimation was revealed by the frequency distribution of the residuals.

Fig. 7
figure 7

Frequency distribution of residuals (expressed as the difference between predicted and observed soil thicknesses). Note that y-axis scale varies from a histogram to another

4.3 Results and validation

To assess the impact that different soil thickness patterns can have on slope stability evaluation, the factor of safety calculations were repeated four times using the different models to input soil thickness. As can be seen from Fig. 8, it led to very different results.

Fig. 8
figure 8

Factor of safety maps obtained using the soil thickness patterns shown in Fig. 6

While using Z and S models, a generalised instability affected the largest part of the studied area, Sexp model depicts a diametrically opposite scenario, in which the Armea basin is stable almost everywhere. The result obtained using the GIST model is intermediate between these two very extreme conditions: Instability affected small sectors of the basin predominantly located in hollows with limestone bedrock. These maps were validated with an inventory of the shallow landslides triggered by the rainfall event used in the simulations. The inventory includes 141 shallow movements (mainly debris flows and soil slips) that damaged properties, roads and assets (in addition to causing injuries to one person) (Segoni et al. 2009). As can be seen also from Fig. 9, the landslide population is characterised by events with very small sliding area (one half covers areas smaller than 100 m2) and only a few phenomena with relevant magnitude.

Fig. 9
figure 9

Frequency distribution of the areal extent of landslides (sliding area) that occurred on December 2006 in the Armea basin

Focusing on a small but representative portion of the Armea basin (Fig. 10), we can better understand how the different soil thickness maps used in the simulation influence the modelling of the most destructive debris flows that occurred in the whole basin. Using the S and Z models, all debris flows were correctly identified, but a very high number of false positives were also computed. The use of Sexp model, on the contrary, led to an unrealistic portrayal of a completely stable hillslope. The soil depth map obtained with the GIST model produced a more realistic instability map: Unstable pixels were limited to the area of initiation of the debris flows, and false positives were limited to the surrounding areas. The missed landslide of Fig. 9 is the only large landslide which the GIST model failed to detect. Field survey evidenced that its triggering can be connected mainly to a set of external factors not accounted for in the stability modelling but of great importance in influencing shallow landsliding: The location in an area affected by recent wildfire (Cannon et al. 2001), the exceptionally fractured bedrock (Sas and Eaton 2008) and the presence of a road cut (Collins 2008).

Fig. 10
figure 10

Inventoried landslides (blue lines) compared with the results of the simulation in a zoom over a particularly representative area. Red indicates highly unstable areas, orange refers to moderately unstable areas. Stable areas are in green

The behaviour observed above at a localised scale was confirmed over the entire basin (Tables 2 and 3). Using S and Z models to input soil thickness in the simulation, a great percentage of landslides were correctly associated with the highest instability class (Table 2) but at the same time a high number of false positives were returned (Table 3). In these two cases, the validation showed a highly sensitive yet low specificity model. In fact, only 60% (S model) and 61% (Z model) of the area that did not experience landsliding was correctly identified as stable. The use of Sexp model returned opposite results: False positives were very low (97.3% of the stable area correctly classified as such), but very few landslides were identified (only 4 of 141, corresponding to the 5% of the total area that resulted affected by landslides). Between these two extremes, GIST model results were more balanced: The 40% of the area affected by landslides is classified as unstable and false positives accounted for only 9% of the unaffected area.

Table 2 True positives and false negatives in the Armea basin simulation
Table 3 False positives and true negatives committed in the Armea simulation

The likelihood ratio (Lr), defined as sensitivity/(1–specificity), can be used as an index of the overall performance of the models (Beguería 2005), to assess which one obtained the best performance. In this light, the use of GIST model gave the best outcome (Lr = 4.44), followed by Sexp (2.43), S (2.20) and Z (2.09).

5 Discussion

Considering different soil thickness models in the simulations, remarkably different results were obtained at both test sites, confirming that slope stability models based on the infinite slope theory are very sensitive to soil thickness (Johnson and Sitar 1990; Wu and Sidle 1995; Van Asch et al. 1999; Segoni et al. 2009). Infinite slope-based approaches are often believed to overestimate instability related to shallow landsliding; our simulations show that, at least in our case studies, an important contribution to overestimation can be provided by the errors made by feeding the slope stability model with inaccurate soil thickness maps.

The analyses showed a typical result: Increasing the sensitivity of the model (i.e. its ability to correctly predict the positive cases), its specificity was negatively affected (i.e. a higher rate of false positives was obtained).The errors generated in modelling the spatial organisation of soil thickness seem correlated with the errors of the derived instability maps, as in both test sites they appear proportional. In fact when soil thickness maps are affected by large overestimation, the derived instability map is overestimated accordingly. In both test sites, the S model returned the highest overestimation of soil thickness and the highest underestimation of the factor of safety. The same evidence repeats, at a slightly lesser extent, with the Z model. The sGIST model returned a soil thickness map affected by low error, but overestimation, even if small in magnitude, was predominant. That led to a FS map affected by systematic instability in the clayey terrains and by the predominance of moderately unstable areas in conglomerates, highlighting that the systematicity of soil thickness errors may be more important than their mean magnitude. This is further confirmed by the behaviour of Sexp model, which produced an evident (and very unsafe) overestimation of factor of safety as a consequence of its tendency to underestimate soil thickness, despite a rather low mean absolute error.

The use of GIST model leads to accurate soil thickness predictions in the Terzona basin, providing very consistent slope stability assessment, but validation could be performed considering only the occurrence of false positives. In the Armea basin, the results are more useful: The soil thickness pattern is more precise than the others methods employed, and the results of the slope stability model are more accurate (higher likelihood ratio). Even from a qualitative point of view the GIST-derived instability map proved to be the more consistent: The true positives rate (40%) is not so high but it includes all the large landslides except one (as discussed in Sect. 4.3), while the missed events are very small (105 m2 mean area). In addition, the other simulations were affected by systematic errors: Sexp-derived FS map depicted an almost uniformly stable scenario, while Z and S models identified a high percentage of landslides only because they predicted an extraordinarily broad instability over the whole area.

6 Conclusions

In this paper, we made a comparison between various methods to enter soil thickness as a spatial variable in basin scale slope stability model. We used a slope stability model that is a combination of a simplified solution of Richards’ infiltration equation for the hydrological model, and an infinite slope model with soil suction effect for the mechanical stability. Soil thickness was entered in the stability model using spatially variable maps obtained with different methods: (a) linear correlation with elevation; (b) linear correlation with slope gradient; (c) exponential correlation with slope gradient; (d) combination of curvature, position along the hillslope profile and slope gradient; (e) a more complex model that uses the same topographic attributes of the previous one in conjunction with geological and geomorphological factors.

Results confirmed that the factor of safety is very sensitive to soil thickness and that in general a more reliable soil thickness map, combined with infinite slope based models, improves spatial distribution of FS values at basin scale. We verified that in a deterministic approach the uncertainty in the FS calculation can be reduced by applying more precise input data (i.e. soil thickness), and we demonstrated that the same stability model can become more sensitive or more specific depending on the input data.

Despite slope-based methods are the most used in literature to derive soil thickness (De Rose 1996, Salciarini et al. 2006; Godt et al. 2008), in both the tested sites they returned poor results. Moreover, we verified that the use of an exponential (rather than a linear) relationship does not improve the results; on the contrary in the Armea basin, they were worsened. The use of a combination of three morphometric attributes (sGIST model, Catani et al. 2010) depicted more consistent soil thickness maps and consequently improved the performance of the stability model, but a further substantial improvement was obtained when using soil thickness patterns derived from a model that encompasses geomorphological criteria (GIST model, Catani et al. 2010).