1 Introduction

The availability of complete event-based landslide inventories associated to different rainfall preparatory and triggering conditions is uncommon for a single region. The scarcity of landslide inventory maps associated to different rainfall triggering conditions is frequently highlighted in the literature, even when two independent inventory maps are available for the same region, allowing comparison (Mondini et al. 2014).

Determining the temporal occurrence of landslides is assumed as one of the most critical issues which limits “the performance of a quantitative determination of the probability of slope failure or landslide reactivation within a defined time span” (Corominas et al. 2014). Therefore, assessing the temporal occurrence of landslides is recognized as an important step in evaluating and managing landslide hazard and conducting risk analysis (Corominas et al. 2014). Some of the most important landslide triggering mechanisms are earthquakes (e.g., Harp et al. 2011; Vaz and Zêzere 2016; Chen et al. 2020) and rainfall (e.g., Su et al. 2015; Zêzere et al. 2015; Pereira et al. 2017; Vaz et al. 2018; Gariano et al. 2021). The landslide inventory map provides information on the location of landslides and their main characteristics over a territory (Pašek 1975; Guzzetti et al. 2000). Different sources are commonly used for landslide mapping: field surveying (Ardizzone et al. 2007; Zêzere et al. 2017), interpretation of aerial photographs (Guzzetti et al. 2000; Santangelo et al. 2015; Oliveira et al. 2017), Light Detection and Ranging (LIDAR) derivate (Ardizzone et al. 2007; Van Den Eeckhaut et al. 2013), or very-high resolution (VHR) monoscopic and stereoscopic satellite images (Ardizzone et al. 2013; Murillo-García et al. 2015; Stumpf et al. 2017).

Frequently, two strategies are used to represent landslides in a map (Sterlacchini et al. 2011; Oliveira et al. 2015a; Zêzere et al. 2017): (i) a single point per landslide, usually the centroid of the landslide area or the centroid of the landslide depletion zone; (ii) a polygon drawing the boundary of each landslide or each landslide depletion zone. The construction of a landslide inventory is data source dependent and time-consuming. According to the data source, landslide inventory maps have been differentiated between landslide archives, which document the location of past landslides from literature or other historical sources, and geomorphological inventory maps (Guzzetti et al. 2000).

In addition, landslide inventories are subdivided into historical, multi-temporal, seasonal and event-based landslide inventory maps (Guzzetti et al. 2012). The main criteria that define these subclasses of landslide inventories are: (i) the landslides association to a single or to multiple landslide events, (ii) the time through which the cumulative effects of landslide events are registered over a region, and (iii) the landslide age (absolute or relative). Historical landslide inventory maps include landslide records spanning from tens to thousands of years, with the age of landslides often not differentiated or expressed in relative terms (e.g., recent, old or very old) (Guzzetti et al. 2012). Multitemporal landslide inventory maps are derived from landslides that have been triggered by multiple events occurring over extended periods (e.g., years to decades) (Galli et al. 2008). Seasonal landslide inventory maps cover landslides also triggered by multiple events but during a singular season (Fiorucci et al. 2011). Finally, the event-based landslide inventory map reports the landslides associated to a unique natural trigger (Guzzetti et al. 2012). This type of inventory is prepared just after the event to portray the landslide occurrences linked to critical triggering conditions (Ghosh et al. 2012). However, the use of high-resolution/high-quality aerial photographs and satellite images captured shortly after the landslide event, when the evidence of landslides is fresh and easily recognizable (Guzzetti et al. 2012), enables to obtain reliable event-based landslide inventories even a decade or more after the event has occurred (Oliveira 2012; Zieher et al. 2016).

Focusing on rainfall-triggered landslides, it is expected that different landslide types occur associated to different rainfall critical conditions (e.g., Zêzere et al. 1999b, 2015; Zêzere and Trigo 2011; Martelloni et al. 2012). Although it has been widely acknowledged that landslides can be triggered both by short intense rainfall events and long-lasting rainfall events (Guzzetti et al. 2007; Zêzere et al. 2015), most landslide susceptibility models, based on rainfall event-based landslide inventories found in literature, are related to intense/short duration rainfall critical conditions (a few hours to a few days of rainfall). These conditions typically include intense rainstorms/summer storms (Dai and Lee 2003; Mondini et al. 2011, 2014; Persichillo et al. 2017), typhons (B Lee et al. 2008; Bhandary et al. 2013; Mondini and Chang 2014; Chang et al. 2014; Lee and Chung 2017;) or heavy rainfall events during the monsoon season (Ghosh et al. 2012).

The landslide inventory maps are used to evaluate the extent of landslide phenomena in a region, such as their distribution, the dominant type, the deformation patterns, the recurrence and statistical characteristics of slope failures (Guzzetti et al. 2012). Moreover, landslide inventory maps allow for the assessment of the spatial probability of landslide initiation and runout, as well as the probability of potential landslide sizes for a given return period (Corominas et al. 2014). However, only a limited number of studies have evaluated the direct effects of using different landslide inventories to model landslide susceptibility for the same area (Bornaetxea et al. 2023). Furthermore, the direct effects of using event-based landslide inventories, according to landslide type, for susceptibility assessment have only been investigated in a few cases (e.g. Lee et al. 2008; Chang et al. 2014; Knevels et al. 2020).

To the best of our knowledge, there has been little discussion on the critical rainfall conditions and characteristics of rainfall event-based landslide inventories used to assess landslide susceptibility in this context. Therefore, the aim of this work is to investigate potential sources of uncertainty in landslide susceptibility assessments resulting from the application of rainfall-triggered event-based landslide inventories. This study is designed to address three possible sources of uncertainty: (i) How representative is an event-based landslide inventory map of the total landslide activity and distribution in a study area?; (ii) How reliable can be an event-based landslide susceptibility map?; (iii) How appropriate can be an event-based landslide inventory map to independently validate a landslide susceptibility map?

2 Regional slope instability—data and processes description

2.1 Geology and geomorphology of the study area

The Grande da Pipa River (GPR) basin is located in the north of Lisbon region, Portugal (Fig. 1A). The basin covers an area of 110 km2 and the elevation ranges from 5 to 440 m. The lithology mainly consists of sedimentary formations dating from the upper Jurassic (LU4–LU10, Fig. 1B covering 95.7% of the study area), with predominance of limestones, marls, mudstones and sandstones intercalations (Kullberg et al. 2006; Zbyszewski and Assunção 1965). Rocks are deformed by a large anticline whose axis is in the central part of the basin (Ribeiro et al. 1979; Zbyszewski and Assunção 1965). Therefore, the bedding planes dip outwards the basin center, creating cuesta-type landforms. The older rocks (LU9—marls, mudstones and sandstone) outcrop in the center of the basin and have been subjected to intense differential erosion during the Quaternary (Ferreira 1984). Above the older rocks there is an open to southeast horse shoe belt of limestone rock walls which are 10–20 m high and produce the steepest slopes in the basin. However, moderate and high slopes (> 15°) are only present in 12.5% of the study area (Fig. 2A).

Fig. 1
figure 1

Grande da Pipa River (GPR) basin location (A), Lithological units (LU), 1983 and 2010 rainfall event-based and historical landslide inventories (B). Lithological units based uniquely on age criteria: LU1—Alluvium; LU2—Limestone intercalations; LU3—Conglomerates, sandstones, and mudstones; LU4—Sandstones, marls, and limestones; LU5—Limestones and marls; LU6—Mudstones, sandstones, marls, and limestones; LU7—Coralic limestones; LU8—Limestones; LU9—Marls, mudstones, and sandstones; LU10—Conglomerates; LU11—Dykes and magmatic intrusions

Fig. 2
figure 2

Landslide predisposing factors: A slope angle, B slope aspect, C slope curvature, D slope over area ratio (SOAR), E soil type, and F land use. Lithology is presented on Fig. 1B and landslides predisposing factors classes are described in Table 3

2.2 Rainfall patterns and landslide occurrence

Most landslides that occurred in the north of Lisbon region since the late 1950s show a clear climatic signal and are triggered by rainfall (Zêzere et al. 1999a, b). In the study area the mean annual precipitation (MAP) is around 760 mm, with the highest concentration of rainfall observed from October to March (Zêzere et al. 2015). Until 2010, 25 rainfall-triggered landslide events were identified, mainly in rainy years with annual precipitation above the MAP (Zêzere et al. 2015; Zêzere and Trigo 2011). Similarly to other examples worldwide (Van Asch et al. 1999; Gostelow 1991; Iverson 2000), two main rainfall patterns (Fig. 3), associated to different rainfall critical conditions, have been recognized in the study area by using daily rainfall data as a proxy of the triggering mechanisms responsible for shallow and deep-seated landslides occurrence on natural slopes (Ferreira 1984; Zêzere et al. 1999b, 2005, 2015): (i) intense, short-duration rainfall periods (1–15 days); and (ii) long-lasting rainfall periods (30–90 days).

Fig. 3
figure 3

Relationship between regional rainfall-triggering critical conditions and event-based landslide inventory maps

In the present work a landslide event is any date for which at least five individual landslides are known to have occurred on natural slopes. (Zêzere et al. 2015; Zêzere and Trigo 2011). The landslide events associated with the critical rainfall conditions described in (i) primarily occur during autumn, especially in November, as a result of polar and tropical interferences, which predominantly trigger shallow landslides. The landslide events triggered by the rainfall critical conditions described in (ii) are related to abundant and persistent winter rain, sometimes following a well-watered autumn. These conditions are frequently responsible for the triggering of larger and deeper landslides. The last landslides event type is often preceded by precipitation clustering over long temporal windows of 23–90 days (Bevacqua et al. 2021), this is, the antecedent rainfall prior to the landslide event. The long-lasting rainfall sequences, observed in the north of Lisbon region are typically of polar origin either in zonal circulation or in blocked circulation (Ferreira 1984) and are statistically associated with the 3-month negative average values of the North Atlantic Oscillation (NAO) index (Zêzere et al. 2005).

2.2.1 The 1983 intense rainfall-triggered landslide event

On 18th November 1983 (Fig. 4A), was recorded the heaviest rainfall episode of the twentieth century in the study area (Ramos et al. 2014). In the São Julião do Tojal (SJT) rain gauge (Fig. 1A), which is considered representative for the study area (Oliveira et al. 2014), a maximum absolute daily (24-h) rainfall of 164 mm was registered (Zêzere et al. 1999a, b), corresponding to a 194-year return period (RP) (Zêzere and Trigo 2011). The antecedent rainfall was not relevant for this landslide event (Zêzere et al. 1999a). The occurrence of the heavy rainfall episode was caused by an extended deep low-pressure system (990 hPa) with a frontal system extending across the Atlantic, approaching the mainland (Lisbon region) on a SW-NE trajectory (Godinho 1984). In the study area, both on the steeper natural slopes and on the anthropic cuts and fills, the rapid water infiltration triggered several shallow translational slides, due to the development of positive pore water pressure and the decrease in the material’s apparent cohesion (Zêzere et al. 2015). The characteristics of the event provide a straightforward temporal identification of the rainfall event that can be related to the landslide occurrences.

Fig. 4
figure 4

Accumulated rainfall for different durations observed in SJT (Fig. 1A) related to the 1983 (A) and 2010 (B) rainfall-triggered landslide events. The dashed red line indicates the dates of the landslide events according to (Zêzere and Trigo 2011)

2.2.2 The 2010 long lasting rainfall-triggered landslide event

In the 2009–2010 climatological year (September to August) two rainfall-triggered landslide events (Figs. 3, 4B) were documented in the north of Lisbon region (Zêzere and Trigo 2011). On 14th January, the first landslide event was recorded, which was linked to a critical rainfall of 332 mm/30 days (6-year return period). The second landslide event occurred almost 2 months later on 9th March and was associated with a critical rainfall of 632 mm/90 days (8-year return period) (Zêzere et al. 2015). During the winter period of December 2009 to March 2010, the NAO index reached its most extreme negative value since 1823. This was particularly evident from 10th December 2009 to 14th January 2010 and from 24th January 2010 to 31st March 2010 (Vicente-Serrano et al. 2011), which coincided with the dates of the landslide events. For instance, during the 4-month period, the Lisbon rain gauge (located 30 km south of the study area) recorded 958.6 mm of rainfall, which is the highest amount of rainfall ever recorded from 1865 to 2010 (Vicente-Serrano et al. 2011). The landslides triggered by this rainfall were mainly deep-seated rotational and translational slides, as reported by Oliveira (2012), Oliveira et al. (2015b), and Zêzere et al. (2015).

In the study area, the landslides were recorded from end-November to end-April, following a progressive temporal and spatial pattern in response to the daily rainfall distribution, the antecedent rainfall accumulation and the geologic/geomorphologic context (Oliveira 2012). These factors were responsible for the regional rise of the water table. In the study area landslides seem to occur during the phase of water table rise and during its high-tand (the upper soil, which is mostly unsaturated, becomes progressively saturated causing the cancellation of the negative pore pressures in a thicker portion of the soil, which contributes to destabilise the shallowest part of the slopes), and reduce their frequency when water table despite high started to lowering. Negative pore pressures are again generated in the unsaturated zone, which are favourable conditions to stability of the upper layers of the soil (Jesus et al. 2018).

According to the literature, this leads to a seasonal landslide inventory map that encompasses landslides triggered by multiple events within a singular season (Fiorucci et al. 2011). Even so, these landslides were primarily triggered by long-lasting rainfall periods (30–90 days), which clearly distinguishes them from those triggered during the 1983 rainfall event. Making an analogy with the slope progressive failure concept (Palmer and Rice 1973), we aggregated all the 2010 seasonal landslides in a single “landslide event” for modeling proposes. This assumption takes into account the progressive temporal and spatial instability that occurred in the study area during the aforementioned 5-month period of the defined 2010 landslide event, as a consequence of the progressive regional water table rise (Jesus et al. 2014, 2018). Spatially, the instability started in the SW sector of the GPR basin, mostly as shallow landslides affecting LU8 and LU7. Subsequently, in March, reached the older rocks (LU9) rich in clay minerals (almost no smectite and the expansive minerals present are chlorite and gypsum, Alonso et al. 2010) which are located in the center of the basin and lately, in the rainy period, triggered the most deep-seated landslides as a consequence of the regional groundwater table rise to position close to the topographic surface (Jesus et al. 2018; Oliveira 2012).

2.3 Historical and event-based landslide inventories

The 1983 and 2010 landslide inventory maps used in this work were based on different landslide mapping approaches that combined different techniques: detailed field mapping; high resolution aerial photo and orthophotomap interpretation; and analysis of shadow relief models. The landslide type was defined following the classification of Cruden and Varnes (1996) and the separation between shallow and deep landslides was made considering a rupture surface depth below or above 3 m, respectively. In addition, to eliminate the potential bias in landslide susceptibility modeling due to the reduced number of debris flows and mudflows in the study area, the present work only considers landslides of the slide type that occurred on natural slopes.

2.3.1 Historical landslide inventory

The historical landslide inventory map (Fig. 1B) was based on a systematic field surveying carried out between 2006 and 2009. Landslides were mapped over orthophotomaps at the 1:2000 scale, using the drainage network and the contour lines with 5 m equidistance as ancillary information. The total number of landslides, as well as their age (absolute or relative), geometry and geolocation in the final inventory map were validated and updated by interpretation of aerial photographs and orthophotomaps, dated from 1989 to 2007, and by the identification of landslide morphometric features in a shadow relief model derived from a 1: 10,000 topographic maps. This historical landslide inventory map includes 640 landslides of the slide type occurred on natural slopes (332 deep-seated rotational slides; 210 shallow rotational slides, and 98 shallow translational slides). The landslide area (La) ranges from 12.4 to 262,194 m2 (mean landslide area around 9000 m2) and the total unstable area is 5,735,400 m2, which corresponds to 5.2% of the GPR basin area.

2.3.2 1983 and 2010 event-based landslide inventories

Two rainfall event-based landslide inventory maps are available for the GPR basin, corresponding to the events described in Sects. 2.2.1 and 2.2.2. Both event-based landslide inventory maps were created by mapping fresh landslide features, considering the time of information acquisition.

The 1983 event-based landslide inventory map was built based on photointerpretation of black and white vertical aerial-photographs in a 1:15,000 scale, taken less than a month after the landslide event (Fig. 1B, Table 1). All landslides are temporally referenced to the 18th of November 1983. The aerial photographs georeferencing process and landslide mapping procedures were described in detail in Oliveira (2012). Visual interpretation of aerial photographs proved to be an effective technique for the detection and mapping of recent landslides, including slope failures that did not leave distinct and immediately recognizable morphological or land cover signatures (Fiorucci et al. 2011). Nowadays, recognizing the evidence of landslides that occurred over 30 years ago is virtually impossible to perform through field-based geomorphological recognition. In the study area, the landslide features tend to disappear in 10–20 years, even faster if there is an intensive agricultural activity (e.g., vineyards or cereals) or the development of activities affecting the landscape (e.g., urbanization) (Oliveira et al. 2015b). In the 1983 landslide event (Table 1), 173 slides were mapped over an area of 130,750 m2 (0.12% of the study area). Shallow rotational slides were the dominant type (98 cases), affecting 66,411 m2 (50.8% of the total unstable area in this event). Deep-seated rotational slides and shallow translational slides presented a similar number of cases (37 and 38, respectively), but the unstable area differed significantly. Deep-seated rotational slides were associated to 42.1% of the total unstable area during the event, while the area affected by shallow translational slides did not surpass 7%.

Table 1 Number and affected area of slides included in the 1983 and 2010 rainfall event-based landslide inventories and the historical landslide inventory in the GPR basin

The inventory map of the 2010 landslide event was based on systematic field surveying performed each week from November 2009 to the end of April 2010 (Fig. 1B, Table 1). All landslides were mapped over orthophotomaps at the 1:2000 scale with contour lines (equidistance 5 m) and drainage network as ancillary information. For this event, few landslides of the slide type were mapped on natural slopes (162 cases) but the total unstable area (461,309 m2, 0.42% of the study area) surpasses 3.5 times the unstable area associated to the 1983 landslide event. This can be explained by the occurrence of 55 deep-seated rotational slides (34% of the total number of landslides in the event) that represent 72% of the total unstable area (331,988 m2). As in 1983, the shallow rotational slides were the most representative landslide type with 74 cases, representing 45.7% of the total number of landslides and 24.8% of the unstable area of the event. Shallow translational slides were the less represented landslide type with 20.4% (33 cases) of the total number of landslides and 3.3% of the total unstable area.

Despite the contrasting rainfall triggering conditions, both landslide events presented a similar landslide density (1983 = 1.6 landslides/km2; 2010 = 1.5 landslides/km2).

3 Methods

3.1 Analysis of landslides morphometric characteristics

The morphometric and typological attributes of landslides are typically heavily influenced by the geological and geomorphological conditions of the areas where slope failure occurs (Carrara et al. 1977). The morphometric characteristics of landslides were analyzed in each landslide event-based inventory were analyzed using 10 landslide parameters and 7 indices (Table 2). These characteristics relate to the morphometry (size and shape) of landslides and their spatial position along the slope, which is considered a proxy for landslide mobility (Garcia 2002; Zêzere 1997). Landslide parameters and indices were analyzed according to the landslide type, which includes: (i) simple descriptive statistical parameters as the mean, maximum and minimum values and standard deviation (SD); and (ii) the analysis of variance (One-way ANOVA). The morphometric parameters were measured using horizontal planimetric equivalents of real distances, considering the predominance of gentle to moderate slopes in the study area (less than 15° in 87.5% of the study area). The F-test (analysis of variance) and the level of statistical significance were computed for each landslide parameter and indices. The level of statistical significance was obtained after logarithmic normalization of data.

Table 2 Landslide morphometry, spatial position, mobility parameters and indices (adapted from Zêzere 1997; Garcia 2002)

3.2 Landslide probability-density relationship

The relationship between the frequency of landslides and the landslide area (LA) was evaluated using the probability-density function (pdf) described in Eq. (1) (Van Den Eeckhaut et al. 2007; Malamud et al. 2004).

$$pdf({L}_{A})=\frac{1}{{L}_{N}}\times \frac{\delta {L}_{N}}{\delta {L}_{A}}$$
(1)

where LA is the landslide area; LN is the total number of landslides; and δLN is the number of landslides as a function of a given class amplitude δLA, for which a class amplitude increments of 1.5 has been adopted.

3.3 Landslide susceptibility assessment and validation

A dataset of seven variables was selected as independent landslide predisposing factors (Figs. 1, 2 and Table 3): lithology, slope angle, slope aspect, slope curvature, slope over area ratio (SOAR), soil type and land use. These variables had successfully explained the relationship between landslides and the most favorable terrain conditions for landslide occurrence in the study area through the application of data-driven methods (Oliveira et al. 2015b, 2017; Zêzere et al. 2017).

Table 3 Information Value (IV) scores according to the dataset of landslide predisposing factors, landslide types and rainfall event-based landslide inventories

The landslide susceptibility was assessed using the Information Value (IV) method (Yin and Yan 1988). The method allows to rank the susceptibility of each terrain mapped unit (pixel 5 × 5 m) of the study area. The IV susceptibility score obtained for each variable class, assumed as landslide predisposing factor, is based on the logarithmic normalization of the ratio between the conditional probability to find a landslide in a certain class of a predisposing factor and the a priori probability to find a landslide in the study area. The final susceptibility score for each terrain mapped unit is the sum of each partial IV score calculated for each class of each landslide predisposing factor. IV scores equal to zero indicate that the relationship between landslide occurrence and the independent variable used to discriminate stable and unstable areas is unclear. Negative IV scores indicate a favorable condition for slope stability and positive IV scores indicate a favorable condition for slope instability in the terrain unit, which will be higher as the IV score increases (Zêzere et al. 2017).

In order to evaluate the reliability of using an event-based landslide inventory map to assess landslide susceptibility at the basin scale, two modeling groups were created. Modeling Group I include eight landslide susceptibility models that compare landslide susceptibility based on the landslide type associated with the rainfall-triggered landslide events of 1983 and 2010 (Table 4). Models 1a and 1b use the complete event-based landslide inventory map, encompassing all considered landslide types; models 2a–2b, 3a–3b, and 4a–4b use, respectively, the event-based deep-seated rotational slides, shallow rotational slides and shallow translational slides (Table 4). Modeling Group II aims to identify the similarities/differences between the historical landslide susceptibility model and the event-based landslide susceptibility models. The Modeling Group II includes four landslide susceptibility models (models 5a–5d) computed to assess the predictive capacity of the historical landslide susceptibility map when validated with independent event-based landslide inventories (Table 4) and vice versa.

Table 4 Landslide susceptibility modelling strategy: training and validation groups

The landslide susceptibility model’s adjustment and independent predictive capability were assessed by computing the respective Receiver Operating Characteristic curves (ROC) and by calculating the Area Under the Curve of the Receiver Operating Characteristic curve (AUC-ROC) values according to the training and validation scheme defined in Table 4. The AUC-ROC is a commonly used measure of the discriminative capacity of a classification model and uses both the landslide and non-landslide terrain units to evaluate model performance. A model is considered to discriminate better than chance if the AUC is greater than 0.5, where a value of 1 is associated with a model with perfect discrimination (Jiménez-Valverde 2012; Zêzere et al. 2017).

3.4 Evaluation of the spatial agreement between landslide susceptibility maps

The study assessed the spatial agreement between landslide susceptibility maps of Modeling Group I. Four pairs of landslide susceptibility maps were grouped by landslide type for both event-based landslide inventory maps (e.g., model 1a X model 1b, Table 4). Modeling Group II produced only one historical landslide susceptibility map and no spatial comparison was made. Firstly, the nine landslide susceptibility maps listed in Table 4 (under the Training Group column) were classified into 10 susceptibility classes using the quantile classification method. This classification ensures that each susceptibility class in all maps contains the same 10% of grid cells within the study area, allowing for a relative comparison (Oliveira et al. 2017). The classification procedure was solely used to assess the spatial agreement between the pair of maps. This was done using the Map Comparison Kit tool (Visser and Nijs 2006) on a cell-by-cell basis. The resulting contingency tables, Kappa statistics (Kappa Index, Kappa histogram, and Kappa location) and the fraction of terrain units equally classified in each pair of landslide susceptibility maps were used to evaluate susceptibility classification dispersion and spatial agreement among pairs of equivalent landslide susceptibility maps.

4 Results

4.1 Statistical properties of the 1983 and 2010 landslide events inventories

The main differences between the 1983 and the 2010 landslide events, regarding the number and type of landslides were described in Sect. 2.3.2. Despite the similar number of landslide cases and the predominance of shallow rotational slides in both landslide events, a 12.6% increase in the number of the deep-seated rotational slides was verified in the 2010 event, contrasting with a 10.9% increment in the number of shallow rotational slides in the 1983 event (Fig. 5A). These results are consistent with the predominant landslide types expected to occur in the region in response to critical conditions of prolonged or short-lived intense rainfall (Zêzere et al. 2015). They are further supported by the statistical analysis of the landslide morphometric attributes for the 1983 and 2010 landslide events (Table 5). Although we are aware that for some landslide types, the number of landslides is too low, it should be noted that for many morphometric attributes and indices there are no significant differences between landslides included in the two event-based inventories, as it is showed by the F-test values and the statistical significance levels below 95% (ns—not significant) (Table 5). Shallow translational slides have virtually equivalent morphometric characteristics in both event-based landslide inventories.

Fig. 5
figure 5

Mean statistical morphometric attributes considering landslide types within the 1983 and 2010 rainfall event-based landslide inventories: A relative landslide number; B mean landslide area (LA); C landslide maximum width (LW) and landslide maximum length (LL); and D maximum elevation in the landslide main scarp top (LmaxE, mean values) and minimum elevation in the landslide tip (LminE, mean values)

Table 5 Statistical parameters and analysis of variance (one way ANOVA) of landslide morphometric attributes and indexes considering landslide types within the 1983 and 2010 rainfall event-based landslide inventories. F ‐ variance F-tests results and corresponding significance levels; p < 0,01—significance level higher than 99%; p < 0,05—significance level higher than 95%; n.s.—not statistically significant

The main differences between landslide inventories are related to the specific landslide size parameters (LL, LW and LA) and the maximum and minimum elevations of landslide types DRS and SRS (Fig. 5B–D), where the variance reaches significance levels of 0.01 (99%). The deep-seated and shallow rotational slides triggered during the 2010 landslide event present a larger width and length, which was reflected in a wider unstable area. In the 2010 landslide event, larger DRS occurred mostly at lower elevations compared to the corresponding DRS occurred during the 1983 landslide event, which are typically smaller in size.

4.2 Landslide probability-density relationship

The landslide distribution as a result of a probability-density function (Fig. 6A) can be used to better characterize the landslide hazard in the study area, by extracting the probability of landslide occurrence given a certain magnitude scenario (landslide size) (Fig. 6B). Based on the historical landslide inventory, 70% of the new slides that are expected to occur in the GPR basin will have an area under 6000–7000 m2 (LA½ ± 77.5 m to 83.7 m, respectively) (Fig. 6B). In an undetermined future time span, 20% of the larger slides will have an area above 10,000 m2, i.e., LA½ ± 100 m. The probability of landslide occurrence tends to decrease significantly for slides larger than 10,000–20,000 m2 and landslides larger than 100,000 m2 (LA½ ± 316.2 m) have a very low probability of occurrence (less than 1%).

Fig. 6
figure 6

Landslide probability density distribution on landslide area (LA): A and probability of landslide magnitude (LA); B for 1983 and 2010 rainfall event-based landslide inventories and for the historical landslide inventory. Ev-B = Event-Based

In addition, landslides smaller than 100 m2 have a slightly higher probability of occurrence in the historical inventory when compared to the 1983 and 2010 event-based landslide inventories. Moreover, it is noticed the progressive detachment to the right side of the historical landslide inventory probability distribution curve in Fig. 6B. This is evident for landslide magnitudes greater than LA > 100–200 m2, indicating the potential for the occurrence of landslide events more severe than those registered in 1983 and 2010. The huge differences in the dependence relationship between landslide frequency and landslide area, when event-based landslide inventory maps are compared (assuming the completeness of the event-based landslide maps) are related to the short-intense or long-lasting rainfall critical conditions. Landslides from the 2010 landslide event increase in size and frequency in comparison with the 1983 landslide event.

4.3 Event-based landslide susceptibility models

The results of the eight landslide susceptibility models performed for each landslide type and for each landslide event (1983 and 2010) (cf. Sect. 3.2, models 1a–4b, Table 4) are shown in Fig. 7.

Fig. 7
figure 7

Landslide susceptibility maps modelled with the 1983 and 2010 rainfall event-based landslide inventories considering different landslide types: A 1983 event-based landslide inventory (Model 1a, 1983 Ev-B); B 2010 event-based landslide inventory (Model 1b, 2010 Ev-B); C 1983 deep rotational slides (Model 2a, 1983 DRS); D 2010 deep rotational slides (Model 2b, 2010 DRS); E 1983 shallow rotational slides (Model 3a, 1983 SRS); F 2010 shallow rotational slides (Model 3b, 2010 SRS); G 1983 shallow translational slides (Model 4a, 1983 STS); and H 2010 shallow translational slides (Model 4b, 2010 STS)

Different spatial susceptibility patterns are recognized for rotational slides (Fig. 7C–E), reflecting the spatial distribution of shallow and deep-seated rotational slides within the 1983 and 2010 landslide events. Concerning the 2010 event-based inventory maps (Fig. 7D and F), the most susceptible areas for shallow and deep-seated rotational slides occurrence are mainly concentrated where LU 9 (marls, mudstones and sandstones) outcrops, i.e., along the central and eastern part of the study area. On contrary, areas classified as more susceptible to shallow and deep rotational slides in models built with the 1983 event-based landslide inventory are more scattered in the LU9 and are concentrated in the southwestern sector of the study area where the LU5 outcrops (limestones and marls).

Regarding the shallow translational slides, the landslide susceptibility patterns are more similar (Fig. 7G and H), which is in line with the morphometric characteristics found for this type of landslides within the 1983 and 2010 landslide inventories (cf. Sect. 4.1). The most susceptible areas correspond to LU7, LU6 and LU5, which are heterogeneous in nature (limestone, marls, mudstones, sandstones and conglomerates). These lithological units comprise steep slopes and are less used for agriculture. In these locations, landslide boundaries are preserved for a longer time compared to shallow slides that occurred in agricultural areas like those occupied by LU9.

The different spatial susceptibility pattern associated to the 1983 and 2010 landslide events is also evident in models built with the overall number of landslides (Fig. 7A and B), which reflects the dominance of rotational slides, in number and size, in both event-based inventories.

Table 3 summarizes the predisposing factors classes and the Information Value scores obtained for the landslide susceptibility models built with different landslide types within the 1983 and 2010 event-based landslide inventories. The most favorable terrain conditions for the occurrence of deep-seated and shallow rotational slides are different in the study area, depending on the hydrological conditions associated to each landslide event. The most favorable conditions for the 1983 landslide event were the following: N and NE facing slopes; slopes ranging from 15° to 45°; SOAR higher than 0.1; the presence of LU5 (limestone and marls) and LU11 (dykes and magmatic intrusions); the presence of rock outcrops or colluvium as soil types; and the presence of cultivated areas, forest or shrubs and herbaceous vegetation. For the 2010 landslide event, the most favorable conditions were the following: N, NE and E facing slopes; slopes ranging from 10° to 20°; SOAR ranging from 0.0001 to 0.01; the presence of LU7 (coralline limestone), LU9 (marls, mudstones and sandstones) and LU11 (dykes and magmatic intrusions); the presence of cambissols and vertissols; the presence of vineyards or shrubs and herbaceous vegetation. Considering the shallow translational slides, prone areas are similar for both susceptibility models built with the 1983 and 2010 landslide events: slopes higher than 20° (which corresponds to areas with less agricultural activities); areas with little water accumulation (i.e., with high values of SOAR); the presence of forest or shrubs and herbaceous vegetation; the presence of rock outcrops and colluvium deposits.

Table 6 summarizes the spatial agreement registered in each pair of landslide susceptibility maps, considering the landslide type. The results highlight the low spatial agreement between the 1983 and the 2010 event-based landslide susceptibility maps, even when comparing each landslide type individually. The spatial disagreement between models is also clear in Table 7, which summarizes the Kappa statistic values resulting from the comparison of classified landslide susceptibility maps built with the 1983 and 2010 event-based landslide inventories. The spatial agreement is higher for shallow translational slides susceptibility maps (Kappa index = 0.127; fraction equally classified = 21%), potentially confirming the significant similarity in landslide predisposing conditions belonging to this category in both event-based landslide inventories.

Table 6 Contingence tables extracted from the overlay between the 1983 and 2010 rainfall event-based landslide susceptibility maps considering the landslide types
Table 7 Kappa values resulting from comparison of classified landslide susceptibility maps built with 1983 and 2010 event-based landslide inventories

The validation of the eight landslide susceptibility models (Fig. 7A–H) is assessed using the ROC curves and the AUC-ROC values, summarized in Fig. 8A (for models based on the 1983 rainfall event-based landslide inventory) and Fig. 8B (for models based on the 2010 rainfall event-based landslide inventory). In both cases, there is a good adjustment of the models when the landslide susceptibility map is cross tabulated with the same landslides used to construct the model (filled lines in Fig. 8A and B). The AUC-ROC values of the model adjustment built with the 1983 (Fig. 8A) and the 2010 (Fig. 8B) event-based landslide inventory ranges between 0.884–0.922 and 0.811–0.959, respectively, showing a high goodness of fit of landslide susceptibility models.

Fig. 8
figure 8

ROC curves and AUC-ROC values of landslide susceptibility models based on different rainfall event-based landslide inventories: A Models built with the 1983 rainfall event-based landslide inventory; B Models built with the 2010 rainfall event-based landslide inventory. Filled lines represent the model adjustment and dashed lines the model independent validation. Ev-B, Event-Based; DRS, Deep-seated rotational slides; SRS, Shallow rotational slides; STS, Shallow translational slides

On the contrary, the capacity of susceptibility models to correctly predict landslides pertaining to a landslide inventory different from the one used to build the model is typically modest, as it is attested by AUC-ROC of dashed curves in Fig. 8A and B. These AUC-ROC values for the total set of susceptibility models built with rotational slides (models 2a, 2b, 3a, 3b), as well as for susceptibility models built with the total landslide inventories (models 1a and 1b) do not surpass 0.635. The exception is found in susceptibility models built with shallow translational slides, since the model based on landslides occurred in 1983 (model 4a) is highly effective in predicting shallow translational slides that occurred in 2010 (AUC-ROC = 0.920). However, the capacity of the model based on the 2010 event (model 4b) to predict the location of shallow translational slides occurred in 1983 is significantly lower (AUC-ROC = 0.713). The results indicate that relying solely on event-based landslide inventories to model landslide susceptibility is inadequate. This is due in part to the fact that landslides of the same type can be triggered by different critical rainfall conditions, which are in turn associated with different terrain predisposing conditions.

4.4 Landslide susceptibility model based on the historical landslide inventory

The next step was to evaluate to what extent a landslide susceptibility model based on a historical landslide inventory can be validated using these event-based landslide inventory maps. A new landslide susceptibility model was constructed using the historical landslide inventory for training purposes (models 5a and 5b). The landslide susceptibility map based on the historical inventory is shown in Fig. 9. Additionally, the susceptibility model based on the historical landslide inventory map was independently validated with the event-based landslide inventories not used for training, i.e., the inventory 1983 Ev-B (model 5a) and the inventory 2010 Ev-B (model 5b). The inverse modeling approach was also performed, which means using each of the event-based landslide inventories for training and the historical landslide inventory for validation (models 5c and 5d). Figure 10 presents the ROC curves and AUC-ROC values for these models. The historical landslide susceptibility model is not effective in predicting the 1983 landslide event (AUC-ROC = 0.57), but it performs well in predicting the 2010 landslide event (AUC-ROC = 0.73).

Fig. 9
figure 9

Landslide susceptibility map based on the historical landslide inventory

Fig. 10
figure 10

ROC curves and AUCROC values of landslide susceptibility models based on the historical landslide inventory and rainfall event-based landslide inventories. Filled lines represent the model adjustment and dashed lines the model independent validation. Ev-B = Event-Based

These results are consistent with the AUC-ROC obtained for the inverse models, which show the higher capacity of susceptibility model built with the 2010 landslide event to correctly predict the spatial location of landslides included in the historical inventory (AUC-ROC = 0.72 against 0.59 for model built with the 1983 landslide event). Moreover, the AUC-ROC values obtained for models 5b and 5d are not far from the goodness of fit of the landslide susceptibility model based on the historical inventory (red curve in Fig. 10, AUC = 0.79). Therefore, the 2010 event-based landslide inventory proved to be the most similar to the historical landslide inventory.

5 Discussion: advantages and limitations

5.1 How representative is an event-based landslide inventory map of the total landslide activity and distribution in a study area?

Two main differences were observed in the geometry and spatial distribution of landslides within the GPR basin in relation to the event-based landslide inventory maps: (i) despite a similar number of landslides in both rainfall-triggering events, the total unstable area registered during the 2010 event was nearly four times higher than that in 1983 event. We also acknowledge that the use of a seasonal inventory for 2010 introduces some bias in this respect.

The hydrological settings for the occurrence of deep-seated slow-moving landslides are normally related to pore water pressure accretion, requiring cumulative rainfall events to trigger a slope failure (Sidle and Bogaard 2016). In the study area, and in relation to the 2010 landslide event, evidences were found linking the first occurrence or successive reactivations of deep-seated and larger rotational slides (DRS) in LU9 (Fig. 1) to the regional rise of the water table to positions closer to or above the topographic surface (Jesus et al. 2018). In addition, significant suction changes resulting from drying after heavy rainfall periods are one of the main causes of rock physical disaggregation in evolutive materials such as LU9 marls. This process promotes the degradation of the marls mechanical properties, such as strength, cohesion and stiffness, leading to intensified water infiltration, weathering and a reduction of the sliding shear strength (Alonso et al. 2010). In the study area, the thickness of the weathered zone ranges from 0.5 to 5 m, with the materials in situ being saturated independently of the depth and degree of weathering (Alonso et al. 2010). These values agree with the deeper positions of the rupture surfaces of the deep-seated and larger rotational slides.

For the 2010 landslide event, a good regional adjustment was also found between shallow landslide occurrence and fluctuations of water table levels in LU9. This relation was found to be less sensitive in LU5, LU6 and LU7 (Jesus et al. 2018). Finally, Jesus et al. (2018) pointed out, as a possible cause, the stratigraphic sequence which led to differences in vertical permeability and to the development of perched water tables and/or confinement of local aquifers beds, being both not captured by the regional hydrological model used to relate water table position over those lithological units.

Rotational slides registered during the 1983 landslide event are less in number and small in area and occurred at higher slopes (from 15° to 45°). The hydrological mechanism that triggered these landslides was certainly associated with the sudden infiltration of water into the soil, thus developing a wetting front. The minimal influence of water table fluctuations on slope instability agrees with the low cumulative antecedent rainfall registered prior to the event day. Therefore, this event depended almost exclusively on the 24-h rainfall, whose distribution was not uniform along the study area (Fig. 11). The rainfall amount was higher in the west part of the RGP basin, which partially explains the observed concentration of landslides on that part of the basin.

Fig. 11
figure 11

Spatial interpolation (cokriging) of rainfall records during the 18 November 1983 landslide event in the Lisbon region

In the GPR basin there are significant differences regarding the dependence relationship between landslide frequency and landslide area, whether the event-based landslide inventory maps related to short-intense (1983) or long-lasting rainfall critical conditions (2010) are compared. Landslides increase in size and frequency when triggered by long-lasting rainfall critical conditions. We acknowledge that having only one inventory per precipitation pattern may not be representative of all critical precipitation conditions that have triggered landslide events in the past. However, these event-based landslide inventory maps are related to the some of the most rainfall critical conditions occurred in the study area. In the 1983 event are related to the heaviest rainfall event over the twentieth century (Ramos et al. 2014) and in the 2010 seasonal “landslide event” to the maximum rainfall value recorded from 1865 to 2010 (Vicente-Serrano et al. 2011). Therefore, we assume that they are to some extent representative of the contrasting critical rainfall conditions that generate landslides in the study area.

The analysis of the landslide probability-density distribution showed that landslides smaller than 100 m2 have a slightly higher probability of occurrence when associated to the historical inventory compared to their association with the 1983 and 2010 event-based landslide inventories. This result is unexpected given our prior knowledge that in the north of Lisbon region, small landslides are typically lost soon after their occurrence due to erosion or human intervention, especially in areas dominated by weak rocks or exposed to more intensive agricultural practices (Oliveira et al. 2017). A possible explanation for this result is related to the timing of the detailed geomorphological fieldwork, which is the basis of the historical landslide inventory map for the GPR basin. Landslide mapping started mid-2006 and in this year three new rainfall-triggered landslide events were recorded in the Lisbon region (20 March, 25–27 October, and 28 November): the first two related to critical rainfall sequences of 4–10 days (i.e., short-duration intense rainfall conditions), and the latter to a long-lasting rainfall 40-day period (Zêzere et al. 2008). These events regionally generated a large number of shallow slides that were mapped during the fieldwork and included in the historical landslide inventory map of the study area, thus increasing the number of landslides smaller than 100 m2. We did not use this information to investigate the short duration events that occurred in the period from March to November 2006 because, at that time, it was not possible to associate all the landslides to a specific rainfall-triggered landslide event.

The landslide probability-density distribution also showed the potential for occurrence of landslide events triggered by rainfall critical conditions more severe than the ones associated to the 1983 and 2010 landslide event at the GPR basin, which is confirmed by the progressive detachment to the right side of the historical landslide inventory probability distribution curve in Fig. 6B, namely for landslide magnitudes greater than LA > 1000–2000 m2. This could be explained if we consider that the geomorphological historical landslide inventories may be the result of the cumulative effects of many landslide events over a period of tens, hundreds, or thousands of years (Guzzetti et al. 2012) and by the above mentioned fact that smaller landslides are typically lost shortly after their occurrence, which could introduce some possible bias related to the weight of the deep-seated landslides in the historical landslide inventory.

The larger landslides included in the historical landslide inventory were identified just below the contact between LU9 (older clay and marl complex gentle slopes) with the upper slope LU7 (compact coralline limestone rock walls). A particular hydrological context favors the development of large landslides in such an upslope position and it may be related to the development of a perched water table in the limestone (UL7) after heavy rainfall over the impermeable clay and marls lithological unit (Coelho 1979). This promotes the retention of water, which is later absorbed by less permeable formations (Ferreira and Zêzere 1997).

Lastly, we cannot neglect the possibility that larger and older landslides than those occurred during the 1983 and 2010 landslide events may have been triggered by earthquakes rather than rainfall. Nevertheless, neither during fieldwork nor in previous works (Vaz and Zêzere 2016) any earthquake-triggered landslide was reported for this study area. In addition, in the north of Lisbon region, the larger complex deep-seated landslide documented as rainfall-triggered (around 180,000 m2) occurred in 1979 in Calhandriz, near the study area, and in a similar geomorphological context (Ferreira et al. 1987; Zêzere et al. 1999a). The size of this landslide is in line with larger landslides registered in the GPR basin historical landslide inventory. Only two deep-seated rotational slides mapped in the GPR basin surpass that value, but not exceeding 262,194 m2 and not corresponding to the unique landslide cases with the main landslide scarp at higher slope positions closer to the top. Therefore, we admit that in GPR basin landslides included in the historical landslide inventory map were triggered by rainfall.

5.2 How reliable can be an event-based landslide susceptibility map?

Independently of the landslide type used for training and validation, the landslide susceptibility models built with the event-based inventories showed to be ineffective in validating other event-based landslide inventories, as well as the historical landslide inventory. This was found to be true for the specific geological/geomorphological context of the GPR basin, where contrasting rainfall critical conditions and hydrological mechanisms can trigger the same type of landslides but varying in size and spatially constrained by different landslide predisposing conditions, particularly lithology and soil type.

Although we did not remove the triggering effect that may control the spatial distribution of landslides, the results for our study area do not support the idea that a common pattern of landslide susceptibility exists for a certain region, regardless of which landslide event is used to build the susceptibility model (Lee and Chung 2017). This is contrary to what is known for other study areas and landslide types (e.g., Lombardo et al. 2018; Knevels et al. 2020). Although we recognize that the spatial distribution of inventoried landslides may be strongly influenced by the spatial distribution of the rainfall event, which is particularly relevant for spatial analyses of event-based inventories, the small size of the study area results in less relevant differences in the spatial patterns of regional rainfall trigger events, particularly those associated with long-lasting rainfall periods. The studied landslide events were triggered by contrasting rainfall critical conditions, which regulate different hydrological mechanisms for landslide initiation and generates distinct associations with landslide predisposing factors. The later was inferred by the contrasting information values of slope and lithological unit for both event-based susceptibility models (Table 3), revealing, as before (e.g., Zêzere et al. 2015), the strong influence of regional geologic and geomorphologic conditions on the slope instability system.

The use of a single event-based landslide inventory to assess landslide susceptibility has been previously recognized as a potential source of bias in landslide susceptibility modeling (Bhandary et al. 2013). This bias may be encompassed by considering a multitemporal approach, which captures more information about the geoenvironmental predisposing conditions that lead to slope instability in a certain study area over time (Bhandary et al. 2013; Mondini and Chang 2014; Persichillo et al. 2017; Reichenbach et al. 2018).

5.3 How appropriate can be an event-based landslide inventory map to independently validate a landslide susceptibility map?

The predictive results of the landslide susceptibility model, based on the historical landslide inventory database (models 5a and 5b) and validated with independent event-based landslide inventories, differs substantially in the study area. The model with the worst prediction is 5a, which was validated with the 1983 event-based inventory. This poor result points out the exceptionality of the spatial distribution of the landslides triggered in 1983, which is not captured by the statistical relationship between the set of landslide predisposing conditions and the landslides historically registered. Furthermore, the results reveal two complementary sources of uncertainty: the first is related to the use of incomplete historical landslide inventories to assess landslide susceptibility; and the second is associated with the misleading predictive capacity of landslide susceptibility models based on historical landslide inventories when validated with in”dependent” event-based landslide inventory maps (e.g. model 5b validated with the 2010 event-based landslide inventory map). Indeed, the 2010 event-based inventory is relatively similar to the historical inventory, which may generate optimistic results.

Regarding these possible causes of uncertainty, limitations in the landslide inventory have been recognized as a major source of error in landslide susceptibility maps (Fell et al. 2008; Stanley and Kirschbaum 2018; Steger et al. 2017). This is due to the challenge of ensuring the complete identification of historical landslide deposits, especially older ones, which are frequently buried or partially dismantled by subsequent landslide events, partially or totally covered by forest, and partially or totally removed by erosion or human activities such as agriculture and forest practices (Corominas et al. 2014; Guzzetti et al. 2012). This potential source of error is further transferred to data-driven landslide susceptibility predictions, which are completely dependent on the statistical relationships established between the set of geoenvironmental factors assumed as predisposing conditions to slope instability and the available landslide inventory. Although recognized that any event-based landslide inventory map may be suitable to evaluate the predictive capability of landslide susceptibility models, it should be used with caution. The reliability of such evaluations depends on both the pattern, extent and magnitude of the rainfall triggering conditions and the pattern of the geoenvironmental factors that control landslide susceptibility (Reichenbach et al. 2018).

6 Conclusions

In this work, we evaluated and discussed potential sources of uncertainty that may arise from using rainfall-triggered event-based landslide inventories to assess landslide susceptibility in regions where landslides can be triggered by contrasting critical rainfall conditions, including short-duration intense rainfall events or long-lasting rainfall periods. In the study area, the same type of landslides can be triggered by contrasting rainfall critical conditions, but with varying sizes. Landslides increase in size when triggered by long-lasting rainfall critical conditions and are spatially constrained by different landslide predisposing conditions, particularly slope, lithology, and soil type. The study area’s event-based landslide susceptibility maps showed that the landslide susceptibility models were not fully validated with the contrasting event-based landslide inventory maps that were not used for training. This suggests that these landslide susceptibility maps may not be entirely reliable. In addition, the results of the landslide susceptibility models based on the historical landslide inventory, which were validated with independent event-based landslide inventory maps, are significantly different and indicate the presence of additional sources of uncertainty. These may arise from the possible incompleteness of the historical landside inventory and an overestimation of the predictive capacity of the landslide susceptibility model due to a validation using a not entirely independent landslide inventory.

In addition to the importance of having a robust historical landslide inventory map, this work aims to contribute to a better knowledge of regional landslide susceptibility patterns based on contrasting rainfall-trigger conditions, providing valuable information to minimize exposure, which is fundamental for risk assessment; to design regional landslide early warning systems for specific rainfall-triggered landslide events, working as a risk mitigation tool (keeping people away from the dangerous areas in case of expected landslides occurrence); and to improve the response and preparedness of civil protection services/agents (evacuation actions to protect local population and non-stationary assets).