Live fuel moisture content time series in Catalonia since 1998

We present a structured and curated database covering 21 years of LFMC measurements in the Catalan region, along with an associated R package to manage updates and facilitate quality processing and visualisation. The data set provides valuable information to study plant responses to drought and improve fire danger prediction. Dataset access is athttps://doi.org/10.5281/zenodo.4675335, and associated metadata are available athttps://metadata-afs.nancy.inra.fr/geonetwork/srv/fre/catalog.search#/metadata/583fdbae-3200-4fa7-877c-54df0e6c5542.


Background
Live fuel moisture content (LFMC), the ratio of water mass over the dry mass of living shoots, is a critical parameter related with flammability and wildfire behaviour (Chandler et al. 1983;Chuvieco et al. 2009;Fares et al. 2017;Resco de Dios 2020). In 1994, the Catalan Forest Fire Prevention Service (SPIF), in collaboration with Catalan Forest Rangers, initiated a LFMC monitoring program to provide operational fire danger evaluation with ground information regarding plant water status. Only four sites were monitored during 1994-1996, following Countryman and Dean (1979) and Norum and Miller (1984). With the aim to increase the size and representativeness of LFMC samples, in 1997 researchers of the Ecological and Forestry Applications Research Centre (CREAF) were requested to suggest a broader set of sampling areas and species representative of Mediterranean shrub habitats, as well as to standardize field and laboratory protocols (Piñol and Ogaya 1997). With this information in hand, in 1998 SPIF initiated the systematic monitoring of LFMC in Table 1 Climate, habitat, geological characteristics, and year of last fire (if any) of the nine localities included in the LFMC monitoring (see Fig.  1). The coordinates of the sampling sites and sampling periods are given in Table 2. MAP: mean annual precipitation (mm·year −1 ); MSP: mean summer precipitation (mm·year −1 ) (source: Digital Climatic Atlas of Catalonia 1961Catalonia -1990

Site description
Sampling sites are distributed in nine localities within the Mediterranean climate area of Catalonia, five of them between 0 and 300 m.a.s.l., and four of them between 500 and 700 m.a.s.l. The mean annual temperature range across sampling localities is 13 to 16°C, and mean annual precipitation goes from 500 to 750 mm (Table 1).
Sampling sites are in places with less than 30% slope, a southern aspect, tree canopy cover of less than 10%, homogenous vegetation age (four of them in previously burnt areas), and sufficient abundance of target species to sample. The representative area ranges from 2 to 7 ha across sampling sites. During the 25 years of LFMC monitoring, some sampling sites have been relocated due to wildfires, fuel treatments, or access difficulties. This explains why some localities include different sampling sites, as shown in Table 2.

Species description
The five sampled species (Arbutus unedo L., Cistus monspeliensis L., Pinus halepensis Mill., Quercus coccifera L., and Salvia rosmarinus (L.) Schleid) are characteristics of Mediterranean shrublands and widely distributed in the Mediterranean basin. Despite that all five species are well adapted to summer drought, they present different morphological traits to cope with drought intensity and extension. A. unedo and Q. coccifera are evergreen broad-leaved shrubs or small trees and resprout after fire from belowground organs. P. halepensis is an evergreen needle-leaved tree that usually regenerates densely after fire from seeds stored in serotinous cones. Among the five species, A. unedo has the highest leaf size, specific leaf area, and mean diameter of xylem vessels, and lowest wood density, suggesting a lower tolerance to severe drought (Castro-Díez 1996). Low specific leaf area and mean vessel diameter in Q. coccifera and P. halepensis suggest a higher tolerance to drought of both species. C. monspeliensis and S. rosmarinus regenerate from seed bank after fire. Despite their high mean vessel diameter, tolerance to drought of these species relies on their low specific leaf area and leaf marcescent phenology, some of them falling during severe summer drought and the rest rehydrating after rain.

Vegetation sampling and LFMC estimation
Vegetation sampling and laboratory protocols follow Piñol and Ogaya (1997). LFMC samples are currently collected in the field by Catalan Forest Rangers at 12:00 UT every 2 weeks all year round (Gabriel et al. 2021). Two or three species are sampled in each locality (Table 2). For each species to be sampled in each site, 20 shoots of 5-mm-diameter live branches, exposed to the sun and corresponding to different individuals, are selected, clipped, and put together into a 5-l hermetic plastic container. Soil and temperature data are also recorded in three localities (Begues, El Bruc, and Camarasa) using time-domain reflectometry (TDR) sensors.
Once at the laboratory, samples are weighted fresh (Fw), oven dried at 100°C for 48 h and weighted dry (Dw) with a balance (0.1 g precision). After that, fuel moisture content, as percent on a dry mass basis, is calculated using After weighting dry samples, leaf and stem fractions are separated, obtaining the dry weight of leaves (Lw) and stems (Sw), from which the leaf-to-stem (LSR) percent ratio is obtained: LSR is measured and stored to inform about the dynamics of fuel load or the level of branch defoliation. The mean and 5% and 95% quantiles of LFMC series per plant species and sampling site within the nine localities are shown in Table 2.

Manual filtering
LFMC raw data tables were manually processed to detect inconsistencies and anomalous values related to sample processing, wrong species, or site coding. Missing database records were filled when physical paper backups were available; otherwise, they were excluded. Anomalous LFMC values were identified if being outside a species-specific range.

Automated outlier detection
Data quality from each species in each site was assessed using univariate time series analyses. These analyses require complete series; therefore, a previous imputation process was carried out. For each series, the unsampled fortnights were identified as missing LFMC values and replaced by a linearly weighted moving average, with a four-value window size. For automatic outlier detection, we used an approach based on fitting an autoregressive integrated moving average (ARIMA) model to each time series. We only considered those series with more than 15 years of data. The ARIMA model selection was carried out using the auto.arima function from forecast package (Hyndman et al. 2020). The order of non-seasonal differencing was set to zero for all series, after evaluating stationarity using augmented Dickey-Fuller t-statistic tests. Parameter values of the selected model by series are available as ancillary dataset in the LFMC package. Two types of outliers were determined: (1) Additive Outliers (AO), single anomalous observations that do not affect subsequent observations in the series, and (2) Temporary Changes (TC), an anomalous event with a decreasing exponential effect. We did not consider a third type called Level Shifts (LS), because an abrupt change in LFMC values is not expected to permanently change the average of LFMC time series. The automatic procedure to detect outliers was implemented using the 'tso' function from tsoutliers package in R (López-de-Lacalle 2019). Outliers were iteratively detected in the ARIMA model residuals by calculating two different test statistics, according to each outlier type. All outliers detected were manually verified by species.
3 Access to the data and metadata description

Database structure and design
A relational database was designed to store LFMC data in a format ensuring long-term integrity. Additionally, this approach allows a flexible access to data, while maintaining the database in a consistent state. The relational model for LFMC database is shown in Fig. 2, which includes seven tables:

Database management
The LFMC database was implemented using the SQLite database management systems. An associated R package was written to facilitate database update and maintenance, as well as data processing and visualization. The main functions included in the package are shown in Table 3.

Technical validation
A total of 94 Additive Outliers and 49 Temporary Changes were automatically detected for LFMC values (Table 4). Both types of outliers were most often found for LFMC series of Q. coccifera. For this species, the delta parameter determining the exponential decay improved the AO estimations when set to δ = 0.5. For the remaining species, δ = 0.5 did not increase the number of TC found nor improved the AO estimations, so the default value (δ = 0.7) was kept. The high incidence of TC values in Tivissa might be explained because the locality includes different sampling sites. For all LFMC series, while AO values did not show a seasonal tendency, most of the TC found occurred during spring. Figure 3 shows two examples of LFMC series in the database, corresponding to S. rosmarinus and Q. coccifera in the same sampling site (Camarasa). AO and TC detected by the time series analysis are indicated, as well as the long-term trend obtained from the same analysis. To assess the correspondence between LFMC trends and weather indices, we used the Standardized Precipitation Index (SPI) time series (McKee et al. 1993) from weather data of nearby automated stations of the Catalan Meteorological Service. Time series of the SPI for 3-month and 12-month accumulation period are also shown in Fig. 3.
Trend component series for both Salvia rosmarinus and Quercus coccifera are broadly related with SPI series, the lowest values of SPI coinciding with the lowest trend values, although the trend for Salvia rosmarinus seems to be more sensitive to drought periods than that of Quercus coccifera. TC and AO values found for Q. coccifera, and the corresponding increase in the LFMC trend, occurred in periods 2002-2003 and 2009-2010, which were relatively moist compared to the dry years between 2005 and 2008.

Reuse potential and limits
We expect the LFMC database to be useful for research on LFMC behaviour, prediction, and how it relates to meteorological, physiological, or remote sensing data (e.g. Ruffault et al. 2018a). In particular, we expect it to be useful for research related with the evaluation of wildfire risk, such as the study of the relationships between drought or climate drivers with the LFMC of different species (Viegas et al. 2001;Castro et al. 2003;Pellizaro et al. 2007), the calibration and validation of remote sensing products (Yebra et al. 2013;Marino et al. 2018), the study and prediction of plant flammability (Saura-Mas et al. 2010;Madrigal et al. 2013;Fares et al. 2017) and fire spread rate (Rossa et al. 2016;Pimont et al. 2019), or the study of the LFMC role in wildfire events and regimes (Ruffault et al. 2018b). In addition, the database can be used to study the ecophysiological traits and processes driving LFMC dynamics (De Cáceres et al. 2015;Nolan et al. 2018;Pivovaroff et al. 2019). Importantly, pooling this LFMC database with the French Reseau Hydrique (Martin-StPaul et al. 2018;Duché et al. 2017) would yield a great robust and long-term LFMC dataset covering the north-western Mediterranean area for more than 20 years. The presented database also contributes to increase the amount of LFMC data available worldwide (Yebra et al. 2019).