Introduction

Karst systems cover 10–15% of the Earth’s surface and 35% of the European continental area (Goldscheider et al. 2020), representing a major worldwide source of freshwater providing drinking water to 10–25% of the world´s population (Ford and Williams 2007; Stevanović 2019). Hydrological models support the understanding of the system’s functioning and are fundamental to ensuring the sustainable water management of karst water resources (Hartmann et al. 2014). However, modelling of karst systems is still a difficult task due to the complex interaction between the matrix and conduit domains (Hartmann et al. 2013) and the difficulty in observing and measuring flow and transport processes in the subsurface (Berthelin and Hartmann 2020). Sivelle et al. (2022b) investigated the relevance of excess air and, thus, hydrostatic pressure for the assessment of matrix-conduit exchange. As a result, large uncertainty characterizes the internal dynamics of karst systems, i.e., fast discharge through the conduit, infiltration into the matrix and water contributions from different geological formations inside the recharge area (Chang et al. 2021).

Hence, hydrological models need to be calibrated due to the lack of knowledge in model parameter values and process understanding (Le Moine et al. 2008). Moreover, model validation is necessary to assess model reliability and utility for further applications (Andréassian 2023; Klemeš 1986). Typically, model calibration and validation are done using measured discharge time series at a monitoring site and computing performance metrics. Among the several existing metrics (Bennett et al. 2013; Ferreira et al. 2020; Moriasi et al. 2007), the Kling-Gupta Efficiency (KGE) is considered suitable for capturing the entire flow regime (Gupta et al. 2009). However, a single metric is generally unable to properly evaluate all model characteristics and solely relying on optimal values of an objective function to assess a model has been criticized (Gupta et al. 2008; Leins et al. 2023). Indeed, acceptable values of a model performance indicator do not necessarily mean that the hydrological model is reliable due to the problem of equifinality (Cinkus et al. 2022), i.e., the existence of multiple optimal parameter sets that reproduce the observed values (Chiogna et al. 2024). Different models, characterized by different structures or parameter values, can result in satisfactory simulated spring discharge (Mudarra et al. 2019). Previous works have shown that models validated with a single spring monitoring site may not capture the internal functioning of karst system and result in large equifinality (Hartmann et al. 2017).

The use of hydrochemical data allows for a better understanding of the internal dynamics of karst systems. Besides the use of conservative tracers such as stable water isotopes (Wang et al. 2021; Winston and Criss 2004), major ions and electrical conductivity (EC) are also used to investigate the spatial and temporal variability in the hydrological response of karts systems (Barbieri et al. 2005; Chang et al. 2021; Gil-Márquez et al. 2017; Hartmann et al. 2013). However, to take full advantage of the benefits of hydrochemical data to better capture the variability of the internal response of a system, the hydrochemical data should be at a resolution at least as detailed as the resolution of discharge observations, in particular for event-based models. Nevertheless, hydrochemical data are typically collected at weekly or coarser temporal resolutions due to high analysis costs and time constraints.

The aim of this study is to further contribute to ongoing research into the coupling of hydrological (e.g., spring discharge) and hydrochemical (e.g., high-resolution ion concentration time series) information for the verification of conceptual event-based models. The method proposed by Richieri et al. (2023) is used to retrieve high-resolution hydrochemical information from continuous measurements of electrical conductivity (EC). Moreover, the semidistributed LuKARS 2.0 (land use change modelling in KARSt systems) model, based on the original model from Bittner et al. (2018), is further developed by using it for the first time at an hourly timescale. LuKARS represents the discharge observed at the spring as the sum of water contributions draining different geological areas (hydrotopes) characterized by different response times. In this work, this approach is validated considering that different parts of the catchment can be characterized by different chemical signatures. The new version of the model represents the different compartments of a karst system as buckets, i.e., epikarst, matrix and conduits, and aims to provide an adaptable representation of binary karst systems with a perennial spring and intermittent overflow. In particular, the model provides an updated conceptualization of the interaction between the matrix and conduit in comparison to Bittner et al. (2018) to allow a more flexible and consistent conceptualization of the subsurface system following the approach proposed in KarstMod (Mazzilli et al. 2019; Sivelle et al. 2023). To constrain model parameters, the KGE performance criteria are combined with the conceptual information derived from high-resolution hydrochemical data, i.e., major ion concentrations. The hypothesis of the present work is that high-temporal-resolution hydrochemical data can support the verification of the conceptual model, whereas model structures simulating the spring discharge without consideration of hydrochemistry may feature a comparable KGE but may not match the internal dynamics of a system.

The model was tested on the Baget system, an intensively studied karst watershed in France (Labat et al. 1999; Sivelle et al. 2022a) which is characterized by an absence of interbasin groundwater flow (Mangin 1975; AL Khoury et al. 2023). Baget shows a heterogeneous geology combining limestone bedrock and an extensive outcrop formation of black flysch (karst and non-karst areas) (Ulloa-Cedamanos et al. 2020). Time series at hourly resolution of the contributions of HCO3 and SO42− to the total EC, called weight factors and derived by Richieri et al. (2023), contribute to the description of the internal dynamics of the Baget system, i.e., the response of water contributions from different geological areas in the catchment to different flow conditions.

The manuscript is structured as follows. First, the study area and the available dataset are presented. Then the new features of LuKARST 2.0 are described in detail, whereas the components of the model that remain unchanged are provided in the electronic supplementary material (ESM). Finally, the Morris method is used for the sensitivity analysis of the model parameters and an envelope of behavioral simulations is defined based on KGE. Finally, this study shows that only a limited subset of behavioral simulations can explain the observed hydrochemical dynamics.

Materials and methods

This section provides information about the Baget karst watershed and the hydrochemical dataset used to select an appropriate conceptual model and parameter values. The hydrochemical data were collected at the Las Hountas spring for the period 30/03/2022–7/04/2022. This section also describes the previous investigations into the internal dynamics of the Baget karst system based on hydrochemical data at high temporal resolution (Richieri et al. 2023).

Study area 

The Baget karst system is a binary karst system characterized by a perennial spring and intermittent overflow, located 10 km southwest of the city of Saint-Girons, in the Ariege administrative department, France (Fig. 1a,b). It has a recharge area of ~13 km2 and does not receive water from adjacent catchments (Mangin 1975; AL Khoury et al. 2023). The vegetation of the recharge area is dominated by a fir-beech forest, with only a small agricultural plot covering 3% of the recharge area (Sivelle et al. 2022a). Vegetation is particularly dense along the north-facing slope, whereas grassland occupies some of the south-facing slope. The Baget system is under the influence of the Atlantic oceanic climate with a mean annual air temperature of 12.3 °C, an average annual rainfall close to 1,700 mm, and no influence of snow melt processes (Padilla et al. 1994). The annual precipitation distribution is bimodal, with peaks occurring in December and February (Ulloa-Cedamanos et al. 2020).

Fig. 1
figure 1

Overview of the study site. a The location of the study area in Europe. b The location of the Antichan rain gauging station in relation to the study area. c Geological map of the Baget catchment with the location of the Las Hountas spring (modified after the BD Charm-50 geology map from the French Geological Survey). d Zoomed in area of the geological map showing the locations of the Las Hountas spring, at which the water samples were collected, and the outlet of the recharge area. e Geological subsurface cross-section in the location indicated with a black line (d). Elevation is provided in meters above mean sea level (m AMSL) (modified from Debroas 2009)

The catchment features heterogeneous geology, dominated by calcareous lithologies originating from the Jurassic and Lower Cretaceous. In addition to the limestone bedrock, a large outcrop of relatively impermeable black flysch partially covers the south-facing slope (Fig. 1c–e). The extent of the black flysch was obtained using the BD Charm-50 geology map (French Geological Survey) by measuring the corresponding covered area. Its estimation of ~30% of the catchment is comparable to the 25.2% estimated by Ulloa-Cedamanos et al. (2020). Spring flow contributions derived from different geological areas of the catchment have undergone different dissolution processes. Calcium carbonate (CaCO3) associated with the limestone is dissolved by carbonic acid (H2CO3) resulting in water typified by elevated Ca2+ and HCO3. The black flysch contains pyrite, where oxidation processes release sulfuric acid (H2SO4), which in turn dissolves CaCO3 with the products SO42−, Ca2+ and HCO3. Consequently, the water contributions from the limestone bedrock and black flysch provide different components to the chemical signature at the spring (Richieri et al. 2023). A more detailed overview of the different sources affecting the chemical composition at the spring is given in section ‘Prior investigation into hydrochemical signals’.

The karstic watershed is characterized by one perennial spring, called Las Hountas. Las Hountas is situated 110 m away from the outlet of the catchment (Fig. 1d) and is representative of a part of the total response of the system. During precipitation events, the downstream part of the Baget catchment recharges the Las Hountas spring. Only during high flood events the karst conduits in the upper part of the catchment start to discharge water to the surface, actively contributing to the streamflow generation in the Lachein stream, which is usually dry (Sivelle et al. 2020). Groundwater discharge to streamflow in the upper part of the surface channel occurs ~50 days/year (Mangin 1975). The surface water of the Lachein stream bypasses the Las Hountas spring and directly reaches the outlet of the catchment (Ulloa-Cedamanos et al. 2021). This is evident by comparing the measured discharge at the outlet and at Las Hountas spring: the discharge at the spring shows a plateau at ~0.6 m3/s (Fig. S1 in the ESM).

Data collection

The hourly precipitation data used as an input for the model were recorded at the meteorological station of Antichan, ~8 km away from the spring (Fig. 1b). The observed discharge time series at the Las Hountas spring was derived by recording water level with an Aqua TROLL 200 device (In-Situ Inc., United States) and applying the rating curve from Mangin (1975), which was adjusted to represent the current state of the cross section during additional field measurements performed in 2021 and 2022. Evapotranspiration was not considered because this study focuses on the simulation of peak flows in response to intense precipitation events during which evapotranspiration effects are negligible.

Table 1 contains information about the temporal resolution, number of samples and statistics of the specific EC (μS/cm), water temperature, pH and major solute species total concentration (i.e., each present both as free ion and as part of complexes as described in Richieri et al. 2023), i.e., Ca, Mg, HCO3, SO4, NO3, Cl, Na and K (mg/L), which were measured at the Las Hountas spring during the event-based sampling campaign from 30/03/2022 to 7/04/2022. The specific EC was measured with a time interval of 15 min by means of the In-Situ Aqua TROLL 200 device and reported at the standard temperature of 25 °C. The water samples were collected by a 6712 ISCO sampler (Teledyne ISCO, United States), which was connected to the EC probe to automatically start sampling above a water level threshold (30 cm). The sampling frequency was hourly during the rising curve of the hydrograph, every 2 h during the recession curve, and then every 3 h near baseflow conditions. The ISCO sampler was installed inside a shelter that always provided shade. The samples were collected every day in plastic vials with zero headspace, filtered through a 0.22-μm membrane and stored in a refrigerator at 4 °C for ~1 week prior to analysis. For each sample, an aliquot for cation analysis was acidified with nitric acid (HNO3) to prevent complexation and precipitation (Weiss 2020; Ulloa-Cedamanos et al. 2020). The solute concentrations, provided at the standard temperature of 25 °C, were analyzed by the laboratory Geosciences Environment Toulouse, France. The ICP-OES (inductively coupled plasma optical emission spectrometry) was used to quantify Ca, Mg, Na and K; ion chromatography was used to quantify NO3, SO4 and Cl; titration analysis was used to quantify HCO3.

Table 1 Statistics of the used datasets, including temporal resolution (TR), number of samples (n), mean, standard deviation (SD), minimum (MIN) and maximum (MAX) values of the specific electrical conductivity EC (μS/cm), major solute species (mg/L), pH and water temperature T (°C) measured at the Las Hountas spring for the events 30/03/2022–7/04/2022

To verify the accuracy of chemical quantification, the charge balance of each sample was computed using the software PHREEQC. The charge mass balance error ranged from 0.8 to 4.9% with a median of 2.4%. Since all the samples show a charge error lower than ±5%, the results of the laboratory analysis were considered reliable (Parkhurst and Appelo 2013). Figure S2 in the ESM shows the time series of the computed charge balance (%) together with the sum of cations (TC) and anions (TA) (mEq/L).

Prior investigation into hydrochemical signals

Richieri et al. (2023) previously investigated the hydrological functioning of the Baget karst system by comparing continuous water level recordings and EC measurements with high-temporal resolution major solute concentrations for multiple precipitation events, occurring in October 2021, November 2021 and November 2022. EC dynamics confirmed the complex hydrological response behavior of the Las Hountas spring, which is characterized by a simultaneous increase in water level and EC (flushing and piston effects) during precipitation events that followed dry periods, and by dilution processes during peak spring discharge periods. Richieri et al. (2023) also used the individual contributions of individual major ions i to the total observed EC, called weight factors fi, to identify the varying contributions of water derived from different areas of the watershed, i.e., limestone and black flysch, at different flow conditions. The individual contributions of each ion to the total EC were computed for each sample with PHREEQC, by considering ion molar conductivity, molar concentration and the electrochemical activity coefficient as well as pH and temperature of the sample (Table 1). A complete description of the equations used to compute the weight factors is provided in Richieri et al. (2023). Equation (1) describes the dissolution process characterizing the limestone formation. The dissolution of one mole of calcium carbonate (CaCO3) by carbonic acid (H2CO3) produces one mole of Ca2+ and two moles of HCO3. Equation (2) describes the dissolution process characterizing the black flysch containing pyrite, whose oxidation releases strong acids, i.e., sulfuric acid (H2SO4). The reaction between one mole of CaCO3 and half a mole of H2SO4 produces one mole of Ca2+, one mole of HCO3 and half a mole of SO42−. Thus, the dissolution of CaCO3 by both H2CO3 and H2SO4 leads to a lower alkalinity (HCO3) than what would be observed in the case of only dissolution by H2CO3. Consequently, it was shown that the increase and decrease in the weight factors of SO42− and HCO3 (\({f}_{{\text{SO}}_{4}^{2-}}\) and \({f}_{{\text{HCO}}_{3}^{-}})\), respectively, indicate a relative increase in the water draining the black flysch, which was observed to be simultaneous to flushing and piston effects. On the contrary, the increase in \({f}_{{\text{HCO}}_{3}^{-}}\) indicates a larger water contribution from the limestone bedrocks during dilution processes and baseflow conditions (Richieri et al. 2023).

$${\text{CaCO}}_{3}+ {\text{H}}_{2}{\text{CO}}_{3}\to {\text{Ca}}^{2+}+ {2\text{HCO}}_{3}^{-}$$
(1)
$${\text{CaCO}}_{3}+ {0.5\text{H}}_{2}{\text{SO}}_{4}\to {\text{Ca}}^{2+}+ {\text{HCO}}_{3}^{-}+ {0.5\text{SO}}_{4}^{2-}$$
(2)

It is pertinent to note, that despite the black flysch containing Na silicate minerals (Na2SiO3) (Ulloa-Cedamanos et al. 2021), the origin of HCO3 is considered to be dominated by CaCO3 dissolution due to the low Na+ content of water samples (Table 1). The variability in silicate weathering input could be further investigated by means of isotopes (Hagedorn and Whittier 2015; Spence and Telmer 2005), which were not available for the present study. The role of Na silicate weathering by H2CO3 (Ulloa-Cedamanos et al. 2021) is therefore considered negligible in the Baget catchment. In addition, a minor contribution of SO42− might be related to a gypsum formation within the catchment. However, as the area of gypsum bedrock only represents 0.2% of the recharge area (Ulloa-Cedamanos et al. 2020), the present study considers the mass flux of SO42− to be controlled by the discharge from the black flysch formation, which covers 30% of the catchment.

Despite being often considered a conservative tracer, Cl is not used in this study for the investigation of system dynamics for two reasons. Firstly, the computation of the weight factors fi at high temporal resolution is affected by high uncertainty in the case of ions with low concentration such as Cl (Table 1; Richieri et al. 2023). Second, this work focuses on the investigation of water draining different geological formations in the catchment, which does not contain any halite deposit or other geogenic sources of Cl (Ulloa-Cedamanos et al. 2021).

The Baget catchment is relatively unpolluted with an absence of SO42−, NO3 and Cl originating from anthropogenic activities. NO3 was observed to be associated with organic decomposition and to increase in autumn. Indeed, the seasonal increase in precipitation causes the leaching of NO3 from the soil (Ulloa-Cedamanos et al. 2020), whereas the observed SO42− and Cl are entirely derived from geogenic sources and precipitation, respectively (Ulloa-Cedamanos et al. 2021).

Model development

This section describes the LuKARS 2.0 model concept, together with the modifications done within this study with respect to the original model from Bittner et al. (2018). Subsequently, the Morris screening sensitivity analysis and the calibration procedure are presented. Finally, the use of high-resolution hydrochemical data for the selection of the model structure and parameters is described. The model was tested for the Las Hountas spring and calibrated and validated for the periods 1/03/2022–29/03/2022 and 30/03/2022–30/04/2022, respectively.

Description of LuKARS 2.0 model concept

LuKARS is a semidistributed model developed by Bittner et al. (2018). The model divides the catchment into hydrotopes, which are defined as independent units exhibiting similar hydrological behavior and soil characteristics. The original model from Bittner et al. (2018) was later modified by Sivelle et al. (2022a) to be coupled with KarstMod (Mazzilli et al. 2019). In this study, LuKARS 2.0 was developed and applied to the Baget catchment starting from the original model (Bittner et al. 2018) considering two hydrotopes in the upper compartment, which represent the main geological formations within the studied catchment, i.e., limestone and black flysch. LuKARS 2.0 adds three new features to LuKARS: (1) the model was modified from a daily to an hourly time step and the parameters adjusted accordingly; (2) a transfer between the matrix and conduit was implemented in the lower compartment of the model; and (3) a drainage from the conduit was implemented to represent water bypassing the spring at high flow conditions (Fig. S1 in the ESM). Figure 2 and Table 2 show the model concept and provide a description of the model parameters, respectively. The model concept was developed based on the understanding of system functioning, more precisely on the dynamics of the water contributions draining the different geological formations (Richieri et al. 2023). Since the hydrological response of the contribution from the black flysch was observed to be fast with no apparent influence on the baseflow, hydrotope 2 was defined as such to provide only fast flow to the conduit but no infiltration to the matrix. On the other hand, hydrotope 1 represents the limestone karst formation, which contributes to both the discharge to the conduit and infiltration in the matrix. The fast flow Qhyd (m3/s) from a certain hydrotope gets active when the water level in that hydrotope reaches the upper storage threshold Emax (mm) and continues until the water level goes down to the lower storage threshold Emin (mm) (hysteresis). Nevertheless, the infiltration Qis (m3/s) is always active and linearly correlated with the water level in the hydrotope. Finally, the water in the conduit will be transferred to the spring with the linear function from Mazzilli et al. (2019) (ESM, Eq. S6). The implemented modifications to LuKARS are described here, whereas the model equations from the original model (Bittner et al. 2018) are reported in the ESM.

Fig. 2
figure 2

LuKARS 2.0 model concept including the implemented transfer between the matrix bucket and the conduit bucket and the drainage from the conduit bucket for the case of the Las Hountas spring. The two hydrotopes are defined based on the main geological formations present in the recharge area, i.e., karst bedrock and black flysch, and their response to rainfall

Table 2 Overview of the model parameters of LuKARS 2.0, including symbol, unit, description, and range used for the Morris analysis

Transfer between the matrix and conduit. A transfer between the matrix and conduit QMC (m3/s) was implemented in the lower compartment of the model to represent the dual behavior typical of karst systems. The transfer was defined based on the approach of Mazzilli et al. (2023) as a function of the recharge area Ra (km2) discharge coefficient kMC (mm/h), exponent aMC (-) and dimensionless water levels in the matrix M and conduit C (-). QMC (m3/s) was implemented as shown in Eq. (3), with t indicating the current time step, abs the absolute value and sgn the sign of the subtraction of the dimensionless water level in the conduit C from that in the matrix M (-). Positive values of QMC (m3/s) means that the current direction of flow is from the matrix to the conduit. To avoid numerical instabilities, Eq. (3) is solved using the analytical solution for the inter-compartment coupling (Mazzilli et al. 2023).

$${Q}_{\text{MC}, t}={\text{Ra}}\times{\text{k}}_{\text{MC}}\times \text{sgn}\left({M}_{t}-{C}_{t}\right)\times {\text{abs}\left({M}_{t}-{C}_{t}\right)}^{{a}_{\text{MC}}}$$
(3)

Discharge from the conduit. A drainage QCloss (m3/s) from the conduit was used to represent the plateau in discharge and the water bypassing the spring (Fig. S1 of the ESM), which results from the natural drainage through the conduits at high flow conditions. The drainage was implemented to start when the water level in the conduit Ct (where the subscript t indicates the time step at which a quantity is computed) gets higher than the storage threshold Closs (mm) and to remain active as long as the water level does not drop below it. Equation (4) shows the implementation of the drainage QCloss,t (m3/s), with Ra (km2) the recharge area and dt (h) the hourly time step. The water level is checked at each time step t and the excess water exits immediately from the system without reaching the spring.

$$\begin{array}{c}\mathrm{if}\;C_t>C_{\mathrm{loss}}:Q_{\mathrm{Closs},t}=\left(C_t-C_{\mathrm{loss}}\right)\ast\frac{\mathrm{Ra}}{dt}\;\mathrm{and}\;C_t=C_{\mathrm{loss}},\\\mathrm{else}{:Q}_{Closs,t}=0\end{array}$$
(4)

Sensitivity analysis (Morris screening)

The identification of the most sensitive model parameters and potential nonlinear interactions was done with the elementary effects method, also known as Morris screening (Campolongo et al. 2007; Morris 1991). Morris screening is a one-at-a-time (OAT) method to perform global sensitivity analysis which is widely applied in literature and it is often used in the case of models characterized by a large number of input parameters for which variance-based sensitivity analysis would be computationally demanding (Campolongo et al. 2007; Jaxa-Rozen and Kwakkel 2018; Merchán-Rivera et al. 2022; Smith 2013). The Morris method considers that each model parameter set q = [q1,…,qn] varies across a discrete number l of values, called levels, forming an n-dimensional l-level grid Γl. The method relies on the average of the elementary effects over parameter space to provide a measure of global sensitivity. The elementary effect di(q) of the ith input parameter quantifies the approximate local sensitivity at the point q and is defined as in Eq. (5) (Smith 2013)

$${d}_{i}\left(q\right)=\frac{f\left({q}_{1},\dots , { q}_{i-1}, { q}_{i, } {q}_{i+1},\dots ,{q}_{n}\right)-f(q)}{\Delta }= \frac{f\left(q+ \Delta {e}_{i}\right)-f(q)}{\Delta }$$
(5)

where ei is a vector of zeros with one on the ith components and \(\Delta\) is the stepsize, which is chosen from the set shown in Eq. (6) (Smith 2013)

$$\Delta \in \left\{\frac{1}{l-1}, \dots , 1- \frac{1}{l-1}\right\}$$
(6)

To obtain a global sensitivity measure, Campolongo et al. (2007) and Morris (1991) proposed the mean µ* and variance σ, respectively, of the finite-dimensional distribution Gi associated with the absolute value of the elementary effect di(q), which is derived from randomly sampling q within Γl. Considering r sampling points, these metrics associated with the ith parameter and the jth sample can be expressed as shown in Eqs. (7) and (8).

$${\mu }_{i}^{*}= \frac{1}{r} \sum_{j=1}^{r}\left|{\text{d}}_{i}^{j}(q)\right|$$
(7)
$${\sigma }_{i}^{2}= \frac{1}{r-1 } \sum_{j=1}^{r}{\left({\text{d}}_{i}^{j}\left(q\right)- {\upmu }_{i}\right)}^{2} with\;{\mu }_{i}= \frac{1}{r} \sum_{j=1}^{r}{\text{d}}_{i}^{j}\left(q\right)$$
(8)

where \({d}_{i}^{j}\) is the elementary effect associated with the ith parameter and jth sample.

High values of σ indicate possible interactions between the model parameters and/or of parameter nonlinearity. The parameters with σ/μ* smaller than 0.1 or between 0.1 and 1 are almost linear or monotonic, respectively. The parameters with σ/μ* larger than 1 are characterized by marked nonmonotonic nonlinearities or interactions with other parameters (Sanchez et al. 2014). µ* and σ are constructed by considering m trajectories of n + 1 points in the parameter space. The total number of realizations accounted for in the metrics calculations is defined by (n + 1)*m. For a detailed explanation of the method, one can refer to the works of Morris (1991) and Campolongo et al. (2007) as well as to that of Smith (2013).

In this study, Morris was run for the calibration period (1/03/2022–30/03/2022) by using the sensitivity analysis library in Python (SALib) and considering the 17 model parameters reported in Table 2, 500 trajectories and 100 grid levels, for a total of 9,000 realizations. The length of the calibration period was selected considering the aim of building an event-based model. The number of realizations was considered sufficient since the method with 1,000 trajectories and 200 grid levels showed convergence of the results. Table 2 shows the parameter ranges used for the Morris analysis. Since it was the first application of LuKARS at an hourly scale, it was required to initially investigate the parameter ranges by means of an independent set of 10,000 Monte Carlo simulations. In addition, the parameter ranges for the two hydrotopes were selected to be consistent with the observations of the water contribution from the two different geological formations. A lower upper storage threshold was set for the black flysch, i.e., hydrotope 2, to represent its faster activation and, thus, the relative increase in the water contribution at the beginning of heavy precipitation events (Richieri et al. 2023). The extents of hydrotope 1 and hydrotope 2 were taken fixed and equal to 70 and 30% of the entire catchment, respectively. Each realization performed for the Morris method was evaluated by computing the KGE. Equation (9) shows the equation for the computation of KGE, with rc the linear correlation between the observations and simulations, σsim the standard deviation in simulations, σobs the standard deviation in observations, μsim the simulation mean and μobs the observation mean (Knoben et al. 2019).

$$\text{KGE}=1- \sqrt{{\left({r}_{\text{c}}-1\right)}^{2}+{\left(\frac{{\sigma }_{\text{sim}}}{{\sigma }_{\text{obs}}}-1\right)}^{2}+ {\left(\frac{{\mu }_{\text{sim}}}{{\mu }_{\text{obs}}}-1\right)}^{2}}$$
(9)

Among the 9,000 model realizations used for the sensitivity analysis, the realization with the highest KGE was considered as the selected one, while the subset of realizations with a KGE larger than 0.5 was defined as behavioral (e.g., Beven and Freer 2001). To assess the uncertainty of the model, each flow component—Qhyd_1, Qhyd_2, Qis_1, QMC, QCloss and QCS (m3/s)—derived by using the selected parameters values was compared to the distribution of the behavioral simulations, i.e., interquartile and 10–90 percentile envelopes.

Molel selection considering hydrochemical constraints

Once the selected and behavioral parameter sets were found, they were used to validate the model for the period 30/03/2022–30/04/2022. Thereafter, it was necessary to check whether the selected simulation represented not only the discharge at the spring but also the internal dynamics of the system, i.e., water contributions from the limestone and black flysch. For this purpose, the flows from hydrotope 1 (limestone) and 2 (black flysch) to the conduit together with the transfer between the matrix and conduit were compared to the observed hydrochemical data recorded during the increase in the water level—occurred in the 2 days 30/03/2022 and 1/04/2022—for the event 30/03/2022–7/04/2022. Therefore, three additional criteria were defined to constrain, among the multiple Morris realizations with KGE larger than 0.5, those also matching the hydrochemical information. The criteria were identified based on the weight factors (Richieri et al. 2023) computed for the time series of HCO3 and SO42− observed during the event in April 2022 (Table 1) and are described in detail within ‘Model selection using hydrochemical constraints’. It is important to note that the weight factors were computed using PHREEQC by considering for each water sample the measured pH (Table 1). In addition, despite HCO3 not being a conservative species, the alkalinity in the case of pH between 7.2 and 8.3 (Table 1) is typically considered to derive from HCO3 alone (Boyd 2020). The hydrochemical constraints were defined based on HCO3 and SO42− as these ions were previously used for the investigation of the internal dynamics of the Baget system (Richieri et al. 2023). In addition, both HCO3 and SO42− were proved to be present at Las Hountas spring at sufficiently high concentration to be retrieved at high temporal resolution by applying the EC decomposition method from Richieri et al. (2023). Indeed, the EC decomposition method (Richieri et al. 2023) allows for the retrieval of accurate concentration time series at high temporal resolution for those solutes present with a sufficiently high concentration, thus significantly contributing (~>6%) to the total EC, whereas the reconstructed time series of solutes with low concentration, e.g., Cl (Table 1), are subject to large uncertainty.

Results

This section presents the results of the investigation. Firstly, the results of the Morris analysis are presented in terms of sensitivity and nonlinear interaction among the model parameters. Secondly, the results of the model calibration and validation considering the KGE of the spring discharge are shown together with the respective uncertainty bands. Finally, it is shown how the hydrochemical criteria were used to further constrain the model by considering not only the discharge at the spring but also the expected internal flow dynamics.

Sensitivity analysis

The results of the sensitivity analysis based on the Morris screening method are evaluated by comparing the mean µ* and standard deviation σ of the distribution function of each parameter. Figure 3a shows the value of µ* for each parameter in descending order, while Table 3 contains the corresponding sensitivity ranking. Among the model parameters, the discharge coefficient for the flow from the conduit to the spring kCS is the most sensitive, followed in order by the upper storage threshold for hydrotope 1 Emax_1 and by the storage threshold for the conduit Closs. On the contrary, the discharge coefficient kMC and the exponent aMC, which are both related to the transfer between the matrix and conduit (QMC), are the least sensitive. In between the most and least sensitive parameters, the parameters related to the flow components from the hydrotope 1, i.e., limestone, are more sensitive than those from the hydrotope 2, i.e., black flysch.

Fig. 3
figure 3

Results of the Morris screening sensitivity analysis. a Mean of the absolute values of the elementary effects (μ*) for each model parameter as the index of their sensitivity. b Mean of the distribution of the absolute values (μ*) against the standard deviation of the distribution (σ) for each model parameter; the areas of the graph corresponding to monotonic and/or linear interactions and nonlinear interactions (Sanchez et al. 2014) are shown with a dashed gray line

Table 3 Overview of the results of the Morris analysis, including for each parameter the sensitivity ranking, the selected parameter value and the median and standard deviation for the behavioral simulations based on KGE

Figure 3b shows the scatter plot of the computed σ and µ*, delineating the areas of the graph corresponding to monotonic and/or linear interactions and nonlinear nonmonotonic interactions (Sanchez et al. 2014). Most of the parameters fall above the bisector, indicating strong nonlinear nonmonotonic interaction among most of the model parameters.

The sensitivity of the parameters was investigated by considering the distribution of the performance of the Morris model realizations over the parameter ranges (Table 2; Fig. 4). Figure 4 shows the distribution of occurrence of the metric KGE of the model realizations for the most and least sensitive parameters, which are the discharge coefficient of the flow from the conduit to the spring kCS and the exponent of the transfer between the matrix and conduit aMC, respectively (Fig. 3a; Table 3). The distribution of the realizations with KGE lower than zero, larger than zero and larger than 0.5 confirms the sensitivity rank obtained with the Morris analysis (Table 3). Indeed, for the exponent aMC, all the three distributions of KGE are uniform over the parameter range (Fig. 4b). On the contrary, for the discharge coefficient kCS, the realizations with KGE larger than 0 and larger than 0.5 show a probability of occurrence skewed towards the right side of the parameter range (Fig. 4a).

Fig. 4
figure 4

Distribution of occurrence of the Kling-Gupta Efficiency (KGE) of the model realizations obtained by means of the Morris screening sensitivity analysis for a the most sensitive parameter, i.e., the recession coefficient of the flow from the conduit to the spring (kCS), and b for the least sensitive parameter, i.e., the exponent of the flow transfer between the matric and the conduit (aMC)

Model calibration and validation

Among the 9,000 Morris realizations, the parameter sample corresponding to the highest KGE was chosen as the selected parameter set and the corresponding realization was chosen as the selected discharge. The selected parameters, calibrated for the period 1/03/2022–30/03/2022, are shown in Table 3 and lead to a KGE of 0.89. The model was then validated with the same selected parameter values for the period 30/03/2022–30/04/2022 leading to a KGE of 0.8. Figure 5a shows the calibrated and validated discharge time series together with the observed discharge at the spring. Overall, the simulated discharge matches the rising and falling limbs of the observed hydrograph, with the exception of the recession phase between April 4 and April 18, which is underestimated by 0.08 m3/s on average. For a better visualization, Fig. S3a in the ESM shows the time series of the observed discharge, the selected discharge of the Morris realizations based on KGE and the difference between the observed and selected discharges at the Las Hountas spring.

Fig. 5
figure 5

Simulated (red lines) flow components (Qspring, Qhyd_1, Qhyd_2, Qis_1, QMC and QCloss) of the selected simulation together with their uncertainty bands computed from the distribution of the behavioral simulations with a KGE larger than 0.5 (gray bands). The dark grey and grey areas of the bands represent the interquartile and the 10–90% percentile ranges of the behavioral simulations, respectively, while the black line is the median of the distribution. The blue line is the discharge observed at the spring

To account for the uncertainty in the model results, the distribution of the Morris behavioral simulations with a KGE greater than 0.5 was considered for both the calibration and the validation periods. Figure 5 shows, for each flow component, i.e., Qspring, Qhyd_1, Qhyd_2, Qis_1, QMC, and QCloss, the interquartile and 10–90% percentile envelopes as well as the median of the 448 selected simulations. For each flow component, the selected simulated discharge falls inside the interquartile range. The simulated discharge at the spring Qspring and the flow from the hydrotope 1 to the conduit Qhyd_1 show the lowest relative width of the interquartile uncertainty band (Fig. 5a,b; Table 4). On the contrary, the flow components characterized by larger uncertainty are the infiltration from hydrotope 1 to the matrix Qis_1 and the transfer between the matrix and conduit QMC (Fig. 5d,e; Table 4). The relative width of the 10–90% percentile ranges follows the same behavior as the interquartile ranges: Qspring and Qhyd_1 show the narrowest bands (Fig. 5a,b; Table 4); whereas Qis_1, QMC and QCloss are characterized by the largest uncertainty (Fig. 5d–f; Table 4). The drainage from the conduit QCloss is active only during high peak discharges, whereas it is null at low flow conditions (Fig. 5f).

Table 4 Relative width of the uncertainty bands computed from the distribution of the behavioral simulations (KGE > 0.5) and of the constrained simulations (hydrochemical constraints) for the flow components Qspring, Qhyd_1, Qhyd_2, Qis_1, QMC and QCloss. Both the relative width of the interquartile and 10–90% percentile ranges were normalized considering the mean of the behavioral and constrained simulations

Model selection using hydrochemical constraints

After validating the model by only considering the KGE metric, it was investigated as to whether the internal dynamics of the system, i.e., water contributions from the limestone and black flysch, were represented or not. The observed temporal dynamics of the system show high temporal variability, justifying the high-temporal resolution of hydrochemical data collection. Here, daily sampling frequencies or averaged daily values would be unsuitable to represent the variability of the water contributions. For this purpose, the high-temporal resolution hydrochemistry data collected in April 2022 (Table 1) were used together with the understanding of the system functioning. Richieri et al. (2023) found out that it is possible to distinguish the water contributions from the two main geological formations present in the Baget catchment by looking at the contribution to the total EC, called weight factor \({f}_{i}\), of HCO3 and SO42−. More precisely, a simultaneous increase in \({f}_{{\text{SO}}_{4}^{2-}}\) and decrease in \({f}_{{\text{HCO}}_{3}^{-}}\) indicated an increase in the relative contribution from the black flysch (Richieri et al. 2023). Figure 6 shows water level (wl) (m), EC (μcm/S) and weight factors \({f}_{{\text{HCO}}_{3}^{-}}\)(-) and \({f}_{{\text{SO}}_{4}^{2-}}\) (-) time series observed at the spring during the increase in water level, which occurred between 30/03/2022 and 1/04/2022, for the event 30/03/2022–7/04/2022. Three peaks in \({f}_{{\text{SO}}_{4}^{2-}}\), simultaneous to a low point in \({f}_{{\text{HCO}}_{3}^{-}}\) were observed on May 30 at 8 pm, May 31 at 9.30 am and May 31 at 10 pm. However, the simulation selected among the 9,000 Morris realizations (highest KGE) does not represent the observed varying relative water contributions. Indeed, the flow from the limestone to the conduit Qhyd_1 and the flow from the black flysch to the conduit Qhyd_2 are almost parallel lines with a null transfer between the matrix and conduit QMC (Fig. 7a).

Fig. 6
figure 6

Water level (wl) (m), electrical conductivity EC (μcm/S) and weight factors \({f}_{{HCO}_{3}^{-}}\) (-) and \({f}_{{SO}_{4}^{2-}}\) (-) time series observed at the Las Hountas spring during the increase in water level—occurred in the 2 days 30/03/2022 and 1/04/2022—for the event 30/03/2022–7/04/2022. The weight factors are computed as the contribution of the individual free ion to the total EC (Richieri et al. 2023). The three green panels indicate the times at which there is a simultaneous increase in \({f}_{{SO}_{4}^{2-}}\) (-) and decrease in \({f}_{{HCO}_{3}^{-}}\) (-)

Fig. 7
figure 7

Simulated discharge time series for the transfer between the matrix and conduit QMC, the flow from hydrotope 1 (limestone) to conduit Qhyd_1 and the flow from hydrotope 2 (black flysch) to the conduit Qhyd_2 during the increase in water level—occurred in the 2 days 30/03/2022 and 1/04/2022—for the event 30/03/2022–7/04/2022. a Discharge time series from the Morris simulation selected based on the KGE of the spring discharge. b Median and interquartile range of the behavioral Morris simulations selected with the three hydrochemical constraints. The three green arrows indicate the times at which the increase in the contribution from the black flysch Qhyd_2 is larger than that from the limestone Qhyd_1

Based on the observed hydrochemical data (Fig. 6), three hydrochemical constraints were used to identify among the Morris realizations with KGE larger than 0.5 those also respecting the internal dynamics of the system. The first criterion defines that the standard deviation of the transfer between the matrix and conduit QMC needs to be significant. Here, the presented conceptual model requires that an exchange between the matrix and conduit exists. To define a threshold value to consider the significance of the exchange, it is considered a standard deviation computed over the calibration period, which has to be larger than the mean standard deviation of the behavioral Morris simulations (0.004). In fact, many behavioral simulations lead to zero or minimal exchange between the two compartments. Other thresholds for the minimum accepted standard deviation were also tested: smaller thresholds (e.g., 0.002) had limited impact on the selected simulations respecting all three criteria, whereas larger thresholds (e.g., 0.01) led to no simulation respecting all three criteria. The second criterion states that the increase in the flow from the black flysch to the conduit (Qhyd_2) during the rising limb of the hydrograph must be larger than the increase in the flow from the limestone to the conduit (Qhyd_1) in the same period. Finally, the third criterion defines that, on April 1, the flow from the black flysch to the conduit (Qhyd_2) must be lower than the flow from the limestone to the conduit (Qhyd_1). Figure 7b shows the median and interquartile ranges of Qhy_1, Qhyd_2 and QMC for the seven Morris realizations respecting the three defined hydrochemical constraints. The median of the flow from the limestone (Qhyd_1) and black flysch (Qhyd_2) to the conduit captures the relative increases in water contribution from the black flysch observed on May 30 at 8 pm, May 31 at 9.30 am and May 31 at 10 pm, with the first peak anticipated at ~12 h (Figs. 7b and 6). The median of the transfer between the matrix and conduit (Fig. 7b) is positive during the entire rising limb of the hydrograph, indicating flow direction from the matrix to the conduit.

Figure 8 shows the distribution of the constrained Morris simulations, i.e., the simulations respecting to both the constraint on the spring discharge (acceptable simulations with KGE larger than 0.5) and the three hydrochemical constraints. The relative interquartile ranges of the constrained simulations (Table 4) are reduced in comparison to the distribution of the simulations considered behavioral based on the discharge KGE (Fig. 5) for all the flow components except QCloss. As observed in Fig. 5, Qspring (Fig. 8a) and Qhyd_1 (Fig. 8b) are characterized by the lowest relative width of the interquartile band, whereas Qis_1 (Fig. 8d), QMC (Fig. 8e) and QCloss (Fig. 8f) by the largest (Table 4). The relative width of the 10–90% percentile ranges is less significantly reduced than the interquartile ranges (Figs. 5 and 8). Qspring (Fig. 8a) shows the lowest relative width of the percentile band, while QMC (Fig. 8e) and QCloss (Fig. 8f) show the largest (Table 4). Despite the reduction in the uncertainty bands, the median of the distribution of Qspring does not respect the plateau of 0.6 m3/s (Fig. S1 of the ESM). Qhyd_1 and QCloss are the only flow components displaying a larger relative width of the percentile band (Fig. 8b,f) in comparison to that observed without considering the hydrochemical constraints (Fig. 5; Table 4). Finally, the 10–90% percentile bands for Qhyd_2 and QCloss are characterized by a relative width that increases proportionally with the discharge.

Fig. 8
figure 8

Uncertainty bands computed from the distribution of the behavioral simulations with a KGE larger than 0.5 and respecting the three hydrochemical constraints for the flow components Qspring, Qhyd_1, Qhyd_2, Qis_1, QMC and QCloss (gray bands). The dark grey and grey areas of the bands represent the interquartile and the 10–90% percentile ranges of the behavioral simulations, respectively, while the black line is the median of the distribution. The blue line is the discharge observed at the spring

Even if the 10–90% percentile range for Qspring is overall decreased (Fig. 8a), the percentile band is reduced for the calibration and increased for the validation period (Figs. 5a and 8a). On the contrary of what was noted before applying the hydrochemical constraints (Fig. 5a), the observed discharge at the spring falls outside the 10–90% percentile band for the calibration and inside the percentile band for the validation period (Fig. 8a). This results from the fact that the 10–90% percentile range of Qspring, respecting the hydrochemical constraints (Fig. 8a), includes realizations which were excluded from the 10–90% percentile range of the realizations selected based only on KGE (Fig. 5a).

To compare the performance of the model in simulating the discharge at the spring before and after considering the hydrochemical constraints, Fig. S3b (ESM) shows the time series of the difference between observed discharge and constrained discharge. The constrained discharge corresponds to the realization with the highest KGE among the subset of realizations respecting the hydrochemical constraints.

Discussion

This section reports the novelties of the new LuKARS 2.0 conceptual model and the use of hydrochemical criteria to verify the model concept and parametrization.

New LuKARS conceptual model

LuKARS 2.0 was developed at an hourly scale starting from the original LuKARS model at a daily scale (Bittner et al. 2018). When comparing the values of the parameters which are in common to both models, the discharge coefficient of the fast flow khy and of the infiltration to the matrix kis are the parameters showing the largest variation between daily (reported in Bittner et al. 2018) and hourly simulations (this study). This difference in magnitude can be explained by the fact that the karst system in which Bittner et al. (2018) tested LuKARS, i.e., Kesrchbaum, does not show relevant concentrated recharge processes and thus is characterized by a more pronounced infiltration than the Baget system, which shows a natural subsurface drainage rapidly recharged by sinkholes along the Lachein stream (Mangin 1975; Sivelle and Labat 2019). Thus, the response of Kerschbaum can be described by means of lower khy and larger kis than the Baget system.

To represent the plateau in the discharge time series at the Las Hountas spring (Fig. S1 of the ESM), a drain is implemented from the conduit out of the catchment. The sensitivity analysis indicates that the parameter controlling the activation of the drainage (Closs) is the third most sensitive parameter (Table 3). The selected Morris simulation as well as the interquartile range of the behavioral simulations (Fig. 5) confirm the relevance of the drain to discharge out of the system the excess water (Fig. 5f) and thus respect the maximum threshold of 0.6 m3/s at the spring (Fig. S1 of the ESM). However, the median of the behavioral simulations coincides with the 25% quartile and is constantly null throughout the calibration and validation periods, meaning that QCloss does not play a role for 50% of the realizations (Fig. 5f). The behavioral simulation respecting the hydrochemical data shows the same behavior (Fig. 8f): the median of the QCloss is constantly null, while the interquartile and 10–90% percentile bands show the large impact that the drainage from the conduit has for a share of the realizations. This indicates, as discussed subsequently, that further data at high temporal resolution, such as solutes and isotopes time series, should be considered to constrain the model parameters. In addition, time series collected at different points of the catchment would give more insight into the role of the conduits as natural drains as well as into the dynamics of the water draining from the different recharge areas.

To represent the characteristic dual response behavior of karst systems, the original LuKARS model (Bittner et al. 2018) was further modified by implementing two buckets in the lower compartment, i.e., matrix and conduit, and a transfer QMC between these two. The sensitivity analysis indicates that the parameters controlling the transfer between the matrix and conduit QMC, i.e., the discharge coefficient kMC and exponent aMC, are the least sensitive parameters (Table 3; Fig. 3a). The selected Morris simulation and the median of the realizations with KGE larger than 0.5 are characterized by no and little influence of QMC on the simulated discharge at the spring, respectively (Fig. 5e). The role of the transfer between the matrix and conduit becomes more relevant after applying the hydrochemical constraints: the realizations respecting the hydrochemical data (Fig. 6) present an increase in the width of the interquartile range and a median larger than the mean standard deviation of the behavioral Morris simulations, i.e., 0.004 (Fig. 8e).

Overall, the selected Morris realization well represents the dynamics of the spring discharge Qspring for both the calibration and validation, with an exception for the period between April 4 and 18 in which the simulated discharge is lower than the observed discharge (Fig. 5a). The model cannot represent the increase in discharge at the spring on April 4 due to the input data, which do not show significant precipitation on April 4 that could cause the increase in simulated discharge (Fig. S1, ESM). The hourly precipitation data were recorded at the Antichan gauging station, ~8 km north-east from the Las Hountas spring, and used as spatially homogeneous input for the presented conceptual model. Due to the heterogeneity of the precipitation field, it might be that the data are not always accurate for the Baget catchment during this specific rainfall event. Consequently, the entire recession curve until April 18 systematically underestimates the observed discharge of 0.08 m3/s on average. The raster daily E-OBS precipitation data (Cornes et al. 2018) were used as a comparison with the Antichan gauging station and showed consistent observations. However, due to the daily temporal resolution and the coarse spatial resolution of the grid (0.25°) of E-OBS, that dataset could not be used for the present study.

Sensitive parameters

The results of the sensitivity analysis can be qualitatively compared to those obtained by Bittner et al. (2020) by applying the active subspaces on the daily LuKARS. In the present paper, the parameters related to hydrotope 1, i.e., limestone, are more sensitive than those from hydrotope 2, i.e., black flysch, probably since the limestone represents 70% of the entire catchment area and thus, assuming spatially homogeneous precipitation rate, contributes to a larger extent to the total recharge to the spring. This observation agrees with the results from Bittner et al. (2020), where the parameters related to the hydrotope with a larger area are generally more sensitive. On the contrary, the sensitivity of the other parameters differs between the present study and the previous investigation due to the different representations of the lower compartment in the model concept and the different study areas. Indeed, according to Bittner et al. (2020), the discharge coefficient regulating the infiltration to the matrix kis is the most sensitive parameter, whereas in the present study kis shows a sensitivity ranking of 9 out of 17 (Table 4).

The parameters regulating the transfer between the matrix and conduit, i.e., discharge coefficient kMC and exponent aMC, are the least sensitive parameters of the presented model concept (Table 3; Fig. 3a). This is also due to the model structure. In fact, the matrix-conduit interaction can be merged as a single compartment, as, for example, in the original version of LuKARS. While this choice can be satisfactory for reproducing discharge time series, it can lead to erroneous hydrochemical interpretation of the results. Therefore, since the sensitivity is a function of the used objective function, the results of the sensitivity analysis performed with respect to KGE, and only against the discharge at the spring, may not capture the importance of all the parameters, especially when nonlinearities are present (Fig. 3b). Indeed, the selection of the Morris realizations respecting the hydrochemical constraints show the relevance of the transfer between the matrix and conduit. Those realizations characterized by a transfer between the matrix and conduit QMC with a standard deviation larger than the mean standard deviation of the behavioral Morris simulations (Fig. 7b) well represent the observed hydrochemical dynamics (Fig. 6).

Impact of hydrochemical information on model selection

The model was calibrated and validated considering the discharge and the hydrochemical signal at the spring, respectively, over a time period of 9 days. The short period of time was chosen in line with the performed analyses so as to have an event-based model. In addition, despite the limitations related to calibration over a short period of time (Leins et al. 2023), the length of the calibration and validation periods were chosen based on the collected high-resolution hydrochemical data (Table 1). Indeed, the interpretation of the internal fluxes of a longer simulated period may require the consideration of reactive transport, i.e., chemical reaction with soil and aquifer materials, seasonal variation in background concentrations, and coexistence of water with different ages.

Based on the previous investigation into the system functioning of Richieri et al. (2023), the parameter ranges (Table 2) for hydrotopes 1 and 2, i.e., limestone and black flysch, respectively, were defined to enhance a faster reaction of the black flysch in case of a strong precipitation event. In fact, it was observed that when the water level at the spring rises simultaneously to the EC, the relative contribution to the EC of SO42−, which originated from the pyrite contained in the black flysch (Eq. 2), increases. The faster response of the black flysch was considered by defining the lower upper and lower thresholds for the activation of the fast flow to the conduit (Table 2). However, when investigating the internal flow component of the selected Morris realization, the increase in fast flow from hydrotope 2, i.e., black flysch, observed during the precipitation event from 30/03/2022 to 7/04/2022 (Fig. 6), is not captured (Fig. 7a). Hydrochemical constraints were thus implemented to constrain those realizations respecting the dynamics of \({f}_{{\text{SO}}_{4}^{2-}}\) and \({f}_{{\text{HCO}}_{3}^{-}}\) (Fig. 6). The constrained simulations (Fig. 7b) capture the larger relative increases in the contribution from the black flysch and the lower relative increases in the contribution from the limestone that occurred on May 30 at 8 pm, May 31 at 9.30 am and May 31 at 10 pm. The first relatively faster increase in black flysch on May 30 at 8 pm (Fig. 7b) was anticipated with respect to the observed hydrochemcial data (Fig. 6) of ~12 h, probably due to the initial water content in the soil before the rise in water level and the need of larger storage.

The verification of the model by considering hydrochemical constraints resulted in the overall reduction of the interquartile uncertainty bands with respect to the case in which the Morris realizations were evaluated based only on the KGE (Figs. 5 and 8). This is due to the application of the hydrochemical constraints, which reduce the selected simulations from 448 to 8. Therefore, the hydrochemical criteria may be too stringent for the validation of the parameter sets, but they can be used to verify whether the conceptual model applies to simulations and can, therefore, be used to test the plausibility of the conceptual model. Despite the reduction of the uncertainty bands, the median of the realizations respecting the hydrochemical constraints of Qspring underestimates the baseflow and exceeds the plateau of 0.6 m3/s (Fig. S1 of the ESM), which is respectively both the selected Morris realizations and the median of the Morris realizations with a KGE larger than 0.5. Finally, the relevance of the hydrochemical constraints is also seen by the fact that the percentile range for the Qspring of the realizations selected, based only on KGE (Fig. 5a), does not contain some realizations that are included in the percentiles of the realizations respecting the hydrochemical dynamics (Fig. 8a). Therefore, without considering the hydrochemical constraints, some parameter sets representing the hydrochemical dynamics would have been excluded.

An alternative to the approach demonstrated in this study, hydrochemical data could be used during the model calibration process. Here, the sensitive parameters of the simulations for the hydrochemical dynamics could be further investigated and constrained. Among the most sensitive parameters (Fig. 3; Table 3), patterns can be identified for α_1(-), khyd_1 (mm2/h), Emax_1 (mm), kis_1(1/h) and khyd_2 (mm2/h), whose values fall in ranges narrower than those initially defined for the Morris analysis (Table 2).

Conclusion

This study presents the development of LuKARS 2.0 on an hourly scale for the Baget karst catchment. The model concept was modified in comparison to the original daily LuKARS from Bittner et al. (2018) to represent the plateau of 0.6 m3/s, which characterizes the discharge at the Las Hountas spring. Thus, a drain from the conduit was implemented to discharge the excess water out of the system as soon as the water level in the conduit rose above a defined threshold. The interaction between the matrix and conduit is here updated in comparison to Bittner et al. (2018) to allow a more flexible conceptualization of karst systems. The sensitivity analysis performed using the Morris method with a total of 9,000 realizations was applied to investigate the parameter sensitivity and calibrate the model. The model was calibrated and validated for the periods 1/03/2022–29/03/2022 and 30/03/2022–30/04/2022, respectively, considering KGE as a performance metric. Among the Morris realizations, the behavioral simulations were first defined based on the KGE value of the simulated discharge at the spring. Then, three additional hydrochemical constraints were considered to represent the internal dynamics of the systems, i.e., water contributions from the limestone and black flysch present in the catchment. The hydrochemical constraints were defined based on SO42− and HCO3 time series at high resolution and the previous work of Richieri et al. (2023).

The results of the present investigation show that the selected Morris realizations can represent the dynamics of the spring discharge but do not sufficiently account for the temporal variation of the contribution from the limestone and black flysch, which was observed during the rising limb of the event 30/03/2022–7/04/2022. The implementation of hydrochemical constraints leads to the representation of the relative increase in black flysch, while still providing a good rendering of the spring discharge. The uncertainty quantification performed on those realizations respecting the hydrochemical data show a reduction in the relative width of the interquartile bands in comparison to before the implementation of the hydrochemical constraints. The selected simulation and the interquartile range of the behavioral simulations show the relevance of the implemented drainage from the conduit to represent a plateau in the spring discharge. However, the 10–90% percentile range of the spring discharge exceeds the defined threshold, indicating that further constraints in the parameter ranges would be required. In addition, further validation using other datasets would allow for investigating to what extent the information contained in hydrochemical data time series leads to a better representation of the system internal dynamics.

In conclusion, this study highlights the importance of considering the internal dynamics of a system when selecting hydrological models. Indeed, even if different model realizations simulated the spring discharge with a comparable KGE, not all of them were consistent with the dynamics of the measured SO42− and HCO3 time series. Hence, hydrochemical data at high temporal resolution need to be collected to conceptualize and select hydrological models as well as to properly constrain the model parameter ranges.