Assessing the reliability of a physical-based model and a convolutional neural network in an ungauged watershed for daily streamflow calculation: a case study in southern Portugal

Oliveira, Ana R.; Ramos, Tiago B.; Simionesei, Lucian; Neves, Ramiro

doi:10.1007/s12665-024-11498-1

Assessing the reliability of a physical-based model and a convolutional neural network in an ungauged watershed for daily streamflow calculation: a case study in southern Portugal

Original Article
Open access
Published: 25 March 2024

Volume 83, article number 215, (2024)
Cite this article

Download PDF

You have full access to this open access article

Environmental Earth Sciences Aims and scope Submit manuscript

Assessing the reliability of a physical-based model and a convolutional neural network in an ungauged watershed for daily streamflow calculation: a case study in southern Portugal

Download PDF

Ana R. Oliveira¹,
Tiago B. Ramos¹,
Lucian Simionesei¹ &
…
Ramiro Neves¹

651 Accesses
1 Citation
2 Altmetric
Explore all metrics

Abstract

The main goal of this study was to estimate inflows to the Maranhão reservoir, southern Portugal, using two distinct modeling approaches: a one-dimensional convolutional neural network (1D-CNN) model and a physically based model. The 1D-CNN was previously trained, validated, and tested in a sub-basin of the study area where observed streamflow values were available. The trained model was here subject to an improvement and applied to the entire watershed by replacing the forcing variables (accumulated and delayed precipitation) to make them correspond to the values of the entire watershed. The same way, the physically based MOHID-Land model was calibrated and validated for the same sub-basin, and the calibrated parameters were then applied to the entire watershed. Inflow values estimated by both models were validated considering a mass balance at the reservoir. The 1D-CNN model demonstrated a better performance in simulating daily values, peak flows, and the wet period. The MOHID-Land model showed a better performance in estimating streamflow values during dry periods and for a monthly analysis. Hence, results show the adequateness of both modeling solutions for integrating a decision support system aimed at supporting decision-makers in the management of water availability in an area subjected to increasing scarcity.

Scale Effects of the Monthly Streamflow Prediction Using a State-of-the-art Deep Learning Model

Article 20 June 2022

Deep neural network-based discharge prediction for upstream hydrological stations: a comparative study

Article 21 August 2023

Multiple data-driven approaches for estimating daily streamflow in the Kone River basin, Vietnam

Article 01 July 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The IPCC 2022 report (Pörtner et al. 2022) projects an increase in the frequency and severity of low flows in Southern Europe, resulting from increasing drought and water scarcity conditions. Population exposed to at moderate water scarcity will grow by 18% and 54% for a raise of 1.5°C and 2°C in air temperature, respectively. The groundwater resources will be affected by an increase in abstraction rates and a decrease in recharge rates. Agriculture, which represents the main water use in the region, may be seriously limited by water availability. Thus, there is a need to improve water management at different scales to cope with the increasing scarcity. At the regional scale, this means the construction of dams and reservoirs to increase water storage, desalination, water reuse, and the adoption of water conservation measures. At the plot scale, that means reallocation to crops more resistant to drought conditions, the improvement of water use efficiency and performance of irrigation systems, and the implementation of soil water conservation practices (Jovanovic et al. 2020; Pereira et al. 2009).

Decision-support systems (DSSs) have been developed over the last few decades to improve water resource management at different spatial and temporal scales (Teodosiu et al. 2009). These tools commonly consist of interactive software-based systems where useful information from raw data sources, documents, simulation models, and other sources is aggregated to identify and solve problems and support decision-making. Considering the plot scale, Smart Irrigation Decision Support System (SIDSS, Navarro-Hellín et al. 2016) and IrrigaSys (Simionesei et al. 2020) are examples of DSSs for irrigation water management support. SIDSS estimates weekly irrigation needs based on data from soil sensors and/or weather stations using two machine learning techniques. IrrigaSys also estimates weekly irrigation needs using a physically based model fed by weather forecast and hindcast data. When considering larger scales, Zhang et al. (2015a) designed and developed a prototype of a DSS for watershed management by integrating open-source web-based geographical information systems, a modeling component, and a cloud computing platform. Ashrafi and Mahmoudi (2019) presented a DSS to assist decision-makers in examining the impacts of different operating policies at the basin scale. DSSs are also applied to reservoir flood control operations (Delaney et al. 2020) and to early warning and detection, follow-up, and early response to flood events and hazmat pollution occurrences in inland and transitional waters (HAZRUNOFF Project - Layman’s Report 2020).

As proposed by Miser and Quade (1985), one of the steps to design a DSS is the building of models to predict consequences. A good hydrological and/or hydraulic model with reliable results and proved forecast capacity is of paramount importance for water management DSSs. Their results can then feed other models in the DSS. For instance, modeled groundwater levels can be used to estimate irrigation needs, or the simulation of river flows can help in flood forecast. However, modeling results can also be directly used to support decision-making.

Concerning models’ classification, they can be divided into three main groups according to their complexity: (i) empirical models; (ii) conceptual models; and (iii) physical models (Sitterson et al. 2017). Empirical models are based on linear and non-linear equations that relate inputs and outputs ignoring the physical processes. These types of models are considered the simplest models. Conceptual models are based on simplified equations to describe the hydrological processes and are characterized by an intermediate level of complexity. Physical-based models, also known as process-based models, are the most complex and rely on physical principles, being suitable to provide insights into physical processes. Usually, physical models use finite difference equations and state variables that can be measured and are time- and space-dependent (Devia et al. 2015; Fatichi et al. 2016). However, their weakness relies on the large number of parameters required to describe the physical characteristics of the watershed, which leads to high complexity levels that make their correct implementation difficult and laborious calibration and validation processes (Devia et al. 2015; Abbott et al. 1986a, b; Ranatunga et al. 2016; Zhang et al. 2015b; Mehr et al. 2013).

The study presented here is included within the framework of a larger work aimed at developing a DSS for supporting water management in the Maranhão and Montargil reservoirs, in southern Portugal. These reservoirs store water that is used mainly for irrigation of the Sorraia Valley, which comprehended a cultivated area of 21,280 ha and an irrigated area of 18,754 ha (ARBVS 2023) in 2021. With a 52% increase in the irrigated area over the last 2 decades (ARBVS 2023) and facing predictions of river flow decrease between 54 and 94% due to climate change (Almeida et al. 2018), accurate forecast of streamflow is of extreme importance to improve the management of water availabilities in the region. Taking as example the Maranhão reservoir, the work presented here makes use of two different types of models to estimate the daily inflow to the reservoir and discusses the advantages and weaknesses of both approaches. The applied models were the physically based MOHID-Land model (Trancoso et al. 2009; Canuto et al. 2019; Oliveira et al. 2020) and a convolutional neural network (CNN) (Oliveira et al. 2023), i.e., a data-driven model. In both cases, the models were calibrated/trained and validated using data from a hydrometric station that corresponds to 30% of the Maranhão watershed. Because there are no stations monitoring the entire watershed despite the importance of this information for the sustainability of the irrigation district, this study also aims to analyze the capacity of both approaches to represent streamflow generation in the entire watershed. That analysis comprehended the expansion of models results from the referred sub-basin to the full basin scale through the extension of the calibrated parameters in MOHID-Land, or through the replacing of the forcing variables in the CNN model. The results were then validated with a monthly reservoir mass balance. Therefore, this study provides sophisticated modeling tools for streamflow calculation in the Maranhão watershed, which were developed using two distinct modeling approaches. The ultimate aim is their integration into the DSS for supporting water managers in the decision-making of water availabilities in the region.

Materials and methods

Description of the study area

The Maranhão dam is located at Ribeira da Seda, southern Portugal (39° 0′ 53.846″ N; 7° 58′ 33.149″ W). The corresponding reservoir has a total capacity of 205 hm³ and drains an area close to 2300 km². The minimum, average, and maximum altitudes are 122, 261, and 723 m, respectively (EU-DEM 2019) (Fig. 1).

The climate is classified as Mediterranean hot-summer (Csa) according to Köppen–Geiger climate classification (Agencia Estatal de Meteorología (España) 2011). The average annual precipitation is 608 mm. The minimum and maximum average monthly precipitation are 4 mm in July and August and 84 mm in December. The average monthly air temperature ranges from 24 °C in July and August, and 9°C in January, while the annual average is 16 °C. The main soil reference groups are Luvisols (67%), Regosols (18%), and Cambisols (11%) (Panagos et al. 2012). The main land uses are non-irrigated arable land and agro-forestry areas, both representing 28% of the watershed, broad-leaved forest, occupying 15%, and olive groves, with a representation of 11% (CLC 2012 2019).

The Maranhão watershed has four hydrometric stations (Fig. 1), with all measuring daily streamflow in natural regime. Table 1 presents a brief characterization of those stations.

Table 1 Characteristics of hydrometric stations: drained area, period of records and percentage of records within this period, minimum (Min), maximum (Max), median (Med), and average (Ave) streamflow values (source: SNIRH 2021)

Full size table

Figure 2 shows the monthly patterns considering the daily streamflow values at the four stations. In accordance with the meteorological characterization, streamflow patterns show higher values between November and April, while lower values occur between May and September, with August presenting the lowest value.

The water stored in the Maranhão reservoir is mainly for irrigation of the Sorraia Valley (ARBVS 2023). Other uses include energy production, industrial supply, and recreation. The stored volumes normally increase during the wet period and decrease in the dry period as expected in hydroagricultural reservoirs (Fig. 3).

Convolutional neural network model description

A one-dimensional convolutional neural network (1D-CNN) was used to estimate daily streamflow at Ponte Vila Formosa. This 1D-CNN model was created, developed, optimized, and tuned in Python language (version 3.8.10) using public and free tools (Keras, Chollet et al. 2015; TensorFlow, Abadi et al. 2016; KerasTuner, O’Malley et al. 2019; Pandas, McKinney 2010; Scikit-learn, Pedregosa et al. 2011). A detailed description about the development of the 1D-CNN model is presented in Oliveira et al. (2023). In that study, the authors carried out a set of experiments where three different neural network models were tested for streamflow estimation, as well as several combinations of precipitation and air temperature values. The models’ structures and hyper-parameters were optimized and tuned using six different training algorithms. Also, the batch size and the number of epochs were optimized. The best solution for streamflow estimation was obtained with a 1D-CNN model composed of one input 1D convolutional (1D-Conv) layer with 16 filters, a kernel size equal to 1, and an output dense layer activated by a linear function. Between them, two 1D-Conv layers, each having 32 filters and a kernel size of 8, were applied. After each 1D-Conv layer, a MaxPooling1D layer with pool_size set to 2 was placed. The Nadam optimizer was the training algorithm with the best performance combined with a learning rate of 1 × 10^–3 and a ε (constant used for numerical stability) of 1 × 10^–8. The batch size and the number of epochs were 20 and 200, respectively. Finally, the input variable was the daily precipitation values accumulated in 1, 2, 3, 4, 5, and 10 days and delayed in 1, 2, 3, 4, 5, 6, and 7 days.

The CNN model was tuned, trained, and validated considering the streamflow values available in Ponte Vila Formosa station (30% of the Maranhão watershed) for the period from 01/01/2001 to 01/01/2009. The model performance was considered good, reaching a Nash–Sutcliffe Efficiency (NSE) of 0.86, a coefficient of determination (R²) of 0.87, a percent bias (PBIAS) of 10.5%, and a root-mean-squared error (RMSE) of 4.2 m³ s⁻¹ for the test dataset. Thus, in this study, the same 1D-CNN model was used by considering the precipitation of the entire Maranhão watershed instead of the sub-basin’s data as in the original version.

Input variables for 1D-CNN model

The precipitation data used to train the 1D-CNN model were obtained from the ERA5-Reanalysis dataset (Hersbach et al. 2017). This is a gridded product with a resolution of 31 km and an hourly timestep, making it an appropriate option for the implementation of the physically based model, which requires sub-daily precipitation in small watersheds like Maranhão. Precipitation data were extracted from the dataset considering all the cells within the limits of the watershed. Precipitation hourly values were then averaged within the watershed area and accumulated each day from 01/01/2001 to 31/12/2009. The daily precipitation values in the watershed accumulated in 1, 2, 3, 4, 5, and 10 days and delayed in 1, 2, 3, 4, 5, 6, and 7 days were considered. The average annual precipitation for the period considered in this study was 575 mm, with July (3 mm) and August (8 mm) presenting the minimum monthly values, and October (104 mm) and November and December (both with 67 mm) the months when more precipitation was registered.

Estimation of Maranhão inflow with 1D-CNN

The Maranhão reservoir’s daily inflow was estimated considering the daily precipitation in the corresponding watershed and the trained 1D-CNN model. However, because of the intrinsic random behavior verified in randomly initialized neural networks (Duan et al. 2020; Alzubaidi et al. 2021), the 1D-CNN model was trained 100 times. Those 100 runs were performed using the same dataset and division into training, validation, and test datasets presented in Oliveira et al. (2023). After each run, the results were compared and evaluated considering the observed streamflow in Ponte Vila Formosa station. Based on the statistical evaluation, the model with the best performance was selected.

The selected 1D-CNN model was then exposed to Maranhão watershed daily precipitation, with results representing the daily surface flow generated in the watershed and flowing to the Maranhão reservoir. Those daily values were then aggregated by month and transformed into volume. The estimated monthly volume that reached Maranhão reservoir was incorporated into the reservoir mass balance to estimate the stored volume in the following month. The validation of inflow values was made through the comparison of estimated stored volumes and the corresponding observed values.

MOHID-Land model description

MOHID-Land is an open-source hydrological model, with the code available in an online repository (github.com/Mohid-Water-Modelling-System/Mohid). MOHID-Land (Trancoso et al. 2009; Canuto et al. 2019; Oliveira et al. 2020) is a fully distributed and physically based model. Considering the mass and momentum conservation equations and a finite volume approach, the model simulates the water movement between four main compartments: atmosphere, porous media, soil surface, and river network. To avoid instability problems and save computational time, the model time step is variable being higher during dry seasons and lower in wet periods when water fluxes increase.

According to his finite volume approach, the domains in MOHID-Land are discretized by a regular grid in the surface plane and by a Cartesian coordinate system in the vertical direction. The land surface considers a 2D domain to simulate the water movement, while the porous media is represented by a 3D domain, which includes the same surface grid and is complemented with the vertical grid with variable thickness layers. Additionally, a 1D domain representing the river network can be derived from a digital terrain model represented in the horizontal grid. The water lines of the river network are then delineated by linking surface cell centers (nodes).

The four compartments referred to before are all explicitly simulated, except the atmosphere which is only responsible for providing the data needed for imposing surface boundary conditions. The atmospheric data can be space and/or time variant, and include precipitation, air temperature, relative humidity, wind velocity, solar radiation, and/or cloud cover.

The amount of water precipitated in each cell is divided into surface and subsurface flow considering the infiltration process and according to the soil saturation state. In this study, the infiltration rate (i, LT⁻¹) was computed according to the Darcy’s law

$$i= {-K}_{sat}\left(\frac{\partial h}{\partial z}+1\right),$$

(1)

where K_sat is the saturated soil hydraulic conductivity (LT⁻¹), h is the soil pressure head (L), and z is the vertical space coordinate (L).

The movement of infiltrated water in porous media was simulated using the Richards’ equation, which is applied to the whole subsurface domain and simulates saturated and unsaturated flow using the same grid

$$\frac{\partial \theta }{\partial t}=\frac{\partial }{{\partial x}_{i}}\left[K(\theta )\left(\frac{\partial h}{{\partial x}_{i}}+\frac{\partial }{{\partial x}_{i}}\right)\right]-S\left(h\right),$$

(2)

where θ is the volumetric water content (L³L⁻³), x_i represents the xyz directions (–), K is the hydraulic conductivity (LT⁻¹), and S is the sink term representing root water uptake (L³L⁻³ T⁻¹). The soil hydraulic parameters were described using the van Genuchten–Mualem functional relationships (Mualem 1976; van Genuchten 1980). When a cell reaches saturation, i.e., when soil moisture in a cell is above a threshold value defined by the user, the model considers the saturated conductivity to compute flow and pressure becomes hydrostatic, corrected by friction. The ratio between the horizontal and vertical hydraulic conductivities is defined by a factor (f_h = K_hor/K_ver) that can also be tuned by the user.

The root water uptake was estimated considering the weather conditions and soil water contents. The reference evapotranspiration (ET_o) rates were computed following the FAO Penman–Monteith method (Allen et al. 1998). The crop evapotranspiration (ET_c) rates were then estimated by multiplying the ET_o first with a crop coefficient (K_c). The K_c values were made to vary as a function of the plant development stage, as follows:

$${K}_{c}=\left\{\begin{array}{c}{K}_{c,ini}, GFr<GFr1\\ {K}_{c,ini}+\frac{GFr-GFr1}{GFr2-GFr1}\left({K}_{c,mid}-{K}_{c,ini}\right), GFr1<GFr<GFr2\\ {K}_{c, mid}, GFr2<PS<{GFrLAI}_{Sen}\\ {K}_{c,mid}+\frac{GFr-GFr2}{1.0-GFr2}\left({K}_{c,end}-{K}_{c,mid}\right), GFr2<GFr<1.0\\ {K}_{c,end}, GFr>1.0\end{array},\right.$$

(3)

where GFr, GFr1, GFr2, and GFrLAI_Sen are the plant growth fractions in the simulated instant, the initial stage, the mid-season stage, and when the LAI senescence starts, respectively, and K_c,ini, K_c,mid, and K_c,end are the crop coefficients during the initial, mid-season and end-season stages, respectively. The plant growth stages are represented as a percentage of maturity heat units, and the values for GFr1, GFr2, and GFrLAI_Sen are defined in the plant growth database of MOHID-Land. ET_c values are then partitioned into potential soil evaporation (E_s) and crop transpiration (T_c) as a function of the simulated leaf area index (LAI), which is computed using a modified version of the EPIC model (Neitsch et al. 2011; Williams et al. 1989) and considering the heat units approach for the plant to reach maturity, the crop development stages, and crop stress (Ramos et al. 2017). Following the macroscopic approach proposed by Feddes et al. (1978), root water uptake reductions (i.e., actual crop transpiration rates, T_a) are computed by distributing water extractions along the root zone domain and are estimated considering the presence of depth-varying stressors, such as water stress (Šimůnek and Hopmans 2009; Skaggs et al. 2006). Finally, the actual soil evaporation is calculated from potential soil evaporation by imposing a pressure head threshold value (ASCE 1996).

The amount of water that is not able to infiltrate is transformed into surface flow which is computed by solving the Saint–Venant equation in its conservative form, accounting for advection, pressure, and friction forces

$$\frac{{\partial Q}_{u}}{\partial t}+{{v}}_{v}\frac{{\partial Q}_{u}}{{\partial x}_{u}}=-gA\left(\frac{\partial H}{{\partial x}_{u}}+\frac{\left|Q\right|{Q}_{u}{n}^{2}}{{A}_{v}^{2}{R}_{h}^{4/3}}\right),$$

(4)

where Q is the water flow (L³T⁻¹), A is the cross-sectional flow area (L²), g is the gravitational acceleration (LT⁻²), ν is the flow velocity (LT⁻¹), H is the hydraulic head (L), n is the Manning coefficient (TL^−1/3), R_h is the hydraulic radius (L), and subscripts u and v denote flow directions. The Saint–Venant equation is solved on a 2D domain considering the directions of the horizontal grid except for the river network, where it is solved considering the 1D domain comprehending the water lines. There, the cross-section for each node of the river network is defined by the user.

The water changes between the river network and the soil surface are estimated according to a kinematic approach, neglecting bottom friction, and using an implicit algorithm to avoid instabilities. The water fluxes between the river network and the porous media are driven by the pressure gradient in the interface of these two mediums.

Model set-up

The MOHID-Land model was implemented using a constant horizontal spaced grid with a resolution of 0.006º in longitudinal and latitudinal directions (⁓520 × 666 m). To cover the modeled domain, the grid had 140 columns and 110 rows, with its origin located at 38° 45′ 16.5" N and 8° 03′ 12.4" W.

Elevation data were interpolated to the MOHID-Land grid from the digital elevation model (DEM) provided by the European Environment Agency (EU-DEM 2019) and have a resolution of approximately 30 m (0.00028°). The watershed’s minimum and maximum elevations after the interpolation process were 107 m and 725 m, respectively (Fig. 4a). The delineation of the watershed and the river network was performed considering the cell where the dam of Maranhão reservoir is located as the outlet. The minimum area to consider the existence of a waterline (minimum threshold area) was 10 km². Additionally, a rectangular geometry was chosen to represent the river cross-sections with width and height defined according to Andreadis et al. (2013). The cross-section dimensions were related to the drained area and were assigned to the river network according to Table 2. For the nodes where the drained area relied between the values presented on the table, the cross-section dimensions were linearly interpolated based on the given information.

Table 2 Cross sections dimensions according to drained area

Full size table

The CORINE Land Cover 2012, with a resolution of 100 m (CLC 2012, 2019), was interpolated to the MOHID-Land’s grid and was used for representing land use in the watershed. Each land-use class was associated with: (i) a Manning coefficient, which was defined according to Pestana et al. (2013) (Fig. 4b), and (ii) a vegetation type class considering MOHID-Land’s database (Fig. 4c).

The K_c values were defined according to Allen et al. (1998) for agriculture (summer and winter crops), orchard, pasture, and brush, while pine, oak, and forest crop coefficients were defined based on the values proposed by Corbari et al. (2017) (Table 3).

Table 3 Crop coefficient values for initial stage (K_{c ini}), mid- (K_{c mid}) and late (K_{c end}) season for each type of vegetation

Full size table

The Mualem–van Genuchten hydraulic parameters were obtained from the European Soil Hydraulic Database (EU Soil Database, Tóth et al. 2017). Although the database provides information at 7 different depths, with a resolution of 250 m, the present application only considered data from 0.3, 1.0 and 2.0 m depths. The porous media was divided into 6 layers, with a thickness of 0.3, 0.3, 0.7, 0.7, 1.5, and 1.5 m from surface to bottom (vertical grid), with the maximum total soil depth of 5.0 m. These layers were organized according to 3 different horizons characterized by the soil hydraulic properties acquired from the selected depths of EU Soil Database. The 2 surface layers (0–0.6 m) were associated with the data at 0.3 m depth, the 2 middle layers (0.6–2.0 m) acquired the values at 1.0 m depth, and the information at 2.0 m depth was representative of the 2 bottom layers (2.0–5.0 m) (Table 4). The spatial variation of soil properties in the surface, middle, and bottom horizons are shown in Fig. 4.d, e and f, respectively, with each ID corresponding to a different combination of soil hydraulic data. The f_h parameter relating horizontal and vertical hydraulic conductivities was set to 10.

Table 4 Mualem–van Genuchten hydraulic parameters by soil horizon. θ_r, residual water content; θs, saturated water content; α and η, empirical shape parameters; K_s,vert, saturated hydraulic conductivity

Full size table

As for the input variables used in the neural network model, meteorological data were obtained from ERA5-Reanalysis dataset (Hersbach et al. 2017). For the implementation of MOHID-Land, the meteorological properties incorporated were the total precipitation, air temperature, and dew point temperature (at 2 m height), u and v components of wind velocity (at 10 m height), surface solar radiation downwards, and total cloud cover. Wind velocity was adjusted from 10 to 2 m height and relative humidity was estimated from air and dew point temperatures according to Allen et al. (1998).

Estimation of Maranhão inflow with MOHID-Land

MOHID-Land was directly implemented in the entire Maranhão watershed, but the lack of daily inflow data at the outlet only allowed model calibration and validation to be performed at Ponte Vila Formosa. There, the estimated daily streamflow data were compared with the observed data, and, when model results are similar to the observed values with the model having a good representation of the streamflow generation on that sub-basin, the calibrated parameters were assumed as representatives of the Maranhão watershed. Hence, the daily streamflow estimated by the model in the outlet section was considered to represent the Maranhão reservoir’s inflow and was transformed to monthly volume. The monthly volumes were then validated with a reservoir mass balance identical to the one presented for the validation of 1D-CNN model’s results.

Models’ evaluation

MOHID-Land and 1D-CNN were calibrated/trained using the average daily streamflow in Ponte Vila Formosa hydrometric station. Validation was performed with daily and monthly timesteps. The dataset was also divided into wet (October–March), and dry (April–September) periods and the results were validated, ignoring the division between calibrated/trained.

In the case of MOHID-Land, the calibration period was from 01/01/2002 to 31/01/2003 and the validation was from 01/01/2004 to 31/12/2009. For the 1D-CNN model, each of the 100 runs was evaluated considering the same test dataset presented by Oliveira et al. (2023). For both models, streamflow estimation performance was evaluated in Ponte Vila Formosa station. The analysis was made with four different statistical parameters, namely, the R², the PBIAS, the RMSE, and the NSE

$${R}^{2}={\left[\frac{{\sum }_{i=1}^{p}\left({X}_{i}^{obs}-{X}_{mean}^{obs}\right)\left({X}_{i}^{sim}-{X}_{mean}^{sim}\right)}{\sqrt{{\sum }_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{mean}^{obs}\right)}^{2}}\sqrt{{\sum }_{i=1}^{p}{\left({X}_{i}^{sim}-{X}_{mean}^{sim}\right)}^{2}}}\right]}^{2}$$

(5)

$$PBIAS=\frac{{\sum }_{i=1}^{p}\left({X}_{i}^{obs}-{X}_{i}^{sim}\right)}{{\sum }_{i=1}^{p}{X}_{i}^{obs}}\times 100$$

(6)

$$RMSE=\sqrt{\frac{1}{p}\sum_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{i}^{sim}\right)}^{2}}$$

(7)

$$NSE=1-\frac{{\sum }_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{i}^{sim}\right)}^{2}}{{\sum }_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{mean}^{obs}\right)}^{2}},$$

(8)

where X_i^obs and X_i^sim are the flow values observed and estimated by the model on day i, respectively. X_mean^obs and X_mean^sim are the average flow considering the observed and the modeled values in the analyzed period, and p is the total number of days/values in this period. According to Moriasi et al. (2007), a model is considered satisfactory when NSE > 0.5, PBIAS ± 25%, and R² > 0.5, while the RMSE represents the standard deviation of the residuals with lower values meaning a better model’s performance.

Maranhão reservoir’s inflow was evaluated with a monthly timestep, since this is the frequency of the data available in the reservoir. Since the models were already calibrated, the validation of the reservoir’s inflow was done for the period comprehended between 01/01/2002 and 31/12/2009.

For the validation process, the monthly water volume reaching the reservoir was incorporated into a mass balance where the observed stored volume from the previous month and the water volume that leaves the reservoir in the current month were also considered

$${V}_{i}^{sim}={V}_{i-1}^{obs}+{VI}_{i}^{sim}-{VO}_{i}^{obs},$$

(9)

where V_i^sim represents the estimated stored volume in month i, V_i-1^obs represents the observed stored volume in the previous month, VI_i^sim is the volume that enters the reservoir in month i resulting from the simulations, and VO_i^obs is the observed volume that leaves the reservoir. The stored volume estimated through the water balance was then compared to the observed stored volume of the corresponding month.

Performance assessment was made by a visual comparison, and it was complemented by the estimation of the R², NSE, PBIAS, RMSE, and the RMSE-observation standard deviation ratio (RSR)

$$RSR=\frac{RMSE}{{STDEV}_{obs}}=\frac{\sqrt{{\sum }_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{i}^{sim}\right)}^{2}}}{\sqrt{{\sum }_{i=1}^{p}{\left({X}_{i}^{obs}-{X}_{mean}^{obs}\right)}^{2}}},$$

(10)

where X_i^obs and X_i^sim are the stored volume values observed and estimated on month i, respectively, and X_mean^obs and X_mean^sim are the average stored volume in the analyzed period. It is important to note that the typical approach for inflow validation, which considers the direct calculation of inflow values from a massa balance performed in the resevoir, was also tested. However, about 30% of the inflow values estimated with that approach resulted in negative inflow. Because of that, the referred approach was not considered in the study.

Results

1D-CNN at Ponte Vila Formosa

Considering the set of 100 runs performed with the 1D-CNN model and the precipitation of Ponte Vila Formosa watershed, the four statistical parameters used to evaluate model’s performance were calculated for each run and considering the test dataset. Four sets of 100 values were obtained. For each of those sets, the minimum, maximum, average, standard deviation, median, and 1st and 3rd quartiles were estimated and are presented in Table 5.

Table 5 Goodness-of-fit indicators for 1D-CNN model in Ponte Vila Formosa hydrometric station

Full size table

A spread range of results were obtained for the statistical parameters, with RMSE ranging from 1.44 to 3.13 m³ s⁻¹, PBIAS from – 40 to 67%, R² from 0.59 to 0.90, and NSE from 0.42 to 0.88. Although some simulations did not reach the minimum requirements to be classified as satisfactory, most of them got acceptable values, with the 1st quartile presenting a NSE of 0.71 and a R² of 0.75. This means that 75% of the simulations had higher values for NSE and R². However, considering the PBIAS results, the table shows that the value of the 3rd quartile was 25%, which means that a quarter of the simulations present higher PBIAS. In turn, the 1st quartile of this statistical parameter was – 3.5% and the minimum value was – 40.3%, which indicates that from the 25 simulations that present lower PBIAS values, a significant part of them is still considered as having a satisfactory behavior.

The simulation considered as the best in fitting the observed streamflow in Ponte Vila Formosa station presented an NSE of 0.88, a R² of 0.88, a PBIAS of – 7.8%, and a RMSE of 1.44 m³ s⁻¹ (Table 5). Although the R² of this model was not the maximum presented in the table, the combined values of the four statistical parameters represented the best solution, since the simulation with the maximum R² presented a PBIAS of 25%, which relies in the limit of the range for a satisfactory performance.

For an easier comparison with MOHID-Land, the four statistical parameters were also estimated considering the entire dataset, neglecting the first year (2001). Streamflow results show that the model outputs included negative values for 1.5% of the dataset. Since these negative values occurred in isolated days, they were replaced by simply averaging the estimated streamflow from the previous and the next days. Table 6 presents those statistical parameters, while Fig. 5 allows a visual assessment of model’s performance. Table 6 also presents the goodness-of-fit indicators when the simulated interval was divided into wet and dry periods and considering the average monthly streamflow.

Table 6 Goodness-of-fit indicators for 1D-CNN model in Ponte Vila Formosa hydrometric station

Full size table

When considering daily results, the 1D-CNN model demonstrated a very good performance, with the NSE and R² reaching values of 0.65, the PBIAS being – 7.21% and the RMSE as 4.75 m³ s⁻¹. Results were better when average monthly streamflow were considered, with NSE, R², PBIAS, and RMSE of 0.87, 0.87, 2.23%, and 2.01 m³ s⁻¹, respectively. This is justified, because the estimation of the average monthly values smooths out the daily errors. Considering the dry and wet periods, the 1D-CNN model shows a much better performance for the wet period. With the NSE and R² having both values of 0.79 and a PBIAS of 8.62% for the wet period, the dry period obtained only an NSE value of 0.26, the R² decreased to 0.57, and the PBIAS presents a value of -53%.

MOHID-Land at Ponte Vila Formosa

MOHID-Land’s calibration focused on a large number of different parameters related to the porous media, river network, and plant development processes. Among them, the f_h factor and the soil hydraulic parameters were a calibration target. In the river network, the minimum threshold area, the cross-section dimensions, and the Manning coefficient were evaluated, and for the vegetation development, the K_c for different stages, and maximum root depth were also subjected to calibration.

The best solution obtained with MOHID-Land comprehended a river Manning coefficient of 0.035 s m^−1/3 and a minimum threshold area of 1 km². The calibrated cross-section dimensions are presented in Table 2, being clearly larger than those of the model set-up. In porous media, the f_h adopted the value 500, while the saturated water content of each soil type was increased by 10%. Finally, the maximum root depth was 25% to 60% lower than the default values of MOHID-Land’s growth database.

The comparison between the streamflow values registered in Ponte Vila Formosa station and those estimated by MOHID-Land is presented in Fig. 6, with the corresponding statistical parameters shown in Table 7. Table 7 also shows NSE, R², PBIAS, and RMSE for the average monthly streamflow and for the division of the analyzed period into wet and dry seasons.

Table 7 Goodness-of-fit indicators for MOHID-Land model in Ponte Vila Formosa hydrometric station

Full size table

MOHID-Land’s results show the satisfactory performance obtained with this model. It reached an NSE and an R² of 0.65 for the calibration period with a slight decrease in the validation period (0.62 for NSE and 0.63 for R²). PBIAS demonstrated an underestimation of streamflow in calibration and an overestimation during validation, while RMSE values were similar in both periods. When considering the monthly aggregation, the model reached a very good performance, with NSE and R² values above 0.85 in calibration and validation periods. The RMSE showed a decrease in both periods when compared with the daily values. Finally, PBIAS did not suffer significant changes. During the wet period, the performance of the model was better than in the dry period. Although R² showed a better value for the dry period, NSE and PBIAS demonstrated an accentuated decrease in model’s performance in that period, with the first going from 0.61 to 0.39 and the second indicating an overestimation of about 9% in wet period and an underestimation of about 30% in dry period.

Maranhão reservoir’s inflow

The characterization of Maranhão reservoir’s inflow obtained with MOHID-Land and 1D-CNN models from 01/01/2002 until 31/12/2009 is presented in Table 8. The respective flow duration curves are presented in Fig. 7.

Table 8 Maranhão reservoir's inflow characterization for 1D-CNN and MOHID-Land models

Full size table

Results from Table 8 showed a very similar behavior for both models apart from the maximum streamflow value. In that case, the 1D-CNN model presented a maximum streamflow more than twice the maximum streamflow estimated by MOHID-Land. However, MOHID-Land had a slightly higher streamflow average. It indicates that for the middle streamflow values, MOHID-Land tends to overestimate 1D-CNN model. It is also demonstrated in Fig. 7, where it is possible to confirm that for streamflow values with non-exceedance probability between 0 and 0.3, higher values are observed for MOHID-Land.

Regarding the validation of stored volumes considering the reservoir’s mass balance, NSE, R², PBIAS, RMSE, and RSR were estimated for the entire period, and the results are presented in Table 9. Figure 7 presents the graph with the comparison between the two models and the observed stored volumes.

Table 9 NSE, R², PBIAS, and RMSE values for MOHID-Land and 1D-CNN models for reservoir’s stored volume

Full size table

Results showed good agreement between both models and observed values. In fact, 1D-CNN and MOHID-Land presented very similar R² (1D-CNN: 0.84; MOHID-Land: 0.85) and RMSE (1D-CNN: 18.62 hm³; MOHID-Land: 18.61 hm³) values. NSE and RSR were equal in both cases, while PBIAS was the parameter in which some difference is observed. With a PBIAS of -0.55% for 1D-CNN model and -1.18% for MOHID-Land model, both models were slightly underestimating the reservoir’s inflow. MOHID-Land showed a higher tendency for that underestimation.

Discussion

1D-CNN model

The 1D-CNN model had already demonstrated its adequacy to predict streamflow in the sub-basin of Ponte Vila Formosa station as demonstrated in Oliveira et al. (2023). The approach presented here, where 100 simulations were performed with the same 1D-CNN structure, allowed to slightly improve the results obtained in that study. Thus, the best solution had an NSE and an R² of 0.88, a PBIAS of – 7.80%, and an RMSE of 1.44 m³ s⁻¹, considering the test dataset. Results also show that half of the 100 simulations obtained a NSE higher than 0.74 and/or a R² above 0.79. The same number of simulations got a PBIAS lower than 9.52%. It indicates the suitability of the developed structure for streamflow estimation.

The results of the 1D-CNN model are in accordance with the results of several authors. Barino et al. (2020) used two 1D-CNN models to predict multi-day ahead river flow in Madeira River, a tributary of the Amazon River, Brazil. One of those models considered only the river flow in previous days, while the other considered that same variable combined with the turbidity. Both models obtained NSE and R² values higher than 0.92, while mean absolute percentage error (MAPE) and normalized RMSE were lower than 25% and 0.20, respectively. Among the models analyzed by Huang et al. (2020), two CNN models were studied to forecast a day ahead streamflow. Considering the lagged streamflow values of the past 16 days in the site to be forecasted and in the neighborhood, a generic CNN model and a CNN model trained with a transfer learning procedure were tested. With four different locations in the United Kingdom being the studied, the generic CNN model obtained MAPE values between 14.36% and 41.95%, while the MAPE of the other CNN model laid between 12.29% and 32.17%. Duan et al. (2020) considered the watersheds within the Catchment Attributes for Large-Sample Studies (CAMELS dataset), in California, USA, to test a temporal CNN model. The model was developed for long-term streamflow projection and consisted of a one-dimensional network that used dilated causal convolutions. As input variables, authors elected precipitation, temperature, and solar radiation and tested different time window sizes to delay the values. After performing 15 runs for each watershed in the CAMELS dataset, the average NSE was 0.55, while the average NSE for the best run over all basins was 0.65. Finally, a CNN model was employed by Song (2020) to estimate daily streamflow in Heuk River watershed, in South Korea. Using rainfall, runoff, soil map, and land-use data, authors generated a hydrological image based on curve number method to feed the neural network and estimate streamflow in the watershed. Model evaluation resulted on a coefficient of correlation of 0.87 and a NSE of 0.60.

Usually, in machine learning methods, better results are verified when antecedent streamflow is considered as a forcing variable (Barino et al. 2020; Khosravi et al. 2022). However, when the model is used in the simulation of future scenarios or periods when no observed data are available, the antecedent streamflow values to feed the model are those already calculated by the model in the previous iterations. Consequently, the propagation and exacerbation of errors in the estimates can lead to a degradation of the results in the long-term. There are also other types of machine learning methods for streamflow estimation emerging in the last few years. For instance, Si et al. (2021) considered a graphical convolutional GRU model to predict the streamflow in the next 36 h hours, while Szczepanek (2022) used three different models, namely, XGBoost, LightGBM, and CatBoost, for daily streamflow forecast. Additionally, hybrid solutions considering different machine learning algorithms, such as Di Nunno et al. (2023) and Yu et al. (2023), are becoming widely used and with improved results.

MOHID-Land model

MOHID-Land daily results demonstrated to be satisfactory. With an NSE and an R² higher than 0.62 and 0.63, respectively, and a PBIAS between – 7% and 4%, and an average RMSE of 5.6 m³ s⁻¹, these results were substantially better than those presented by Almeida et al. (2018) for the same study area. Using Soil Water Assessment Tool (SWAT), the authors compared the daily streamflow also in Ponte Vila Formosa station. They obtained an NSE, an R², a bias, and an RMSE of – 3.05, 0.31, 2.93, and 12.61 m³ s⁻¹, respectively, for the calibration period. For the validation, the NSE was 0.11, the R² was 0.24, and the bias and RMSE were – 0.46 and 15.21 m³ s⁻¹, respectively. Almeida et al. (2018) also made a daily comparison in Moinho Novo hydrometric station, which is located in Montargil watershed and is very similar to Maranhão watershed sharing boundaries between them. For Moinho Novo station, the authors obtained for calibration and validation periods, respectively, an NSE of 0.22 and 0.39, an R² of 0.41 in both cases, a bias of 0.90 and – 1.07, and an RMSE of 13.1 and 16.6 m³ s⁻¹. Bessa Santos et al. (2019) estimated the daily streamflow in Sabor River watershed, placed in Northeast Portugal and with an area of 3170 km². Using SWAT model, they compared the modeled and observed river flow values and the results reached an NSE of 0.62 and 0.61 for calibration and validation periods, respectively, and a R² for those same periods of 0.63 and 0.80. The PBIAS was 2.7% for calibration and -24% for validation, while RSR for calibration and validation was 0.62 and 0.63, respectively. Considering Pracana watershed, located in Central Portugal, Demirel et al. (2009) also used SWAT model to predict daily streamflow. Authors classified the model as having a poor peak magnitude estimation.

Considering the monthly values, MOHID-Land’s performance increased substantially when compared with the daily values. The results reached an NSE of 0.85 and 0.92 and a R² of 0.86 and 0.95 for calibration and validation periods, respectively. PBIAS and RMSE also demonstrated the very good behavior of the model. Those parameters obtained very good results for the calibration and validation periods, with PBIAS indicating a slight underestimation during calibration (-6.59%) and an overestimation (4.15%) during validation, and the RMSE being about 2 m³ s⁻¹ for both periods. In line with this work, Brito et al. (2018) used SWAT for long-term forecasts of monthly Enxoé reservoir’s inflow. With that watershed located in South Portugal and draining an area of 60 km², authors reached an NSE of 0.78 and an R² of 0.77. Almeida et al. (2018) also presented a monthly analysis for Ponte Vila Formosa station, with SWAT obtaining an NSE of – 1.26 and 0.40 for calibration and validation periods. For calibration and validation, respectively, R² reached values of 0.58 and 0.54, the bias was 2.97 and – 0.42, and the RMSE 6.04 and 5.93 m³ s⁻¹. Ponte Vila Formosa streamflow was also modeled by van der Laan et al. (2023) with SWAT model. They obtained an NSE, an R², and a PBIAS for calibration period of 0.76, 0.77, and – 7.1%, respectively. For the validation period, the NSE was 0.89, the R² was 0.9, and PBIAS was 15%.

The comparisons presented above allowed to conclude that MOHID-Land’s performance is in line with the other studies carried out in Portuguese watersheds for daily streamflow estimation. The exception was the study performed by Almeida et al. (2018) where the simulation of the same sub-basin that was being modeled here obtained a much poorer performance there. When monthly streamflow was considered, MOHID-Land’s performance surpassed the results obtained with SWAT model for the same or identical sub-basins. The difference in the performance of the models is justified by the fact that SWAT is more empirically parametrized than MOHID-Land. For instance, MOHID-Land explicitly estimates the infiltration and porous media fluxes based on Darcy’s law and Richards equation, respectively, with the remaining water transformed into surface runoff where fluxes are estimated based on Saint–Venant equation. On the other hand, in SWAT, a baseflow factor, which is a direct index of groundwater flow response to changes in recharge, or a surface runoff lag coefficient to control the fraction of the total available water that will be allowed to enter the reach on 1 day, needs to be defined. The empirical parametrization of some processes prevents a more accurate representation of reality, leading to more errors in estimates and the degradation of the overall performance, especially beyond the period of calibration.

Nonetheless, MOHID-Land has its own limitations. In one hand, the implementation effort is significatively high, with several parameters needing to be defined, such as the six hydraulic parameters of all the soil types, the crop coefficients for each type of vegetation, the surface and the river Manning coefficients, and others. The high number of input data, parameters, and variables that the user should define conduces to an extremely high number of parameters that can be calibrated, which can be time-consuming. A consequence of this is reflected in the number of simulations performed to reach the best solution. In this study, more than 70 simulations were made to test the sensitivity of the MOHID-Land to other parameters than those studied by Oliveira et al. (2020), and to obtain the combination that allows a good fit between modeled and observed streamflow. On the other hand, the empirical representation of parts of the hydrological processes or the generalization of some parameters can make the representation of the modeled system difficult, leading to values of the calibrated parameters outside the normal ranges. That condition is here verified with the crop coefficients calibrated for the summer and winter crops, which are considered too low.

Models’ comparison

Overall, the 1D-CNN model demonstrated a better performance than MOHID-Land model for daily streamflow estimation in Ponte Vila Formosa station. However, when the results are aggregated by month, MOHID-Land’s performance surpassed the 1D-CNN results.

Focusing on wet and dry periods, it is interesting to verify that the results of both models complement each other. If on one hand, the 1D-CNN obtained a performance for the wet period better than that obtained by MOHID-Land, on the other hand, during the dry period, MOHID-Land demonstrated a better performance. Thus, in the first case, both models achieved satisfactory performances, but the 1D-CNN, with an NSE and R² of 0.79, was better than MOHID-Land, which obtained an NSE of 0.61 and an R² of 0.63. In the second case, the dry period, both models experienced a decrease in their performances, but MOHID-Land, with an NSE of 0.39 and an R² of 0.69, performed better than the 1D-CNN model, which obtained an NSE of 0.26 and an R² of 0.56. These results put in evidence the difficulty of MOHID-Land in estimating the peak flow events, but also a better ability to simulate the transitions between the wet and dry periods when compared to the 1D-CNN. It can also be verified in Figs. 5 and 6, where the results for MOHID-Land demonstrate a more natural behavior than those obtained for 1D-CNN model.

The more irregular behavior of 1D-CNN model is in part justified by the fact that these types of models have not a physical basis, which means that the streamflow estimation does not consider physical laws or limitations. This characteristic of neural network models also justifies the difficulty in avoiding the existence of negative streamflow values. Although other authors did not refer to this issue, it was verified in this study and should not be ignored, since it can limit the application of the model.

Models’ extension to Maranhão watershed

The streamflow estimated by the extension of 1D-CNN and MOHID-Land models to the entire Maranhão watershed was made by the adaptation of the trained and calibrated models to that watershed. Thus, the 1D-CNN presents a maximum inflow value substantially higher than the maximum predicted by MOHID-Land, which is related to the fact that MOHID-Land demonstrated some difficulty in reproducing peaks flow (Table 8). The remaining statistics are similar between both models, with the minimum streamflow near 0 m³ s⁻¹, the average is between 3.6 and 3.9 m³ s⁻¹, and the median is 1.9 and 1.6 m³ s⁻¹ for 1D-CNN and MOHID-Land.

The evaluation of the inflow values based on the mass balance at the reservoir scale showed a very good performance when using 1D-CNN and MOHID-Land (Table 9). Both models have NSE and RSR of 0.79 and 0.46, respectively. R² is 0.84 for 1D-CNN and 0.85 for MOHID-Land and the RMSE is 18.6 m³ s⁻¹ for both models. The higher difference in the statistical parameters is in the PBIAS with the 1D-CNN underestimating – 0.55% and the MOHID-Land also presenting an underestimation, but a little higher, of about – 1.18%. Visually, it is also possible to verify slight differences between the stored volume estimated with inflow from 1D-CNN model and from MOHID-Land model (Fig. 8), with the main differences occurring in the wet season (October–March).

In a similar approach but considering the continuous simulation of the stored water in two reservoirs included in the same modeled watershed, Rocha et al. (2020) found identical results. Using SWAT model to Monte Novo and Vigia reservoirs, in South Portugal, the authors validated the stored volume of both reservoirs with a monthly timestep, obtaining an NSE of 0.44 and a PBIAS of 6.3% for Monte Novo reservoir and an NSE of 0.70 and PBIAS of 10.1% for Vigia reservoirs.

In this case, models were extended to an ungauged watershed, which physical characteristics and the rainfall regime are similar to those verified in the sub-basin where the models were trained or calibrated. In that sense, the question that arises from this study is about the behavior of this expanding approach when larger watersheds, marked by diversified characteristics and rainfall regimes, are the target of the study. In those cases, the calibrated parameters cannot be representative or even represented in the expanded area, for the typical hydrological models, or the differences in the rainfall regime when considering the expanded area cannot be correctly related with the runoff values, which was already referred to by Parisouj et al. (2020).

Finally, it is important to note that several sources of uncertainty are involved in modeling Ponte Vila Formosa watershed, but also in expanding the optimized models to the entire watershed. Besides difficulties in correctly considering the differences between monitored and unmonitored areas, models also have their own uncertainty. On one hand, the limitations of model developers and users in correctly representing real systems through the structure of a hydrologic model and approximations made by numerical methods result in residual model errors and, therefore, in model output uncertainties (Loucks and van Beek 2017). However, the attempt of improve the representation of reality through the increase of model complexity results in adding the cost of data collection and may also introduce more parameters needing to be defined, which can then result in more potential sources of error in model output. On the other hand, Gal and Ghahramani (2016) focused on the high levels of uncertainty when using deep learning tools for regression and classification, even with simple modeling structures. In that sense, further investigation should be carried out concerning the expansion of both models and the involved uncertainty. For a better understanding, for example, several instances of the same model, with slight but coherent differences in the parametrization, can be taken into account, with the calculation of the streamflow resulting from the combination of those instances and considering the estimation of confidence intervals.

Conclusions

The proposed approach showed the adequateness of implementing a 1D-CNN model and a physically based model for estimating daily streamflow generation at the outlet of an ungauged watershed after prior calibration/validation of those models in a sub-basin of the same catchment. Considering the sub-basin modeling, the 1D-CNN model demonstrated a better performance than MOHID-Land when considering the daily values and the wet period. The MOHID-Land model showed a better performance in estimating streamflow values during dry periods and for a monthly analysis. When the validation of the reservoir mass balance was considered, the results showed an identical behavior for both models, with only a slight difference in the PBIAS. That difference indicates a smaller underestimation of inflow by the 1D-CNN than that estimated by MOHID-Land.

Although the results were considered from satisfactory to very good in all the steps taken during the validation process, the generation of negative values by the 1D-CNN is of concern. In that sense, the model presented here should be a target of improvement in future applications. In turn, MOHID-Land model revealed a lower performance for daily streamflow estimation, but its physical basis contributes to avoiding unpredictable and incomprehensible results.

Finally, it is worth noting that neural network models are developed and trained for present and/or past conditions, and their application to future scenarios can be limited. Also, the prediction of events that go beyond the observations can be problematic. This limitation is mainly related to its lack of capacity to absorb information about future conditions in cases where neural networks were not prepared to be forced by variables that include the impact of those future changes. Nonetheless, the changes in future conditions can be easily imposed in physically based models, with the main problems being: (i) the detail of the characterization of future conditions, that most of the time is too coarse for the detail adopted on physical models; and (ii) the high computational time needed to run long-term simulations, usually performed in analysis of future scenarios. Thus, hybrid solutions, combining different types of models or different models, can be used to incorporate the predicted changes in neural network models.

Data availability

Not applicable.

References

Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M et al. (2016) Tensorflow: A system for large-scale machine learning. In: Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16), Savannah, GA, USA, 265–283.
Abbott MB, Bathurst JC, Cunge JA, O’Connell PE, Rasmussen J (1986a) An Introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 1: History and Philosophy of a Physically-Based, Distributed Modelling System. J Hydrol 87:45–59. https://doi.org/10.1016/0022-1694(86)90114-9
Article Google Scholar
Abbott MB, Bathurst JC, Cunge JA, O’Connell PE (1986b) Rasmussen J (1986b) An Introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 2: Structure of a Physically-Based. Distrib Model Syst J Hydrol 87:61–77. https://doi.org/10.1016/0022-1694(86)90115-0
Article Google Scholar
Agencia Estatal de Meteorología (España) Atlas Climático Ibérico: Temperatura Del Aire y Precipitación (1971–2000)=Atlas Climático Ibérico: Temperatura Do Ar e Precipitação (1971–2000)=Iberian Climate Atlas : Air Temperature and Precipitation (1971–2000) (2011) Instituto Nacional de Meteorología: Madrid. ISBN 978–84–7837–079–5.
Allen RG, Pereira LS, Raes D, Smith M (1998) Crop Evapotranspiration - Guidelines for Computing Crop Water Requirements. FAO Irrigation and Drainage Paper 56:327
Google Scholar
Almeida C, Ramos T, Segurado P, Branco P, Neves R, Proença de Oliveira R (2018) Water Quantity and Quality under Future Climate and Societal Scenarios: A Basin-Wide Approach Applied to the Sorraia River. Portugal Water 10:1186. https://doi.org/10.3390/w10091186
Article Google Scholar
Alzubaid L, Zhang J, Humaidi AJ, Al-Dujaili A, Duan Y, Al-Shamma O, Santamaría J, Fadhel MA, Al-Amidie M, Farhan L (2021) Review of Deep Learning: Concepts, CNN Architectures, Challenges, Applications. Future Directions J Big Data 8:53. https://doi.org/10.1186/s40537-021-00444-8
Article Google Scholar
Andreadis KM, Schumann GJ-P, Pavelsky T (2013) A Simple Global River Bankfull Width and Depth Database: Data and Analysis Note. Water Resour Res 49:7164–7168. https://doi.org/10.1002/wrcr.20440
Article Google Scholar
ARBVS (2022) Área Regada. https://www.arbvs.pt/index.php/culturas/area-regada. Accessed 18 October 2022.
ASCE, Task Committee on Hydrology Handbook of Management Group D of ASCE (1996) Hydrology Handbook (Second Edition). American Society of Civil Engineers. https://doi.org/10.1061/9780784401385
Ashrafi SM, Mahmoudi M (2019) Developing a Semi-Distributed Decision Support System for Great Karun Water Resources System. J. Appl. Res. Water Wastewater 6(1):16–24. https://doi.org/10.22126/arww.2019.1042
Barino FO, Silva VNH, Lopez-Barbero AP, De Mello HL, Santos ABD (2020) Correlated Time-Series in Multi-Day-Ahead Streamflow Forecasting Using Convolutional Networks. IEEE Access 8:215748–215757. https://doi.org/10.1109/ACCESS.2020.3040942
Article Google Scholar
Bessa Santos RM, Sanches Fernandes LF, Vitor Cortes RM, Leal Pacheco FA (2019) Development of a Hydrologic and Water Allocation Model to Assess Water Availability in the Sabor River Basin (Portugal). Int J Environ Res Public Health 16:2419. https://doi.org/10.3390/ijerph16132419
Article Google Scholar
Brito D, Ramos TB, Gonçalves MC, Morais M, Neves R (2018) Integrated Modelling for Water Quality Management in a Eutrophic Reservoir in South-Eastern Portugal. Environ Earth Sci 77:40. https://doi.org/10.1007/s12665-017-7221-5
Article CAS Google Scholar
Canuto N, Ramos TB, Oliveira AR, Simionesei L, Basso M, Neves R (2019) Influence of Reservoir Management on Guadiana Streamflow Regime. Journal of Hydrology: Regional Studies 25:100628. https://doi.org/10.1016/j.ejrh.2019.100628
Article Google Scholar
Chollet F et al (2015) Keras https://github.com/fchollet/keras. Accessed 20 March 2021.
CLC 2012, Corine Land Cover 2012 2019 © European Union, Copernicus Land Monitoring Service 2018, European Environment Agency (EEA) https://land.copernicus.eu/pan-european/corine-land-cover. Accessed 22 June 2019.
Corbari C, Ravazzani G, Galvagno M, Cremonese E, Mancini M (2017) Assessing Crop Coefficients for Natural Vegetated Areas Using Satellite Data and Eddy Covariance Stations. Sensors 17:2664. https://doi.org/10.3390/s17112664
Article Google Scholar
Delaney CJ, Hartman RK, Mendoza J, Dettinger M, Delle Monache L, Jasperse J, Ralph FM, Talbot C, Brown J, Reynolds D, Evett S (2020) Forecast Informed Reservoir Operations Using Ensemble Streamflow Predictions for a Multipurpose Reservoir in Northern California. Water Resour. Res., 56. https://doi.org/10.1029/2019WR026604
Demirel MC, Venancio A, Kahya E (2009) Flow Forecast by SWAT Model and ANN in Pracana Basin. Portugal Adv Eng Softw 40:467–473. https://doi.org/10.1016/j.advengsoft.2008.08.002
Article Google Scholar
Devia GK, Ganasri BP, Dwarakish GS (2015) A Review on Hydrological Models. Aquat Procedia 4:1001–1007. https://doi.org/10.1016/j.aqpro.2015.02.126
Article Google Scholar
Di Nunno F, De Marinis G, Granata F (2023) Short-Term Forecasts of Streamflow in the UK Based on a Novel Hybrid Artificial Intelligence Algorithm. Sci Rep 13:7036. https://doi.org/10.1038/s41598-023-34316-3
Article CAS Google Scholar
Duan S, Ullrich P, Shu L (2020) Using Convolutional Neural Networks for Streamflow Projection in California. Front Water 2:28. https://doi.org/10.3389/frwa.2020.00028
Article Google Scholar
EU-DEM (European Digital Elevation Model) (2019) © European Union, Copernicus Land Monitoring Service 2019, European Environment Agency (EEA) https://land.copernicus.eu/pan-european/satellite-derived-products/eu-dem/eu-dem-v1.1/view. Accessed 15 May 2019.
Fatichi S, Vivoni ER, Ogden FL, Ivanov VY, Mirus B, Gochis D, Downer CW, Camporese M, Davison JH, Ebel B, Jones N, Kim J, Mascaro G, Niswonger R, Restrepo P, Rigon R, Shen C, Sulis M, Tarboton D (2016) An Overview of Current Applications, Challenges, and Future Trends in Distributed Process-Based Models in Hydrology. J Hydrol 537:45–60. https://doi.org/10.1016/j.jhydrol.2016.03.026
Article Google Scholar
Feddes RA, Kowalik PJ, Zaradny H (1978) Simulation of field water use and crop yield. Centre for Agricultural Publishing and Documentation, Wageningen
Google Scholar
Gal Y, Ghahramani Z (2016) Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning. Proceedings of the 33rd International Conference on Machine Learning, in Proceedings of Machine Learning Research, 48:1050–1059. https://proceedings.mlr.press/v48/gal16.html.
HAZRUNOFF Project - Layman’s Report (2020) HazRunoff: Integration of sensing and modelling technologies for early detection and follow-up of hazmat and flood hazards in transitional and coastal waters http://www.hazrunoff.eu/wp-content/uploads/2020/06/HAZRUNOFF_Layman-Report.pdf. Accessed 10 May 2023.
Hersbach H, Bell B, Berrisford P, Hirahara S, Horányi A, Muñoz‐Sabater J, Nicolas J, Peubey C, Radu R, Schepers D, Simmons A, Soci C, Abdalla S, Abellan X, Balsamo G, Bechtold P, Biavati G, Bidlot J, Bonavita M et al. (2017) Complete ERA5 from 1979: Fifth generation of ECMWF atmospheric reanalyses of the global climate. Copernicus Climate Change Service (C3S) Data Store (CDS).
Huang C, Zhang J, Cao L, Wang L, Luo X, Wang J-H, Bensoussan A (2020) Robust Forecasting of River-Flow Based on Convolutional Neural Network. IEEE Trans Sustain Comput 5:594–600. https://doi.org/10.1109/TSUSC.2020.2983097
Article Google Scholar
Jovanovic N, Pereira LS, Paredes P, Pôças I, Cantore V, Todorovic M (2020) A Review of Strategies, Methods and Technologies to Reduce Non-Beneficial Consumptive Water Use on Farms Considering the FAO56 Methods. Agric Water Manag 239:106267. https://doi.org/10.1016/j.agwat.2020.106267
Article Google Scholar
Khosravi K, Golkarian A, Tiefenbacher JP (2022) Using Optimized Deep Learning to Predict Daily Streamflow: A Comparison to Common Machine Learning Algorithms. Water Resour Manag 36:699–716. https://doi.org/10.1007/s11269-021-03051-7
Article Google Scholar
Loucks DP, Van Beek E (2017) Water Resource Systems Planning and Management. Springer International Publishing. https://doi.org/10.1007/978-3-319-44234-1
Article Google Scholar
McKinney W (2010) Data Structures for Statistical Computing in Python. In S. van der Walt & J. Millman (Eds.), Proceedings of the 9th Python in Science Conference. 56–61. https://doi.org/10.25080/Majora-92bf1922-00a
Mehr AD, Kahya E, Olyaie E (2013) Streamflow Prediction Using Linear Genetic Programming in Comparison with a Neuro-Wavelet Technique. J Hydrol 505:240–249. https://doi.org/10.1016/j.jhydrol.2013.10.003
Article Google Scholar
Miser HJ, Quade ES (1985) Handbook of Systems Analysis: Overview of Uses, Procedures, Applications, and Practice. Noth-Holland. ISBN 978–0444009180.
Moriasi DN, Arnold JG, van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations. Trans Am Soc Agric Biol Eng 50:885–900
Google Scholar
Mualem Y (1976) A New Model for Predicting the Hydraulic Conductivity of Unsaturated Porous Media. Water Resour Res 12:513–522. https://doi.org/10.1029/WR012i003p00513
Article Google Scholar
Navarro-Hellín H, Martínez-del-Rincon J, Domingo-Miguel R, Soto-Valles F, Torres-Sánchez R (2016) A Decision Support System for Managing Irrigation in Agriculture. Comput Electron Agric 124:121–131. https://doi.org/10.1016/j.compag.2016.04.003
Article Google Scholar
Neitsch SL, Arnold JG, Kiniry JR, Williams JR (2011) Soil and Water Assessment Tool Theoretical Documentation Version 2009.
O’Malley T, Bursztein E, Long J, Chollet F, Jin H, Invernizzi L (2019) Keras Tuner https://github.com/keras-team/keras-tuner. Accessed 30 May 2021.
Oliveira AR, Ramos TB, Simionesei L, Pinto L, Neves R (2020) Sensitivity Analysis of the MOHID-Land Hydrological Model: A Case Study of the Ulla River Basin. Water 12:3258. https://doi.org/10.3390/w12113258
Article Google Scholar
Oliveira AR, Ramos TB, Simionesei L, Gonçalves MC, Neves R (2022) Modeling Streamflow at the Iberian Peninsula Scale Using MOHID-Land: Challenges from a Coarse Scale Approach. Water 14:1013. https://doi.org/10.3390/w14071013
Article Google Scholar
Oliveira AR, Ramos TB, Neves R (2023) Streamflow Estimation in a Mediterranean Watershed Using Neural Network Models: A Detailed Description of the Implementation and Optimization. Water 15(5):947. https://doi.org/10.3390/w15050947
Article Google Scholar
Panagos P, Van Liedekerke M, Jones A, Montanarella L (2012) European Soil Data Centre: Response to European Policy Support and Public Data Requirements. Land Use Policy 29:329–338. https://doi.org/10.1016/j.landusepol.2011.07.003
Article Google Scholar
Parisouj P, Mohebzadeh H, Lee T (2020) Employing Machine Learning Algorithms for Streamflow Prediction: A Case Study of Four River Basins with Different Climatic Zones in the United States. Water Resour Manag 34:4113–4131. https://doi.org/10.1007/s11269-020-02659-5
Article Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al. (2011) Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12(Oct), 2825–2830.
Pereira L, Cordery I, Iacovides I (2009) Coping with Water Scarcity: Addressing the Challenges, ISBN 978–1–4020–9578–8.
Pestana R, Matias M, Canelas R, Araújo A, Roque D, Van Zeller E, Trigo-Teixeira A, Ferreira R, Oliveira R, Heleno S (2013) Calibration of 2D Hydraulic Inundation Models in the Floodplain Region of the Lower Tagus River. In Proceedings of the Proc. ESA Living Planet Symposium 2013; Edinburgh, UK.
Pörtner H-O, Roberts DC, Tignor M, Poloczanska ES, Mintenbeck K, Alegría A, Craig M, Langsdorf S, Löschke S, Möller V et al. (2022) IPCC, 2022: Climate Change 2022: Impacts, Adaptation and Vulnerability. Contribution of Working Group II to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change; Cambridge University Press: Cambridge University Press, Cambridge, UK and New York, NY, USA.
Ramos TB, Simionesei L, Jauch E, Almeida C, Neves R (2017) Modelling Soil Water and Maize Growth Dynamics Influenced by Shallow Groundwater Conditions in the Sorraia Valley Region. Portugal Agric Water Manag 185:27–42. https://doi.org/10.1016/j.agwat.2017.02.007
Article Google Scholar
Ranatunga T, Tong STY, Yang YJ (2016) An Approach to Measure Parameter Sensitivity in Watershed Hydrological Modelling. Hydrol. Sci. J. 1–17. https://doi.org/10.1080/02626667.2016.1174335
Rocha J, Carvalho-Santos C, Diogo P, Beça P, Keizer JJ, Nunes JP (2020) Impacts of Climate Change on Reservoir Water Availability, Quality and Irrigation Needs in a Water Scarce Mediterranean Region (Southern Portugal). Sci Total Environ 736:139477. https://doi.org/10.1016/j.scitotenv.2020.139477
Article CAS Google Scholar
Simionesei L, Ramos TB, Palma J, Oliveira AR, Neves R (2020) IrrigaSys: A Web-Based Irrigation Decision Support System Based on Open Source Data and Technology. Comput Electron Agric 178:105822. https://doi.org/10.1016/j.compag.2020.105822
Article Google Scholar
Šimůnek J, Hopmans JW (2009) Modeling Compensated Root Water and Nutrient Uptake. Ecol Modell 220:505–521. https://doi.org/10.1016/j.ecolmodel.2008.11.004
Article Google Scholar
Sitterson J, Knightes C, Parmar R, Wolfe K, Muche M, Avant B (2017) An Overview of Rainfall-Runoff Model Types. U.S. Environmental Protection Agency, Washington, DC, EPA/600/R-17/482.
Skaggs TH, van Genuchten MTh, Shouse PJ, Poss JA (2006) Macroscopic Approaches to Root Water Uptake as a Function of Water and Salinity Stress. Agric Water Manag 86:140–149. https://doi.org/10.1016/j.agwat.2006.06.005
Article Google Scholar
SNIRH, 2021 Sistema Nacional de Informação de Recursos Hídricos. Available online: https://snirh.apambiente.pt/index.php?idMain=. Accessed 7 February 2021.
Song CM (2020) Hydrological Image Building Using Curve Number and Prediction and Evaluation of Runoff through Convolution Neural Network. Water 12:2292. https://doi.org/10.3390/w12082292
Article CAS Google Scholar
Szczepanek R (2022) Daily Streamflow Forecasting in Mountainous Catchment Using XGBoost. LightGBM and CatBoost Hydrology 9:226. https://doi.org/10.3390/hydrology9120226
Article Google Scholar
Teodosiu C, Ardeleanu C, Lupu L (2009) An overview of decision support systems for integrated water resources management. Environ Eng Manag J 8:153–162. https://doi.org/10.30638/eemj.2009.023
Article Google Scholar
Tóth B, Weynants M, Pásztor L, Hengl T (2017) 3D Soil Hydraulic Database of Europe at 250 m Resolution. Hydrol Process 31:2662–2666. https://doi.org/10.1002/hyp.11203
Article Google Scholar
Trancoso AR, Braunschweig F, Chambel Leitão P, Obermann M, Neves R (2009) An advanced modelling tool for simulating complex river systems. Sci Total Environ 407:3004–3016. https://doi.org/10.1016/j.scitotenv.2009.01.015
Article CAS Google Scholar
van der Laan E, Nunes JP, Dias LF, Carvalho S, Dos Santos FM (2023) Climate Change Adaptability of Sustainable Land Management Practices Regarding Water Availability and Quality: A Case Study in the Sorraia Catchment, Portugal (preprint).
van Genuchten MTh (1980) A Closed-Form Equation for Predicting the Hydraulic Conductivity of Unsaturated Soils. Soil Sci Soc Am J 44:892–898. https://doi.org/10.2136/sssaj1980.03615995004400050002x
Article Google Scholar
Williams JR, Jones CA, Kiniry JR, Spanel DA (1989) The EPIC Crop Growth Model. Transactions of the ASAE 32:0497–0511. https://doi.org/10.13031/2013.31032
Yu Q, Jiang L, Wang Y, Liu J (2023) Enhancing Streamflow Simulation Using Hybridized Machine Learning Models in a Semi-Arid Basin of the Chinese Loess Plateau. J Hydrol 617:129115. https://doi.org/10.1016/j.jhydrol.2023.129115
Article Google Scholar
Zhang D, Chen X, Yao H (2015a) Development of a Prototype Web-Based Decision Support System for Watershed Management. Water 7:780–793. https://doi.org/10.3390/w7020780
Article Google Scholar
Zhang X, Peng Y, Zhang C, Wang B (2015b) Are Hybrid Models Integrated with Data Preprocessing Techniques Suitable for Monthly Streamflow Forecasting? Some Exp Evid J Hydrol 530:137–152. https://doi.org/10.1016/j.jhydrol.2015.09.047
Article Google Scholar

Download references

Funding

Open access funding provided by FCT|FCCN (b-on). This research was supported by FCT/MCTES (PIDDAC) through project LARSyS–FCT pluriannual funding 2020–2023 (UIDP/50009/2020). T. B. Ramos was supported by a CEEC-FCT Contract (CEECIND/01152/2017).

Author information

Authors and Affiliations

Centro de Ciência e Tecnologia do Ambiente e do Mar (MARETEC-LARSyS), Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais, 1, 1049-001, Lisbon, Portugal
Ana R. Oliveira, Tiago B. Ramos, Lucian Simionesei & Ramiro Neves

Authors

Ana R. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Tiago B. Ramos
View author publications
You can also search for this author in PubMed Google Scholar
Lucian Simionesei
View author publications
You can also search for this author in PubMed Google Scholar
Ramiro Neves
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R.O. was responsible for the conceptualization, software, formal analysis, and writing of the original draft. The methodology was elaborated by A.R.O., L.S., and T.B.R. T.B.R., L.S., and R.N. revised the manuscript.

Corresponding author

Correspondence to Ana R. Oliveira.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Oliveira, A.R., Ramos, T.B., Simionesei, L. et al. Assessing the reliability of a physical-based model and a convolutional neural network in an ungauged watershed for daily streamflow calculation: a case study in southern Portugal. Environ Earth Sci 83, 215 (2024). https://doi.org/10.1007/s12665-024-11498-1

Download citation

Received: 06 June 2023
Accepted: 07 February 2024
Published: 25 March 2024
DOI: https://doi.org/10.1007/s12665-024-11498-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Assessing the reliability of a physical-based model and a convolutional neural network in an ungauged watershed for daily streamflow calculation: a case study in southern Portugal

Abstract

Similar content being viewed by others

Scale Effects of the Monthly Streamflow Prediction Using a State-of-the-art Deep Learning Model

Deep neural network-based discharge prediction for upstream hydrological stations: a comparative study

Multiple data-driven approaches for estimating daily streamflow in the Kone River basin, Vietnam

Introduction

Materials and methods

Description of the study area

Convolutional neural network model description

Input variables for 1D-CNN model

Estimation of Maranhão inflow with 1D-CNN

MOHID-Land model description

Model set-up

Estimation of Maranhão inflow with MOHID-Land

Models’ evaluation

Results

1D-CNN at Ponte Vila Formosa

MOHID-Land at Ponte Vila Formosa

Maranhão reservoir’s inflow

Discussion

1D-CNN model

MOHID-Land model

Models’ comparison

Models’ extension to Maranhão watershed

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation