Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin

Özdoğan-Sarıkoç, Gülhan; Dadaser-Celik, Filiz

doi:10.1007/s11356-024-33732-w

Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin

Research Article
Open access
Published: 29 May 2024

Volume 31, pages 39098–39119, (2024)
Cite this article

Download PDF

You have full access to this open access article

Environmental Science and Pollution Research Aims and scope Submit manuscript

Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin

Download PDF

476 Accesses
Explore all metrics

Abstract

Physically based or data-driven models can be used for understanding basinwide hydrological processes and creating predictions for future conditions. Physically based models use physical laws and principles to represent hydrological processes. In contrast, data-driven models focus on input–output relationships. Although both approaches have found applications in hydrology, studies that compare these approaches are still limited for data-scarce, semi-arid basins with altered hydrological regimes. This study aims to compare the performances of a physically based model (Soil and Water Assessment Tool (SWAT)) and a data-driven model (Nonlinear AutoRegressive eXogenous model (NARX)) for reservoir volume and streamflow prediction in a data-scarce semi-arid region. The study was conducted in the Tersakan Basin, a semi-arid agricultural basin in Türkiye, where the basin hydrology was significantly altered due to reservoirs (Ladik and Yedikir Reservoir) constructed for irrigation purposes. The models were calibrated and validated for streamflow and reservoir volumes. The results show that (1) NARX performed better in the prediction of water volumes of Ladik and Yedikir Reservoirs and streamflow at the basin outlet than SWAT (2). The SWAT and NARX models both provided the best performance when predicting water volumes at the Ladik reservoir. Both models provided the second best performance during the prediction of water volumes at the Yedikir reservoir. The model performances were the lowest for prediction of streamflow at the basin outlet (3). Comparison of physically based and data-driven models is challenging due to their different characteristics and input data requirements. In this study, the data-driven model provided higher performance than the physically based model. However, input data used for establishing the physically based model had several uncertainties, which may be responsible for the lower performance. Data-driven models can provide alternatives to physically-based models under data-scarce conditions.

Modeling streamflow in Sot river catchment of Uttar Pradesh, India

Article 16 September 2023

Prediction of daily reservoir inflow using atmospheric predictors

Article 28 May 2019

Physically-Based Streamflow Predictions in Ungauged Basin with Semi-Arid Climate

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Developing hydrological models at the basin scale is challenging due to the complexity of hydrological processes, spatial variability of soil, geology, and land use/cover characteristics, and spatial and temporal variability of climatic conditions. Hydrological models enable us to understand how physical or climatic changes could affect basinwide hydrological processes and predict basin response to natural or artificial changes (Gupta et al. 2005; Rajat 2021).

There are two classes of hydrological models: physically based or data-driven models. Physically based models consist of mathematical equations that are representation of conceptual models and physical laws such as conservation of mass and conservation of momentum (Chua 2012). These models have been used to predict various hydrological variables including streamflow (Ouyang et al. 2021) and reservoir volumes (Beharry et al. 2021), and explain rainfall-runoff relationships (Liu and Todini 2002) and assess hydrological impacts of global circulation patterns (Liang et al. 1994), and determine occurrence of floods (Borah 2011; Costabile et al. 2013; Costabile and Macchione 2015). The processing of hydrological parameters for the development of physically based models requires expertise, high-quality data, and detailed knowledge of the basin processes (Kim et al. 2015a). Additionally, some studies noted that site-specific constraints and the challenges in reaching input data contributed to some of the short-term prediction errors of physically based models (Costabile and Macchione 2015). Physically based models are capable of explaining the physical processes underlying hydrological events.

Data-driven models have also been used for prediction and forecasting in hydrological studies (Barzegar et al. 2021; Elshorbagy et al. 2010; Evora and Coulibaly 2009; Hu et al. 2021; Ouyang et al. 2021; Özdoğan-Sarıkoç et al. 2023; Tsai et al. 2015; Yaseen et al. 2015; Zhang et al. 2018). These models became more popular with the advances in computation techniques and capacities over the last decades. Data-driven models can be trained easily without knowledge about the physical processes in the basin, offering a valuable tool for modelling difficult or complex terrains with data limitations (Wunsch et al. 2018). These models can be more quickly developed with minimum inputs (Mosavi et al. 2018). However, they have been faced criticism for their inherent lack of transparency and difficulty in reproducing results (Elshorbagy et al. 2010). For example, some of the data-driven models, such as artificial neural networks (ANNs), adapt black-box approach, where inputs are related to outputs using various transfer functions without using knowledge about physical relationships (Kanungo et al. 2006).

Todini (2007) emphasized that an objective comparison is necessary to evaluate the uncertainties and advantages of physically based and data-based models. However, studies that compared these two approaches are still very limited. We provide a summary of previous studies, listed in the Web of Science index, that compared physically based and data-driven models in hydrological applications in Table 1. The studies listed in Table 1 were conducted in different locations, with different models, and with data sets having different characteristics. The annual precipitation in these watersheds ranged from 660 to 2715 mm and studies in arid and semi-arid landscapes were limited. Performance evaluation was done based on various hydrological variables including streamflow, flood events, and evapotranspiration. Still, model prediction performance for some hydrological variables such as reservoir volumes has not yet been investigated. Most of the studies (Ahmadi et al. 2019; Demirel et al. 2009; Kim et al. 2015b; Pradhan et al. 2020; Rabezanahary Tanteliniaina et al. 2021; Srivastava et al. 2006; Valeh et al. 2021; Zakizadeh et al. 2020) focused on the comparison of SWAT with classical ANN models (such as feed-forward networks) for streamflow forecasting. The performance of NARX, which is an ANN model, widely used in data-driven modeling of complex systems, has not been compared with SWAT for streamflow and reservoir volume forecasting. The studies that compared two approaches generally provided better performance with data-driven models (Ahmadi et al. 2019; Demirel et al. 2009; Fan et al. 2020; Hussain et al. 2021; Ji et al. 2021; Kim and Kim 2021; Kim et al. 2015b; Lee et al. 2020; Pradhan et al. 2020; Rabezanahary Tanteliniaina et al. 2021; Srivastava et al. 2006; Sungmin et al. 2020; Valeh et al. 2021; Zakizadeh et al. 2020). However, the models were compared in watersheds where hydrological system was mostly in its natural state and where high-quality data were rather accessible. The model performances have not been compared in watersheds where the hydrological processes were highly modified with reservoirs and irrigation activities and in basin where data availability poses major challenges. Additionally, to the best of our knowledge, there has been no performance comparison conducted for reservoir volume prediction. In this study, we aim to contribute to the available literature by using a physical-based model, SWAT (Arnold et al. 1998), and a data-driven model, NARX, for reservoir volume and streamflow prediction, in a semi-arid, data-scarce basin, where basin hydrology was altered through human interventions.

Table 1 Previous studies that compared physically-based and data-driven models for the prediction of different hydrological variables*

Full size table

SWAT, a lump-parameter, continuous time-scale model, is among the most-used physically based models to characterize basin-scale hydrological processes and to simulate streamflow, reservoir volumes, and reservoir operations (Kim and Parajuli 2012). SWAT has been used for investigating climate change impacts (Jha et al. 2006; Narsimlu et al. 2013; Sood et al. 2013), water quality characterization (Pisinaras et al. 2010; Pohlert et al. 2005), land use/cover change impact assessment (Du et al. 2013; Marhaento et al. 2017), and testing the effects of scenarios (Pisinaras et al. 2010) and best management practices (Kaini et al. 2012; Uniyal et al. 2020). A few studies are available in the literature that used SWAT for prediction of reservoir volumes or reservoir water levels (Beharry et al. 2021, Kim and Parajuli 2012, Jouma and Dadaser‐Celik 2022, Kim et al. 2021, Sedighkia and Abdoli 2022, Zhang et al. 2011, Zhang et al. 2022).

NARX is a relatively new and special type of recurrent neural network (RNN), characterized by its utilization of feedback connections, which typically provides higher performance than conventional RNNs (Lin et al. 1996). NARX has been successfully used for modeling nonlinear systems (Wunsch et al. 2018) with its capability to store information in memory much longer than other RNNs (Lin et al. 1996), which leads to faster convergence and better generalization (Lin et al. 1998). Most researchers used NARX for predicting groundwater levels (Guzman et al. 2017, 2019; Javadinejad et al. 2020; Nunno and Granata 2020; Wunsch et al. 2018), reservoir inflows (Ghazali et al. 2018; Yang et al. 2019), streamflow (Nunno et al. 2021), water temperatures (Kwak et al. 2017), and floods (Chang et al. 2014, 2022; Nanda et al. 2016; Rjeily et al. 2017). The number of studies that focused on reservoir volume and streamflow prediction is comparatively lower (Ghazali et al. 2018; Nunno et al. 2021; Yang et al. 2019). To the best of our knowledge, no studies used NARX for reservoir volume prediction.

This study was conducted at the Tersakan Basin in Türkiye. The climatic characteristics of the Tersakan Basin is semi-arid, with about 450-mm annual precipitation. Tersakan Basin has an altered hydrological regime due to construction of reservoirs on the stream network. Another difficulty for hydrological modeling is caused by the uncertainty in the amount of water used from reservoirs for irrigation purposes. The limited availability of input data such as land use/cover and soil data also caused challenges. This study aims to compare the performances of a physically based model (SWAT) and a data-driven model (NARX) in the Tersakan Basin. The weaknesses and strengths of these approaches were discussed. The analyses in this study could help evaluate the potential of different modelling approaches in predicting reservoir volumes and streamflow in challenging watersheds. This study can also create information related to the use of a new type of ANN model, NARX, in hydrological modelling studies, which is quite limited in the literature.

Materials and methods

SWAT and NARX were applied for predicting reservoir volumes and streamflow in the Tersakan Basin. Flowcharts representing the key stages of SWAT and NARX models are presented in Figs. 1 and 2, respectively. Below, we first provide background information about our study area. Then, we provide details for SWAT and NARX applications.

Study area

Tersakan Basin is located to the north-central Türkiye (Fig. 3). The Tersakan Stream starts from Ladik Reservoir located to the east of the basin. Tersakan Stream irrigates Merzifon (located to the north) and Suluova (located at the center) districts. Due to large agricultural areas covering about 88 km² in Suluova, the streamflow is significantly lower at the basin outlet (Anonymous 2019). The length of the Tersakan Stream is about 100 km and annual flow is 0.125 × 10⁹ m³. Maximum, minimum, and average flows are 317 m³/s, 0.021 m³/s, and 3.96 m³/s, respectively.

Tersakan Basin’s total area is 2206 km². Ladik Reservoir was created by State Hydraulic Works for irrigation purposes in 1973 by constructing a regulator at the outlet of Ladik Lake. Tersakan Stream starts from this location (Tübitak Marmara Research Center 2010). Ladik Reservoir’s volume and surface area are 4854 × 10⁴ m³ and 13.3 km², respectively. Yedikir Reservoir was built between 1982 and 1985 and provides irrigation service to approximately 74 km² of area. Yedikir Reservoir’s surface area is 5.93 km² and its volume is 5710 × 10⁴ m³.

Although there were multiple meteorological stations within the basin (Fig. 3), only a single meteorological station had regular data records for long time periods. Figure 4 shows precipitation and minimum, maximum, and average air temperatures measured at this station from 1975 to 2019. The annual average precipitation during the 1975–2019 period was 434 mm and the annual average air temperature was 12°C. The annual minimum and maximum average air temperatures were 7°C and 17°C, respectively. In general, the basin has cold, semi-arid (steppe) climate, as categorized by the Köppen-Geiger climate classification system (Peel et al. 2007). The elevation in the basin ranges from 375 to 2063 m (Fig. 5a).

Data used

SWAT necessitates a comprehensive dataset including topography, soils, land use/cover, and climatic variables such as minimum and maximum air temperature, precipitation, relative humidity, solar radiation, and wind speed (Fig. 5a, b, d). Additionally, information on land management practices is essential for accurately characterizing watershed processes. Digital elevation model (DEM), soil, and land use/cover data for the Tersakan Basin (Fig. 5) were acquired from global datasets due to unavailability of data from local sources (Table 2). The spatial resolution of these data was quite low (Table 2). Daily meteorological data was available from State Meteorology Service. Streamflow, and reservoir operation data, and data on local agricultural practices were obtained from State Hydraulic Works and local organizations (Table 2).

Table 2 Data characteristics and sources

Full size table

Land use/cover characteristics in the Tersakan Basin is explained based on CORINE 2018 data in Table 3. Corine 2018 dataset has been used to characterize land use/cover in many previous studies (Germeç and Ürker 2023; Llanos-Paez et al. 2023). Tersakan Basin has a high percentage of non-irrigated arable lands with 18.7% coverage and broad-leaved forests with 13.3% coverage. Permanently irrigated area cover 11.5% and land principally occupied by agriculture with significant areas of natural vegetation cover 10%. Natural grasslands and transitional woodland-shrub occupy 11.4% and 13%, respectively. There are three types of soil in the Tersakan Basin. These are calcic cambisols, haplic kastanozems, and calcic xerosols.

Table 3 Land use/cover in the Tersakan Basin

Full size table

For the NARX model, meteorological data consisting of maximum temperature, minimum temperature, precipitation, wind speed, relative humidity, and solar radiation were used as input. The NARX model outputs included water volumes of Ladik and Yedikir Reservoirs and streamflow the basin outlet.

SWAT model development

SWAT is a semi-distributed, time-continuous, ecohydrological model generally applied at the watershed scale (Arnold and Fohrer 2005) and designed to simulate water, nutrient, and sediment transport (Arnold et al. 1998, 2012; Neitsch et al. 2005). It works with hydrological response units (HRUs) which are areas with unique characteristics identified by land use, soil type, and slope. SWAT runs on a daily time step and can simulate plant growth, water quality, and reservoir operations in addition to sediment and nutrient movement and water balance (Arnold et al. 2012).

The ArcSWAT interface program with revision 664 version of SWAT2012 was used to set up the hydrological model. In the initial phase, the watershed was partitioned into sub-basins using the DEM-based option in SWAT, with an area threshold of 220 km² (approximately 10% of the watershed area) (Fig. 5e). Subsequently, certain sub-basins were merged to ensure that reservoirs are located within single sub-basins. Hydrologic response units (HRUs) were created by combining DEM, soil, land use, and slope maps (Fig. 5f) (Neitsch et al. 2005). A 5% threshold for land use, soil, and slope was applied during generation to ensure that variations in land use, soil, and slope were adequately represented in the model. This approach helped capture the spatial heterogeneity of the landscape while ensuring that computational burdens were minimized by excluding small areas (Arnold et al. 2012). We used the variable storage method for river channel routing method for and the Penman/Monteith method for potential evapotranspiration estimation.

Water volume data obtained from Ladik and Yedikir Reservoirs and streamflow data collected at the basin outlet were used to calibrate and validate the SWAT model. Model calibration aims to find a specific set of model parameters that can accurately capture the behavior of the system. Model calibration is an iterative process, where observed and simulated values are continuously compared using different parameters sets. The process continues until the parameter set that provides the most satisfactory results is determined. In this study, a software package called SWAT-CUP (Abbaspour 2015), which provides automatic model calibration, was used. Although several algoritms are available within the SWAT-CUP, we preferred to use the Sequential Uncertainty Fitting Version 2 (SUFI-2) algorithm, which is the most frequently used algorithm for SWAT model calibration and has been proved to be successful (Aibaidula et al. 2022; Mengistu et al. 2019). For calibration and validation, the procedure explained in Abbaspour et al. (2007) and Abbaspour et al. (2015) was used. Twenty-eight parameters that could affect streamflow, reservoir storage, and irrigation were used for calibration. The parameter set was determined based on the available literature where most used parameters for SWAT calibration were listed (Table 4). For each parameter, the lower and upper limit values were also selected from the literature. We included all 28 parameters in calibration. Manual calibration was conducted prior to automatic calibration with SWAT-CUP. A sensitivity analysis is applied to be able to understand the response of the basin to different parameters. In SWAT-CUP, the SUFI-2 algorithm was run 500 times in each iteration. Iterations were continued until the best fit between the simulated and observed values was reached (Abbaspour et al. 2015). At the end of each iteration, new parameter ranges produced by the algorithm were used. Reservoirs are important elements that greatly affect the hydrological dynamics in the SWAT model (Phiri et al. 2021). Irrigation activities also affect the water movement greatly. For this purpose, the parameters that can affect reservoir volumes and irrigation water use were determined and calibration/validation process was carried out for Ladik and Yedikir Reservoirs.

Table 4 Parameters selected for streamflow and reservoir volume calibration, calibration ranges, and type of calibration (v means that the existing parameter value is replaced by a given value and r means that the existing parameter value is multiplied by (1 + a given value (Abbaspour 2015))

Full size table

We ran the SWAT model for the 2000–2017 period, where the first 5 years were used for model warm-up. The warm-up period refers to the initial period of the simulation during which the model adjusts to initial conditions, for various variables such as soil moisture content and reservoir volumes, to reach a stable state before actual model simulations. The duration of the warm-up period can change depending on the characteristics and complexity of the watershed being modeled, and spatial and temporal resolution of the input data used (Prasad et al. 2015). Typically, a warm-up period of 2–10 years is recommended for most SWAT model simulations; however, much longer warmup periods has also been used (Prasad et al. 2015; Schuol et al. 2008; Wang and Kalin 2011; Wu et al. 2012). We used a warm-up period of 5 years, to let the model adjust initial conditions especially for reservoirs.

During model calibration, we followed a multi-variable and multi-site calibration approach. This approach involved adjusting multiple model parameters simultaneously and across multiple sites. This approach has been applied before in basins where spatial variations are high and proved to be more successful in simulating hydrological processes (Cao et al. 2006; Moussa et al. 2007; Shresthaa et al. 2016). In the SWAT model calibration, water volumes at the Ladik Reservoir were used to calibrate the SWAT model for sub-basins 1 and 2 and water volumes at the Yedikir Reservoir were used to calibrate the SWAT model for sub-basins 3, 4, 5, and 7. The streamflow at the basin outlet was used to calibrate the SWAT model for sub-basins 6 and 8. The parameters related to the basin (.bsn) were selected in the calibration based on Ladik Reservoir volumes only and the results found here were added to the SWAT model for all sub-basins. Water volumes at the Ladik Reservoir were available for the 2010–2017 period, where the data from 2010 to 2014 (60 months) were used for calibration and 2015–2017 (36 months) for validation. For Yedikir Reservoir, water volume data could be reached for the 2010–2016 period. Here, we used the 2010–2014 (60 months) period for calibration and the 2015–2016 (24 months) period for validation. Streamflow at the basin outlet was available only for the 2013–2017 period. The 2013–2015 (36 months) and 2016–2017 (24 months) periods were selected for calibration and validation, respectively.

NARX model development

NARX is a type of RNN that deals with forecasting time series data (Chang et al. 2013, 2022; Lin et al. 1998; Menezes and Barreto 2008; Wunsch et al. 2018). The NARX model has three different architectures (Shen and Chang 2013). A statistical neural network is the first type of architecture. In this type, the NARX models use the target as an input during model training and model testing. The second type has a serial, parallel configuration, in which the target is used as an input in during model training, and the output value is feedback as an input value during model testing. Usually, the NARX model performance was stronger during the training phase, but weaker during model testing. The last type is a parallel configuration. In this type, the output value is used as an input during the testing and training phases. This study used a parallel configuration because this type leads to a strong fault tolerance (Chang et al. 2022). Figure 6 shows the NARX model architecture used in this study. The NARX model consisted of input, hidden, and output layers. The outputs create new inputs and the inputs can delay for a certain time steps. Equation 1 can be used for forecasting N-step-ahead (N ≥ 1):

$$z \left(t+N\right)=f[z\left(t+N-1\right),\dots ,z\left(t+N-q\right); \;U(t)]$$

(1)

In Eq. (1), $f(.)$ is the nonlinear function. $z \left(t+N\right)$ and $U(t)$ output value and denote the input vector at the t time step, respectively. $q$ is the order of output memory. $z\left(t+N-q\right)$ and $U(t)$ are input regressors. The $z\left(t+N-i\right)$ regressor (i is 1 to q) acts as the autoregressive model in the time series, and another regressor $U(t)$ also acts as an implicit exogenous variable.

In this study, the NARX model was used to estimate the water volumes at the Ladik and Yedikir Reservoirs and streamflow at the basin outlet. The NARX model network was trained using the Scaled Conjugate Gradient algorithm (Møller 1993) and the transfer functions of layers were selected as the sigmoid type. In this algorithm, a feedforward ANN architecture is used, where connection weights of neurons are optimized at the same time (Chen and Chang 2009). In practice, the scaled conjugate algorithm was found to be more effective than the classical backpropagation algorithm (Chiang et al. 2004). Model construction and application were undertaken on the MATLAB 2016a software. We used the default learning rate (0.01) in all simulations.

The number of hidden neurons and delays were determined based on the least complex model structure that might produce adequate results based on the available literature (Alsumaiei 2020; Chiang et al. 2004; Wunsch et al. 2018; Yang et al. 2019). First, we tested the number of neurons and delay parameters for various values. This analysis revealed that the network structures with 1–10 hidden neurons and 1–10 delay numbers provided better model performance. Delay numbers of NARX model can help reduce the sensitivity of the network system (Li et al. 2017). For selection of the optimum parameter values and further improvement, the NARX model was run several times by selecting values from this range. The optimum parameter values were selected as those that provided the highest performance.

Monthly averages (for minimum and maximum temperature, relative humidity, and solar radiation) or totals (for precipitation) were calculated for climatic variables and used as input. All data were normalized to be between 0 and 1 (Eq. 2). The data were randomly divided into three sets: training, validation, and testing sets. Seventy percent of the data was used for training and 15% for validation and 15% for testing.

$$X=\frac{{X}_{ori}-{X}_{min}}{{X}_{max}-{X}_{min}}$$

(2)

In Eq. 2, ${X}_{\text{min}}$, ${X}_{\text{max}}$, ${X}_{\text{ori}}$, and $X$ were minimum, maximum, original, and normalized values, respectively.

Performance evaluation and comparison

We used five performance measures to evaluate the success in model calibration and validation. These were coefficient of determination (R²) (Krause et al. 2005), Nash–Sutcliffe Efficiency (NSE) (Nash and Sutcliffe 1970), root-mean-square error (RMSE), normalized root mean squared error (NRMSE) (Armstrong and Collopy 1992), and Kling-Gupta efficiency (KGE) (Gupta et al. 2009).

In Eqs. 3–6, ${E}_{\text{observed}}$shows the observed value, and ${E}_{\text{predicted}}$ shows the predicted one. ${\overline{E} }_{\text{predicted}/\text{observed}}$ is the mean of the predicted/observed values, ${E}_{\text{predicted}/\text{observed}}^{t}$ is the predicted/observed values at time t, and ${E}_{\text{predicted}\_\text{average}}$ is the average of the predicted values. N is the number of data.

R² (Eq. 3) range from 1 to 0, and when it is close to 1 there is a perfect relationship between the actual and predicted values (Krause et al. 2005; Yang et al. 2017).

$${R}^{2}={\left\{\frac{{\sum }_{i=1}^{N}\left({E}_{Observed}^{t}-{\overline{\text{E}} }_{\text{Observed}}\right)\left({\text{E}}_{predicted}^{t}-{\overline{E} }_{predicted}\right)}{\sqrt{{\sum }_{i=1}^{N}{\left({E}_{Observed}^{t}-{\overline{\text{E}} }_{\text{Observed}}\right)}^{2}}\sqrt{{\sum }_{i=1}^{N}{\left({\text{E}}_{predicted}^{t}-{\overline{E} }_{predicted}\right)}^{2}}}\right\}}^{2}$$

(3)

NSE (Eq. 4) changes between − ∞ and 1 and the values close to 1 denote better performance.

$$NSE=1-\frac{{\sum }_{i=1}^{N}{({E}_{observed}-{E}_{predicted})}^{2}}{{\sum }_{i=1}^{N}{({E}_{predicted}-{E}_{predicted\_average})}^{2}}$$

(4)

The RMSE (Eq. 5) presents the error between the simulated and observed values, and this value is a widely used error index statistic (Singh et al. 2005).

$$\text{RMSE}=\sqrt{\frac{\sum_{i=0}^{N}{({E}_{predicted}-{E}_{observed})}^{2}}{N}}$$

(5)

Model performances for SWAT and NARX were also evaluated based on NRMSE (Eq. 6), the relative form of RMSE, which provided a better comparison by normalizing the volumes of reservoirs with different capacities by the mean values.

$$NRMSE (\%)=\frac{\sqrt{\frac{\sum_{i=0}^{N}{({E}_{predicted}-{E}_{observed})}^{2}}{N}}}{{\overline{E} }_{observed}}\times 100$$

(6)

KGE metric (Eq. 7) ranges from − ∞ and 1 (Pham et al. 2021). The closer the model result is to 1, the more perfect the model performance.

$$KGE=1-\sqrt{{\left(r-1\right)}^{2}+{\left(\propto -1\right)}^{2}+{\left(\beta -1\right)}^{2}}$$

(7)

In Eq. 7, ∝ is a relative variability in the simulated and observed values. r is the Pearson coefficient and β represents the bias:

$$\propto =\frac{{\sigma }_{\widehat{y}}}{{\sigma }_{y}} \text{and }\beta =\frac{{\mu }_{\widehat{y}}}{{\mu }_{y}}$$

(8)

where ${\sigma }_{y}$ is the standard deviation of the simulating values, and ${\sigma }_{\widehat{y}}$ is the standard deviation of the observations. ${\mu }_{\widehat{y}}$ and ${\mu }_{y}$ are simulating and observation mean, respectively.

More than one metric should be taken into account when evaluating models as individual metrics may have weaknesses (Bennett et al. 2013). In this study, we used five metrics for comparison of model performance: R², RMSE, NRMSE, NSE, and KGE. RMSE and R² are among the metrics chosen for model performance due to their wide usage areas. NSE is a parameter that is sensitive to peaks and may provide a more reliable assessment. KGE is a relatively new metric developed based on NSE (Pham et al. 2021). It is very popular in hydrological models by addressing the shortcomings in NSE by incorporating bias and variance terms (Akbarian et al. 2023). Since two reservoirs of different capacities were compared in the study, it would be useful to express the results in relative terms scaled to mean values. Therefore, the NRMSE metric was used.

Results and discussion

Performance of the SWAT model

The SWAT model of the Tersakan Basin included 8 sub-basins (Fig. 5e) and 220 HRUs (Fig. 5f). Sub-basins are spatially related to each other and have a geographical location within the basin. The sub-basin boundary is obtained in such a way that the entire area within any sub-basin flows to the outlet of the other sub-basin (Arnold et al. 2012). The parameters used in model calibration, their descriptions, initially selected ranges, and calibration outputs are shown in Table 4. Table 5 and Fig. 7 present model calibration and validation results.

Table 5 Summary of SWAT Model statistical results for Ladik and Yedikir reservoirs volume and streamflow

Full size table

The sensitivity analysis showed that reservoir volumes were most sensitive to GW_Delay.gw (delay time for aquifer recharge), RCHRG_DP.gw (aquifer percolation coefficient), and CH_K(2).rte (effective hydraulic conductivity of channel), and streamflow was most sensitive to SURLAG.bsn (surface runoff lag coefficient), CH_K(2).rte (Effective hydraulic conductivity of channel), and GW_REVAP.gw (Revap coefficient).

The R², NSE, KGE, RMSE, and NRMSE values calculated between the observed and predicted water volumes at the Ladik Reservoir were 0.76, 0.69, 0.73, 8.3 × 10⁶ m³, and 33% respectively, for the calibration period and 0.67, 0.64, 0.79, 8.3 × 10⁶ m³, and 32% respectively for the validation period. Based on observed and predicted water volumes at the Yedikir Reservoir, R², NSE, KGE, RMSE, and NRMSE values were calculated as 0.69, 0.65, 0.83, 8.9 × 10⁶ m³, and 21% respectively, for the calibration period and 0.56, 0.41, 0.73, 10.7 × 10⁶ m³, and 26% respectively for the validation period. Moriasi et al. (2015) state that the SWAT model performance can be classified as “satisfactory” when 0.5 < NSE < 0.7 and 0.6 < R² < 0.75 for flow predictions at the daily, monthly, and annual scales. Model performance was proposed to be “good” when 0.7 < NSE < 0.80 and 0.75 < R² < 0.85 and “very good” when NSE > 0.80 and R² > 0.85. KGE model performance can be divided into three groups, “poor performance” (0.5 > KGE > 0), “intermediate” (0.75 > KGE > 0.5), and “good performance” (KGE > 0.75) (Moriasi et al. 2015). They did not identify criteria based on RMSE, as the values for RMSE could change based on the units of the variable. However, Yuzer and Bozkurt (2022) mentioned that the SWAT model performance is excellent when NRMSE < 10%. Based on these criteria, the model performance was found to be “satisfactory” based on NSE for the calibration and validation periods for predicting water volumes at the Ladik Reservoir. Based on R², it was “good” during model calibration and “satisfactory” during model validation. Based on KGE, it was “intermediate” during model calibration and “good” during model validation. For prediction of water volumes at the Yedikir Reservoir, model performance was found to be “satisfactory” based on NSE, and R² during model calibration, but “unsatisfactory” during validation. Based on KGE, it was “good” during model calibration and “intermediate” during model validation. The NRMSE values were higher than 10% in all cases. Here, we should mention that most of the SWAT model evaluation criteria were developed based on flow estimations. No specific criteria are available for models that simulate reservoir volumes. Predicting reservoir volumes is more challenging due to the high variability of water inflows and outflows. A few studies are available in the literature that evaluated model performance for water volume prediction in lakes and reservoirs. Beharry et al. (2021) showed that the NSE value between observed and predicted reservoir volumes were 0.67 during the calibration period and 0.70 during the validation period. Kim and Parajuli (2012) modeled the reservoir outflow option in SWAT. Considering the SWAT model performances, they found that the NSE value was 0.60 in the calibration period and 0.62 in the validation period. NSE and R² values calculated between predicted and observed reservoir volumes were found to be 0.36–0.60 and 0.54–0.75 during model calibration and 0.23–0.13 and 0.49–0.50 during model validation for two irrigation reservoirs in Türkiye, respectively (Jouma and Dadaser‐Celik 2022). These results suggest that the SWAT model performance obtained for predicting water volumes at the Ladik and Yedikir Reservoirs were compatible with the available literature.

Based on the observed and predicted streamflow at the basin outlet, the R², NSE, KGE, RMSE, and NRMSE values were calculated as 0.66, 0.63, 0.67, 2.5 m³/s, and 57%, respectively, for the calibration period and 0.79, 0.22, 0.29, 3.7 m³/s, and 71%, respectively, for the validation period. Calibration period results showed that model performance was “satisfactory” based on NSE and R² criteria proposed by Moriasi et al. (2015). For the validation period, R² result showed that model performance was “good,” but NSE result showed that it was “unsatisfactory.” This study showed that there is a strong relationship in calibration period between predicted and measured values. But during the validation of the SWAT model for streamflow, the degree of relationship between observed and predicted values were lower. Tan et al. (2019) provided a review of SWAT model performance in Southeast Asia for monthly and daily streamflow prediction based on R² and NSE values. More than 60% of the 217 studies performed “very good” for monthly streamflow. However, some other studies provided lower performance due to uncertainties in input data. They also reported that in general, the results of the calibration provided better results than validation.

As can be seen from Table 5, the SWAT model generally provided good to satisfactory results particularly based on NSE value during calibration, but it was sometimes lower during validation. The performance of the SWAT model could be affected by a variety of factors: (1) the SWAT model for the Tersakan Basin was developed with DEM, soil, and land use/cover data obtained from global datasets with low spatial resolutions. Also meteorological data were available only from a single station. Many previous studies showed that SWAT model performance could be affected by the characteristics of meteorological data and DEM, soil, and land use/cover datasets used (Bouslihim et al. 2019; Cuceoglu et al. 2021). In this study, data availability created major challenges (2). Tersakan Basin is a basin, where the hydrologic regime was modified significantly due to construction of reservoirs and irrigation water use in the basin. Desta and Lemma (2017) evaluated the SWAT model results for the Ziway Lake, which is under intense human influence, and stated that the human interventions negatively affected the hydrological results. Similarly, Jouma and Dadaser‐Celik (2022) showed that model performance was lower due to modification of the hydrologic regime in the Develi Basin (Türkiye) (3). The lack of information about irrigation practices in the Tersakan Basin posed another challenge during model calibration. Due to the unavailability of data, we estimated irrigation with the auto-irrigation tool available in SWAT, where irrigation water requirements could be predicted based on crop water stress. We identified the parameters used for estimating water stress during calibration. Considering the high spatial and temporal variability of cropping and irrigation practices across the basin, auto-irrigation tool only provided rough estimates of irrigation water use. Chen et al. (2018) and Chen et al. (2020) also mentioned that auto-irrigation tool available in SWAT could pose some uncertainties on model results.

Performance of the NARX model

NARX model was developed to predict reservoirs volumes at the Ladik and Yedikir Reservoirs and streamflow at the basin outlet. Delay numbers for the best models were determined to be 7, 9, and 2 for predicting water volumes at the Ladik Reservoir, water volumes at the Yedikir reservoir, and streamflow at the basin outlet, respectively, and the number of neurons for the same variables were 10, 9, and 6, respectively. Model results are shown in Fig. 7 and Table 6.

Table 6 Summary of NARX model statistical results for Ladik and Yedikir reservoirs volume and streamflow

Full size table

The R², NSE, KGE, RMSE, and NRMSE values calculated between the observed and predicted water volumes measured at the Ladik Reservoir were 0.96, 0.95, 0.96, 3.1 × 10⁶ m³, and 13%, respectively, for the training period; 0.94, 0.93, 0.90, 3.6 × 10⁶ m³, and 14% for the validation periods; and 0.96, 0.95, 0.92, 2.6 × 10⁶ m³, and 22%, for the testing period, respectively. For the Yedikir Reservoir, the R², NSE, KGE, RMSE, and NRMSE values between predicted and observed water volumes were 0.93, 0.93, 0.92, 4.1 × 10⁶ m³, and 9.7% for the training period; 0.96, 0.96, 0.98, 2.6 × 10⁶ m³, and 6% for the validation period; and 0.93, 0.90, 0.91, 3.5 × 10⁶ m³, and 16% for the testing period, respectively. Based on model performance criteria, the NARX model performance was found to be “very good” based on NSE and R² for all periods for simulating water volumes at the Ladik Reservoir. For prediction of water volumes at the Yedikir Reservoir, model performance was found to be “very good” based on NSE, and R² during model training, validation and testing periods. Based on KGE, it was “good” during model training, validation, and testing for Ladik and Yedikir reservoir. Moreover, as the NRMSE value is less than 10%, the NARX model performance is excellent for training and validation periods based on these criteria.

The R², NSE, KGE, RMSE, and NRMSE values calculated between predicted and observed streamflow at the basin outlet were 0.89, 0.89, 0.89, 1.5 m³/s, and 0.32% for the training period; 0.96, 0.95, 0.89, 0.9 m³/s, and 0.21% for the validation period; and 0.79, 0.71, 0.81, 0.9 m³/s, and 0.22% for the testing period, respectively. For streamflow, predicted model performance was found to be “very good” based on NSE, and R² during model training and validation periods.

For the testing period, the NARX model performance was “good.” Based on KGE it was “good” during model training, validation, and testing period.

There are only a few studies in the literature regarding reservoirs, and in those studies the reservoir inlet or outlet flow was predicted rather than reservoir volumes. Ghazali et al. (2018) studied model performance for simulating monthly reservoir inflow and the R² results ranged from 0.73 to 0.90. Yang et al. (2019) also examined the reservoir inflow prediction and they found the NSE result as 0.85. These results showed that in this study results found were similar and compatible with the literature. Results also showed that the NARX model is suitable for reservoir volume prediction.

The NARX model does not use information or data regarding the physical characteristics of the basin thus the NARX model does not represent the basin system. Moreover, the major criticism towards data-driven models, including NARX, is the physical meaningless and implicit features (Jimeno-Sáez et al. 2018). Data-driven models only make predictions for selected points (Srivastava et al. 2006). In addition, due to the sensitivity of these models to the value of outliers in the training process, it would be more appropriate to use them in the macro perspective (Zakizadeh et al. 2020).

Comparison of SWAT and NARX models

In this study, the results from the physically based and data-driven models were compared for predicting reservoir volumes and streamflow at the Tersakan Basin. We compared the training period results from the NARX model with the calibration period results from the SWAT model. Additionally, we compared the validation/training results from the NARX model with the validation period results from the SWAT model. In model comparison, we used five different metrics, R², RMSE, NRMSE, KGE, and NSE. According to the model performances provided in Tables 5 and 6, the NARX model provided better performance than the SWAT model for reservoir volume and streamflow prediction based on all five metrics.

There is no study in the literature comparing NARX and SWAT models. However, there are some studies for streamflow prediction with other data-driven models, such as ANNs, LSTM, and SWAT models, that were used together (Table 1). The performance of the ANN model was compared with SWAT in many studies (Ahmadi et al. 2019; Demirel et al. 2009; Fan et al. 2020; Jimeno-Sáez et al. 2018; Kim et al. 2015a; Makwana and Tiwari 2017; Pradhan et al. 2020; Rabezanahary Tanteliniaina et al. 2021; Srivastava et al. 2006; Valeh et al. 2021; Wagena et al. 2020; Zakizadeh et al. 2020). Some other studies compared the performance of the SWAT with SOM (Kim et al. 2015a), XGBoost (Ji et al. 2021), SVR (Ji et al. 2021), and LSTM (Giha et al. 2018; Ji et al. 2021; Kim and Kim 2021; Lee et al. 2020). In these studies, R² values calculated with SWAT ranged from 0.49 to 0.92, while those with the data-driven models were between 0.52 and 0.98. The NSE values calculated based on simulations with SWAT ranged from 0.47 and 0.90, and they were between 0.49 and 0.98 with the data-driven models. The results of this study were compatible with the previous studies for streamflow prediction. These studies showed that the performances of the data-driven models were better than the SWAT model in estimating streamflow. However, they also emphasized that the SWAT model provides the hydrological water balance better, while the ANN model produces the result without considering the hydrological outputs.

Results showed that with both models, the best performance was obtained when predicting water volumes at the Ladik reservoir. The second best performance was obtained during prediction of water volumes at the Yedikir reservoir. The model performances were the lowest for prediction of streamflow at the basin outlet Ladik Reservoir is located, where the Tersakan River starts in the basin. Yedikir reservoir is located close to the basin outlet (Fig. 3). Agricultural areas are denser towards the basin outlet and there is a lot of uncontrolled irrigation here. These results suggest that both model performances are more affected where human influences are intense (Özdoğan-Sarıkoç et al. 2023).

As SWAT model includes the physical conceptualization of the watershed and simulates processes that affect water movement, it can produce information about water balance, which is useful for understanding hydrological processes. In addition, the SWAT model’s ability to complete the missing data is also among its advantages over the NARX model (Makwana and Tiwari 2017). The weaknesses of the SWAT model are that it requires a lot of data for model development and is time consuming, and variable selection is difficult and requires expertise (Zakizadeh et al. 2020). On the other hand, NARX model is that it is easier to implement, as it does not require any physical properties in the basin. It requires less cost and data and is better and faster than SWAT. It is more suitable to use for basins where necessary data for establishing a physically based model are limited, such as the Tersakan Basin. However, it is one of the important criticisms that it is a black-box and produces physically meaningless results (Jimeno-Sáez et al. 2018). NARX model also does not represent a watershed system in the spatial dimension and therefore cannot make predictions at various points along the stream. It can perform prediction only at the point where it is informed (Srivastava et al. 2006). However, SWAT model can calculate parameters for each sub-basin and predict inflows and outflows for each sub-basin (Zakizadeh et al. 2020). Another shortcoming of NARX model is that they need to be retrained for data changes in the watershed. This indicates that they cannot be used to predict future conditions associated with the watershed.

Limitations and future work

The SWAT model is created based on the physical characteristics of the watershed area, which requires extensive information about topography, soils, and land use/cover. Unfortunately, the physical data about the Tersakan basin were quite limited; therefore, data from global datasets were to be used. If high-quality and high-resolution data could have been reached, the performance of the SWAT model could have been better (Ahmadi et al. 2019). Also this study used data from a single meteorological station as the stations available in the basin did not provide regular and long-term data. This might have prevented the inclusion of some local climatic events in the simulations (Thodsen et al. 2017). In the future, the availability of data from global datasets that offer higher spatial and temporal resolutions and meteorological data from multiple stations could improve the performance of the SWAT model. Due to presence of reservoirs in the basin, we used a watershed configuration consisting of 8 sub-basins and 220 HRUs. Different watershed configurations could have been created with different number of sub-basins/HRUs. In the future, the effects of number of sub-basins/HRUs on the model performance could be evaluated.

When two models are evaluated according to the same performance evaluation criteria, the NARX model performance was better than that of the SWAT model. However, here we should note that the performance criteria used in this study (i.e., Moriasi et al. (2015)) was developed for physically based models. Data-driven models usually yield higher performance. The suitability of performance criteria for data-driven models should be evaluated in the future.

Both physically based and data-driven models have advantages under different conditions. Combining data-driven models with physically based models can strengthen these advantages and lower the shortcomings and has the potential to produce a superior hydrological output. Some examples to the use of hybrid models is using the outputs from physically based model as inputs to the data-driven model (Noori and Kalin 2016; Wang et al. 2022) or using data-driven models for improving the quality of inputs to the data-driven model (Liang et al. 2017). In the future, we plan to examine alternative configurations using both approaches.

Conclusions

Physically based models are used to understand various hydrological processes and to predict the behavior of systems. These models are a function of various parameters used to describe watershed features and produce a set of equations that are used to predict reservoir volume and streamflow. On the other side, data-driven models provide good alternatives without prior knowledge about physical processes. In these models, the representation of model output is more important than representing watershed processes. Creating physically based hydrological models requires expertise, and the process is more complex and lengthy but data-driven models are created much easily and have strong learning capabilities.

The main results of this study can be summarized as follows:

(1)
Both models produced satisfactory results in the estimation of reservoir volume and streamflow. However, according to the performance evaluation criteria, the NARX model produced better results than the SWAT model.
(2)
The SWAT and NARX models were both provided the best performance when predicting water volumes at the Ladik reservoir. The second best performance with both models was obtained during prediction of water volumes at the Yedikir reservoir. The model performances were the lowest for prediction of streamflow at the basin outlet. We argue that the degree of human intervention on the hydrologic system increased from upstream to downstream in the basin and these interventions affected model performance.
(3)
The SWAT model is a physically based model that simulates hydrological processes in the basin. The NARX model, on the other hand, is a result-based model and it does not focus on processes. Therefore, the comparison of these models is challenging due to their different nature. In this study, the NARX model provided better results than the SWAT model. However, the SWAT model could have produced more reasonable results for predicting reservoir volume or streamflows in the future and could better adapt to changing physical or hydrological conditions as it represent the basin processes better.

In general, this study shows that there is no single better model for predicting reservoir volume and streamflow. Each approach has its own weaknesses and strengths. Hence, in future studies, we suggest hybrid models, which combines with physically based and data-driven models, can be used.

Data availability

The authors confirm that the data supporting the findings of this study are available within the article.

References

Abbaspour KC, Yang J, Maximov I, Siber R, Bogner K, Mieleitner J, Zobrist J, Srinivasan R (2007) Modelling hydrology and water quality in the pre-alpine/alpine Thur watershed using SWAT. J Hydrol 333:413–430
Article Google Scholar
Abbaspour KC, Rouholahnejad E, Vaghefi S, Srinivasan R, Yang H, Kløve B (2015) A continental-scale hydrology and water quality model for Europe: calibration and uncertainty of a high-resolution large-scale SWAT model. J Hydrol 524:733–752
Article Google Scholar
Abbaspour KC (2015) SWAT-CUP: SWAT Calibration and Uncertainty Programs - A User Manual, Swiss Federal Institute of Aquatic Science and Technology, Eawag106
Ahmadi M, Moeini A, Ahmadi H, Motamedvaziri B, Zehtabiyan GR (2019) Comparison of the performance of SWAT, IHACRES and artificial neural networks models in rainfall-runoff simulation (case study: Kan watershed, Iran). Phys Chem Earth, Parts A/B/C 111:65–77
Article Google Scholar
Aibaidula D, Ates N, Dadaser-Celik F (2022) Modelling climate change impacts at a drinking water reservoir in Turkey and implications for reservoir management in semi-arid regions. Environ Sci Pollut Res 30(5):13582–13604
Article Google Scholar
Akbarian M, Saghafian B, Golian S (2023) Monthly streamflow forecasting by machine learning methods using dynamic weather prediction model outputs over Iran. J Hydrol 620:129480
Article Google Scholar
Alsumaiei AA (2020) A nonlinear autoregressive modeling approach for forecasting groundwater level fluctuation in urban aquifers. Water 12:820
Article Google Scholar
Anonymous (2019) Samsun Governorship Provincial Directorate of Environment and Urbanization Samsun Province 2018 Environmental Status Report
Armstrong JS, Collopy F (1992) Error measures for generalizing about forecasting methods: empirical comparisons. Int J Forecast 8:69–80
Article Google Scholar
Arnold JG, Fohrer N (2005) SWAT2000: current capabilities and research opportunities in applied watershed modelling. Hydrol Process 19:563–572
Article Google Scholar
Arnold JG, Srinivasan R, Muttiah RS, Williams JR (1998) Large area hydrologic modeling and assessment – part 1: model development. J Am Water Resour Assoc 34:73–89
Article CAS Google Scholar
Arnold JG, Kiniry JR, Srinivasan R, Williams JR, Haney EB, Neitsch SL (2012) Soil and water assessment tool input/output documantation version 2012. In: Texas Water Resources Institute T-, College Station (Hrsg.)
Ashrafzadeh A, Salehpoor J, Lotfirad M (2024) Comparative analysis of data-driven and conceptual streamflow forecasting models with uncertainty assessment in a major basin in Iran. Int J Energy Water Resour In press
Ayzel G, Izhitskiy A (2018) Coupling physically based and data-driven models for assessing freshwater inflow into the Small Aral Sea. Proc Int Assoc Hydrol Sci 379:151–158
Google Scholar
Barzegar R, Aalami MT, Adamowski J (2021) Coupling a hybrid CNN-LSTM deep learning model with a boundary corrected maximal overlap discrete wavelet transform for multiscale Lake water level forecasting. J Hydrol 598:126196
Article Google Scholar
Beharry SL, Gabriels D, Lobo D, Ramsewak D, Clarke RM (2021) Use of the SWAT model for estimating reservoir volume in the Upper Navet watershed in Trinidad. SN Appl Sci 3(2):163
Article CAS Google Scholar
Bennett ND, Croke BFW, Guariso G, Guillaume JHA, Hamilton SH, Jakeman AJ, Marsili-Libelli S, Newham LTH, Norton JP, Perrin C, Pierce SA, Robson B, Seppelt R, Voinov AA, Fath BD, Andreassian V (2013) Characterising performance of environmental models. Environ Model Softw 40:1–20
Article Google Scholar
Borah DK (2011) Hydrologic procedures of storm event watershed models: a comprehensive review and comparison. Hydrol Process 25:3472–3489
Article Google Scholar
Bouslihim Y, Rochdi A, El Amrani PN, Liuzzo L (2019) Understanding the effects of soil data quality on SWAT model performance and hydrological processes in Tamedroust watershed (Morocco). J Afr Earth Sc 160:103616
Article Google Scholar
Cao W, Bowden WB, Davie T, Fenemor A (2006) Multi-variable and multi-site calibration and validation of SWAT in a large mountainous catchment with high spatial variability. Hydrol Process 20:1057–1073
Article Google Scholar
Chang F-J, Chen P-A, Liu C-W, Liao VH-C, Liao C-M (2013) Regional estimation of groundwater arsenic concentrations through systematical dynamic-neural modeling. J Hydrol 499:265–274
Article CAS Google Scholar
Chang F-J, Chen P-A, Lu Y-R, Huang E, Chang K-Y (2014) Real-time multi-step-ahead water level forecasting by recurrent neural networks for urban flood control. J Hydrol 517:836–846
Article Google Scholar
Chang L-C, Liou J-Y, Chang F-J (2022) Spatial-temporal flood inundation nowcasts by fusing machine learning methods and principal component analysis. J Hydrol 612:128086
Article Google Scholar
Chen Y-h, Chang F-J (2009) Evolutionary artificial neural networks for hydrological systems forecasting. J Hydrol 367:125–137
Article Google Scholar
Chen Y, Marek GW, Marek TH, Brauer DK, Srinivasan R (2018) Improving SWAT auto-irrigation functions for simulating agricultural irrigation management using long-term lysimeter field data. Environ Model Softw 99:25–38
Article Google Scholar
Chen Y, Marek GW, Marek TH, Porter DO, Moorhead JE, Heflin KR, Brauer DK, Srinivasan R (2020) Watershed scale evaluation of an improved SWAT auto-irrigation function. Environ Model Softw 131:104789
Article Google Scholar
Chiang Y-M, Chang L-C, Chang F-J (2004) Comparison of static-feedforward and dynamic-feedback neural networks for rainfall–runoff modeling. J Hydrol 290:297–311
Article Google Scholar
Chua LHC (2012) Considerations for data-driven and physically-based hydrological models in flow forecasting. IFAC Proc 45:1025–1030
Google Scholar
Costabile P, Macchione F (2015) Enhancing river model set-up for 2-D dynamic flood modelling. Environ Model Softw 67:89–107
Article Google Scholar
Costabile P, Costanzo C, Macchione F (2013) A storm event watershed model for surface runoff based on 2D fully dynamic wave equations. Hydrol Process 27:554–569
Article Google Scholar
Cuceoglu G, Seker DZ, Tanik A, İz O (2021) Analyzing effects of two different land use datasets on hydrological simulations by using SWAT model. Int J Environ Geoinformatics 8:172–185
Article Google Scholar
Demirel MC, Venancio A, Kahya E (2009) Flow forecast by SWAT model and ANN in Pracana basin, Portugal. Adv Eng Softw 40:467–473
Article Google Scholar
Desta H, Lemma B (2017) SWAT based hydrological assessment and characterization of Lake Ziway sub-watersheds, Ethiopia. J Hydrol: Reg Stud 13:122–137
Google Scholar
Du J, Rui H, Zuo T, Li Q, Zheng D, Chen A, Xu Y, Xu CY (2013) Hydrological simulation by SWAT model with fixed and varied parameterization approaches under land use change. Water Resour Manag 27:2823–2838
Article Google Scholar
Elshorbagy A, Corzo G, Srinivasulu S, Solomatine DP (2010) Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - part 2: Application. Hydrol Earth Syst Sci 14:1943–1961
Article Google Scholar
Evora ND, Coulibaly P (2009) Recent advances in data-driven modeling of remote sensing applications in hydrology. J Hydroinf 11:194–201
Article Google Scholar
Fan H, Jiang M, Xu L, Zhu H, Cheng J, Jiang J (2020) Comparison of long short term memory networks and the hydrological model in runoff simulation. Water 12:175
Article Google Scholar
Germeç E, Ürker O (2023) Investigation of a SWAT model for environmental health management based on the water quality parameters of a stream system in central Anatolia (Türkiye). Sustainability 15:13850
Article Google Scholar
Ghazali M, Honar T, Nikoo MR (2018) A fusion-based neural network methodology for monthly reservoir inflow prediction using MODIS products. Hydrol Sci J 63:2076–2096
Article Google Scholar
Giha L, Sungho J, Daeeop L (2018) Comparison of physics-based and data-driven models for streamflow simulation of the Mekong river. J Korea Water Resour Assoc 51:503–514
Google Scholar
Gupta HV, Kling H, Yilmaz KK, Martinez GF (2009) Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling. J Hydrol 377:80–91
Article Google Scholar
Gupta H, Beven KJ, Wagener T (2005): Model calibration and uncertainty estimation. In Encyclopedia of hydrological science, edited by: Anderson, M. G., John Wiley & Sons, Ltd
Guzman SM, Paz JO, Tagert MLM (2017) The use of NARX neural networks to forecast daily groundwater levels. Water Resour Manag 31:1591–1603
Article Google Scholar
Guzman SM, Paz JO, Tagert MLM, Mercer AE (2019) Evaluation of seasonally classified inputs for the prediction of daily groundwater levels: NARX networks vs support vector machines. Environ Model Assess 24:223–234
Article Google Scholar
Hu X, Shi L, Lin G, Lin L (2021) Comparison of physical-based, data-driven and hybrid modeling approaches for evapotranspiration estimation. J Hydrol 601:126592
Article Google Scholar
Hussain F, Wu R-S, Wang J-X (2021) Comparative study of very short-term flood forecasting using physics-based numerical model and data-driven prediction model. Nat Hazards 107:249–284
Article Google Scholar
Javadinejad S, Dara R, Jafary F (2020) How groundwater level can predict under the effect of climate change by using artificial neural networks of NARX. Resour Environ Inf Eng 2:90–99
Article Google Scholar
Jha M, Arnold JG, Gassman PW, Giorgi F, Gu RR (2006) Climate change sensitivity assesment on upper Mississippi River Basin stream flows using SWAT. J Am Water Resour Assoc 42:997–1015
Article Google Scholar
Ji H, Chen Y, Fang G, Li Z, Duan W, Zhang Q (2021) Adaptability of machine learning methods and hydrological models to discharge simulations in data-sparse glaciated watersheds. J Arid Land 13:549–567
Article Google Scholar
Jimeno-Sáez P, Senent-Aparicio J, Pérez-Sánchez J, Pulido-Velazquez D (2018) A comparison of SWAT and ANN Models for daily runoff simulation in different climatic zones of Peninsular Spain. Water 10:192
Article Google Scholar
Jouma N, Dadaser-Celik F (2022) Assessing hydrologic alterations due to reservoirs and intensified irrigation in a semi-arid agricultural river basin using SWAT. Irrig Drain 71:452–471
Article Google Scholar
Kaini P, Artita K, Nicklow JW (2012) Optimizing structural best management practices using SWAT and Genetic algorithm to improve water quality goals. Water Resour Manag 26:1827–1845
Article Google Scholar
Kanungo DP, Arora MK, Sarkar S, Gupta RP (2006) A comparative study of conventional, ANN black box, fuzzy and combined neural and fuzzy weighting procedures for landslide susceptibility zonation in Darjeeling Himalayas. Eng Geol 85:347–366
Article Google Scholar
Kim C, Kim C-S (2021) Comparison of the performance of a hydrologic model and a deep learning technique for rainfall- runoff analysis. Trop Cyclone Res Rev 10:215–222
Article Google Scholar
Kim B, Sanders BF, Famiglietti JS, Guinot V (2015a) Urban flood modeling with porous shallow-water equations: a case study of model errors in the presence of anisotropic porosity. J Hydrol 523:680–692
Article Google Scholar
Kim M, Baek S, Ligaray M, Pyo J, Park M, Cho K (2015b) Comparative studies of different imputation methods for recovering streamflow observation. Water 7:6847–6860
Article Google Scholar
Kim J, Lee J, Park J, Kim S, Kim S (2021) Improvement of downstream flow by modifying SWAT reservoir operation considering irrigation water and environmental flow from agricultural reservoirs in South Korea. Water 13:2543
Article Google Scholar
Kim H, Parajuli PB (2012) Impacts of reservoir operation in the SWAT model calibration. 2012 ASABE Annual International Meeting Dallas, Texas July 29 – August1, 2012
Krause P, Boyle DP, Base F (2005) Comparison of different efficiency criteria for hydrologic model assessment. Adv Geosci 5:89–97
Article Google Scholar
Kumar S, Pandey KK, Ahirwar A (2024) Comparison of the performance of SWAT and hybrid M5P tree models in rainfall–runoff simulation. J Water Health 22(4):639–651
Article Google Scholar
Kwak J, St-Hilaire A, Chebana F (2017) A comparative study for water temperature modelling in a small basin, the Fourchue River, Quebec, Canada. Hydrol Sci J 62:64–75
CAS Google Scholar
Lee D, Lee G, Kim S, Jung S (2020) Future runoff analysis in the Mekong River Basin under a climate change scenario using deep learning. Water 12:1556
Article Google Scholar
Li G, Kawan B, Wang H, Zhang H (2017) Neural-network-based modelling and analysis for time series prediction of ship motion. Ship Technol Res 64:30–39
Article Google Scholar
Liang X, Lettenmaier DP, Wood EF, Burges SJ (1994) A simple hydrologically based model of land surface water and energy fluxes for general circulation models. J Geophys Res Atmos 99:14415–14428
Article Google Scholar
Liang Z, Tang T, Li B, Liu T, Wang J, Hu Y (2017) Long-term streamflow forecasting using SWAT through the integration of the random forests precipitation generator: case study of Danjiangkou Reservoir. Hydrol Res 49:1513–1527
Article Google Scholar
Lin T, Horne BG, Giles CL (1998) How embedded memory in recurrent neural network architectures helps learning long-term temporal dependencies. Neural Netw 11:861–868
Article Google Scholar
Lin T, Horne BG, Tino P, Giles CL (1996) Learning long-term dependencies in NARX recurrent neural networks. IEEE Transactions on Neural Networks 7(6):1329–1338
Article CAS Google Scholar
Liu Z, Todini E (2002) Towards a comprehensive physically-based rainfall-runoff model. Hydrol Earth Syst Sci 65:859–881
Article Google Scholar
Llanos-Paez O, Estrada L, Pastén-Zapata E, Boithias L, Jorda-Capdevila D, Sabater S, Acuña V (2023) Spatial and temporal patterns of flow intermittency in a Mediterranean basin using the SWAT+ model. Hydrol Sci J 68:276–289
Article Google Scholar
Makwana JJ, Tiwari MK (2017) Hydrological stream flow modelling using soil and water assessment tool (SWAT) and neural networks (NNs) for the Limkheda watershed, Gujarat, India. Model Earth Syst Environ 3:635–645
Article Google Scholar
Marhaento H, Booij MJ, Rientjes THM, Hoekstra AY (2017) Attribution of changes in the water balance of a tropical catchment to land use change using the SWAT model. Hydrol Process 31:2029–2040
Article Google Scholar
Menezes JMP, Barreto GA (2008) Long-term time series prediction with the NARX network: an empirical evaluation. Neurocomputing 71:3335–3343
Article Google Scholar
Mengistu AG, van Rensburg LD, Woyessa YE (2019) Techniques for calibration and validation of SWAT model in data scarce arid and semi-arid catchments in South Africa. J Hydrol: Reg Stud 25:100621
Google Scholar
Møller MF (1993) A scaled conjugate gradient algorithm for fast supervised learning. Neural Netw 6:525–533
Article Google Scholar
Moriasi DN, Gitau MW, Pai N, Daggupati P (2015) Hydrologic and water quality models: performance measures and evaluation criteria. Trans ASABE 58:1763–1785
Article Google Scholar
Mosavi A, Ozturk P, Chau K-w (2018) Flood prediction using machine learning models: literature review. Water 10:1536
Article Google Scholar
Moussa R, Chahinian N, Bocquillon C (2007) Distributed hydrological modelling of a Mediterranean mountainous catchment – model construction and multi-site validation. J Hydrol 337:35–51
Article Google Scholar
Nanda T, Sahoo B, Beria H, Chatterjee C (2016) A wavelet-based non-linear autoregressive with exogenous inputs (WNARX) dynamic neural network model for real-time flood forecasting using satellite-based rainfall products. J Hydrol 539:57–73
Article Google Scholar
Narsimlu B, Gosain AK, Chahar BR (2013) Assessment of future climate change impacts on water resources of Upper Sind River Basin, India using SWAT model. Water Resour Manag 27:3647–3662
Article Google Scholar
Nash JE, Sutcliffe JV (1970) River flow forecasting through conceptual models part I - a discussion of principles. J Hydrol 10(3):282–290
Article Google Scholar
Neitsch SL, Arnold JG, Kiniry JR, Williams JR (2005) Soil and water assessment tool theoretical documentation, Texas Water Resources Institute. College Station, Texas, USA
Google Scholar
Noori N, Kalin L (2016) Coupling SWAT and ANN models for enhanced daily streamflow prediction. J Hydrol 533:141–151
Article Google Scholar
Nunno FD, Granata F, Gargano R, Marinis G (2021) Prediction of spring flows using nonlinear autoregressive exogenous (NARX) neural network models. Environ Monit Assess 193:1–17
Article Google Scholar
Nunno FD, Granata F (2020) Groundwater level prediction in Apulia region (Southern Italy) using NARX neural network. Environ Res 190:110062
Ouyang W, Lawson K, Feng D, Ye L, Zhang C, Shen C (2021) Continental-scale streamflow modeling of basins with reservoirs: towards a coherent deep-learning-based strategy. J Hydrol 599:126455
Article Google Scholar
Özdoğan-Sarıkoç G, Sarıkoç M, Celik M, Dadaser-Celik F (2023) Reservoir volume forecasting using artificial intelligence-based models: artificial neural networks, support vector regression, and long short-term memory. J Hydrol 616:128766
Article Google Scholar
Peel MC, Finlayson BL, McMahon TA (2007) Updated world map of the Köppen-Geiger climate classification. Hydrol Earth Syst Sci 11:1633–1644
Article Google Scholar
Pham LT, Luo L, Finley A (2021) Evaluation of random forests for short-term daily streamflow forecasting in rainfall- and snowmelt-driven watersheds. Hydrol Earth Syst Sci 25:2997–3015
Article CAS Google Scholar
Phiri WK, Vanzo D, Banda K, Nyirenda E, Nyambe IA (2021) A pseudo-reservoir concept in SWAT model for the simulation of an alluvial floodplain in a complex tropical river system. J Hydrol: Reg Stud 33:100770
Google Scholar
Pisinaras V, Petalas C, Gikas GD, Gemitzi A, Tsihrintzis VA (2010) Hydrological and water quality modeling in a medium-sized basin using the Soil and Water Assessment Tool (SWAT). Desalination 250:274–286
Article CAS Google Scholar
Pohlert T, Huisman JA, Breuer L, Frede HG (2005) Modelling of point and non-point source pollution of nitrate with SWAT in the river Dill, Germany. Adv Geosci 5:7–12
Article Google Scholar
Pradhan P, Tingsanchali T, Shrestha S (2020) Evaluation of soil and water assessment tool and artificial neural network models for hydrologic simulation in different climatic regions of Asia. Sci Total Environ 701:134308
Article CAS Google Scholar
Prasad D, Naresh P, Srinivasulu A, Kyle RD-M, Rebecca WZ, Jaehak J, Prem BP, Dharmendra S, Mohamed AY (2015) A recommended calibration and validation strategy for hydrologic and water quality models Transactions of the ASABE. 58(6):1705–1719
RabezanaharyTanteliniaina MF, Rahaman MH, Zhai J (2021) Assessment of the future impact of climate change on the hydrology of the Mangoky River, Madagascar Using ANN and SWAT. Water 13:1239
Article Google Scholar
Rajat PA (2021) Calibration of hydrological models considering process interdependence: a case study of SWAT model. Environ Model Softw 144:1–14
Article Google Scholar
Rjeily YA, Abbas O, Sadek M, Shahrour I, Chehade FH (2017) Flood forecasting within urban drainage systems using NARX neural network. Water Sci Technol 76:2401–2412
Article Google Scholar
Schuol J, Abbaspour KC, Srinivasan R, Yang H (2008) Estimation of freshwater availability in the West African sub-continent using the SWAT hydrologic model. J Hydrol 352:30–49
Article Google Scholar
Sedighkia M, Abdoli A (2022) Design of optimal environmental flow regime at downstream of multireservoir systems by a coupled SWAT-reservoir operation optimization method. Environ Dev Sustain 39:1–14
Google Scholar
Shen HY, Chang LC (2013) Online multistep-ahead inundation depth forecasts by recurrent NARX networks. Hydrol Earth Syst Sci 17:935–945
Article Google Scholar
Shrestha RR, Nestmann F (2009) Physically based and data-driven models and propagation of input uncertainties in river flood prediction. J Hydrogic Eng 14:1309–1319
Article Google Scholar
Shresthaa MK, Recknagel F, Frizenschaf J, Meyer W (2016) Assessing SWAT models based on single and multi-site calibration for the simulation of flow and nutrient loads in the semi-arid Onkaparinga catchment in South Australia. Agric Water Manag 175:61–71
Article Google Scholar
Singh J, Knapp HV, Arnold JG, Demissie M (2005) Hydrological modeling of the iroquois river watershed using HSPF and SWAt. J Am Water Resour Assoc 41:343–360
Article Google Scholar
Sood A, Muthuwatta L, McCartney M (2013) A SWAT evaluation of the effect of climate change on the hydrology of the Volta River basin. Water Int 38(3):297–311
Article Google Scholar
Srivastava P, McNair J, Johnson T (2006) Comparison of process-based and artificial neural network approaches for streamflow modeling in an agricultural watershed. JAWRA J Am Water Resour Assoc 42:545–563
Article Google Scholar
Sungmin O, Emanuel D, Rene O (2020) Robustness of process-based versus data-driven modeling in changing climatic conditions. J Hydrometeorol 21:1929–1944
Article Google Scholar
Tan ML, Gassman PW, Srinivasan R, Arnold JG, Yang X (2019) A review of SWAT studies in Southeast Asia: applications, challenges and future directions. Water 11:914
Article Google Scholar
Thodsen H, Farkas C, Chormanski J, Trolle D, Blicher-Mathiesen G, Grant R, Engebretsen A, Kardel I, Andersen HE (2017) Modelling nutrient load changes from fertilizer application scenarios in six catchments around the Baltic Sea. Agriculture 7:41
Article Google Scholar
Todini E (2007) Hydrological catchment modelling: past, present and future. Hydrol Earth Syst Sci 11:468–482
Article Google Scholar
Tsai W-P, Chang F-J, Chang L-C, Herricks EE (2015) AI techniques for optimizing multi-objective reservoir operation upon human and riverine ecosystem demands. J Hydrol 530:634–644
Article Google Scholar
Tübitak Marmara Research Center (2010) `Environment Institute, Preparation of Watershed Protection Action Plans-Yeşilırmak Basin (Havza Koruma Eylem Planlarının Hazırlanması-Yeşilırmak Havzası), Tubitak Marmara Research Center, Gebze
Uniyal B, Jha MK, Verma AK, Anebagilu PK (2020) Identification of critical areas and evaluation of best management practices using SWAT for sustainable watershed management. Sci Total Environ 744:140737
Article CAS Google Scholar
Valeh S, Motamedvairi B, Kiadaliri H, Ahmadi H (2021) Hydrological simulation of Ammameh basin by artificial neural network and SWAT models. Phys Chem Earth Parts A/B/C 123:103014
Article Google Scholar
Wagena MB, Goering D, Collick AS, Bock E, Fuka DR, Buda A, Easton ZM (2020) Comparison of short-term streamflow forecasting using stochastic time series, neural networks, process-based, and Bayesian models. Environ Model Softw 126:104669
Article Google Scholar
Wang H, Khayatnezhad M, Youssefi N (2022) Using an optimized soil and water assessment tool by deep belief networks to evaluate the impact of land use and climate change on water resources. Concurr Comput: Pract Experience 34:6807
Wang R, Kalin L (2011) Modelling effects of land use/cover changes under limited data. Ecohydrology 4:265–276
Article Google Scholar
Wu Y, Liu S, Abdul-Aziz OI (2012) Hydrological effects of the increased CO2 and climate change in the Upper Mississippi River Basin using a modified SWAT. Clim Chang 110:977–1003
Article Google Scholar
Wunsch A, Liesch T, Broda S (2018) Forecasting groundwater levels using nonlinear autoregressive networks with exogenous input (NARX). J Hydrol 567:743–758
Article Google Scholar
Yang T, Asanjan AA, Faridzad M, Hayatbini N, Gao X, Sorooshian S (2017) An enhanced artificial neural network with a shuffled complex evolutionary global optimization with principal component analysis. Inf Sci 418:302–316
Article Google Scholar
Yang S, Yang D, Chen J, Zhao B (2019) Real-time reservoir operation using recurrent neural networks and inflow forecast from a distributed hydrological model. J Hydrol 579:124229
Article Google Scholar
Yang H, Sun H, Jia C, Yang T, Yang X (2024) Future climatic projections and hydrological responses with a data driven method: a regional climate model perspective. Water Resour Manag 38:1693–1710
Article Google Scholar
Yaseen ZM, El-shafie A, Jaafar O, Afan HA, Sayl KN (2015) Artificial intelligence based models for stream-flow forecasting: 2000–2015. J Hydrol 530:829–844
Article Google Scholar
Yuzer EO, Bozkurt A (2022) Deep learning model for regional solar radiation estimation using satellite images. Ain Shams Eng J 14(8)
Zakizadeh H, Ahmadi H, Zehtabian G, Moeini A, Moghaddamnia A (2020) A novel study of SWAT and ANN models for runoff simulation with application on dataset of metrological stations. Phys Chem Earth Parts A/B/C 120:102899
Article Google Scholar
Zhang N, He HM, Zhang SF, Jiang XH, Xia ZQ, Huang F (2011) Influence of reservoir operation in the upper reaches of the Yangtze River (China) on the inflow and outflow regime of the TGR-based on the improved SWAT model. Water Resour Manag 26:691–705
Article Google Scholar
Zhang D, Lin J, Peng Q, Wang D, Yang T, Sorooshian S, Liu X, Zhuang J (2018) Modeling and simulating of reservoir operation using the artificial neural network, support vector regression, deep learning algorithm. J Hydrol 565:720–736
Article Google Scholar
Zhang Y, Qi J, Pan D, Marek GW, Zhang X, Feng P, Liu H, Li B, Ding B, Brauer DK, Srinivasan R, Chen Y (2022) Development and testing of a dynamic CO2 input method in SWAT for simulating long-term climate change impacts across various climatic locations. J Hydrol 614:128544
Article CAS Google Scholar

Download references

Funding

Open access funding provided by the Scientific and Technological Research Council of Türkiye (TÜBİTAK). We would like to thank Erciyes University Research Fund (FDK- 2020–10451) for the financial support.

Author information

Authors and Affiliations

Department of Vegetable and Animal Production, Suluova Vocational School, Amasya University, Amasya, Turkey
Gülhan Özdoğan-Sarıkoç
Department of Environmental Engineering, Erciyes University, Kayseri, Turkey
Filiz Dadaser-Celik

Authors

Gülhan Özdoğan-Sarıkoç
View author publications
You can also search for this author in PubMed Google Scholar
Filiz Dadaser-Celik
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Gülhan Özdoğan-Sarıkoç (GOS) and Filiz Dadaser-Celik (FDC) contributed to the study design. Data collection and analysis were performed by GOS. The first draft of the manuscript was written by GOS. FDC read and revised the manuscript. All authors (GOS, FDC) read and approved the final manuscript.

Corresponding author

Correspondence to Filiz Dadaser-Celik.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All authors mutually agreed to publish the work in this journal.

Conflict of ınterest

The authors declare no competing interests.

Additional information

Responsible Editor: Marcus Schulz

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Highlights

• The data-driven model performed better than the physically based model.

• Comparing physically based and data-driven models is challenging due to their different nature.

• Data-driven models provide an alternative to physically based models under data-scarce conditions.

• The physically based model adapts better and represents basin processes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Özdoğan-Sarıkoç, G., Dadaser-Celik, F. Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin. Environ Sci Pollut Res 31, 39098–39119 (2024). https://doi.org/10.1007/s11356-024-33732-w

Download citation

Received: 07 January 2024
Accepted: 16 May 2024
Published: 29 May 2024
Issue Date: June 2024
DOI: https://doi.org/10.1007/s11356-024-33732-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Physically based vs. data-driven models for streamflow and reservoir volume prediction at a data-scarce semi-arid basin

Abstract

Similar content being viewed by others

Modeling streamflow in Sot river catchment of Uttar Pradesh, India

Prediction of daily reservoir inflow using atmospheric predictors

Physically-Based Streamflow Predictions in Ungauged Basin with Semi-Arid Climate

Introduction