Pitfalls in hydrologic model calibration in a data scarce environment with a strong seasonality: experience from the Adyar catchment, India

Tigabu, Tibebe B.; Wagner, Paul D.; Narasimhan, Balaji; Fohrer, Nicola

doi:10.1007/s12665-023-11047-2

Pitfalls in hydrologic model calibration in a data scarce environment with a strong seasonality: experience from the Adyar catchment, India

Original Article
Open access
Published: 20 July 2023

Volume 82, article number 367, (2023)
Cite this article

Download PDF

You have full access to this open access article

Environmental Earth Sciences Aims and scope Submit manuscript

Pitfalls in hydrologic model calibration in a data scarce environment with a strong seasonality: experience from the Adyar catchment, India

Download PDF

Tibebe B. Tigabu¹,
Paul D. Wagner¹,
Balaji Narasimhan² &
…
Nicola Fohrer¹

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Abstract

Process-based hydrologic models can provide necessary information for water resources management. However, the reliability of hydrological models depends on the availability of appropriate input data and proper model calibration. In this study, we demonstrate that common calibration procedures that assume stationarity of hydrological processes can lead to unsatisfactory model performance in areas that experience a strong seasonal climate. Moreover, we develop a more robust calibration procedure for the Soil and Water Assessment Tool (SWAT) in the Adyar catchment of Chennai, India. Calibration was carried out based on seasonal decomposition and by successively shifting the calibration period. Daily and monthly streamflow records were used to investigate how these different calibration procedures influence model parameterization. Results show that SWAT model performance improved when calibrated after separating the streamflow into wet and dry seasons. The wet season calibration increased the Kling Gupta Efficiency coefficient and Nash–Sutcliffe Efficiency coefficient values from 0.56 to 0.68 and 0.19 to 0.51, respectively, compared to calibration based on wet and dry seasons together. In addition, when calibration time periods were shifted, resultant sets of model parameter values and performance metrics differed. Calibration based on the 2004–2009 period resulted in an overestimation of streamflow by 8.2%, whereas the overestimation was 12.1%, 18.3%, and 20.0% for the 2004–2010, 2004–2011, and 2004–2012 periods, respectively. This study underlines that both the availability of observed streamflow data and the way these data are applied to calibration have a strong impact on model parameterization and performance.

Hydrological modeling in the Upper Blue Nile basin using soil and water analysis tool (SWAT)

Article 19 January 2021

Modeling streamflow in Sot river catchment of Uttar Pradesh, India

Article 16 September 2023

Performance evaluation of hydrological model in simulating streamflow and water balance analysis: spatiotemporal calibration and validation in the upper Awash sub-basin in Ethiopia

Article 21 February 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Process-based numerical hydrologic models are rapidly evolving to help understand and predict potential impacts of population growth, land use, and climate change on water resources (Athira et al. 2021; Koycegiz and Buyukyildiz 2019; Paniconi and Putti 2015; Arnold et al. 2012). These hydrologic models can be used to develop and evaluate management practices to mitigate the effects of climate change and select the best set of policies for implementation. Hydrologic models are also required to predict extreme events such as floods and droughts that are common in many parts of the world including India. However, the reliability and robustness of hydrological models depend on the availability of appropriate input data and proper model calibration (Chen et al. 2023; Bucak et al. 2017)). In many parts of the world, lack of appropriate input data is a major constraint for hydrologic modeling (Abbaspour et al. 2019). Scarcity of long-term daily climatic and hydrologic data is a challenge for the application of advanced hydrological modeling techniques in water resources management (Alipour and Kibler 2019). Furthermore, heterogeneity in the climatic and hydrologic data restricts the applicability of scientific tools to monitor changes relevant to the hydrologic cycle (WMO 2009). Therefore, reasonably calibrated hydrologic models are still very important tools to predict streamflow for ungauged catchments and to fill data gaps in streamflow records (Alipour and Kibler 2019; WMO 2009). The other challenge of hydrologic modeling is calibration of model parameters under a specific assumption and data scarce conditions (Nguyen et al. 2022; Abbaspour et al. 2019). This indicates that getting unconditional model parameters independent of calibration procedures and objective functions is difficult.

Many model calibration tools and guidelines have been developed for the hydrologic model SWAT (Soil and Water Assessment Tool). Nguyen et al. (2022) introduced an open-source R-based tool called R-SWAT for parameter calibration, sensitivity analysis, uncertainty analysis, and visualization of SWAT outputs. Likewise, Pfannerstill et al. (2014a, b) developed R-scripts for SWAT model parameter calibration, validation, and sensitivity analyses. Their scripts were designed to generate a set of parameter values based on Latin Hypercube Sampling from the R-package FME (Soetaert and Petzoldt 2010) and for each parameter set a model run is performed. In addition, Abbaspour (2005) introduced SWAT–CUP which aims to perform parameter calibration, sensitivity analysis, and uncertainty analysis with the SWAT model. However, SWAT–CUP is proprietary software, and freely available versions have some user restrictions, for example, parallel simulation options are restricted (Nguyen et al. 2022).

Though there are multiple options to calibrate hydrologic models, limiting model uncertainty is still a challenging task. In most cases model outputs for streamflow are compared to corresponding measured data with the assumption that all error variance is contained within the predicted values and that observed values are error free (Moriasi et al. 2007). However, observed data are never error free (Abbaspour et al. 2017). Another common assumption during hydrologic model calibration is the stationarity of processes, i.e., the means and variances of model input data and physical processes do not vary over time. Calibration, therefore, leads to a single set of calibrated model parameters. However, physical processes are dynamic in space and time, so the prediction of outputs with a single parameter set is questionable. The non-stationary behavior of hydrologic processes is often neglected which may result in uncertainties in model predictions based on a single set of parameters derived under the assumption of stationary condition (Mondal et al. 2016). According to Abbaspour et al. (2017), no calibration can lead to a single set of model parameters, because hydrological processes are modeled by a range of mathematical equations. A single set of model parameters could be achieved only for a given set of conditions depending on the calibration procedures and objective functions. Another source of model uncertainty could be inappropriate representation of hydrological model parameters. A single model parameter may influence one or several hydrological processes (Guse et al. 2020a; b), and model outputs could be associated with uncertainty if model parameters are not identified and adopted properly (Pluntke et al. 2014). Finally, hydrologic models that are evaluated against an aggregate measure of a single statistical value are problematic as it is difficult to identify why measured and modeled output variables differ. The use of multi-criteria performance metrics (e.g., with flow duration curve) can improve model evaluation.

A recent study by Lakshmi and Sudheer (2021) attempted to apply a new way of calibration for Cedar Creek and Riesel Y2 watersheds of the USA to address limitations of seasonal calibration based on the SWAT model. The authors partitioned the simulation period into different clusters and performed calibrations for each cluster group. Here, parameters sets were identified for each independent cluster group and the results revealed improved streamflow estimation. However, Lakshmi and Sudheer (2021) applied seasonal clustering on a single simulation period which might not sufficiently depict the effects of multi-year variability of hydrological processes on model parameter optimization.

Therefore, a calibration approach that leverages multiple objective functions, multiple variables, multiple parameter sets, multiple evaluation techniques, using data that includes wet, average, and dry years and many model simulation periods may reduce uncertainty and improve hydrologic model accuracy (Guse et al. 2020a, b; Lu et al. 2015; Ewen 2011; Moriasi et al. 2007). In this study, we intended to calibrate and validate the hydrological model SWAT for the Adyar catchment, India, using daily streamflow data by considering seasonal differences. Due to data scarcity and distinct seasonality the catchment is a challenging case for model calibration. Accordingly, the current study introduces new hydrologic model calibration procedures that address the effect of temporal variability (non-stationarity of hydrologic processes) and seasonal differences on model parameter selection. Specific objectives include: (i) to develop a more robust calibration procedure using different calibration periods and (ii) to investigate if distinct seasons lead to different model performances and model parameter set.

Materials and methods

Study area

The Adyar catchment is part of the Chennai basin and is located in the south of Chennai city, India (Fig. 1). Adyar river enters the Chennai District at Nandambakkam after flowing approximately 9 km through the metropolitan area in a total of its 15 km stretch in the suburbs of Chennai (Kumar et al. 2019). The Adyar river flows along the southern part of Chennai city and enters the Bay of Bengal (Fig. 1). It originates from the three tank groups, Kallampadu, Pillapakkam, and Kavanur. There are three major tanks called Chembarambakkam, Sriperumpudur, and Pillapakkam in the Adyar catchment. The total catchment area is about 828.8 km² and its land use is characterized by rapid urbanization (Nithila Devi et al. 2020). The upper Adyar catchment comprises about 75% of the catchment and is important with regard to flood control. Chembarambakkam tank has an annual average discharge volume of 89.4 million cubic meters at Kattipara (Nithila 2020), which is roughly 40% of streamflow in the catchment. The Adyar river is seasonal and reaches its peak streamflow for a few days during the northeast monsoon rainy season. Due to urbanization the Adyar River is heavily polluted.

The Adyar catchment has a tropical wet and dry climate feature (Kumar et al. 2019). The catchment receives rain from South–West and North–East monsoons (Sudheer et al. 2016). The October, November, and December months contribute most of the North–East monsoon rainfall, and the topographic depression of the Bay of Bengal causes rainfall to occur in the form of one or two annual cyclones (Kiram and Kumar 2017). Based on rainfall data analyses from three rain gauge stations (Korattur Anicut, Sriperumbudur, and Chembarambakkam) between the years 2000 and 2019, about 55% of annual average rainfall in the Adyar basin upstream of Chembarambakkam occurred during the North–East monsoon, which lasts from October to December. The annual rainfall for the North–East monsoon over the period 2000–2019 varies from 238 mm (2003) to 1895 mm (2015). During the same period, the annual rainfall for the winter and west monsoon seasons together (between January to September) varies from 281 to 921 mm. The long-term annual average rainfall of the modelling catchment (part of Adyar catchment upstream of Chembrabakkam) over the years is 1260 mm. In addition to rainfall variability, the mean maximum and minimum temperature differs from month to month. As evident from the average monthly maximum and minimum temperature, May was the warmest month over the period between 2000 and 2019. Its average maximum and minimum temperature values were 38.6 °C and 27.2 °C, respectively. The lowest average maximum temperature value occurred in December with a value of 29.4 °C and the lowest average minimum temperature occurred in January with a value of 19.8 °C. The annual average maximum and minimum temperatures values were 33.9 °C and 23.8 °C, respectively. Factors that control seasonal patterns of rainfall and temperature include the Madden–Julian Oscillation (MJO) and the inter-tropical convergence zone (Dey et al. 2022).

Database

Daily measured rainfall data at three weather stations (Korattur Anicut, Sriperumbudur, and Chembarambakkam) and gridded temperature (maximum and minimum) data with spatial resolutions of 0.5° by 0.5° from 2000 to 2019 were available from the India Meteorology Department (IMD 2019). Furthermore, from 2004 to 2020, daily streamflow of the Adyar River upstream of the Chembarambakkam reservoir were provided by the Tamil Nadu State Water Resources Department (TWRD 2020) for calibration and validation of the SWAT model simulation. Spatial data including soil, land use land cover maps, and topographic information (digital elevation model) were used to setup the hydrologic model (Fig. 2, Table 1). The shuttle radar topography mission (SRTM) 1 arc-second global digital elevation model (DEM) data with 30 m by 30 m resolution was available from the US Geological Survey website (USGS 2018). A soil map of the scale of 1:50,000 from Tamil Nadu Agricultural University (TNAU 2018) and land use classification for the year 2016 derived from Sentinel-1 and Sentinel-2 data (Fig. 2) (Steinhausen et al. 2018) were used. In general, the spatial, weather, and hydrologic data were used to setup and calibrate the SWAT model. All sources and their basic features are presented in Table 1.

Table 1 Spatial and hydrometeorological data sources and availability used this study

Full size table

SWAT model setup

The semi-distributed, time continuous eco-hydrologic model SWAT (Soil and Water Assessment Tool) was used for this study (Arnold et al. 1998; Arnold and Fohrer 2005). The SWAT model is one of the most popular process-based models widely used to simulate catchment hydrological processes globally including the Indian Subcontinent (e.g., Dash et al. (2023); Singh and Saravanan (2020); Wagner et al. (2019); Demirel et al. (2009)). SWAT can provide spatially distributed outputs for water balance components based on hydrological response units (HRUs). For this study, one SWAT model was setup for the Adyar catchment in which Chembarambakkam reservoir is located (Fig. 1). ArcSWAT 2012 was used to delineate the watershed and compile SWAT input files. SWAT2012 version 664 was used to run the simulation. The Food and Agricultural Organization (FAO) guideline was followed in classifying slope classes as 0–2% (foot slope), 2–5% (gentle sloping), 5–8% (sloping), 8–15% (strongly sloping), and > 15% (moderately steep to very steep) (Jahn et al. 2006). The catchment was divided into eleven sub-basins. Then, we used spatially distributed land use (thirteen different classes, Fig. 2D), ten soil (Fig. 2C), and five slope classes (Fig. 2A) to generate hydrologic response units (HRUs): each unique combination of land use, soil type, and slope class defined an HRU within every subbasin (Neitsch et al. 2011). All possible combinations of land use, soil and slope layers were used to setup the HRUs without applying any thresholds to derive a model setup with high spatial precision (Her et al. 2015). This approach resulted in 3066 HRUs.

Calibration and performance matrices

Three years (from 2001 to 2003) were considered a warm-up period to define appropriate initial conditions and to maintain equilibrium conditions for the model (Noori and Kalin 2016). Daily measured inflow discharge to Chembarambakkam Lake were available for the period from 2004 to 2020, and these data were applied using different calibration approaches to yield a reasonable hydrologic representation of the catchment. According to Moriasi et al. (2007), ultimate model calibration should be done with sufficient data length to include wet and dry years and encompass a range of hydrologic events. Thus, model calibration was attempted for different combinations of measured inflow discharge to Chembarambakkam:

1.
All inflow discharges considered for daily time scale
2.
Extreme inflow values excluded for daily time scale
3.
Daily time scale inflows only for wet season (Oct to Dec)
4.
Monthly inflows (wet season)
5.
Only wet season of wet years (annual rainfall > long-term mean) for daily time scale
6.
Dry years of wet season (annual rainfall < long-term mean) for daily time scale

Because the land use/cover input data used to setup the SWAT model was from 2015, the calibration period included years between 2015 and 2019 and validation was carried out for the period 2004–2008. Prior to calibration, key SWAT model input parameters that cause changes on the hydrograph of inflow to the reservoir were identified through parametric sensitivity runs. Here, we applied changes on the value of a parameter, ran the model, and evaluated the responses of the streamflow under different values. Based on this, ten parameters were considered for model calibration. Five thousand different parameter combinations were sampled using a Latin Hypercube sampling approach in R environment using package FME (Soetaert and Petzoldt 2010). Then, the default SWAT model input parameter values were altered based on values sampled using Latin Hypercube sampling for each calibration run inside the R environment (R Core Team 2013). Model performances for all 5000 model iterations were evaluated based on measured streamflow using the hydroGOF package in R (Zambrano-Bigiarini 2017). The numerical goodness-of-fit of the simulated versus measured streamflow were evaluated in different segments of the flow duration curve based on Pfannerstill et al. (2014a, b); and Haas et al. (2017). Kling Gupta Efficiency Coefficient (KGE) was the objective function used to identify the parameter combination that yielded the best fit between simulated and observed streamflow data. Furthermore, the values of Nash–Sutcliffe Efficiency (NSE), Percent Bias (PBIAS), and Standardized Root Mean Square Error (RSR) were also considered in evaluating the performance of the model. KGE was used as the main objective function instead of NSE for model calibration as it improves the bias, the over/under estimation of simulations, and the variability measures significantly (Gupta et al. 2009). For this reason, daily and monthly model performances were evaluated based on the optimum KGE values.

In most hydrologic models, selecting optimum statistical values for the given objective functions is based on the general assumption of time-invariant model parameters and catchment characteristics (Abebe et al. 2010). However, catchment characteristics and model parameters are time-variant, and uncertainty associated with model parameters are assumed to vary with time. Thus, model validation, land use change, and climate change impact studies using parameters that were determined during the calibration process may not be enough to produce reliable predictions. Antecedent and climatic conditions of the basin cause variability of hydrologic processes through time, and precipitation is one of the primary hydrologic model inputs that influences the parameter optimization processes (Lakshmi and Sudheer 2021). Moriasi et al. (2007) also reported that data used to simulate hydrologic processes affect calibration and validation of hydrologic models. Therefore, we applied successive calibration using different time periods (1-year time shift) of simulated and observed time series discharge to see how model parameters vary for different time periods. Moreover, the best calibrated model was also validated for different time periods by moving the validation period 1 year ahead to see the changes in model efficiency. Figure 3 summarizes all step-by-step modelling steps.

Results

Model parameters and performance for different calibration data sets

Due to seasonal variability and the influence of peak flows, the daily base calibration resulted in relatively poor performance (KGE = 0.56, NSE = 0.19, and PBIAS = 14.5%) compared to the wet monsoon and monthly values (Table 2, Figs. 4, 5). Particularly, the NSE value for the daily base calibration indicates a weak performance, but the positive NSE value indicates the model is at least better than using the mean value of all measurements (Moriasi et al. 2007). Similarly, KGE and PBIAS are in the acceptable range.

Table 2 SWAT model performances when calibration was carried out by comparing observed against simulated daily streamflow (extreme values included, extreme values excluded, monsoon season records, wet years, dry years, and monthly time steps are considered). The NSE and PBIAS values are acceptable based on Moriasi et al. (2007) except for the NSE value of Scen-6

Full size table

The model performance for the daily streamflow was improved when the calibration was carried out for North–East monsoon season (October to December). Both the (KGE = 0.67) and (NSE = 0.58) values were higher as compared to values from the daily base calibration. Likewise, the PBIAS and RSR values were improved for the seasonal calibration. In addition to statistical indices, the good fit between observed and simulated streamflow was also evaluated graphically based on Pfannerstill et al. (2014a); Pfannerstill et al. (2014b) and Haas et al. (2017) using the flow duration curves and hydrographs (Fig. 5).

Although modeled and observed streamflow matched well for the wet monsoon season during the calibration and validation periods, model efficiency tended to decrease for dry seasons and years (Table 2). Therefore, the performance of the SWAT model was separately evaluated for dry and wet years. Dry years are those years that received a total annual rainfall less than the long-term mean annual rainfall, and in wet years the total annual rainfall is greater than the long-term mean (Fig. 6). The performance of the model during dry years was worse than during wet years, NSE (dry years = − 0.59, wet years = 0.58). Figure 7 shows portions of model performances from the 5000 iterations for the 6 scenarios mentioned in Table 2. Simulations of streamflow for the north–east monsoon season, wet years, and monthly time steps better reproduced observed streamflow for both calibration and validation periods. It is not surprising that streamflow in dry years and dry seasons is more difficult to model due to water withdrawal and insufficient observations of streamflow and the data are in general relatively hard to handle. Aggregation of the data to monthly values resulted in a better performance, although there were differences for some months (Table 2).

Temporal variability of the SWAT model performance

One of the objectives for this study was to investigate the variability of hydrologic model parameters and the efficiency of the SWAT model under time-variant conditions. Model parameter selection and performance indices differ when different time periods were used for streamflow calibration. When the SWAT model was calibrated for the period 2004–2009, the simulated streamflow was overestimated by 8.2% compared to observed data, whereas the overestimations were 12.10%, 18.3%, and 20.0%, for the periods 2004–2010, 2004–2011, and 2004–2012, respectively. However, fitted model parameters for all time periods were the same. Likewise, the calibration periods of 2004–2013 and 2004–2014 had the same model parameters and simulated streamflow was overestimated by 20.7% and 22.1%, respectively. These results indicate that model uncertainties such as overestimation/underestimation could be attributed to the temporal dynamics of model input variables. We also found that the length of calibration period influenced model efficiency. The efficiency of the model was improved when the calibration period was increased from 6 to 8 years. On average, the KGE value was 0.27 for 6-year calibration period, whereas it was 0.4 for 8-year calibration. It is worth remarking that the variability of model performance reflected the time shifts of hydrologic response variables in the SWAT model, but these changes in model performance are not statistically significant.

Conventionally, hydrologic model calibration and validation are carried out under the assumption that the hydrologic parameters and the degree of uncertainties are consistent over time. However, the results of this study indicate that the degree of uncertainty and the model parameters could vary through time due to the temporal variability of the input parameters. When calibration was carried out for different periods with a 1-year time lag, but the same length of records, the hydrologic model parameters and evaluation criteria showed differences. Efficiencies of the SWAT model were decreasing when the time lag was increasing. For example, initially the calibration was based on the period of 2015–2019, and the optimum value of the objective function KGE and NSE were 0.68 and 0.59, respectively, and the simulated streamflow was overestimated. Even though statistical indices computed for different validation periods differ, the model performances are reasonable in most cases (Table 3). When a time lag of 1 year was applied for the validation periods, there were slight differences on the KGE, NSE, PBIAS and RSR values. The overall analyses indicated that model performance for different validation periods show increasing variability when there is a larger time lag between two different periods. For example, the KGE values were 0.68 and 0.60 when we validated the model for the period 2004 to 2007 and 2009 to 2012, respectively (Table 3). From this, we conclude that model uncertainty could not be stationary over time and this non-stationarity could be related to temporal differences of observed streamflow, precipitation, and air temperature between earlier and later periods.

Table 3 Performance of a calibrated SWAT model for monsoon season (October to December) streamflow when it is validated for different periods (model parameters remain uniform)

Full size table

Variability of streamflow for different validation periods

Due to the temporal differences of the model input variables, the quality of the streamflow simulation over the study catchment showed considerable differences. These differences were reflected by the statistical indices that were calculated when the model was validated for different periods. The calibrated SWAT model well-represented the recent monsoon season streamflow from the year 2012 onwards. Although reasonable model performances were achieved for the years before 2012 to reproduce the monsoon season streamflow, in most cases, the statistical indices were relatively lower when compared to the values calculated for the periods after the year 2012 (Table 3). The lower performance indices for the years before 2012 may be associated with the land use/cover data and the time of calibration. Static land use/cover data, generated in 2015 by Steinhausen et al. 2018, was used to construct the SWAT model. As hydrologic processes are highly sensitive to land use changes, streamflow simulated under such static land use/land cover conditions can show disparities from the observed one (Wagner et al. 2019). Güse et al. (2015) also concluded that dynamic modeling leads to a more realistic temporal development of land use change and its impacts on hydrologic processes. In addition, the SWAT model was calibrated for the period 2015 to 2019, which might not be sufficient to represent the temporal hydrologic variability that caused differences in the model performance for the periods before and after 2012. Our results indicated that the monsoon season streamflow was underestimated for the period 2005 to 2011, overestimated during 2011 to 2014, and again underestimated during 2015 to 2019.

General water balance

Modelling the water balance components of a catchment is of particular importance in catchments, where measured data are scarce. This study presented simulated water balance components for the upstream part of Adyar catchment. According to the simulated outputs of a calibrated SWAT model, the dominant hydrologic process is evapotranspiration. About 61% of the annual rainfall is lost to the atmosphere as actual evapotranspiration, while the annual catchment water yield accounts for 23% the annual rainfall. Due to the high evapotranspiration, the surface runoff share is only 11% of the total rainfall. Baseflow contributes about 40% to the streamflow. Our results revealed that water losses are more pronounced during the North− East monsoon season (October to December). About 56%, 85%, and 62% of the annual rainfall, surface runoff, and water yield, respectively, occurred in the monsoon season. Thus, availability of surface water flow during the other months of the year (January to September) is very limited.

Impacts of parameter variability on simulated water balance components

Water balance components including surface runoff, groundwater contribution to the streamflow, actual evapotranspiration, and water yield differences were analyzed for different parameter sets. Regardless of the influence of time period, the choice of model parameter influences the simulation of the water balance components. All the major water balance components responded differently for different optimum parameter sets. This is important, because these components were not measured, rather we inferred them from simulations. In this regard, water balance components for the first five best runs (Table 4) from a total of 5000 were analyzed to understand changes of annual water balances with different parameter combinations. Changes in the model parameter values influence the annual water balance components. For example, reducing the CN2 value from − 12 to − 14 decreases the surface runoff and water yield by 5.6% and 11%, respectively. Likewise, a 0.2 increase on the value of EPCO causes an increase of actual evapotranspiration by 4%, and an increase of gw_revap (groundwater revap coefficient) by 0.02 resulted a reduction of groundwater recharge by 7.5%. Groundwater discharge from the shallow aquifer to the streamflow was decreased when the gw_revap values were increased, whereas the actual evapotranspiration was increased. Thus, model parameter values that were optimized during calibration were largely responsible for the differences in the simulated water balance components. In addition, the seasonal variability on climate input variables also causes differences of water balance components. These different calibrations of model parameters are worthy to propose multiple water management planning options, because most of the hydrologic processes are not often measured directly but are instead simulated as part of the water balance components. Through these simulations it becomes clear that the water balance components are varying with parameter changes, even though the objective functions are on a similar level. Figure 8 illustrates the variability of annual mean water balance components for the five best runs from 5000 iterations.

Table 4 Parameter values for the first five best simulations (SIM1, SIM2, SIM3, SIM4, and SIM5) out of 5000 simulation and the corresponding model performances

Full size table

Figures 9 and 10 show the spatial variations of simulated ET and water yield under different parameters sets (Table 4). With reference to the simulation outputs under SIM1, changes in water yield showed high variability, while changes in ET were small, varying between − 8% and 10% (Fig. 10A). Changes in water yield varied between − 130% and 20%. In particular, SIM4 showed a strong deviation in water yield from the other simulations (Fig. 10B).

Moreover, the average annual simulated water balance components varied when model parameters were calibrated for different lengths of calibration periods. This was verified by comparing model outputs for the entire simulation period (2004–2019) after applying sequential calibrations for different periods. The differences were due to the impact of temporal variability of the model input and output variables in selecting model parameters values that resulted from the best optimum KGE. For example, there were 15% decreases in the annual average surface runoff and water yield, 13% and 122% increases on actual evapotranspiration and lateral flow, respectively, and 18% decrease on total aquifer recharge when we applied model parameter sets calibrated (1) for the 2004–2014 period and (2) for the 2004–2015 period. These results indicate that a 1-year difference of calibration period caused a large variation in the selection of best parameter combination, which resulted in considerable changes to major water balance components. Therefore, model calibration and validation periods should include as many years as possible to depict variability in the data set and model.

Discussion

The accuracy of hydrological models depends on accurate representation of model parameters and use of appropriate input data. In this study the influence of seasonality and temporal variability in hydrologic model calibration were analysed by testing different modelling periods and seasonal decomposition. Considering different modelling periods can help to understand how non-stationarity of hydrologic processes influences model calibration and parameters selection procedures, while seasonal decomposition often minimizes model uncertainties that could be masked by aggregating seasonal processes. Our results show that model parameter values and model efficiencies obtained under non-stationary conditions are considerably different compared to calibration under stationary conditions. It is worth remarking that the variability of model performance was reflected by modelled water balance components in the SWAT model, though changes in model performance are not statistically significant. These differences in the values of model parameters and model efficiency are attributed to temporal dynamics of model input variables. This implies that multiple calibration and validation options would be necessary to account for non-stationarity of hydrologic processes. Our results showed remarkable differences in model performance when a 1-year shift was applied to the calibration and validation periods. The changes of the model efficiency increased when the time shift increased between two validation periods (Table 4). Not only the model efficiency varied but also the spatial distribution of water fluxes varied (Figs. 9, 10). This indicates that the selection of calibration and validation periods influence the selection of model parameters as well as the performance and results of the model, which agrees with Myers et al. (2021).

Conventionally, hydrologic model calibration and validation are carried out under the assumption that hydrologic parameters and the degree of uncertainties are consistent over time. In this perspective, hydrologic models are calibrated and validated under fixed periods that end with a single set of parameters. However, single set of parameters could be valid to represent processes that are stationary in nature. Abbaspour et al. (2017) also argued that no calibration can lead to a single parameter set or a single model output as physical processes are modeled based on continuum equations and one can only measure a limited amount of data. Therefore, multiple sets of calibrated model parameters and evaluation are needed to simulate multiple processes (Moriasi et al. 2007). In this study, we also found that the length of calibration period influenced the model efficiency. The SWAT model efficiency was improved when the calibration period was increased from 6 to 8 years. On average, the KGE value was 0.27 for 6-year calibration period, whereas it was 0.4 for 8-year calibration.

When calibration was carried out for different periods with a 1-year time lag, but the same length of records, the hydrologic model parameters and evaluation criteria showed differences due to temporal effect. Efficiencies of the SWAT model decreased when the time lag increased. Similarly, there were slight differences in the KGE, NSE, PBIAS, and RSR values when a time lag of 1 year was applied for the validation periods (Table 4). The overall analyses indicated that the model performances for different validation periods show high variability when there is a larger time lag between two different periods.

We found improved model performances of the SWAT model simulations for the wet season (North–East monsoon) and wet years, while our model performances were poor for the wet and dry seasons and years together (Table 2). Furthermore, the SWAT model performance was pronouncedly improved when it was calibrated for wet years only. Our findings are, therefore, in agreement with Gao et al. (2018) who found that separating wet and dry years of monthly runoff strongly improved the efficiency of the SWAT model compared to their calibration results for the wet years and dry years. Similarly, Lakshmi and Sudheer (2021) applied discretization and clustering of simulation periods into different seasons to improve streamflow simulation with SWAT. Their findings showed that calibrating SWAT based on distinct seasons improved streamflow simulation. The authors applied clustering of the simulation periods into sub-annual periods based on the local climatic conditions and calibrated model parameters for the distinct clusters. Their results showed that the SWAT model performance based on NSE was improved from 0.68 to 0.72.

Conclusion

In this study, we examined a new methodological approach to calibrate and improve the efficiency of the SWAT model in a data scarce environment with a strong seasonality. We applied seasonal decomposition (wet and dry seasons) and successive time shifts of the simulation periods based on measured streamflow and performed multiple model efficiency evaluation of the SWAT model for the Adyar catchment, India. The main aim of this research was to investigate how time invariant and seasonal variability of hydrologic processes affect model parametrizations. Based on our findings, the following conclusions are drawn:

1.
Optimization of hydrologic model parameters for successively shifted periods of time resulted in different sets of model parameters. This indicates that non-stationary behaviors of hydrologic processes influence parameter selection during calibration and multiple sets of hydrologic model parameters might be necessary rather than using only a single set of parameter values that are calibrated under fixed calibration and validation periods. We recommend researchers test different calibration periods both in length and timing to assess whether timing has a strong effect on model performance in the studied catchment. At least a simple switch of calibration and validation period should be tested.
2.
The degree of uncertainty associated with model parameters can be minimized when a calibration or validation period includes more years.
3.
Seasonality of hydrologic processes has considerable effects on the SWAT model capability to represent hydrologic processes in the Adyar catchment. As streamflow in the Adyar catchment is dominated by winter monsoon rainfall, calibrating the SWAT model for wet and dry seasons together was not efficient. However, model calibration for winter monsoon and wet years resulted in a reasonable model performance. This is the season that dominates the hydrologic cycle and is most important to be accurately depicted in a model. Therefore, we conclude that understanding the seasonal behavior of a catchment is important prior to calibrating a hydrologic model.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Abbaspour KC, Vaghefi SA, Srinivasan R (2017) A guideline for successful calibration and uncertainty analysis for soil and water assessment: a review of papers from the 2016 international SWAT conference. Water 10(1):6
Article Google Scholar
Abbaspour KC, Vaghefi SA, Yang H, Srinivasan R (2019) Global soil, landuse, evapotranspiration, historical and future weather databases for SWAT Applications. Sci Data 6(1):263
Article Google Scholar
Abbaspour KC (2005) Calibration of hydrologic models: when is a model calibrated. In: MODSIM 2005 international congress on modelling and simulation. modelling and simulation society of Australia and New Zealand, pp. 2449–12455
Abebe NA, Ogden FL, Pradhan NR (2010) Sensitivity and uncertainty analysis of the conceptual HBV rainfall–runoff model: implications for parameter estimation. J Hydrol 389(3–4):301–310
Article Google Scholar
Alipour MH, Kibler KM (2019) Streamflow prediction under extreme data scarcity: a step toward hydrologic process understanding within severely data-limited regions. Hydrol Sci J 64(9):1038–1055
Article Google Scholar
Anushiya J, Ramachandran A (2015) Assessment of water availability in Chennai basin under present and future climate scenarios. In: Environmental management of river basin ecosystems. Springer, Cham, pp. 397–415.
Arnold JG, Fohrer N (2005) SWAT2000: current capabilities and research opportunities in applied watershed modelling. Hydrol Process 19:563–572. https://doi.org/10.1002/hyp.5611
Article Google Scholar
Arnold JG, Srinivasan R, Muttiah RS, Williams JR (1998) Large area hydrologic modelling and assesment Part I: model development. JAWRA J Am Water Resour Assoc 34:73–89. https://doi.org/10.1111/j.1752-1688.1998.tb05961
Article Google Scholar
Arnold JG, Moriasi DN, Gassman PW, Abbaspour KC, White MJ, Srinivasan R, Santhi C, Harmel RD, Van Griensven A, Van Liew MW, Kannan N (2012) SWAT: model use, calibration, and validation. Trans ASABE 55(4):1491–1508
Article Google Scholar
Athira P, Sudheer KP (2021) Calibration of distributed hydrological models considering the heterogeneity of the parameters across the basin: a case study of SWAT model. Environ Earth Sci 80(4):1–18
Article Google Scholar
Balakrishnan T (2008) District groundwater brochure, Chennai District, Tamil Nadu, Ministry of Water Resources, Central Ground Water Board, South Eastern Coastal Region, Chennai. P15
Bucak T, Trolle D, Andersen HE, Thodsen H, Erdoğan Ş, Levi EE, Filiz N, Jeppesen E, Beklioğlu M (2017) Future water availability in the largest freshwater Mediterranean lake is at great risk as evidenced from simulations with the SWAT model. Sci Total Environ 581:413–425
Article Google Scholar
Chen Y, Faramarzi M, Gan TY, She Y (2023) Evaluation and uncertainty assessment of weather data and model calibration on daily streamflow simulation in a large-scale regulated and snow-dominated river basin. J Hydrol 617:129103
Demirel MC, Venancio A, Kahya E (2009) Flow forecast by SWAT model and ANN in Pracana basin, Portugal. Adv Eng Softw 40(7):467–473
Article Google Scholar
Dey A, Chattopadhyay R, Joseph S, Kaur M, Mandal R, Phani R, Sahai AK, Pattanaik DR (2022) The intraseasonal fluctuation of Indian summer monsoon rainfall and its relation with monsoon intraseasonal oscillation (MISO) and Madden Julian oscillation (MJO). Theoret Appl Climatol 148(1–2):819–831
Article Google Scholar
Ewen J (2011) Hydrograph matching method for measuring model performance. J Hydrol 408(1–2):178–187
Article Google Scholar
Gao X, Chen X, Biggs TW, Yao H (2018) Separating wet and dry years to improve calibration of SWAT in Barrett Watershed, Southern California. Water 10(3):274
Article Google Scholar
Gupta HV, Kling H, Yilmaz KK, Martinez GF (2009) Decomposition of the mean squared error and NSE performance criteria: implications for improving hydrological modelling. J Hydrol 377(1–2):80–91
Article Google Scholar
Guse B, Pfannerstill M, Fohrer N (2015) Dynamic modelling of land use change impacts on nitrate loads in rivers. Environ Process 2(4):575–592
Article Google Scholar
Guse B, Pfannerstill M, Gafurov A, Kiesel J, Lehr C, Fohrer N (2017) Identifying the connective strength between model parameters and performance criteria. Hydrol Earth Syst Sci 21(11):5663–5679
Article Google Scholar
Guse B, Pfannerstill M, Fohrer N, Gupta H (2020a) Improving information extraction from simulated discharge using sensitivity‐weighted performance criteria. Water Resour Res 56(9):e2019WR025605
Guse B, Pfannerstill M, Kiesel J, Strauch M, Volk M, Gupta H, Fohrer N (2020b) Simulated sensitivity time series and model performance in three german catchments
Haas MB, Guse B, Pfannerstill M, Fohrer N (2016) A joined multi-metric calibration of river discharge and nitrate loads with different performance measures. J Hydrol 536:534–545. https://doi.org/10.1016/j.jhydrol.2016.03.001
Article Google Scholar
Her Y, Frankenberger J, Chaubey I, Srinivasan R (2015) Threshold effects in HRU definition of the soil and water assessment tool. Trans ASABE 58(2):367e378
Huang Y (2014) Comparison of general circulation model outputs and ensemble assessment of climate change using a Bayesian approach. Global Planet Change 122:362–370
Article Google Scholar
IMD (Indian Meteorological Department) (2018) Meteorological Centre: Season’s Rainfall. IMD: Season’s Rainfall 1986–2010 Meteorological Centre, Tamil Nadu https://imdtv.
Kiran S, Kumar NV (2017) Flood mapping analysis of Chennai city using geomatics. IJRTER 03:369–377. https://doi.org/10.23883/IJRTER.2017.3368.NTRBV
Article Google Scholar
Koycegiz C, Buyukyildiz M (2019) Calibration of SWAT and two data-driven models for a data-scarce mountainous headwater in semi-arid Konya closed basin. Water 11(1):147
Article Google Scholar
Kumar P, Dasgupta R, Ramaiah M, Avtar R, Johnson BA, Mishra BK (2019) Hydrological simulation for predicting the future water quality of Adyar River, Chennai, India. Int J Environ Res Public Health 16(23):4597
Article Google Scholar
Lakshmi G, Sudheer KP (2021) Parameterization in hydrological models through clustering of the simulation time period and multi-objective optimization based calibration. Environ Model Softw 138:104981
Article Google Scholar
Lu Z, Zou S, Xiao H, Zheng C, Yin Z, Wang W (2015) Comprehensive hydrologic calibration of SWAT and water balance analysis in mountainous watersheds in northwest China. Phys Chem Earth Parts a/b/c 79:76–85
Article Google Scholar
Mondal A, Narasimhan B, Sekhar M, Mujumdar PP (2016) Hydrologic modelling. Proc Indian Natl Sci Acad Part A Phys Sci 82(3):817–832
Google Scholar
Moriasi DN, Arnold JG, Van Liew MW, Bingner RL, Harmel RD, Veith TL (2007) Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans ASABE 50(3):885–900
Article Google Scholar
Munoth P, Goyal R (2020) Impacts of land use land cover change on runoff and sediment yield of Upper Tapi River Sub-Basin, India. Int J River Basin Manag 18(2):177–189
Article Google Scholar
Myers DT, Ficklin DL, Robeson SM, Neupane RP, Botero-Acosta A, Avellaneda PM (2021) Choosing an arbitrary calibration period for hydrologic models: how much does it influence water balance simulations? Hydrol Process 35(2):e14045
Article Google Scholar
Neitsch SL, Arnold JG, Kiniry JR, Williams JR (2011) Soil and water assessment tool theoretical documentation version 2009. Texas Water Resources Institute
Nguyen TV, Dietrich J, Dang TD, Tran DA, Van Doan B, Sarrazin FJ, Abbaspour K, Srinivasan R (2022) An interactive graphical interface tool for parameter calibration, sensitivity analysis, uncertainty analysis, and visualization for the Soil and Water Assessment Tool. Environ Model Softw 156:105497
Article Google Scholar
Nithila Devi N, Sridharan B, Bindhu VM, Narasimhan B, Bhallamudi SM, Bhatt CM, Usha T, Vasan DT, Kuiry SN (2020) Investigation of role of retention storage in tanks (small water bodies) on future urban flooding: a case study of Chennai City, India. Water 12(10):2875
Article Google Scholar
Noori N, Kalin L (2016) Coupling SWAT and ANN models for enhanced daily streamflow prediction. J Hydrol 533:141–151
Article Google Scholar
Paniconi C, Putti M (2015) Physically based modeling in catchment hydrology at 50: Survey and outlook. Water Resour Res 51(9):7090–7129
Article Google Scholar
Pfannerstill M, Guse B, Fohrer N (2014a) A multi-storage groundwater concept for the SWAT model to emphasize nonlinear groundwater dynamics in lowland catchments. Hydrol Process 28:5599–5612. https://doi.org/10.1002/hyp.10062
Article Google Scholar
Pfannerstill M, Guse B, Fohrer N (2014b) Smart low flow signature metrics for an improved overall performance evaluation of hydrological models. J Hydrol 510:447–458. https://doi.org/10.1016/j.jhydrol.2013.12.044
Article Google Scholar
Pluntke T, Pavlik D, Bernhofer C (2014) Reducing uncertainty in hydrological modelling in a data sparse region. Environ Earth Sci 72(12):4801–4816
Article Google Scholar
Soetaert K, Petzoldt T (2010) Inverse modelling, sensitivity and Monte Carlo analysis in R using package FME. J Stat Softw 33:1–28
Article Google Scholar
Srinivasan V, Gorelick SM, Goulder L (2010) A hydrologic‐economic modeling approach for analysis of urban water supply dynamics in Chennai, India. Water Resour Res https://doi.org/10.1029/2009WR008693
Steinhausen MJ, Wagner PD, Narasimhan B, Waske B (2018) Combining Sentinel-1 and Sentinel-2 data for improved land use and land cover mapping of monsoon regions. Int J Appl Earth Obs Geoinf 73:595–604
Google Scholar
Stocker TF, Qin D, Plattner GK, Alexander LV, Allen SK, Bindoff NL, Bréon FM, Church JA, Cubasch U, Emori S, Forster P (2013) Technical summary. In Climate change 2013: the physical science basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change. Cambridge University Press, Cambridge, pp. 33–115
Sudheer KP, Narasimhan B, Nambi I, Steinburc F (2016) Sustainable water resources management of Chennai basin under changing climate and land use. IGCS Technical Report, IIT Madras, Chennai, India
Suriya S, Mudgal BV (2012) Impact of urbanization on flooding: the Thirusoolam sub watershed–A case study. J Hydrol 412:210–219
Article Google Scholar
Taylor KE, Stouffer RJ, Meehl GA (2012) An overview of CMIP5 and the experiment design. Bull Am Meteor Soc 93(4):485–498
Article Google Scholar
Team RDC (2013) R Foundation for Statistical Computing: Vienna. Austria2007
Shah HL, Mishra V (2016) Hydrologic changes in Indian subcontinental river basins (1901–2012). J Hydrometeorol 17(10):2667–2687
Article Google Scholar
Thrasher B, Maurer EP, McKellar C, Duffy PB (2012) Bias correcting climate model simulated daily temperature extremes with quantile mapping. Hydrol Earth Syst Sci 16(9):3309–3314
Article Google Scholar
TNAU (Tamil Nadu Agricultural University) (2018) Spatial data of soil at the scale of 1: 50,000. Coimbatore, Tamil Nadu, India
TNWRD (Tamil Nadu Water Resources Department) (2020) Surface and Ground Water Data Center, Tamil Nadu State
USGS (US Geological Survey) (2018) Shuttle Radar Topography Mission (SRTM) 1 arc-second global Digital Elevation Model (DEM), USGS EarthExplorer http://earthexplorer.usgs.gov. Accessed 03 Aug 2018
Wagner PD, Bhallamudi SM, Narasimhan B, Kumar S, Fohrer N, Fiener P (2019) Comparing the effects of dynamic versus static representations of land use change in hydrologic impact assessments. Environ Model Softw 122:103987
Article Google Scholar
WMO (2009) Guide to Hydrological Practices–Volume I: Hydrology–From Measurements to Hydrological Information (WMO-No. 168)
Zambrano-Bigiarini M (2017) Package ‘hydroGOF’. Goodness-of-fit functions for comparison of simulated and observed

Download references

Acknowledgements

The authors are thankful to the Indo-German center for sustainability (IGCS) and the DAAD for the financial support during the research period. We are also thankful to the department of Civil and Environmental engineering, Indian Institute of Technology Madras (IITM) for their support by providing relevant data to the study. We are very grateful to Yara Pasner, for proofreading the manuscript. The authors thank the editors and the two anonymous reviewers for their helpful comments. We also acknowledge the financial support received from Department of Science and Technology (DST), Government of India, Grant Number DST/CCP/CoE/141/2018(G).

Funding

Open Access funding enabled and organized by Projekt DEAL. This research received funding from the Department of Science and Technology (DST), Government of India, Grant Number DST/CCP/CoE/141/2018(G) and from the Indo-German-Center for Sustainability (IGCS).

Author information

Authors and Affiliations

Department of Hydrology and Water Resources Management, Institute of Natural Resource Conservation, CAU, 24118, Kiel, Germany
Tibebe B. Tigabu, Paul D. Wagner & Nicola Fohrer
Department of Civil Engineering, Indo-German Centre for Sustainability, Indian Institute of Technology Madras, Chennai, 600036, India
Balaji Narasimhan

Authors

Tibebe B. Tigabu
View author publications
You can also search for this author in PubMed Google Scholar
Paul D. Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Balaji Narasimhan
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Fohrer
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. TT, PW, and BN contributed for material preparation and data collection. Data analyses were performed by TT. The first draft of the manuscript was written by TT and PW modified the first draft of the manuscript. NF and BN critically reviewed the modified manuscript and was completed by TT. NF and PW read and approved the final manuscript.

Corresponding author

Correspondence to Tibebe B. Tigabu.

Ethics declarations

Conflict of interest

The authors declare that they have no relevant financial or non-financial interests to disclose.

Ethical approval

The authors declare that all ethical responsibilities of authorship were applied.

Consent to publish

The authors have consented to the submission of the manuscript to the journal.

Consent to participate

Informed consent was obtained from all individual authors included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tigabu, T.B., Wagner, P.D., Narasimhan, B. et al. Pitfalls in hydrologic model calibration in a data scarce environment with a strong seasonality: experience from the Adyar catchment, India. Environ Earth Sci 82, 367 (2023). https://doi.org/10.1007/s12665-023-11047-2

Download citation

Received: 02 September 2022
Accepted: 05 July 2023
Published: 20 July 2023
DOI: https://doi.org/10.1007/s12665-023-11047-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Pitfalls in hydrologic model calibration in a data scarce environment with a strong seasonality: experience from the Adyar catchment, India

Abstract

Similar content being viewed by others

Hydrological modeling in the Upper Blue Nile basin using soil and water analysis tool (SWAT)

Modeling streamflow in Sot river catchment of Uttar Pradesh, India

Performance evaluation of hydrological model in simulating streamflow and water balance analysis: spatiotemporal calibration and validation in the upper Awash sub-basin in Ethiopia

Introduction