Introduction

Precise measurement of precipitation at global and regional scale are acute for understanding the climate and hydrological cycle, simulating land surface hydrological processes, water resources management (Stage et al. 2017). As reported by Habib et al. (2014), satellite rainfall may be used as alternative database for hydrological modeling. Yet, authors also informed evaluations are subjected to random and systematic errors. Hence, these errors should be either removed or minimized before we used in hydrological model simulation.

Variations in precipitation often influenced by nonlinear relations between numerous factors like local differences of topography, the orientation of mountains and aspects (Haile et al. 2009) which clues directly to flexible discharge output (Biemans 2012) and fluctuations in water storage (Narjary and Kamra 2013). Finally, this leads to incorrect simulation results and sometimes even in wrong conclusions (Vrgut et al. 2005). Precipitation data can be collected directly through ground-based observations using rain gauges and weather radars (Zhang et al. 2014) and indirectly through satellites (Lopez et al. 2017). Still, ground rainfall stations in many parts of the world and most parts of Ethiopia are very sparse and unevenly distributed (Ayehu et al. 2017). Thus, precipitation estimation is unavoidably subject to error due to different factors depending on the type of measurement (Alemohammed et al. 2015). To overcome some of the above-mentioned restrictions of ground-based precipitation measurements, space-based (satellite) estimations of rainfalls provide a hopeful alternative source (Bigiarini et al. 2017). Satellite-based precipitation estimates may offer information with spatiotemporal (near-real-time) high-resolution over wide-ranging regions wherever conventional rainfall data are unusual or absent (Kidd and Huffman 2011). Since combined precipitation products mostly have more improved quality than individual data set (Xie and Xiong 2011), different emerged rainfall products (reanalysis gauge and satellite gauge) are gradually being developed (Duan et al. 2017). It is principally worth nothing that the recently released Climate Hazards Group Infra-Red Precipitation with Station data (CHIRPS_2) provides rainfall at the finest spatial resolution of 0.05° and 0.25° (Funk et al. 2015).

At diverse times many academics have been carried out a study to assess the performance of satellite precipitation estimates for hydrological modeling in different regions of the world including Ethiopia and indicated that high-resolution satellite rainfall products have prospective use for hydrological modeling based on their product types. Later, all the above evaluations did not include Dabus watershed of Ethiopia which is rugged in topography and have more sparsely in situ rain gauge networks. Moreover, most of the studies were limited to satellite precipitation products of early version and only event-based analysis for examples, in (Tesfaye et al. 2017). Besides, in Dabus watershed the influences of water withdrawal, land cover and climate change have not been quantified yet due to scarce precipitation data for rainfall-runoff modeling.

As a result, this study concentrates on evaluating satellite rainfall products of CHIRPS_2 and TMPA_3B42v7 estimate by using in situ rain gauge products based on their hydrological simulation result as well as identification of sensitive parameters of model outputs when calibrated and validated with input-specific satellite precipitation estimates. The HEC-HMS hydrological model was used to simulate the stream flow for both rain gauge and satellite rainfall products. Biases between satellite rainfall estimates and observed data are identified and then used to correct satellite rainfall products by using several methods, like precipitation threshold, scaling approach, power transformation, distribution transfer, precipitation model and empirical correction methods. In this study a power transformation method was applied for satellite rainfall products. For each case of bias correction steps, it was intended to match the most important statistics coefficient of variation and mean on a scale of months.

The main objective of this research is (1) to evaluate CHIRPS_2 and TMPA_3B42v7 satellite rainfall, (2) to evaluate the ability of satellite rainfall products to characterize rainfall patterns and capture the magnitude of rainfall, (3) to assess the skill of satellite rainfall products as an input into a hydrological model for stream flow simulation and (4) to evaluate the performance of HEC-HMS model using satellite and ground-based precipitation products.

Description of the study area

The Dabus River is a north-flowing tributary of the Abay River in southwestern Ethiopia. The river is known for its continuous flow even during the dry seasons, which attributes to the existence of 600–900 km2 area of a swamp and fed by several tributaries that originate from the southwestern and central parts of Wollega. The major tributaries of Dabus River are Dilla and Keshmando which covers drainage area of 21,032 km2, and it locates between 9°0′0″–11°50′0″ N latitude and 34°30′0″–35°58′40″ E longitude within the Upper Blue Nile Basin. Geographically the course defines not only part of the border between Benishangul-Gumuz and Oromia regions but also the entire shared boundary of the Assosa and Kamashe zones of the Benishangul-Gumuz region, 664 km from Addis Ababa, Ethiopia. The study area contains 22 Weredas with 206,000 populations which elevation is varied from 879 to 3149 m (Fig. 1).

Fig. 1
figure 1

Location map of Dabus watershed

Data set

Meteorological precipitation products

In this study we used in situ-based meteorological data at different timescales (precipitation, evapotranspiration, relative humidity, wind speed, temperature and solar radiation) collected from Ethiopian National Meteorological Agency (NMA), for a period of 16 years (2000–2015) from 12 station, and out of those stations, only nine stations have complete data. Dabus River stream flow data were collected from Ministry of Water and Energy (MoWE). Two satellite rainfall products CHIRPS-2 and TMPA-3B42v7 were extracted from (http://chg.geog.ucsb.edu/data/chirps/) and (https://pmm.nasa.gov/), respectively.

Satellite precipitation products

CHIRPS-2 and TMPA-3B42v7

The Climate Hazards Group Infra-Red Precipitation with Station data (CHIRPS version 2) is a global daily, pentadal (5), monthly precipitation product, and a third-generation rainfall procedures clearly designed for monitoring agricultural drought and global environmental change over land (Funk et al. 2015), which was developed by the US Geological Survey and the Climate Hazards Group at the University of California, Santa Barbara, in 2015. CHIRPS_2 rainfall products mainly organized in support of drought-related issue in Africa (Climate Hazard Group 2016), principally for Ethiopia, Afghanistan or Haiti (Funk et al. 2015). CHIRPS product which belongs to the ‘satellite-gauge’ precipitation source category combines remotely sensed precipitation of geosynchronous and polar-orbiting satellites, from five different most recommended satellite rainfall products, with more than 2000 station records to calibrate global Cold Cloud Duration (CCD) precipitation estimates (Funk et al. 2015). The product structures at a spatial resolution of 0.05° and 0.25° from 50° S to 50° N across all longitudes with more than 30-year final monthly rainfall records (1981–present). The daily CCD measurements and daily CFS measurements are finally used to disaggregate the 5-daily products to daily rainfall estimates using a simple redistribution method which data are available at: http://chg.geog.ucsb.edu/data/chirps/ with resolution of 0.05° and 0.25°.

The Tropical Rainfall Measuring Mission (TRMM) Multisatellite Precipitation Analysis (TMPA) is a joint US and Japan satellite mission, which was launched in 1997 to monitor tropical and subtropical precipitation data and to estimate its related latent heating covering the latitude band 50° N–50° S. It also intended to provide a best estimate of quasi-global precipitation data from a wide variety of modern satellite-born rainfall-related sensors. Its rainfall estimates are provided at relatively high spatial resolution (0.25° × 0.25°) and 3-hourly time stapes, in both real and postreal time to meet a wide range of research needs. The latest version of TMPA_3B42v7 rainfall product was released in June 2012, and recent studies indicated that 3B42v7 estimates improve upon 3B42v6 (Bigiarini et al. 2017; Chen et al. 2013). Nowadays, after more than 17 years of data collection, the instruments turned off on April 8, 2015, after the spacecraft exhausted its fuel reserves and replaced by Global Precipitation Measurement (GPM) mission. For this thesis work TMPA_3B42v7 daily data were used and can be easily accessed from (https://pmm.nasa.gov/).

Materials and methods

Rainfall

The Dabus watershed has three different seasons: a raining summer (June to September). For this study area about (70–90)% of total rainfall occurs during this season which is also characterized by minimum levels of sunshine and high relative humidity. A dray season lasts from October to February characterized by maximum sunshine and low relative humidity. Minor rainy season (Belg) lasts from March to May. In general, the study area falls within the climatic classification of Tropical Climate I according to Copen–Geiger–Pohl system with uni-modal rainfall distribution. The average monthly rainfall distribution in the study area is shown in Fig. 2.

Fig. 2
figure 2

Average monthly rainfall distribution of the study area (2000–2015)

Temperature

The maximum and minimum temperature in the watershed varies between 17–33 °C and 10–18 °C, respectively. The average monthly maximum and minimum temperature in the study area is shown in Figs. 3 and 4, respectively.

Fig. 3
figure 3

Average maximum monthly temperature of the study area (2000–2015)

Fig. 4
figure 4

Average minimum monthly temperature of the study area (2000–2015)

Stream flow

Dabus is one of the main rivers which contribute a significant amount of flow to Upper Blue Nile basin. Due to this reason, the Ethiopian Ministry of Water Resources installed gauging station at the downstream of the river and it has long-term daily observations since 1976. Based on the recorded data from 2000 to 2015 the watershed has an average annual daily flow of 161 m3/s. The monthly average discharge at Dabus watershed gauging station is shown in Fig. 5.

Fig. 5
figure 5

Monthly average discharge of Dabus gauging station (2000–2015)

Land use land cover

The land cover of the Dabus watershed is mainly characterized by woodlands, and relatively small part of the sub-basin is cultivated. Based on the National Land Use Land Cover Data (NLCD) classification, the watershed is covered by cultivated agricultural land, grass land, wet land, water bodies and forest land. The sources of land use land cover data are MoWIE and FAO land use data set 2013 with grid resolution 30 arc seconds (1 km × 1 km) which was extracted from http://www.fao.org/news/story/en/item/216144/icode/ or http://www.fao.org/geonetwork) (Fig. 6).

Fig. 6
figure 6

Land use land cover map of the study area

Soil

The Dabus watershed contains different soil classes; however, the most common soil types are Hablic Alisols, Hablic Nitisols, Rhodic Nitisols and Eutric Fluvisols. The sources of soil data are FAO land use data set 2013 with grid resolution 30 arc seconds (1 km × 1 km) which was extracted from FAO, HWSD (Harmonized World Soil data) data set (org/soils-portal/soil-survey/soil) and MoWIE (Fig. 7).

Fig. 7
figure 7

Soil map of the study area

Data set

In order to calibrate and validate the HEC-HMS hydrological model, different input data such as hydrological, meteorological and geospatial data including stream flow, rainfall, land use land cover, DEM and soil data are required. The existing observed meteorological and hydrological data were collected from various sources. The meteorological data set such as rainfall, maximum temperature, minimum temperature, wind speed and relative humidity was collected on a daily timescale from the Ethiopian National Meteorological Agency (NMA) for a period of 2000–2015. The precipitation data set used for simulation of the HEC-HMS hydrological model in the study area has been obtained from in situ rain gauge, Tropical Rainfall Measuring Mission (TRMM) and Climate Hazards Group Infra-Red Precipitation with Station data (CHIRPS).

Data analysis

After collecting the essential information, filling the missing data and checking the quality of the hydrometeorological data is needed. The percentage of missing data within the time series of each hydrological and meteorological variable was evaluated. Missing of different data may be results from lack of appropriate records, shifting of station location and processing. In this study, inverse distance weighting (IDW) was applied for filling the missing data. IDW estimation is more subjective by neighboring measurements than far-away measurements.

$$ P\left( x \right) = \mathop \sum \limits_{i = 1}^{n} \frac{{w_{i} p_{i} }}{{\mathop \sum \nolimits_{i} w_{i} }} $$

where wi = \( \frac{1}{{di^{n} }} \), n is the number of neighboring stations, d is the distance between station with missed data to that of nearby station having data, P is recorded data in the nearby, P(x) is station with missed data and i is station identification. Additionally, the quality of stream flow and rainfall data also was evaluated.

Checking homogeneity of meteorological station

Homogeneity analysis is used to identify a change in the statistical properties of time series data which is caused by either natural or man-made factors. For this study the optional recommended method to apply homogeneity has been tested with respect to neighboring stations. The none dimensionalizing of the month’s value is carried out as;

$$ P_{i} = \frac{{\overline{{P_{1} }} }}{{\bar{P}}} \times 100\% $$

where Pi = none-dimensional value of precipitation for month i, \( \bar{P}_{i} \) = over year averaged monthly precipitation at the station, i, and \( \bar{P} \) = the over year average yearly rainfall of station (Fig. 8).

Fig. 8
figure 8

Homogeneity test for the selected nine stations in the study area

As shown in the above figure the maximum precipitation occurs between May and September in all station which shows the homogeneity of the stations.

Checking consistency of meteorological stations

In the process of checking recording of the in situ rain gauge stations has undergone a significant change during the period of record with various reasons; inconsistency would arise in the precipitation data of the station. This inconsistency of the recorded data for this study was done by the double mass curve technique (Fig. 9).

Fig. 9
figure 9

Double mass curves for the selected meteorological stations

The graph of cumulative data of one variable verses the cumulative data of related variable is almost straight line so long as the relation between the variable is fixed ratio. Accordingly the nine meteorological stations selected for this study were consistent since the graph of the plot forms straight line without break.

Stream flow homogeneity test

Rainbow is a software tool designed to study agrometeorological and hydrological records by means of a frequency analysis and to test the homogeneity of the records. It offers a test of stream flow homogeneity, which is based on the cumulative deviations from the mean.

$$ {\text{Sk}} = \mathop \sum \limits_{i = 1}^{k} \left( {xi - \bar{x}} \right) $$

where xi are the records from the series x1, x2xn and \( \bar{x} \) is the mean of the records. To test the homogeneity (uniformity) of data set, the cumulative deviations are often rescaled. When the deviation cross one of the horizontal lines the homogeneity of the data set is rejected with, respectively, 90, 95 and 99% probability. As presented in Fig. 10 the rescaled cumulative deviations from the mean would not crossed one of the horizontal 90, 95 and 99% probabilities lines. Therefore, for these studies the range of cumulative deviation could not be rejected on 90%, 95% and 99% probability levels which show the homogeneity of the annual data series and assures that the observations are almost from the same population.

Fig. 10
figure 10

Rescaled cumulative deviations from the mean for the total annual stream flow of Dabus gauging station

Processing of satellite rainfall products

The satellite rainfall products used in this study were acquired on a Network Common Data Form (NetCDF) gridded format from January 1, 2000, to December 31, 2015. These files create a URL list which was saved to specific location on the local computer as ‘myfile.nc’ (NetCDF file extension). Then, in order to extract the rainfall data for the study area and to export the data into excel for each pixel, Panoply NetCDF data viewer was used. Panoply is a graphical visualization tool which is capable to produce two-dimensional plots of geographically referenced data. It can plot longitude–latitude arrays as global or regional maps, plot zonal average curves of longitude–latitude statistics, combine two sets of data by differencing, summing or averaging and use a variety of 40 color tables and over 75 map projections. Most applications within panoply deal Net-CDF file format. Statistical indices like mean, standard deviation and coefficient of variation were used for evaluating the performance of the selected satellite precipitation products besides in situ rain gauge measurements. This helps to get an overall impression of the performance of the selected satellite rainfall estimates in the study periods and area.

Bias correction of satellite rainfall products

The term bias correction describes the process of scaling climate model output in order to justification for systematic errors in the climate models. The basic principle is that biases between satellite rainfall estimates and observed data are identified and then used to correct satellite rainfall products by using several methods.

Like, precipitation threshold, scaling approach, power transformation, distribution transfer, precipitation model and empirical correction methods, in this study, a power transformation method was applied for satellite rainfall products. For each case of bias correction steps, it was intended to match the most important statistics (coefficient of variation and mean) on a scale of months.

Because the bias in precipitation was found to vary spatially, bias corrections were carried out for the whole study area based on areal data. Leander and Buishand (2007) were used a power transformation method, to correct the CV (coefficient of variation) and the mean of rainfall data. In this nonlinear correction method each daily precipitation amount P is transformed to a corrected P* using:

$$ P^{*} = \, a \, \times \, P^{b} $$

The determination of the ‘b’ parameter is done iteratively. It was determined such that the CV of the corrected daily satellite rainfall matches the CV of the observed daily rainfall. In this way, the CV is only a function of parameter b according to: CV(P) = function (b); then, the parameter ‘a’ is determined such that the mean of the transformed daily values matches with the mean of observed rainfall data. The resulting parameter ‘a’ depends on value of ‘b,’ whereas the parameter b depends only on the CV and is independent of the value of parameter ‘a.’ Finally in order to compare performance of satellite and gauge rainfall data both should be the same in terms of spatial resolution. To do this the point data of gauge rainfall products were changed into areal rainfall data by using Thiessen polygon method.

Models setup

HEC-GeoHMS input terrain preprocessing

HEC-GeoHMS is a set of ArcGIS tools specially designed to process geospatial data and create input file for HEC-HMS hydrological model. Input data to HEC-HMS can be preprocessed using HEC-GeoHMS under GIS environment. In this study, HEC-GeoHMS and Arc-Hydro were used to develop river network of the watershed, to delineate sub-basins, and other watershed features that co-operatively describe the drainage patterns from the digital elevation model (DEM) of the basin. Figure 11 shows the Dabus watershed which was prepared using HEC-GeoHMS. Thus the total number of Dabus watershed was divided into 5 (five) sub-basins, depending on gauging station and large river meeting place with their basin characteristics.

Fig. 11
figure 11

HEC-HMS basin model representation of Dabus watershed

HEC-HMS model components

HEC-HMS model has four model components: basin model, meteorological model, control specification and input data (time series data, paired data and gridded data) components. The basin model contains information relevant to the physical attributes of the model, such as basin areas and river reach connectivity. Correspondingly, the meteorological model holds information related to rainfall data. The control specifications section comprises information related to the timing of the model such as when a storm happened and what type of time interval (second, minute, hour or day), want to use in the model. Finally, the input data component contains parameters or boundary conditions for basin and meteorological models. The main input data used for this study were satellite and in situ precipitation, observed stream flow and different basin characteristics (curve number, soil, LULC) resulting from HEC-GeoHMS process.

HEC-HMS model calibration and validation

Calibration

Model calibration is a systematic process of changing model parameter values until model results match acceptably the observed data. Calibration can be achieving in terms of either manual or automated. Further, the calibration estimates some model parameters that cannot be estimated by observation or measurement or have no direct physical meaning. This study used combination of manual and automated calibration method.

Validation

Model validation is the process of testing model capability to simulate observed data other than used for the calibration, with acceptable accuracy. Throughout this process, calibrated model parameters are not subject to adjustment, and their values are kept constant. Generally, in this thesis the model was calibrated for 10 years (2001–2010) and one-third of out of total time series data from (2011–2015) were used for validation. Additionally, 1-year time series data (2000) were used to initialize (warming-up) the model.

Description of HEC-HMS model calibration and validation methods

In SCS unit hydrograph method 37.5% of the runoff volume occurs before the peak flow, and the time base of hydrograph is five times the lag time (HEC-HMS user manual 2008). The SCS developed a correlation between the time of concentration (Tc) and the lag time (Tlag). The time of concentration can be estimated based on magnitude of the event and sub-basin characteristics including topography and the length of the reach (Kirpich’s formula).

$$ T_{\text{lag}} = \, 0.6 \, \times \, T_{\text{c}} $$

Time of concentration is quasi-physically based parameter that can be estimated as,

$$ T_{\text{C}} = t_{\text{sheet}} + \, t_{\text{shallow}} + \, t_{\text{channel}} $$

where TC is time of concentration in hr, tsheet = sum of travel of time sheet flow segment, tshallow = sum of travel time in shallow segment and tchannel = sum of travel time in channel. In recession method, relationships of Qt, the base flow at any time t, to an initial value are related as:

$$ Q_{\text{t}} = \, Q_{\text{O}} \times K^{\text{t}} $$

where Qt is the base flow at time t; QO is initial base flow (at time zero); and K is an exponential decay constant, defined as the ratio of base flow 1 day earlier.

Muskingum simulation routes the water through the reaches by combining parameters of ‘k’ and ‘x.’ K is the travel time of a flood wave passing through the reach, x is a measure of the degree of storage varied from (0 to 0.5), in which x = 0 means a level-pool reservoir or maximum storage, x = 0.5 means a pure transmission reach in which there are no storing effects. The value of k can be estimated as:

$$ K = \frac{L}{V} $$

where L is length of reach (m) and V is mean velocity (m/s). The value of k was first fixed by assuming velocity. In this study, to model the runoff process in each sub-basin and reach it was required to establish initial values of nine parameters like initial discharge, ratio to peak, recession constant, initial abstraction, constant flow rate, constant fraction, Muskingum k, Muskingum x, Muskingum subreach, and some parameters were imported from the basin model like basin lag and curve number.

Sensitivity analysis

Sensitivity analysis is a method to determine which parameters of the hydrological model have the highest impact on the model result. It ranks model parameter based on their influence to the overall error in the model predictions. The most sensitive parameters correspond to greater change in output response which information is vital during model calibration. According to Hann (2002) sensitivity analysis can be local or global. In the local sensitivity analysis, the significance of each input parameter is determined separately by keeping other model parameters constant. In the global sensitivity analysis all model inputs are allowed to vary over their ranges at the same time.

Hence, in this study a local sensitivity analysis was selected for assessing the event model. The final set of the parameters of the calibrated model was considered as baseline/nominal parameter set. Formerly, the model was run repeatedly with the starting point value for each parameter multiplied, in turn, by 0.2, 0.4, 0.6, 0.8, 1.2, 1.4 and 1.6 while keeping all other parameters constant at their nominal initial values. In addition to this, the hydrographs resulting from scenarios of adjusted model parameters were then compared with the baseline model hydrograph.

Model performance evaluations

The performance of model should be evaluated for the extent of its correctness, consistency and adaptability (Goswami 2005). In this study the HEC-HMS model performance was evaluated through visual examination of the simulated and observed hydrographs and through a set of objective functions that measure the goodness-of-fit between simulated and observed hydrograph. In addition to this, the model was evaluated using statistical measures to determine the quality and reliability of predictions when compared to observed values. Efficiency criteria such as Nash and Sutcliffe simulation efficiency (NSE), coefficient of determination (R2), percent error peak flow (PEPF), percentage (%) error in the total runoff volume (RVE) and percentage bias (PBIAS) were used to evaluate the model performance.

NSE and R2 were used to evaluate the model ability to reproduce the pattern of the observed hydrographs. The percent error peak flow measures the agreement between the magnitudes of observed and simulated peak values. The Nash–Sutcliffe coefficient of efficiency (NSE) is estimated by

$$ {\text{NSE}} = 1{-}\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {Q_{{{\text{obs}}, i}} - Q_{{{\text{sim}},i}} } \right) ^{2} }}{{\mathop \sum \nolimits_{i = 1 }^{n} (Q_{{{\text{obs}},i}} - \bar{Q}_{{{\text{obs}},i}} ) ^{2} }} $$

where NSE is Nash–Sutcliffe coefficient of efficiency, Qobs,i is the observed discharge at the time step i, \( \bar{Q}_{\text{obs}} \) is the mean of the observed discharge, Qsim,i is the simulation discharge at the time step i, and n is the number of observations. NSE ranges between − ∞ and 1 with NSE = 1.0 being the target value.

Values between 0.6 and 1.0 are generally viewed as acceptable level of performance, but values NSE = 0 indicate that the mean observed value is a better predictor than the simulated value of hydrological data, which indicates unacceptable performance. As all terms are defined previously, the coefficient of determination (R2) is estimated by;

$$ R^{2} = \frac{{\left[ { \mathop \sum \nolimits_{i = 1}^{n} \left( {Q_{{{\text{sim}}, i}} - \bar{Q}_{{{\text{sim}},i}} } \right) \left( {Q_{{{\text{obs}},i}} - \bar{Q}_{{{\text{obs}},i}} } \right)} \right] ^{2} }}{{ \mathop \sum \nolimits_{i = 1}^{n} \left[ {\left( {Q_{{{\text{sim}}, i}} - \bar{Q}_{{{\text{sim}},i}} } \right)} \right] ^{2} \mathop \sum \nolimits_{i = 1 }^{n} \left[ {(Q_{{{\text{obs}},i}} - \bar{Q}_{{{\text{obs}},i}} ) } \right]^{2} }} $$

The percentage error in total runoff volume (RVE) is estimated by;

$$ {\text{RVE}} = \frac{{ \mathop \sum \nolimits_{i = 1}^{n} \left( {Q_{{{\text{sim}}, i}} - Q_{{{\text{obs}},i}} } \right) }}{{ \mathop \sum \nolimits_{i = 1 }^{n} (Q_{{{\text{obs}},i}} )}} \times 100 $$

The percentage (%) error in total runoff volume (RVE) ranges between − ∞ and ∞. The model performance is very good for RVE between − 5 and 5%, but RVE between − 10 and − 5% and 5–10% suggests satisfactory performance.

The percentage error of peak flow (PEPF) is estimated by:

$$ {\text{PEPF }} = \, 100 \times \left| {\frac{{Q_{\text{obs}} \left( {\text{peak}} \right) {-} Q_{\text{sim}} \left( {\text{peak}} \right) }}{{Q_{\text{obs}} \left( {\text{peak}} \right) }}} \right| $$

where Qobs(peak) is the observed discharge, and Qsim(peak) is the peak simulated discharge. The evaluation range for PEPF is the same as to RVE.

The percentage of bias (PBIAS) measures the average tendency of the simulated data to be larger or smaller than their observed counterparts. The optimal value of PBIAS% is 0, with low-magnitude values indicating an accurate model simulation. Positive values show under estimation bias, and negative values indicate overestimation bias which can be calculated as,

$$ {\text{PBIAS}} = \frac{{\mathop \sum \nolimits_{i = 1 }^{n} \left( {Q_{o,i} - Q_{s,i} } \right)}}{{\mathop \sum \nolimits_{i = 1 }^{n} (Q_{o,i} )}} \times 100 $$

Results and discussion

In this unit the findings of the study were presented for each of the objectives planned in introduction part. The results were concisely discussed with reference to other previous scientific studies and range of model performance indicators.

Comparisons of gauged and satellite rainfall products

The precipitation data used in this study were obtained from in situ measurements and two satellite-based rainfall estimation products, CHIRPS_2 and TMPA_3B42v7, for a period of 16 years (2000–2015). Based on daily mean values, Begie station indicated a wider difference between in situ and CHIRPS_2 precipitation products when compared to other stations which is 1.074 mm/day followed by Abadie station with 0.866 mm/day. CHIRPS_2 satellite precipitation products also underestimate in Kamashe and overestimates in Begie station (Table 1).

Table 1 Summary statistics of daily in situ and satellite rainfalls products

Similarly based on mean values still, the difference is less than from CHIRPS_2 estimates, and Assosa station indicated a wider difference between in situ and TMPA_3B42v7 rainfall products when compared to other stations which is 0.312 mm/day followed by Kiltukara station with 0.079 mm/day. TMPA_3B42v7 satellite precipitation data also underestimate in Kamashe and overestimate in Assosa station. It is noted that the difference between in situ and satellite precipitation estimates showed elevation-dependent trends, TMPA_3B42v7 satellite precipitation products showed higher difference in low elevation areas. This result is consistent with findings of (Hessels 2015). Even if CHIRPS_2 satellite rainfall result was less affected by elevation variation, it showed less difference for mountainous areas which is consistent with the findings of (Ayehu et al. 2017).

Also, coefficient of variation (CV) shows the degree of variation between satellite rainfall estimates and in situ rain gauge data, whereas standard deviation shows the measure of the spread of the rainfall estimates from the mean and less variation was observed between in situ and satellite precipitation products, which indicated the less temporal variability of rainfall estimates.

Generally, TMPA_3B42v7 satellite rainfall products showed better performances in terms of capturing the volume of annual rainfall than the CHIRPS_2 satellite precipitation products, while CHIRPS_2 satellite rainfall products showed better performance in terms of capturing the patterns of annual precipitation than TMPA_3B42v7 satellite rainfall products. The mean annual precipitation of in situ gauge, CHIRPS_2 and TMPA_3B42v7 satellite rainfall products was estimated to be 1238.621 mm, 1339.328 mm and 1325.787 mm, respectively.

As Fig. 12 shows the inter-annual variation of rainfall data performs well for the inputs of in situ, TMPA_3B42v7 and CHIRPS_2 precipitation products for a period of 2000–2015. Figure 12 indicates CHIRPS_2 satellite rainfall product shows slightly overestimation and TMPA_3B42v7 satellite rainfall products show both overestimation and underestimation.

Fig. 12
figure 12

Mean annual rainfall of in situ and satellite rainfall products

Generally, TMPA_3B42v7 satellite rainfall products showed better performances in terms of capturing the volume of annual rainfall than the CHIRPS_2 satellite precipitation products, while CHIRPS_2 satellite rainfall products showed better performance in terms of capturing the patterns of annual precipitation than TMPA_3B42v7 satellite rainfall products.

The mean annual precipitation of in situ gauge, CHIRPS_2 and TMPA_3B42v7 satellite rainfall products was estimated to be 1238.621 mm, 1339.328 mm and 1325.787 mm, respectively.

HEC-HMS modeling result for observed rainfall products

Sensitivity analysis

In order to understand the effect of each model parameter, a sensitivity analysis was applied before calibration and validation of the model using HEC-HMS 4.2 graphical user interface (GUI). So, Muskingum (k), Muskingum (x), curve number (CN), recession constant (R.C), initial abstraction (I. ab), initial discharge (I.D), ratio to peak (R.P), flow rate (F.R), constant fraction (C.F), lag time and number of subreach were included in the sensitivity analysis.

Figure 13 expressed the Nash–Sutcliffe coefficient of efficiency (NSE) for specific change in parameter values of this study. Based on the analysis, lag time (L.T) was ranked as the most sensitive parameter on stream flow of this study as shown by steepest slope of ENS parameter change plot. The parameter I.ab and I.D were the second and third control effects on ENS, respectively. The remaining eight (8) parameters were not showed significant changes on value of ENS.

Fig. 13
figure 13

HEC-HMS model sensitivity analysis evaluated for Dabus station in terms of ENS

Calibration and validation

According to Santhi et al. (2001) model performance can be said to be very good if, ENS (0.75–1), R2 (0.75–1), PBIAS (< 10)%; good if, ENS (0.65–0.75), R2 (0.65–0.75), PBIAS (10–15)  %; satisfactory if, ENS (0.5–0.65), R2 (0.5–0.65), PBIAS (15–25)%; and unsatisfactory if, ENS (< 0.5), R2 (< 0.5), PBIAS (> 25)%.

The HEC-HMS hydrological model was calibrated in Dabus watershed for the period of (2001–2010) and validated for the period of (2011–2015) including 1 year (2000) for warming-up. Table 2 shows the fixed HEC-HMS model parameter value during automated and manual calibration of observed precipitation data.

Table 2 Model Parameters selected for calibration of observed rainfall

Figure 14 shows the observed and simulated stream flow hydrograph for calibration (2001–2010) and for validation (2011–2015) period. The results indicated a good agreement between the data sets with an ENS, of 0.843 and R2 0.954 for the calibration period and ENS of 0.791 and R2 of 0.952 for the validation period. The model captured well the daily time series of stream flow as well as the trend during calibration and validation periods. The results showed good performance in terms of capturing the observed stream flow volume with RVE of 5.557% during calibration and 6.634% during validation period. As the HEC-HMS model performance statistics, Table 3, shows for both calibration and validation periods (2001–2015) and on the standards set out by (Moriasi 2007), for evaluating hydrological model performance the HEC-HMS model was rated as ‘good’ for Dabus watershed of this study.

Fig. 14
figure 14

Observed and simulated daily discharge hydrograph of Dabus watershed using in situ rainfall products

Table 3 Daily time step calibration and validation result of CHIRPS_2

Furthermore, a comparison of the measure of the statistics for calibration and validation periods tells a good performance of the hydrological model during the calibration period as compared to the validation period. This thought agrees with findings of (Moriasi 2007), in which model performance during the calibration period performs best compared to the validation period. However, the model performance is still acceptable for the validation period, indicating that the HEC-HMS model can be applied to study rainfall-runoff relation in the catchments outside of the calibration period. In Fig. 14 calibration and validation results of in situ rainfall products are presented.

Although observed and simulated discharge matched well for calibration and validation periods, there was an average magnitude overestimation of the observed stream flow with PBIAS of − 14.648% for the calibration and − 19.638% for validation period. The agreement of peak magnitude between observed and simulated stream flow was very good with PEPF of 2.364% for calibration and − 6.346% for validation period as indicated in Figure 14.

HEC-HMS modeling with satellite precipitation products

The two selected satellite precipitation products (CHIRPS_2 and TMPA_3B42v7) were used as an input for model sensitivity analysis, calibration and validation independently for the same time window.

Sensitivity analysis

The results of the sensitivity analysis for CHIRPS_2 and TMPA_3B42v7 satellite rainfall products are shown in Fig. 15. Based on the analysis, Muskingum (x) was ranked as the most sensitive parameter on stream flow as shown by steepest slope of PBIAS parameter change plot although the parameter lag time (L.T.) also affects PBIAS (%) next to Muskingum x.

Fig. 15
figure 15

HEC-HMS model sensitivity analysis evaluated in terms of PBIAS (%)

Calibration and validation of satellite rainfall products

CHIRPS_2

A plot of daily observed to daily simulated stream flow indicated a satisfactory agreement with an ENS of 0.685 and R2 of 0.777 for the calibration period and with an ENS of 0.513 and R2 of 0.704 for CHIRPS_2 satellite rainfall products. The model captured well the daily time series of stream flow plus the trend during both calibration and validation periods.

Even if observed and simulated outputs matched acceptably for calibration and validation periods, there was an average magnitude overestimation of the observed stream flow with PBIAS (%) of − 18.425% for the calibration and − 21.467% for validation period. The agreement of magnitude between observed and simulated peak flow was good with PEPF of 4.042% for calibration and PEPF of − 6.075% for validation period, and the results showed satisfactory performance in terms of capturing the observed discharge volume with RVE of 7.893% during calibration and 8.562% during validation periods. Table 3 shows the calibration (2001–2010) and validation (2011–2015) results of CHIRPS_2 satellite precipitation estimates (Figs. 16, 17).

Fig. 16
figure 16

Observed rainfall and simulated hydrograph of calibration at Assosa station using CHIRPS_2 satellite rainfall products

Fig. 17
figure 17

Observed and simulated daily discharge hydrograph of CHIRPS_2 products

As shown in the above graphs the result of CHIRPS_2 satellite precipitation estimates for both calibration and validation period was better in capturing the patterns and peak of the rainfall. It is also clearly indicated that for high precipitation products its flow is also high. In overall, CHIRPS_2 satellite precipitation products show large error during low precipitation (dry) seasons and give best result for flat areas. This result agrees with the findings of Duan et al. (2016) and Bayissa et al. (2017), in which CHIRPS_2 satellite precipitation products show an overestimation of lower monthly precipitation products and can be used for flood monitoring and early warning system.

TMPA_3B42v7

Even if the performance of TMPA_3B42v7 precipitation result was better than CHIRPS_2, a plot of daily observed stream flow to daily simulated stream flow indicated a satisfactory agreement with an ENS of 0.755 and R2 of 0.786 for the calibration period and an ENS of 0.714 and R2 of 0.705 for validation period.

As the Table 4 and Fig. 18 show the model captured well the daily time series of stream flow as well as the trend during both in the calibration and validation periods. Even though observed and simulated outputs matched tolerably for calibration and validation periods, there was an average magnitude overestimation of the observed stream flow with PBIAS of − 11.325% for the calibration and − 17.357% for validation period. The agreement of magnitude between observed and simulated flow was satisfactory with PEPF of 7.335% for calibration and PEPF of − 5.075% for validation period, and the results showed a good performance in terms of capturing the observed stream flow volume with RVE of 9.893% during calibration and 10% during validation periods. Generally, TMPA_3B42v7 satellite rainfall product was showed good performance during wet seasons and at highlands. This thought agrees with the findings of Bigiarini et al. (2017) and Pineda et al. (2016), in which TMPA_3B42v7 satellite rainfall products show a better correlation in higher elevation areas than lower elevation areas.

Table 4 Daily time step calibration and validation results of TMPA_3B42v7 products
Fig. 18
figure 18

Observed and simulated daily discharge hydrograph of TMPA_3B42v7 rainfall products

Comparisons of HEC-HMS model performance for satellite rainfall products

As clearly indicated in the above the sensitivity analysis of the selected stream flow parameters was done independently, and its comparison results showed that eleven parameters were recognized as significantly sensitive in common for both CHIRPS_2 and TMPA_3B42v7 satellite rainfall products. But the sensitive percentage of objective function value was different in each case, which indicated that the sensitivity of flow parameters was dependent on the type of satellite rainfall products used. In general, as described in the Table 4, Fig. 18 and other performance evaluation criteria, the HEC-HMS model gives better outcome when it was calibrated and validated by TMPA_3B42v7 satellite precipitation products than CHIRPS_2 satellite rainfall products for Dabus watershed area. This result is consistent with the finding of Hessels (2015), which was conducted in Blue Nile basin, Ethiopia. The results also could be due to the topography of the study area which is characterized by rugged landscape with higher elevation areas, as TMPA_3B42v7 satellite rainfall products give slightly better results than CHIRPS_2 in high elevation areas (Table 5).

Table 5 Summary of model performance results of CHIRPS_2 and TMPA_3B42v7

Comparisons of in situ and satellite precipitation products

In this section, the comparison was made to evaluate the HEC-HMS model performance with precipitation inputs from in situ precipitation estimates and TMPA_3B42v7 satellite rainfall products which was showed relative better performance than CHIRPS_2 satellite rainfall estimates in the above section. Although the percentage of sensitivity value was different, all flow parameters were common for both in situ precipitation products and TMPA_3B42v7 satellite rainfall estimates. As showed in the above discussion, it is clear that in both model calibration and validation periods HEC-HMS model showed better performance than satellite precipitation estimates, based on the model efficiency evaluation criteria’s and the capacity of simulating the stream flow. This reduction of model performance statistics might be due to the satellite rainfall estimates contaminated with random and systematic errors (Ambaw 2016; Habib et al. 2014).

Effects of bias correction of satellite rainfall products

In order to clearly understand the effect of bias correction on satellite rainfall products both the bias-corrected and uncorrected satellite precipitation estimates were used independently as model input and provided the following outputs.

In order to clearly understand the effect of bias correction on satellite rainfall products both the bias-corrected and uncorrected satellite precipitation estimates were used independently as model input, and Fig. 19 indicates that the performance of bias-corrected satellite rainfall products is better than uncorrected satellite rainfall products. But the performance of in situ rainfall products is still better than the performance of satellite rainfall products.

Fig. 19
figure 19

Statistical comparison of simulation of daily stream flow of various rainfall inputs

Table 6 and Fig. 20 clearly indicate that the model performance statistics value has been increased when the model used bias-corrected satellite rainfall products. Thus, it can be concluded that bias-corrected satellite rainfall estimates showed better performance than uncorrected satellite rainfall estimates for stream flow simulation in the study area. However, the performance of bias-corrected satellite precipitation products was less than the performance of in situ rainfall products. These thoughts were similar to the findings of (Ambaw 2016; Habib et al. 2014; Liu et al. 2014; Yong et al. 2012), in which the performance of model increases when it calibrates and validates with in situ rainfall products.

Table 6 Daily time step calibration and validation periods model performance statistics using bias-corrected satellite precipitation estimates
Fig. 20
figure 20

Daily calibration results of in situ, Bias-corrected CHIRPS_2 and uncorrected CHIRPS_2 rainfall products

Conclusions

In this study, the HEC-HMS model with integration of GIS and HEC-GeoHMS was used and evaluated for stream flow simulation in Dabus watershed, Abbay basin Ethiopia. The hydrological and meteorological products were obtained from in situ gauging stations and satellite estimates for a period from 2000 to 2015.

Unlike other previous studies; this study was more comprehensive in using combined rain gauge and satellite rainfall estimates for the better accuracy of the result. Simulation of the study was done by using input data from satellite precipitation estimates (CHIRPS_2 and TMPA_3B42v7) and gauging station precipitation data independently. In addition to this, bias-corrected satellite rainfall estimates are also considered in the simulation to investigate the effect of bias correction.

According to the results from the sensitivity analysis using in situ and satellite rainfall data, nine major parameters were selected based on their percentage of sensitivity, including the parameter that was imported from the model, like curve number and lag time. From those used sensitive parameters lag time and Muskingum k, x, were the most sensitive parameters, whereas number of subreach was the least sensitive parameters for the study area.

Daily timescale calibration and validation results over the study area show good performance with ENS, R2, PBIAS (%), PEPF (%) and RVE (%), 0.843, 0.954, − 14.648%, 3.634%, 5.557%, respectively, for calibration and 0.791. 0.952, − 19.638%, − 6.346% and 6.634%, respectively, for validation. Hence, the calibrated and validated model parameter can be used for simulation of stream flow in the study area.

Calibration and validation results over the study area were satisfactory when CHIRPS_2 satellite precipitation estimates used with ENS, R2, PBIAS, PEPF and RVE, of 0.695, 0.777, − 18.425%, 4.042%, 7.893%, respectively, for calibration and 0.513, 0.704, − 21.467%, − 6.075% and 8.562%, respectively, for validation periods.

The results obtained during calibration and validation of TMPA_3B42v7 satellite rainfall estimates show better than results of CHIRPS_2 estimates. Calibration and validation of TMPA_3B42v7 is also satisfactory with ENS, R2, PBIAS, PEPF and RVE, of 0.755, 0.786, − 11.325%, 7.335%, 9.893%, respectively, for calibration and 0.714, 0.705, − 17.357%, − 5.075% and 10, respectively, for validation period.

Additionally, low HEC-HMS model performance was observed in the study area when it was calibrated and validated with CHIRPS_2 satellite precipitation products than TMPA-3B42v7 satellite precipitation products. This might be because the topography of the area is characterized with rugged landscapes having mostly higher elevation areas.

Subsequently, TMPA-3B42v7 satellite rainfall estimates have good capabilities in capturing precipitation values in high elevation areas than CHIRPS_2 satellite precipitation products which is slightly affected by elevation difference. Comparatively, low performance of HEC-HMS model was resulted for the study area when satellite precipitation estimates are used for calibration and validation instead of using in situ rainfall products.

CHIRPS_2 and TMPA_3B42v7 satellite rainfall data showed overestimation and underestimation of the simulated stream flow due to random and systematic errors though its simulation results were captured the shape of observed stream flow hydrograph. When the model calibrated and validated with bias-corrected satellite precipitation estimates its performance had significantly increased. However, the model performance of stream flow simulation using these bias-corrected satellite precipitation estimates were still lower than simulation result of in situ precipitation products.