Introduction

Suspended sediment load is a major challenge for water quality and quantity, hydroelectric equipment, fish habitat, reservoir sedimentation, channel morphology and consequence impacts on river flooding. Many studies conducted on erosion-sedimentation process modeling, have special respect to the suspended sediment concentration (SSC) because of its main role in representation of watershed sedimentation. Suspended sediments can account for 70–99 % of the quantity of sediment transported by streams (Knighton 1998). Knowing the spatial and temporal variation of annual suspended sediment concentration (SSC) is a crucial variable about which, dam managers have always concerned. Annual SSC variation is a key practice for planning and management of water resources and dam regulations such as the time of water storage into and water release from dam. Suspended sediment concentration is therefore an important abiotic variable used to quantify water quality and habitat availability for some species of fish and invertebrate (Tramblay et al. 2008). From the above discussion, SSC is an important abiotic variable to quantify water quality. So there is an essential need to estimate SSC magnitude occur within the rivers annually especially for the regions where reservoir dams are in use at the outlet. Many researchs have been focused on the development of empirical relationships between specific sediment discharge and catchment physic-climatic variables (Demmak 1984; Probst and Amiotte Suchet 1992; Meddi 1992; Terfous et al. 2001; Benkhaled and Remini 2003; Achite and Ouillon 2007; Yan et al. 2013; Montanher et al. 2014; Wang and Tian 2013; Aulenbach 2013). In recent years researchers tried to do frequency analysis on SSC time series in order to quantify and predict annual extreme events of SSC and to determine SSC quantiles. SSC frequency analysis is an essential task for risk analysis related to any hydraulic structures construction projects during its design life such as reservoir dams. In frequency analysis of SSC, we are looking for what is the probability of occurrence of a specific SSC or greater values? FA is a method that allows relating the frequency of occurrence of an extreme event to its magnitude through the use of probability distributions (e.g. Rao and Hamed 2001; Chebana and Ouarda 2011). This technique is increasingly used by hydrologists to predict floods (Jaiswal et al. 2002; Kumar et al. 2003; Lim and lye 2003; Neslihan 2009; Sarhadi et al. 2012; Farsadnia et al. 2014), rainfall (Eslamian and Feizi 2007; Yurekli et al. 2009; Modarres and Sarhadi 2011; Hailegeorgis et al. 2013), low flows (Pearson 1995; Kroll and Vogel 2002; Chen et al. 2006; Modarres 2008a, b; Dodangeh et al. 2014), wind speeds (Modarres 2008a, b) and recently for dust storms (Dodangeh et al. 2012). However few studies attempted to apply this technique to predict sediment quantiles. Tramblay et al. (2008) employed FA technique to predict annual extreme events of SSC. They tried to fit probability distributions to stations data recorded to estimate the return period for a specific concentration or the concentration for a given return period. They showed that 2-parameter distributions like exponential, lognormal, Weibull and Gamma performed well to fit SSC data time series. Benkhaled et al. (2014), performed frequency analysis on SSC data in upstream of Foum El Gherza dam in southeast Algeria. They showed that 2-parameter log normal distribution (LN2) fitted well the considered data. While this technique is increasingly being used for hydro-climatic variables such as rainfall and extreme stream flow time series (Floods and low flows), on one hand, this method has not been a popular method to model magnitude frequency analysis of sediment transported by the rivers. On the other hand, although Sefidroud basin occupied by the severely erodible formations and characterized by the short-time intensive rainfalls carrying a large amounts of sediments into the dam lake, as one sixth of the reservoir volume is considered reservoir dead storage and a remarkable amount of sediments continue to cause a decreasing volume rate of 4 % annually, but there have been no studies on the sediment concentration in this region. (Othman et al. 2013).

Sefidroud dam is a buttress- type concrete gravity structure one, and its height is 106 m (White 2001). The crest length is 425 m, and the reservoir capacity was 1800 MCM at the beginning of operation. Approximately 600 MCM of this capacity is allocated for dead storage. The drain facilities can drain around 6000 m3/s and the annual adjustable water volume is 1650 MCM. The under cultivation area is 188,000 ha, and the installed power is 87,500 m3/s. The maximum and minimum discharge during a statistic period are 2300 and 0.5 m3/s respectively. The average discharge is 119.20 m3/s and the normal storage is 1276 MCM, finally the crest storage is 1760 MCM (Othman et al. 2013).

The present work will contribute to fill such a knowledge gap in this region applying FA techniques to quantify probability occurrence of annual maximum SSC and vice versa i.e. to quantify potential extreme concentrations can occur with different return periods or probabilities at the location of gauging stations. The results are required for quantifying fish habitat availability in rivers. It also helps managers to determine priorities of sub-watersheds for rehabilitation by bio-mechanical activities. The contents of the paper are structured as fallows follows: “Materials and methods” deals with geographical location of the study area and methods are also discussed in this section. The results are described in “Results and discussions” and a brief conclusion is also presented in the last “Conclusions”.

Materials and methods

Study area and data set

Present study was conducted in Sefidroud basin which located in the arid and semi-arid regions of northwest Iran. Sefidroud basin that drains an area of approximately 56,700 square kilometer is known as one of the most important basins which provide water for 9 provinces of this country (Fig. 1). The average annual rainfall in this region varies between 400 mm in the west and 500 mm in the east. Ghezel Ozan River known as one of the major rivers of this area with a length of around 800 km originates from the upward Kurdistan Mountains and impounds into the Sefidroud reservoir which is the primary water source for many provinces especially for Guilan province. Guilan province located at downstream of the Sefidroud dam, is the main rice-production center in Iran. It is worth nothing that a total of 2400 km2 cultivated rice farms heavily depend upon water supplies from the Sefidroud reservoir (Othman et al. 2013). Under the effects of climate change, the hydrological response of the region has noticeably been changed such as a remarkable shift from snow to rainfall and the subsequent increases in quick-flow runoff in winter and early spring. This condition, accompanied by the human interference through the land use change, deforestation and disposal of rangelands in the vast space, resulted in accelerated erosion and soil leaching throughout the region. Under this condition, the capacity of the lake of Sefidroud dam has strongly been reduced due to the reservoir siltation. Each year a total of 32 million ton sediment deposited within the lake of the dam and now the volume of the reservoir reached to the half of it at the time of construction (The volume of the lake of 1.8 Milliard cubic meters at the time of construction reached to the 900 million cubic meters now) (Fig. 2).

Fig. 1
figure 1

Location map of the sediment gauging stations along the Ghezel Ozan River

Fig. 2
figure 2

Pictures of sediroud Dam, have been filling with suspended sediment

In this study, annual suspended sediment concentration (SSC) data of eight gauging stations located along the Ghezel Ozan River and its tributaries, were compiled from the Iran ministry of power. The location of these stations has been illustrated in Fig. 1. The data series of 12 stations, cover the time spans of different record lengths so that the shortest length of data belongs to Sohrein station (with the record length of 10 years) and the longest record belongs to Yengi Kand and Ghareguni stations (with the record length of 31 years). It is worth nothing that scare and short length data record of suspended sediment data is a common problem in the country, especially in arid and semi arid regions. The max annual suspended sediment concentrations and their related discharges of each selected station are presented in Table 1.

Table 1 The maximum annual suspended sediment concentrations and their related discharges of each selected station

Method

Frequency analysis is a statistical approach commonly used in hydrology to relate the magnitude of extreme events to a probability of occurrence (Rao and Hamed 2001).

The main objective of frequency analysis is to infer the probability of exceedence of all possible events, in this case suspended sediment concentrations, from observed values (a sample of the parent population). The process of frequency analysis includes two main steps: (1) preliminary data processing in spite of checking randomness, stationary and homogeneity which are basic assumptions for frequency analysis, must data time series meet to them and (2) carrying out the frequency analysis by fitting various probability distributions, selecting the best fitted one, parameter estimation of the best fit distribution and risk assessment based on quantiles or return periods. Using the fitted probability distribution, it is possible to predict the probability of exceedence for a specified magnitude (i.e. quantile) or the magnitude associated with a specific exceedence probability (Tramblay et al. 2008).

Preliminary tests of stationary, randomness and homogeneity

  1. 1.

    The Mann–Kendall test is used for determining the existence of statistically significant trends in climate and hydrologic time series (Burn 1994; Pasquini et al. 2006). This is a test for correlation between sequences of pairs of values. Using rank eliminates the sensitivity of the correlation test to the function linking the pairs of values. In particular, the standard correlation test is used to find linear relations between test pairs, but the rank correlation test is not restricted in this way.

  2. 2.

    Autocorrelation functions are commonly used for checking randomness in a data set. This randomness is ascertained by computing autocorrelations for data values at varying time lags. If random, such autocorrelations should be near zero for any and all time-lag separations. If non-random, then one or more autocorrelation values will be significantly non-zero. These functions were used to test the randomness of annual suspended sediment concentration time-series.

The randomness of the data set was tested by the autocorrelation function for the confidence bands (CB) given by:

$$CB = \frac{{Z_{i} = \alpha /2}}{\sqrt n }$$
(1)

where z is the percent point function of the normal distribution, n is the sample size and a is the significance level. Thus, the CB have fixed width that depends on the sample size.

  1. 3.

    Thom’s test (or run test) is usually used to assess homogeneity of time series by computing variations of the time series from a central value, usually median. Thom’s test is described in detail by Rodrigo et al. (1999).

Frequency analysis

Frequency analysis is done by fitting appropriate probability distributions to the stationary, homogeneous and independent data time series to predict the probability of exceedence for a specified magnitude (quantile) or the magnitude associated with a specific exceedence probability (Benkhaled et al. 2014). After selecting a good distribution, parameters need to be estimated. Several methods, including the method of moments and maximum likelihood (ML) are extensively being used for this purpose. In recent years, probability weighted moments (PWMs) has been developed for parameter estimation of the distributions fitted to the hydrological time series. Probability weighted moments was defined by Greenwood et al. (1979) as follows:

$$M_{P,r, s} = E\left[ {X^{p} F^{r} \left( {1 - F} \right)^{s} } \right] = \mathop \int \limits_{0}^{1} \left[ {X\left( F \right)} \right]^{p} F^{r} (1 - F)^{s} dF$$
(2)

The following two moments are usually considered:

$$M_{1,0, s} = \alpha_{s} = \mathop \int \limits_{0}^{1} x\left( F \right)(1 - F)^{s} dF$$
(3)
$$M_{1,r, 0} = \beta_{r} = \mathop \int \limits_{0}^{1} x\left( F \right)F^{r} dF$$
(4)

\(\alpha_{s}\) and \(\beta_{r}\) are related as follow:

$$\alpha_{s} = \mathop \sum \limits_{k = 0}^{s} \left( {\begin{array}{*{20}c} s \\ k \\ \end{array} } \right)( - 1)^{k} \beta_{k}$$
(5)
$$\beta_{r} = \mathop \sum \limits_{k = 0}^{r} \left( {\begin{array}{*{20}c} r \\ k \\ \end{array} } \right)( - 1)^{k} \alpha_{k}$$
(6)

To select an appropriate distribution which best fit the data time series a number of statistical tools can be considered including graphics, goodness of-fit-tests and selection criteria. One of the graphical tools discriminate the best fitted distribution among the others is plotting-position.

Plotting position estimators of sample PWMs are driven by the following equations:

$$a_{s} = \alpha_{s} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {1 - p_{i:n} } \right)^{s} x_{i}$$
(7)
$$b_{r} = \beta_{r} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {p_{i:n} } \right)^{r} x_{i}$$
(8)

where Pi:n = plotting position. Plotting position is a tool for visual inspection and can be used to compare sample frequency distribution with population frequency distribution.

Many literatures have demonstrated that graphical tools are not sufficient for selecting distributions (Ouarda et al. 1994; El Adlouni et al. 2008; Hebal and Remini 2011). Therefore non-graphical tools are also considered in this study. The Klomogorov–Smirnov and Chi square tests were used to verify the hypothesis concerning the parent distribution of the sample.

Results and discussions

Test for randomness

Autocorrelation functions were used to test randomness of the annual suspended sediment concentrations time series. Figure 3 shows ACFs for the annual suspended sediment concentrations time series of the selected stations across the study region. As the figure shows, all autocorrelation coefficients fall within the 95 % confidence limits. Therefore we can conclude that all of the investigated time series are strongly random. It may be attributable to the unregulated characteristics of the streams within the reach these measuring stations are located.

Fig. 3
figure 3

Autocorrelation plots for the annual suspended sediment concentrations of the Ghareguni (a) and YengiKand (b) stations

The run test’s statistic for suspended sediment concentrations are also given in Table 2. It is observed that all time series show homogeneity with having P value >0.05 which confirm the results of ACFs in favor of unregulated flows at the gauging stations.

Table 2 Results of homogeneity and stationary tests of the selected stations

Identification of best fit distributions

At this section, the priority of the candidate distributions which best fit to the SSC data set was examined by applying the plotting position formula as well as by Klomogorov–Sminov and Chi-square statistic criterion. The candidate regional distributions are generalized logistic, extreme value, normal, Pareto and Pearson-III distributions. The efficiency of each probability distribution to fit the SSC data was evaluated based on the Klomogorov–Smirnov and Chi square tests and results are summarized in Table 3. Results show that 2-parameter distributions identified as the best fitted distributions to the SSC time series. The only 3-parameter distribution in this study was GPAR distribution which fitted to the SSC data time series recorded at Alibeig station. The four distributions discriminated as the best fit distributions among the others are Logistic, EV1, GPAR and LN2 distributions, most of them are 2-parameter. Having well performance of GPAR distribution at the Alibeig station may be attributed to the local generation mechanisms resulted in complexity of SSC time series at this site. Tramblay et al. (2008) implied that 2-parametr distributions such as Log normal, Exponential, Weibul and Gamma distributions fit the annual maximum SSC better than 3-parameter distributions including the Generalized Pareto distribution so that time series of SSC of just 5 stations of total of 208 stations were fitted to GPAR distribution. Benkhaled et al. (2014) also suggested 2-parameter distributions to fit SSC rather than 3-parameter distributions. They showed that 2-parameter log normal distribution has the best fit among the others.

Table 3 Goodness-of-fit test using Klomogorove–Smirnov and Chi square tests and best fitted distribution

Frequency analysis and quantile estimation of SSC

Quantile estimation of the investigated variable (SSC in this study) is one of the main tasks of the frequency analysis. The best fitted distributions presented in the previous section were then used to calculate magnitude of the SSC quantiles at different 2, 5, 10, 20, 30, 50, 100 and 200 years return periods (Fig. 4). Table 4 shows the result of quantile estimation for the selected stations and at different return periods. It is clear from the table that maximum return period of extreme SSC data observed in the region is 10 years return period. There is not observation data for return period greater than 10 years. It means that maximum amount of SSC in all gauging stations over the region, correspond to 10 years return period and occur in sequence of each 10 years. Increasing return period (from 2 to 200 years), the quantile values increases with lower occurrence probability in each year. Lowest values of SSC are seen at Chapugh station. The SSC value for this station varies between 0.4 and 50 gr/L for 2- and 10-year return period respectively. Similar values are also presented for Chasb station in Table 4. The lowest value of SSC for this station is equal to 24.64 gr/L with occurrence of 2-year return period and maximum value is 527 gr/L with occurrence of 10 years return period. Sarcham and SalamatAbad stations show similar values of approximately 2000 and 3000 gr/L for 2 years return period and 149,000 and 121,000 gr/L for 10 years retun periods respectively. TakAghaj and Alibeig have the same condition as well. But two stations with greatest values of SSC occurred in 2 and 10 years return periods are YengiKand and Ghareguni stations showing 32,308 and 111,711 gr/L for each 2 years return period and 158,412 and 833,095 gr/L for 10 years return periods respectively.

Fig. 4
figure 4

At site plotting position and quantile estimations for the Annual suspended sediment concentrations of the selected stations

Table 4 At-site quantile estimations of stations in the study region

Conclusions

Estimation of suspended sediment concentrations (SSC) is an essential task for watershed science researchers and reservoir dam operators for optimum land use planning and management and determining the life time of the hydraulic structures. This study was conducted to provide probability occurrence of maximum SSC quantiles for different return periods in Sefidroud basin, Northwest Iran. The results showed that in comparison to the 3-parameter distributions, 2-parameter distributions have better performances to fit annual maximum SSC data time series except for Alibeig station. For the remaining stations, 2-parameter distributions preferred to 3-parameter distributions. Among the 2-parameter distributions, Logistic, EV1 and LN2 distributions showed better performances. Logistic distribution showed the best fit to the SSC data time series of Ghareguni, Sarcham and Chapugh gauging stations. EV1 was selected as the best fitting distribution at the YengiKand and Chasb stations. For Takaghaj and SalamatAbad stations, 2-parameter LN2 distribution performed better than others. Previous studies (Tramblay et al. 2008; Benkhaled et al. 2014) also indicated the better performances of 2-parameter distributions to fit annual SSC data time series.

After the selection of the best fit distributions by Goodness-of-fit tests, frequency analysis of SSC and quantile estimation was done by the selected distributions. Results showed that Ghareguni and YengiKand stations with the Logistic and EV1 distributions showed maximum suspended concentrations in all return periods. Other stations including Chasb, Takaghaj and Alibeig stations showed the lower SSC quantities. The maximum rates of SSC in all over the region were observed at 10 years return period with the probability occurrence of 10 % in each year. The results of this study emphasized on the extending biological and structural soil conservation project in the region especially for upstream watersheds of Ghareguni and YengiKans stations. Nevertheless the results of this study showed better performances for 2-parameter distributions rather than 3-parameter distributions, however it is recommended to conduct SSC frequency analysis for other regions with different climate and land use schemes.