1 Introduction

Runoff estimation in ungauged watersheds is one of the most common problems in applied hydrology. Thus, simple methods for predicting runoff in watersheds are particularly important in hydrologic applications, such as flood design or water balance calculation models (Ponce and Hawkins 1996; Steenhuis et al. 1995). The Soil Conservation Service Curve Number (SCS-CN) method was originally developed by the U.S. Department of Agriculture, Soil Conservation Service to predict direct runoff volumes for given rainfall events and mainly for the evaluation of storm runoff in small agricultural watersheds. The methodology has been documented in detail in the National Engineering Handbook, Section 4: Hydrology (NEH-4) (SCS 1956, 1964, 1971, 1985, 1993, 2004). Due to its simplicity, it became one of the most popular techniques among the engineers and the practitioners, mainly for small catchment hydrology (Mishra and Singh 2006). Detailed descriptions of the method are provided by Hawkins (1993), Steenhuis et al. (1995), Ponce and Hawkins (1996), Grove et al. (1998), McCuen (2002), Michel et al. (2005), Soulis et al. (2009) and Soulis and Valiantzas (2012).

The SCS-CN method has been adopted for various regions and for various land uses and climatic conditions (Mishra and Singh 1999; Romero et al. 2007; King and Balogh 2008; Elhakeem and Papanicolaou 2009). Furthermore, beyond its original scope for the evaluation of storm runoff it has become an integral part of more complex, long-term watershed models such as Soulis and Dercas (2007), Geetha et al. (2008), Singh et al. (2008), Tyagi et al. (2008), Adornado and Yoshida (2010).

The SCS-CN method is based on the following equation calculating runoff from rainfall depth,

$$ \matrix{ {Q = \frac{{{{\left( {P - {I_a}} \right)}^2}}}{{P - {I_a} + S}}}{for}{P > {I_a}} \\ {Q = 0}{for}{P \leqslant {I_a}} \\ }<!end array> $$
(1)

where P is the total rainfall, I a is the initial abstraction, Q is the direct runoff and S is the potential maximum retention. Based on a second assumption, that the amount of initial abstraction is a fraction of the potential maximum retention

$$ {I_a} = \lambda S $$
(2)

Equation 1 becomes

$$ Q = \frac{{{{\left( {P - \lambda S} \right)}^2}}}{{P + \left( {1 - \lambda } \right)S}} $$
(3)

The potential maximum retention S is expressed in terms of the dimensionless curve number (CN) through the relationship

$$ S = \frac{{25400}}{{CN}} - 254 $$
(4)

with S, in mm, taking values from 0, when S → ∞, to 100, when S = 0. The determination of all the NEH-4 SCS-CN values commonly used in hydrologic practice, assume the initial abstraction rate to be set to the constant value, λ = 0.2, in order that S (or its transformation CN) remains the only unknown parameter of the method. However, the validity of this assumption has been investigated in many studies (Woodward et al. 2003; Mishra et al. 2004, 2006; Jain et al. 2006; Elhakeem and Papanicolaou 2009). Woodward et al. (2003) analysing event rainfall-runoff data from several plots recommended using λ = 0.05, while Elhakeem and Papanicolaou (2009) suggested the use of a nonlinear relationship between initial abstraction I a and potential maximum retention S.

CN values can be obtained from tables for various soil types, land cover and land management conditions, however CN estimation based on real data from local or nearby similar watersheds is preferable. In order to estimate S from real data, Eq. 3 can be solved by the quadratic formula (Chen 1982)

$$ S = \frac{P}{\lambda } + \frac{{\left( {1 - \lambda } \right)Q - \sqrt {{{{\left( {1 - \lambda } \right)}^2}{Q^2} + 4\lambda PQ}} }}{{2{\lambda^2}}} $$
(5)

Combining Eqs. 4 and 5, CN value can be directly estimated from rainfall and runoff data

$$ CN = \frac{{25400}}{{\frac{P}{\lambda } + \frac{{\left( {1 - \lambda } \right)Q - \sqrt {{{{\left( {1 - \lambda } \right)}^2}{Q^2} + 4\lambda PQ}} }}{{2{\lambda^2}}} + 254}} $$
(6)

Though, even when the CN value is determined from measured P-Q data, the calculated CN values via Eq. 6 actually vary significantly from storm to storm on any watershed. For this reason, various methods to estimate a single CN value characterizing the watershed from measured runoff data were proposed (Hjelmfelt 1980; Hawkins et al. 1985; Hawkins 1993; Bonta 1997). However, the proposed methods lead to different CN values and in spite of the great number of studies on the SCS-CN method and its widespread use, there is not an agreed procedure to estimate CN from measured runoff data.

Hawkins (1993) in his study on the asymptotic determination of runoff curve numbers from measured runoff analysing a significantly large number of watersheds, where CNs are calculated from real P-Q storm data, he concluded that a secondary systematic correlation almost always emerges in watersheds between the calculated CN value and the storm rainfall depth itself. In most of the watersheds, these calculated CNs approach a constant value with increasing rainfall depth that is assumed to characterize the watershed. Depending on the type of correlation between rainfall and CN values calculated from measured runoff distinguished watersheds in three categories namely watersheds with “standard” behaviour, “violent” behaviour, and “complacent” behaviour. In the “standard” behaviour, which is the most common, the larger calculated CN values correspond to the smaller rainfall depths, and decline progressively with increasing storm size, approaching a stable near constant asymptotic CN value with increasingly larger storms. A typical example of a natural watershed presenting this behaviour is given in Fig. 1. In this case, Hawkins (1993) suggests the estimation of a single asymptotic CN value, observed for very large storm sizes, to be used to characterize such watersheds. In less common cases of watersheds the calculated CN declines steadily with increasing rainfall with no obvious tendency to approach a constant value (“complacent” behaviour, Fig. 1). According to Hawkins (1993), an asymptotic curve number cannot be safely determined from data for this behaviour. In the last behaviour concerning also a small number of watersheds the calculated CNs have an apparently constant value for all rainfall depths except very low rainfall depths where CN increases suddenly (“violent” behaviour). Hjelmfelt et al. (2001) presented additional examples of watersheds presenting similar behaviors.

Fig. 1
figure 1

Two-CN model curves proposed by Soulis and Valiantzas (2012) fitted to the data presented by Hawkins (1993) for the “standard” (Coweeta watershed #2, North Carolina) and the “complacent” (West Donaldson Creek, Oregon) behaviour watersheds

The previously developed methodologies for estimating CNs from measured data aim at the determination of a single asymptotic CN value characterizing the watershed hydrologic response for high rainfall depths. The observed deviations from the asymptotic behaviour for lower rainfall depths are not essentially taken into consideration and are rather attributed to various sources of temporal variability e.g. high rainfall intensities or wetness of the watershed that eventually posed in doubt the validity of the CN method. Consequently, the resulting CN values fail to describe the watershed response in small and medium rainfall events, limiting the applicability of the method to the estimation of peak runoff values. It must also be noted that the above methods fail to determine a final CN value in “complacent” watersheds or when the available runoff dataset does not contain events large enough to determine the final CN value.

To overcome the above issues, Soulis and Valiantzas (2012) investigated the novel hypothesis that the observed correlation between the calculated CN value and the rainfall depth in a watershed reflects the effect of the inevitable presence of soil-cover complex spatial variability along watersheds. They showed that the presence of spatial variability in the watersheds produces a progressive decrease in the calculated CNs as the storm size decreases and for excessively large storm sizes the CN tends to stabilize in an asymptotic “composite” CN value. Based on this hypothesis, they introduced the simplified concept of a two-CN heterogeneous system to model the observed CN-rainfall variation by reducing the CN spatial variability into two classes. The behaviour of the CN-rainfall function produced by the proposed two-CN system concept was found to be similar to the variation observed in natural watersheds. In Fig. 1, the two-CN model curves proposed by Soulis and Valiantzas (2012) are fitted to the data presented by Hawkins (1993) for the “standard” (Coweeta watershed #2, North Carolina) and the “complacent” (West Donaldson Creek, Oregon) behaviour watersheds.

In this study, the above concept is extended in order to facilitate the determination of SCS-CN parameter values from rainfall-runoff data in heterogeneous watersheds. This method exploits the observed correlation between the calculated CN values and the rainfall depths in order to identify the spatial distribution of CN values along the watershed taking into account the specific characteristics of the watershed. The proposed method utilizes the available rainfall-runoff data, remote sensing data and GIS techniques to provide information on spatial watershed characteristics that describe hydrological behaviour, which is valuable in environmental hydrologic analysis and modeling. Furthermore, it could facilitate the estimation of CN values corresponding to specific soil-land cover complexes by utilizing data coming from not entirely homogeneous watersheds. The proposed method is demonstrated in the small scale experimental watershed of Lykorrema stream in Greece.

2 Materials and Methods

2.1 Spatial Identification Methodology

Previous studies on the estimation of CN values from measured data ignore the effect of spatial heterogeneity on hydrologic response and focus on the determination of a single CN value characterizing the entire watershed, although spatial heterogeneity is a very common characteristic of natural watersheds. Soulis and Valiantzas (2012) investigated the effect of spatial heterogeneity on the estimation of CN values from measured data and they showed that the effort to determine a single CN value in watersheds characterized by significant spatial heterogeneity may produce a correlation between the calculated CN value and the rainfall depth similar to the one observed by Hawkins (1993) and Hjelmfelt et al. (2001). Furthermore, they modeled the observed CN-rainfall variation with a simplified two-CN heterogeneous system, reducing the CN spatial variability into two classes.

However, more than two CN value categories generally appear in natural watersheds. Thus, in order to more closely describe the real conditions of natural watersheds it would be preferable if the number of the CN value categories identified was equal to the number of soil-land cover complexes distinguished in each watershed. However, every added CN category requires the determination of two extra parameters (the corresponding CN value and the area it covers), giving rise to the non-convergence and non-unique solution problems when the inverse solution procedure is applied. Though, the involved parameters have a clear physical meaning allowing the position of constraints taking into account the specific characteristics of the examined watershed and facilitating the determination of the remaining parameters based on the available rainfall-runoff measurements.

In this manner, the proposed methodology extends the number of CN value categories identified to the actual number of soil-land cover complexes existing in the watershed. The proposed method exploits the available rainfall-runoff data, remote sensing data, and GIS techniques to identify the spatial distribution of CN values along the watershed taking into account the specific characteristics of the watershed.

Assuming that the total Q at the watershed’s outlet is the sum of the partial Q i (i = 1 to n) coming from each of the n different soil-land cover complexes existing in the watershed, which is characterized by a CN = CNi and covers an a i percentage of the total area, Eq. 6 yields:

$$ CN = \frac{{25400}}{{\frac{P}{\lambda } + \frac{{\left( {1 - \lambda } \right)\sum\limits_{{\iota = 1}}^n {{Q_{\iota }}} - \sqrt {{{{\left( {1 - \lambda } \right)}^2}{{\left( {\sum\limits_{{\iota = 1}}^n {{Q_{\iota }}} } \right)}^2} + 4\lambda P\sum\limits_{{\iota = 1}}^n {{Q_{\iota }}} }} }}{{2{\lambda^2}}} + 254}} $$
(7)

where:

$$ \matrix{ {{Q_i} = 0}{if}{\lambda P < \frac{{25400}}{{C{N_i}}} - 254} \\ {{Q_i} = \frac{{{a_i}}}{{100}}\frac{{{{\left( {P - \lambda \left( {\frac{{25400}}{{C{N_i}}} - 254} \right)} \right)}^2}}}{{P + \left( {1 - \lambda } \right)\left( {\frac{{25400}}{{C{N_i}}} - 254} \right)}}}f{\lambda P \geqslant \frac{{25400}}{{C{N_i}}} - 254} \\ }<!end array> $$
(8)

and

$$ \sum\limits_{{i = 1}}^n {{a_i}} = 100 $$
(9)

The corresponding CNi values can be estimated based on Eq. 7 accomplishing the following steps:

  1. 1.

    The measured P and Q values are sorted separately and then realigned on a rank order basis to form P-Q pairs of equal return period following the frequency matching technique (Hjelmfelt 1980; Hawkins 1993; Hjelmfelt et al. 2001). Then the measured P-Q data are transformed to the equivalent P-CN data using Eq. 6.

  2. 2.

    The watershed is divided in a set of n relatively uniform subareas corresponding to the main soil-land cover complexes distinguished in the watershed by overlaying the available land cover and soil data. The resulting subareas are clearly spatially identified along the watershed and the corresponding area percentages (a i , i = 1 to n) are calculated using a GIS.

  3. 3.

    Additional constraints are posed, if needed, based on specific characteristics of the watershed (i.e. the CN of water bodies or impervious areas like paved roads can be set equal to 100)

  4. 4.

    Equation 7 is fitted to the transformed CN-P measured data curve yielding a set of best estimates for the CN i values corresponding to each soil-land cover complex.

  5. 5.

    The estimated CN values are assigned to the corresponding subareas in the GIS, yielding the spatial distribution of CN values along the watershed.

2.2 Case Study Site

The proposed method is demonstrated in the small scale experimental watershed of Lykorrema stream (15.2 km2), which is situated in the east side of Penteli Mountain, Attica, Greece, and is centered at approximately N38.02° and E23.55° (Fig. 2). This watershed was selected because it has been presented in the international literature as an example of the “complacent” behaviour, and detailed geographical data were available (Soulis et al. 2009).

Fig. 2
figure 2

Map of the Lykorrema experimental watershed

The region is characterized by a Mediterranean semi-arid climate with mild, wet winters and hot, dry summers. Precipitation occurs mostly in the autumn–spring period. The yearly average precipitation value for the last 6 years studied is 690 mm. The watershed is divided in two sub-watersheds. The Upper Lykorrema watershed (7.84 km2) presents a relatively sharp relief, with elevations ranging between 280 m and 950 m. Its average elevation is 560 m and its average slope is as high as 36 %. The Lower Lykorrema watershed (7.36 km2) also presents a relatively sharp relief with elevations ranging between 146 m and 643 m. Its average elevation is 310 m and its average slope is 21 %. The watershed is dominated by coarse soils with high hydraulic conductivities and a smaller part is covered by medium textured soils presenting relatively high hydraulic conductivities. The dominant vegetation type in the watershed is pasture with a few scattered tufts of trees. There is also a dense road network, mainly in the lower part of the watershed, where a settlement exists. A small part of the watershed is covered by bare rock. The Upper and Lower Lykorrema experimental watersheds are operated from the Agricultural University of Athens, Greece and the National Technical University of Athens, Greece, respectively. Detailed description of the hydrology, climate and physiography of Lykorrema experimental watershed and of the available geographical and hydro-meteorological databases are provided by Soulis (2009) and Soulis et al. (2009).

2.3 Storm Events

The study area is equipped with a dense hydro-meteorological network, which is fully operational since September 2004. For the current analysis, all the storm events producing significant direct runoff that took place from September 2004 to August 2008 were used (30 events). A storm event was considered significant when the value of peak flow rate in the hydrograph was greater than 0.15 m3/s for the Upper Lykorrema watershed and 0.25 m3/s for the entire watershed. The end of an event was defined when a six-hour period without rainfall occurred. The watershed areal rainfall estimation was made using the Thiessen polygons method, while base-flow was separated using the constant slope graphical method (Dingman 2002). The potential maximum retention S and the CN value were directly estimated from the measured rainfall and runoff data using Eqs. 5 and 6. Details on the characteristics of these events are provided by Soulis et al. (2009).

3 Results and Discussion

In order to demonstrate the use of the proposed methodology, the CN values characterizing the Lykorrema watershed were estimated based only on the data coming from the Upper Lykorrema watershed. Then, the resulted CN values were validated using all the available data.

It must be noticed that in the following analysis, the constant value of λ = 0.2 corresponding to the standard case is initially examined. This value of λ was selected to obtain CN values that are compatible with those provided by the classical SCS-CN method documentation. Nevertheless, the same analysis can be applied for other λ values, as well.

Exploiting the available spatial data coming from a detailed soil survey in the area and a detailed land cover classification based on remote sensing techniques, the three main soil-land cover complexes existing in the Lykorrema watershed were spatially identified and the corresponding area percentages were calculated (Fig. 2 and Table 1). The first complex includes areas characterized by sandy loam (SL) soils with very high hydraulic conductivity that are covered by a mixture of pasture and shrublands with a few scattered tufts of trees. The second complex includes areas with similar land cover as the first one but characterized by sandy clay loam (SCL) soils with relatively high hydraulic conductivity. The last complex includes almost impervious areas i.e. roads, buildings, areas covered with bare rock and streams.

Table 1 Characteristics of the three main soil-land cover complexes present in the Lykorrema watershed

The measured P and Q data of the Upper Lykorrema watershed were ordered to form P-Q pairs of equal return periods and transformed to the equivalent P-CN data, as it was previously described. Following, Eq. 7 was fitted to the resulted P-CN data curve for the Upper Lykorrema watershed yielding the best estimates of the CN values corresponding to the three soil-land cover complexes existing in the watershed (Fig. 3 and Table 1). The resulted CN values were validated in comparison to the original rainfall runoff data (i.e. before being ordered and realigned) to demonstrate that the SCS-CN method, using the CN values obtained by the proposed CN determination methodology, provides adequate predictions of the watershed’s runoff response (Fig. 4a and b).

Fig. 3
figure 3

Equation 7 fitted to the P-CN data curve of the Upper Lykorrema watershed for λ = 0.2 and λ = 0.05

Fig. 4
figure 4

Predicted runoff values in comparison to the original measured runoff values, against the total rainfall depth, for (a) the Upper and (b) the Entire Lykorrema watersheds

In Fig. 5 the spatial distribution of the identified CN values along the watershed is illustrated, highlighting the ability of the proposed methodology to provide information on spatial watershed characteristics that drive hydrological behavior. In Fig. 4a and b the predicted runoff values using the CN values resulted by the proposed methodology (Table 1) are plotted in comparison to the original measured runoff values, against the total rainfall depth, for the Upper (Fig. 4a) and the Entire (Fig. 4b) Lykorrema watersheds. It can be seen that there is good agreement between the predicted and the measured runoff values for both watersheds (R 2 = 0.863 and R 2= 0.916 for the Upper and the Entire Lykorrema watersheds respectively).

Fig. 5
figure 5

Identified CN values spatial distribution

For comparison purposes the above analysis was also carried out using the alternative constant value of λ = 0.05 as suggested by Woodward et al. (2003), as well as other researchers. The fitting procedure provided slightly inferior results to the standard case of λ = 0.2 (Fig. 3). The resulted CNi optimal values were CN1 = 10, CN2 = 34, and CN3 = 99. It must be noticed that these CN values are not directly comparable to the CN values estimated using λ = 0.2, since according to Eq. 6 the CN values depend on the λ value. The previous problem of the non-correspondence of the CN values obtained for the two different values of λ is also discussed by Woodward et al. (2003). Runoff predictions based on the CN values resulted by the proposed methodology for λ = 0.05 are also slightly inferior to those of the standard case (Fig. 4a and b). The above results demonstrate that the proposed method is applicable for other than λ = 0.2 values of λ as well; however, in this case the resulted CN values are different and they are not compatible with the classical CN values provided in the original SCS-CN method documentation. It is also worth noting that, if λ is considered as an additional free parameter, then the fitting procedure presents serious problems of non-uniqueness of the solution (e.g. different initial guess values of λ applied in the 4-parameter optimization procedure leads to different final estimates of optimized parameters).

The runoff predictions obtained by the proposed heterogeneous system approach are also compared to the runoff predictions obtained by the SCS-CN method using a single optimum CN value for the cases of λ = 0.05 and λ = 0.2 in Fig 4a and b. It can be observed that the proposed method (for both λ = 0.05 and λ = 0.2) provides superior runoff predictions than the SCS-CN method using a single CN value.

Usually, lower initial abstraction ratios are selected for watersheds characterized by low CN values (Woodward et al. 2003). Low CN values correspond to soil, land cover and land management conditions favoring high total rainfall abstractions (S). Low CN values also correspond to high initial abstractions (I a ), which is in accordance to the previous soil, land cover and land management conditions. However, in many watersheds characterized by low CN values, runoff response may observed even for low rainfall depths. In these cases, the runoff response observed for low rainfall depths can be modeled by selecting low initial abstraction ratios (e.g. λ = 0.05).

In contrast, in the present method, the runoff generated for low rainfall depths is attributed to the inevitable presence of spatial variability in natural watersheds; i.e. at low rainfall depths runoff response is generated in parts of the watershed characterized by high CN values (impervious areas, roads, wet areas, streambeds etc.). As rainfall depth increases other parts of the watershed, characterized by lower CN values, may contribute to the runoff response as well. The proposed approach is in line with the physical characteristics of natural watersheds and it justifies the CN-P variation observed in natural watersheds (Soulis and Valiantzas 2012). Additionally, if the value of λ = 0.2 is selected, then the estimated CN values are directly compatible with the original SCS-CN method documentation. Besides, the proposed approach is valid for other λ values, as well. Further study to additional experimental watersheds with known characteristics may improve the selection of the appropriate initial abstraction ratio value.

4 Summary and Conclusions

Soulis and Valiantzas (2012) stated the hypothesis that the correlation between the calculated CN value and the rainfall depth, which was observed in previous studies (e.g. Hawkins 1979; 1993; Hjelmfelt et al. 2001), reflects the effect of the inevitable presence of soil-cover complex spatial variability along watersheds. Based on this hypothesis, they introduced the concept of a simplified idealized heterogeneous system composed by two different CN values. The behaviour of the CN-P function produced by this system was analysed systematically and it was found similar to the CN-P variation observed in natural watersheds. They also demonstrated that the two-CN system approach provides reasonable accuracy in predicting runoff volumes and it performs better than the previous original method.

In this study, a methodology to determine the spatial distribution of the SCS-CN parameter values from rainfall-runoff data in heterogeneous watersheds is proposed based on the same hypothesis. This methodology exploits the observed correlation between the calculated CN values and the rainfall depths in combination with specific characteristics of the watershed, such as soil and land cover types, in order to extend the number of the identified CN value categories to the actual number of soil-land cover complexes, which are present in the watershed.

The proposed methodology was calibrated and validated using the detailed hydro-meteorological and spatial data available for the Lykorrema experimental watershed in Greece. It was demonstrated that the proposed methodology is able to provide reasonable accuracy in predicting runoff volumes and at the same time to effectively describe the CN values spatial heterogeneity that drives the watershed’s hydrological behaviour. This last capability, i.e. to provide information on CN values spatial distribution and thus spatially distributed runoff estimations, can be proven valuable in spatial distributed hydrological modeling and environmental hydrologic analysis. Furthermore, it allows the estimation of CN values for specific soil-land cover complexes in heterogeneous watersheds, which may facilitate future studies aiming at the adaptation and the documentation of the SCS-CN method in various regions and for various land uses.