Introduction

The estimation of saturated soil water content (θ s ) is important for the knowledge of hydrological behaviour of watersheds. Measuring this parameter is time- and finance-consuming. Therefore, indirect estimation of the soil hydraulic properties has been concerned. Pedotransfer functions (PTFs) are functions applied to estimate the soil hydraulic properties using basic soil properties such as sand, silt, and clay percentages, bulk density (BD) and other easy to measure properties (Bouma and Van Lanen 1987). A detailed review of PTFs can be found in vereecken et al. (2010). Multiple-linear regression method (Mayr and Jarvis 1999; Tomasella et al. 2000), group Method of Data Handling (Pachepsky and Rawls 1999) and neural network analysis (ANN) (Schaap et al. 1998; Minasny and McBratney 2002; Minasny et al. 2004) have been used to develop θ s PTFs. Tamari et al. (1996) have presented a review of ANN applicability to estimate soil hydraulic properties. Two problems that should be concerned in developing ANN-based models are included: (1) the learning algorithm may not get optimum weights to minimize prediction errors, (2) a number of weights that are difficult to easy interpretation (Schaap et al. 2001). Therefore, ANN approach is not always applicable to develop the accurate and reliable θ s PTFs (Twarakavi et al. 2009). Adaptive neuro-fuzzy inference system (ANFIS) can be an appropriate approach using the learning capability of an ANN for parameter optimization. ANFIS have been used for database management, signal processing, hydrologic applications (Jang 1993; Nayak et al. 2004) and developing PTFs for estimation of paddy soils CEC (Seyedmohammadi et al. 2016). Researchers have not assessed the potential of this approach in saturated soil water study. There is no discussion in the literature of applications of ANFIS approach for developing θ s PTFs and evaluation of member function types. An important advantage of this technique is that it does not require the model structure to be known a priori compared with most of the approaches (Nayak et al. 2004). Nayak et al. (2004) showed that the ANFIS outperforms ANNs and other traditional methods in terms of prediction errors, computational speed and efficiency. The objective of this study was to evaluate the applicability of ANFIS-based PTFs and different member functions in estimating θ s to overcome the problems with the other approaches in some soils of Iran’s flood spreading areas. In addition, the effect of different input variables to develop θ s PTFs was investigated.

Description of ANFIS

Since the detailed review of adaptive networks of ANFIS have been provided by Jang (1993), only a briefly description is presented in this paper. The adaptive network is a multi-layer feed-forward network in which each node performs a particular function on incoming signals and a set of parameters pertaining to this node (Jang 1993). This structure has five layers including fuzzy layer, product layer, normalized layer, de-fuzzy layer and total output layer. A detail about the functioning of the ANFIS is as follows (Jang 1993):

Layer 1 Every node (i) in this layer is an adaptive node with node function. Ol,i is the output of the ith node of the layer l:

$$\begin{aligned} {\text{O}}_{{ 1,{\text{i}}}} = \, \mu_{\text{Ai}} ( {\text{x)}}\quad \, \;\;{\text{for i }} = { 1},{ 2},{\text{ or}} \hfill \\ {\text{O}}_{{ 1,{\text{i}}}} = \, \mu_{{{\text{Bi}} - 2}} ( {\text{y)}}\quad {\text{ for i }} = { 3},{ 4,} \hfill \\ \end{aligned}$$
(1)

where x (or y) is the input to node i, and Ai is a linguistic label associated with this node. Therefore, O1,i is the membership function of Ai. The bell shaped membership functions (MF) can be computed as follows:

$$\mu_{A} (x) = \frac{1}{{1 + \left| {\frac{{x - c_{i} }}{{a_{i} }}} \right|^{2b} }},$$
(2)

where ai, bi and ci are the parameter set to be learnt.

Layer 2 Each node in this layer denoted as Π which multiply incoming signals and sending the product out.

$${\text{O}}_{{ 2,{\text{i}}}} = {\text{ w}}_{\text{i}} = \, \left( {\mu_{\text{Ai}} \left( {\text{x}} \right)} \right) \times \left( {\mu_{\text{Bi}} \left( {\text{y}} \right)} \right) \quad {\text{ i }} = { 1},{ 2,}$$
(3)

where each node represents the firing strength of the rule. w1, w2 are the weight functions of the next layer.

Layer 3 Each node in this layer is labeled as N. O3,i indicates the Layer 3 output as shown in Eq. 4:

$$O_{3,i} = \bar{w} = \frac{{w_{i} }}{{w_{1} + w_{2} }}.$$
(4)

Layer 4 This layer’s nodes are adaptive with node functions.

$${\text{O}}_{{ 4,{\text{i}}}} = {\text{ w}}_{\text{i}} {\text{f}}_{\text{i}} = {\text{ w}}_{\text{i}} \left( {{\text{p}}_{\text{i}} {\text{x }} + {\text{ q}}_{\text{i}} {\text{y }} + {\text{ r}}_{\text{i}} } \right),$$
(5)

where wi denotes the output of layer 3 and pi, qi, ri are the parameters set.

Layer 5 This layer’s single node computes the overall output as the summation of all incoming signals as:

$$O_{5,i} = \sum\nolimits_{i} {\bar{w}} f_{i} = \frac{{\sum\nolimits_{i} {w_{i} f_{i} } }}{{\sum\nolimits_{i} {w_{i} } }},$$
(6)

where O5,i denotes the layer 5 output.

Materials and methods

The θ s PTFs were developed using the database established by Mahdian and Kamali (2010). Some flood spreading areas in Iran were chosen and soil properties data including sand%, silt%, clay%, Organic carbon and bulk density (BD) were collected. Then, the content of pores of medium size (0.2–30 µm) was calculated. Soil properties have been measured based on USDA standards. Data were divided into approximately 80% for training and the 20% for testing. Then, the performance of ANFIS-based PTFs and different member functions for θ s predictions were assessed. In the present study, the Sugeno fuzzy model (Takagi and Sugeno 1985) and back propagation algorithm were applied for training. A set of models was developed based on the ANFIS approach for the water content at θ s . The selection of best inputs is an important issue in the ANFIS-based models development. ANFIS-based PTFs were developed by varying the above mentioned input variables. In the first set of models (P1) (SSC), sand%, silt%, and clay% were employed as inputs. The second set of models (P2) (SSCBD) employed sand%, silt%, clay% and BD as input variables, while the input data of the third set of models (P3) were included medium porosity (0.2–30 µm) in addition to those inputs used for P1. The fourth set of models (P4) used all input variables mentioned above together. The fifth set of models (P5) used all input variables mentioned above together in addition to organic carbon (OC). Moreover, considering a hierarchical approach, SSCBDθ33 model employed sand, silt and clay percentages, bulk density and water retention point at −33 kPa potential. SSCBD θ33θ1500 added the water content at −1500 kPa to the input variables of the SSCBDθ33 model. In the present investigation, different eight MFs (Gaussian curve (gaussmf), Π-shaped (pimf), Triangular-shaped (trimf), Trapezoidal-shaped (trapmf), Generalized bell-shaped (gbellmf), composed of product of two sigmoidally shaped (psigmf), composed of difference between two sigmoidal (dsigmf) and Gaussian combination (gaussmf)) were assessed for developing θ s ANFIS-based PTFs. The PTFs performance was investigated based on the RMSE, ME, and R2. The MFs of the ANFIS takes values between 0 and 1. Therefore, input data were normalized so as to lie in the range of 0–1 (Masters 1993). The models development was done using MATLAB™ (The MathWorks Inc. USA).

Result and discussion

PTFs performance of the saturated soil water content

The brief statistics of soil properties of the studied flood spreading areas are presented in Table 1. In this research, studied soils were coarse-textured and the bulk densities ranged from 1.02 to 1.95 g cm−3. The goodness of fit statistics on the normalized values (Table 2) for all the PTFs is presented in Table 3. In this research, it was observed that the variation of the performance criteria was significant for ANFIS-based PTFs with different MF types (Table 4). Ignoring the various input data set, the ANFIS-based models were able to estimate soil hydraulic properties with a reasonable accuracy, as can be evidenced by relatively the low RMSE values (Table 3). The fairly same RMSE values for estimation of θ s using different combinations of input variables were observed in Table 3 and Fig. 1. Soil structure and pore-size distribution may extensively influence soil water content in the air-entry and capillary region of the soil water retention curve (Merdun et al. 2006), which were only indirectly accounted for in the θ s PTFs by BD and medium porosity. Medium pore-size was included as an input variable to develop ANFIS-based PTFs of θ s , which was not effective to improve the model accuracy.

Table 1 Summary statistics of input data for training and testing ANFIS-based PTFs
Table 2 Summary statistics on the normalized values of studied soil hydraulic properties for training and testing pedotransfer functions
Table 3 The goodness of fit statistics during validation for the best selected PTFs
Table 4 The correlation coefficient between the estimated and measured saturated soil water content for different member functions (MF1 to MF8)
Fig. 1
figure 1

Measured and estimated soil water content (cm3 cm−3) at saturation (θ s ) by ANFIS-based pedotransfer functions. SSC sand, silt and clay, BD bulk density

Effect of input variables

From Table 3, it is apparent that all of the PTFs perform similarly as the RMSE did not vary significantly amongst different combination of inputs except. One other observation is that R2 values between measured and PTF-estimated θ s using ANFIS approach did not increase as some input predictors were used in the PTFs (from P1 to P5). In all of the θ s PTFs, correlation between estimations of θ s and corresponding observations was the low (Tables 3, 4). Nemes et al. (2003) and Vereecken et al. (2010) have stated that including extra input variables can improve the efficiency of ANN models. However, the results of the present study showed that increasing the number of input variables did not necessarily lead to improve the model performance and better estimations. This is consistent with earlier findings of Nemes et al. (2006) and Haghverdi et al. (2012). As mentioned above, there are many other important parameters such as soil salinity and clay mineralogy that may significantly influence the soil water content (Nemes et al. 2003), which were not accounted for as input data for water retention PTFs in the present research. Vegetation and topography characteristics may significantly affect soil hydraulic properties (Sharma et al. 2006), which were ignored in the current study. Considering vegetation and topography characteristics as similar among training and test datasets, if not included as estimators, may improve the performance of the models (Haghverdi et al. 2012).

Tomasella et al. (2000) reported the poor fits of retention points near saturation. Considering RMSE, it was seen that increasing the BD and porosity as predictors did not necessarily improve the performance of the PTFs. It seems that while evaluating the performance of any model for its accuracy and applicability in predicting saturated soil water content, it is not only important to evaluate the number and type of input variables but also the differences in member functions. Including more input variables for developing the θ s PTFs may be confusing and, especially when they have not significant effect, the development will be complicated. Extra input variables may also be employed to improve the performance of the models, such as soil structure (Minasny et al. 1999), soil chemical properties.

In the case of the ANFIS-based PTFs developed in this study, there are different models with various input data set and MF types. Therefore, it seems ANFIS approach provides more flexible θ s PTFs. To investigate the above mentioned reason and observed improvement, it is needed to do further research evaluating ANFIS in future researches. In the present study, the ANFIS model took 150–300 epochs, implying considerable savings in computational time for ANFIS models. The results suggested that the model building process can be simplified when an ANFIS model is developed compared to an ANN, and the ANFIS model preserves the full potential of ANN models in its performance (Nayak et al. 2004). Neuro-fuzzy techniques remove some of the shortcomings of ANNs. ANFIS had a good applicability in this study that is reported by Nayak et al. (2004), Aqil et al. (2007) and Dastorani et al. (2010).

A comparative analysis suggests that the proposed modeling approach may outperform ANNs and other traditional models in terms of computational speed, estimation errors and performance. Despite the general belief that ANN-based models are relatively successful, there are number of weights that do not allow easy interpretation (Schaap et al. 2001). Considering the above mentioned problems associated with ANN-based PTFs, ANN approach is not always able to improve the PTFs accuracy and reliability (Twarakavi et al. 2009).

The results demonstrated that the ANFIS showed good performance in terms of various statistical indices and the results are promising. It was also observed that the ANFIS model eases the model developing process. Therefore, it is important to develop the suitable PTFs using appropriate approaches (i.e. ANFIS) after the knowledge of limitations and weaknesses that need to be considered. Finally, a good performance may be obtained when pore size distribution would be measured in addition to the typical PTF inputs. It is noted that there are large uncertainties in estimation of ANFIS-based PTFs due to the lack of data for some textural classes (fine- and medium textured) in the training.

Conclusion

This study presented the application of an adaptive neuro fuzzy inference system (ANFIS) to saturated soil water content modeling. In this study, the ANFIS approach was successfully employed to develop saturated soil water content PTFs that used various input set. It was observed that an increase in some of the predictors (i.e. medium porosity) did not result in improved saturated soil water content prediction by PTFs. It was concluded that by increasing the BD as input variable to develop the ANFIS-based PTFs of saturated soil water content, the model accuracy was not improved and the observed difference was not statistically significant. The results were encouraging and suggest that an adaptive neuro-fuzzy approach is viable for modeling θ s . Although estimation errors of ANFIS for θ s estimation were relatively similar, the results were different for various member function types to estimate soil water content at θ s . There is a need for more application of the ANFIS approach especially for fine- and medium textured classes.