1 Introduction

In 1985, the first concrete face rockfill dam (CFRD) was constructed in California. The dam construction was followed by constructing a timber faced dumped rockfill dam that had been started in 1850 (Fell et al., 2005). Through the progress of engineering concepts and technology, this type of dam construction has become popular during recent years, especially in areas with shortage of impenetrable soil (Ozkuzukiran et al., 2006). In fact, this type of dam has been constructed for almost 150 years around the world. Over the past two decades, many of them were constructed with heights exceeding 150 m. For instance, Shuibuya, with a height of 233 m, and Mongjiadu, with a height of 179 m (both in China) are two examples (Zhou et al., 2011). China is one of the countries where CFRDs have become common, and as a result, a Chinese design code for CFRDs has been developed and is used in the field (Chinese National Committee on Large Dams, 1999). The history of construction and design of modern rockfill dams, including description of designing procedure of CFRDs, was presented by Cooke (1984). Along the same line of thought in Australia, there has been an attempt to provide the standard procedure of designing CFRDs based on the results of their own experience (Australian National Committee on Large Dams Incorporated, 1991).

One of the most common causes of dam failure is the settlement of crest and body, which leads to the formation of cracks across the body and downstream slope. These cracks increase drainage flow at the dam toe and cause instability, which is transmitted to the entire structure and eventually failure of the dam. In most of the dams, the crest settlement is not uniform, although it may be symmetrical (because the load is the maximize in axial points and reaches to zero in toes). The impact of non-uniform settlement (even if it is symmetrical) in various sectors of construction is not negligible. In conventional design procedures, the crest settlement is considered as 2% of the total height and in areas prone to earthquakes (due to earthquake effects) 1% of this amount will be added to the dam body height. However, the settlement for the entire dam crest cannot be accurately estimated (Kutzner, 1997). The crest settlement of rockfill dams will be consistently continued for a long time after construction. It is believed that a significant amount of the settlement occurs between 24–30 months after construction (Dascal, 1987). To monitor the dam, a good assessment of dam settlement after construction is required to warn the engineers against any potential problems (Habibagahi, 2002).

Considering the above discussion, it is obvious that prediction of dam crest settlement is of major importance. In most studies, one or several dams have been considered specifically to determine the influential parameters in predicting dam settlement (Park et al., 2005; Malla et al., 2007; Gikas and Sakellariou, 2008; Seo et al., 2009; Zhou et al., 2011). Based on these studies, it is noted that the dam height is the main factor in crest settlement (Kim and Kim, 2008). Among the few proposed relationships in this field, Clements (1984)’s equation was developed for the crest settlement of 68 rockfill dams after completion. He suggested an equation in the form of S=aH b, where S is the crest settlement, H is the dam height, a and b are constant numbers in which a=0.002 (at the initial impounding) and a=14×10−7 (after 10 years of operation), b=1.1 (at the initial impounding) and b=2.6 (after 10 years of operation). One of the limitations of this equation is that only one input parameter is used; however, the dam settlement prediction depends on several parameters (such as time and shape factor). As each parameter has an effect on another, the application of the empirical methods could become impractical. On the other hand, no specific relation has been developed for predicting settlement of CFRDs yet. To overcome these limitations, soft computing techniques can be used to develop a more accurate and reliable predictive method. In the present study, two intelligent methods, adaptive neuro-fuzzy inference system (ANFIS) and gene expression programming (GEP), widely used in solving complex engineering problems, have been utilized to propose new models for predicting CFRD crest settlement.

2 Intelligent methods

The dam behavior depends on many influential factors, which complicate the prediction of the settlement of dam structure. Methods derived from experimental modeling are not only time-consuming, but also do not provide reasonable estimates. Therefore, in such cases, the application of artificial intelligence based methods is recognized to be an appropriate substitute. In recent years, these methods have been widely used in problems related to geosciences and geotechnical engineering (Lazzari and Salvaneschi, 1994; Beiki et al., 2010; Fragos et al., 2010; Mollahasani et al., 2011; Mousavi et al., 2012).

2.1 Adaptive neuro-fuzzy inference system (ANFIS)

ANFIS is the integration of fuzzy logic (FL) and artificial neural network (ANN) introduced by Jang (1993). Fuzzy systems and ANNs include various advantages and disadvantages. A fuzzy system can model the qualitative aspects of human knowledge and reasoning processes, whereas it does not feature any earning capabilities. In other words, a fuzzy system cannot be trained. Nevertheless, neural networks are able to do self-training using datasets. Meanwhile, neural networks are implicit and they are unable to use human language (Kartalopoulos, 1996). To overcome these deficiencies, ANFIS has been proposed. ANFIS has the advantages of both fuzzy and neural systems (Srinivasan and Fisher, 1995; Jang and sun, 1997).

ANFIS systems are often applied in conjunction with the Takagi-Sugeno (TSK) fuzzy system. The main purpose of Takagi-Sugeno approach is to automate fuzzy modeling using measured data. In the fuzzy TSK method, for a system bearing two input parameters X and Y each describing a membership function, the following IF-THEN rules apply:

$${\text{If}}x = {A_1}{\text{andy = }}{{\text{B}}_1}{\text{then}}{{\text{f}}_{1(x,y)}} = {p_1}x + {q_1}y + {k_1},$$
(1)
$${\text{If}}x = {A_2}{\text{and}}y{\text{ = }}{{\text{B}}_2}{\text{then}}{{\text{f}}_{2(x,y)}} = {p_2}x + {q_2}y + {k_2},$$
(2)

where x (or y) is the input to node, i, p, q, and k are consequence parameters resulted from the training; and A and B are labels of the fuzzy set (small, large, etc.) defined as suitable membership functions. According to Fig. 1, ANFIS process acts in five steps (Jang et al., 1997; Kayadelen, 2011; Jalalifar et al., 2011).

  1. 1.

    The basic parameters of membership functions are defined, i.e., bell-shape membership functions are described as follows:

    $${\eta _A}(X) = {1 \mathord{\left/ {\vphantom {1 {\left( {1 + |X - {c_i})/{a_i}{|^{2b}}} \right),}}} \right. \kern-\nulldelimiterspace} {\left( {1 + |X - {c_i})/{a_i}{|^{2b}}} \right),}}$$
    (3)

    where η A is the bell-shaped membership function, and a, b, and c are parameters of the equation. Changes in each of these parameters will change bell-shape membership function symmetrically. In fact, various forms of membership functions for fuzzy sets were made. Parameters of this layer are defined as initial (or premise) parameters.

  2. 2.

    The firing strength of a rule, which represents the product of the incoming signals, is calculated by

    $${U_{2,i}} = {W_i} = \eta {A_i}(X)\eta {B_i}(Y),i = 1,2.$$
    (4)
  3. 3.

    The firing strength calculated in the second step is normalized using the following equation:

    $${U_{3,i}} = {\bar W_i} = {W_i}/({W_1} + {W_2}),i = 1,2,$$
    (5)

    where \(\overline {{W_i}} \) is the normalized firing strength.

  4. 4.

    The effect of each rule on output is calculated with an adaptive node function as

    $${U_{4,i}} = {\bar W_i}{f_i} = {\bar W_i}({p_i}X + {q_i}Y + {k_i}),$$
    (6)

    where p i , q i , and k i are the consequence parameters.

  5. 5.

    The final output of ANFIS model is

    $${U_{5,i}} = \sum\limits_i {{{\bar W}_i}} {f_i} = {{\left( {\sum\limits_i {{W_i}} {f_i}} \right)} \mathord{\left/ {\vphantom {{\left( {\sum\limits_i {{W_i}} {f_i}} \right)} {\left( {\sum\limits_i {{W_i}} } \right).}}} \right. \kern-\nulldelimiterspace} {\left( {\sum\limits_i {{W_i}} } \right).}}$$
    (7)
Fig. 1
figure 1

The first-order TSK fuzzy model (a) and ANFIS architecture (b) (Jalalifar et al., 2011; Kayadelen, 2011)

In this system, training means that with the use of training data, the non-linear parameters related to the fuzzy membership functions at the first level and the linear parameters of the forth layer are determined in a way that for each desired input, a favorable output is obtained. During this fuzzy-neural process, membership function parameters are regulated through the back propagation (BP) algorithm or in combination with the least squares (LS) method (Jang et al., 1997; Demuth and Beale, 2001).

Using various methods of identification, different techniques to construct ANFIS model are available such as grid partitioning (GP), subtractive clustering method (SCM), fuzzy c-means clustering (FCM) (Jalalifar et al., 2011). In this study, to identify premise membership functions, the three aforementioned methods were used.

2.1.1 Grid partitioning (GP)

In the GP method, each part of premise variables is recommended independently (Jang, 1993). In developing this expert model, membership functions of all premise variables are defined by using previous experiences and knowledge. Membership functions are designed to provide the concept of linguistic terms in a specific content. In most systems, no specific knowledge is available for these partitions. In these cases, domains of premise variable can simply be partitioned into a number of equally-spaced and equally-shaped membership functions. Using available input-output data, parameter of membership function can be optimized (Jalalifar et al., 2011).

2.1.2 Subtractive clustering method (SCM)

The SCM is a modified form of the mountain, originally introduced by Chiu (1994). When there is no clear view of the number of clusters that should be specified for data collection, this algorithm will be a fast method for finding the number of clusters and also its centers (MATLAB User’s Guide, 2006). This method assumes that each data will be analyzed as a clustering center (Delmirli and Muthukumaran, 2000). The point that has the maximum number of neighbors is selected as the cluster center. The data will be placed within a circle with a fuzzy radius. The algorithm will search for a new point between the points with more neighbors. Then, by finding a new point, other points with fewer neighbors will be ignored. This procedure continues until all points are tested. In this method, the smaller radius clustering increases the rules and clusters, and vice versa (MATLAB User’s Guide, 2006).

2.1.3 Fuzzy c-means clustering (FCM)

The FCM is the most common method of fuzzy clustering. This method will place the data in groups, based on their degree of membership. This method was introduced by Bezdek (1981) and, in fact, is an optimized method of clustering such as the K-means method (Dave and Krishnapuram, 1997; Chintalapudi and Kam, 1998).

Generally, in systems where no certain knowledge is available for recognizing membership functions, these functions are defined for all primary variables using previous experiences. Therefore, the GP method is utilized to meet the requirements. But the aim of clustering method is to categorize the huge data sets and to provide a simple representation of system behavior. In case the number of clusters is unknown, by using the SCM, the number and center of clusters can be found quickly. In the FCM method, which is proposed to improve SCM performance, the data are classified into pertinent groups based on their degrees of membership.

2.2 Gene expression programming (GEP)

Ferreira (2001) proposed for the first time an algorithm based on genetic algorithms (GAs) and genetic programming that was called gene expression programming (GEP) (Kayadelen, 2011). GEP is a new evolutionary algorithm that has been established to overcome many limitations of GA and genetic programming (Teodorescu and Sherwood, 2008; Kayadelen, 2011). GA is a machine learning model, inspired from the behavior evolution mechanism of nature (Sivanandam and Deepa, 2008). However, genetic programming was presented to automate programming and induction and was introduced by Koza (1992) after testing on the symbolic regression. Genetic programming can be viewed as a special type of GAs, where the size of chromosomes will be modified and changed by genetic operators. The genetic programming can be considered as a subset of GAs. The main difference between GA and genetic programming is that development programs in the second subset are as parse trees, but in the first subset it is a binary string of fixed length. GEP can be considered a new subset genetic programming, and the differences between these two are the ways of providing solutions. GEP has two main parts called chromosomes and expression tree (ET) (Ferreira, 2001; Kayadelen, 2011). Any mathematical information is coded in gene in chromosomes by way of bilingual and conclusive language called Karva Language (the language of the genes), and also is translated to the ETs by means of the language of ET (Kayadelen, 2011). Fig. 2 demonstrates the GEP algorithm that the selected process starts with five elements such as: function set, terminal set, fitness function, control parameters, and stop conditions (Teodorescu and Sherwood, 2008).

Fig. 2
figure 2

The algorithm of GEP (Teodorescu and Sherwood, 2008)

In this method, a mathematical function is defined as a chromosome including many genes, which is extended using the provided data. In GEP, it is encoded as simple strings of fixed-length which are subsequently expressed as ETs of different sizes and shapes. This algorithm randomly builds an initial chromosome, which indicates a mathematical function. Then, it is converted into an ET. In other words, this method creates a fixed-length string character to provide solutions. These solutions are tree-like structures, and these ETs can be written as mathematical functions. Each ET consists of two parts called ‘function set and terminal set’ (an example as shown in Fig. 3 (Kayadelen, 2011)). Then the next step is to compare predicted results with real values. The GEP process will stop if the error is in agreement with error standards that have been initially specified. If the error standards cannot be met, some of the chromosomes will be selected by roulette wheel sampling selection to find a new chromosome and mutation will occur (Kayadelen, 2011).

Fig. 3
figure 3

Schematic of a chromosome with one gene and its expression tree and corresponding mathematical equation (Kayadelen, 2011)

GEP has four main operators, including: selection, mutation, transposition, and crossover. Selection operator is responsible for selecting chromosome using roulette wheel sampling method with elitism to determine probability of producing offspring. In encoding sequence the chosen chromosome will change by mutation operator or in other words the selected chromosome will mutate. Transposition operator will duplicate a part of the chromosomes accidentally and put it in another location. Then coding in chromosomes (which are chosen randomly) will change through (crossover) combining. This is the base of GEP function (Ferreira, 2006). The rate of the operator is determined by users before analysis. The mutation rates are usually between 0.1 and 0.01. It also recommended that the rate of crossover and transposition be considered 0.1 and 0.4.

3 Database

The information of 24 CFRD has been used in this research, based on the information of previous studies in different regions of the world. The specifications of these dams are given in Table 1 (Kim and Kim, 2008).

Table 1 Specifications of some CFRDs used in this study (Kim and Kim, 2008)

In order to simulate the settlement prediction, data was divided into training and test sets. 80% (19 sets of data) were used for training and 20% (five sets of data) were used for the test. We attempted to select the test data that includes various dam heights, and then obtained simulating can be applicable in various heights. Three input parameters used were height (H), shape factor (S f), and time (t), in the prediction of dam crest settlement (CSCFRD). The complete information and the range of each parameter is presented in Table 2.

Table 2 Range of different parameters used in this study

Several parameters, such as height of dam, condition of the foundations, dam shape, duration of impounding, and geomechanical parameters of rock fills are considered influential in settlement. All of the available relations in predicting settlement of embankment dams depend on a single factor, namely the dam height. Since only the three parameters, dam height, time, and shape factor are common in all CFRDs, they have been selected as inputs (Fig. 4).

Fig. 4
figure 4

Cross plots of the relationship between crest settlement (CSCFRD) and independent variables (a) Dam height; (b) Shape factor; (c) Time; (d) Number of dam

Four statistical evaluation criteria were used to assess the performances of the applied intelligent methods. These criteria are mean absolute error (MAE), standard deviation (σ), root mean square error (RMSE), and determination of the coefficient (R 2), respectively, given by Eqs. (8)(11) (Kayadelen, 2011; Mousavi et al., 2012; Ozcan, 2012).

$${\text{MAE = (1/}}n)\sum\limits_{i = 1}^n {|{r_i}} - {p_i}|,$$
(8)
$$\sigma = \sqrt {(1/n)\sum\limits_{i = 1}^n {({e_i}} - \bar e{)^2}} ,$$
(9)
$${\text{RMSE}} = \sqrt {(1/n)\sum\limits_{i = 1}^n {({r_i}} - {p_i}{)^2}} ,$$
(10)
$${R^2} = {\left( {\frac{{\sum\limits_{i = 1}^n ( {p_i} - \bar p)({r_i} - \bar r)}}{{\sqrt {\sum\limits_{i = 1}^n ( {p_i} - \bar p{)^2}\sum\limits_{i = 1}^n ( {r_i} - \bar r{)^2}} }}} \right)^2},$$
(11)

where r is the actual value and p is the predicted value, \(\bar r\;{\text{and }}\bar p\) are the mean of actual and predicted values, respectively, e is the absolute error (r ip i ), \({\bar e}\) is the mean of absolute error \((1/n)\sum\limits_{i = 1}^n {({r_i} - {p_i})} \) and n is the number of data sets.

4 Predicting dam settlement using two intelligent methods

4.1 Predicting dam settlement using ANFIS method

In this research, MATLAB software has been used for ANFIS modeling. To predict the dam crest settlement, three ANFIS methods (GP, SCM, and FCM) were employed. Before modeling, in order to reduce the range of input and output parameters, all parameters were normalized in [−1, 1] interval. The training phase was completed with 19 data sets while the remaining 5 sets of raw data were applied for validation of ANFIS models. The RMSE rate has been used as a criterion for evaluation and comparison. In Tables 35 the results of each method to achieve the best ANFIS structure have been specified. By using the trial-and-error method, and according to the RMSE, the appropriate model was selected. Based on the GP results presented in Table 3, the third row has the minimum rate of RMSE. This ANFIS model has four membership functions (Gaussian2), parameters of which have been optimized by the hybrid algorithm (combination of BP and least squares). The system consists of five layers and 64 IF-THEN rules, which connect to each other by AND operator as displayed in Fig. 5 (p.597).

Table 3 Effect of different parameters on crest settlement ANFIS model (GP) and RMSE of each model
Table 4 Effect of different parameters on crest settlement ANFIS model (SCM) and RMSE of each model
Table 5 Effect of different parameters on crest settlement ANFIS model (FCM) and RMSE of each model
Fig. 5
figure 5

ANFIS structure made by GP method to predict CS CFRD based on three input parameters (height, shape factor, and time)

Furthermore, in ANFIS model, after 64th epochs of training, testing phase RMSE of the model reached its minimum value (0.0510). The value of R 2 is between the predicted and real rate of crest dam settlement for test data, which is equal to 0.9693 (Fig. 6a).

Fig. 6
figure 6

Coefficient of determination between real values of dam crest settlement and predictions of ANFIS (a) Grid partitioning (GP); (b) Subtractive clustering methods (SCM); (c) Fuzzy c-means clustering (FCM)

Table 4 shows the results of the second ANFIS (SCM) model based on different parameters. According to the RMSE rate, the most appropriate result is in row number 10. The cluster radius of this model is 0.56 and it is formed of 11 rules. The structure of the present model is given in Fig. 7. After 13th epochs of training, the RMSE of the testing phase reached its minimum value (0.0553). The value of R 2 is between the predicted and real rate of crest dam settlement for test data, which is equal to 0.8657 (Fig. 6b).

Fig. 7
figure 7

ANFIS structure built using SCM method to predict CS CFRD based on three input parameters (height, shape factor, and time)

In the third ANFIS model, which uses FCM, the optimum structure is shown in row 12 of Table 5. Fifteen Gaussian membership functions with application of hybrid method in the 100th epochs created the minimum value of RMSE for the test data. RMSE and R 2 values of this model are 0.0430 and 0.8848, respectively (Table 5 and Fig. 6c). ANFIS current structure has 15 rules. The structure has been demonstrated in Fig. 8. Table 6 can be proposed to indicate the overall results and the specifications of ANFIS three models. In this table, other characteristics of models, such as the number of linear and nonlinear parameters, are also shown.

Table 6 Different parameter types and their values used for the prediction of CS CFRD ANFIS model
Fig. 8
figure 8

ANFIS structure built using FCM method to predict CS CFRD based on three input parameters (height, shape factor, and time)

4.2 Predicting dam crest settlement using GEP method

In this study, the software GeneXProTools 4.0 was used to employ the GEP method. To obtain the most appropriate model for solving the problem, a series of adjustments made in the software were undertaken (Table 7). Determining the number of genes is one of the most important issues, affecting the accuracy and application of the model (the decrease or increase of the number of genes may lower the precision of the obtained equation or prolong it). According to the above conditions, only three genes were used. To communicate between the mapping functions, each ET of a function is used to connect models with more than one gene in order to calculate the final mapping function, which is composed of several genes (these adjustments are obtained from the trial-and-error method).

Table 7 GEP software configurations

For example, Eq. (12) obtained from this procedure is illustrated in Fig. 9.

Fig. 9
figure 9

Expression trees of Eq. (8)

Using the 19 sets of data (80%), two equations are obtained based on three input parameters (dam height (H), shape factor (S f), and time (t)) to predict dam crest settlement (CSCFRD). Eq. (12) has the constant, while Eq. (13) is without constant.

Five sets of data (20%) were used for the evaluation. The values of R 2 for the real dam crest settlement and GEP prediction (with and without constant) are 0.9603 and 0.9734, respectively (Fig. 10). Survey results to predict CSCFRD are also demonstrated in Table 8. Note that Eq. (12) (with constant) has a better performance in predicting CSCFRD.

$${\text{C}}{{\text{S}}_{{\text{CFRD}}}} = \left\{ {{{\cos }^2}\left( {\cos \left( {{{\cos }^2}\left( {at/{S_f}} \right)} \right)} \right)} \right\} \cdot \left\{ {\cos \left( {\log \left( {\tan \left( {(H + {S_f})/2} \right) + H + {S_f} + t} \right)} \right)} \right\} \cdot \left\{ { - \cos \left( {\cos \left( {aS_f^3/t} \right)} \right)} \right\},$$
(12)

where a is a constant number, and a=−64.803467

$${\text{C}}{{\text{S}}_{{\text{CFRD}}}} = \left\{ {1/\tan t + \left( {\tan (t + 2H) + \left( {H + {S_f} + (t/4)} \right)/2} \right)} \right\} \cdot \sqrt {\sqrt {\left( {\left( {(3{S_f} + t)/4} \right) - \tan t} \right){S_f}H} } \cdot \left\{ {{1 \mathord{\left/ {\vphantom {1 {|{{\tan }^{ - 1}}(H + 2t - S_f^4){|^{15}}}}} \right. \kern-\nulldelimiterspace} {|{{\tan }^{ - 1}}(H + 2t - S_f^4){|^{15}}}}} \right\}.$$
(13)
Table 8 GEP results to predict CS CFRD
Fig. 10
figure 10

Coefficients of determination between real values of dam crest settlement and predictions of the GEP (a) With constant (Eq. (12)); (b) Without constant (Eq. (13))

5 Discussion

Fig. 11 compares the predictions of all the two methods (ANFIS-based models and GEP-based models) for the test data at a glance. According to Fig. 11, the values predicted by these methods are very close to the real ones of the dam settlement. Proximity or overlapping of the lines shows that, the predictions of intelligent methods are well adapted to the real values. Meanwhile, there are some areas that have relatively large errors (especially in predicting fourth sets of data). High correlation coefficients and low error rates mean that both intelligent methods are excellent methods in modeling such behavior.

Fig. 11
figure 11

Comparing the real of crest settlement and predicted intelligent models ANFIS (GP, SCM, and FCM) and GEP (Eq. (12) (with constant number) and Eq. (13) (without constant number))

Error values (RMSE, MAE, and σ) and correlation of coefficient (R 2) of all intelligent methods are shown in Fig. 12. This figure displayed results of ANFIS model (GP, SCM, and FCM) and GEP equations (with and without constant). The rate of error in the GEP method was less than ANFIS and also the value of R 2 was larger in the GEP method. For ANFIS, the SCM method has more errors and lower R 2 values than other methods (GP and FCM). Comparing GP with FCM, although GP has higher R 2, the error rate of GP (RMSE, MAE, and σ) is larger than that of FCM. Note that the value of R 2 is more definitely that cannot assurance the appropriate results. Therefore, RMSE and MAE should also be considered. Comparing results of Eq. (12) with Eq. (13), it is found that Eq. (13) has a higher R 2 but the rate of error is far more, which has a better performance.

Fig. 12
figure 12

Comparing the rate of R 2, RMSE, MAE and σ for predicting crest settlement using ANFIS (GP, SCM and FCM) and GEP (Eq. (12) (with constant number) and Eq. (13) (without constant number))

Therefore, the preferred predicting results are GEP (Eq. (12)), GEP (Eq. (13)), FCM, GP, and SCM. On the other hand, unlike GEP, ANFIS does not provide a specific equation, and its development is just a coding in MATLAB software that will not be applicable as GEP method for further use. Thus, GEP had a better performance than ANFIS.

It is noteworthy that a particular equation to predict the CFRD crest settlement has not been provided. As the only equation to predict rockfill dams crest settlement is the Clements (1984)’s equation, its constants in different conditions are given in Table 9.

Table 9 Constants of Clements (1984)’s equation in different conditions

Based on the parameters given in Table 9 for the case of an initial impounding and using the equation S=0.002H 1.1, R 2 equals 0.3255. For the second case (10 years after operation), the data pertinent to dams under 10 years of life must be removed first. Consequently, only nine dams remain, and the value of R 2 from Clements (1984)’s equation (S=0.0000014H 2.6) will be equal to 0.2541. Fig. 13 illustrates the amount of accuracy of Clements’s prediction for both cases, in comparison to the real CSCFRD.

Fig. 13
figure 13

Comparison of the real and computed values by Clements (1984) ’s equation (a) At initial impounding; (b) After 10-years service

The two mentioned models indicate that Clements (1984)’s equation lacks the appropriate accuracy for these kinds of dams. In addition, another deficiency is the dependency of this equation to a single parameter, i.e., the height of dam. Increasing the number of input parameters will involve more effective parameters, so the results will be more practical and more reliable. Finally, the presented equations (Eqs. (12) and (13)), together with the appropriate number of inputs, are more accurate in predicting crest settlements of these types of dams (CFRDs) compared to other methods (e.g., empirical methods).

6 Conclusions

CFRDs are now being considered as an alternative at most sites to the embankment and rockfill dams, largely due to the cost and construction schedule. Numerous CFRDs are presently under construction throughout the world, and their popularity is increasing. In the present research, in order to predict the CFRD crest settlement, two approaches using ANFIS and GEP models were used, along with the data of all 24 dams. ANFIS includes FCM, SCM, and GP methods. Furthermore, two equations (with and without a constant) were proposed in the GEP models. Parameters such as H (dam height), S f (dam shape factor), and t (time after first operations) were considered as input parameters. For modeling, data of 24 CFRDs were compiled from reliable sources from seven countries around the world such that 80% of data were employed for training, and the remaining (20%) were used for test (validation) of the models. Results of both intelligent methods were good in predicting the settlement. However, the results of GEP were more reasonable than the ANFIS method. To conclude, this study indicates that intelligent methods are an appropriate tool for solving problems with complex mechanisms and multiple influential factors. These methods have no restrictions on the number of input parameters to predict the behavior of dams. It is noteworthy that the presented intelligent models could reveal the relation between input parameters and their effects on outputs, and have the ability to intelligently generalize the new data.