A comparative study of the performance of artificial neural network and multivariate regression in simulating springs discharge in the Caspian Southern Watersheds, Iran
 745 Downloads
Abstract
While there are different methods and models that can be applied to estimate the qualitative and quantitative parameters of water resources, unfortunately, no comprehensive qualitative and quantitative data exist about water resources in Iran. The present study is to compare the performance of the artificial neural network (ANN) and the multivariate regression methods in simulating spring discharge in the Caspian Southern Watersheds. Multivariate regression method was used by using SPSS software. Springs average discharge was considered as the dependent variable and other affecting factors as independent variables. Two linear models were presented for estimating the alluvial and karst springs discharge. Then, the models’ performance was evaluated and confirmed. Also, the artificial neural network was applied to simulate the alluvial and karst springs discharge. ANN performance was evaluated through two parameters: median root of square of the error and Pearson’s Rsquared statistics. The results showed that the most important factors of karst springs discharge were the porosity of aquifer formation and the site elevation; in case of the alluvial springs, the transmissivity of aquifer formation and the aquifer depth were the most important factors. Moreover, ANN efficiency in estimating springs discharge was higher than that of the multivariate regression method.
Keywords
Porosity Water table Alluvial and karst springs SimulationIntroduction
Throughout the world, water is considered the most vital element of life. In addition, due to factors such as population increase, development in agricultural, industrial and other human activities, this vital role has become more prominent (Khaleghi and Varvani 2018a, b). The scarcity of appropriate surface water resources and the increasing water demands have caused greater use of groundwater resources (Jang 2015). Furthermore, due to industrial developments, increase in population, and lack of observing the ecological standards, the possibility of water resources pollution has increased. Springs are one of the most important water resources that provide a part of human required water. In the field of water resources management, conducting qualitative and quantitative studies such as the study of springs discharge is so important. On the other hand, the computerized models have provided the tool for water resources management (Gualbert and Essink 2001). Recently, it has become widespread to use the computerized mathematical models in groundwater management. Furthermore, different methods and techniques have been presented for hydrologic and hydrogeological parameters simulations. Lallahem et al. (2005) came to this conclusion that the artificial neural network (ANN) is efficient in groundwater modeling. To evaluate the groundwater level, Ioannis et al. (2005) applied the ANN and found that, when there is limited data at hand, this model can present acceptable groundwater estimation. Using ANN, Krishna et al. (2008) presented a model for groundwater in Kakinada, a coastal city in India. She stated that this model along with the BP (back propagation) network, and the LM algorithm can present the best simulation. For presenting models for groundwater salinity, Hall et al. (2001) conducted a study in Thailand and presented some models for salinity management and estimation. Coppola et al. (2005) found that the ANN technology can serve as a powerful and accurate simulation and management tool that minimizes the degradation of groundwater quality to the extent possible by identifying appropriate pumping policies under variable and/or changing climate conditions. Nichols and Verry (2001) developed regression equations relating yearly groundwater recharge and stream flows to the seasonal precipitation amounts in north central Minnesota. In addition, the previous studies have shown that the impact of climate changes on groundwater is sitespecific (Brouyere et al. 2004). Alvis et al. (2005) investigated alluvial aquifer and its changes in the South Dakota. Their results showed that the water table depth and discharge values varied in different places. Auckenthaler et al. (2005) presented a linear model to simulate karst springs discharge in Switzerland, and their results showed the effect of aquifer formation on karst springs discharge. Worthington (1991) studied karst areas hydrology and his results showed that porosity volume of aquifer formation is an important factor on karst areas hydrologic conditions. Prohaska and Stevanovic (1993), Zhang et al. (1996), Dimitrov et al. (1997) and Bonacci (2001) presented some models to simulate karst springs discharge by using different methods. Their results showed the effects of formation type, precipitation, and water resources such as rivers on springs discharge. Generally, effective factors in alluvial springs discharge are the depth of groundwater, precipitation value and evaporation, topography and distance from water resources, and specifically, the porosity of aquifer formation and aquifer depth in karst springs (Brunner and Kinzelbach 2005; Zhang 2001). The purpose of this study is to use the ANN and multivariate regression to simulate alluvial and karst springs discharge and also to compare the performance of these methods in estimating the springs discharge in the Caspian Southern Watersheds.
Study area
Materials and methods
Artificial neural network
Overview of the artificial neural networks
The artificial neural network is comprised of neurons network. It takes the cue from the neurons’ biological counterparts, so the neurons that are capable of learning can be trained to find the appropriate solutions, recognize patterns, classify data, and even forecast the future events. As a result, they have found a wide range of applications in simulating very complex relationships and modeling many hydrologic problems including rainfall–runoff modeling and streamflow forecasting (Hsu et al. 2002; Riad et al. 2004; Mazvimavi et al. 2005). Such a network usually consists of many layers arranged in serial orders while each layer contains a group of neurons with the same pattern of connections to the neurons in the other layer(s). The first and last layers are used as input and output variables, while the transitional layers are usually connoted as the hidden layer that, depending on the complexity of the problem, can be either one or more. The weights to a neuron are automatically adjusted by training the network according to learning rules until it properly simulates the past data or performs the desired task. Mathematical functions, known as neuron transfer functions, are used to transform the input to output, for each neuron.
Feedforward backpropagation network (FFBP)
For FFBP, the number of hidden layers and the number of the nodes in the input and hidden layers were determined after trying various network structures. There are several methods to avoid overfitting in ANNs. These methods are summarized in Giustolisi and Laucelli (2005).
Analysis of the data
 1.
Rootmeansquared error (RMSE)
In this equation, Obs refers to observed values, calc to values calculated by the network and the model, and n to the number of data in each step. The nearer is RMSE to zero, the nearer are the observed and calculated values to each other, and the more accurate is the simulation in each step.$${\text{RMSE}} = \sqrt {\sum\limits_{i = 1}^{n} {\frac{{\left( {{\text{obs}}  {\text{calc}}} \right)}}{n}} }$$(3)  2.The Pearson’s Rsquared statistics (RSqr)where Qi is the observed value, Ôi is the modeled value, Ōi is the mean of the observed data, Õi is the mean of the modeled data, and n is the number of data in each stage of test and terrain (Eq. 4).$${\text{Rsqr}} = \left[ {\frac{{\sum\limits_{i = 1}^{n} {\left( {Qi  \overline{Qi} } \right) \cdot \left( {\hat{O}i  \tilde{O}i} \right)} }}{{\sqrt {\sum\limits_{i = 1}^{n} {\left( {Qi  \overline{Qi} } \right)^{2} \cdot \sum\limits_{i = 1}^{n} {\left( {\hat{O}i  \tilde{O}i} \right)^{2} } } } }}} \right]^{2}$$(4)

Q_{spring} is the mean spring discharge (m^{3}/s).

D is the mean depth of grand water (m).

L is the distance from water resources (m).

H is the site elevation (m).

P is the mean annual precipitation (mm).

W is the water table (m).

T is the mean transmissivity formation of aquifer formation (m^{2}/day).

Q_{spring} is the spring average discharge (m^{3}/s).

P is the porosity percentage of aquifer formation.

L is the distance from water resources (m).

H is the site elevation (m).

R is the annual mean precipitation (mm).

S is the site slope (degree).
Multivariate regression method
Statistical analysis was performed through SPSS software using a stepwise method. Spring average discharge was considered as independent variables. For karst springs, the aquifer formation porosity (%), aquifer depth, site elevation, annual precipitation, site slope, and distance from the water resources (lakes and rivers) were studied as independent variables. For alluvial springs, the transmissivity of aquifer formation (T), aquifer depth, water table depth, annual precipitation, site elevation, and distance from the water resources were considered as independent variables. Eightytwo alluvial springs in the Caspian Southern coasts and 80 karst springs in the central Alborz highlands were studied. Eighty percent of the sample springs were applied to propose the models, and 20 percent were used to validate the models’ efficiencies. Through this method, two linear models were presented to simulate alluvial and karst springs discharge (Formulas 5, 6). In the next step, efficiencies of the presented linear models for simulation of springs discharge were evaluated. In this stage (validation), the presented models were applied for simulating the spring’s discharges that were not used in the modeling stage.
Results
Correlation between springs discharge and their effective factors
Elevation  Precipitation  Porosity (aquifer formation)  Distance from water resources  

Karst spring discharge  − 0.37  0.01  0.68  0.25 
Elevation  Precipitation  Transmissivity of aquifer formation (T)  Distance from water resources  

Alluvial spring discharge  − 0.15  − 0.07  0.66  0.17 
The results obtained through the linear model with multivariate regression method (karst spring)
Model  Sum of squares  df  Mean square  F  Sig. 

Regression  278.79  5  55.79  10.517  > 0.01 
Residual  233.40  44  5.30  
Total  362.51  49 
The results obtained through the linear model with multivariate regression method (alluvial spring)
Model  Sum of squares  df  Mean square  F  Sig. 

Regression  217.65  2  108.82  25.71  > 0.01 
Residual  215.85  51  4.232  
Total  433.50  53 
That Q_{spring} is the mean spring discharge (l/s), P is the aquifer formation porosity (%), and H is the site elevation (m).
The results of karst springs discharge simulation by the use of ANN in the eight network structure in the testing stage
Models  Algorithms  The best structure  R  RMSE 

1  LM  3–12–1  0.767  0.98 
CG  3–8–1  0.52  2.7  
GDX  3–14–1  0.45  3.1  
2  LM  3–18–1  0.81  0.78 
CG  3–12–1  0.59  1.7  
GDX  3–18–1  0.71  1.2  
3  LM  2–8–1  0.702  0.96 
CG  2–12–1  0.57  1.7  
GDX  2–14–1  0.62  1.5  
4  LM  5–4–1  0.602  1.98 
CG  5–12–1  0.47  2.9  
GDX  5–8–1  0.51  2.56  
5  LM  3–8–1  0.475  2.929 
CG  3–10–1  0.35  4.3  
GDX  3–6–1  0.40  3.9  
6  LM  2–14–1  0.69  1.7 
CG  2–10–1  0.47  2.9  
GDX  2–14–1  0.51  2.1  
7  LM  2–10–1  0.39  3.7 
CG  2–12–1  0.27  5.5  
GDX  2–4–1  0.34  4.3  
8  LM  3–20–1  0.63  1.82 
CG  3–12–1  0.45  3.1  
GDX  3–16–1  0.49  2.9 
The results of alluvial springs discharge simulation by the use of ANN in the eight network structure in the testing stag
Models  Algorithms  The best structure  R  RMSE 

1  LM  3–20–1  0.61  1.3 
CG  3–12–1  0.45  2.9  
GDX  3–10–1  0.57  1.63  
2  LM  3–10–1  0.59  1.62 
CG  3–18–1  0.41  2.6  
GDX  3–14–1  0.51  2.1  
3  LM  2–14–1  0.45  2.89 
CG  2–8–1  0.38  3.72  
GDX  2–18–1  0.41  3.1  
4  LM  5–10–1  0.55  1.95 
CG  5–6–1  0.50  2.2  
GDX  5–18–1  0.47  2.4  
5  LM  3–12–1  0.67  1.1 
CG  3–6–1  0.34  3.9  
GDX  3–16–1  0.51  2.17  
6  LM  2–4–1  0.56  1.9 
CG  2–14–1  0.33  3.98  
GDX  2–8–1  0.48  2.29  
7  LM  2–14–1  0.79  0.78 
CG  2–12–1  0.54  1.9  
GDX  2–10–1  0.60  1.58  
8  LM  3–14–1  0.63  1.4 
CG  3–6–1  0.38  3.7  
GDX  3–10–1  0.52  2.01 
Discussion
In this study, using equal data springs discharge was simulated in the central Alborz highlands and the Caspian coasts through two methods of multivariate regression and artificial neural network. The results of these two methods indicated that the aquifer formation porosity (%) had the highest correlation with karst springs discharge (Zhang et al. 1996; Dimitrov et al. 1997). Also, elevation had a significant relationship with karst springs discharge. But in the case of the alluvial springs, the elevation factor was not important and did not reveal any the significant relationship with springs discharge. For alluvial springs discharge, both transmissivity (T) of aquifer formation and aquifer depth had significant relations with spring discharge and they were the most important factors in alluvial springs discharge (William 2003; Gholami et al. 2008). The evaluation of linear models performances showed that the maximum correlations between the simulated and observed values were 0.63 and 0.73 (karst and alluvial springs), respectively. Nonetheless, using the same data for evaluation of the Artificial ANN’s efficiency, the correlation between estimated and observed values was 81 and 79% (karst and alluvial springs, respectively) correspondingly.
Therefore, the efficiency and accuracy of the ANN in springs discharge simulation were greater than the multivariate regression. This finding was consistent with the results of the past studies that showed the high capability of the neural network with LM learning technique in estimating and simulating groundwater parameters (Prohaska and Stevanovic 1993). Generally, the neural network has the capacity to extract the dominant rules of the data, even for scrambled data. This characteristic of the neural network can be considered as the most outstanding characteristic of these techniques, in comparison with the other methods. Considering the appropriate type of learning technique, the proper number of neurons and hidden layer, the suitable type and the number of input factors, and also through applicable calibration, it can be said that this technique is an efficient and appropriate tool for estimating springs discharge in any investigation (Gholami et al. 2018).
Conclusions
The performance of the artificial neural network with a multilayered perceptron format and LM learning technique was proved in the current study. Considering the results of the network efficiency analysis for different models, and comparing the obtained results with the real data, it can be concluded that the second and seventh models, out of the whole sixteen suggested models, were the best for simulating karst and alluvial springs discharge. In this study, training the network through three learning techniques was investigated and the results indicated that in comparison with the CG, GDX learning techniques, the LM learning technique showed a higher learning speed as well as a greater error decrease. Therefore, through the application of the artificial neural network, it is possible to estimate springs discharge and other qualitative and quantitative water parameters even in dataless situations. Moreover, this approach is applicable in the field of water resources management. Also, for further studies, it is suggested that spring discharges modeling by using the other artificial intelligence.
Notes
Acknowledgements
We thank TAMAB (Water Resources Research Organization of Iran) for providing the data for aquifers and groundwater and also for helping us with the datapreprocessing.
References
 Alvis A, Hargrave R, Francisco E, Fischer D (2005) Aquifer vulnerability of the Inyan Kava Group, Blackhawk QuaDRANGLE, SOUTH Dakota. In: Western South Dakota hydrology conferenceGoogle Scholar
 Auckenthaler A, Reichert P, Huggenberger P (2005) Modeling discharge and microorganism transport in a karst aquifer. Geophys Res Abstr 7:01603Google Scholar
 Bonacci O (2001) Analysis of the maximum discharge of karst springs. Hydrogeol J 9(4):328–338. https://doi.org/10.1007/s100400100142 CrossRefGoogle Scholar
 Brouyere S, Carabin G, Dassargues A (2004) Climate change impacts on groundwater reserves: modeled deficits in a chalky aquifer. Geer Basin, Belgium. Hydrogeol J 12(2):123–134. https://doi.org/10.1007/s1004000302931 CrossRefGoogle Scholar
 Brunner P, Kinzelbach W (2005) Groundwater modeling in remote Chinese Basin—how can models be improved in areas where data are scarce? European Geosciences UnionGoogle Scholar
 Coppola EA, Mclane CF, Poulton MM, Szidarovszky F, Magelky RD (2005) Predicting conductance due to upcoming using neural networks. J Groundw 43(6):827–836. https://doi.org/10.1111/j.17456584.2005.00092 CrossRefGoogle Scholar
 Dimitrov D, Machkova M, Damyanov G (1997) On the karst spring discharge forecasting by means of stochastic modeling. In: Günay G, Johnson AI (eds) Karst waters and environmental impacts (Proceedings of international symposium Antalya, Turkey, 1995). Balkema, Rotterdam, pp 353–359Google Scholar
 Gholami V, Azodi M, Salimi Taghvaye (2008) Modeling of karst and alluvial springs discharge in the central Alborz highlands and on the Caspian Southern coasts. Casp J Environ Sci 6(1):41–45Google Scholar
 Gholami V, Booij MJ, Tehrani EN, Hadian MA (2018) Spatial soil erosion estimation using an artificial neural network (ANN) and field plot data. CATENA 163:210–218CrossRefGoogle Scholar
 Giustolisi O, Laucelli D (2005) Improving generalization of artificial neural networks in rainfall–runoff modeling. Hydrol Sci J 50(3):439–457. https://doi.org/10.1623/hysj.50.3.439.65025 CrossRefGoogle Scholar
 Gualbert H, Essink O (2001) Fresh groundwater supplyproblems and solutions center of hydrology (ICHU). Institute of Earth Science, Ocean & Coastal ManagementGoogle Scholar
 Hall N, Greiner R, Yangvanit S (2001) Modeling salinity management at from and catchment level in NSW and Thailand and Modsim 2001. Australian National University, CanberraGoogle Scholar
 Hsu K, Gupta HV, Sorooshian GS, Imam B (2002) Selforganizing linear output map (SOLO): an artificial neural network suitable for hydrologic modeling and analysis. J Hydrol 38(12):1–17. https://doi.org/10.1029/2001WR000795 CrossRefGoogle Scholar
 Ioannis N, Daliakopoulosa P, Coulibalya I, Tsanis K (2005) Groundwater level forecasting using artificial neural networks. J Hydrol 309:229–240. https://doi.org/10.1016/j.jhydrol.2004.12.001 CrossRefGoogle Scholar
 Jang CH (2015) Probabilitybased classifications for spatially characterizing the water temperatures and discharge rates of hot springs in the Tatun Volcanic Region, Taiwan. Environ Monit Assess 187(5):297. https://doi.org/10.1007/s1066101545208 CrossRefGoogle Scholar
 Khaleghi MR, Varvani J (2018a) Simulation of the relationship between river discharge and sediment yield in the semiarid river watersheds. Acta Geophys 66:109–119. https://doi.org/10.1007/s1160001801109 CrossRefGoogle Scholar
 Khaleghi MR, Varvani J (2018b) Sediment rating curve parameters relationship with Watershed, characteristics in the Semiarid River Watersheds. J Arab J Sci Eng. https://doi.org/10.1007/s1336901830927 CrossRefGoogle Scholar
 Krishna B, Satyajit Rao YR, Vijaya T (2008) Modeling groundwater levels in an urban coastal aquifer using artificial neural networks. Hydrol Process 22:1180–1188. https://doi.org/10.1002/hyp.6686 CrossRefGoogle Scholar
 Lallahem S, Mania J, Hani A, Najjar Y (2005) On the use of neural networks to evaluate groundwater levels in fractured media. J Hydrol 307:92–111. https://doi.org/10.1016/j.jhydrol.2004.10.005 CrossRefGoogle Scholar
 Maier HR, Dandy GC (2000) Neural networks for prediction and forecasting of water resources variables: a review of modeling issues and applications. Environ Model Softw 15:101–123CrossRefGoogle Scholar
 Mazvimavi D, Maijerink AM, Savenije HH, Stein A (2005) Prediction of flow characteristics using multiple regression and neural networks: a case study in Zimbabwe. Phys Chem Earth 30:639–647. https://doi.org/10.1016/j.pce.2005.08.003 CrossRefGoogle Scholar
 Nichols DS, Verry ES (2001) Streamflow and groundwater recharge from the small forested watershed in northcentral Minnesota. J Hydrol 89–103. PII: soo221694(01)003377Google Scholar
 Prohaska S, Stevanovic Z (1993) The development of the autocrossregression model for karst spring flow simulation. Theor Appl Karstol 6:151–155Google Scholar
 Riad S, Mania J, Bouchaou L, Najjar Y (2004) Predicting catchment flow in a semiarid region via an artificial neural network technique. Hydrol Proc J 18(13):2387–2393. https://doi.org/10.1002/hyp.1469 CrossRefGoogle Scholar
 Salehi F, Prasher SO, Amin S, Madani A, Jebelli SJ, Ramaswamy HS, Drury CT (2000) Prediction of annual nitrateN losses in drain outflows with artificial neural networks. ASAE 43(5):1137–1143CrossRefGoogle Scholar
 Samani N, GohariMoghadam M, Safavi A (2007) A simple neural network model for the determination of aquifer parameters. J Hydrol 340:1–11. https://doi.org/10.1016/j.jhydrol.2007.03.017 CrossRefGoogle Scholar
 William BW (2003) Conceptual models for karstic aquifers. Republished by permission from. In: Palmer AN, Palmer MV, Sasowsky ID (eds) Karst modeling: special publication 5. The Karst Waters Institute, Charles Town, pp 11–16Google Scholar
 Worthington SR (1991) Karst hydrogeology of the Canadian Rocky Mountains. PhD thesis, McMaster University, Hamilton, p 227Google Scholar
 Yasmin R (2008) Groundwater modeling of the northeastern part of barind tract its sustainable development and management, Bangladesh. Asian J Inf Technol 7(5):218–225Google Scholar
 Zhang M (2001) Informationstatistics evaluation of the effects of groundwater buried depth to upper soil and groundwater salinity. China Postdoctoral Preceding Science Press, Beijing, pp 221–224Google Scholar
 Zhang YK, Bai EW, Libra R, Rowden R, Huaibai L (1996) Simulation of spring discharge from a limestone aquifer in Iowa, USA. USA Hydrogeol J 4(1):41–54. https://doi.org/10.1007/s100400050087 CrossRefGoogle Scholar
Copyright information
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.