1 Introduction

The swelling of anhydrite-bearing clay rocks, also known as clay-sulfate rocks, has caused large problems in many geotechnical projects since more than 140 years ago with the construction of the Schanz railway tunnel in southern Germany (Schädlich et al. 2013). Despite advances in engineering methods and practical experiences gained during past decades, construction, e.g., tunneling, in swelling rocks is yet a challenging task because the accurate prediction of swelling pressures and deformations of clay-sulfate rocks is extremely difficult. Furthermore, ground heave, once the swelling is triggered by water inflow, may continue to develop for many years and it is practically impossible to stop the swelling (Schweizer et al. 2019).

The most remarkable problems with the swelling of clay-sulfate rocks are encountered during the construction of railway and road tunnels and viaducts. For example, some of the tunneling projects in Baden-Württemberg in Germany and Jura Mountains in Switzerland reported severe heave resulting from the uplift of materials below the tunnel floor (Anagnostou et al. 2010; Madsen and Müller-Vonmoos 1989; Wittke 2004; Wittke et al. 2007). In Spain, tunnels, e.g., Camp Magré, Lilla, and Puig Cabrer, constructed to provide a high-speed railway from Madrid to Barcelona, were affected by extreme sulfate-related heave (Alonso et al. 2013; Ramon et al. 2017).

A large-scale heave, causing catastrophic damage to buildings and roads, has taken place in the historic city of Staufen in southwest Germany, where geothermal drilling disturbed the hydrological system in the subsurface, allowing penetration of water into the clay-sulfate bearing formation of the Triassic Gipskeuper formation (“Gypsum Keuper”). Expansive hydration of the mineral anhydrite (CaSO4) led to substantial uplift of the ground and subsequently caused severe damages to infrastructures (Butscher et al. 2016, 2018; Jarzyna et al. 2021). Although the undertaken mitigation measures successfully decreased the uplift rate at the ground surface, they failed to completely stop the swelling process (Schweizer et al. 2019).

Swelling of clay-sulfate rocks may occur by clay swelling through osmotic water flow between surfaces of neighboring clay minerals, and sulfate swelling through the chemical transformation of anhydrite (CaSO4) into gypsum (CaSO4∙2H2O), which is the key process controlling the swelling phenomena (Butscher et al. 2016). Anhydrite dissolves in groundwater depending on temperature and ionic composition until the pore water becomes supersaturated with respect to gypsum. The dissolution of anhydrite then becomes accompanied by the crystallization of gypsum in particular when open spaces, e.g., fractures, are available. Gypsum crystals nucleate on existing gypsum crystals and surfaces of clay minerals. The transformation of anhydrite into gypsum proceeds until all the anhydrite present in the rock is converted to gypsum. This process (gypsification of anhydrite) is accompanied by about 61% increase in volume (Butscher et al. 2011, 2017; Jarzyna et al. 2021).

Since clay-sulfate rock swelling includes groundwater flow, mechanical swelling and chemical reaction, as well as their interaction, numerical models simulating the swelling should consider coupled thermo-hydro-mechanical-chemical processes (Butscher et al. 2016; Ramon et al. 2017). The calculation of chemical processes is usually performed by solving thermodynamic equations, which contributes a significant portion of the overall computational cost because compositions of liquid and mineral phases must be estimated at each iteration (e.g., Schweizer et al. 2018). Therefore, due to the high computational time, an efficient application of complex thermodynamic models, e.g., the Pitzer interaction model, in numerical simulation of coupled multi-physics processes in the subsurface environment is laborious (Hassanpouryouzband et al. 2021; Taherdangkoo et al. 2021).

As an alternative to complex thermodynamic models, artificial intelligence (AI) can be applied to determine the solubility of anhydrite and gypsum in the subsurface environment (Zarei et al. 2021). AI models do not require any pre-assumptions between input and output variables and are capable of handling complex nonlinear input–output relationships. However, AIs are subjected to some shortcomings, for instance, the stagnation in local minima during the training process and/or overfitting, which causes depictive outcomes. Optimization algorithms such as Bayesian Regularization have been proposed to overcome these issues and thus improve the performance of AI models (Benamara et al. 2020; Hassanpouryouzband et al. 2018; Taherdangkoo et al. 2020).

The primary goal of this study was to develop a robust neural network model able to determine the solubility of two calcium sulfates, gypsum and anhydrite, in aqueous solutions for further application in higher level numerical modeling. We used feed-forward and cascade-forward networks trained with a Bayesian regularization algorithm to perform the regression task. The network models were developed using an experimental dataset covering a wide range of conditions. A comparison analysis was designed to evaluate the performance of the most accurate network model with experimental data reported by Li and Demopoulos (2005, 2006), Marshall et al. (1964) and Marshall and Slusher (1966) as well as the Pitzer ion interaction model and two available data-driven models: the CMIS model (Rahmati et al. 2019) and the CSA-LSSVM model (Zarei et al. 2021).

2 Materials and Methods

2.1 Network Models

2.1.1 Feed-Forward Neural Network

Feed-forward neural network (FFNN) is a widely used neural network architecture. In a FFNN, information is processed in one direction, from input neurons, then through hidden neurons, and to output neurons. Therefore, the connections between the neurons do not form a cycle (Benamara et al. 2020; Svozil et al. 1997). The FFNNs are mainly employed for supervised learning tasks in cases where the input data are neither sequential nor time-dependent (Benardos and Vosniakos 2007; Lavine and Blank 2009).

2.1.2 Cascade-Forward Neural Network

Cascade-forward neural network (CFNN) is a specific type of ANNs recognized by its cascaded structure. Similar to the FFNN, the CFNN covers three categories of layers, namely input, hidden, and output layers (Nandagopal et al. 2017). However, the main specificity of CFNN is the cascade shape considered for linking neurons of different layers. This shape allows creating junctions between neurons of a given hidden layer with all the previous layers. This topology brings supplementary flexibility for dealing with complicated systems having larger numbers of input data. Besides, CFNNs have high degrees of generalization (Zeng et al. 2021; Zimmermann and Mattedi 2020).

2.1.3 Bayesian Regularization

Bayesian Regularization (BR) is a robust algorithm for optimizing weights and bias of various ANN types. BR can be considered as an improved version of the Levenberg–Marquardt algorithm as it involves the main principle of the latter with some modifications in the calculation procedure (Burden and Winkler 2009; MacKay 1992). In this context, the objective function aimed to be minimized is formulated to a weighted summation of squared network weights and squared errors. The coefficients of this function are determined by means of Bayes’ theorem (Burden and Winkler 2009; Foresee and Hagan 1997). The BR algorithm reduces the potential for overfitting while eliminating the need for the validation phase. Therefore, BR is especially suitable for small datasets (Taherdangkoo et al., 2020).

2.2 Existing Data-Driven Models

Rahmati et al. (2019) developed a committee machine intelligent system (CMIS) model to determine the solubility of gypsum in strong electrolyte solutions at temperatures ranging from 278.15 to 371.15 K. The CMIS model exhibited a reasonable predictive ability, showing good accordance with the experimental data. However, the CMIS model suffers from two deficiencies: (1) the model can only be applied to estimate the gypsum solubility, and (2) it is applicable for temperatures up to 371.15 K.

Recently, Zarei et al. (2021) developed a least-squares support vector machine tuned with coupled simulated annealing (CSA-LSSVM) to determine the solubility of gypsum, hemihydrate, and anhydrite in acid and salt-containing solutions at temperatures up to 373.15, 398.15, and 623.15 K, respectively. The model-associated deficiencies are (1) gypsum solubility in aqueous solutions can only be determined at temperatures up to 373.15 K, and therefore the applicability of the model in the geothermal field is limited. (2) the model fails to determine the solubility of calcium sulfates under some circumstances, for instance, it cannot accurately determine the solubility of gypsum in the system HCl + MgCl2 + CaCl2 + CaSO4 + H2O (Zarei et al. 2021).

2.3 Pitzer Model

The Pitzer ion interaction model has been widely used to describe the thermodynamic behavior of mixed electrolyte aqueous solutions. The Pitzer model is based on Debye–Hückel’s electrostatic theory for the long-range interactions and composition for the ion-specific short-range interactions in a solution. Harvie and Weare (1980) and Harvie et al. (1984) modified the Pitzer model by introducing asymmetrical electrostatic mixing terms into the model to improve the fit in multi-component systems. The mathematical expression and internal parameters of the model can be found in the relevant literature (Pitzer 1973; Pitzer and Mayorga 1973, 1974). The solubility products (Ksp) of the solid phases for calcium sulfates can be expressed as follows:

$${K}_{\mathrm{sp}}={m}_{\mathrm{Ca}}{\gamma }_{\mathrm{Ca}}{m}_{{\mathrm{SO}}_{4}}{\gamma }_{{\mathrm{SO}}_{4}},$$
(1)

where γ and m are the activity coefficients of different species and molality, respectively. aw is the water activity. The activity coefficients of calcium and sulfate ions contributing to gypsum precipitation are calculated by the Pitzer model as follows:

$$ln{\gamma }_{M}={Z}_{M}^{2}{F}^{\delta }+{\sum }_{a}{m}_{a}\left(2{B}_{M,a}+{ZC}_{M,a}\right)+{\sum }_{c}{m}_{c}\left(2{\Phi }_{M,c}+{\sum }_{a}{m}_{a}{\psi }_{M,c,a}\right)+{\sum }_{a}{\sum }_{{a}^{^{\prime}}}{m}_{a}{m}_{{a}^{^{\prime}}}{\psi }_{M,a,{a}^{^{\prime}}}+\left|{Z}_{M}\right|{\sum }_{c}{\sum }_{a}{m}_{c}{m}_{a}{C}_{c,a},$$
(2)
$$ln{\gamma }_{X}={Z}_{X}^{2}{F}^{\delta }+{\sum }_{c}{m}_{c}\left(2{B}_{X,c}+{ZC}_{X,c}\right)+{\sum }_{a}{m}_{a}\left(2{\Phi }_{X,a}+{\sum }_{c}{m}_{c}{\psi }_{X,c,a}\right)+{\sum }_{c}{\sum }_{{c}^{^{\prime}}}{m}_{c}{m}_{{c}^{^{\prime}}}{\psi }_{X,c,{c}^{^{\prime}}}+\left|{Z}_{X}\right|{\sum }_{c}{\sum }_{a}{m}_{c}{m}_{a}{C}_{c,a},$$
(3)
$$ln{a}_{w}=-\varphi {M}_{w}{\sum }_{i}{m}_{i},$$
(4)
$$\left(\varphi -1\right){\sum }_{i}{M}_{i}=\frac{-2{A}_{\varphi }{I}^{2/3}}{1+b{I}^{1/2}}+{\sum }_{c}{\sum }_{a}{m}_{c}{m}_{a}\left({B}_{c,a}^{\varphi }+Z{C}_{c,a}\right)+{\sum }_{c>{c}^{^{\prime}}}{\sum }_{{c}^{^{\prime}}}{m}_{c}{m}_{{c}^{^{\prime}}}\left({\varphi }_{c,{c}^{^{\prime}}}+{\sum }_{a}{m}_{a}{\psi }_{c,{c}^{^{\prime}},a}\right)+{\sum }_{a>{a}^{^{\prime}}}{\sum }_{{a}^{^{\prime}}}{m}_{a}{m}_{{a}^{^{\prime}}}\left({\varphi }_{a,{a}^{^{\prime}}}+{\sum }_{c}{m}_{c}{\psi }_{c,{a}^{^{\prime}},a}\right),$$
(5)

where M and X are Ca2+ and SO42−, respectively. m is the molality, and I is the ionic strength, a and a' are anionic species, c and c' are cationic species, and Z equals \({\sum }_{i}{m}_{i}\left|{Z}_{i}\right|\). B and C represent the binary interaction parameters between ions of different signs. \(\Phi\) is the interaction between ions of the same sign, and \({\psi }_{ijk}\) is the ternary interaction parameter. Mw is the water molecular weight, and \(\varphi\) is the osmotic coefficient.

2.4 Data Acquisition

Experimental data of two calcium sulfates; gypsum (CaSO4·2H2O) and anhydrite (CaSO4) in aqueous electrolyte solutions were collected from the literature. The compiled solubility data were mostly reported in molality; therefore, the remaining data were converted to molality using appropriate conversion factors. The dataset contains a total of 1912 experimental data points (Azimi and Papangelakis 2010; Barba et al. 1984; Block and Waters 1968; Blount and Dickson 1973; Bock 1961; Furby et al. 1968; Hill and Wills 1938; Innorta et al. 1980; Kleinert and Wurm 1952; Kumar et al. 2005, 2004, 2007, 2010; Li and Demopoulos 2005, 2006; Ling and Demopoulos 2004; Madgin and Swales 1956; Marshall and Slusher 1966; Marshall et al. 1964; Ostroff and Metler 1966; Scheuermann et al. 2019; Wang et al. 2013; Wu et al. 2010; Yeatts and Marshall 1972; Yuan et al. 2010).

The dataset covers gypsum solubility in aqueous solutions containing NaCl, CaCl2, MgCl2, KCl, FeCl2, FeCl3, Na2SO4, HCl, AlCl3, MgSO4, H2SO4, and NaOH at temperatures ranging from 273.65 to 383.15 K, and anhydrite solubility in solutions containing H2SO4, NaCl, CaCl2, MgCl2, Na2SO4, and HCl at temperatures from 298 to 473.15 K. The statistical analysis and distribution of input and output data are summarized in Table 1 and Fig. 1, respectively. The calcium sulfate solubility (molality) is a function of temperature (K), and ionic composition of different salts (molality):

Table 1 Statistical evaluation of the input and output parameter values based on 1912 entries
Fig. 1
figure 1

Distribution of input and output parameter values in terms of histogram plots (colour figure online)

$${C}_{{\mathrm{CaSO}}_{4}}=f\left(T,{C}_{\mathrm{NaCl}},{C}_{\mathrm{KCl}},{C}_{{\mathrm{CaCl}}_{2}},{C}_{{\mathrm{MgCl}}_{2}},{C}_{{\mathrm{FeCl}}_{2}},{C}_{{\mathrm{FeCl}}_{3}},{C}_{{\mathrm{Na}}_{2}{\mathrm{SO}}_{4}},{C}_{{\mathrm{H}}_{2}{\mathrm{SO}}_{4}},{C}_{\mathrm{HCl}},{C}_{{\mathrm{AlCl}}_{3}},{C}_{{\mathrm{MgSO}}_{4}},{C}_{\mathrm{NaOH}}\right).$$
(6)

3 Results and Interpretation

3.1 Network Models

We used feed-forward and cascade-forward neural networks to determine calcium sulfate solubilities. Each model has one hidden layer having different numbers of neurons. Following the ratio of 80:20, the input dataset was randomly divided into training, and test sets. A sample of 1532 data was specified for the training phase, and the remaining data were used to analyze the reliability and robustness of the network models. The model performance was evaluated based on the mean squared error (MSE). The optimum network has the lowest MSE value without any noticeable difference between MSE values of training and test datasets, the opposite of which is an overfitting sign. The FFNN network trained with the BR algorithm, FFNN-BR model, has 26 neurons in the hidden layer, while the CFNN-BR has 12 neurons in the hidden layer.

Regression plots of predicted solubility values by FFNN-BR and CFNN-BR models versus measured solubility values depicted in Figs. 2 and 3 show that both models have yielded an acceptable accuracy, indicated by an accumulation of the data points close to the unit slope line. The statistical analysis listed in Table 2 shows that the FFNN-BR model has better performance compared to the CFNN-BR model. The FFNN-BR model has the MSE and R2 values of 5.67 × 10−5 and 0.97, respectively.

Fig. 2
figure 2

Regression plots of the FFNN-BR model predicted values versus experimental values (colour figure online)

Fig. 3
figure 3

Regression plots of the CFNN-BR model predicted values versus experimental values (colour figure online)

Table 2 Statistical parameters of two network models to determine calcium sulfate solubility

The quality of the FFNN-BR predictions is further illustrated in Fig. 4 with experimental and predicted values covering each other, indicating a prediction with the minimum error. The outputs of this model show a good agreement with the experimental values. The histogram plot of standardized residuals of the FFNN-BR model is depicted in Fig. 5. The residuals are mostly distributed near zero, indicating the outstanding performance of the model. Our statistical analysis indicates that the developed FFNN-BR is able to accurately determine solubility of gypsum and anhydrite in acid and salt-containing solutions in the ranges of temperature for which the model was trained.

Fig. 4
figure 4

Comparison between calculated and measured values of calcium sulfate solubility for the FFNN-BR model (colour figure online)

Fig. 5
figure 5

Histogram of standardized residuals between the FFNN-BR predictions and experimental data for the train and test datasets (colour figure online)

3.2 Comparison Analysis

The predictions of the FFNN-BR model were compared with the experimental values of gypsum and anhydrite solubility in binary, ternary, and multi-component acid and salt-containing solutions reported by Li and Demopoulos (2005, 2006). The FFNN-BR model outputs were further compared with the Pitzer model as well as previously published data-driven models, including the CMIS model (Rahmati et al. 2019) and the CSA-LSSVM model (Zarei et al. 2021). Finally, the FFNN-BR was applied to determine experimental values of gypsum and anhydrite in aqueous sodium chloride solutions reported by Marshall et al. (1964) and Marshall and Slusher (1966). The designed comparison analysis provides an in-depth understanding on the behavior of the FFNN-BR model, and its overall performance to determine calcium sulfate hydrates solubilities in aquatic solutions.

Solubility of gypsum in HCl + CaCl2 + CaSO4 + H2O ternary system at three different temperatures, i.e., 295, 313, and 333 K, was measured by Li and Demopoulos (2006). The concentration of HCl is approximately 3 m. Figure 6 compares the experimental values with the predicted ones obtained from various models. The FFNN-BR outperforms the other models to determine gypsum solubility at 295 and 313 K. The results obtained from Pitzer and CMIS models are satisfactory. The CSA-LSSVM provides the least accurate predictions under these conditions. The Pitzer model is the most accurate to determine the gypsum solubility in this ternary system at 333 K. The FFNN-BR predictions are sufficiently accurate, following the expected trend.

Fig. 6
figure 6

Comparison of experimentally determined solubility of calcium sulfate as gypsum in the system HCl (3 mol/kg) + CaCl2 + CaSO4 + H2O at (a) 295, (b) 313, and (c) 333 K (Li and Demopoulos 2006) with the calculated values from Pitzer ion interaction and the FFNN-BR model of this study; and previously existing data-driven models CMIS (Rahmati et al. 2019) and CSA-LSSVM (Zarei et al. 2021) (colour figure online)

Solubility of gypsum in HCl + MgCl2 + CaSO4 + H2O ternary system at temperatures of 298, 323, and 353 K measured by Li and Demopoulos (2006) is compared with the predicted values, shown in Fig. 7. The concentration of HCl is approximately 3 m. The FFNN-BR and CMIS models are the most accurate in determining the solubility at 298 K. The CSA-LSSVM provides satisfactory performance. The FFNN-BR outperforms the other models at 323 K, while the CSA-LSSVM performs slightly better at 353 K. The Pitzer model shows the worst accuracy at predicting gypsum solubility in HCl + MgCl2 + CaSO4 + H2O ternary system, in particular at temperatures of 298 and 353 K.

Fig. 7
figure 7

Comparison of experimentally determined solubility of calcium sulfate as gypsum in the system HCl (3 mol/kg) + MgCl2 + CaSO4 + H2O at (a) 298, (b) 323, and (c) 353 K (Li and Demopoulos 2006) with the calculated values from Pitzer ion interaction and the FFNN-BR model of this study; and previously existing data-driven models CMIS (Rahmati et al. 2019) and CSA-LSSVM (Zarei et al. 2021)  (colour figure online)

The FFNN-BR model is the most accurate model predicting gypsum solubility in the multi-component system of HCl + MgCl2 + CaCl2 + CaSO4 + H2O at 323 and 333 K (Fig. 8). The experimental data were measured by Li and Demopoulos (2006). The analysis shows that both Pitzer and CMIS models are sufficiently accurate. The CSA-LSSVM predictions deviate significantly from the experimental values, and thus the model predictions are not satisfactory.

Fig. 8
figure 8

Comparison of experimentally determined solubility of calcium sulfate as gypsum in the system HCl + MgCl2 (1 mol/kg) + CaCl2 + CaSO4 + H2O at (a) 323, and (b) 333 K (Li and Demopoulos 2006) with the calculated values from Pitzer ion interaction and the FFNN-BR model of this study; and previously existing data-driven models CMIS (Rahmati et al. 2019) and CSA-LSSVM (Zarei et al. 2021)  (colour figure online)

The solubility of anhydrite in HCl + CaCl2 + CaSO4 + H2O system at temperatures of 298 and 313 K is measured by Li and Demopoulos (2005). Figure 9 shows that the Pitzer model is the most accurate model to determine the solubility values at 298 K. While the FFNN-BR model is highly accurate to determine the anhydrite solubilities at low CaCl2 concentrations (mol/kg ≤ 0.5), the model’s performance is lower at CaCl2 concentrations equal or above 1 m. Both FFNN-BR and CSA-LSSVM follow the trend observed in the experimental data. The FFNN-BR is the most accurate model to determine the solubility values at 313 K. The CSA-LSSVM predicted values are more accurate at 313 K as compared to the case of 298 K.

Fig. 9
figure 9

Comparison of experimentally determined solubility of calcium sulfate as anhydrite in the system HCl (3 mol/kg) + CaCl2 + CaSO4 + H2O system at (a) 298, and (b) 313 K (Li and Demopoulos 2005) with the calculated values from Pitzer ion interaction and the FFNN-BR model of this study; and previously existing data-driven model CSA-LSSVM (Zarei et al. 2021)  (colour figure online)

The CSA-LSSVM is the most accurate model to determine the anhydrite solubility in HCl + CaSO4 + H2O system at the temperature of 298 and 313 K reported by Li and Demopoulos (2005) (Fig. 10). The Pitzer model performs better at 313 K compared to 298 K. Although the performance of the FFNN-BR model is lower than the other two models, the predicted values are sufficiently accurate. The FFNN-BR model performs better at low HCl concentrations.

Fig. 10
figure 10

Comparison of experimentally determined solubility of calcium sulfate as anhydrite in the system HCl + CaSO4 + H2O system at (a) 298, and (b) 313 K (Li and Demopoulos 2005) with the calculated values from Pitzer ion interaction and the FFNN-BR model of this study; and previously existing data-driven model CSA-LSSVM (Zarei et al. 2021)  (colour figure online)

The FFNN-BR model exhibits excellent performance in predicting the solubility of anhydrite and gypsum in NaCl + CaSO4 + H2O system at 313.15, 353,15, and 448.15 K measured by Marshall et al. (1964) and Marshall and Slusher (1966) (Fig. 11). The model follows the expected trend observed in the experiments. For instance, the solubility of gypsum in aqueous NaCl solutions increases with the increase of NaCl concentrations and then experiences a decline (Fig. 11a). This behavior has been accurately captured by the FFNN-BR model.

Fig. 11
figure 11

Comparison of experimentally determined solubility of calcium sulfate as anhydrite in the system NaCl + CaSO4 + H2O system at (a) 333.15 and 448.15 K (Marshall et al. 1964) and (b) 353.15 K (Marshall and Slusher 1966) with the calculated values from the FFNN-BR model  (colour figure online)

4 Discussion

We follow from the presented results that the FFNN-BR model is able to determine the solubility of the calcium sulfates, gypsum and anhydrite, in complex acidic and electrolyte solutions with sufficient accuracy. The approach is, therefore, well suited for geological applications. In general, the model provides more accurate results as compared to the Pitzer model. In most cases studied, the FFNN-BR outperforms the CSA-LSSVM model, which is the only data-driven model that predicts the solubility of both gypsum and anhydrite in aqueous solutions. Similarly, in most cases, the predictions of the FFNN-BR are more accurate than the CMIS model, which is applicable for predicting the gypsum solubility. The FFNN-BR model is an accurate alternative to determine solubility of gypsum and anhydrite in mixed electrolyte solutions.

The FFNN-BR model can be further implemented in numerical modeling frameworks to address geotechnical problems such as swelling of clay-sulfate rocks. Numerical models that simulate rock swelling are usually coupled with a geochemical reaction model, e.g., PHREEQC (Charlton and Parkhurst 2011), to simulate the chemical processes. The FFNN-BR model is shown to be an effective tool to calculate solubility of calcium sulfates in mixed aquatic systems and is a promising alternative to coupling hydro-mechanical swelling models with geochemical reaction models such as PHREEQC. The FFNN-BR is more effective as compared to the Pitzer model, and has a large potential to reduce the overall computation run time required for modeling of complex THMC processes in the subsurface environment.

The FFNN-BR model should be applied under the ranges of conditions for which the model was trained. The application of the model outside of this specific domain is subjected to the modeler’s judgment to decide whether the prediction deviations are acceptable or not. Future studies may employ other data-driven models to predict solubility of calcium sulfate in aquatic systems. Other optimization algorithms such as metaheuristics algorithms can be tested to improve the performance of feed-forward neural network models. An accurate and sufficiently large dataset plays an important role in the development of data-driven models. Although a sufficiently large dataset is employed to develop the network models, future studies may enlarge the dataset to improve the accuracy and generalization ability of the model.

5 Summary and Conclusions

Modeling swelling behaviors of clay-sulfate rocks is complex as it involves complicated coupled thermo-hydro-mechanical-chemical processes. An accurate calcium sulfate solubility prediction framework is required for the numerical modeling of the chemical processes. This research employs feed-forward and cascade-forward neural network algorithms trained with Bayesian regularization to determine solubility of gypsum and anhydrite in aqueous electrolyte solutions. The compiled dataset includes experimental data covering a widespread range of temperature and salt concentrations. The FFNN-BR model is identified as the most accurate model for the task of calcium sulfate solubility prediction, yielding a R2 value of 0.966 and a MSE value of 0.0075. The FFNN-BR model presents an excellent performance in determining gypsum solubility. Although the model determines the anhydrite solubility with a slightly lower accuracy, the obtained values are sufficiently accurate. In general, the FFNN-BR model performs better compared to the Pitzer model and the other examined data-driven models. We conclude that the FFNN-BR model is a well-suited and robust tool which can be regarded as an alternative to more classical approaches for calcium sulfate solubility calculations. The proposed model can be further implemented in numerical modeling frameworks to effectively tackle complicated geotechnical problems, such as coupled THMC modeling of swelling processes of clay-sulfate rocks. This would reduce the required computation time and make numerical simulations more effective.