Impact of waste foundry sand on drainage behavior of sandy soil: an experimental and machine learning study

Kumar, Ankit; Parihar, Aditya

doi:10.1007/s43503-023-00019-x

Impact of waste foundry sand on drainage behavior of sandy soil: an experimental and machine learning study

Original Article
Open access
Published: 02 January 2024

Volume 3, article number 1, (2024)
Cite this article

Download PDF

You have full access to this open access article

AI in Civil Engineering Aims and scope Submit manuscript

Impact of waste foundry sand on drainage behavior of sandy soil: an experimental and machine learning study

Download PDF

1426 Accesses
Explore all metrics

Abstract

The study of drainage behavior is essential for using waste material in geotechnical applications. In this study, sandy soil was replaced with waste foundry sand (WFS) at an incremental interval of 20% by weight. Permeability (k) for each mix was acquired at three relative densities (R_D), i.e., 65%, 75% and 85%, by using the constant head method. Then the results were further processed with machine learning (ML) models to validate the experimental data. The experimental study demonstrated that k would decrease with the increase in relative density and WFS content. A rise in R_D from 65% to 85% resulted in a substantial reduction of up to 140% in the value of k. Moreover, the complete replacement of sand with WFS reduced the value of k by 36%, 51% and 57% for R_D of 65%, 75% and 85%, respectively. The total dataset of 90 observations was divided at a ratio of 63/13/15 into training/validation/testing datasets for ML-AI modeling. Input variables include percentage of sand (BS), replacement with WFS, total head (H), time interval (t) and outflow (Q); and k is the output variable. The methods of artificial neural network (ANN), random forest (RF), decision tree (DT) and multi-linear regression (MLR) are used for k prediction. It is found that the random forest approach performed outstandingly in these methods, with an R² value of 0.9955. The performance of all the proposed methods was compared and verified with Taylor's diagram. Sensitivity analysis showed that Q and R_D were the most influential parameters for predicting k values.

Sensitivity analysis and prediction of erodibility of treated unsaturated soil modified with nanostructured fines of quarry dust using novel artificial neural network

Article 07 July 2021

Geocell Mattress Reinforcement for Bottom Ash: A Comprehensive Study of Load-Settlement Characteristics

Article 28 July 2023

Predictive Models for Estimating the Coefficient of Permeability for Sands

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

As a property of soils, permeability measures the speed of water percolation. Low permeability (k-value) means the possible generation of excess pore water pressure and secondary consolidation (Lin et al., 2018; Zhang et al., 2021). The measurements of permeability are essential for geotechnical projects related to groundwater tables or precipitation water. The k-value of soil depends upon many factors, including: grain size properties, void ratio, fine content, over-consolidation ratio, drainage type, and density of soil or impurities, if any (Cashman & Preene, 2020; Smith, 2014).

The k-value can be identified in direct and indirect methods. For direct methods, tests are carried out on soil samples. In contrast, for indirect methods, the k-value is calculated from empirical formulas based on grain size properties and void ratio (Nagy et al., 2013; Osterhout, 1922). Some direct and indirect methods are listed in Table 1. It is not convenient to perform permeability tests on every soil sample. So, modeling is often used to provide a rough estimate of the actual k-value. The k-value exhibits up to 10 orders of magnitude, ranging from a coarse feature to a very fine feature.

Table 1 Direct and indirect methods for measuring the k-value

Full size table

Proposed earlier based on some assumptions, indirect methods are suitable for different conventional soils. However, for soils incorporating waste materials, the values of constants would change significantly. So, if various waste materials or composites are present, new equations must be developed based on experimental data. This gap can be bridged by developing certain models with artificial intelligence techniques (Baghbani et al., 2022; Shahin, 2013).

Sand is a kind of natural materials used in the majority of construction projects. In India, the expected demand for sand is 700 MT (in 2017), with an annual growth rate of 6%–7%. Mining has led to a 90% decline in sediment levels in key Asian rivers, putting local communities into risks of flooding, land loss, contaminated drinking water, and crop devastation (Ministry of Mines, 2018). India produces more than 3 million tons of WFS annually. The scarcity of natural sand drives practitioners to replace them with waste and by-products from industries. The replacement materials should meet the strength and other requirements specified in construction laws and codes. One such material which has been used in construction over recent decades is WFS. Many researchers have reported the viability of WFS in various geotechnical applications (Gedik et al., 2018; Heidemann et al., 2021; Javed, 1995; Siddique & Singh, 2011; Sinha et al., 2020; Tittarelli, 2018; Winkler & Bolshakov, 2000), hydraulic or fluid barriers (Abichou et al., 1998, 2000, 2002), retaining-wall backfills (Lee et al., 2001), highway sub-bases (Guney et al., 2006; Javed & Lovell, 1994; Mast & Fox, 1998; Partridge et al., 1999) and ground improvement (Vipulanandan et al., 2000). For all such applications, knowledge on the drainage behaviour of WFS or WFS-incorporating sand is necessary. FHWA (2004) has reported a range of permeability 10^–3–10^–6 cm/s.

This study proposes a model to measure the permeability of soils incorporating a certain kind of industrial materials. Sandy soil has been mixed with waste foundry sand at different ratios to cover the range of replacement, and the hydraulic behaviour at different densities has been observed. Few existing research is available that considers the relative density as a governing parameter in indirect methods. The main objective of this study is to explore the drainage behaviour of WFS-incorporated sand in retaining-wall backfills and earthen dam applications.

Sustainable geomaterials are rapidly replacing conventional materials. Due to their accuracy, soft computing methodologies have gained pervasive traction over the preceding two to three decades. Many models have been trained for evaluating material properties based on known input parameters (Dalkilic et al., 2023; Khatti & Grover, 2023a, 2023b, 2023c, 2023d, 2023e; Kumar et al., 2023; Länsivaara et al., 2023; Rabbani et al., 2023). Machine learning modeling can be broadly classified as an approximation, classification and forecasting (Sarker, 2021). This study will adopt artificial neural networks, multi-linear regression, decision trees, and random forest techniques. Several past studies on modelling of k-value by using AI techniques are listed in Table 2. In most cases, the input variable is the grain size properties of soil. The performance of models is dependent upon available data sets, correlation between variables, and standard deviation in values.

Table 2 AI methods used in the past studies on k-value

Full size table

2 Materials used

2.1 Sand

Sandy soil samples acquired in Punjab, India is used in this study. The grain size properties are presented in Fig. 1 and Table 3. The type of gradation highly impacts the permeability of soil. The specific gravity of soil is determined as per ASTM D854 (2002), while GSD parameters are as per ASTM D422-63 (2007). The soil is classified as poorly graded sand.

Table 3 Grain size properties of the materials used

Full size table

2.2 Waste foundry sand

In this study, WFS is acquired from an iron foundry located in Ludhiana, India. The grain sizes are presented in Fig. 1. The WFS grain sizes' distribution curve is found similar to that of Ottawa sand at the F65 grade (Carey et al., 2020). The grains of WFS are smaller than those of sand, so the specific gravity is found to be smaller. The index and engineering properties for this type of WFS are reported by Kumar and Parihar (2022). WFS is classified as poorly graded sand as per the USCS system (ASTM D2487, 2006; Casagrande, 1948).

3 Research methodology

This study is divided into four phases: data generation, data preparation, modeling, and getting conclusions based on the best-performed model (Fig. 2). The data in this study are generated from a series of laboratory scale tests. The experimental output data are further investigated for outliers and multicollinearity. The phases involved will be discussed in detail in the following sections.

3.1 Lab experiments

To cover a wide range of replacements with WFS and to explore the relative density, 18 different compositions are considered (Table 4). At least five distinct readings of permeability for each composition are measured. The time during which a particular amount of water is drained from the samples is also considered as one of the governing parameters.

Table 4 Proportions of different composite soils

Full size table

Permeability is affected by the relative density of soil, as liquid would take more time to flow through denser media. Test for R_D is a preliminary step for sample preparation in the permeability experiment.

3.1.1 Relative density

The relative density of soil composites indicates the compactness of cohesionless materials. The test in this study is conducted as per ASTM D4254 (2000). To measure the minimum density (${\rho }_{min}),$ the soil is poured down by free, falling from 2–3 cm height in a relative density mold of 3000 cc, and the mass of soil in the mold is noted down. The maximum density (${\rho }_{max})$ is determined by vibrating the filled mold at a frequency of 3,600 vibrations/min under a 115 kg surcharge for 8 min. After the vibration and removal of the load, the settlement of the loading plate is measured, and the reduced volume is thereby determined. Since the mass remains constant, a reduction in volume in the latter case results in increased density.

The density for different relative densities is calculated by using Eqs. 2 and 3. The values of densities for different relative densities with variation in WFS replacement level are shown in Fig. 8.

$${\rho }_{\Delta }={\rho }_{max}-{\rho }_{min}$$

(1)

$${\rho }_{d}={{\rho }_{max}-(R}_{D}*{\rho }_{\Delta })$$

(2)

3.1.2 Permeability

The permeability of soil indicates the degree of easiness of water to flow through a porous medium. Permeability doesn't depend upon the density and viscosity of the flow-in materials, like hydraulic conductivity. This study measures permeability for all considered cases (Table 4) as per ASTM D2434 (2019). As the soil composites are granular, the constant head method is performed on all compositions. The total head is maintained at 122.5 cm. A Permeameter with a height of 12.73 cm, a diameter of 10 cm, two operating valves, and an air vent is used (Fig. 3). De-aired water is allowed to flow through the sample with two-way drainage. At least five readings of quantity outflow per unit of time for each sample are noted. An equation given by Darcy law (Eq. 3) is adopted for calculating the k-value in m/day.

$$k=\frac{Q\times L}{h\times A}$$

(3)

where Q is the amount of drained water per unit time; h is water head; A is the cross-sectional area of sample; and L is the length of the sample.

3.2 AI-approaches

R programming offers a rich ecosystem of packages, and is specifically designed for machine learning, so it is a versatile tool for data scientists and researchers. Its packages, such as "caret", "mlr" and "tidymodels", provide a wide range of tools and functions to streamline the entire machine-learning workflow. These packages offer well-documented and efficient solutions, ranging from data preprocessing, feature engineering and model selection to training, validation and evaluation. This study uses Rstudio (V 1.4.1564) and R programming platform (V 4.3.1) to access soft computing techniques.

For training the model, data are initially split into three sections: Training data, Validation data and Test data. At the beginning, classifier training is done by using a training data set, followed by using the validation data set to tune the parameters, so as to estimate the skill of the machine learning model on unseen data. In the final stage, the performance of the classifier is tested by using a test data set. According to a widely used thumb rule, the number of data points should be at least ten times the input parameters (Alwosheel et al., 2018; Haykin, 2009). The minimum number of data points required in this study is 70. To avoid overfitting of the model, k-fold cross-validation is considered, with the value of k as a five-seed value of 42 (Fushiki, 2011). The dataset for training and validation combined should be 85% of the total dataset, with the nearest multiple of 5. The total 90 data points in this study are divided in the ratio of 62/13/15 for training/validation/testing, respectively (Fig. 4). The k here represents the fold for cross-validation; it should not be mistaken with k, which represents the permeability.

3.2.1 Artificial neural networks (ANN)

This computing technique's working principle is inspired by the biological neural network of human brains. This method was first proposed by MaCulloch and Pitts (1943). A group of simulated neurons make an artificial neural network. Every neuron functions as a node linked to other nodes via connections that resemble biological axon-synapse-dendrite connections. A weight is assigned to each link to indicate how strongly one node will affect others (Winston, 1992). Because they can reproduce and model non-linear processes, artificial neural networks have been applied in many disciplines. In civil engineering, they are widely used for soft computing (Lazarevska et al., 2014; Xu et al., 2022; Yang et al., 2021). The input-hidden-output layer schematic is presented in Fig. 5. The hit and trial method is used to select the number of hidden layers, which is found to be optimum at 10. The hyperparameters are optimized for computation time and respective error (Table 5). The activation function chosen for the neurons in the hidden layers is Rectified Linear Unit (ReLU), which helps introduce non-linearity into the model.

Table 5 Hyperparameters for ANN model

Full size table

3.2.2 Multilinear regression (MLR)

This approach reveals linear relationships between independent (y) and dependent (x) variables. Since multiple-regression takes into account many explanatory variables, it extends the ordinary least-squares regression. The generalized relation is given in Eq. 3, where 'a' represents the intercept and ${\prime}\epsilon {\prime}$ represents the error. The coefficient b_n is determined by minimizing the sum of the square of residuals after the model is evaluated with statistical parameters.

$$y=a+{b}_{1}{x}_{1}+{b}_{2}{x}_{2}+\cdots \cdots \cdots \cdots \cdots +{b}_{n}{x}_{n}+\epsilon$$

(4)

$$k=a {(BS)}^{{b}_{1}} {(WFS)}^{{b}_{2}} {(RD)}^{{b}_{3}} {(Q)}^{{b}_{4}} {(T)}^{{b}_{5}}$$

(5)

3.2.3 Decision tree model (DT)

Decision tree is a non-parametric supervised learning method, which can be used for classification and regression. It is aimed at discovering simplistic decision rules derived from data features, so as to build a model that can predict the value of a variable (Fig. 6). This technique is widely used in civil engineering fields, where decisions are often made based on variables' upper or lower limits. For example, if the permeability value is less than 10^–6 cm/s, the soil will be classified as clay (Desai & Joshi, 2010; Singh et al., 2020). Table 6 outlines the hyperparameters and their respective approximate values used for tuning a decision tree classifier, where the criterion is Gini impurity.

Table 6 Hyperparameters for DT model

Full size table

3.2.4 Random forest (RF)

In 1995, the first random decision forest method was developed by Ho (1995). In the fitting process, errors are computed, and the importance of variables is measured. The relevance of variables in a regression or classification task can be ranked by using random forests. Variables that create high values for this score are given higher weightage than those that produce low values. This method solves the problem of overfitting, since the output is based on majority voting or averaging. This technique is widely used in geotechnical engineering to calculate engineering and index properties (Dutta et al., 2019; Rauter & Tschuchnigg, 2021).

In this method, the number of trees for the prediction of the k-value is optimized by using an error rate curve, as shown in Fig. 7. More trees than the optimum value may increase the calculation time of the model; meanwhile, a less value can predict erroneous values. The error rate is found to vary insignificantly (can be considered constant) after 50 trees. Fundamental settings or hyperparameters that influence the behavior and performance of the model are tabulated in Table 7.

Table 7 Hyperparameters for RF model

Full size table

3.2.5 Limitations of AI models

1.
ANN: ANN requires a relatively large amount of data for training, so it may be computationally intensive. It is also often considered a kind of "black-box" models, which makes it a challenging task to interpret their decision-making process. Selecting the exemplary architecture and hyperparameters can be a trial-and-error process, and it is sensitive to initial conditions.
2.
MLR: MLR assumes a linear relationship between independent and dependent variables. It might not capture complex non-linear relationships in data. Additionally, it is sensitive to multicollinearity among the predictor variables, which can lead to unstable coefficient estimates.
3.
DT: Decision tree assumes that data are non-linearly separable, and it can lead to overfitting, especially when the tree depth is not adequately controlled. It may not perform well on highly imbalanced datasets; and biased trees might be created if one class dominates others.
4.
RF: Random forest is less interpretable than individual decision trees and can be computationally expensive for large datasets. It may not perform optimally when there is a high degree of multicollinearity in the features; and it may struggle with extrapolation as relying on the range of values seen in the training data.

4 Results and discussion

4.1 Experimental results

The variation in density is plotted against WFS content (Fig. 8). It can be seen that the density reduces by 25% with increased WFS content, because WFS exhibits lower dry density values. The experimental results of the k value for all cases are plotted in Fig. 9. As seen in the surface heat map, the permeability is decreasing as the relative density and WFS content increase. And the permeability also decreases with an increase in the replacement level of WFS. Fully replacement of sand with WFS reduces the k value by 36%, 51% and 57% for R_D values of 65%, 75% and 85%, respectively.

4.2 Statistical features of data sets

The descriptive statistical summary for the training, validation, testing and total data sets is given in Table 8. The summary features all necessary statistical parameters: count, lower and upper bound, mean, standard deviation, kurtosis and skewness. Standard deviation is the maximum in the validation dataset for all parameters. Kurtosis and skewness are reported for all datasets, purposed to measure the symmetries about the center point and the distribution pattern of data. The Pearsons coefficient between two input parameters shows their relationship, and the histogram represents the distribution of data values (Fig. 10).

Table 8 Statistical features of different data sets

Full size table

4.3 Performance of AI models

4.3.1 Results of ANN

The hit and trial method was used to select the number of hidden layers, which is found to be optimum at 10. The 5–10–1 network with 76 weights resulted in SSE value of 0.0782 with a skip-layer connection. More than half of the predicted points are on the negative side of the 1:1 line. Figure 11 shows the output of the ANN model. Error lines of ± 20% indicate that the maximum data points exist within that limit (Fig. 11a). The maximum error value is found to be -0.55 m/day, which is unacceptable (Fig. 11c). Consequently, this model is inadequate for determining the permeability of sand and WFS mixture.

4.3.2 Results of MLR

The results of MLR model are presented in Fig. 12. As can be seen, all data points get fitted between ± 15% error lines (Fig. 12a). The plot for actual and predicted values along data points is presented in Fig. 12b. The maximum value of the errors is 0.4 m/day (Fig. 12c). MLR assumes that the amount of errors in the residuals is similar at each point of the linear model. This scenario is known as homoscedasticity. This assumption is attributed to the low degree of reliability.

$$k=0.5051\frac{{(\text{BS})}^{0.001} {(Q)}^{0.03724}}{{(\text{RD})}^{0.492}{ (\text{WFS})}^{0.0591}{ (T)}^{0.003}}$$

4.3.3 Results of DT

The decision tree analysis predicts the k-values based on Q values. The decision-making process is showcased in Fig. 13. As seen, in a particular box, the k-value is given with the no. of observations (n) and the percentage of observations considered in that condition (%). Condition is written beneath the boxes; all boxes are filled with contrast counter colors (the larger the values, the darker the colors).

Figure 14a presents the scatter plot of actual and predicted k-values acquired from the decision tree model. It is shown that this model performs poorly in predicting the k-value of the sand-WFS mixture; so it is not recommended. A particular predicted value covers a wide range of actual values. The relative error value is 0.6 m/day, the highest among all models (Fig. 14c).

4.3.4 Results of RF

This approach performs better than other techniques. Results from the RF model are presented in Fig. 15. The error lines to fit the data in the scatter plot are drawn at 15% on the positive side and at 10% on the negative side. For actual k-values less than 3.5 m/day, the data points are around the 1:1 line, and the values greater than 3.5 m/day lean towards the negative error line (Fig. 15a). The maximum value of error is found to be 0.45 m/day (Fig. 15c).

4.4 Comparison of performance of different models

4.4.1 Performance parameters

The effectiveness of the proposed models is evaluated by the following performance parameters: coefficient of determination (R²), mean squared error (MSE), root mean square error (RMSE), performance index (PI), index of scatter (IOS), index of agreement (IOA), variance accounted for (VAF), and a20 index (Table 9). The mathematical expressions, ideal values and significance of each parameter are also listed in Table 9. Notation y represents actual data, $\overline{y }$ represents the mean of actual data, $\widehat{y}$ represents the predicted data, n is the number of data points, and 20 m represents the number of data points, which are in the range of $\pm$ 20% of the actual data.

Table 9 Insights into the considered performance parameters

Full size table

The values of these parameters for the training, validation and testing datasets are presented in Tables 10, 11 and 12, respectively. The values demonstrate the performance of different models. The values of R², MSE and RMSE for the testing dataset are less than those for the training and validation datasets.

Table 10 Performance parameters for the training dataset

Full size table

Table 11 Performance parameters for the validation dataset

Full size table

Table 12 Performance parameters for the testing dataset

Full size table

The values of R² for the training, validation and testing datasets are 0.96500, 0.96614 and 0.9126, respectively. The R² value for the ANN model is the least of all the proposed models for the testing dataset. For the training data, the values of R², MSE and RMSE are 0.96106, 0.04289 and 0.2071, respectively.

The performance parameters for RF are optimum for all datasets in a minimum error. The value of R² for the training, validation and testing datasets are 0.99314, 0.99374 and 0.9579, respectively. Contrast to the random forest, the multi-linear regression performs well, as the values of R² for the training, validation and testing datasets are 0.98066, 0.96854 and 0.9265, respectively. The values of R² for the training, validation and testing datasets are 0.96106, 0.95567 and 0.9338, respectively, which shows a poor correlation between the actual and predicted values.

4.4.2 Check for overfitting

Overfitting is a common challenge in machine learning. It means that a model learns the training data to a so excessive degree that it even captures the noises and random fluctuations, instead of only the genuine underlying patterns. As a result, an overfitted model performs well on the training data, but poorly on unseen or new data, leading to bad generalization. Understanding the issue of overfitting is crucial for building up accurate and reliable models on real-world tasks. In this study, the overfitting ratio is computed in Eq. 5. OFR confirms that the RF model is ideally fit.

$${\text{OFR}}=\frac{{{\text{RMSE}}}_{validation}}{{{\text{RMSE}}}_{training}}$$

(6)

4.4.3 Taylor's diagram

Taylor's diagram quantifies the degree of correspondence between the predicted and actual values. Figure 16 depicts a Taylor diagram, which graphically illustrates the following metrics for all the proposed methods: the value of the Pearson correlation coefficient, the root-mean-square error, and the standard deviation. As can be seen, the mark of RF is much closer to the actual value point than other marks are.

4.4.4 Distribution of residuals

The error distribution highlights the instances where a model consistently underperforms or overperforms along the data points. The error distribution comparison of all proposed AI approaches (Fig. 17) indicates that the random forest is the best-fit approach for the prediction of the k-value of sand-WFS mixtures. The box plots depict the lower and upper values of residuals with outlier points. Investigating these outliers can provide insights into unique scenarios or data points that require special attention. The performance of the models can be compared based on the distance of the median from the origin line. The results are consistent with the performance parameters. Patterns and trends identified from the models' error distribution indicate the absence of systematic errors. These patterns can help increase the robustness of the performance of the models. In addition, the error distribution results show that the models are well-calibrated, particularly in specific prediction ranges.

4.5 Sensitivity analysis

Sensitivity analysis is carried out to determine the most influential input parameters in predicting k-values. As RF is an outperforming approach, sensitivity analysis is done based on the RF method, and the results are presented in Table 13. This analysis is made by relucting one parameter, and the value of R² is noted. The parameters that cause a reduction in R² are influential. It is shown that R_D and Q are the most influential parameters, which highly impact the k-values of the composite soil.

Table 13 Sensitivity analysis based on the random forest method

Full size table

$$\widehat{{s}_{i}}=\widehat{{s}_{a}}-\widehat{{s}_{r}}$$

(7)

4.6 Comparison with the existing literature

The best architecture model in this study is the random forest for predicting the soil permeability. The proposed model in this study is compared with the models available from the existing literature (Table 14). The R² for the actual and predicted datasets of the available models is relatively lower than that of the model proposed in this study.

Table 14 Comparison of the best-performing model with the existing literature

Full size table

5 Conclusions

This study explores the drainage behavior of WFS-incorporated sand. The experimental research has been extended to include AI modeling. The following major conclusions are drawn from this study.

The permeability tends to decrease as the relative density of the soil increases. A notable reduction in the k-value, up to 140%, can be observed when the relative density is increased from 65% to 85%. Similarly, an increase in the replacement level of WFS is associated with the decrease in the permeability. When sand is completely replaced with WFS, there are reductions of 36%, 51%, and 57% in the k-values for the relative density of 65%, 75%, and 85%, respectively.
The R² value and other performance parameters indicate that the relationship between the actual and predicted values is most pronounced in the random forest method. The order of the performance of all the proposed models can be presented as RF > MLR > ANN > DT.
Taylor's diagram is used to verify the outcomes of all the considered AI approaches, and it proves the good performance of RF, as its mark is nearer to the actual value. The overfitting ratio for RF is close to 1, indicating a strong level of fitness of the model.
Sensitivity analysis demonstrates that Q and R_D are the most influential parameters for predicting k-values.

Data availability

Data will be made available on reasonable request.

Abbreviations

KNN:: k-nearest neighbors
SVM:: support vector machine
LightGBM:: light gradient boosting machine
RF:: random forest
GB:: gradient boosting
ANN:: artificial neural network
MLP:: multiple layer perceptron
RBF:: radial basis function
ANFIS:: adaptive neuro-fuzzy inference system
GPR:: gaussian process regression
Poly:: polynomial
PUK:: pearson universal kernel
TLBO:: teaching learning-based optimization
MEP:: multi-expression-programming
GEP:: genetic-expression-programming
GP:: gaussian process
MLR:: multi-linear-regression
CANFIS:: co-active neuro-fuzzy inference-system
MLP:: multilayer perceptron
DT:: decision tree
CART:: classification and regression trees
GMDH:: group method of data handling
GA:: genetic algorithm
k :: permeability
PI:: plasticity index
w _l :: liquid limit
w _p :: plastic limit
CC:: clay content
ρ :: density
wc:: water content
e :: void ratio
D ₁₀ :: diameter at 10% finer
D ₃₀ :: diameter at 30% finer
D ₆₀ :: diameter at 60% finer
G:: specific gravity
ρ _d :: dry density
D :: grain size
S :: % of sand
Fa:: % of fly ash
T :: time
H :: head
OC:: organic content
BD:: bulk density
PD:: particle density
UCS:: uniaxial compressive strength
Gp:: gas pressure
Temp:: temperature
σ′:: effective stress
Q :: rate of flow
A :: c/s area of sample
L :: length of flow within soil sample
h :: total hydraulic head
a :: c/s area of stand pipe
$\mu$ :: coefficient of viscosity
γ:: unit weight of water
h ₁, h ₂ :: hydraulic heads
t :: time interval
d _e :: effective grain size
c :: constant related to e
n :: porosity
S _S :: specific surface
BS:: percentage of sand
WFS:: fraction of waste foundry sand

References

Abichou, T., Benson, C. H., Edil, T. B., & Freber, B. W. (1998). Using waste foundry sand for hydraulic barriers. Proceedings of Recycled Materials in Geotechnical Applications, 86–99.
Abichou, T., Benson, C. H., & Edil, T. B. (2000). Foundry green sands as hydraulic barriers: Laboratory study. Journal of Geotechnical and Geoenvironmental Engineering, 126(12), 1174–1183. https://doi.org/10.1061/(ASCE)1090-0241(2000)126:12(1174)
Article Google Scholar
Abichou, T., Benson, C. H., & Edil, T. B. (2002). Foundry green sands as hydraulic barriers: Field study. Journal of Geotechnical and Geoenvironmental Engineering, 128(3), 206–215.
Article Google Scholar
Ahmad, M., Keawsawasvong, S., Bin Ibrahim, M. R., Waseem, M., Kashyzadeh, K. R., & Sabri, M. M. S. (2022). Novel approach to predicting soil permeability coefficient using Gaussian process regression. Sustainability, 14(14), 8781. https://doi.org/10.3390/su14148781
Article Google Scholar
Alwosheel, A., van Cranenburgh, S., & Chorus, C. G. (2018). Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis. Journal of Choice Modelling, 28, 167–182. https://doi.org/10.1016/j.jocm.2018.07.002
Article Google Scholar
ASTM D4254. (2000). Standard test methods for minimum index density and unit weight of soils and calculation of relative density. West Conshohocken: ASTM International.
Google Scholar
ASTM D854. (2002). Standard test methods for specific gravity of soil solids by water pycnometer. West Conshohocken: ASTM International.
Google Scholar
ASTM D2487. (2006). Classification and identification of soils for general engineering purposes. West Conshohocken: ASTM International.
Google Scholar
ASTM D422-63. (2007). Standard test method for particle-size analysis of soils. West Conshohocken: ASTM International.
Google Scholar
ASTM D2434. (2019). Standard test method for permeability of granular soils (constant head). West Conshohocken: ASTM International.
Google Scholar
Baghbani, A., Choudhury, T., Costa, S., & Reiner, J. (2022). Application of artificial intelligence in geotechnical engineering: A state-of-the-art review. Earth-Science Reviews, 228, 103991. https://doi.org/10.1016/j.earscirev.2022.103991
Article Google Scholar
Bui, Q.-A.T., Al-Ansari, N., van Le, H., Prakash, I., & Pham, B. T. (2022). Hybrid model: teaching learning-based optimization of artificial neural network (TLBO-ANN) for the prediction of soil permeability coefficient. Mathematical Problems in Engineering, 2022, 1–9. https://doi.org/10.1155/2022/8938836
Article Google Scholar
Carey, T. J., Stone, N., & Kutter, B. L. (2020). Grain size analysis and maximum and minimum dry density testing of Ottawa F-65 Sand for LEAP-UCD-2017. In Model Tests and Numerical Simulations of Liquefaction and Lateral Spreading (pp. 31–44). Springer International Publishing. https://doi.org/10.1007/978-3-030-22818-7_2
Casagrande, A. (1948). Classification and identification of soils. Transactions of the American Society of Civil Engineers, 113, 901–930.
Article Google Scholar
Cashman, P. M., & Preene, M. (2020). Permeability of soils and rocks. In Groundwater Lowering in Construction (pp. 73–92). CRC Press. https://doi.org/10.1201/9781003050025-5
Chen, J., Tong, H., Yuan, J., Fang, Y., & Gu, R. (2022). Permeability prediction model modified on Kozeny-Carman for building foundation of clay soil. Buildings, 12(11), 1798. https://doi.org/10.3390/buildings12111798
Article Google Scholar
Dalkilic, H. Y., Kumar, D., Samui, P., Dixon, B., Yesilyurt, S. N., & Katipoğlu, O. M. (2023). Application of deep learning approaches to predict monthly stream flows. Environmental Monitoring and Assessment, 195(6), 705. https://doi.org/10.1007/s10661-023-11331-5
Article Google Scholar
Darcy, H. (1856). Les Fontaines Publiques de la Ville de Dijon.
Desai, V. S., & Joshi, S. (2010). Application of decision tree technique to analyze construction project data (pp. 304–313). https://doi.org/10.1007/978-3-642-12035-0_30
Dutta, R. K., Gnananandarao, T., & Sharma, A. (2019). Application of random forest regression in the prediction of ultimate bearing capacity of strip footing resting on dense sand overlying loose sand deposit. Journal of Soft Computing in Civil Engineering, 3(4), 28–40. https://doi.org/10.22115/scce.2019.137910.1080
Article Google Scholar
Erzin, Y., Gumaste, S. D., Gupta, A. K., & Singh, D. N. (2009). Artificial neural network (ANN) models for determining hydraulic conductivity of compacted fine-grained soils. Canadian Geotechnical Journal, 46(8), 955–968. https://doi.org/10.1139/T09-035
Article Google Scholar
Feng, S., Barreto, D., Imre, E., Ibraim, E., & Vardanega, P. J. (2023). Use of hydraulic radius to estimate the permeability of coarse-grained materials using a new geodatabase. Transportation Geotechnics, 41, 101026. https://doi.org/10.1016/j.trgeo.2023.101026
Article Google Scholar
FHWA. (2004). Foundry sands facts for civil engineers. Federal Highway Adminstration.
Google Scholar
Fushiki, T. (2011). Estimation of prediction error by using K-fold cross-validation. Statistics and Computing, 21(2), 137–146. https://doi.org/10.1007/s11222-009-9153-8
Article MathSciNet Google Scholar
Gedik, A., Lav, A. H., & Lav, M. A. (2018). Investigation of alternative ways for recycling waste foundry sand: An extensive review to present benefits. Canadian Journal of Civil Engineering, 45(6), 423–434. https://doi.org/10.1139/cjce-2017-0183
Article Google Scholar
Guney, Y., Aydilek, A. H., & Demirkan, M. M. (2006). Geoenvironmental behavior of foundry sand amended mixtures for highway subbases. Waste Management, 26(9), 932–945. https://doi.org/10.1016/j.wasman.2005.06.007
Article Google Scholar
Haykin, S. (2009). Neural networks and learning machines (Third). Pearson.
Google Scholar
Hazen, A. (1911). Discussion: Dams on sand foundations. Transactions, American Society of Civil Engineers, 73(11).
Heidemann, M., Nierwinski, H. P., Hastenpflug, D., Barra, B. S., & Perez, Y. G. (2021). Geotechnical behavior of a compacted waste foundry sand. Construction and Building Materials, 277, 122267. https://doi.org/10.1016/j.conbuildmat.2021.122267
Article Google Scholar
Ho, T. K. (1995). Random Decision Forests. 3rd International Conference on Document Analysis and Recognition, Montreal, 278–282.
Izadi, H., Roostaei, M., Hosseini, S. A., Soroush, M., Mahmoudi, M., Devere-Bennett, N., Leung, J. Y., & Fattahpour, V. (2022). A hybrid GBPSO algorithm for permeability estimation using particle size distribution and porosity. Journal of Petroleum Science and Engineering, 217, 110944. https://doi.org/10.1016/j.petrol.2022.110944
Article Google Scholar
Javed, S. (1995). Uses of waste foundry sands in civil engineering. Transportation Research R, 1486, 109–113.
Google Scholar
Javed, S., & Lovell, C. W. (1994). Use of waste foundry sand in highway construction. Joint Highway Research Project Report, Indiana Department of Transportation, 1–304.
Khatti, J., & Grover, K. S. (2021a). Computation of permeability of soil using artificial intelligence approaches. International Journal of Engineering and Advanced Technology, 11(1), 257–266. https://doi.org/10.35940/ijeat.A3220.1011121
Article Google Scholar
Khatti, J., & Grover, K. S. (2021b). Determination of permeability of soil for Indian soil classification system using artificial neural network technique. Invertis Journal of Science & Technology, 14(2), 49–57. https://doi.org/10.5958/2454-762X.2021.00005.6
Article Google Scholar
Khatti, J., & Grover, K. S. (2023a). Assessment of fine-grained soil compaction parameters using advanced soft computing techniques. Arabian Journal of Geosciences, 16(3), 208. https://doi.org/10.1007/s12517-023-11268-6
Article Google Scholar
Khatti, J., & Grover, K. S. (2023b). CBR Prediction of pavement materials in unsoaked condition using LSSVM, LSTM-RNN, and ANN approaches. International Journal of Pavement Research and Technology. https://doi.org/10.1007/s42947-022-00268-6
Article Google Scholar
Khatti, J., & Grover, K. S. (2023c). Prediction of compaction parameters for fine-grained soil: Critical comparison of the deep learning and standalone models. Journal of Rock Mechanics and Geotechnical Engineering. https://doi.org/10.1016/j.jrmge.2022.12.034
Article Google Scholar
Khatti, J., & Grover, K. S. (2023d). Prediction of UCS of fine-grained soil based on machine learning part 1: Multivariable regression analysis, gaussian process regression, and gene expression programming. Multiscale and Multidisciplinary Modeling, Experiments and Design, 6(2), 199–222. https://doi.org/10.1007/s41939-022-00137-6
Article Google Scholar
Khatti, J., & Grover, K. S. (2023e). Prediction of UCS of fine-grained soil based on machine learning part 2: Comparison between hybrid relevance vector machine and Gaussian process regression. Multiscale and Multidisciplinary Modeling, Experiments and Design. https://doi.org/10.1007/s41939-023-00191-8
Article Google Scholar
Kim, M. H., & Song, C. M. (2023). Prediction of the soil permeability coefficient of reservoirs using a deep neural network based on a dendrite concept. Processes, 11(3), 661. https://doi.org/10.3390/pr11030661
Article Google Scholar
Kozeny, J. (1927). Uber Kapillare Leitung der Wasser in Boden. Royal Academy of Science, Vienna Proc. Class I, 136, 271–306.
Google Scholar
Kumar, A., & Parihar, A. (2022). State-of-the-art review on sustainability in geotechnical applications of waste foundry sand. Indian Geotechnical Journal, 52(2), 416–436. https://doi.org/10.1007/s40098-021-00580-1
Article Google Scholar
Kumar, D. R., Samui, P., Wipulanusat, W., Keawsawasvong, S., Sangjinda, K., & Jitchaijaroen, W. (2023). Soft-computing techniques for predicting seismic bearing capacity of strip footings in slopes. Buildings, 13(6), 1371. https://doi.org/10.3390/buildings13061371
Article Google Scholar
Länsivaara, T. T., Farhadi, M. S., & Samui, P. (2023). Performance of traditional and machine learning-based transformation models for undrained shear strength. Arabian Journal of Geosciences, 16(3), 183. https://doi.org/10.1007/s12517-022-11173-4
Article Google Scholar
Lazarevska, M., Kneević, M., Cvetkovska, M., & Trombeva-Gavriloska, A. (2014). Application of artificial neural networks in civil engineering. Tehnicki Vjesnik-Technical Gazette, 21, 1353–1359.
Google Scholar
Lee, K., Cho, J., Salgado, R., & Lee, I. (2001). Retaining wall model test with waste foundry sand mixture backfill. Geotechnical Testing Journal, 24(4), 401–408. https://doi.org/10.1520/GTJ11137J
Article Google Scholar
Lin, D., Wu, H., & Hu, L. (2018). Excess Pore Pressure During One-Dimensional Self-weight Consolidation. In Proceedings of GeoShanghai 2018 International Conference: Fundamentals of Soil Behaviours (pp. 407–416). Springer Singapore. https://doi.org/10.1007/978-981-13-0125-4_45
Mast, D. G., & Fox, P. J. (1998). FHWA/IN/JTRP-98/18, Geotechnical performance of a highway embankment constructed using waste foundry sand. Joint Transportation Reasearch Program, Indiana Department of Transportation.
McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. The Bulletin of Mathematical Biophysics, 5(4), 115–133. https://doi.org/10.1007/BF02478259
Article MathSciNet Google Scholar
Ministry of Mines, I. (2018). Sand Mining Framework.
Nagy, L., akács, A. T., Huszák, T., Mahler, A., & Varga, G. (2013). Comparison of permeability testing methods. International Conference on Soil Mechanics and Geotechnical Engineering.
Osterhout, W. J. (1922). Direct and indirect determinations of permeability. Journal of General Physiology, 4(3), 275–283. https://doi.org/10.1085/jgp.4.3.275
Article Google Scholar
Partridge, B. K., Fox, P. J., Alleman, J. E., & Mast, D. G. (1999). Field demonstration of highway embankment construction using waste foundry sand. Transportation Research Record, 1670, 98–105. https://doi.org/10.3141/1670-13
Article Google Scholar
Pham, B. T., Nguyen, M. D., Al-Ansari, N., Tran, Q. A., Ho, L. S., van Le, H., & Prakash, I. (2021). A comparative study of soft computing models for prediction of permeability coefficient of soil. Mathematical Problems in Engineering, 2021, 1–11. https://doi.org/10.1155/2021/7631493
Article Google Scholar
Rabbani, A., Samui, P., & Kumari, S. (2023). Implementing ensemble learning models for the prediction of shear strength of soil. Asian Journal of Civil Engineering. https://doi.org/10.1007/s42107-023-00629-x
Article Google Scholar
Rauter, S., & Tschuchnigg, F. (2021). CPT data interpretation employing different machine learning techniques. Geosciences, 11(7), 265. https://doi.org/10.3390/geosciences11070265
Article Google Scholar
Rehman, Z., Khalid, U., Ijaz, N., Mujtaba, H., Haider, A., Farooq, K., & Ijaz, Z. (2022). Machine learning-based intelligent modeling of hydraulic conductivity of sandy soils considering a wide range of grain sizes. Engineering Geology, 311, 106899. https://doi.org/10.1016/j.enggeo.2022.106899
Article Google Scholar
Sarker, I. H. (2021). Machine learning: algorithms, real-world applications and research directions. SN Computer Science, 2(3), 160. https://doi.org/10.1007/s42979-021-00592-x
Article Google Scholar
Shahin, M. A. (2013). Artificial Intelligence in Geotechnical Engineering. In Metaheuristics in Water, Geotechnical and Transport Engineering (pp. 169–204). Elsevier. https://doi.org/10.1016/B978-0-12-398296-4.00008-8
Siddique, R., & Singh, G. (2011). Utilization of waste foundry sand (WFS) in concrete manufacturing. Resources, Conservation and Recycling, 55(11), 885–892. https://doi.org/10.1016/j.resconrec.2011.05.001
Article Google Scholar
Singh, B., Sihag, P., Pandhiani, S. M., Debnath, S., & Gautam, S. (2021). Estimation of permeability of soil using easy measured soil parameters: Assessing the artificial intelligence-based models. ISH Journal of Hydraulic Engineering, 27(sup1), 38–48. https://doi.org/10.1080/09715010.2019.1574615
Article Google Scholar
Singh, V. K., Kumar, D., Kashyap, P. S., Singh, P. K., Kumar, A., & Singh, S. K. (2020). Modelling of soil permeability using different data driven algorithms based on physical properties of soil. Journal of Hydrology, 580, 124223. https://doi.org/10.1016/j.jhydrol.2019.124223
Article Google Scholar
Sinha, A. K., Vinoth, M., Shankar, S. R. (2020). Characterization of Foundry Sand Waste Material for Road Construction. New Building Material & Construction World. https://www.nbmcw.com/article-report/infrastructure-construction/roads-and-pavements/characterisation-of-foundry-sand-waste-material-for-road-construction.html
Slichter, C. S. (1899). Heoretical investigation of the motion of ground waters.
Smith, I. (2014). Smith’s elements of soil mechanics (9th ed.). Wiley-Blackwell.
Google Scholar
Taylor, D. W. (1948). Fundamentals of soil mechanics. Soil Science, 66(2), 161. https://doi.org/10.1097/00010694-194808000-00008
Article Google Scholar
Terzaghi, K. (1925). Determination of permeability of clays. Engineering News-Record, 95(21), 832–836.
Google Scholar
Tittarelli, F. (2018). Waste foundry sand. In Waste and Supplementary Cementitious Materials in Concrete (pp. 121–147). Elsevier. https://doi.org/10.1016/B978-0-08-102156-9.00004-3
Torabi, M., Sarkardeh, H., & Mirhosseini, S. M. (2022a). Estimating the permeability coefficient of soil using CART and GMDH approaches. Water Supply, 22(8), 6756–6764. https://doi.org/10.2166/ws.2022.248
Article Google Scholar
Torabi, M., Sarkardeh, H., & Mirhosseini, S. M. (2022b). Prediction of soil permeability coefficient using GEP approach. Numerical Methods in Civil Engineering. https://doi.org/10.52547/nmce.2022.414
Article Google Scholar
Tran, V. Q. (2022). Predicting and investigating the permeability coefficient of soil with aided single machine learning algorithm. Complexity, 2022, 1–18. https://doi.org/10.1155/2022/8089428
Article Google Scholar
Uthayakumar, A., Mohan, M. P., Khoo, E. H., Jimeno, J., Siyal, M. Y., & Karim, M. F. (2022). Machine learning models for enhanced estimation of soil moisture using wideband radar sensor. Sensors, 22(15), 5810. https://doi.org/10.3390/s22155810
Article Google Scholar
Vipulanandan, C., Weng, Y., & Zhang, C. (2000). Designing flowable grout mixes using foundry sand, clay and fly ash. Proceedings of Sessions of Geo-Denver Advances in Grouting and Ground Modification, 215–233. https://doi.org/10.1061/40516(292)15
Wang, J., Yan, W., Zhijun, W., Wang, Y., Lv, J., & Zhou, A. (2020). Prediction of permeability using random forest and genetic algorithm model. Computer Modeling in Engineering & Sciences, 125(3), 1135–1157. https://doi.org/10.32604/cmes.2020.014313
Article Google Scholar
Winkler, E. S., & Bolshakov, A. (2000). Characterization of foundry sand waste.
Winston, P. H. (1992). Artificial intelligence (3rd ed.). Pearson.
Google Scholar
Xu, H., Chang, R., Pan, M., Li, H., Liu, S., Webber, R. J., Zuo, J., & Dong, N. (2022). Application of artificial neural networks in construction management: A scientometric review. Buildings, 12(7), 952. https://doi.org/10.3390/buildings12070952
Article Google Scholar
Yang, X., Guan, J., Ding, L., You, Z., Lee, V. C. S., Mohd Hasan, M. R., & Cheng, X. (2021). Research and applications of artificial neural network in pavement engineering: A state-of-the-art review. Journal of Traffic and Transportation Engineering (english Edition), 8(6), 1000–1021. https://doi.org/10.1016/j.jtte.2021.03.005
Article Google Scholar
Yilmaz, I., Marschalko, M., Bednarik, M., Kaynar, O., & Fojtova, L. (2012). Neural computing models for prediction of permeability coefficient of coarse-grained soils. Neural Computing and Applications, 21(5), 957–968. https://doi.org/10.1007/s00521-011-0535-4
Article Google Scholar
Zhang, L., Dang, F., Gao, J., & Ding, J. (2021). Measurement and investigation on 1-D consolidation permeability of saturated clay considering consolidation stress ratio and stress history. Geofluids, 2021, 1–21. https://doi.org/10.1155/2021/6616331
Article Google Scholar

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Civil Engineering, Thapar Institute of Engineering and Technology, Patiala, 147004, India
Ankit Kumar & Aditya Parihar

Authors

Ankit Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Aditya Parihar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, methodology, software, data curation, writing—original draft preparation, visualization, investigation, formal analysis (AK): Supervision, writing—reviewing and editing (AP).

Corresponding author

Correspondence to Ankit Kumar.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumar, A., Parihar, A. Impact of waste foundry sand on drainage behavior of sandy soil: an experimental and machine learning study. AI Civ. Eng. 3, 1 (2024). https://doi.org/10.1007/s43503-023-00019-x

Download citation

Received: 06 June 2023
Revised: 19 November 2023
Accepted: 29 November 2023
Published: 02 January 2024
DOI: https://doi.org/10.1007/s43503-023-00019-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Impact of waste foundry sand on drainage behavior of sandy soil: an experimental and machine learning study

Abstract

Similar content being viewed by others

Sensitivity analysis and prediction of erodibility of treated unsaturated soil modified with nanostructured fines of quarry dust using novel artificial neural network

Geocell Mattress Reinforcement for Bottom Ash: A Comprehensive Study of Load-Settlement Characteristics

Predictive Models for Estimating the Coefficient of Permeability for Sands

Explore related subjects

1 Introduction

2 Materials used

2.1 Sand

2.2 Waste foundry sand

3 Research methodology

3.1 Lab experiments

3.1.1 Relative density

3.1.2 Permeability

3.2 AI-approaches

3.2.1 Artificial neural networks (ANN)

3.2.2 Multilinear regression (MLR)

3.2.3 Decision tree model (DT)

3.2.4 Random forest (RF)

3.2.5 Limitations of AI models

4 Results and discussion

4.1 Experimental results

4.2 Statistical features of data sets

4.3 Performance of AI models

4.3.1 Results of ANN

4.3.2 Results of MLR

4.3.3 Results of DT

4.3.4 Results of RF

4.4 Comparison of performance of different models

4.4.1 Performance parameters

4.4.2 Check for overfitting

4.4.3 Taylor's diagram

4.4.4 Distribution of residuals

4.5 Sensitivity analysis

4.6 Comparison with the existing literature

5 Conclusions

Data availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation