The use of artificial neural networks in the determination of soil grain composition

Sekuła, Klaudia; Karłowska-Pik, Joanna; Kmiecik, Ewa

doi:10.1007/s00477-023-02480-7

The use of artificial neural networks in the determination of soil grain composition

ORIGINAL PAPER
Open access
Published: 31 May 2023

Volume 37, pages 3797–3805, (2023)
Cite this article

Download PDF

You have full access to this open access article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

The use of artificial neural networks in the determination of soil grain composition

Download PDF

1204 Accesses
1 Citation
Explore all metrics

Abstract

The paper presents the possibility of using data mining tools — artificial neural networks — in prediction of hydrometer reading after 24 h in order to limit the duration of the test to 4 h. The authors analysed a database of 693 granulometric composition analysis results of genetically different soils with the use of radial basis function network (RBF) and multilayer perceptron (MLP). The calculations performed showed that it is possible to use MLP to shorten the test time without affecting the quality of the results. The presented accuracy of the model, in the range of 0.55–0.72, allows one to determine the content of silt and clay fractions with an accuracy of 0.49% for equivalent diameter (dT) and 1.50% for percentage of all particles with a diameter smaller than dT (ZT). The results were better than that achieved using linear re-gression models with all predictors (REG), stepwise regression models (SREG), and classification and regression trees (CRT). Taking into account the uncertainty of hydrometric determinations, the obtained forecast values is lower than this uncertainty, therefore neural networks can be used to predict the results of this type of research.

Artificial Neural Networks for the Prediction of Mechanical Properties of Soils

Prediction of the Soil Water Retention Curve from Basic Geotechnical Parameters by Machine Learning Techniques

Advanced Mathematical Models to Predict the Compaction Properties of Coarse-Grained Soils from Various Physical Properties

Article 12 April 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Data mining methods, including regression analysis and artificial neural networks (ANNs) are increasingly used to predict the physical and mechanical properties of soils and to optimize techniques of reconnaissance soil engineering properties (Yuanyou et al. 1997; Yang & Rosenbaum 2002; Boadu et al. 2013; Varghese et al. 2013).

The literature review conducted by the authors showed that often statistical tools are used to specify the relationship between individual geotechnical parameters of natural soils and rock, determined in laboratory tests (Penumadu and Zhao 1999; Lee et al. 2003; Park and Lee 2011; Gurocak et al. 2012; Khanlari et al. 2012; Tizpa et al. 2015; Kim et al. 2021). Such correlations are also successfully used to assess the relationship between the mechanical properties of various soil mixtures and substances aimed at improving or changing them (Najjar and Basheer 1996; Debnath and Dey 2017; Dehghanbanadaki et al. 2019). The use of artificial neural networks in in-situ research is also known (Zhou and Wu 1994; Chan et al. 1995; Abu Kiefa 1998, Nejad et al. 2009; Emami and Yasrobi 2017). Statistical methods and the use of ANN are also widely used in the interpretation of geotechnical data to assess the slope stability using numerical modelling methods (Sakellariou and Ferentinou 2005; Wang et al. 2007; Mustafa et al. 2012; Lian et al. 2015; Ray et al. 2020; Li et al. 2022) or subsidence (Kanayama et al. 2014; Ghiasi and Koushki 2020).

The soil particle size analysis is performed to determine the particle size composition (ISO 2017). It allows for the determination of the percentage content of individual fractions occurring in the soil, and thus the type and name of the soil tested. Knowing the type of the tested soil is an initial laboratory test and allows to predict its properties and determine the scope of further laboratory steps to specify physical and mechanical properties (Vangla and Latha 2015; Liu et al. 2020; Guo et al. 2020).

Particle size analysis can be performed by (1) the mechanical method — sieve analysis or (2) the sedimentation method — hydrometer analysis, or by using a combination of these two methods. In the case of cohesive soils, it is necessary to use the hydrometer method, which is a much more labour-intensive and time-consuming method (Barman and Choudhury 2020). In the case of sieve analysis, the preparation of the test sample is limited to drying it. Performing the hydrometer analysis requires additionally washing the dried sample, boiling the suspension and then bringing it to the ambient temperature. The further testing process for sieve analysis takes approximately 30 min. For hydrometer analysis, this time is extended to more than 24 h (Myślińska 1992, ISO 2016).

The basic objective of the article was to verify the possibility of using ANN (radial basis function network— RBF and multilayer perceptron – MLP) to predict the hydrometer reading after 24 h based on the results of measurements carried out within 4 h of mixing the suspension, which would significantly accelerate the time to conduct this type of research. An additional question was whether the measurement time could be reduced even more and how this would affect the quality of the prediction of subsequent measurements. The results obtained with the use of ANN were compared to the results obtained with the use of linear regression models with all considered predictors (REG), stepwise regression models (SREG), and classification and regression trees (CRT).

2 Materials and methods

The granulometric composition analysis were carried out at the Geotechnical and Geomechanical Research Laboratory of the Department of Hydrogeology and Engineering Geology, Faculty of Geology, Geophysics and Environmental Protection of the AGH University of Krakow. The research was conducted in 2020–2021. The database included analyses for 693 samples of natural soils of various genesis and type (from low-cohesive to very cohesive). The samples were collected at a depth between 0.5 m and 9.7 m below ground level at various points in the Śląskie, Świętokrzyskie, Małopolskie and Podkarpackie provinces. The specific density range of these soils ranged from 2.66 to 2.70, which is a typical value for cohesive soils with different content of the clay fraction. All the tested soils were mineral or low-organic soils with an organic substance value not exceeding 4.3%. The weight of the tested soil was prepared from the material previously dried to a constant weight at 105 °C and ranged from 41.25 g for very cohesive soils to 152.28 g for low-cohesive soils with a high content of sand fraction. The average weight value was about 60 g.

The study was carried out using a combination of mechanical (sieve analysis) and sedimentation (hydrometer analysis) methods according with ISO 17892-4 (2016). Soil particles larger than 0.063 mm were subjected to sieve analysis, while smaller fractions were used to prepare the suspension and tested using the hydrometer method.

Sedimentation methods based on the fractionation of soil in water suspension are based on the Stokes law, which determines that the free-fall velocity of spherical particles is directly proportional to their diameter and specific density, and depends on the specific density and viscosity of the liquid (water) in which the particles fall and due to gravitational acceleration (Myślińska 1992).

Since Stokes law determines the falling velocity of spherical particles, and most ground particles are irregular but not spherical, the concept of equivalent diameter (dT) is introduced.

Assuming that, at the beginning of sedimentation analysis, the carefully mixed soil suspension is homogeneous and the particles fall freely, independently of each other, we can transform the formula from Stokes's law and obtain data on the size of the falling particles. Knowing the values of the viscosity and specific density of water, the specific density of the soil skeleton, the acceleration of gravity, which are constant values for a given research, and substituting the road to time ratio for velocity, it is possible — by conducting hydrometric analysis — to determine the equivalent diameter (dT) that has travelled the distance (HR) after the time (T). This time is specified in the ISO 14688-2 standard (ISO 2017). Hydrometer descent readings are taken after 30 s, 1, 2, 5, 15, 30 min, and after 1, 2, 4, and 24 h. The temperature of the slurry is also recorded for each reading. The percentage of particles of the calculated diameter is determined by a formula that includes measuring the density of the suspension with a hydrometer. As a result, it is obtained as a percentage of all particles with diameter smaller than dT.

Table 1 summarizes the basic descriptive statistics of the analysed variables.

Table 1 Descriptive statistics of analysed variables

Full size table

The hydrometer readings decrease over time, on average from 23.8 after 30 s to 1.0 after 24 h. The interquartile range also decreases, from 9.7 after 30 s to 3.3 after 24 h.

Comparison of skewness and its standard error indicates left-skewness of hydrometer readings after 30’’, 1’ and 2’, and right-skewness of hydrometer readings after 15’, 30’, 1 h, 2 h, 4 h, 24 h and temperature. Therefore, the median and interquartile range are more adequate sample characteristics.

The hydrometer reading after 24 h is strongly positively correlated (Table 2) with earlier measurements, especially with those obtained at a shorter time interval. On the one hand, this can make linear methods such as linear regression very useful for predicting measurement values after 24 h from previous measurements. On the other hand, these methods may be contraindicated by the collinearity of the predictors, which is noticeable here. The hydrometer reading after 24 h is not statistically significantly correlated with temperature, which may mean that the inclusion of this predictor in the model will not translate into an improvement in its quality. The dependence of the hydrometer reading after 24 h on other features and the distributions of these features are presented in Figures S1a–j (Supplementary Material).

Table 2 Pearson’s correlations. * Correlation is significant at 0.05 level (2-tailed). ** Correlation is significant at 0.01 level (2-tailed)

Full size table

Several models were built for the hydrometer reading after 24 h prediction, including:

linear regression models with all predictors (REG)
Stepwise regression models (SREG)
Classification and regression trees (CRT)
Artificial neural networks — radial basis function network (RBF) and multilayer perceptron (MLP).

In all models, the predictors were the hydrometer readings after 30’’, 1’, 2’, 5’, 15’, 30’, 1 h, 2 h and 4 h. The usefulness of temperature as a predictor in the model was also checked.

The linear regression model for a target variable $Y$ and predictors ${X}_{1}, {X}_{2}, \dots , {X}_{p}$ has the form:

$$Y={\beta }_{0}+{\beta }_{1}{X}_{1}+{\beta }_{2}{X}_{2}+\dots +{\beta }_{p}{X}_{p}+\varepsilon ,$$

where $\varepsilon$ is a random error with centered normal distribution and ${\beta }_{1}, {\beta }_{2}, \dots , {\beta }_{p}$ are estimated with the least squares method. As some predictors may turn out to be statistically insignificant, the stepwise method of selecting variables was used. In this method, in each subsequent step, a variable is added or removed according to the criterion based on the value of the $F$ statistic. For details, see Larose and Larose (2015) and IBM SPSS (2021).

The classification and regression trees (CRT) method (Breiman et al. 1984) recursively partitions the records into subsets with similar values for the target variable. In this way, it produces graphs in which, from each decision node, starting from the initial one called the root, exactly two edges come out to the nodes on the lower level. The CRT algorithm builds the tree by conducting for each decision node an exhaustive search of all available predictors and all possible splitting values, selecting the optimal split for quantitative target variable prediction according to the least squares deviation impurity measure. The prediction of the value of the quantitative target variable for an observation that is in a given node is based on the average value of this variable for the records of the training set in that node. This means that weaker (in comparison to other methods) prediction results should be expected, since in fact the number of different possible outcomes for estimating the quantitative target variable is limited by the number of terminal nodes (leaves) in the tree. For details, see (Larose and Larose 2015; IBM SPSS 2021).

There are two models of ANN implemented in PS IMAGO PRO (IBM SPSS Statistics). The first one is the multilayer perceptron (MLP), which has an input layer, one or two hidden layers, and an output layer. For each quantitative predictor, there is one neuron in the input layer. The number of neurons in hidden layers can be automatically chosen. The output layer has one neuron for the quantitative target variable. Each neuron from a given layer is connected to all neurons from the next layer. The connections have weights assigned, which are initially numbers in the range [0; 1]. As an output from each neuron of the hidden and output layers, we obtain the value of the activation function on the linear combination of input signals and weights. The activation function for the hidden layers can be a hyperbolic tangent or sigmoid function and for the output layer additionally identity. The weights are corrected in the learning process by the backpropagation algorithm so that the error function defined as the sum of the squared errors reaches a minimum (Larose and Larose 2015; Rojas 1996).

The second model of ANN is the radial basis function (RBF) network. Compared to MLP, it has only one hidden layer in which the number of neurons depends on the number of groups that form the observations in the predictor space. Only connections between the hidden layer and the output layer have assigned weights. The weights do not require multiple corrections and are fitted by the least-squares method (IBM SPSS 2021).

Model quality was assessed using the repeated cross-validation method, which effectively increases the precision of the error estimates while still maintaining a small bias (Kuhn and Johnson 2013; James et al. 2017). The records are divided $m$ times into $k$ groups of similar size. For each such split, the following procedure is repeated $k$ times. Successively, each of the $k$ groups of records becomes a test set, and the remaining groups together are treated as a training set, on which a model is built. Then the model is checked on the test set. In this way, $k$ measures of model quality are obtained for each of the $m$ considered partitions. This gives together $k\times m$ measures of model quality, which are finally averaged.

As a measure of model quality, mean absolute error (MAE) and mean squared error (MSE) were considered. MAE for target variable $Y$ is defined as:

$$\mathrm{MAE}\left(Y\right)=\frac{1}{n}\sum_{i=1}^{n}\left|{y}_{i}-{\widehat{y}}_{i}\right|,$$

and MSE is defined as:

$$\mathrm{MSE}\left(Y\right)=\frac{1}{n}\sum_{i=1}^{n}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2},$$

where ${y}_{i}$ denotes the observed and ${\widehat{y}}_{i}$ denotes the predicted value of the target variable $Y$ for the $i$ th observation ($i=1, 2,\dots , n$, where $n$ is the sample size) (Larose and Larose 2015).

3 Results

The final regression model built on the entire dataset is presented in Table 3. The fit of the model is very high, with the determination coefficient ${R}^{2}=0.921$, but, as was initially supposed, most of the variables, except the readings after 15’, 30’ and 4 h, are statistically insignificant. The same was the case with models built in the cross-validation procedure.

Table 3 Coefficients $\beta$ of the regression model built on the entire dataset and their significances

Full size table

Therefore, it was necessary to select variables and build models using the stepwise method with the probability of including equal to 0.05 and the probability of removing equal to 0.1. The models obtained in the cross-validation procedure contained the readings after 15’, 30’ and 4 h and usually one additional variable, which could be e.g. reading after 30’’, 2’ or 1 h. This model instability is due to the observed collinearity of the predictors.

The classification and regression tree models had a maximum depth of 5, where the minimum number of cases in the parent node was set to 10 and in the child node to 5. Most of the splits in the trees based on the reading after 4 h, and the importance of the predictors, measured as the sum of the improvements in the splits based on a given variable, was lower for the earlier measurements.

Predictors introduced into the ANN were standardized by subtracting the mean and dividing by the standard deviation (Z-score standardisation). The RBF network had softmax as the activation function in the hidden layer and identity in the output layer. The obtained networks had 4 neurons in the hidden layer, which was set automatically.

The analysis assumed a comparison of MLP models with one and two hidden layers and different activation functions in hidden (the hyperbolic tangent and the sigmoid function) and output (the hyperbolic tangent, sigmoid function and identity) layers. The entire network training procedure was carried out only on the training set. In addition, the test set was not used to determine the moment of stopping learning. The stop condition was determined by setting the number of learning epochs to 1000.

The quality of all models was evaluated using tenfold cross-validation method repeated 5 times. The average values of MAE and MSE were calculated and are presented in Table 4.

Table 4 Average MAE and MSE values obtained as a result of repeating tenfold cross-validation procedure 5 times for models of regression (REG), stepwise regression (SREG), classification and regression tree (CRT), radial basis function network (RBF), multilayer preceptron (MLP) with one (1) or two (2) hidden layers, hiperbolic tangent (tanh) or sigmoid (sig) activation function for hidden layers and identity (id), hyperbolic tangent (tanh) or sigmoid (sig) activation function for output layer

Full size table

The best models of multilayer perceptrons with one and two hidden layers with sigmoid activation function in hidden layers and identity in the output layer adequately predict the hydrometer readings after 24 h, especially their positive values. For negative readings, the prediction may be slightly overestimated (Figs. 1 and 2).

MLP models with the sigmoid function in the hidden layers (one or two) and the identity function in the output layer were used to test the possibility of observation time reduction to 2 and 1 h. In each of these cases, the measurement values were predicted at further points in time. The results are presented in Tables 5 and 6.

Table 5 Average MAE and MSE values obtained as a result of repeating the tenfold cross-validation procedure 5 times for MLP models with one hidden layer. The observation time was successively shortened to 4 h, 2 h and 1 h, and on this the remaining readings were predicted by the model

Full size table

Table 6 Average MAE and MSE values obtained as a result of repeating the tenfold cross-validation procedure 5 times for MLP models with two hidden layers

Full size table

The square root of the mean square error is 0.8264 for MLP with one hidden layer and 0.8334 for MLP with two hidden layers.

4 Discussion

The labor-consumption and high cost of examining the particle size distribution using the hydrometer method has long led scientists to search for alternative methods of determining its particle size distribution. On the one hand, it is a basic research in engineering geology, however, it is of key importance in the further classification and selection of more advanced laboratory tests.

Attempts to create alternative methods for determining the particle size composition were made, for example, by Barman and Choudhury (2020), who presented classification of soil images using multi SVM and linear kernel function. However, they emphasize the adequacy of the use of this method solely for the purpose of determining the texture characteristics of soils for the purposes of agriculture. The accuracy of the system has not be verified in the case of determining the content of the exact sizes of individual soil fractions which is of key importance in the geotechnical classification. Ghasemy et al. (2019) proposed a mathematical approach based on comparing the results of the combination of sedimentation and spectrophotometric methods. However, laboratory tests were performed on only 17 samples and despite the confirmation of the initial assumptions, the accuracy of the results was not determined. This makes it impossible to compare the results of this experimental method for all types of soil.

Owji et al. (2014) in their publication showed that in the case of hydrometer readings using the Bouyoucos method, it is possible to shorten the reading time to 2 h, but only to determine the overall texture of the soil. However, such a procedure is not sufficiently precise. They also showed that each subsequent hydrometer reading significantly influences the determination of the content of the finer fractions. And it is their number that is of key importance in the final classification of cohesive soils, which is very extensive and requires the specification of the content of clay and silt fractions with high accuracy. Adiku et al. 2005 indicated that the hydrometer readings at any time can be predicted from the exponential equation provided that the reading after 4.5 min (R_4.5) and the experimentally determined exponent B are known. The accuracy between the calculated and measured R values was determined using the equations defined by them as R₂ = 0.96. However, this method is not universal for all types of soils, but only for those that are similar in type and genesis.

Used by Fragomeni et al. (2021) multiple regression analysis and stepwise regression analysis in the evaluation of the relationship between different geotechnical parameters showed the possibility of developing predictive models, the effectiveness and reliability of which are better than others. They also indicated that their use saves time and money in laboratory research. On the other hand, Gołębiowska and Hyb (2008) formulated conclusions regarding the uncertainty of the parameters determined in the hydrometer analysis, i.e. the equivalent diameter (dT) and their content (ZT). They determined the mean uncertainty of dT equal to 3% (dT ± 3% dT) and the uncertainty of the particle content (ZT) equal to 8% of the particle size at R hydrometer readings below 5 (ZT ± 8% ZT) and at larger readings equal to 3% particle size (ZT ± 3% ZT).

The determination of the uncertainty of the dT and ZT values for the models made in this research was carried out by comparing the actual value of the equivalent grain diameter (dT_real), calculated on the basis of the actual R reading (R_real), to the dT values calculated on the basis of the R value provided in the models. The predicted R value obtained as a result of the fivefold validation was averaged for MLP with the sigmoid function in the hidden layers (one – R_ONE or two – R_TWO) and the identity function in the output layer. Then, on their basis, dT_ONE, dT_TWO, ZT_ONE and ZT_TWO were calculated.

Subsequently, the differences between the real dT and ZT values and the calculated value from the predicted R readings were determined. The results are summarized in Table 7.

Table 7 The mean values of the differences between the real dT and ZT values and the dT and ZT values calculated from the predicted R values based on MLP with one hidden layer (MLP 1 sig id) and MLP with two hidden layers (MLP 2 sig id) with sigmoid activation function in hidden layers and identity in the output layer

Full size table

The presented results show that both the dT and ZT values, calculated on the basis of the predicted values of the R, are within the acceptable limits for the uncertainty of these determinations. In the case of ZT, a higher error is noticeable for R_real > 5 readings. This is due to the much smaller number of samples whose reading after 24 h exceeded this value. A similar value of the mean error for the entire data set to the error value for the R readings ≤ 5 shows that the solution presented in the article can be used successfully for the entire data set without differentiating the samples due to the value of the last R reading.

The machine learning methods used in the analyses in this article (linear regression, CRT and MLP) treat the hydrometer readings as separate predictors, they do not take into account in any way the fact that they were read in a specific order, at moments of time separated by a known number of minutes. Probably the problem of predicting hydrometer readings could also be analysed as time series forecasting problem, for which dedicated more advanced techniques can be used (eg. recurrent neural networks). This may be the subject of further research. However, it should be borne in mind that the results may not be satisfactory. This is due to the fact that the studied time series are short and consist of observations at only a few time points. Moreover, the analysed time series (hydrometer readings) are monotonic (non-increasing) what simplifies the situation and means that less advanced methods may be sufficient.

5 Conclusions

Optimization of the research process can be achieved through the construction of new equipment or the improvement of existing equipment or research method. However, the process is not simple and fast. They are also often unfavorable or not financially optimal solutions. The use of statistical tools, including neural networks, is a much simpler, faster and cheaper solution. It only requires a sufficient amount of data and basic statistical software.

The methodology presented in the article presents the possibility of using neural networks in the prediction of hydrometer readings after 24 h, which allows for significant shortening of the test and optimization of laboratory procedures without compromising the credibility of the obtained results. The calculations performed also showed the possibility of predicting the hydrometer readings after 1 h, 2 h and 4 h. In the case of readings for these times, the accuracy is lower, but it can still be used to determine the grain size composition for soils with less differentiation and a lower content of clay and silt fractions.

Considering the uncertainty of hydrometric determinations, the obtained forecast value is lower than this uncertainty, therefore neural networks can be used to predict the results of this type of research. However, the condition for the laboratory to use neural networks to predict readings is to collect a sufficiently large database of full hydrometric test readings for soils that may differ in type and origin, but occur in a defined geographical area. It is also recommended to periodically update and calibrate the results by performing control tests.

References

Abu Kiefa MA (1998) General regression neural networks for driven piles in cohesionless soils. J Geotech Geoenviron Eng. https://doi.org/10.1061/(ASCE)1090-0241(1998)124:12(1177)
Article Google Scholar
Adiku SGK, Osei G, Adjadeh TA, Dowuona GN (2005) Simplifying the analysis of soil particle sizes I. Test of the Sur and Kukal’s modified hydrometer method. Commun Soil Sci Plant Anal. https://doi.org/10.1081/LCSS-200026828
Article Google Scholar
Barman U, Choudhury RD (2020) Soil texture classification using multi class support vector machine. Inf Process Agric. https://doi.org/10.1016/j.inpa.2019.08.001
Article Google Scholar
Boadu FK, Owusu-Nimo F, Achampong F, Ampadu SI (2013) Artificial neural network and statistical models for predicting the basic geotechnical properties of soils from electrical measurements. Near Surf Geophys 11:599–612
Article Google Scholar
Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Chapman & Hall/CRC, New York
Google Scholar
Chan WT, Chow YK, Liu LF (1995) Neural network: an alternative to pile driving formulas. Comput Geotech. https://doi.org/10.1016/0266-352X(95)93866-H
Article Google Scholar
Debnath P, Dey AK (2017) Prediction of laboratory peak shear stress along the cohesive soil-geosynthetic interface using artificial neural network. Geotech Geol Eng. https://doi.org/10.1007/s10706-016-0119-2
Article Google Scholar
Dehghanbanadaki A, Sotoudeh MA, Golpazir I (2019) Prediction of geotechnical properties of treated fibrous peat by artificial neural networks. Bull Eng Geol Environ. https://doi.org/10.1007/s10064-017-1213-2
Article Google Scholar
Emami M, Yasrobi SS (2017) Modeling and interpretation of pressuremeter test results with artificial neural networks. Geotech Geol Eng. https://doi.org/10.1007/s10706-013-9720-9
Article Google Scholar
Fragomeni C, Hedayat A, Asce AM, Navidi W, Kuhn E, Thomas D, Perkin, M (2021) Development of prediction models for resilient modulus of soils. Rocky mountain geo-conference 2021
Ghasemy A, Rahimi E, Malekzadeh A (2019) Introduction of a new method for determining the particle-size distribution of fine-grained soils. Measurement. https://doi.org/10.1016/j.measurement.2018.09.041
Article Google Scholar
Ghiasi V, Koushki M (2020) Numerical and artificial neural network analyses of ground surface settlement of tunnel in saturated soil. SN Appl Sci. https://doi.org/10.1007/s42452-020-2742-z
Article Google Scholar
Gołębiewska A, Hyb W (2008) Ocena niepewności wyników pomiarów w analizie areometrycznej gruntu. Geoinżynieria 4:30–35 ([in Polish])
Google Scholar
Guo Z, Lai J, Jin J, Zhou J, Zhao K, Sun Z (2020) Effect of particle size and grain composition on two-dimensional infiltration process of weathered crust elution-deposited rare earth ores. T Nonferr Metal Soc. https://doi.org/10.1016/S1003-6326(20)65327-4
Article Google Scholar
Gurocak Z, Solanki P, Alemdag S, Zaman MM (2012) New considerations for empirical estimation of tensile strength of rocks. Eng Geo. https://doi.org/10.1016/j.enggeo.2012.06.005
Article Google Scholar
IBM SPSS statistics algorithms. Available on-line: https://www.ibm.com/docs/en/SSLVMB_27.0.0/pdf/en/IBM_SPSS_Statistics_Algorithms.pdf. Accessed on 4th Oct 2021
ISO 14688-2 (2017) Geotechnical investigation and testing — Identification and classification of soil — Part 1: identification and classification of soil. Principles for a classification
ISO 17892-4 (2016) Geotechnical investigation and testing — Laboratory testing of soil — Part 4: determination of particle size distribution
James G, Witten D, Hastie T, Tibshirani R (2017) An introduction to statistical learning with applications in R. Springer, New York
Google Scholar
Kanayama M, Roh A, Paassen LA (2014) Using and improving neural network models for ground settlement prediction. Geotech Geol Eng. https://doi.org/10.1007/s10706-014-9745-8
Article Google Scholar
Khanlari GR, Heidari M, Momeni AA, Abdilor Y (2012) Prediction of shear strength parameters of soils using artificial neural networks and multivariate regression methods. Eng Geol. https://doi.org/10.1016/j.enggeo.2011.12.006
Article Google Scholar
Kim Y, Satyanaga A, Rahardjo H, Park H, Lun Sham AW (2021) Estimation of effective cohesion using artificial neural networks based on index soil properties: a Singapore case. Eng Geol. https://doi.org/10.1016/j.enggeo.2021.106163
Article Google Scholar
Kuhn M, Johnson K (2013) Applied predictive modeling. Springer, New York
Book Google Scholar
Larose DT, Larose CD (2015) Data mining and predictive analytics, 2nd edn. Wiley, New Jersey
Google Scholar
Lee SJ, Lee SR, Kim YS (2003) An approach to estimate unsaturated shear strength using artificial neural network and hyperbolic formulation. Comput Geotech. https://doi.org/10.1016/S0266-352X(03)00058-2
Article Google Scholar
Li Y, Rahardjo H, Satyanaga A, Rangarajan S, Tsen-Tieng Lee D (2022) Soil database development with the application of machine learning methods in soil properties prediction. Eng Geol. https://doi.org/10.1016/j.enggeo.2022.106769
Article Google Scholar
Lian C, Zeng Z, Yao W, Tang H (2015) Multiple neural networks switched prediction for landslide displacement. Eng Geol. https://doi.org/10.1016/j.enggeo.2014.11.014
Article Google Scholar
Liu X, Zou D, Liu J, Zhou C, Zheng B (2020) Experimental study to evaluate the effect of particle size on the small strain shear modulus of coarse-grained soils. Measurement. https://doi.org/10.1016/j.measurement.2020.107954
Article Google Scholar
Mustafa MR, Rezaur RB, Rahardjo H, Isa MH (2012) Prediction of pore-water pressure using radial basis function neural network. Eng Geol. https://doi.org/10.1016/j.enggeo.2012.02.008
Article Google Scholar
Myślińska E (1992) Laboratoryjne badania gruntów. Wydawnictwo Naukowe PWN, Warszawa [in Polish]
Najjar YM, Basheer IA (1996) Utilizing computational neural networks for evaluating the permeability of compacted clay liners. Geol Eng, Geotech. https://doi.org/10.1007/BF00452947
Book Google Scholar
Owji A, Esfandiarpour Boroujeni I, Kamali A, Hosseinifard SJ, Bodaghabadi MB (2014) The effects of hydrometer reading times on the spatial variability of soil textures in Southeast Iran. Arab J Geosci. https://doi.org/10.1007/s12517-012-0786-0
Article Google Scholar
Park HI, Lee SR (2011) Evaluation of the compression index of soils using an artificial neural network. Comput Geotech. https://doi.org/10.1016/j.compgeo.2011.02.011
Article Google Scholar
Penumadu D, Zhao R (1999) Triaxial compression behavior of sand and gravel using artificial neural networks (ANN). Comput Geotech. https://doi.org/10.1016/S0266-352X(99)00002-6
Article Google Scholar
Pooya Nejad F, Jaksa MB, Kakhi M, McCabe BA (2009) Prediction of pile settlement using artificial neural networks based on standard penetration test data. Comput Geotech. https://doi.org/10.1016/j.compgeo.2009.04.003
Article Google Scholar
Ray A, Kumar V, Kumar A, Rai R, Khandelwal M, Singh TN (2020) Stability prediction of Himalayan residual soil slope using artificial neural network. Nat Hazards. https://doi.org/10.1007/s11069-020-04141-2
Article Google Scholar
Rojas R (1996) Neural networks. Springer, Berlin, A systematic Introduction
Book Google Scholar
Sakellariou MG, Ferentinou MD (2005) A study of slope stability prediction using neural networks. Geotech Geol Eng. https://doi.org/10.1007/s10706-004-8680-5
Article Google Scholar
Tizpa P, Jamshidi Chenari R, Karimpour Fard M, Lemos Machado S (2015) ANN prediction of some geotechnical properties of soil from their index parameters. Arab J Geosci. https://doi.org/10.1007/s12517-014-1304-3
Article Google Scholar
Vangla P, Latha GM (2015) Influence of particle size on the friction and interfacial shear strength of sands of similar morphology. Int J Geosynth Ground Eng. https://doi.org/10.1007/s40891-014-0008-9
Article Google Scholar
Varghese VK, Babu SS, Bijukumar R, Cyrus S, Abraham BM (2013) Artificial neural networks: a solution to the ambiguity in prediction of engineering properties of fine-grained soils. Geotech Geol Eng. https://doi.org/10.1007/s10706-013-9643-5
Article Google Scholar
Wang Z, Li Y, Shen RF (2007) Correction of soil parameters in calculation of embankment settlement using a BP network back-analysis model. Eng Geol. https://doi.org/10.1016/j.enggeo.2007.01.007
Article Google Scholar
Yang Y, Rosenbaum MS (2002) The artificial neural network as a tool for assessing geotechnical properties. Geotech Geol Eng. https://doi.org/10.1023/A:1015066903985
Article Google Scholar
Yuanyou X, Yanming X, Ruigeng Z (1997) An engineering geology evaluation method based on an artificial neural network and its application. Eng Geol. https://doi.org/10.1016/S0013-7952(97)00015-X
Article Google Scholar
Zhou Y, Wu X (1994) Use of neural networks in the analysis and interpretation of site investigation data. Comput Geotech 16(2):105–122. https://doi.org/10.1016/0266-352X(94)90017-5
Article Google Scholar

Download references

Funding

The study was partially financed by AGH-UST 16.16.140.315/10.

Author information

Authors and Affiliations

AGH University of Krakow, Mickiewicza 30 Av., 30-059, Kraków, Poland
Klaudia Sekuła & Ewa Kmiecik
Nicolaus Copernicus University in Toruń, Toruń, Poland
Joanna Karłowska-Pik

Authors

Klaudia Sekuła
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Karłowska-Pik
View author publications
You can also search for this author in PubMed Google Scholar
Ewa Kmiecik
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, K.S.; methodology, K.S. and J.K.P.; data curation, K.S.; validation, K.S., J.K.P and E.K.; formal analysis, K.S.; investigation, K.S.; writing—original draft preparation, K.S. and J.K.P.; writing—review and editing, K.S., J.K.P. and E.K.; supervision, E.K. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Klaudia Sekuła.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (DOCX 301 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sekuła, K., Karłowska-Pik, J. & Kmiecik, E. The use of artificial neural networks in the determination of soil grain composition. Stoch Environ Res Risk Assess 37, 3797–3805 (2023). https://doi.org/10.1007/s00477-023-02480-7

Download citation

Accepted: 16 May 2023
Published: 31 May 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s00477-023-02480-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The use of artificial neural networks in the determination of soil grain composition

Abstract

Similar content being viewed by others

Artificial Neural Networks for the Prediction of Mechanical Properties of Soils

Prediction of the Soil Water Retention Curve from Basic Geotechnical Parameters by Machine Learning Techniques

Advanced Mathematical Models to Predict the Compaction Properties of Coarse-Grained Soils from Various Physical Properties

1 Introduction

2 Materials and methods

3 Results

4 Discussion

5 Conclusions

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (DOCX 301 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The use of artificial neural networks in the determination of soil grain composition

Abstract

Similar content being viewed by others

Artificial Neural Networks for the Prediction of Mechanical Properties of Soils

Prediction of the Soil Water Retention Curve from Basic Geotechnical Parameters by Machine Learning Techniques

Advanced Mathematical Models to Predict the Compaction Properties of Coarse-Grained Soils from Various Physical Properties

1 Introduction

2 Materials and methods

3 Results

4 Discussion

5 Conclusions

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file 1 (DOCX 301 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation