ANN-based evaluation system for erosion resistant highway shoulder rocks

Tariq, Aiman; Abualshar, Basil; Deliktas, Babur; Song, Chung R.; Al-Nimri, Bashar; Barret, Bruce; Silvey, Alex; Glennie, Nikolas

doi:10.1186/s40703-024-00216-2

ANN-based evaluation system for erosion resistant highway shoulder rocks

Original Research
Open access
Published: 23 July 2024

Volume 15, article number 17, (2024)
Cite this article

Download PDF

You have full access to this open access article

International Journal of Geo-Engineering Aims and scope Submit manuscript

ANN-based evaluation system for erosion resistant highway shoulder rocks

Download PDF

Aiman Tariq¹,
Basil Abualshar²,
Babur Deliktas³,
Chung R. Song²,
Bashar Al-Nimri ORCID: orcid.org/0009-0003-3540-3141²,
Bruce Barret⁴,
Alex Silvey⁴ &
…
Nikolas Glennie⁴

32 Accesses
Explore all metrics

Abstract

Highway shoulder rocks are exposed to continuous erosion force due to extreme rainfall that could be caused by global warming to some extent. However, the logical design method for erosion-resistant highway shoulder is not well-researched yet. This study utilized a large-scale UNLETB (University of Nebraska Lincoln–Erosion Testing Bed) with a 7.6 cm nozzle width and a 4000 cm³/sec flow rate to study the erosion characteristics of highway shoulder rocks. Test results showed that different shoulder materials currently used had vastly diverse erosion resistance. However, the clear criteria between the erosion-resistant gradation and other gradation could not be determined easily. Then, this study trained ANN (Artificial Neural Network) with test results to conveniently distinguish the erosion resistance of rocks from other rocks. The ANN predicted the acceptable/non-acceptable erosion characteristics of shoulder rocks with close to 99% accuracy based on the three gradation parameters (D₁₀, D₃₀, and D₆₀).

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Climate change and finding resilient solutions to mitigate its impact, are major challenges in a modern engineering society. Transportation infrastructures are particularly exposed to severe rainfall and directly impacted by unprecedented weather conditions—such as sudden and heavy rainfall. Moreover, Hydraulic earth structures, such as levees and dams, are susceptible to damage [30, 50]. The primary potential failure modes include internal erosion, overtopping, and slope instability. 45% of the 126 reported cases of dam failure overseas resulted from internal erosion, accounting for a significant portion of total failures [32]. Specifically, the crushed rock shoulder, designed to reduce drop-off and enhance safety, is affected. However, a method to rationally evaluate the erosion resistance of crushed rock shoulder materials is hardly found, nor is a test-based design method to select erosion-resistant rock shoulder materials.

One rational approach is to use hydrodynamics-based computer software such as Flow-3D-Hydro [17, 26, 38, 44, 51]. FLOW-3D-Hydro is capable of estimating sediment transport based on dedicated computational models. Different mechanisms, such as bedload transport, suspended load transport entrainment, and deposition, are coupled through the mass conservation concepts. Additionally, it requires many input parameters such as particle diameter, sediment density, critical Shields’ number, bedload coefficient, entrainment parameter, roughness, molecular diffusion coefficient, and turbulent diffusion multiplier. However, obtaining these parameters is not a straightforward process. It requires a solid understanding of the hydrodynamics conditions, making the use of the software challenging in practice without proper training and experience.

Another common and simplified approach developed to describe the erosion model is the excess shear stress model as presented in Eq. (1). [1, 2, 4, 5, 11, 15, 21, 21, 22, 22,23,24,25, 29, 35, 36, 43, 45, 46].

$${\epsilon }_{r}^{.}={k}_{d}{({\tau }_{e}-{\tau }_{c})}^{a}$$

(1)

where ${\epsilon }_{r}^{.}$ is the erosion rate (m/sec), ${k}_{d}$ is the erodibility coefficient (m³/ N-sec), ${\tau }_{e}$ is the fluid- induced shear stress (Pa), ${\tau }_{c}$ is the critical shear stress (Pa), and “$a$” is an empirical exponent which depends on the soil type. The exponent “$a$” is suggested to be 1 for cohesive soils and 1.5 for non- cohesive soils [46]. Equation (1) is dimensionally correct only when "$a$" is equal to 1. Despite this dimensional inconsistency, this equation is widely used in practice due to its simplicity. This excess shear stress model describes the erosion process based on two excess shear stress parameters: the critical shear stress that governs the ultimate erosion depth and the erodibility coefficient that governs the erosion rate. However, extracting these parameters from the erosion tests is not straightforward; it demands a data regression procedure that requires a thorough understanding of the testing method and hyperbolic assumption of the underlying theory. Once the critical shear stress and the erodibility coefficient are evaluated, this excess shear stress model can conveniently predict the erosion behavior of the field soils.

Different techniques were developed to evaluate the erosion properties of soils, such as traditional flume test [19, 20, 40,41,42], Rotating Cylinder Test [34], Hole Erosion Test [52], Erosion Function Apparatus [12], and Jet Erosion Test [24], and Mini Jet Erosion Test [24]. However, their inherent scale of erosion is not large enough to create erosion of shoulder rocks subjected to heavy rains.

The availability of big data and increased computational power have resulted in a new era of data-driven approaches that enhance engineering problem-solving abilities. Notably, AI-based machine learning techniques are gaining significant attention and acceptance as a powerful tool to accomplish these challenges. Machine learning models effectively manage highly nonlinear problems while revealing unknown correlations among various parameters [47, 48]. These models can learn from patterns between input and output data, and once these patterns are learned, they can be used to perform predictions with reasonable accuracy [10]. Al-Swaidani et al. [6] employed machine-learning models to estimate the strength of problematic clayey soils treated with nano lime and nano pozzolan. They found that the ANN technique can effectively predict the expansive clayey soils’ California Bearing Ratio (CBR) and plasticity index. Aregbesola and Byun [7] implemented different machine learning methodologies to classify geogrid reinforcement in stabilized and unstabilized aggregate specimens based on a few properties of aggregate. Remarkably, all models achieved a minimum accuracy of 0.9 in predicting unstabilized specimens, and results suggested that this methodology can effectively determine the type and presence of geogrid reinforcement in aggregates. A review paper by Fatehnia and Amirinia [16] explored the use of AI for predicting the load-bearing capacity of pile foundations. They argue that AI methods such as ANNs offer a significant advantage over traditional methods by providing more accurate predictions due to their ability to handle the complexities of material behavior. The ANN-based modeling approach has been used to study soil erosion as well. Harris and Boardmann [27] proposed expert systems and ANNs as an alternative approach to traditional mathematical models for predicting erosion in the South Downs region of Sussex, England. ASCE [8] details the application of neural networks for rainfall-runoff modeling, stream flow forecasting, and reservoir operations. Licznar and Nearing [33] utilized ANN to reliably predict the amount of soil erosion resulting from rainfall runoff on highway shoulders through quantitative means.

The erosion resistance of soils and rocks may vary due to their shape, size, angularity, and material composition differences. Therefore, employing machine learning methodologies such as ANNs in predicting erosion resistance in specific conditions may offer distinct advantages compared to conventional testing methods, such as cost reduction, lower labor requirements, and time savings.

This study investigates the erosion resistance of commonly used materials for highway shoulders by employing experimental testing and a machine-learning approach. Ten distinct gravel types/gradations were selected for erosion resistance testing. These tests were conducted using a large-scale University of Nebraska Lincoln Erosion Testing Bed (UNLETB). The erosion results obtained from the tests were classified into three categories based on performance: well-performing, poor-performing, and not acceptable. Knowing that ANNs are data-hungry models, a method was devised to generate additional synthetic data within the bounds of the test results. The ANN model was then trained to predict the performance category of a rock material by inputting parameters characterizing the gradation curve (D₁₀, D₃₀, D₆₀, C_u, C_c). The accuracy of the developed model's predictions was evaluated using the hold-out method and k-fold cross-validation method. Once the model was trained and its predictive performance assessed, it could be utilized to determine the suitability of rocks for erosion resistance on highway shoulders. Detailed discussions on the UNLETB system, test results, ANN training methodology, and performance verification are provided in subsequent chapters.

UNLETB and erosion tests results

The University of Nebraska-Lincoln Erosion Testing Bed (UNLETB) idea was inspired by the University of Mississippi Erosion Testing Bed (UMETB) [27, 44] that was used to analyze the erosion of levee soils in New Orleans during Hurricane Katrina. UNLETB concept is to capture the erosion profile with a waterproof video camera (GoPro10) under the plunging circular water jet, as shown in Fig. 1.

UNLETB consists of a large outer tank, sump pumps, PVC pipes, a sample box, a waterproof camera, and a glass plate. Considering that the flow should be sufficient to erode the particles and the nozzle diameter should be relatively larger than the maximum particle size, the nozzle diameter and flow rate were designed to be 7.62 cm and 4000 cm³/sec. The tank is filled with water, and pumps circulate the water to apply it as a jet to the PVC nozzle on a 20 cm × 20 cm × 20 cm sample box. The sample box has an acrylic face with 1 cm grids. A glass plate is placed atop the sample box until the flow stabilizes. Then, the camera is switched on to capture video images of the erosion process. Finally, the video images are analyzed frame by frame to determine erosion depth at desired time steps.

The materials of this study were selected based on the recommendations of Nebraska Department of Transportation (NDOT), based on the availability, and the current utilized materials in the shoulders. The gradation curves and USCS (Unified Soil Classification System) symbols of ten different provided materials are shown in Fig. 2. Material names are kept as local names provided by NDOT.

Gradation parameters such as D₁₀, D₃₀, and D₆₀ represent the particle sizes corresponding to 10%, 30%, and 60% passing, respectively. Additionally, the coefficient of uniformity (C_u) and the coefficient of curvature (C_c) are usually used in soil characterization and classification. It is believed that these parameters may affect the erosion of the relatively large particles such as the ones used for highways shoulder. The gradation parameters are presented for the tested materials are presented in Table 1.

Table 1 Gradation parameters of the tested materials

Full size table

The erosion test results conducted using UNLETB for the selected materials are depicted in Fig. 3. The erosion curves are interpreted based on erosion depth and erosion rate. The step-by-step erosion process reflected in the curves is because of the 1 cm rock particle dislodgement, reflected as 1 cm erosion. It is evident from Fig. 3 that while some materials, such as Gravel Surface Course, exhibit erosion depths reaching up to 20 cm, others, such as 1.5 in Rock Aggregate, show negligible erosion. The negligible erosion of the material may be attributed to the presence of large-sized crushed aggregates, which may not be suitable for highway shoulders due to potential tire damage. Based on the erosion depth depicted in Fig. 3, two groups of erosion curves can be observed. In the first group (green box in Fig. 3), the erosion depth of specimens varies between 5 and 10 cm, while in the second group (red box in Fig. 3), the erosion depths vary between 10 and 20 cm. Materials with lower erosion depths indicate higher erosion resistance, while those with higher erosion depths suggest lower erosion resistance.

Accordingly, this work classifies the gradation curves into three categories according to experimental observations. The proposed categories of gradation curves are named well-performing (WP), poorly performing (PP), and not acceptable (NA) based on their erosion resistance performance, as depicted in Fig. 4.

The left side in Fig. 4 illustrates the three proposed categories, with the well-performing curves enclosed in the green box, poorly performing curves in the red box, and not acceptable curves in the blue box. The respective erosion results of these boxes (gradation categories) are highlighted in a similar color in the right figure. Upon visual observation, it is evident that the erosion depth of gradation curves within the green box remains within an acceptable range, those within the red box demonstrate excessive erosion depths, and gradation curves within the blue box are deemed not acceptable because of their large size. The proposed basis for this classification using the experimental data and visual observation offers a simplified yet effective approach to categorizing the erosion performance of a material.

Synthetic data generation

Supervised machine learning methodologies rely on a rich dataset to accurately estimate the unknown function that maps inputs to outputs. The limited data from tests made it challenging to train presented a challenge for training the ANN model. To overcome this, a scheme is proposed to increase the data in the database by generating synthetic data. The synthetic data for training the ANN model are generated by systematically shifting an experimental gradation curve with a small distance uniformly towards the direction of larger particle size (left) and towards the direction of smaller particle size (right) on the logarithmic scale, thus mimicking the gradation of new material. The overall process of synthetic gradation curve generation is illustrated in Fig. 5.

By utilizing this approach, all the original experimental gradation curves were utilized to create additional synthetic gradation curves, thereby increasing the diversity and quantity of the training dataset. Although a large number of synthetic data points could be generated using this strategy, 364 curves were deemed to be sufficient for the purpose of training the ANN model in this work. Each synthetically generated gradation curve depicted in Fig. 5b has been categorized into the erosion performance group, denoted by blue, green, and red colors representing NA, WP, and PP, respectively. The range of the synthetic gradation curves, as shown in Fig. 5b, spans from the percent finer 10% to 90% because the input parameters (D₁₀, D₃₀, D₆₀, C_c, C_u) required to train the ANN model fall within this range.

With knowledge of the D₁₀, D₃₀, D₆₀ values of the synthetic gradation curves and their respective positions in the gradation plot (i.e., whether they fall into the well-performing (WP), poorly performing (PP), or not acceptable (NA) category), the ANN model was trained efficiently to predict erosion resistance performance.

Machine learning approach

In this study, an ANN was used as a supervised machine learning model for predicting erosion resistance behavior of different gradations of highway shoulder rocks. ANN models use interconnected nodes in a layered structure mimicking the human brain's thought process. These models can learn from data sets and predict system behavior without prior knowledge of input–output relationships [9, 39]. The feed-forward backpropagation algorithm through supervised learning was implemented to train ANN algorithms to predict erosion behavior. This method improves the ANN's prediction ability by continuously learning and adjusting the model based on new data through a corrective feedback system.

A multiclass classification ANN model predicting material suitability for highway shoulder is trained using the test data and the generated synthetic data. The performance of the trained ANN model to predict erosion resistance is evaluated using the handout and k-fold cross-validation technique. Additionally, the impact of different combinations of input parameters, network architectures, and other hyperparameters on the model’s accuracy is also tested. The overall flow to expand the findings from UNLETB by incorporating the erosion test results into an ANN-based system to accurately and conveniently predict the erosion resistance of various gradation rocks is shown in Fig. 6.

Identification of input parameters

It is crucial to include all input parameters that directly or indirectly influence the outputs of the ANN during the training process. Simultaneously, it is essential to eliminate redundant parameters from the high-dimensional dataset to reduce the dimensionality of the input space, thereby improving the predictive performance of the ANN model. This process is commonly referred to as the feature selection technique. There are five parameters (D₁₀, D₃₀, D₆₀, C_u, and C_c) that characterize the gradation curve and can be used as the inputs for the ANN model in this study. Two combinations were tested in order to select the best set of input parameters. The first combination included gravel size parameters D₁₀, D₃₀, and D₆₀ as the input parameters. In contrast, in the second combination, uniformity coefficient (C_u) and coefficient of curvature (C_c) were also included as inputs, in addition to D₁₀, D₃₀, and D₆₀. The determination of the most effective combination for predicting erosion resistance performance will be based on the model test results.

Data preprocessing

The results from sieve analysis, erosion tests and synthetic data were collected in a database. It is generally a good practice to normalize or pre-process the input features in the dataset before initializing training to achieve better performance [53]. Pre-processing typically accelerates the learning process and balances the focus of the training on all variables, by ensuring all variables are treated equally [18]. Therefore, all the data in the database were rescaled from the original range to a common range between 0 and 1 using the normalization equation given in Eq. (2), where X_norm is the normalized value, X_inp is the actual input value, X_min is the smallest value, and X_max is the largest value in the input dataset. This way, the original distribution of the data was retained but scale was changed by applying a uniform scaling factor.

$${X}_{norm}= \frac{{X}_{inp}- {X}_{min}}{{X}_{max}- {X}_{min}}$$

(2)

The outputs of the ANN model are the categorical data (WP, PP, NA), so they must be encoded to numerical values before use in training. A popular technique, one hot encoding [37] was used, where each label (class) is mapped to a binary vector. To achieve this, the categorical value was first transformed into the integer values, and then each integer value was represented as a binary vector where all elements are set to zero, except for the element at the index corresponding to the integer value, which is set to one as shown in Fig. 7.

Model configuration

The ANN model was created in a three-step process: (1) training using 70% of the data, (2) testing using 15% of the data, and (3) validating using the remaining 15% of the data in the database. If the model meets the training criteria in the first step, it moves on to the testing step, where its performance is evaluated using a previously unseen test dataset. If the model does not meet the training criteria, it returns to the first step for additional training. The type and number of hyperparameters used in the ANN model were decided using the trial-and-error method. Burian et al. [14] suggest that the accuracy of an ANN model and its overall performance tend to improve with a decrease in the number of hidden neurons. Developing a neural network model involves a crucial and challenging aspect of selecting an appropriate architecture. This entails determining the optimal number of layers and nodes within each layer. There is no standard process for determining the optimal ANN architecture. Hence, the ANN architecture (number of layers and neurons) gradually increased, starting with a single hidden layer containing three neurons until there was an improvement in the predictions. The proposed neural network used the Rectified Linear Unit (ReLU) activation function in the hidden layer, known for its speed and improved performance [3]. The output layer of this multiclass classification ANN model is designed to represent the vector class, a task facilitated by the implementation of the softmax activation function. As suggested by Bridle [13], the softmax function transforms the vector of numerical outputs into probabilities. The architecture of the proposed ANN model is shown in Fig. 8.

To train the constructed ANN model, an Adam optimizer proposed by Kingma and Ba [31] was used. Adam optimizer is based on the stochastic gradient descent method and is known for its computational efficiency and low memory requirements. The configuration parameters (alpha, beta1, beta2, and epsilon) of this optimizer were kept as default. The categorical cross-entropy loss function was defined during the training process for calculating the difference between the predicted probabilities of a classification model and the actual outputs. The equation for the categorical cross-entropy loss function is presented in Eq. (3), where y represents the actual output for a given input and $(\widehat{{y}_{i}})$ denotes the predicted probability for that input.

$$Loss= -\sum_{i=1}^{\begin{array}{c}output\\ size\end{array}}{y}_{i}*\text{log}(\widehat{{y}_{i}})$$

(3)

Validation of trained model

Two validation techniques were used to evaluate the performance of the model. The first technique is the hold-out method, in which the data is divided into different sets: one set for training and the other for testing. However, this technique can sometimes lead to a biased model performance. That is why the second validation technique, k-Fold cross validation proposed by Jung and Hu, [28] is also used. In the k-fold cross-validation technique, the dataset is divided into k subsets. The hold-out method is then iterated k times, where each of the k subsets serves as the test set while the remaining k-1 subsets are utilized for training the model. The value of k in this work was selected as five, and the average performance from all k tests was calculated. Both validation techniques are illustrated in Fig. 9.

Evaluation of ANN model

The number of accurate predictions made by an ANN model is used for assessing the model's robustness in classification problems. It plays a critical role in determining the model’s classification accuracy and overall reliability. For this purpose, a confusion matrix [49] is commonly used to evaluate the ANN classification model’s performance and indicate how well the model predicts the correct class label for a data set. Each row in the confusion matrix represents the predicted class of a sample, and each column represents the actual class of the same samples. A confusion matrix typically contains four elements: true positive (TP), true negative (TN), false positive (FP), and false negative (FN). A confusion matrix for two output predictions is presented in Fig. 10.

TP represents the number of specimens correctly predicted as positive by the model located in the diagonal section of the confusion matrix. TN refers to the number of specimens correctly predicted as negatives by the model, determined by adding up the values in all rows and columns except for the row and column of the class in question. FP represents the number of specimens incorrectly predicted as positive by the model, calculated by summing up all the values in the column of that class, excluding the TP value. FN refers to the number of specimens incorrectly predicted as negative by the model, determined by adding all the values in the row pertaining to that class, excluding the TPs. TP, TN, FP, and FN values in the confusion matrix are used to evaluate the ANN model performance using various metrics such as Accuracy, Precision, Recall, and F1 Score and identify areas for improvement.

The accuracy metric represents the overall accuracy of the multiclass classification ANN model. It is defined as the number of correct predictions (TP) divided by the sum of all values in the confusion matrix (TP + FP). The precision metric evaluates the model's ability to identify positive cases accurately and measures the accuracy of predicting a particular class. It is calculated by dividing the number of TP predictions for a class by the sum of TP and FP predictions. The recall metric represents the ability of the model to detect all positive cases. It is calculated as the number of TP predictions divided by the sum of TP and FP predictions. The F1 score metric combines the precision and recall measurements into a single value. It is calculated by taking the harmonic mean of the precision and recall values, which gives equal weight to both measurements. A high F1 score means the model performs well in both identifying positive cases accurately and detecting all positive cases. A low F1 score, on the other hand, indicates the model needs improvement in either precision or recall. The accuracy, precision, recall, and F1 score can be calculated using Eq. (4) – (7).

$$Accuracy=\frac{ \sum True Positives}{\sum True Positives+ \sum False Positives}$$

(4)

$$Precision = \frac{True Positive}{True Positive+False Positive}$$

(5)

$$Recall= \frac{True Positive}{True Positive+False Negative}$$

(6)

$$F1 Score= 2*\frac{Precision*Recall}{Precison+Recall}$$

(7)

Results

ANN training was initially set to run for 1000 iterations, but it was observed there was little improvement in the loss value after 300 iterations. As a result, the network was trained for 300 epochs. The best performing ANN model was determined after conducting trials with two combinations of input parameters, different hidden layer neurons, and two different validation methods. Table 2 gives information about the various models and the optimal parameter values for each model.

Table 2 Various ANN models based on different parameters

Full size table

As shown in Table 2, there is not much difference in the accuracy values when the C_u and C_c parameters are included in the input layer. Therefore, to reduce the size of the ANN model and the associated computational time, the input parameters in the first combination with three variables (D₁₀, D₃₀, D₆₀) were selected. The accuracy values achieved using the hold-out validation and k-Fold cross validation showed a slight difference mostly because hold-out validation is susceptible to variance especially if the data is small. Therefore, the results achieved using k-Fold cross validation were accepted as an accurate estimate of the model's generalization performance because this method can better detect overfitting. Various trials were conducted using a combination of different hidden layer neurons and epochs. Initially, the training started on a model with three neurons in the hidden layer and trained 32 times. Using this model, an accuracy of approximately 67% was achieved. The model's performance improved as the number of neurons and epochs increased. The model with eight neurons in the hidden layer and trained on 300 epochs showed the best performance among all the models. Therefore, the architecture of the best performing ANN model consisted of an input layer with three input parameters (D₁₀, D₃₀, D₆₀), one hidden layer with eight neurons, and finally, one output layer with three outputs. The accuracy of this ANN model reached up to 99% both on training and testing data. This indicates the model does not suffer from overfitting. As a result, the overall improvement in the accuracy of the model was 48% from the initial model. Using the ANN model, the erosion resistance behavior of rocks can be predicted with 99% accuracy based on its D₁₀, D₃₀, and D₆₀ values obtained from a sieve analysis test. Figure 11 illustrates the improvement in the accuracy of the best trained model as the number of iterations increases.

Performance evaluation

The average confusion matrix of the validation dataset using k-Fold cross validation is shown in Fig. 12. This matrix compares the actual target values (true labels) with those predicted by the ANN model and gives a holistic view of how well the classification model works and what types of mistakes are present. The main diagonal cells (top left to bottom right) represent the average number of correctly predicted outputs, while the off-diagonal cells represent the average number of wrongly predicted outputs across all five folds. As clearly visible from Fig. 12, most of the values in the off-diagonal are zero, indicating that the ANN model exhibits excellent classification capabilities. Out of 75 instances in the validation dataset, 74 (27 + 25 + 22) correct predictions were made, as presented in the main diagonal. The cell located in the first row and the second column indicates that a 0.6 sample (or 1 sample) was incorrectly predicted as belonging to the PP class when, in reality, it belongs to the WP class. The model accurately predicted the rest of the instances related to the class PP and NA.

The data in the confusion matrix can be used to evaluate the model performance first by calculating the matrix elements, i.e., TP, TN, FP, FN, and then calculating various metrics based on Eqs. (4)-(7). The values of these confusion matrix elements are calculated in Table 3.

Table 3 Class-based true positives, true negatives, false positives, and false negatives

Full size table

The TP values for the WP, PP, and NA classes from Table 3 indicate that the model made accurate predictions of 27 instances being positive for the WP class, 25 instances being positive for the PP class, and 22 instances being positive for the NA class. The TN value for the WP class was 45, meaning the model correctly identified 45 instances as not belonging to the WP class. Similarly, the TN values for the PP and NA classes were identified as 49 and 53, respectively. The FP values for the WP, PP, and NA classes were calculated as 0, 1, and 0, respectively, meaning the model made one incorrect prediction for the PP class, labeling an instance as positive (i.e., belonging to the PP class) when actually it was negative (i.e., not belonging to the PP class). The FN value of 1 for the WP class was identified, meaning the model incorrectly predicted an instance in the WP class as negative (i.e., not belonging to the WP class) when, in fact, it was positive (i.e., belonging to the WP class).

The performance of the ANN model was calculated using evaluation metrics such as the overall accuracy, class-based precision, recall, and F-score given in Eqs. (4)-(7). The values of these calculated matrices are presented in Table 4.

Table 4 Class-based precision, recall, F1 score values and overall accuracy

Full size table

The precision metrics for the WP, PP, and NA classes were determined as 1, 0.96, and 1, respectively. This indicates that the ANN model accurately predicted 100% of instances for the WP and NA classes while achieving a 96% accuracy for the PP class. The recall metrics for the WP, PP, and NA classes were identified as 0.96, 1, and 1, respectively. This implies that 96% of positive cases for class WP were correctly predicted by the model, while all positive cases for PP and NA were accurately predicted, indicating a perfect recall. The F1 scores for the WP, PP, and NA classes were determined as 0.98, 0.98, and 1, respectively. An F1 score of 0.98 for WP and PP suggests that the model achieved a good balance between precision and recall. Moreover, a perfect F1 score of 1 for NA indicates the model’s perfect accuracy in identifying positive cases (precision) and its ability to detect all positive cases (recall) perfectly. The F1 scores for the WP, PP, and NA classes were calculated as 0.98, 0.98, and 1, respectively. An F1 score of 0.98 for WP and PP indicates the model had a good balance between precision and recall. The F1 score of 1 for NA indicates perfect precision and recall for this class. Finally, the overall accuracy of the ANN model was calculated to be 99%. This confirms that 99% of all predictions were correct.

Conclusion

In this paper, the erosion resistance of highway shoulder rocks was evaluated based on experimental studies conducted using the large-scale UNLETB. An ANN multiclass classification model was developed to facilitate the prediction of the erosion performance of shoulder rocks conveniently without requiring specialized testing equipment. This model was trained with a successful strategy for generating synthetic data, with the aim of categorizing the erosion performance of rock materials into three groups: Well Performing (WP), Poor Performing (PP), and Not Acceptable (NA). The best performing ANN model was obtained after testing the various combinations of input parameters, model architectures, training iterations and validation techniques. The performance of the trained model was assessed using various evaluation metrics such as class-based precision, recall, and F1 score, and overall accuracy. Based on the results obtained from this study, following conclusions can be drawn:

The ANN model achieved 99% accuracy levels in its predictions and was successfully able to distinguish between the different erosion behavior of shoulder materials within three performance groups (WP, PP and NA) based on information from gradation curves. Extensive testing of the model's performance using various evaluation methodologies yielded exceptionally favorable outcomes.
The successful implementation of the ANN classification model, combined with its ability to accurately categorize erosion into three groups, highlights the potential for the application of machine learning techniques in solving complex problems in the field of geotechnical engineering.
This work provides valuable insights into the behavior of shoulder rocks under erosion and can support engineers and researchers in making informed decisions regarding shoulder materials selection for erosion applications.

Data availability

Data is available upon request from the corresponding author.

References

Abualshar B (2022) Evaluation of an equivalent mean grain size diameter to rationally predict the erodibility of fine riverbed soils in Nebraska
Abualshar B, Song CR, Wood RL, Hashim AA (2024) The equivalent sand particle diameter approach to rationally estimate the erosion behavior of fine-grained riverbed soils. J Geotech Geoenviron Eng 150(3):06023012
Article Google Scholar
Agarap AF (2018) Deep learning using rectified linear units (relu). arXiv preprint arXiv:1803.08375.
Al-Madhhachi AT, Hanson GJ, Fox GA, Tyagi AK, Bulut R (2011) Measuring erodibility of cohesive soils using laboratory jet erosion tests. World environmental and water resources Congress 2011
Al-Madhhachi AT, Hanson GJ, Fox GA, Tyagi AK, Bulut R (2013) Measuring soil erodibility using a laboratory “mini” jet. Trans ASABE 56(3):901–910
Google Scholar
Al-Swaidani AM, Meziab A, Khwies WT, Al-Bali M, Lala T (2024) Building MLR, ANN and FL models to predict the strength of problematic clayey soil stabilized with a combination of nano lime and nano pozzolan of natural sources for pavement construction. Int J Geo-Eng 15(1):2
Article Google Scholar
Aregbesola SO, Byun YH (2024) Classification of geogrid reinforcement in aggregate using machine learning techniques. Int J Geo-Eng 15(1):4
Article Google Scholar
ASCE Task Committee on Application of Artificial Neural Networks in Hydrology (2000) Artificial neural networks in hydrology. I: preliminary concepts. J Hydrol Eng. 5(2), 115–123.
Ayeldeen M, Negm A, El Sawwaf M, Gädda T (2018) Laboratory study of using biopolymer to reduce wind erosion. Int J Geotech Eng 12(3):228–240
Article CAS Google Scholar
Bahiraei M, Heshmatian S, Moayedi H (2019) Artificial intelligence in the field of nanofluids: a review on applications and potential future directions. Powder Technol 353:276–301
Article CAS Google Scholar
Bolhasan A, Wood RL, Abualshar B, Wittich CE, Song CR, Guo J, Liao Y (2022) Data-driven prioritization and empirical predictions for bridge Scour in Nebraska (No. M104). Nebraska department of transportation
Briaud J, Ting FCK, Chen HC, Gudavalli R, Perugu S, Member S (1999) Sricos: prediction of scour rate in cohesive soils at bridge piers. 125(April), 237–246
Bridle JS (1990) Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition. In neurocomputing (pp. 227–236). Springer, Berlin, Heidelberg.
Burian SJ, Durrans SR, Nix SJ, Pitt RE (2001) Training artificial neural networks to perform rainfall disaggregation. J Hydrol Eng 6(1):43–51
Article Google Scholar
Dinh BH, Nguyen AD, Jang SY, Kim YS (2021) Evaluation of erosion characteristics of soils using the pinhole test. Int J Geo-Eng 12:1–14
Fatehnia M, Amirinia G (2018) A review of genetic programming and artificial neural network applications in pile foundations. Int J Geo-Eng 9(1):2
Article Google Scholar
Fox B, Feurich R (2019) CFD analysis of local scour at bridge piers
Gökhan AKSU, Güzeller CO, Eser MT (2019) The effect of the normalization method used in different sample sizes on the success of artificial neural network model. Int J Assess Tools Educ 6(2):170–192
Article Google Scholar
Hanson GJ, Cook KR, Hunt SL (2005) Physical modeling of overtopping erosion and breach formation of cohesive embankments. Trans ASAE 48(5):1783–1794
Article Google Scholar
Hanson GJ, Robinson KM, Cook KR (2002) Scour below an overfall: part II. Prediction 45(1):957–964
Google Scholar
Hanson GJ (1990) Surface erodibility of earthen channels at high stresses part I—open channel testing. 33
Hanson GJ (1990) Surface erodibility of earthen channels at high stresses part II—developing an In Situ testing device. Trans Am Soc Agric Eng. https://doi.org/10.13031/2013.31305
Article Google Scholar
Hanson GJ, Cook KR (1997) Development of excess shear stress parameters for circular jet testing
Hanson GJ, Cook KR (2004) Apparatus, test procedures, and analytical methods to measure soil erodibility in situ. Appl Eng Agric 20(4):455–462
Article Google Scholar
Hanson GJ, Hunt SL (2007) Lessons learned using laboratory JET method to measure soil erodibility of compacted soils. Appl Eng Agric 23(3):305–312
Article Google Scholar
Jalal HK, Hassan WH (2020) Three-dimensional numerical simulation of local scour around circular bridge pier using flow-3D software. IOP Conf Series Mater Sci Eng. https://doi.org/10.1088/1757-899X/745/1/012150
Article Google Scholar
Jang W, Song CR, Kim J, Cheng AH-D, Al-Ostaz A (2011) Erosion study of New Orleans levee materials subjected to plunging water. J Geotech Geoenviron Eng 137(4):398–404. https://doi.org/10.1061/(asce)gt.1943-5606.0000439
Article Google Scholar
Jung Y, Hu J (2015) A K-fold averaging cross-validation procedure. J Nonparamet Stat 27(2):167–179
Article Google Scholar
Khanal A, Fox GA, Al-Madhhachi AT (2016) Variability of erodibility parameters from laboratory mini jet erosion tests. J Hydrol Eng 21(10):04016030. https://doi.org/10.1061/(asce)he.1943-5584.0001404
Article Google Scholar
Kim J, Kim Y, Satyanaga A (2023) Instability of embankment slopes due to overflow and drawdown. Water 15(19):3402
Article Google Scholar
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980.
Lee HJ, Kim IH, Chung CK (2021) Evaluation of the internal stability of well-graded silty sand through the long-term seepage test. Int J Geo-Eng 12:1–13.
Licznar P, Nearing MA (2003) Artificial neural networks of soil erosion and runoff prediction at the plot scale. CATENA 51(2):89–114
Article Google Scholar
Moore L, Masch FD (1962) Experiments on the Scour resistance of cohesive sediment. J Geophys Res 67(4):1437
Article Google Scholar
Olsen NRB, Haun S (2020) A numerical geotechnical model for computing soil slides at banks of water reservoirs. Int J Geo-Eng 11(1):22
Partheniades E (1965) Erosion and deposition of cohesive soils. J Hydr Div 91(1):105
Article Google Scholar
Potdar K, Pardawala TS, Pai CD (2017) A comparative study of categorical variable encoding techniques for neural network classifiers. Int J Comput Appl 175(4):7–9
Google Scholar
Pourshahbaz H, Abbasi S, Taghvaei P (2017) Numerical scour modeling around parallel spur dikes in FLOW-3D. Drinking water engineering and science, June, 1–16. https://doi.org/10.5194/dwes-2017-21
Shahin MA, Jaksa MB, Maier HR (2008) State of the art of artificial neural networks in geotechnical engineering. Electron J Geotech Eng 8(1):1–26
Google Scholar
Shaikh A, Ruff JF, Abt SR (1988) Erosion rate of compacted na-montmorillonite soils. J Geotech Eng 114(3):296–305
Article Google Scholar
Shaikh BA, Ruff JF, Charlie WA, Abt SR (1988) Erosion rate of dispersive and nondispersive clays. J Geotech Eng 114(5):589–600
Article Google Scholar
Shields A (1936) Application of similarity principles and turbulence research to bed-load movement.
Simon A, Thomas RE, Klimetz L (2010) Comparison and experiences with field techniques to measure critical. 2nd joint federal interagency conference, Las Vegas, 826, 13.
Song CR, Kim J, Wang G, Cheng AH-D (2011) Reducing erosion of earthen levees using engineered flood wall surface. J Geotech Geoenviron Eng 137(10):874–881. https://doi.org/10.1061/(asce)gt.1943-5606.0000500
Article Google Scholar
Song CR, Wood RL, Abualshar B, O’Brien M, Al-Nimri B, and Nasimi M (2023). Erosion resistant rock shoulder (No. SPR-P1(20)) nebraska department of transportation
Stein OR, Julien PY, Alonso CV (1993) Mechanics of jet scour downstream of a headcut. J Hydraulic Res 31(6):723
Article Google Scholar
Tariq A, Uzun B, Deliktaş B, Yaylı MÖ (2023) Assessment of machine learning methods predicting the axial vibration frequencies of microbars. ZAMM-J Appl Math Mech/Zeitschrift für Angewandte Mathematik und Mechanik 104:e202300916
Article Google Scholar
Tariq A, Uzun B, Deliktaş B, Yaylı MÖ (2024) Vibration analysis of embedded porous nanobeams under thermal effects using boosting machine learning algorithms and semi-analytical approach. Mechanics of advanced materials and structures, 1–24
Ting KM (2011). Confusion matrix. In: Sammut C, Webb GI. (Eds). Encyclopedia of machine learning. Springer. Boston, MA. https://doi.org/10.1007/978-0-387-30164-8_157
Vahedifard F, Jasim FH, Tracy FT, Abdollahi M, Alborzi A, AghaKouchak A (2020) Levee fragility behavior under projected future flooding in a warming climate. J Geotech Geoenviron Eng 146(12):04020139
Article Google Scholar
Vasquez, J., & Walsh, B. (2009). CFD simulation of local scour in complex piers under tidal flow. Proceedings of the Thirty-Third IAHR Congress: Water Engineering for a Sustainable Environment, 604, 913–920. http://flow3d2.propagation.net/pdfs/tp/wat_env_tp/cfd-simulation-of-local-scour-in-complex-piers-under-tidal-flow-23-09.pdf
Wan CF, Fell R. (2002). Investigation of internal erosion and piping of soils in embankment dams by the slot erosion test and the hole erosion test-interpretative report.
Yu L, Wang S, Lai KK (2005) An integrated data preparation scheme for neural network data analysis. IEEE Trans Knowl Data Eng 18(2):217–230
Google Scholar

Download references

Funding

This study was funded by Nebraska Department of Transportation (01034D-SPR-FY22(04)) and MATC (25-1121-0005).

Author information

Authors and Affiliations

Bursa Uludag University, Bursa, Turkey
Aiman Tariq
University of NE – Lincoln, Lincoln, NE, 68588, USA
Basil Abualshar, Chung R. Song & Bashar Al-Nimri
Department of Civil Engineering, Bursa Uludag University, Bursa, Turkey
Babur Deliktas
Nebraska Department of Transportation, 1500 Nebraska Parkway, Lincoln, NE, 68502, USA
Bruce Barret, Alex Silvey & Nikolas Glennie

Authors

Aiman Tariq
View author publications
You can also search for this author in PubMed Google Scholar
Basil Abualshar
View author publications
You can also search for this author in PubMed Google Scholar
Babur Deliktas
View author publications
You can also search for this author in PubMed Google Scholar
Chung R. Song
View author publications
You can also search for this author in PubMed Google Scholar
Bashar Al-Nimri
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Barret
View author publications
You can also search for this author in PubMed Google Scholar
Alex Silvey
View author publications
You can also search for this author in PubMed Google Scholar
Nikolas Glennie
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Aiman Tariq, Basil Abualshar, Babur Deliktas, Chung R. Song; Software, Aiman Tariq; Validation, Aiman Tariq, Basil Abualshar, Babur Deliktas, Chung R. Song, Bruce Barret, Alex Silvey, Nikolas Glennie; Methodology, Aiman Tariq , Basil Abualshar, Babur Deliktas, Chung R. Song; Formal analysis, Basil Abualshar; Resources, Bruce Barret, Alex Silvey, Nikolas Glennie; Writing-original draft preparation, Aiman Tariq, Basil Abualshar; Project administration, Chung R. Song; Writing-review and editing, Basil Abualshar, Bashar Al-Nimri, Chung R. Song; Visualization, Aiman Tariq, Basil Abualshar; Supervision, Babur Deliktas, Chung R. Song; Funding acquisition, Chung R. Song; Investigation, Basil Abualshar, Bashar Al-Nimri; All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Bashar Al-Nimri.

Ethics declarations

Consent for publication

The authors agreed to publish this research article in “International Journal of Geo-Engineering”.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tariq, A., Abualshar, B., Deliktas, B. et al. ANN-based evaluation system for erosion resistant highway shoulder rocks. Geo-Engineering 15, 17 (2024). https://doi.org/10.1186/s40703-024-00216-2

Download citation

Received: 31 October 2023
Accepted: 16 May 2024
Published: 23 July 2024
DOI: https://doi.org/10.1186/s40703-024-00216-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

ANN-based evaluation system for erosion resistant highway shoulder rocks

Abstract

Introduction

UNLETB and erosion tests results

Synthetic data generation

Machine learning approach

Identification of input parameters

Data preprocessing

Model configuration

Validation of trained model

Evaluation of ANN model

Results

Performance evaluation

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Consent for publication

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation