Introduction

An abrasive belt grinding comprises a belt grinding and a contact control wheel that serves as a machining tool designed to perform grinding, deburring, and finishing operations of any metallic material (Wang et al., 2022a). In the past few years, abrasive belt grinding has gradually developed in high-precision manufacturing industries with advanced equipment (Zhu et al., 2018). In particular, for a turbine blade manufacturing industry for aero-engines and their combustion chambers, even the supporting parts for the combustion chambers such as a bose-hole in Trent 700 combustor, the abrasive belt grinding machining process has shown to be appropriate for complex and curved products due to its capability to adapt to the workpiece surfaces and its low-temperature effect (Caesarendra et al., 2018a). Moreover, supported by the development of advanced scientific technologies including modern control applications and robotic arms with multiple axes, it is possible to work on complex surfaces while maintaining uniform quality (Caesarendra et al., 2018b).

The outcome of a grinding process using an abrasive belt to improve the quality of the workpiece depends on the physical condition of the abrasive belt and the parameters applied during the machining process such as grit roughness size, belt rotation speed, feed rate, compressive force, hardness of the contact wheel to surface of workpiece (Pandiyan et al., 2017). Understanding the five main factors is a challenging factor in the high-precision manufacturing industry because of its high nonlinearity and complexity (Pandiyan et al., 2020a). A worn-out grinding belt greatly affects material removal and the quality of workpiece surface roughness, however, once the workpiece surface material is removed, it will also eventually intensify the wear on the belt surface. Wear causes an abrasive belt to change its original roughness that affects the interaction between the grinding belt and workpiece which in turn seriously reduces grinding performance and surface quality (Chen et al., 2022).

Direct and indirect sensors are two technology clusters for monitoring the condition of systems including the abrasive belt (Kurada & Bradley, 1997). Direct sensing provides an immediate and accurate assessment of the belt grinding condition, but it has limited applications by being out of reach during the machining process. On the other hand, indirect sensors are capable of capturing the occurrence of process parameters such as vibration, force, temperature, etc., calculated and analyzed for belt grinding conditions based on reliable signal characteristics and correlated tool wear conditions. In an automated machining process, monitoring equipment offers a reduction in line production downtime and maintains product quality (Caesarendra et al., 2021). An experiment of online drill bit wear monitoring was carried out to investigate the effect of drill bit wear on cutting force when drilling copper alloy workpieces (Klaic et al., 2018). Meanwhile, the dominance of the average coefficient of cutting force in the tangential direction and flank wear has been investigated by Liang et al., (2022) by combining cutting parameters such as feed per tooth, depth, and cutting speed to analyze milling tool wear. A more sensitive monitoring feature with a residual variation auto-encoder based on multiple sensor signals has been developed to monitor carbide-coated tool wear conditions in impeller machining processes (Ou et al., 2022).

Acoustic Emission (AE) sensors have been applied in surface finishing to capture acoustic waves from the interaction of the abrasive grinding wheel with a workpiece to determine the quality of the polishing process (Wang et al., 2022b). To verify more clearly, Nakai et al., (2015) have calculated the condition of the grinding wheel based on the AE and force sensors which receive signals from the machining process using an intelligent system. Furthermore, Twardowski et al., (2021) have identified the wear of diamond-coated carbide milling tools based on focused acoustic emission signals across the entire vibration frequency spectrum into a classification tree algorithm for the learning process. A series of previous studies have revealed monitoring of tool wear with hard types and rigid grinding using sensor equipment systems and coding algorithms to predict decisions as listed in Table 1. Tool condition monitoring has not been considered the appropriate tool; therefore, the proposed research is to develop suitable abrasive belt condition monitoring to reduce the research gap. Monitoring the condition of belt grinding is a highly appropriate step in ensuring the quality of machining processes. By proactively monitoring, we can identify potential issues or wear on the belt grinding equipment at an earlier stage. This aids in preventing further damage, minimizing downtime, and, of course, maintaining the quality of machining outcomes. Thus, the adage "prevention is better than cure" holds true in this context.

Table 1 Experiment in grinding and cutting tool wear monitoring throughout 2022

Most of the previous research explored the wear conditions of grinding and cutting tools in the machining process while monitoring tool conditions for grinding belts were still limited to investigations. Machining processes using traditional rigid grinding have been developed into abrasive belt grinding with its advantages including the ability to reach complex workpiece surface geometries (Pandiyan et al., 2020b). Grinding belts are equipped with polymer wheels for contact control and belt placement tracks that cover complex surface areas. Due to the dynamic and interrelated operational parameters, the grinding process with abrasive belts is non-linear and non-uniform (Pandiyan & Tjahjowidodo, 2017).

In the context of the grinding process with abrasive belts, non-linearity refers to the fact that the relationship between the input parameters (such as belt speed, pressure, and grit size) and the output parameters (such as surface roughness, material removal rate, and grinding force) is not linear. This means that a change in one input parameter does not result in a proportional change in the output parameter. Instead, the relationship between the input and output parameters is complex and can be influenced by other factors such as the properties of the workpiece material and the abrasive belt. On the other hand, non-uniformity refers to the fact that the grinding process with abrasive belts is not uniform across the entire surface of the workpiece. This means that different parts of the workpiece may experience different levels of material removal, surface roughness, and other output parameters. The non-uniformity can be caused by various factors such as the wear of the abrasive belt, the unevenness of the workpiece surface, and the non-uniform distribution of the abrasive particles on the belt (Pandiyan et al., 2020a).

Abrasive belt grinding has three main components, i.e., a force control wheel, an abrasive-coated belt, and a prime mover wheel. Depending on the application, a belt grinding is selected based on its grit size and it is used continuously until it reaches its wear limit before it is replaced by the new one. The performance of the belt grinding process is influenced by the feed rate, depth of feed, compressive force, direction, and feeding angle (Zhang et al., 2005). Although the factors affecting the performance of abrasive coated belts have been detected, compared to other types of tools, belt grinding tools exhibit apparent difficulties in managing wear during their operating time. Belt wear causes degradation of the abrasive grains that result in decreasing grinding performance and the workpiece surface becomes defective (Pandiyan et al., 2016).

Belt grain wear could occur massively, and it has a serious impact on tool integrity for surface smoothing, therefore, Pandiyan et al. (2019) detected sensor parameter data during the milling process and then analyzed it with a decision algorithm from Machine Learning (ML) to monitor belt condition. Belt grinding that loses its integrity can greatly differ the mechanism of contact between an abrasive grain and workpiece surface, whereby as wear of the grains grows, the function of grinding abrasive changes from cutting ability to only shearing and plowing (Pandiyan & Tjahjowidodo, 2019). The common strategy for monitoring tool wear status is by practicing a conventional method, namely checking directly by interrupting the process relying on the capability of employee operators. The entire production process might come to a complete halt due to measurement activities and assessing the eligibility of the tool to maintain workpiece quality, especially if it requires unloading and loading back to its original position with difficult reference settings. The erratic challenge of belt wear demands manufacturing machinery to be equipped with intelligent decision methods. This research is proposed to reveal monitoring of abrasive belt grinding using ML which analyzes the vibration signal of the accelerometer and force sensors.

Manufacturing process automation embedded with artificial intelligence might be a breakthrough to address the challenges of abrasive belt grinding, increasing productivity and reducing human intervention. In recent years, intelligent tool monitoring, especially in abrasive belt grinding, has attracted many researchers to reveal how to ensure tool life, as Pandiyan et al. (2018) have explained the evolution of degradation by classifying Genetic Algorithm (GA) and Support Vector Machine (SVM) from the accelerometer and compressive force data in the belt grinding process installed on the robotic arm. During surface finishing by abrasive belt grinding, Discrete Wavelet Decomposition (DWD) and Fast Fourier Transform (FFT) decomposed the raw signal simultaneously captured from an omnidirectional capacitance microphone. Moreover, the condition of the belt based on the response of the extracted sound signal is modeled using Optimally Pruned Extreme Learning Machine (OP-ELM) to predict the wear factor of belt grains (Zhang et al., 2018). Numerous researchers have employed acoustic emission installed in research objects, for instance, an application of sound signals to detect grinding belt wear has been reported by Zhang et al. (2015) by building a regression model with classification in SVM to predict belt condition utilizing OP-ELM. Cheng et al. (2020) demonstrated that the integration of sound and current signals on the abrasive belt significantly enhances the accuracy of tool wear prediction. They constructed a Bayesian network through sensor fusion and compared it with 315 experimental data. Their findings revealed that Random Forest (RF) models can generate more precise predictions. Similarly, Wu et al. (2017) implemented a strategy involving various ML techniques, including Support Vector Regression (SVR), Artificial Neural Network (ANN), Random Forests (RF), and feed-forward back propagation.

Despite significant advancements in recent decades, particularly in robotic belt grinding of superalloys, there remains a scarcity of advanced techniques and methods to achieve precision profile finishing and desired properties (Ren et al., 2023). Li et al. (2017) introduced a novel approach for modelling and controlling automatic grinding processes with belt grinding tools; however, its trajectory accuracy and rigidity pose challenges, limiting its implementation for accurate grinding. While industrial robots offer flexibility, accuracy, and consistency in machining operations, their implementation encounters various limitations, including material and tool diversity, low adaptability to environmental changes, flexibility issues, complex tool path preparation processes, and quality control challenges (Makulavičius et al., 2023). A prominent restriction in robot-based machining tasks is inadequate rigidity at the tool center point (TCP), impacting machining accuracy (Verl et al., 2019). Despite a comprehensive review of calibration and measurement techniques, trajectory planning, force control, and surface integrity analysis by Zhu et al. (2020), issues persist regarding grinding accuracy and poor surface quality.

In this paper, an abrasive belt grinding system integrated into a robot arm equipped with an accelerometer is established, based on vibration signal response experiments carried out to determine the prediction of belt grain wear. The purpose of this study is to monitor the condition of the abrasive belt grinding predicted using ML codes from correlation data of vibration and force sensors in the three directions and to study the potential to incorporate it in an online monitoring system in the future. The paper is structured as follows: Section "Introduction" provides a literature review on a brief development of tool monitoring methods which focused on monitoring abrasive belt grinding using the ML method, while Section "Machine learning models" reveals several ML methods used in this study including K-Neural Networks (KNN), Support Vector Machine (SVM), Multi-layer Perceptron (MLP), and Decision Tree (DT), while the data captured by sensors discussed in Section "Materials and methods", and processed into ML method is discussed in Section "Experimental Settings and Discussion". Furthermore, Section "Conclusions" presents the conclusions of this study.

Machine learning models

The present investigation used the following machine learning classification and prediction algorithms:

K-nearest neighbors (KNN)

KNN, a common classification algorithm for non-parametric behavior patterns like tool conditions, predicts data groups by calculating distances between tested and training points. Selecting K points close to the tested data, it computes probabilities for the training data group, choosing the group with the highest score (Junior et al., 2018).

The default distance metric of the nearest neighbour algorithm is the Euclidean distance, the so-called L2 distance, which calculates the distance between two points, x[a] and x[b] that can be written as follows:

$$ d\left( {{\text{x}}^{\left[ a \right]} ,{\text{x}}^{\left[ b \right]} } \right) = \sqrt {\mathop \sum \limits_{j = 1}^{m} \left( {x_{j}^{\left[ a \right]} - x_{j}^{\left[ b \right]} } \right)^{2} } , $$
(1)

In constructing a classification, the simple embodiment of the KNN method tries to predict a target class label as the class label most frequently represented among the K training examples closest to the specified query point.

Support Vector Machine (SVM)

It is an aggregate of supervised learning methods as a robust classification, outliers’ detection, and an efficiency regression technique that maximizes model prediction accuracy.

All data are separated linearly, hyperplane f(x) = 0 can be written as,

$$ f\left( {\text{x}} \right) = {\text{w}}^{{\text{T}}} {\text{x}} + b = \mathop \sum \limits_{j = 1}^{n} {\text{w}}_{{\text{j}}} {\text{x}}_{{\text{j}}} + b = 0, $$
(2)

where the w component is an n-dimensional vector, and the b component is a scalar number that determines the optimal separation hyper-plane that renounces the maximum margin of both classes. Kernel tricks are utilized in non-linear classification data problems where they transform the incoming data into a larger dimensional feature space to build the possibility of applying linear separation. The non-linear nature of kernel functions is used in the SVM method ϕ: XF converts data from the input data space X to features space F. In space F, the discriminant function can be written as follows:

$$ f\left( {\text{x}} \right) = {\text{w}}^{{\text{T}}} \emptyset {\text{x}} + b, $$
(3)

SVM with non-linear data can be derived by classifying it to be a linear partition by mapping correctly the input data x (in matrix form) into the features space and training SVM for mapped features ϕ(x). From the training examples, linear combinations can articulate weight vectors, i.e., \({\text{w}}=\sum_{i=1}^{n}{\alpha }_{i}{{\text{x}}}_{{\text{i}}}\) accordingly Eq. 3 takes the form as the following formula:

$$ f\left( {\text{x}} \right) = \mathop \sum \limits_{i = 1}^{n} \alpha_{{\text{i}}} {\text{x}}_{{\text{i}}}^{T} {\text{x}} + b, $$
(4)

In the features space, F this expression takes the formula as follows:

$$ f\left( {\text{x}} \right) = \mathop \sum \limits_{i = 1}^{n} \alpha_{i} \emptyset ({\text{x}}_{{\text{i}}} )^{{\text{T}}} \emptyset ({\text{x}}_{{\text{j}}} ) + b, $$
(5)

Afterward, nonlinear SVM can be trained by exchanging the results of Eq. 2 with kernel analogues K (xi, xj).

$$ K\left( {{\text{x}}_{{\text{i}}} ,{\text{x}}_{{\text{j}}} } \right) = \emptyset ({\text{x}}_{{\text{i}}} )^{{\text{T}}} \emptyset ({\text{x}}_{{\text{j}}} ), $$
(6)

Furthermore, the resulting classifier for non-linear SVM is represented by the kernel function as follows:

$$ f\left( {\text{x}} \right) = \mathop \sum \limits_{i = 1}^{n} \alpha_{i} K({\text{x}}_{{{\text{i}},}} {\text{x}}_{{\text{j}}} ) + b, $$
(7)

Multi-layer perceptron (MLP)

Each neuron k at layer l for the input x can be derived using Eq. 8. The weight vector learned at each mini-batch is denoted as W while the bias for each neuron is denoted as b. Each input value will be passed to the non-linear activation function denoted as g, which is in our case, the ReLU activation function.

$$ h_{k}^{l} \left( x \right) = g(h^{l - 1} \left( x \right)W_{k}^{l} + b), $$
(8)

The training process will be undertaken by using an optimization technique called gradient descent (Agarap, 2018) which processes the score from the loss function. The learning mechanism for parameter w is based on the gradient of the loss function L that is shown in Eq. 9. The time for updating the parameter is denoted by t while the learning rate that controls the influence from the gradient is denoted by α. The biased learning also proceeds with a similar process by taking the gradient from the loss function L.

$$ w^{t} = w^{t - 1} - \alpha . \frac{{\partial L^{t - 1} }}{{\partial w^{t - 1} }}, $$
(9)

Decision Tree (DT)

There are several algorithms that can be employed to construct the decision tree including ID3 (Goodfellow et al., 2016), C4.5 (Quinlan, 1985), and Classification and Regression Trees (CART) (Quinlan, 1993). The Scikit-Learn library (Berk, 2020) that uses the CART algorithm which constructs the binary tree to form a predictor was implemented in the present study.

The lower score shows the purest selection of the node leading to higher performance. The Gini impurity is calculated using Eq. 10.

$$ H\left( {Q_{m} } \right) = \mathop \sum \limits_{k} p_{mk} \left( {1 - p_{mk} } \right), $$
(10)

where H is the loss function. The data at node m of the tree is denoted by Qm. The k denotes the index of the labels of the samples and Pmk denotes the prediction probability at node m.

Materials and methods

In order to predict the belt abrasive wear which affects the surface polishing quality of machining workpieces, two types of sensors, i.e. three-axis accelerometer and three-axis dynamometer are utilized in this research. The research methodology including the experiment preparation and the data acquisition process is detailed in the following sub-sections.

Sensor devices and experimental setup in the polishing process

An abrasive belt grinder is attached to the end-effector of the multi-axis robotic arm to experiment with the grinding of mild steel workpieces as shown in Fig. 1. To deliver the direction movement in the grinding, we use the multi-axis ABB 6660–205-193 robotic arm, while the workpiece grinding process employs a belt grinder with electric power which rotates at 11,000 rpm in no-load conditions. Generally, abrasive belts are used for surface reduction or grinding and surface smoothing or polishing processes with a belt size of about 19.05 to 203.2 mm in width and about 457.2 mm in length.

Fig. 1
figure 1

Abrasive belt grinding experiments with show accelerometer and dynamometer positions

The abrasive belt is fed with a compressive force controlled by an ATI Industrial automation (Omega 160) force sensor at three Cartesian coordinates (x, y, and z) wrapped around a closed metal box. By adding a custom-made mount, the belt-grinding abrasive is affixed to a force sensor installed on the end-effector of ABB's robot arm. A multi-axis force transducer, i.e., the ATI force sensor, provides uniform control of the force input in the Z direction/normal direction. Force sensor from the normal direction facilitates the operation of the belt grinding to measure the compressive force constantly on the workpiece during material removal, while this research applies a fixed 20 N abrasive belt grinding force. Table 2 lists the parameters of other research conditions comprehensively.

Table 2 Parameters of research conditions

Pre-research by carrying out the grinding process on the workpiece has obtained four classifications of abrasive belt conditions for different surface wear and was used as a basis for belt grouping during the experiment as described in Table 3. Four groups representing different abrasive belt wear levels corresponding to the usage time were observed on a few workpieces as output variable values during the study. Meanwhile, the applied input parameters such as grinding speed, force control, and feed rate are controlled at a fixed value as written in Table 2.

Table 3 Belt condition with four monitoring durations

Vibration signals due to mechanical contact between the abrasive belt grinding and workpiece during this experiment were captured by the accelerometer from Kistler 8763B500, a triaxial type vibration sensor, where the sensor is mounted on the belt tension control rod arm as shown in Fig. 1. Kistler 9254 dynamometer is placed at the bottom of the mild steel workpiece holder. The force sensor has a physical form with a very low overall height of 32 mm and a large measurement range of up to ± 10 kN. This dynamometer is equipped with a cover plate measuring 95 × 105 mm, used to measure three components of the resultant force vector and three components of the resultant moment vector. The analog vibration signal generated from the grinding process is converted into the digital signal format by NI DAQ, a data acquisition module with 2 kHz for sampling frequency, meanwhile, communication with the computer utilizes the LabVIEW software platform and saves the vibration data into an Excel file. DEWESoft platform is utilized to support the performance of the NI DAQ device in capturing force signals from dynamometers that travel in normal and tangential directions. The signals captured by an accelerometer and force sensors are synchronized offline respecting the time duration.

Planning and configuring belt grinding lines were carried out using the built-in software of ABB's robotic studio. Figure 2 illustrates the abrasive belt path planning for the linear paths carried out in this work having five distinct zones from point A to point E. In order to achieve the required in feed force, the abrasive belt grinding is set at points B-C, and then it continues to move to the point C-D position where the feeding force on the surface of the workpiece is completely controlled during the study. After surface feeding, the abrasive belt leaves the workpiece as shown in the D-E point positions. The entire range of grinding processes works in dry and unlubricated conditions.

Fig. 2
figure 2

Abrasive belt grinder tool path in the experiment

Data acquisition and collection

Triggered vibration signals and forces generated by performing grinding on mild steel workpieces using the abrasive belt with four distinct conditions, namely new belt, after 5 min usage time, after 15 min usage time, and in a completely worn-out state. Specifically, the new grain belt is operated to grind the surface of the mild steel workpiece for 5 min continuously, then the condition of the belt and workpiece is observed and recorded, while the vibration and force signals have been documented automatically in an Excel file on the computer. After that, the belt grinding was operated again for 15 min continuously, while the conditions of the belt and workpiece were observed again as before. In the final process, the abrasive belt is re-operated until worn-out for about 30 min and observing the condition the belt and workpiece are in worn-out condition.

This research has applied an ML method, which is utilized to predict wear on abrasive belt grinding by installing a 3-axis dynamometer and a 3-axis accelerometer sensor as operational data. The accelerometer and force features are captured and displayed via the LabVIEW platform and stored in an Excel file, which therefore serves as the raw training data for the ML model. However, the raw data extracted and injected into the ML algorithm does not guarantee the convergence of the solution, it must be searched by trying several ML methods. Therefore, several types of ML algorithms were investigated in this study including KNN, SVM, MLP, and DT methods to identify three axes of vibration and force sensors in order to obtain relevant axes representing the wear condition of belt grains. A sample of the raw signal from the accelerometer and compressive force sensor in the x-axis direction for different durations of abrasive belt operation is presented in Fig. 3a for vibration signals and (b) for compressive forces signal. The picture order from top to bottom in Fig. 3 represents the vibration and force signal signals for a brand-new belt, 5 min usage time, 15 min usage time, and worn-out belt.

Fig. 3
figure 3

Raw signals of x-axis direction accelerometer sensor and force sensor

Machining contact between an abrasive belt grinding and a workpiece generates a frequency of around 2 kHz in a period of 3.5 s and produces a raw dataset in the form of vibration and force signals reaching seven thousand points as illustrated in Fig. 3. Segmentation of raw data groups into smaller data groups is prepared before completing the convolution. The data has been grouped up to 200 s, although there is an overlap of about 10%, thus every 0.1-s reading of sample data from the belt grinding process, there are 200 data points that describe the dynamic characteristics of the grinding cycle (Pandiyan et al., 2018).

The assessment of tool state relies on scrutinizing diverse sensor data properties. This study involves the identification of various time-domain and frequency-domain features crucial for real-time differentiation among four distinct tool life cycles. The research methodology employs the Welch spectral density extraction approach to extract information from a signal, including position and peak frequency values. Additionally, the computation of signal energy distribution features within the specified frequency band is conducted using a periodogram. The selection of signal attributes is guided by a priori knowledge concerning the anticipated characteristics of the signals to be categorized. For prognosticating tool belt status monitoring, a comprehensive set of 27 unique features is derived from both the time and frequency domains of each sensor channel. The data, sourced from six channels encompassing AE sensors, 3-axis accelerometers, and 2-axis dynamometers, expands the feature space to 162 variables, with each channel contributing 27 variables. This detailed analysis enhances the understanding of tool conditions through an intricate exploration of sensor data, providing a robust foundation for real-time monitoring and forecasting of tool life cycles.

Experimental settings and discussion

Various ML models have been applied to analyze the wear level of belt grains such as KNN, SVM, MLP, and DT from vibration and force signals generated by mechanical contact between the abrasive belt grinding and workpiece (Pedregosa et al., 2011). The objective is to examine these popular ML methods in the application of abrasive belt grinding intelligence condition monitoring. To provide an objective comparison analysis, the ML methods undergo similar stages presented in Fig. 4.

Fig. 4
figure 4

An examination procedure of ML methods for intelligence abrasive belt condition monitoring

Prior to comparing the algorithms, the vibration data was split into data training and data testing. The strategy of splitting the dataset into two groups consists of training and testing sets, where 80% and 20% of the raw dataset are for training and testing, respectively. The certainty for each partition leads to a balanced data set that applies to all algorithms in the ML method in which each built class has a similar number of samples. This separation process with a balanced set embodied in the implementation is critical in assessing machine learning models because accuracy metrics in unbalanced data sets do not represent true performance (Brodersen et al., 2010). The detailed procedure can be seen in Fig. 5.

Fig. 5
figure 5

Predefined procedure for data preparation used throughout all experimentation

The selected ML methods were tuned in advance to get their peak performance, except for the LR method as it does not require crucial parameters to be tuned. Each hyperparameter in the ML model except the LR method was tuned by enumerating each possible set of choices. Then, the accuracy and latency analysis as presented in Fig. 4 were compared based on K-fold cross-validation. To measure the latency the model was run ten times. Each algorithm has been compared using accuracy metrics in two different evaluations: 10-fold cross-validation (10-fold CV) and standardized performance in test sets. A 10-fold CV is used in the training set to compare each algorithm and to determine good hyperparameters. Due to the small number of samples in the raw data, the 10-fold CV will avoid bias in terms of comparative analysis. For the test set, the model has been trained with all data points in the training set and utilizes the model to be evaluated in the test set.

In terms of scale, the scale for accuracy is a real number in [0, 1]. The value of 0 shows faulty performance while 1 shows flawless performance. In other words, this score can be transformed into a percentage where 0 is 0% while 1 is 100%. For the latency, we use the seconds as the scale.

This study used three common set separations i.e., the training set, validation set, and testing set. In the first stage, the dataset was divided into two parts: the training set and the testing set. The validation set was taken from the training set through K-fold cross-validation. This will assess the performance of the model with different elements of the train and validation set that potentially reveal its true performance.

Ten number of K was suggested for a small number of datasets as suggested in previous studies (Aguilera et al., 2010; Marcot & Hanea, 2021). Using K = 10, the evaluation will take 10% of the training set (204 samples) as the validation set which results in roughly 20 samples per validation. A high number of K will provide less bias for comparison during parameter/hyper-parameter selection. For observing its capability, a comparison of each method using a test set containing roughly 52 samples was conducted.

Hyperparameter optimization

Before analyzing the performance of several ML methods, each algorithm was optimized to determine its peak performance in terms of accuracy and efficiency. Bayesian Optimization (BO) (Shahriari et al., 2016) is employed to find the optimal hyperparameter. The BO method is an efficient algorithm for finding the optimal hyperparameter that works based on a surrogate model and acquisition function. The objective function is designed to search the hyperparameters that maximize the accuracy while minimizing the latency of the model’s prediction which is shown in Eq. 11 with the search space for each machine learning model that is defined in Table 4.

$$ \max KFoldCV\left( f \right) + \left( {1 - meanLatency\left( f \right)} \right) $$
(11)
Table 4 Search space for each machine learning model

In the objective function defined in Eq. 11, we aid the BO to search the hyperparameter for each model such that the models balancing the accuracy performance with the latency. The BO will maximize the function by finding the maximum cross validation through function \(KFoldCV(.)\) and minimizing the mean latency through maximizing the inverse of the function \(meanLatency(.)\). The mean latency was calculated by recording the running time ten times using the test set. The model itself is denoted as f.

Results and discussions

The score of the performance of optimized models is detailed in Table 5. Bold style text shows the best of its criteria. Starting from the objective score obtained from BO, the highest performance was achieved by RF which contributed to the highest 10-fold CV score. The objective score indicates the tradeoff between accuracy and latency which shows higher performance than DT and other models. The lowest objective score was obtained by SVM which was slightly lower than KNN and MLP. SVM, KNN, and MLP are not a suitable approach to handling our problem. On the other hand, tree-based models yielded high performance which are DT, RF, and XGBoost.

Table 5 The testing results of the machine learning models and their accuracy

In test accuracy, all samples in the train set were included the validation set during 10-Fold CV which provided flawless performance for a tree-based decision model of 100% accuracy. These models are DT, RF, and XGBoost. Although the RF produced the highest accuracy with 10-fold CV, it still has equivalent accuracy with DT and XGBoost. Furthermore, DT has less complexity than RF as it utilizes merely one decision tree which becomes competitive with the RF model. On the other hand, KNN, SVM, and MLP yielded the lowest performance which was below 30% of test accuracy. From these results, it observed that DT, RF, and XGBoost is a potential and an effective approaches to tackling abrasive belt grinding problems.

For the efficient models, DT and XGBoost produced the lowest latency compared to other models. Both models achieved a computational time of roughly 0.004 s for running the test set. Meanwhile, XGBoost provided the highest latency with approximately 0.02 s which becomes the lowest performance among all models. Other models like KNN, SVM, and MLP yielded a latency of about 0.01 s but with low test accuracy.

Identifying the most appropriate model for addressing a given problem necessitates thorough consideration and is dependent upon different factors. In addition, it observed that DT, RF, and XGBoost are the best candidates compared to KNN, SVM, and MLP. In a tree-based model, all approaches could achieve 100% of test accuracy. Hence, accounting for the accuracy of the model alone, DT, RF, and XGBoost can be considered as a good preference. Furthermore, considering the latency that may be suitable in embedded devices, DT and RF are the most preferable choices as they can yield the lowest latency. However, since RF employs a different decision tree, it may provide a slightly lower performance than DT which provides more stable results in terms of latency.

It is imperative to acknowledge that the efficacy of the model under discussion is contingent upon the dataset used for training. Hence, different datasets may yield disparate results, thereby necessitating a reevaluation of the model. This paper provides insight into the potential baseline models that have to be assessed in other datasets with similar tasks/problems, namely DT, RF, and XGBoost.

A DT visualization from the optimal hyperparameter is depicted in Fig. 6. The left arrow appointing the left node of the tree in the figure represents the right or true condition of the parent node while the right arrow manifests the opposite (wrong or false condition). Traversing from the root of the tree, the main condition is at the position 137 ≤ 0.117. For the true condition, the conditional execution from the expression 143 ≤ 705.725 resulted in classification for class worn-out and used (both with 51 samples). Meanwhile, for the right child node from the root, the same position 137 will be assessed through condition ≤ 0.156. The true condition from this node resulted in 48 samples classified as new. If the condition is not satisfied, the node will traverse the next child node from condition 48 ≤ 407500. The true condition from this node classified the 51 samples as brand new while the other 3 samples from the false condition were classified as New. In DT method, “new” and “used” label refer to “5 minutes” and “15 minutes” usage time, respectively.

Fig. 6
figure 6

Decision tree visualization from the optimal hyperparameter

Creating a decision tree visualization with only two features considered (137 and 143) allows us to understand the importance of these features in the context of the grinding problem. Feature 137 and 143 might represent specific measurements or characteristics observed during the grinding process. For example, feature 137 could represent the vibration level detected during grinding, while feature 143 might represent the temperature of the grinding tool. These two features are crucial because they provide valuable insights into the condition and performance of the grinding process.

In the grinding problem, the timeline is likely a critical factor influencing the values of features 137 and 143. As the grinding process progresses over time, the vibration level (137) and temperature (143) of the grinding tool may change accordingly. For instance, at the beginning of the grinding process, the vibration level might be relatively low, and the temperature might be stable. However, as the grinding continues, the vibration level could increase due to wear and tear on the grinding tool, while the temperature might also rise due to friction generated during the process. Therefore, these features capture dynamic changes over time, making them essential for monitoring and controlling the grinding process effectively.

The constructed decisions, such as "137 <= 0.117" and "143 <= 706," represent the splitting criteria used by the decision tree to partition the data into different branches. These decisions are based on the values of features 137 and 143. For example, "137 <= 0.117" means that if the value of feature 137 is less than or equal to 0.117, the data point will follow one branch of the tree, while if it is greater than 0.117, it will follow another branch. Similarly, "143 <= 706" indicates that the data point will be split based on whether the value of feature 143 is less than or equal to 706 or not.

These decisions are constructed based on the algorithm's optimization process to maximize the information gain at each split, leading to more homogeneous subsets of data. In the context of the grinding problem, these decisions could represent critical thresholds for controlling the process. For example, if the vibration level (feature 137) exceeds a certain threshold, it might indicate an abnormal condition that requires intervention to prevent damage to the grinding tool or the workpiece. Similarly, if the temperature (feature 143) exceeds a certain threshold, it could signal overheating, which may affect the quality of the finished product. Therefore, the decision tree provides a systematic approach to decision-making based on these important features, helping to ensure the efficiency and effectiveness of the grinding process.

Conclusions

Monitoring the condition of the abrasive belt grinding, belt surface wear has been affected by the workpiece (geometry and material hardness) and tool conditions (type and nature of the abrasive material, belt speed and tension, tool compression force, in-feed speed, and tool quality). During the grinding process, the surface layer is peeled off, and the grains are micro- and macro-grinded resulting in a rough-sized surface topography of the abrasive belt. The dynamic features and interrelated changes have resulted in the workpiece grinding process being un-linear and non-uniform.

In principle, this study has examined the effect of vibration and force on the surface grinding process of mild steel on the wear of the grain coating of belts using various machine learning algorithm predictions. This new strategy has been proposed to assess belt grinding surface wear with machine learning utilities that complement each other and facilitate the interpretation of belt surface wear parameters.

Based on experimental activities and analytical discussions on the grinding process using abrasive belts presented in this paper, the following statements have concluded:

  • Each algorithm has been compared using the tenfold CV accuracy metric as a training set to compare each algorithm and determine good hyperparameters.

  • Tree-based models, Decision Tree (DT), Random Forest (RF), and XGBoost method generate the highest average accuracy compared to Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), and K-Nearest Neighbor (KNN).

  • Decision Tree (DT) and Random Forest (RF) yielded high performance with low latency merely about 0.004 s in evaluating the test set.

The findings of this research offer crucial insights for effective process management. By implementing these research findings, it can enhance operational efficiency, reduce downtime, and improve equipment reliability. The challenge may lie in integrating changes without disrupting production. Lean practices can serve as a solution by targeting waste elimination and continuous process improvement. Thoughtful implementation can have a significant positive impact on overall performance.

Future works will study the possibility of direct sensing for online applications which consist of two main stages: (1) the integration of the prediction models through ensemble learning; and (2) and the application of Internet of Thing (IoT) based machine learning cloud system for online prediction. The ensemble learning combines the predictions of multiple models to improve overall performance and robustness. In the context of online applications for the grinding process, ensemble learning can enhance decision-making by leveraging both the predictions form machine learning models and real-time sensor data. A preliminary study on an online machine learning prediction to predict a chamfer length of deburring process in the aerospace industry based IoT which employed the Amazon Web Services Machine Learning (AML) platform has been conducted and presented in Caesarendra et al., (2018a). This will give an insight regarding the possibility of the online application of the intelligent manufacturing of either workpiece or tool condition monitoring.