Introduction

Artificial neural network (ANN) is defined as a computational modelling method for experimental data based on input conditions. ANN simulates the human brain working. ANN is composed of inputs, artificial neurons of elements with different weights [1]. The importance of ANN comes from its ability to use nonlinear models, high capability to learn with high speed, and no assumptions are needed as in the case of analytical modelling. ANN performance can be improved by adding experimental data [2]. The artificial neural network is a successful prediction method for machinability measures such as cutting forces, surface roughness, and material removal rate [3,4,5]. In this regard, S. Sada [6] used ANN to predict the material removal rate and the surface roughness for mild steel turning. His model was based on the feedforward network. He concluded that the Levenberg–Marquardt (LM) and scaled conjugate gradient (SCG) training algorithms are convenient for predicting model for turning. N. Nagaraj et al. [7] carried ANN model for surface roughness, and material removal rate was measured during grooving assisted with hot air for soda glass workpieces. Their model properties were learngdm for learning function, mean square error (MSE) for performance function, and 10 hidden neurons.

Ultrasonic-assisted drilling (UAD) is recommended for most brittle materials such as glass, carbon fiber-reinforced plastics, and metal matrix composites. UAD process is a hybrid of the ultrasonic machining and drilling. The process parameters are the cutting speed, feed rate, abrasive tool concentration, frequency, and amplitude. The experimental outputs can be the thrust force, surface roughness, edge chipping, and geometrical aspects [8,9,10]. The main force resulting during UAD is the thrust force in z-axis due to the ultrasonic vibration and feed rate. The thrust force is required to predict the cracks behavior, the edge chipping, and the surface roughness. In this regard, the high thrust force increases with the feed rate increase and the decrease of spindle speed [9, 11, 12]. Moreover, the lower thrust force reduces the hole edge tearing size [13]. Wu et al. [14] measured mean and maximum thrust force during their experiments of alumina UAD and found that they decreased at high spindle speed and low feed rate. Ning et al. [15] found similar trends when carried out carbon fiber-reinforced plastic drilling and stated that the reason for this trend is the lower contact time between the abrasive tool and the workpiece in UAD. The intermitted contact in UAD [16] and the reduction of the friction due to the increasing the cutting speed [17] are the main reasons for the reduction in the thrust force.

Modelling the thrust force and surface roughness for UAD is an important objective for reducing the experiments by predicting them for further conditions and finding optimum conditions [18,19,20]. J. M. Baraheni et al. developed statistical model for UAD of composite of LY160/HA-11/T700. They found that the low feed rate and high spindle speed had a significant effect on reduction of the thrust force hence reduction of the fibers delamination [21]. Many mathematically based process physics models were developed. Pei et al. [22] proposed that the shape of the removed chips is an ellipsoid. Liu et al. [23] assumed the abrasives were in the shape of octahedron and the fracture zone was in tetrahedron shape. Although there are many trials of mathematical and statistical modelling of thrust force of UAD, there is no trails to used artificial neural network.

Surface roughness also is a crucial output for machining processes because it judges the level of the machined part finish. N. Guba et al. [24] found that the UAD improved the hole surface finish compared to the conventional drilling. Generally, The higher spindle speed and the lower feed rate reduced the surface roughness (Ra) as found in many previous studies [25,26,27]. Zhang et al. [9] tested UAD of K9 glass and found that Ra increased marginally in case of higher vibration amplitude. Moreover, high feed rate and low spindle speed deteriorated the surface quality. Jiao et al. [28] developed statistical model for the surface roughness during UAD of alumina and found that the feed rate was the significant factor. Moreover, surface roughness (Ra) decreased with spindle speed, increase of grit size, and decrease of feed rate. J. Airao et al. [29] investigated the effect of different cooling strategies on the surface roughness during ultrasonic-assisted turning. They found that using minimum quantity lubrication (MQL) and CO2 reduced the surface roughness greatly compared to dry cutting.

The main objective of this paper is using the artificial neural network to model the machinability variables: maximum and mean thrust force and the surface roughness measured by Ra. The models are developed to monitor the effect of the tool concentration, cutting speed and feed rate on thrust force, and surface roughness to continuously adjust the cutting conditions to maintain these outputs within the acceptable ranges. Four models are developed: one model for each variable separately, and the other model was developed based on the three variables. A comparison was held between the output of the models of single variable and the output of the multivariable to find the capability of the multivariable model to express the variables. Some of these experimental results without artificial neural network modelling were published before in these references [8, 30, 31].

Development of neural network model

Experimental work related to ANN

The experimental work related to this study involved measuring the mean and maximum thrust force and surface roughness for UAD. The used material as workpiece was soda glass due to their importance in biomedical and chemical applications. The workpiece material chemical composition is \({\mathrm{SiO}}_{2}\) (69–74%), \({Na}_{2}\) O (10–16%), and CaO (5–14%). The experiments were carried out using the DMG-MORI ULTRASONIC 20. Linear maximum rotational speeds with and without ultrasonic vibration are 40,000 rpm and 42,000 rpm, respectively. The used cutting tools were electroplated diamond core drills (SCHOTT Ltd. Corp.) with outer diameter and wall thickness of 6 mm and 0.5 mm, respectively. Figure 1 shows the experimental setup.

Fig. 1
figure 1

Experimental setup

The thrust force on the cutting tool was measured using Kistler dynamometer (force sensor) type 9272. The mean and maximum values of thrust force are taken in this study. The surface roughness was measured in terms of the arithmetical mean value (Ra) using the surface roughness tester TR-200. The ultrasonic vibration conditions were 28 kHz of frequency and 10 m\(\upmu\) of amplitude. The experimental parameters were the abrasive particles concentration of the cutting tool, the spindle speed, and the feed rate as shown in Table 1. The measured outputs for maximum and mean thrust force (N) and the surface roughness (\(\mu m)\) are shown in Table 2.

Table 1 Experimental levels of the selected parameters
Table 2 The actual and normalized experimental results of the thrust force (maximum and mean) and the surface roughness

Artificial neural network model development

Four models are developed in this study: one model is developed for mean and maximum thrust force and the surface roughness Ra separately and then one model to predict the three responses directly. The model conditions are defined form the literature review. The feedforward architecture is selected in this study with multi-hidden layers with sigmoid functions (Eq. (1)) and one layer with linear function (Eq. (2)) before the output layer. Due to the sigmoid function range from 0 to 1, the experimental true results are normalized as shown in Table 2 [2, 32, 33]. Equation (3) was used to normalize the experimental data.

Many trials were carried out for defining suitable number of hidden layers. Figure 2 shows the mean squared error (MSE) for three runs for models for the maximum and mean thrust force and the surface roughness with the number of hidden layers. It is found that the MSE decreases with increasing the number of the hidden layers while increasing the number of hidden layer increases the time of model running. It is noticed that a large decrease for the MSE occurs at 10 hidden layers, and it is suitable for the running time. The percentages of training, validation, and testing data were taken from similar publish work [32, 34]. They are selected to be 70%, 15%, and 15%, respectively. The most selected training function is Levenberg–Marquardt as most of similar problems [6, 34, 35]. The mean squared error is used to measure the models performance. Matlab R2020b-academic was used to carry out the model runs.

Fig. 2
figure 2

The effect of the number of hidden layers on the mean squared error for results of a mean thrust force, b maximum thrust force, and c surface roughness Ra

$$Y=\frac{1}{1+\mathrm{exp}\left(-x\right)}$$
(1)
$$Y=ax+b$$
(2)
$${x}_{i}=\frac{{d}_{i}-{d}_{min}}{{d}_{max}-{d}_{min}}$$
(3)

where \({x}_{i}\) is the normalized data corresponding to the experimental data (\({d}_{i}\)). The symbols \({d}_{min}\) and \({d}_{max}\) are the minimum and maximum experimental data values.

Results and discussion

Figure 3a shows the ANN architecture which is used to predict only output (mean or maximum thrust force or surface roughness), and Fig. 3b shows the architecture was used to model the three outputs at the same time. Figure 3c shows the detailed architecture developed. Table 3 shows the neuron weights and biases for all layers. During ANN working, the neuron weights are arranged in matrix corresponding to the terms of (w) in Eq. 4 where each raw represents a layer, where \({w}_{\mathrm{1,1}}\), \({w}_{\mathrm{2,1}}\), \({w}_{\mathrm{1,2}}\), and \({w}_{\mathrm{2,2}}\) equal to − 0.2, 1.5, − 1.1, and − 1.7, and so on. The dot product of each weight (w) column with the input vector then is summed to the corresponding bias (b) and then (x) in Eq. 4 is compensated in Eq. (1).

Fig. 3
figure 3

The neural network architecture for a one output model and b three outputs models and c detailed used of neural network architecture

Table 3 The neurons weights and biases for the neural network for the mean and maximum thrust force and surface roughness
$$x=wp+b=\left[\begin{array}{ccc}{w}_{\mathrm{1,1}}& {w}_{\mathrm{2,1}}& {w}_{\mathrm{3,1}}\\ {w}_{\mathrm{1,2}}& {w}_{\mathrm{2,2}}& {w}_{\mathrm{3,2}}\\ \begin{array}{c}\begin{array}{c}.\\ .\end{array}\\ {w}_{\mathrm{1,10}}\end{array}& \begin{array}{c}\begin{array}{c}.\\ .\end{array}\\ {w}_{\mathrm{1,10}}\end{array}& \begin{array}{c}\begin{array}{c}.\\ .\end{array}\\ {w}_{\mathrm{1,10}}\end{array}\end{array}\right]\left[\begin{array}{c}N/H\\ v\\ f\end{array}\right]+\left[\begin{array}{c}b1\\ b2\\ \begin{array}{c}.\\ b10\end{array}\end{array}\right]$$
(4)

The ANN model for the mean thrust force

Figure 4 shows the regression and performance plots for the developed model for the mean thrust force which is developed by neural nertwork technique. The regression plots (Fig. 4a) show that the artificial neural network model output is close to the experimental data. The regression coefficients (R2) for the data used for training, validation, and test are 0.92, 0.93, and 0.93 respectively which are expressed in the high agreement of the model. When all data are fitted, the regression coefficient is 90%. The models of the true experimental data and the neural network output for training, validation, test, and the overall data are shown in Eq. 5, Eq. 6, Eq. 7, and Eq. 8, respectively. In Fig. 4b, the mean squared error decreased greatly for the training, validation, and test data with increasing the number of the training trials. The best performance for both validation and tested data at the third epoch about mean squared error (MSE) equals to 0.0296 and 0.009, respectively.

Fig. 4
figure 4

Neural network outputs for the mean thrust force. a Regression plot and b performance plot

$$\mathrm{Output }\left(\mathrm{Fz mean}-\mathrm{training}\right)=0.83 \times \mathrm{ target}+0.11$$
(5)
$$\mathrm{Output}\left(\mathrm{Fz mean}-\mathrm{validation}\right)=0.54 \times \mathrm{ target}+0.35$$
(6)
$$\mathrm{Output}\left(\mathrm{Fz mean}-\mathrm{test}\right)=0.74 \times \mathrm{ target}+0.13$$
(7)
$$\mathrm{Output }\left(\mathrm{Fz mean}-\mathrm{all}\right)=0.72 \times \mathrm{ target}+0.18$$
(8)

The ANN model for maximum thrust force

The regression plots for fitting all experimental results of the maximum thrust force and the portions of data used for training, validation, and the test in addition to the performance plots are developed by neural nertwork technique as shown in Fig. 5a. The coefficients of regression (R2) of the developed models show high representation for the experimental data especially in case of training (R2 = 96.6%), validation (R2 = 94.4), and all data (94%), while test data is quite lower (R = 86%). The equations correlate with the experimental results, and the output of the neural network for training, validation, and test data is Eq. 9, Eq. 10, and Eq. 11, respectively, and Eq. 12 correlates all results with ANN outputs. Figure 5b shows that the mean squred error decreases for the training, validation, and test data with training neural network model in many trails. The least MSE for validation data occurs at the trail 8 and in case of the tested data occurs at the sixth trail.

Fig. 5
figure 5

Neural network outputs for the surface roughness Ra. a Regression plot and b performance plot

$$\mathrm{Output }\left(\mathrm{Fz max}-\mathrm{training}\right)=0.92 \times \mathrm{target}+0.038$$
(9)
$$\mathrm{Output}\left(\mathrm{Fz max}-\mathrm{validation}\right)=0.85 \times \mathrm{ target}+0.024$$
(10)
$$\mathrm{Output}\left(\mathrm{Fz max}-\mathrm{test}\right)=1.2 \times \mathrm{ target}+0.053$$
(11)
$$\mathrm{Output }\left(\mathrm{Fz max}-\mathrm{all}\right)=0.91 \times \mathrm{ target}+0.036$$
(12)

The ANN model of surface roughness

The model with conditions is stated in section of development neural network model conditions with the normalized data of the surface roughness. Figure 6a shows the regression fitting of the true experimental results after dividing into training, validation, and test data in addition to the regression of all data. These models achieve higher coefficients of regression (R2) for the surface roughness measurements (Ra) compared to the model of the mean thrust force. The coefficients of regression for training, validation, and test portions and all data are 95%, 95%, 91%, and 92%, respectively. Equations (1215) are derived from the developed neural network to correlate to the true experimental results (target) and the output for training, validation, test, and all data. Figure 6b indicates that the mean squared error of the fitted values with the ANN trails decreases for the training, validation, and test data. The least MSE for validation data occurs at the fifth trail.

Fig. 6
figure 6

Neural network outputs for the maximum thrust force (a), regression plot (b), and performance plot (c)

$$\mathrm{Output }(\mathrm{Ra}-\mathrm{training})=0.89\times \mathrm{target}+0.056$$
(13)
$$\mathrm{Output}(\mathrm{Ra}-\mathrm{validation})=1.3\times \mathrm{target}+0.12$$
(14)
$$\mathrm{Output}(\mathrm{Ra}-\mathrm{test})=1.4\times \mathrm{target}+0.28$$
(15)
$$\mathrm{Output }(\mathrm{Ra}-\mathrm{all})=0.99\times \mathrm{target}+0.0091$$
(16)

The ANN model for mean and maximum thrust force and surface roughness

This section concerns with the neural network model with the output of three machinability measures: the mean and maximum thrust force and the surface roughness Ra. This model is devolved for the manufacturing engineers for optimizing the three machinability measurements at the same conditions. It is developed with the same conditions of the previous models with the normalized data of the three experimental outputs. Figure 7a shows the regression of the fitted data of potions of the training, validation, and test data in addition to the regression of fitting all data. The coefficients of regression (R2) for this model for training, validation, and test data are 94%, 85.8%, 83.4%, and 90%, respectively. They are a little low compared to the coefficient of regression of the previous models may be due to the variation of the three measurements, but it is important for the optimization of the three outputs at the same time. Equations (1720) are developed from the neural network to correlate the true experimental results (target) and the output. Figure 7b indicates that the mean squared error of the fitted values with the ANN trail decreases for the training, validation, and test data. The least MSE for validation data occurs at the third trail.

Fig. 7
figure 7

Neural network outputs for the three meansured outputs. a Regression plot and b performance plot

$$\mathrm{Output }\left(\mathrm{training}\right)=0.87 \times \mathrm{target}+0.065$$
(17)
$$\mathrm{Output}(\mathrm{validation})=0.69\times \mathrm{target}+0.086$$
(18)
$$\mathrm{Output}\left(\mathrm{test}\right)=0.69\times \mathrm{ target}+0.11$$
(19)
$$\mathrm{Output }(\mathrm{all})=0.81\times \mathrm{target}+0.081$$
(20)

Figure 8 shows the comparison between the true experimental results, predicted output of the model developed for each machinability measurement separately and the output from the general model based on each measurement. The predicted outputs are developed based on Eqs. (5), (9), and (13) for the mean (Fz mean) and maximum (Fz max) thrust force and the surface roughness (Ra, respectively. Equation (17) is used to find the output for the three measurements. These equations are used because of their high coefficients regresion. Regarding Fig. 8a, the predicted outputs of the mean thrust force of the general model are closer to the real experimental data compared to the outputs of the model of the mean thrust force only. Figure 8b shows that the outputs of the model of the maximum thrust force are closer to the true experimental results compared to the general model. In case of the surface roughness Ra, the experimental results and the output of Ra model are nearly the same, while the general model produces an error in the range from 2.8 to 8.1% as shown in Fig. 8c. Generally, the models developed by the neural networks with the previous stated conditions are suitable representative for the experimental results. The errors between the experimental results and the predicted results from the models were calculated, and Table 4 summarizes the error range for each model predicted values compared to the experimental results.

Fig. 8
figure 8

Comparison between the experimental results, output of the model for each machinability measurement, and the output of the general model from the three measurements. a Mean thrust force (Fz mean), b maximum thrust force (Fz max), and surface roughness (Ra)

Table 4 The models error compared to the experimental results

Conclusions

The ANN models of mean and maximum thrust force and the surface roughness Ra of soda glass drilled using UAD are developed. The experimental conditions of tool concentration, feed rate, and spindle speed were modeled based on the experimental results using the neural network and using the feedforward architecture and 10 hidden layers. Three models were developed for each experimental variable individually based on one output model, and then, a three outputs model is developed for the three experimental outputs at the same time. The generated models give high coefficient of regressions. The general model controls the three outputs at the same time and saves the manufacturing engineers controlling for the inputs and lower thrust force and good surface roughness. The coefficients of regression for the developed model are quit high and suitable for these results. The following points conclude the paper outputs:

  1. 1.

    The artificial neural networks can be used to model machinability measures (mean and maximum thrust force and surface roughness) for ultrasonic-assisted drilling process with reasonable presentation for the experimental results.

  2. 2.

    The three machinability measures can be modeled in one model to find optimum conditions for reducing the maximum and minimum thrust force and the surface roughness.

  3. 3.

    Generally, modelling each variable individually is more efficient in terms of coefficient of regression compared to one model for the three variables.