1 1. Introduction

The accelerated progression of technological advancements, coupled with the rising demand for superior products and novel materials, has culminated in the evolution and refinement of production methodologies [1]. Although conventional machining processes such as turning, milling, and drilling were used for over a century in various demanding tasks in the manufacturing industry and many advancements, such as the use of advanced coolants and cooling methods, the use of specially designed tools or assistive technologies has been developed in order to alleviate some of the common issues restricting the high performance of these processes such as tool wear, machining chatter and chip control, improvement of their efficiency is still limited. Especially, non-conventional processes have shown to be especially suited for efficient machining of difficult-to-cut materials, including hardened steels, titanium alloys, and nickel superalloys as well as ceramics and composite materials. A particularly interesting non-conventional method is laser beam machining (LBM), which is a cutting-edge manufacturing process that employs a high-energy laser beam to sculpt materials with intricate precision, often dealing with variables that are nonlinear, imprecise, and multifaceted in nature [2]. Due to its non-conventional nature, LBM does not require physical contact between the workpiece and the tool, reducing the need for expensive and complex tooling for the machining setup and also eliminating the drawback of cutting tool wear during conventional machining processes [3].

LBM is a sophisticated subtractive manufacturing technique, whose efficacy is governed by intricate interrelationships among several key parameters, relevant to both the machine tool employed and the workpiece material, namely laser power, laser beam spot size, feed rate, use of assisting gas, thermo-mechanical properties of the workpiece, etc [4]. The laser’s power dictates the energy intensity imparted, while its pulse duration and frequency modulate the temporal distribution of this energy [5, 6]. Spot size, derived from the focal length of the employed lens, determines the spatial concentration of the laser beam, influencing the resolution and precision of machining [7, 8]. The feed rate represents a balance between achieving desired material removal rates and minimizing thermal effects [9]. Assisting gases not only influence the ejection of molten material but can also chemically interact with the workpiece, affecting surface characteristics [10]. Material-specific properties, such as thermal conductivity and reflectivity, play pivotal roles in the beam-material interaction dynamics [11]. Ambient conditions, particularly temperature and humidity, might subtly influence beam propagation and absorption. Understanding and calibrating these parameters, in harmony, are paramount for ensuring optimal, reproducible, and scientifically consistent results in LBM applications [12].

Based on the mentioned above complex LBM physical phenomena, an optimization of these parameters is crucial to increase the precision and minimize the energy and materials waste [7, 13]. Apart from experimental work and statistical analysis of the findings based on methods such as ANOVA, addressing the complexities inherent in this process requires advanced computational tools, and this is where the Adaptive neuro-fuzzy inference systems (ANFIS) comes into play [14]. ANFIS, an innovative model that blends the adaptive capabilities of artificial neural networks (ANN) [15] with the logical reasoning of fuzzy inference systems (FIS) [16], is uniquely poised to understand and model the nuanced parameters of LBM. By exploiting the individual strengths of ANN and FIS, ANFIS offers a compelling strategy for function approximation and system modeling, especially relevant in the dynamic world of laser manufacturing. From an engineering viewpoint, the brilliance of ANFIS resides in its adeptness at navigating the challenging terrains of nonlinearities and imprecisions, thereby paving the way for more accurate and efficient laser machining outcomes [17].

The efficacy of LBM is deeply contingent upon its input parameters, and for this purpose many researchers have conducted comprehensive experimental and numerical studies in order to determine the correlation between these parameters and some of the outcomes of LBM. It is widely accepted that a paramount machining performance metric is surface roughness [18]. To delineate the correlation between surface quality and these parameters in LBM drilling, Muthuramalingam et al. [19] undertook an optimization study on a titanium grade 5 alloy utilizing a CO2 laser. Through the Taguchi gray relationship analysis (GRA), it was discerned that the laser power (LP) and stand-off distance (SoD) significantly influence the surface roughness (SR) [19]. Specifically, a higher LP combined with a reduced SoD results in a diminished SR. Chatterjee et al. [20] in their study used artificial intelligence techniques to predict the quality characteristics of Ti6Al4V and AISI 316 after laser hole drilling. From the analytical evaluation, it is discerned that the Multi Gene Genetic Programming (MGGP) model exhibits a diminished root mean square error (RMSE) relative to the ANFIS model when evaluating performance metrics. This data suggests that MGGP holds a heightened potentiality and robustness in prognosticating the performance parameters associated with the laser beam micro-drilling operation. Conclusively, the MGGP models manifest superior prediction fidelity in juxtaposition with the ANFIS model. Numerous researchers have endeavored to devise an AI algorithm tailored for predicting and optimizing process outcome such as material removal rate (MRR) and kerf width in LBM, especially given that a reduced kerf width is integral to ensuring optimal cut quality [21].

Campanell et al. [22] used an ANN model to identify the values of scan speed and pulse frequency that correlate with a predetermined depth of MRR, thereby enhancing the efficiency of the process of laser milling. The results show that there is an inverse relationship where scanning speed (V) decreases with a rise in the ablation depth. Ablation depths below 2.5 μm correspond with higher velocities, approaching the upper limit of 2000 mm/s considered in this study. Conversely, greater ablation depths exceeding 10 μm align with scan speeds in the 460 to 510 mm/s range. Rajamaniaet al. [23] performed an ANFIS modelling and whale optimization algorithm to predict the MRR, kerf taper angle, and SR during a fiber laser cutting of Hastelloy C276. The results show that ANFIS model, having the least prediction errors, is more efficient than the multiple linear regression models approach. Whale optimization algorithm proves to be the best method for optimizing the LBC process, with optimal parameters being 3 bar of gas pressure, 319.8 mm/min cutting speed, 5.93 J pulse energy, and 2.97 mm stand-off distance. Finally, Bakhtiyari et al. [24] employed ANN and ANFIS algorithms to represent the cadmium plasma attributes under conditions of local thermodynamic equilibrium during laser ablation. The study results indicate that the suggested machine learning algorithms proficiently create precise models to forecast the temporal and spatial changes of the laser-induced plasma (LIP). The need of implementing the diode-based LBM (DLBM) process is increasing predominantly [25]. The higher laser-power diode can increase the plasma energy developed across the machining zone [26].

Apart from surface quality and MRR, the kerf width (KW) is also an important quality metric on accessing the cutting quality of LBM process, as it indicates not only the material wastage but also the cutting dimensional accuracy [27]. More specifically, Löhr et al. [28] performed a comprehensive analysis of the deviations of the kerf profile during CO2 laser processing of PMMA workpieces, showing also that focal point position, cutting speed, and laser power are the most important parameters regarding kerf width. Pramanik et al. [29] investigated the effect of various parameters, including sawing angle, laser power, duty cycle, pulse frequency, and scanning speed on the kerf quality during fiber laser micro-machining of Ti-6Al-4 V. Comparable results on the effect of process parameters on kerf quality were also obtained in another study by the same authors on stainless steel workpieces, who analyzed also the kerf taper angle [30]. Analysis of experimental data indicated that all parameters were statistically significant but pulse frequency, duty cycle, and scanning speed were the most important. Son and Lee [31] correlated the kerf width with volume energy during LBM and found that higher kerf widths are produced under higher volume energy and that laser power has a more significant effect on kerf width than cutting speed. Naskar et al. [32] conducted experiments on LBM of aluminum alloy 7075 and analyzed the effect of laser power, pulse frequency, and assist gas pressure on kerf width and heat-affected zone (HAZ). It was found that all parameters were important and that kerf width is minimized at low laser power due to less thermal energy incorporation, moderate assist gas pressure, and higher pulse frequency.

Bakhtiyari et al. [7] analyzed comprehensively the influence of several parameters during LBM, underlined the importance of kerf width as a quality indicator, and mentioned that high laser power leads to increase of kerf width as well as waviness of the produced profiles whereas increased cutting speed reduces the kerf width. Genna et al. [33] conducted experiments under five different parameters, such as workpiece thickness, feed rate, assist gas pressure, and focus position, and argued that increased cutting speed reduces kerf width due to the reduced irradiation time and lower energy but the increase of kerf width observed in some cases can be attributed to the change in cut front inclination. Lind et al. [34] studied the effect of beam shape on kerf geometry during LBM and developed a model for determining the maximum possible cutting speed depending on the geometry of kerf. Kusuma and Huang [35] conducted a comprehensive comparison of different soft computing models for kerf width prediction during LBM, showing that a random forest model was able to achieve the lowest MAPE in both training and testing. Khoshaim et al. [36] performed laser machining experiments on PMMA sheets and found out that kerf deviation was mostly affected by workpiece thickness and cutting speed. Increased cutting speed as well as gas pressure and laser power increased the achieved kerf width [37, 38].

Even though several research efforts were performed on the prediction of LBM process quality metrics, only less research attention was provided on enhancing the process of the diode-based LBM, and also few amounts of attention were given to the prediction of KW during the machining process by appropriate soft computing models [39]. As KW indicates the overcut made during the cutting process and indirectly represents materials wastage that happened during the machining process [40], apart from reducing the surface roughness, it is essential to reduce KW as much as possible due to the high cost of titanium alloy specimens. Hence, it needs to be predicted before the machining process for choosing the optimal process parameters. Moreover, it is important that the influence of the DLBM process factors on KW and surface roughness has to be investigated for achieving better quality during diode laser machining. Thus, in the present work, a comprehensive experimental work based on Taguchi orthogonal array design is conducted in order to investigate the influence of process parameters on kerf width and surface roughness of DLBM processed specimens at first and then, an ANFIS-based predictive model and a GA-ANFIS model for KW obtained during DLBM were developed based on the experimental data and evaluated regarding their capabilities, in comparison with other established methods, such as regression methods and regression tree models. Finally, an optimization procedure by the gray relational–based (GRB) multi-response optimization method was carried out in order to determine the optimum conditions for obtaining the highest quality of the produced features based on the minimization of both kerf width and surface roughness values.

2 Experimental methodology

In this work, the effect of using advanced AI methods for the prediction of kerf quality during laser machining of titanium alloy is investigated, along with the determination of optimum process parameters values which can lead to reduced kerf width and surface roughness. A comprehensive experimental work is conducted, taking into account five different process parameters under multiple levels each and then the correlation of these parameters with kerf width and surface roughness is determined. In order to improve the precision and robustness of the prediction of kerf width, apart from the usually developed regression models, several AI models, namely decision trees (DT), adaptive neuro-fuzzy inference system (ANFIS), and a hybrid genetic algorithm - adaptive neuro-fuzzy inference system (GA-ANFIS) model, are developed and compared based on multiple metrics in order to determine the most preferable approach for the specific problem. Moreover, the hybrid GA-ANFIS model is compared to other state-of-the-art models such as support vector machine (SVM), multilayer perceptron neural network, Gaussian process regression, and kernel regression in order to further prove the superiority of this method. Finally, a gray relational–based optimization method is employed in order to provide the parameter combination which leads to minimum kerf width and surface roughness values.

2.1 Experimental design

The aerospace and industrial sectors make extensive use of titanium (Ti-6Al-4 V) alloy. Thus, due to its frequent use in demanding applications, it was chosen for the current study. In this study, the milling process was performed over the workpiece specimens using a customized power diode-assisted laser beam machining process (DLBM) process [41]. The power supply unit (PSU) of 12 V has been connected to the laser power module, which has an input electrical power of 10 W and 20 W with a laser driver module for controlling the laser beam [42]. The process parameters in the DLBM process such as standoff distance (SOD), feed rate (FR), duty cycle (DC), and frequency (F) along with powder diode (PD) were selected to examine their effects on the thickness of the KW [27]. The range of process variable settings for process factors was chosen as indicated in Table 1. Figure 1 shows the graphical representation of the laser milling.

Fig. 1
figure 1

Graphical representation of the laser milling process

Kerf width (KW) is the removed material width during the machining process, indicated in the schematic diagram of Fig. 2. Celestron’s digital microscope software was utilized to acquire images and to measure the kerf width. The proposed study used a USB digital microscope to measure the kerf width of machined specimens. The digital microscope was set to a magnification of 52.834% to capture the top surface of the titanium workpiece. In this experiment, the kerf width was measured based on ten different positions for each test, using the ruler function in Celestron Micro Capture Pro. The ruler was first calibrated with a Vernier scale to ensure accuracy, and the ten sample measurements were then averaged to determine the kerf width of each sample. The surface roughness of the cut surface was quantitatively measured using an Alicona Infinite focus microscope by setting up lambda filter (cutoff wavelength) on 80 μm. Each machining trial was replicated two times to enhance the measurement accuracy and all the performance characteristic values used for analysis were obtained from computing the mean values of the repeated trials. The surface roughness was measured from scanned areas at the center of the machined surface.

Fig. 2
figure 2

Schematic diagram of top and bottom kerf widths in a section of a cut created by LBM

Table 1 Process factors in the DLBM study

By using an experimental design based on L25 Taguchi orthogonal array for four parameters under five levels and duplicating the design twice, as laser power was varied at two levels, 50 different experimental training combinations were carried out in total. The number of levels for the different variables was determined by analyzing the literature and discussing with laser machining experts in this field. Especially regarding laser power, it was observed that the suggested two laser power values can be related to the limiting values for obtaining optimal values for the process. Moreover, the choice of laser power depends on the hardware requirements and commercially availability for diodes used for the same process. This approach not only allows a very comprehensive evaluation of the effect of each parameter on kerf width during laser machining of titanium alloy but also maintains a relatively low required time and cost for the experiments.

2.2  Fuzzy inference system (FIS)–based prediction approach

A fuzzy inference system (FIS) is a type of artificial intelligence model that operates based on fuzzy logic. Fuzzy logic is a mathematical framework that deals with uncertainty and imprecision, allowing for the representation and manipulation of vague and uncertain information. It is particularly useful in situations where conventional binary logic (true/false) for decision making may not be suitable.

The fuzzy inference system consists of four main components:

Fuzzification

This step involves converting crisp (exact) input data into fuzzy sets. Fuzzy sets are characterized by membership functions that assign a degree of membership to each element of the set. These membership functions define the degree of similarity between the input data and the fuzzy sets.

Fuzzy rule base

The fuzzy rule base contains a set of IF-THEN rules that express the relationship between the fuzzy inputs and fuzzy outputs. Each rule consists of a condition (antecedent) and a conclusion (consequent) linked by linguistic terms. The condition part uses the fuzzy sets obtained from the fuzzification step, and the conclusion part defines the fuzzy sets that represent the desired output.

Inference engine

The inference engine evaluates the fuzzy rules based on the fuzzy input values to determine the corresponding fuzzy outputs. The degree of activation of each rule is determined by the membership values of the input data in the respective fuzzy sets.

Defuzzification

In many applications, the fuzzy output needs to be converted back into crisp values. Defuzzification is the process of aggregating the fuzzy outputs and obtaining a single crisp output value. Various methods can be used for defuzzification, such as centroid, mean of maximum, or weighted average.

Fig. 3
figure 3

Tuning process of the fuzzy inference system

Tuning a fuzzy inference system (FIS) involves adjusting its parameters and rules to optimize its performance for a specific task or application. Proper tuning is essential to ensure that the FIS can accurately model the underlying system, make accurate predictions, or control a process effectively [43]. There are several techniques for tuning a FIS, depending on the type of FIS and the nature of the problem at hand [44]. There are various algorithms and optimization techniques that can be used for tuning fuzzy inference systems. Some of the common ones are grid search, genetic algorithms, particle swarm optimization (PSO), simulated annealing, differential evolution, ant colony optimization (ACO), and particle competition and cooperation optimization (PCCO). The process of tuning involves evolutionary learning to solve the optimization problem as shown in Fig. 3.

2.3 Adaptive neuro-fuzzy inference system (ANFIS)

ANFIS is a hybrid computational model that combines the capabilities of artificial neural networks (ANNs) and fuzzy logic to perform inference and learning tasks [13]. Fuzzy logic systems use fuzzy sets and rule base for inferring output parameters based on input data. Development of the rule base and structure of fuzzy systems requires an expert knowledge about the system. The fuzzy rules are defined based on linguistic variables and membership functions, which determine the degree of membership of input variables to different fuzzy sets. An artificial neural network (ANN) is used to approximate the consequent part of fuzzy rules. The ANN is responsible for tuning the parameters (weights and biases) of the nodes in the output layer, which correspond to the consequent part of the fuzzy rules. ANFIS was developed as a tool for modelling and predicting complex systems based on input-output data [45].

Fig. 4
figure 4

ANFIS architecture and learning process

The architecture of an ANFIS typically consists of several layers as shown in Fig. 4, each performing specific functions in the modelling and inference process [46, 47]. The functions of different layers in ANFIS are listed below.

Input layer

The input layer receives the crisp input variables or data. Each input node represents an input variable.

Membership function layer

The membership function layer fuzzifies the crisp input data into fuzzy linguistic variables. Each node in this layer represents a membership function associated with a specific input variable. Membership functions define the degree of membership of an input value to a particular fuzzy set.

Normalization layer

The normalization layer computes the normalized firing strength of each rule. It takes the fuzzy membership degrees from the membership function layer as inputs and normalizes them.

Rule layer

The rule layer computes the firing strength (activation level) of each rule. It combines the normalized membership degrees from the normalization layer based on the fuzzy rules. Each node in this layer represents a specific rule.

Aggregation layer

The aggregation layer aggregates the outputs of the rules. It combines the outputs of the rule layer using the firing strengths obtained in the previous layer.

Output layer

The output layer provides the final output of the ANFIS model. It takes the aggregated outputs from the aggregation layer and performs defuzzification to convert the aggregated output into a crisp value.

Parameter learning

The most important feature of the ANFIS algorithm is its ability to adjust the parameters (weights and biases) of the network to optimize the output. Forward Pass Perform is a forward pass through the network, passing the input data through the fuzzy inference system. It computes the outputs of each node based on the fuzzy rules and membership functions. Error calculation compares the computed outputs with the desired outputs and calculate the error. Backward Pass (gradient calculation) propagates the error back through the network and calculates the gradients of the error with respect to the parameters. Parameters such as weights and biases are updated using an optimization algorithm (e.g., gradient descent) to minimize the error. This process iterates for a specified number of iterations or until convergence. It allows the ANFIS model to adjust its parameters and improve its performance on the training data.

2.4 Genetic algorithm tuned adaptive neuro-fuzzy inference system (GA-ANFIS)

Genetic algorithm (GA) is a heuristic search and optimization algorithm inspired by the process of natural selection and genetics in biology. The basic idea behind a GA is to start with a population of possible solutions to a problem, and then use a process of selection, crossover, and mutation to generate new solutions that may be better than the previous ones. The solutions are encoded as chromosomes, which are strings of binary or real-valued numbers that represent potential solutions to the problem. The key advantages of GA are that it can handle a wide range of problems and search spaces, and it can work in parallel to generate multiple solutions simultaneously.

Genetic algorithm-tuned ANFIS is a hybrid machine learning approach that combines the ANFIS with GA optimization to improve the performance of ANFIS as shown in Fig. 5. In a GA-tuned ANFIS approach, the GA algorithm is used to optimize the parameters of the ANFIS model, such as the membership functions and the scaling factors, to improve the accuracy of the model [48]. The GA algorithm searches for the optimal set of parameters by evaluating the fitness of different parameter sets using a fitness function, which is typically the mean squared error between the predicted and actual values. The advantage of using GA-tuned ANFIS is that it can improve the accuracy of ANFIS by optimizing the parameters more efficiently than traditional trial-and-error methods. It can also handle complex and non-linear systems, and adapt to changing environments [49].

Fig. 5
figure 5

Structure of genetic algorithm–tuned ANFIS system

2.5 K-fold cross-validation

K-fold cross-validation is a technique used in machine learning to evaluate the performance of a model. In this method, the dataset is divided into K equal-sized partitions, or “folds.” The model is then trained K times, each time using K−1 folds for training and one fold for testing. For example, if we choose K = 5, the data would be split into five folds, each containing one-fifth of the data. The model would be trained five times, each time using a different fold as the test set and the remaining four folds as the training set. The results from each of the five tests are then averaged to produce a final evaluation metric, such as accuracy or error rate [50].

The advantage of using K-fold cross-validation is that it provides a more accurate estimate of the model’s performance than simply splitting the data into a single training and testing set. By using multiple folds, we are able to train and test the model on different parts of the dataset, which can help to reduce the impact of any particular subset of the data having an unusual distribution. Another advantage of K-fold cross-validation is that it allows us to make better use of our limited dataset, particularly when the amount of data available for training is small. By repeatedly training the model on different parts of the dataset, it is possible to get a better estimate of the model’s true performance [51].

Finally, it is necessary to note that the performance of the predictive models is evaluated using several standard performance measures as listed below.

Mean squared error (MSE)

This is one of the most common measures used to evaluate the accuracy of regression models. It calculates the average squared difference between the predicted values and the actual target values. MSE is given as follows:

$$MSE=\frac{1}{n}\sum _{i=1}^{n}{\left({y}_{m}-{y}_{p}\right)}^{2}$$
(1)

where n is number of data samples, and ym is the experimental value, while yp is the predicted value.

Root mean squared error (RMSE)

RMSE is the square root of the MSE. It provides a measure of the typical prediction error and is often preferred when the errors have significant magnitude variations. RMSE is given as follows:

$$RMSE=\sqrt{\frac{1}{n}\sum _{i=1}^{n}{\left({y}_{m}-{y}_{p}\right)}^{2}}$$
(2)

Mean absolute error (MAE)

MAE is another measure of the prediction error, but it takes the absolute difference between predicted and actual values. It is less sensitive to outliers than MSE.

$$MAE=\frac{1}{n}\sum _{i=1}^{n}\left|{y}_{m}-{y}_{p}\right|$$
(3)

R-squared (R²)

R-squared measures the proportion of variance in the target variable that is predictable from the input variables. It provides an indication of how well the model fits the data, with values ranging from 0 to 1. Higher R-squared values indicate a better fit.

$${R}^{2}=\frac{\left[\sum _{i=1}^{n}{\left({y}_{m}-\stackrel{-}{{y}_{m}}\right)}^{2}\right]-\left[\sum _{i=1}^{n}{\left({y}_{m}-{y}_{p}\right)}^{2}\right]}{\left[\sum _{i=1}^{n}{\left({y}_{m}-\stackrel{-}{{y}_{m}}\right)}^{2}\right]}$$
(4)

3 Results and discussion

3.1 Analysis and discussion of the experimental results

By using an experimental design based on Taguchi L25 orthogonal array for four parameters under five levels and performing the same experiments with two different levels of laser power, 50 different experimental training combinations were carried out in total, as can be seen in Table 2. The typical KW and surface roughness (Ra) values presented in Table 2 were then thoroughly analyzed based on statistical methods in subsections 3.1.1 and 3.1.2 respectively.

Table 2 Experimental results

3.1.1 Analysis of kerf width values

At first, as can be seen from Table 2, a considerable amount of experimental tests has been carried out in this work, by varying the values of five process parameters within an appropriate range, relevant to the practical conditions often selected for this process. Furthermore, by observing the results of Table 2, it becomes obvious that the values of kerf width vary considerably within the range of different process parameters chosen for the experiment. More specifically, values as low as 0.46 mm up to 2.1019 mm were recorded under different conditions, indicating that the experimental investigation was comprehensive enough, as it was able to capture different probable cases of material removal during DLBM.

Thus, based on the analysis of the experimental results, the main effects plot regarding kerf width can be observed in Fig. 6. At first, it can be clearly observed that the largest variation occurs for the laser power even though its values vary in the range of 10–20 W, as it can induce a considerable variation of the produced kerf width. Furthermore, feed rate has also a noticeable impact on the kerf width as well as duty cycle, to a slightly lesser extent, whereas standoff distance (SOD) and frequency have a less pronounced effect on the variation of kerf dimension. In the case of laser power (LP), feed rate (FR), duty cycle (DC), and SOD, a clear increasing trend can be observed as the values of these parameters increase, implying that it is not favorable to use high values of these parameters when the main objective of the investigation is to achieve low kerf width, although some of these parameters are expected to increase the MRR. On the other hand, frequency exhibits a more complex trend by initially increasing kerf width, then decreasing its values and finally reaching a moderate value at the highest frequency.

Fig. 6
figure 6

Main effects plot of kerf width

The observed trends can be explained based on the mechanics of material removal during laser machining of titanium alloy. Increased laser power contributes to a higher amount of heat being introduced to the workpiece during LBM; thus, it is expected that the zone of melting and evaporation, as well as the HAZ, will be larger when a higher laser power is used. This is evident from most of the experimental works in the relevant literature and was particularly noted in the work of Son and Lee [33], who particularly correlated volume energy and kerf dimensions. The increase of kerf width by increase of SOD can be explained due to the changes in the focusing of the beam as well as energy distribution when it acts from different distances. At higher standoff distances, the divergence of the beam becomes higher, with the spot size being slightly increased. Thus, HAZ and material removal increase, with a wider kerf width developing.

Moreover, the trend between the cutting speed and the kerf width was found to be almost a linearly increasing one. In general, a higher feed rate is related to a shorter time of interaction between the laser beam and the workpiece, so a lower amount of energy is transmitted to the workpiece [52]. Thus, the increased amount of heat is not sufficient to evaporate or not even melt a considerable amount of material around the irradiated region, resulting in less material removal and narrower top kerf width, whereas also taper increases due to the increased lag and larger difference between the size of top and bottom kerf width. However, in a few cases where relatively high cutting speeds were used, it was shown that the use of higher feed rate can lead to higher kerf widths [53]. This phenomenon was observed by Rao et al. in sheet machining and was attributed to higher vibrations observed in the direction perpendicular to the cutting direction. A non-linear correlation of cutting speed and kerf width at different ranges of cutting speed was also noted by Darwish et al. [54] and Kondayya and Krishna [55]. A plausible explanation is relevant to the possibility of a “self-focusing” or “self-induced absorption” phenomenon due to variations in cut front formation [33], which can be caused especially by the formation of a dense and turbulent plume which leads to higher absorption [56] and eventually enlarges the kerf width. Moreover, this phenomenon can be relevant to the material properties of titanium workpiece, such as low thermal conductivity or higher thermal expansion coefficient leading to lower heat dissipation, higher heat accumulation, and material distortion.

The duty factor is relevant to the ratio of the time the laser is operating to the total cycle time, implying that for higher values, the laser is operating for the larger percentage of the cycle. Thus, higher values of duty factor can be directly correlated with larger kerf width, as more energy is delivered to the workpiece, leading to more intense melting and vaporization; thus, more energy is available to melt the top surface of the kerf [29, 57]. Moreover, the pulse frequency is related to the number of laser pulses per unit time, so that higher frequency is related to smaller exposure time for each pulse and results in a narrower kerf width but also less uniform [29], as it was observed for frequencies over 18 kHz in the present study.

Regarding the relative importance and significance of process parameters, analysis of variance (ANOVA) was performed on the experimental data, as can be seen in Table 3. Every parameter, except from pulse frequency, is proven to be statistically significant. This result is also confirmed based on the calculation of the delta factor in the results of the main effects plot by Taguchi analysis, as the most important parameter is proven to be laser power, whereas feed rate, duty cycle, SOD, and frequency have a smaller contribution to kerf width, accordingly.

Table 3 ANOVA results for kerf width

3.1.2 Analysis of surface roughness values

Regarding surface roughness, it can be observed from Table 2 that the values of Ra vary between 0.450 and 1.823 μm during the DLBM experiments. Thus, it is possible that the choice of the appropriate values of the different process parameters can provide the desired surface roughness value. The results depicted in the main effects plot of Fig. 7 present the main trends occurring for the surface roughness values in respect to the process parameters.

Fig. 7
figure 7

Main effects plot for surface roughness

Laser power is revealed to be again a dominant factor, as the increase of laser power between 10 and 20 W leads to a radical increase of surface roughness of the specimens in any case. Moreover, it is shown that the feed rate is also an important parameter for surface roughness given that it contributes to an almost linear increase of surface roughness. The duty cycle is another important parameter for regulating surface roughness, with a generally positive correlation with Ra, except for the range [70%, 80%] when a slight decrease of Ra was observed. Finally, SOD and frequency are the least important parameters with increased SOD resulting in a moderate increase of Ra and frequency leading to a slight increase of Ra up to 18 kHz and then to a slight decrease for values up to 20 kHz.

These trends can also be explained based on the specific characteristics of the laser beam machining process. When laser power is high, the more intense thermal phenomena such as melting and evaporation result in a rougher surface being developed. Moreover, the increase of duty cycle also contributes to a higher amount of energy being introduced to the workpiece leading to more radical alterations of surface topography, whereas the use of different frequencies seems to have a negligible effect on surface roughness as its variation is much lower than the variation induced by other parameters. Finally, in order to justify the importance of each parameter, in Table 4, the ANOVA results for surface roughness are depicted.

Table 4 ANOVA results for surface roughness

ANOVA results from Table 4 confirm the qualitative observations from the main effects plot for surface roughness as it is determined that all parameters except from frequency are statistically important for surface roughness, with laser power being the most important, followed by feed rate, duty cycle, and SOD. Thus, in order to achieve the desired surface roughness, it is important to choose appropriate values mainly for laser power, feed rate, and duty cycle. However, as it was stressed before, in practical applications, it is crucial to take into account multiple objectives, such as kerf width or MRR, as it will be performed in subsection 3.4.

3.1.3 Microscope observations of surface quality after DLBM

Finally, the experimental results prove the considerable potential of diode laser machining as the appropriate control of the process parameters can lead to the achievement of the desired kerf quality with sufficient precision. It was shown that the main parameters required to be varied for efficient regulation of kerf width are laser power and feed rate, as two of the main factors related to the heat input into the workpiece. However, the exact values of process parameters for efficient material removal by DLBM should be selected carefully in accordance with other performance metrics such as MRR, surface roughness, or power consumption, in order to be able to achieve not only high quality but also high productivity during DLBM of titanium alloys. Figure 8 shows SEM images of the bottom surfaces. Figure 8a is an image machined with 10-W laser power. It can be seen that the presence of pockmarks  may be caused by the expulsion of molten material or gas entrapment during the material’s solidification. There is also the presence of microcracks caused by thermal stress due to the rapid heating and cooling during the laser milling process. On the other hand, in Fig. 8b, it is shown the bottom surface machined by 20-W laser power. This image has more rough surface with an intense presence of craters; these are depressions on the surface but might be larger and have a more defined edge. Craters can result from the impact of the laser pulse on the surface causing material removal. Additionally, there is a presence of globules which are spherical particles that have solidified on the surface. They could be remnants of molten material that cooled rapidly into spherical shapes due to surface tension.

Fig. 8
figure 8

SEM images that show surfaces machined by 10-W laser power (a) and by 20-W laser power (b)

3.2 Development of regression model

In order to gain further insight into the correlation of process parameters and kerf width, it is important to develop a regression model, which will take into account the obtained experimental results. Thus, after some preliminary investigations, a non-linear regression model was developed, including only the statistically significant terms, as follows:

$$K_W=1.17661\ast10^{-6}LP^{0.843893}\ast SOD^{0.798317}\ast FR^{0.936836}\ast DC^{0.921306}$$
(5)

This model has a MSE value of 0.0095629, RMSE value of 0.09779, MAPE value of 6.866%, and ΜΑΕ value of 0.078629. The values of these metrics indicate that a sufficient level of correlation between process parameters and kerf width can be established through this model. However, it is also necessary to investigate further the possibility to develop a more precise and robust model based on AI techniques in order to be able to generalize more sufficiently and further reduce the errors.

3.3 Development of DT regression model

A relatively simple but useful machine learning (ML) model is decision tree (DT). This type of ML model not only is primarily effective for classification purposes but can also be used for regression. Especially, regression tree (RT) is an algorithm pertinent to the DT methods, which can be regarded as a supervised learning algorithm, able to handle both numerical and categorical values. The algorithm of RT is relatively simple, as it is partitioning the feature space in a recursive way, eventually creating smaller subsets based on the values of input parameters. Each node of the tree involves a splitting criterion in order to split the data into child nodes in the optimum way until the stopping criterion is true, e.g., maximum tree depth. The final model has a tree structure with the leaf nodes having the final predicted values of the target response based on specific values of the input parameters which lead to this node.

RT have various advantages such as direct interpretability, given that a visual representation of the tree structure can be obtained in order to under the function of the model and identify the most important features, handling of missing data, as the missing values are omitted during splitting, and establishment of non-linear correlations but they can suffer from overfitting, as most ML models, and they exhibit high computational costs due to large datasets.

The regression tree model developed in this work has a depth of four levels. The predictor importance estimated by the model is comparable to the results of ANOVA as the ranking of input variables was as follows: laser power, feed rate, duty factor, SOD, and frequency. Frequency had an importance value of 0 and was neglected in the RT model. The model was found to have a MSE value of 0.0148, RMSE value of 0.1216, MAPE value of 1.1741%, and ΜΑΕ value of 0.0822. The values of these metrics indicate that the model was able to capture the correlation between input parameters and kerf width at a satisfactory level. However, compared to the non-linear regression model, MSE, RMSE, and MAE values are slightly higher but MAPE value was considerably decreased.

3.4 Performance of developed ANFIS model

There were in total 50 machining experiments that were conducted on titanium alloy specimens under various process variables’ combinations, as shown in Table 2. The experimental values of KW which are presented in Table 2 are utilized as the training data in ANFIS under the aforementioned machining settings, and the experimental dataset was randomly divided into training (40 numbers) and testing (10 numbers) datasets. The training dataset is used to train the predictive models and the testing dataset is used to validate the performance of the models. Training an ANFIS model involves adjusting its parameters to fit the training data. Since the aim of the study is to accurately predict KW, ANFIS model is developed to estimate the KW based on the input factors.

In an ANFIS structure, the input membership functions and rule base are tuned by ANN by an iterative learning process. ANFIS architecture and training parameters are shown in Table 5. So, the ANN tunes the fuzzy inference system to adapt to the input variables to accurately predict KW. Figure 9 shows the scatter plot denoting the deviation from actual experimental KW with the predicted values for both training and testing datasets. It is inferred from these results that the FIS is tuned well using the ANN in the ANFIS model with an R2 value of 0.99 for the training data but have only 0.94 value for the testing dataset. Also, the error parameters MSE, RMSE, and MAE are comparatively larger for the testing dataset compared to the training dataset. This is mainly due to the overfitting issue. This justifies the requirement of an additional algorithm to tune the ANFIS structure to improve the prediction results.

Table 5 ANFIS parameters
Fig. 9
figure 9

Scatter diagram between measures and predicted values for both training and test data for ANFIS model

3.5 Performance of developed ANFIS model

Using a genetic algorithm (GA) to tune an adaptive neuro-fuzzy inference system (ANFIS) and then validating its performance is a powerful approach to achieve accurate and adaptable modeling for predicting kerf width. The ANFIS structure used in this model is similar as in Table 5 and the settings of GA parameters used to tune ANFIS are given in Table 6. The initial population of chromosomes with random parameter values is created. The crossover and mutation operations are applied to generate a new population of chromosomes. The iteration process repeats the selection, crossover, and mutation steps for a defined number of generations or until the convergence criterion is met. Thus, the GA-ANFIS model for predicting KW has been developed. The structure of the rule surfaces between the input and output variables for the GA-ANFIS model is shown in Fig. 10. Figure 11 shows the scatter plot denoting the deviation from actual experimental KW with the predicted values for both training and testing datasets for the GA-ANFIS model.

Table 6 GA parameters for tuning ANFIS
Fig. 10
figure 10

Rule surfaces show relationship between input and output variables in the GA-ANFIS model

The performance of GA-ANFIS is evaluated and compared with the ANFIS model using the performance measures such as MSE, RMSE, MAE, and coefficient of determination (R2) as shown in Table 7. As per the results, it is evident that the predicting performance of the GA-ANFIS model is better with less value for error parameters (MSE, RMSE, MAE) and a higher R2 value of 0.98. This makes sure that there is no or less overfitting as the prediction accuracy for testing data is better as compared to training data. In comparison with the nonlinear regression model and the regression tree model, both ANFIS and GA-ANFIS exhibit superior MSE, RMSE, and MAE values.

Fig. 11
figure 11

Scatter diagram between measures and predicted values for both training and test data for the GA-ANFIS model

Table 7 Performance comparison of ANFIS and GA-ANFIS models

The performance of developed GA-ANFIS was additionally compared with standard regressions models such as linear regression, support vector machine (SVM), multilayer perceptron neural network, Gaussian process regression, and kernel regression, apart from the regression models presented in subsections 3.2 and 3.3 in order to conduct a more comprehensive comparison with state-of-the-art models. These models are developed using MATLAB Regression Toolbox. Table 8 shows the performance comparison of the GA-ANFIS model with standard regression models for testing data. As the GA-ANFIS model exhibits less value for error parameters (MSE, RMSE, MAE) and a higher R2 value, it can be inferred that it has better prediction characteristics.

Table 8 Performance comparison of GA-ANFIS models with standard regression models

3.6 Fivefold cross-validation of the GA-ANFIS model

Fivefold cross-validation is a technique used to assess the performance of the prediction model by dividing the dataset into five subsets or folds. The model is trained and evaluated five times, each time using a different fold as the validation set and the remaining folds as the training set. This method provides a more robust estimate of a model’s generalization performance compared to a single train-test split. The dataset is randomly divided into five equal-sized subsets (folds). During each iteration, four folds are used for training, and the remaining fold is used for validation. Fivefold cross-validation provides a more reliable assessment of a model’s performance compared to a single train-test split, as it accounts for potential variations in the data distribution. It reduces the risk of overfitting or underfitting on a particular dataset partition. Table 9 shows the performance comparison of ANFIS and GA-ANFIS models for different trials of fivefold cross-validation. A box plot, also known as a box-and-whisker plot, is a graphical representation that displays the distribution of a dataset. It shows the median, quartiles, and potential outliers of the data. In fivefold cross-validation, a box plot can help visualize the distribution of performance metrics MSE, RMSE, MAE, and R2 across the different folds as shown in Fig. 12. The GA-ANFIS model has less variation of the performance metrics compared to the ANFIS model across different folds. It infers that GA-ANFIS for predicting kerf width model has less dependency on dataset, thus avoiding overfitting or underfitting issues.

Fig. 12
figure 12

Box plot of performance metrics for different prediction algorithms in different trials of 5-fold validation

Table 9 Performance comparison of ANFIS and GA-ANFIS models for different trails of 5-fold cross-validation

3.7 Multi-response optimization of kerf width and surface roughness

In the present work, an attempt was made to adopt gray relational–based (GRB) multi-response optimization (MRO) for optimizing kerf width and surface roughness along with prediction of kerf width [58, 59]. The GRB approach can provide better accuracy among many MRO methods. The steps involved in GRB approach are explained in Fig. 13.

Fig. 13
figure 13

Steps involved in GRB multi-response optimization

Both kerf width and surface roughness were considered as the smaller the better characteristics. The surface roughness of the cut surface was quantitatively measured using an Alicona Infinite focus microscope by setting up lambda filter (cut off wavelength) on 80 μm. Each machining trial was replicated two times to enhance the measurement accuracy and all the performance characteristic values used for analysis was obtained from computing the mean values of the repeated trials. The surface roughness was measured from scanned areas at the centre of the machined surface [60]. Table 10 shows the S/N ratio and normalized S/N ratio of the performance measures. Table 11 depicts the gray relational coefficient of all the experiments conducted.

Table 10 S/N ratio and normalized S/N ratio arrays
Table 11 Gray relational coefficient and gray relational grade for the present study

The average gray relational grade on all levels of significant factors shows that the overall effect of responses in the machining process is as shown in Table 12. Therefore, average gray relational grade analysis was utilized to obtain the most optimum combination of process parameters on all levels. It was noted the highest average gray relational grade as 20 W (level 2) for laser power, 22 mm (level 5) for standoff distance, 300 mm/min (level 5) for feed rate, 85% (level 5) for duty cycle, and 18 kHz (level 3) for frequency.

Table 12 Identification of optimal process parameters

The confirmation tests were carried out to ensure the accuracy of the optimal process parameters and optimization algorithm. In the confirmation experiment, the machining trial has been conducted with the computed optimum machining parameters and the responses have been analyzed. The estimated gray relational grade can provide an expected value of gray relational grade after the confirmation test [61]. It is used to statistically determine the amount of improvement that can be expected by machining the material surface with the optimized machining parameter combination. The estimated gray relational grade (Gp) for the optimum level of process parameters is computed from Eq. 6.

$$G_{p}=G_{m}+\sum(G_{i}-G_{m})$$
(6)

Where Gm denotes the mean of the overall gray relational grade and Gi denotes the mean of the gray relational grade for each process parameter on its optimum level. The gray relational grade has been improved by 4.59% from the gray relational grade for the optimized parameters.

4 Conclusion

In this investigation, an endeavor was made to introduce the GA-tuned ANFIS technique to predict kerf width while machining grooves in a titanium alloy workpiece using the DLBM process and use gray relational–based optimization approach in order to determine the optimum process parameters for minimizing both kerf width and surface roughness. After the experimental results were analyzed by appropriate statistical techniques, the ANFIS model and GA-tuned ANFIS model were validated by comparing values from prediction and experiments for analyzing the accuracy of models and the GRB method leading to the optimum parameters for improving part quality. The following conclusions were made based on the observations:

  • The analysis of the experimental results revealed that laser power plays the most significant role regarding kerf width, followed by feed rate, duty factor, and SOD, whereas frequency is not statistically significant. Moreover, similar conclusions were also drawn regarding surface roughness, with laser power being again identified as the dominant parameter for regulating surface quality.

  • Non-linear regression model and regression tree model can provide a sufficient level of accuracy for the prediction of kerf width with the former exhibiting lower MSE, RMSE, and MAE values, whereas the latter exhibits lower MAPE.

  • The FIS was effectively fine-tuned using an artificial neural network (ANN) within the ANFIS model, yielding an impressive R2 value of 0.99 during training. However, this strong performance might suggest overfitting, as the R2 value dropped to 0.94 when tested on an independent dataset.

  • The predicting performance of the GA-ANFIS model was better with less value for error parameters (MSE, RMSE, MAE) and a higher R2 value of 0.98. This infers that the problem of overfitting is reduced significantly by usage of genetic algorithm. Moreover, both of these networks outperform the non-linear regression and regression tree models.

  • Additional comparison of the proposed GA-ANFIS models with various regression models such as linear regression, SVM, Gaussian process regression, multilayer perceptron neural network, and kernel regression proved that this model outperforms every other model in terms of MSE, RMSE, MAE, and R2.

  • In the fivefold cross-validation, the GA-ANFIS model has less variation of the performance metrics compared to the ANFIS model across different folds. This implies the dependency of the prediction model on the training data is reduced significantly by usage of genetic algorithm.

  • The optimal process parameters were found as 20 W (level 2) for laser power, 22 mm (level 5) for standoff distance, 300 mm/min (level 5) for feed rate, 85% (level 5) for duty cycle, and 18 kHz (level 3) for frequency.

Finally, it should be noted that it is planned to improve the prediction accuracy by implementing deep neural networks and other advanced machine learning algorithms in future as well as developing real-time online prediction and preventive control system for the DLBM process.