Bulging behavior
In this section, the bulging behavior of the different tool paths described in section 2.2 is summarized for a reduced selection of geometries and pressure levels. The development of the product height ha(i) can be seen in Fig. 6. Fundamental findings indicate the great influence of the pressure p and the number of cycles n on the product height. Either a higher pressure or a higher number of cycles leads to a larger product height. However, when the pressure was set too high and cracks occurred, the forming process could not be continued and the parts did not achieve the maximum height that was obtainable with a lower pressure. Other than the curve of manufacturing a middle-sized part with a pressure of 0.475 bar, which has an almost homogeneous slope at any point, most curves are characterized by a non-linear trend with an increasing growth rate. The height difference Δha(i), Eq. 3, describes this issue in more detail and is shown in Fig. 7.
$${\varDelta h}_a(i)={h}_{\mathrm{a}}(i)-{\mathrm{h}}_a\left(i-1\right)$$
(3)
Especially, the small parts, Fig. 7a, and the middle-sized parts, Fig. 7b, show an almost linear development of the curve. Corresponding to Fig. 6b, the height difference does not increase for the pressure p = 0.475 bar. In contrast, the large parts behave different at the beginning, where the tool start point is closer towards the center of the blank. The height difference of the large parts is almost constant and lower than the height difference of the smaller parts within the first ten cycles. It can be deduced that the pressure has a lower influence on the height growth within these cycles. However, beyond the 10th cycle, the influence of the pressure is considerable and the height difference increases. Regarding all shapes, a high risk of cracks can be related to a combination of high pressure values and height differences Δha(i) ≥ 3 mm.
The development of the vertical force Fa(i) is depicted in Fig. 8 where, generally, a higher pressure causes a higher force. The force is initially in a range from 230 N to 300 N and increases strongly during the first ten cycles. After that, the forces curves flatten, stagnate or even decrease in some cases. The vertical force reaches values between 360 N and 460 N in the end of the manufacturing process. It can be concluded that in high pressure situations, forces Fa(i) ≥ 460 N are an indicator for imminent cracks. Moreover, the force increases at a slower rate for larger parts than for the smaller ones. For example, in the fifth cycle where pressure p = 0.575 bar, the force Fa(5) amounts to 304 N for a large part whereas it is already at 405 N for a middle-sized part. A possible explanation for this is the fact that the start point of the tool is located closer towards the blank’s center and consequently the forming condition might be different there.
In summary, there is no linear or constant relationship between the product height ha(i) and the pressure p. Additionally, the parameters are continuously changing throughout the manufacturing process, rendering it more prone to failure. Prior to using ANNs for learning the interrelations among the parameters, the data generated from the experiments is first described and undergoes pre-processing in the following section.
Data description and extrapolation
During training, the ANN used in order to predict pressure has five inputs and one output, namely: Geometry (Table 1), circle diameter of the tool path da(i), current cycle i, height difference Δha(i), average forming force Fa(i) and pressure pt(i), respectively. While geometry is an ordinal categorical variable, all other variables and outputs are continuous. The original dataset consists in total of 803 observations for all geometries, as shown in Table 3. From there on, it undergoes a series of adjustments. First, invalid cycles within the dataset are removed. Invalid cycles are defined as instances after a crack appears in the part. On the one hand, these instances give rise to false measurements of the height, and on the other hand, the forming process cannot be continued once a crack appears, since the pressure in the pressure chamber can no longer be maintained. A benchmark ANN is trained using the remaining 782 observations and compared with a better performing one that is trained with extrapolated data. The reasons behind extrapolating are two-fold: first, to project and extend the known pressure-height relationship into areas which are not covered by the experiments, and second, to generate more observations. For all points, the extrapolation occurs linearly and begins at p = 0.35 bar, which is considered to be the minimum pressure at which deformation can take place for pure Aluminum, as used with comparable hardening states in [1] and in the experiments conducted in this work. This procedure can be seen in Fig. 9, as well as in Table 4. Though the extrapolation is linear, it was only performed on each cycle independently. Consequently, it does not affect the non-linear relationship between the product height ha(i) and the pressure p.
Table 3 Number of samples within the dataset, before and after eliminating anomalies and extrapolation Table 4 Two consecutive cycles for a small-sized truncated cone after extrapolating the first cycle After extrapolating, the data contains 3128 observations to be used for training and testing purposes. The input and output variables were normalized using the max-min normalization method before training. Each independent variable x was mapped to the range [−1, 1] through the transformation by the expression:
$$\frac{\left(x-{x}_{\mathrm{min}}\right)}{\left({x}_{\mathrm{max}}-{x}_{\mathrm{min}}\right)}\ast \left(\max -\min \right)+\min$$
Where min, max are the defined feature ranges, x the current value and xmin, xmax the minimum and maximum value in the dataset respectively. Similarly, each independent variable y was mapped to the range [0, 1].
Offline pressure prediction
In the case presented in the paper, where the target is a continuous variable, the mean squared error (MSE) is chosen as the loss function to be minimized. MSE measures the average square difference of the experimental (actual) value and the value predicted by the ANN. Additionally, the non-linear activation function σ for the input and hidden layers is selected to be the Rectified Linear Unit (ReLU), with the final output layer undergoing no transformation. ReLU converges relatively fast and is efficient in multiplying gradients over multiple layers [27], which makes it one of the most widely used non-linear activation functions for deep networks. The exhaustive search showed that the best performing model consists of 3 hidden layers with 500, 250, and 150 neurons respectively in each layer. Further hyperparameters are presented in Table 5. To properly estimate the generalization error of the learner, the dataset is split into three subsets, namely training (90%), validation (5%) and testing (5%). The test set is not involved in training the learner, and is only called after training to estimate the true generalization error of the algorithms as best as possible. The model is ultimately trained using the listed hyperparameters 25 times, where the data is re-shuffled for every replication, resulting in a different train / test split for every run. The latter is performed to account for the stochastic nature of neural networks. The average of these 25 runs for the model trained on the extrapolated dataset is shown in Table 6b. A different ANN model was trained on non-extrapolated data and serves as a benchmark to compare the results with (Table 6a). This model undergoes an exhaustive search as well, which results in a different set of hyperparameters to be used. Moreover, other machine learning models such as linear regression and polynomial regression were found to be less accurate, as depicted in Table 6c, d. The models achieved satisfactory results across multiple performance metrics, with model b outperforming its counterparts. The relatively high value of the coefficient of determination R2 = 0.984 achieved indicates that the model explains the variance of p with respect to the input parameters sufficiently well. The mean absolute error (MAE) presented for the test set shows that on average, a prediction of a new pressure point in the range of [0,1] deviates 0.018 bar from the real value of said pressure point. Finally, it can be deduced from Fig. 10 that model b is not overfit on the training data; the loss (MSE) decrease for the training and validation sets is relatively close, and the model can thus generalize its learning on samples it has not come across before. Therefore, model b is selected to be used in the following sections for controlling the pressure levels.
Table 5 ANN model parameters for the extrapolated dataset Table 6 Performance across several metrics for: a ANN non-extrapolated data and b ANN extrapolated data c linear regression d polynomial regression Influence on geometrical accuracy
The geometrical accuracy consists of the height difference of each cycle along with the final accumulated height. At first, the development of the height difference during the manufacturing process is analyzed for constant and dynamically adjusted pressure, in Figs. 11 and 12 respectively. Ideally, the height difference Δha(i) should be 1.75 mm for each cycle. If it is not the case, then the final height would either be exceeded or not achieved. For the uncontrolled process, the highest pressure levels from Table 1 which did not lead to cracks were selected as constant setpoints. From Fig. 11, it can be seen that the height difference starts of smaller than the value of 1.75 mm for all parts. It increased continuously to reach values exceeding 1.75 mm thereafter. The height difference had also reached the critical value of 3 mm during the manufacturing of the large part. As for the case where pressure was dynamically adjusted, it can be seen in Fig. 12 that any deficit or excess of the height difference was compensated by the controller. Furthermore, the control concept helped prevent the height difference from exceeding the critical value of 3 mm for all geometries.
The comparison of the final height for parts manufactured using constant and dynamically adjusted pressure is shown in Fig. 13, where the final contour was measured by a 3D-scanner. In the case of constant pressure, the small and middle-sized parts exceeded the target height value whereas the large part remained well below it. Using dynamically adjusted pressure, all parts underwent significant improvements in terms of accuracy. It is also worth noting that the final height reached was consistently slightly above the target height. This is caused by elastic bulging which occurs at the beginning of the forming process along with springback at the end after pressure release in the chamber. These phenomena lead to a mismatch between the height measured by the laser sensor and the height determined by the 3D-scanner. Ideally, this mismatch can be compensated through adjustment of the initial height hoff according to the values in Table 7, which was not performed for the experiments.
Table 7 Elastic bulging prior to the first cycle and springback after pressure release as average value for all geometries By controlling the product height ha(i), the wall angle α was indirectly influenced. The wall angles α corresponding to the cross sections of Fig. 13 are shown in Fig. 14. The wall angle never remained at a constant value without the control and also exceeded the target value of 60.26° defined in Table 2. In contrast, the controlled process led to a wall angle which is more in agreement with the target value. Thus, controlling the pressure can lead to more homogenous sheet thinning in the part wall.