Abstract
Fiber reinforced polymers (FRP) provide favorable properties such as weight-specific strength and stiffness that are central for certain industries, such as aerospace or automotive manufacturing. Liquid composite molding (LCM) is a family of often employed, inexpensive, out-of-autoclave manufacturing techniques. Among them, resin transfer molding (RTM), offers a high degree of automation. Herein, textile preforms are saturated by a fluid polymer matrix in a closed mold.Both impregnation quality and level of fiber volume content are of crucial importance for the final part quality. We propose to simultaneously learn three major textile properties (fiber volume content and permeability in X and Y direction) presented as a three-dimensional map based on a sequence of camera images acquired in flow experiments and compare CNNs, ConvLSTMs, and Transformers. Moreover, we show how simulation-to-real transfer learning can improve a digital twin in FRP manufacturing, compared to simulation-only models and models based on sparse real data. The overall best metrics are: IOU 0.5031 and Accuracy 95.929 %, obtained by pretrained transformer models.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 RTM and permeability measurements
When it comes to weight-specific mechanical characteristics like strength and stiffness, fiber reinforced polymers (FRP) are superior to other engineering materials. FRP, especially when using carbon fibers, can replace conventional steel or aluminum, which lowers fuel use and CO2 emissions throughout the product lifecycle. These composite materials are built up from a polymer matrix that has fiber reinforcement. Liquid composite molding (LCM) techniques are some of the most well-known and cost-effective out-of-autoclave manufacturing methods for FRP components production. For medium volumes (1,000 to 10,000 components per year), resin transfer molding (RTM, [1]) is a regularly used manufacturing method. Under pressure, a liquid thermoset polymer (a resin) is injected into a mold cavity containing the reinforcement material, such as a fabric made of carbon or glass fibers (called the preform). During the injection procedure, this results in a “flow front” that separates saturated from dry material (cf. Figure 3).
The permeability properties of the preform, which explain how well a porous medium transmits fluids, have a major impact on the dynamics of the flow front. In FRP, (i) fiber volume content (\(V_f\)) and (ii) preform layout primarily determine the preform permeability. Localized changes in preform permeability can take place at locations where the wall thickness varies or at curved sections of the component (compression or even folding of the preform on the inner surface of the curvature). Additionally, they could result from manual treatment of the fibrous structure or from the material’s flawed qualities such as missing or misplaced fiber bundles (e.g., fringing of fiber bundles, breakage of filaments). For one type of textile, the permeability might vary by up to 20 %, according to Tifkitsis [2]. Local differences in permeability might result in subpar impregnation quality or even dry areas, which will reduce the mechanical performance of the final composite part. These variations also affect the temporal progression of the flow front. Before or during an industrial RTM process, these changes are often unknown for a specific preform; therefore, it would be ideal to have a map of the permeability and \(V_f\) values in order to identify problems as soon as possible.
We suggest using machine learning (ML) models that, given an observed sequence of flow front images during an injection procedure in a permeameter, predict changes in the permeability and \(V_f\), resulting in a sequence-to-instance task (cf. Figure 1). The training data is collected from two sources: (1) flow experiment data from a permeameter, and (2) flow simulation data from PAM RTM, a commercially available FEM simulation tool for RTM processes (see Sect. 2). This assignment is a component of a broader pipeline that ultimately results in the prediction of a textile property map from in-mold sensory data. For such, an intermediate step that converts sensors to flow front images is required, such as employing transposed convolutions [3], generative models, or analytical techniques [4]. To sum up, our contributions are as follows:
-
For this sequence-to-instance task in an engineering context, we compare several ML models (see Tables 4 and 5).
-
We demonstrate the effectiveness of sim-to-real transfer learning in low data regimes by using transfer learning to adapt a model trained in the simulation domain to real data (see Table 6).
This method can be viewed as a component of the process’ digital twin [5]. The component can be regarded as acceptable if it does not exhibit excessive fluctuations in permeability and \(V_f\).
This paper substantially extends preliminary results discussed in [6]: In order to collect real injection data, we conducted additional physical experiments. We used real data not only for evaluation but also for training purposes and measured the quality effects. Finally, we extended the models to not only predict deviations of \(V_f\) but also of the permeability in X and Y direction. We also take into account textiles with anisotropic characteristics which increases both the difficulty and the realism of the proposed scenario.
1.1 Related Work
The composites processing literature has taken an interest in online measurements of permeability or its variation. The issue of in-situ monitoring the filling status of RTM processes using lineal sensors is taken into consideration by Tifkitsis et al. [2]. They use Kriging models to forecast the arrival times of the flow front at specific sensors. The same simulation program we used, PAM-RTM®, is utilized to generate the RTM simulations that are needed to fit these substitute models. However, we take into account the entire spatial expansion of the flow front (i.e., a high-dimensional prediction) while they just take into account two measurement lines across a plate. By using Darcy’s law [7] to calculate it, Wei et al.[8] estimate global and local permeability during the injection with pressure and flow front sensors:
with volume-average flow velocity \(\varvec{\upsilon }\), fluid viscosity \(\eta \), permeability tensor \(\textsf{K}\), and pressure gradient \(\nabla p\). Simulating liquid composite molding relies heavily on Darcy’s law. The planar, anisotropic permeability tensor is modeled as follows for 2-dimensional flow (which is covered in the work at hand):
\(k_x\) and \(k_y\) represent the permeability in the x and y directions, respectively, whereas \(k_{xy}\) describes how the flow in one major direction is dependent on a pressure gradient in the other major direction. A Regularising Ensemble Kalman filter Algorithm (RENKA) was demonstrated by Matveev et al. [9] as a means of detecting permeability changes throughout an RTM process. To reduce the computational complexity of pure Bayesian inversion, they introduce this filtering technique. However, the algorithm is less usable than a fully data-driven ML approach because it requires nontrivial mathematical steps and must be modified for new component designs. CNNs were utilized by Gonzales et al. [10] to monitor changes in the flow front caused by pressure sensors to detect permeability changes. However, they are only able to identify discrete, rectangular changes in permeability by using the data of all recorded runs. Our method is more flexible since it creates the full permeability map of the preform rather than just identifying specific elements of the patch, like its length, width, or center point. Moreover, the flow fronts in each of the aforementioned works differ because they all employ a lineal injection gate rather than a central one. CNNs are used by Caglar et al. [11] to measure the permeability of textiles at the microscopic level. Other papers integrate RTM analysis and ML in addition to online permeability estimation: Stieber et al. [3] describe a learning-based dry spot classifier that solely operates on simulated data. Transfer learning from simulated to actual data is a suggested research area, particularly for low-data regimes. Weiss et al., Stieber and Hofer et al. [12,13,14] All the above works employ rectilinear flow, whereas our work employs radial flow. From the manufacturing perspective, this a more realistic case since it is two dimensional instead of one dimensional and thus is applicable to a broader range of processes. Besides online permeability estimation, other papers combine RTM analysis and ML: Stieber et al. [3] present a learning-based dry spot classifier, working only on simulated data. Especially for low-data regimes, transfer learning from simulated to real data is a suggested research direction [12,13,14].
2 Data regime
Training neural networks for online prediction of preform textile properties usually requires a large amount of data paired with knowledge about material properties and experimental parameters. Such data sets are rarely available, as (i) RTM runs are elaborate in terms of experimental time as well as cost, and (ii) RTM molds, that are industrially employed and not prototypically or in landmark research projects, typically show a very limited number of sensors, that are mostly limited to only temperature and pressure signals [15,16,17].
In this work, we propose to use data sets from both, real experiments and simulation runs, in order to outperform models based on simulation data only. In particular, the data sets used for this work stem either from the optical permeameter testrig depicted in Fig. 3 or from an FEM-based flow simulation, run in PAM-RTM®.
2.1 Materials
2.1.1 Test fluid
Comparative rheometric tests revealed that the color pigment does not affect the viscosity of the test fluid.
For the flow experiments on the optical permeameter, standard plant oil was used as a test fluid. This is common practice in experimental permeability characterization [18] to avoid handling of chemically reactive thermosets. The viscosity of the test fluid was experimentally characterized in a temperature range between 15 and 30 °C, which covers the range typically seen in non-conditioned research labs. In this range, a nearly linear decreasing trend was found between about 90 and 48 mPas. Thus, the viscosity of the test fluid is well comparable to that of uncured (this is the relevant condition during impregnation of the fibrous preform) epoxy resin commonly used in RTM. In our flow experiments, the fluid temperature was measured in the feeding line and then interpolated into the above-mentioned characteristics in order to obtain the fluid viscosity relevant for the respective experiment. In order to enhance the contrast of saturated against unsaturated preform regions in the digital images acquired during the flow experiments, the test fluid was colored with a red color pigment (Sudanred IV, Sigma Aldrich). Comparative rheometric tests revealed that the color pigment does not affect the viscosity of the test fluid.
2.1.2 Fabric
The experiments were run with a glass fiber, 2/2 twill woven fabric, type Hexion 1202 of Hexcel, with a nominal areal weight of 290 g/m² and in-plane permeability characteristics as listed in Table 1. The material is commercially available and well known from recent international benchmark exercises (IBE) on in-plane permeability characterization [18]. The research group co-authoring this paper has participated in these IBE with their optical permeameter and the in-plane permeability data used for the simulation runs were found by interpolating into the corresponding results of principal in-plane permeability \(k_x\) and \(k_y\) vs. volume-averaged fiber volume content \(V_f\). This \(V_f\) was estimated according to
with the number of layers n, areal weight (grammature) \(m_A\), cavity height h and fiber mass density \(\rho _f\). The areal weight was determined by weighing each preform prior to the flow experiment. The mass density of the glass fibers was taken from the technical data sheet with 2550 kg/m³. The material shows principal flow directions well aligned with the orientation of the woven fiber bundles and thus: \(k_{xy} \approx 0\), cf. Equation 2.
2.1.3 Preform and Patches
The flow experiments and resulting data presented in Sect. 2.2 are based on specifically manipulated preforms of the woven fabric. The preform is created by homogeneous stacking of the fabric layers in uniform orientation A rectangular patch of the fabric was introduced in a particular location of the preform, varying in terms of: (i) number of layers, (ii) orientation of the fabric, (iii) \(V_f\), and thus, (iv) in-plane permeability. Table 1 lists the most relevant properties of preform and patch region.
Several different patch locations were chosen for the flow experiments to provide a meaningful data set for the subsequent training of the neural network. Figure 2 shows the geometric details of such a configuration.
2.2 Data from flow experiments
2.2.1 Optical permeameter
For the study at hand, an optical permeability characterization cell (briefly termed permeameter for the remainder of the work) was used, which follows the radial flow technique combined with optical flow front tracking. Using an optical permeameter offers the advantage that almost the entire flow front can be tracked in the form of planar images instead of sensory time series at certain locations only or, in current industrial practice, even no in-mold sensors at all. Moreover, the flow front images acquired (cf. Figure 3) align closely with those from an FEM-based process simulation (cf. Figure 1) which enables a sim-to-real transfer approach.
The experimental data used for this work was acquired with the permeameter depicted in (Fig. 3). The mold of this cell can be seen as a flat plate RTM mold with a central injection gate and optically transparent top mold half, which allows for tracking of the flow front using an industrial camera system [19].
2.2.2 Image and data processing
To use the experimental data for passing them through a model trained on simulation-only data, the acquired sequence images had to be pre-processed (cf. Figure 4): (i) The images were rotated to align them with the images created by the simulation, (ii) empty zones outside the fabric were removed, (iii) image areas showing the stiffening frame of the mechanical setup were removed and (iv) parts of the flow front, occluded by the stiffening frame, were supplemented by a specifically developed, automated mechanism. The latter involves fitting an elliptical geometry model to selected data points along the fluid flow front [19] and extrapolating parabolic models for the major and minor ellipse axis length [20], respectively.
2.2.3 Experimental test plan and data augmentation
Two series of flow experiments were run with specific dedication for the work at hand which are briefly termed RealPermSmall (from 2020) and RealPermLarge (from 2021). Table 2 provides an overview of the entirety of data sets.
To benefit the most from this limited number of real-world samples, the injection experiments were designed such that the patches were located in one quadrant of the preform only. Data augmentation through flipping the resulting image sequences along the \(x-\) and \(y-\)direction, respectively, yielded additional image sequences
2.3 Data from flow simulation runs
To obtain data from simulation runs, briefly termed the simulated data, a strategy comparable to previous works of the authors, in particular, [3] and [6], was followed involving an automated pipeline: Starting with a two-dimensional representation of a preform with anisotropic material properties, briefly termed preform, small patches (rectangular or circular) with varying in-plane permeability and \(V_f\) were randomly inserted. Subsequently, fluid flow through the preform was numerically predicted through a commercial RTM simulation software, resulting in a sequence of “label images”, or briefly labels (see Fig. 1), over the percentage of preform filling.
Simulation data were generated in several evolutionary stages, Table 3 gives an overview of the evolution of data sets generated for this work. The difference in \(V_f\) of the 2021 and 2022 data, respectively, and the setup specified in Table 1 arises from slight deviations between nominal and real values of the areal weight of the fabric. The latter were measured directly before starting the flow experiments and were found within the tolerances specified in the technical data sheet of the material.
The most important aspects of the data sets are specified in the “properties” column of Table 3. It is an evolution towards more realistic flow simulations. The first campaign, SimIso in 2019, represents a very first try targeting a single image classification task, where the sampling over time did not matter. The SimIsoCustom data set from 2020 introduced regular sampling but was still based on isotropic material with (unrealistically) strong changes in \(V_f\). With the creation of the SimAniso data set in 2021, the simulation was driven further towards reality: preform properties were chosen according to real flow experiments and anisotropic in-plane permeability was introduced as an extra layer of complexity. Furthermore, \(V_f\) for both, preform and patch region, was sampled according to the probability found in the flow experiments (cf. Figure 12 in the Appendix). Finally, SimAnisoChannel represents a data set covering geometrically small (i.e. 1..2 mm) flow channels along the patch edges in order to reflect “race tracking”, an effect well known for LCM [21].
In Fig. 5 we show the timing of simulation and reality in two different cases: one without a patch of different \(V_f\), thus homogenous preform and one with a patch of higher \(V_f\). It shows great correspondence of the simulation carried out with PAM RTM and the real experiment, the flow in y direction in the particular example with patch is a little slower.
In the following subsections, particularities of flow simulations in preforms with (i) anisotropic in-plane permeability and (ii) inhomogeneities introduced by patches with properties deviating from the preform are described. These are particularly important for the application of machine learning techniques in the FRP domain as the simulation-reality gap is widened, if not taken care of.
2.3.1 Flow in isotropic preforms
In flow simulations on isotropic material, the material properties can directly be derived from each other, i.e. increasing \(V_f\) results in a lower in-plane permeability, and the changes are equivalent in \(x-\) and \(y-\) direction. In other words, an ML model predicting \(V_f\) would automatically allow for an analytic computation of in-plane permeability, assuming that a corresponding equation is known for a certain material. In addition, the orientation of the patch is irrelevant as it does not affect directional permeability values.
2.3.2 Flow in anisotropic preforms
Restricting an ML model to predict only \(V_f\) in an anisotropic material, information about in-plane permeability and their principal directions cannot be deduced. However, it is crucial to know the principal flow directions of the fabric to be able to reason from fluid flow, in particular when introducing patches with unknown fabric orientation. Hence, consideration of in-plane anisotropy was the major driver towards a model that predicts all three properties, i.e. \(V_f\), \(k_x\), and \(k_y\), from sequences of flow front images.
In Fig. 6 (left), the effect of anisotropic in-plane permeability in a preform with a rectangular patch is highlighted with two different patch locations. The patch exhibits a higher number of fabric layers and thus, a higher level of \(V_f\). Moreover, the orientation of the fabric layers in the patch is rotated by \(90^{\circ }\) with respect to the remaining preform, which is indicated by the arrows pointing in the direction of major in-plane permeability. The top left image shows the expected impact: the flow front is trailing inside the patch region as a result of reduced permeability in the vertical flow direction. However, the bottom left image shows an opposite situation: the flow front inside the patch region is leading compared to the surrounding preform. This effect is caused by (i) the particular relation of in-plane permeability values (see Table 1) and (ii) the level of pressure gradient driving the fluid flow in this particular region of preform and patch, respectively. Although numerically correct, this flow pattern is not observed in the flow experiments. There, the flow front is trailing inside the patch region, which is explained by an additional effect, known as “race tracking”.
2.3.3 Race tracking
In the simulation environment, the edges along the patch region can be implemented as “ideal”, i.e. without any gap between preform and patch. The mesh properties change instantaneously from a cell of the preform to a cell in the patch region. However, this is not reflecting reality. There, despite the highest possible efforts in manual handling, a gap remains which forms a flow channel around the patch. This channel, typically in the size of 1..2 mm, shows a level of axial permeability which is typically one to two orders of magnitude higher compared with the surrounding fibrous and thus, causes the fluid flow to advance along the edges of the patch. This effect is well-known for LCM and referred to as “race tracking” [21].
To reflect this effect in the flow simulation, 2 mm wide flow channels were added around the rectangular patches of the data set SimAnisoChannel. The permeability of the corresponding mesh cells was specified with an equivalent isotropic permeability, also employed in this context by Bickerton et al. [21] which is one order of magnitude higher than the major in-plane permeability of the preform. The results of these changes are shown in Fig. 6 (right) and are considered to be well in line with observations in the flow experiments.
2.4 Final remarks
Although a fairly “basic” situation with strictly two- dimensional flow in a plate-shaped mold and homogeneous fabric material was considered in this work, flow simulations meeting the real-world situation require a decent amount of expert knowledge. However, such flow simulations scale very well. Flow experiments by contrast are very costly in a lab or industrial environment and they do scale only linearly. As a result, limited data is available.
To combine data from flow experiments and flow simulations, image and data pre-processing are required as shown in Fig. 4. These pre-processing steps also include the sampling of the image sequences over the filling percentage of the visible flow front, while adding padding to create sequences of the same length.
3 Approach
The modified textile property maps provoke altered flow fronts and need to be rediscovered by the ML models. These models get a sequence of flow front images as input, as shown in Fig. 1. In previous work [6], we predicted the \(k_x\) permeability as a first feasibility check of this approach. In this work, we predict all three major properties of the fluid flow through the textile:
-
\(V_f\) - volume-averaged fiber volume content
-
\(k_x\) - permeability in x direction
-
\(k_y\) - permeability in y direction
3.1 Data split
The training/test and validation set were fixed over all experiments. All real data samples flipped both in x and y direction made up the test set, while the training and validation sets were made up of the remaining three flip configurations: not flipped, flipped in x, and flipped in y direction.
This is a reasonable split of the data since all our models use convolutions for feature extractions. Convolutions are invariant to translation but, in general, not to rotation [22], so the model must generalize well, to be able to predict dry spots in places with a different flow front direction.
3.2 Sim-to-real transfer learning
We use Sim-to-Real Transfer Learning for our models. First, the models are trained on a larger data set from simulation only. Then, the trained networks are trained again on a smaller data set containing data from real experiments with a smaller learning rate. Decreasing the learning rate in the second training step helps to not alter the weights of the network too much, which can lead to decreasing performance. In general, this approach enables the usage of much larger and more capable models for data sets that are normally too small, as described in Sect. 1.1. In this Transfer Learning approach, the model trained on simulated data is used as an initialization point or warm start for the model trained on real-world data. This helps the model trained on real-world data to learn faster and achieve better performance.
3.3 Models
All models are trained end-to-end, from the input sequence of flow front images to the property map (cf. Figure 7).
Several neural networks are suitable to address this 3-dimensional sequence-to-instance task. As a baseline model, we employed a convolutional neural network (CNN) with four conv2D layers. From a subsampling step, we get 100 single-channel images of the time steps of the injection process. Since 2D convolutions work over any number of channels, we use the 100 time steps as independent channels for the first convolutional layer.
The second proposed model relies on the Transformer mechanism [23] which works on one-dimensional embeddings. Hence, the input image sequence needs to be converted into an embedding sequence. To do so, a fully convolutional encoder creates feature vectors for the individual images and is trained end-to-end in the sequence-to-instance pipeline. For the encoder part, the sequence length is fused with the batch dimension, so the convolutions work over every single picture separately. After the encoder, the dimensions are restored to utilize the sequence capabilities of a Transformer (cf. Fig. 7). The Transformer output vector is used to create the property map using 2D transposed convolutions. Summarizing, the model has an encoder – Transformer – decoder structure.
The last approach emphasizes the temporal aspect by using a Convolutional Long Short Term Memory (ConvLSTM) architecture [24]. In contrast to regular LSTMs, ConvLSTMs work over sequences of two-dimensional matrices, instead of one-dimensional vectors, which makes them suitable for our task.
We improved all models compared to [6] by using leaky ReLU instead of ordinary ReLU activation functions in the image reconstruction part and by adding batch normalization to the network. These changes resulted in faster and more stable training.
4 Evaluation
Based on Stieber et al.’s work [6], which involved using networks trained on simulated data for inference on real data, we wanted to investigate the following research questions:
-
1.
How do the network architectures proposed in Sect. 3.3 compare against each other?
-
2.
Are the real-world samples enough to learn the task by themselves and how can simulation data help?
-
3.
How much performance can be added by retraining neural networks with real data that have been pre-trained on simulated data?
-
4.
How does the simulation fidelity affect the output of the trained models?
4.1 Metrics
But first, we had to select a set of suitable metrics apart from the “expert eye” thatwere used on the output images. Finding a statistic that accurately reflects observed performance proved to be more challenging than anticipated because, for example, pixel wise accuracy frequently emphasizes “blurry” forecasts. Since a significant section of the property maps had nearly similar base properties, accuracy alone was not enough. Therefore, we permitted \(\varepsilon \)-tolerances to continue classifying specific pixels as correct. This \(\varepsilon \) between prediction and label was manually determined to be 0.03. Values above 3 % reduce the metric’s sensibility, whereas values below 3 % increase it. Aside from that, intersection over union (IOU) was our method. This metric is concerned with the size and position of the introduced patch, which is the extent of the observed variation in the attributes. To calculate it, we introduced certain thresholds for every property, based on the label, for instance: for \(V_f\), within the RealPermLarge dataset (from 2021), the lower bound would be 0.38 whereas the upper bound is 0.41. As listed in Table 2, \(V_f\) was designed to be 0.4, but manual handling of the textile leads to this (measured) spread between 0.38 and 0.41. These values stem from inspecting the overall dataset for one property (here \(V_f\)), binning it into 100 between 0 and 1, and taking the bins with the most entries as the bins that represent the most of the textile. Thus, for \(V_f\) in this case, the border values are 0.38 an 0.41. Afterward, a binary image is made with pixels falling into the desired window of values and values out of that window, i.e. black pixels would be all values \(>0.41\) and \(<0.38\) and white pixels would be the rest. Then the IOU is calculated according to Eq. 4, with this binary image and a binary version of the label.
The IOU metric is calculated for all three different properties independently and we also produce the mean of the three values. For accuracy, we also calculate both the three independent values and one of all channels together. In most tables in the following, we show the aggregated metrics of the values but we also added more extensive evaluations with all independent metrics to Sect. 1 (appendix).
4.2 Results
How do the network architectures proposed in Section 3.3compare against each other?
To start with the first research question, we show the resulting metrics from all three proposed architectures on the SimAnisoChannel data set in Table 4. It shows that the ConvLSTM yields the best results on the simulation test set in terms of both accuracy and IOU, albeit at the longest training duration.
Are the real-world samples enough to learn the task by themselves and how can simulation data help?
Table 5 shows that the Transformer outperforms the ConvLSTM when only using real data as a test set and when using only 10 real samples for re-training, which answers research question 2. These were sampled from the RealPermSmall and RealPermLarge datasets (cf. Section 2.2) to have a good starting point of what only few samples can do. For the retraining step, no weights were frozen and the learning rate was kept constant at 1e-4. It is also important to state that the Transformer trains much faster than the ConvLSTM and thus yields results more promptly during development and evaluation (cf. Table 4).
How much performance can be added by retraining neural networks with real data that have been pre-trained on simulated data?
From this point, we investigated, how much the best model – the retrained Transformer benefits from additional real data for training. This is a particularly interesting question since the question before implementing data-driven models for specialized use cases often is: “How much real data are you going to need?” We address this question by using our augmented training data set of 288 injections and training the Transformer model both from scratch and from a checkpoint stemming from the training on the simulated data. An overview of how the models benefit from pre-training is given in Table 6. The results can also be seen in Fig. 11. A possible explanation is that when the Transformer is trained from scratch using only the real data, it is only able to predict dry spot presence, ignoring the offset to left or right and the orientation of the patch. However, it can differentiate between dry spot locations in the Y direction. The number of samples has seemingly no impact on the overall performance; the metrics are erratic and stay on a certain level. When the training is started with a sim-data-only checkpoint, the performance is much better: predictions of orientation, location, and extent are much more accurate which is also reflected by the metrics. Additionally, the performance increases almost linearly with the number of real data samples used.
When using different networks on different amounts of data, we take into account that the Transformer has many parameters and thus may show underwhelming performance on a data set as small as the real-world data set we use. Therefore we utilize the CNN with its much smaller parameter number as a second baseline for real-data-only training. In Fig. 11 we compare the CNN and the Transformer regarding real-data-only learning and transfer learning capabilities. The figure shows that while the parameter-wise smaller CNN outperforms the bigger Transformer accuracy-wise, they are on par in terms of IOU. But when leveraging the simulation data via pre-training, the Transformer outperforms the CNN by far.
The overall best metrics are hit at a data set size of 240: IOU 0.5031 (up 0.12 from 10 samples) and Accuracy 95.929 % (up 0.6 % from 10 samples). This answers research question 3, one example of very well defined property maps is shown in Fig. 8 and another in Fig. 9, with less perfect outcomes. The latter occurs much more seldom than the former, but we wanted to include imperfect outcomes in the evaluation as well.
How does the simulation fidelity affect the output of the trained models ( SA vs SAC )?
To investigate research question 4, on how the quality of the simulation influences the overall performance of the Sim-to-Real Model, we took the best-performing model, the Transformer, and tested it on a lower-quality simulation data set. In Table 3, and the paragraph on anisotropy (2.3.2), we explain why using anisotropic textile features changes the phenomenology of flow fronts drastically and why we introduced channels to simulate race-tracking. To test our model on a dataset that is farther away from showing realistic phenomena, we tested the Transformer on a dataset without channels for race-tracking. The comparison can be made in Table 7. The accuracies are all over 90 %, with better values for the transfer-learning models each, and the models pre-trained on the more elaborate SimAnisoChannel data are each better than their SimAniso counterpart. When inspecting the IOU values, it becomes obvious, that the SimAnisoChannel data models perform better than the SimAniso models. This difference can also be observed in Fig. 10. While both transfer learning models show better results than the models trained on simulation data only, there are obvious differences in performance when observing “R1”: The Transfer SA (T/SA) model produces a misaligned patch compared to the Transfer SAC (T/SAC) model. When inspecting the models that were only trained on simulation data and have never seen real data in training, the differences are even more apparent: In “R2”, the No Transfer SA (NT/SA) model produces strong artifacts whereas the No T/SAC model shows light artifacts and produces the patch in the almost correct size and position. These excerpts from the dataset reinforce the claim taken from the metrics, that better simulation datasets ease the transfer learning process from simulation to reality (Fig. 11).
5 Discussion and future work
In-mold sensor data would be necessary to apply the results of this paper to an industrial setting [5]. A modular approach could first take sensor data and map them to flow front images and then feed those outputs directly into the models presented here. Alternatively, the mapping from sensor data to permeability deviation maps could be learned end-to-end, given a small dataset of respective real data.
6 Conclusion
We set out to investigate how machine learning can be used to infer material properties during the RTM process. Generating huge amounts of real data for the RTM process is time-consuming, expensive, and therefore undesirable. Instead, we chose to leverage sim-to-real learning, by generating large amounts of simulated RTM data, and only a small amount of real data. We trained three different models, a simple Conv2D net, a ConvLSTM, and a novel Encoder-Transformer-Decoder and evaluated their performance. Our research shows that leveraging simulated data for pretraining can greatly improve performance compared to a model trained solely on a small amount of real data. Further, we investigated how little data is truly needed to have a real-world improvement. We found that even as little as 10 samples can be enough if the data is generated carefully with respect to expert knowledge. To add a little more detail, using 10 real samples with a network that was pre-trained with a well-crafted simulation, we gain 25 percent points in accuracy on a non-pretrained network. In terms of IOU, we gain 0.20 points. The difference between two data sets based on better or worse simulations is 1.4 percent points in accuracy, from 93.9 to 95.3 and 0.08 in IOU, from 0.30 to 0.38. That shows that only small amounts of data can be used when a simulation for pretraining is available and better simulations leverage the results even further. Additionally, we showed how the quality of the simulated data can impact model performance. Our results suggest that keeping the domain shift between real and simulated as small as possible can help to improve the quality of the predictions.
References
Babb DA, Richey WF, Clement K, Peterson ER, Kennedy AP, Jezic Z et al inventors; Dow Chemical Co, assignee. Resin transfer molding process for composites. https://patents.google.com/patent/US5730922A/en
Tifkitsis KI, Skordos AA (2020) Real time uncertainty estimation in filling stage of resin transfer molding process. Polymer Composites 41(12):5387–5402. https://doi.org/10.1002/pc.25803
Stieber S, Schröter N, Schiendorfer A, Hoffmann A, Reif W (2021) FlowFrontNet: Improving carbon composite manufacturing with CNNs. In: ECML PKDD. Springer International Publishing pp 411–426. https://link.springer.com/10.1007/978-3-030-67667-4_25
Achzet M, Schlech T, Linscheid FF, Faber J, Sause MGR (2022) Ultraschallbasierte prozessüberwachung am beispiel eines T-RTM prozesses. In: DGZfP-Jahrestagung pp 1–11
Stieber S, Hoffmann A, Schiendorfer A, Reif W, Beyrle M, Faber J et al (2020) Towards real-time process monitoring and machine learning for manufacturing composite structures. In: IEEE Symposium on Emerging Technologies and Factory Automation, ETFA. vol. 2020-Sept pp 1455–1458
Stieber S, Schroter N, Fauster E, Schiendorfer A, Reif W (2021) PermeabilityNets: Comparing neural network architectures on a sequence-to-instance task in cfrp manufacturing. In: 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA). IEEE pp 694–697. https://ieeexplore.ieee.org/document/9680220/
Darcy HPG (1856) Les fontaines publiques de la ville de Dijon. V. Dalamont
Wei BJ, Chang YS, Yao Y, Fang J (2016) Online estimation and monitoring of local permeability in resin transfer molding. Polymer Composites 4;37(4) pp 1249–1258. https://doi.org/10.1002/pc.23290
Matveev M, Endruweit A, Long A, Iglesias M, Tretyakov M (2021) Part I: Bayesian inversion algorithm for estimation local variations in permeability and porosity of reinforcements using experimental data. Composites Part A 143(2021):1–22. https://doi.org/10.1016/j.compositesa.2021.106323
González C, Fernández-León J (2020) A machine learning model to detect flow disturbances during manufacturing of composites by liquid moulding. J Composit Sci 4(2):71. https://doi.org/10.3390/jcs4020071
Caglar B, Broggi G, Ali MA, Orgéas L, Michaud V (2022) Deep learning accelerated prediction of the permeability of fibrous microstructures. Composites Part A: Applied Science and Manufacturing 158 (Feb) https://doi.org/10.1016/j.compositesa.2022.106973
Weiss K, Khoshgoftaar TM, Wang D (2016) A survey of transfer learning. J Big Data 12;3(1):9. https://doi.org/10.1186/s40537-016-0043-6
Stieber S (2018) Transfer learning for optimization of carbon fiber reinforced polymer production. Organic Computing: Doctoral Dissertation Colloquium 2018:61–74
Hofer S, Bekris K, Handa A, Gamboa JC, Mozifian M, Golemo F et al (2021) Sim2Real in robotics and automation: applications and challenges. IEEE Transactions on Automation Science and Engineering 4;18(2):398–400. https://doi.org/10.1109/TASE.2021.3064065
Fauster E : personal communication
Chaloupka A : personal communication
Konstantopoulos S, Fauster E, Schledjewski R (2014) Monitoring the production of FRP composites: A review of in-line sensing methods. Express Polym Lett 8(11):823–840. https://doi.org/10.3144/expresspolymlett.2014.84
May D, Aktas A, Advani SG, Berg DC, Endruweit A, Fauster E et al (2019) In-plane permeability characterization of engineering textiles based on radial flow experiments: A benchmark exercise. Composites Part A 121:100–114. https://doi.org/10.1016/j.compositesa.2019.03.006
Fauster E, Berg DC, Abliz D, Grössing H, Meiners D, Ziegmann G et al (2019) Image processing and data evaluation algorithms for reproducible optical in-plane permeability characterization by radial flow experiments. J Compos Mater 53(1):45–63. https://doi.org/10.1177/0021998318780209
Fauster E, Berg DC, May D, Blößl Y, Schledjewski R (2018) Robust evaluation of flow front data for in-plane permeability characterization by radial flow experiments. Advanced Manufacturing: Polymer and Composites Science 4(1):24–40. https://doi.org/10.1080/20550340.2018.1439688
Bickerton S, Advani SG (1999) Characterization and modeling of race-tracking in liquidcomposite molding processes. Compos Sci Technol 11;59(15) pp 2215–2229. https://doi.org/10.1016/S0266-3538(99)00077-9
Weiler M, Cesa G (2019) General E(2) - Equivariant steerable CNNs. Advances in Neural Information Processing Systems 32(2)
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN et al (2017) Attention is all you need. In: Advances in Neural Information Processing Systems pp 5999–6009
Shi X, Chen Z, Wang H, Yeung DY, Wong WK, Woo WC (2015) Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in Neural Information Processing Systems pp 802–810 (Jan)
Funding
Simon Stieber was funded by the Bavarian State Ministry of Economic Affairs and Media, Energy and Technology (StMWi) within the project MAI CC4 CosiMo (Composites for sustainable Mobility). The experiments conducted in the RealPermLarge experiment campaign at the Processing of Composites Group (Montanuniversität Leoben) in 2021 were also financed by this project. All other authors declare no conflicts of interest. Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design, the main conceptual idea was formed by Simon Stieber advised by Wolfgang Reif. Material preparation, data collection were performed by Ewald Fauster and Marcel Bender. Preprocessing and analysis of experimental data was done by Simon Stieber and Ewald Fauster. Simulation design, execution and data collection was done by Simon Stieber. The main machine learning experiment design was executed by Simon Stieber and Alexander Schiendorfer. Machine learning model and metric design and code implementation was done by Niklas Schröter and Simon Stieber. The initial draft was mainly written by Simon Stieber and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
The authors have no relevant financial or non-financial interests to disclose
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
1.1 Detailed results
1.2 Further information on real data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Stieber, S., Schröter, N., Fauster, E. et al. Inferring material properties from FRP processes via sim-to-real learning. Int J Adv Manuf Technol 128, 1517–1533 (2023). https://doi.org/10.1007/s00170-023-11509-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00170-023-11509-8