Abstract
A sequential nonlinear multiscale method for the simulation of elastic metamaterials subject to large deformations and instabilities is proposed. For the finite strain homogenization of cubic beam lattice unit cells, a stochastic perturbation approach is applied to induce buckling. Then, three variants of anisotropic effective constitutive models built upon artificial neural networks are trained on the homogenization data and investigated: one is hyperelastic and fulfills the material symmetry conditions by construction, while the other two are hyperelastic and elastic, respectively, and approximate the material symmetry through data augmentation based on strain energy densities and stresses. Finally, macroscopic nonlinear finite element simulations are conducted and compared to fully resolved simulations of a lattice structure. The good agreement between both approaches in tension and compression scenarios shows that the sequential multiscale approach based on anisotropic constitutive models can accurately reproduce the highly nonlinear behavior of bucklingdriven 3D metamaterials at lesser computational effort.
Introduction
Recent progress in Additive Manufacturing (AM) and microfabrication technologies has opened up a whole new set of possibilities for the realization of metamaterials, i.e., artificial microstructures that exhibit properties and behaviors extending beyond the capabilities of ordinary materials [1]. In the context of mechanical metamaterials [2, 3], besides characteristics such as low mass density and high stiffness or strength [4, 5], in recent years also elastic flexibility and tailored buckling [6, 7], pentamode, auxetic and chiral effects [8,9,10], negative thermal expansion [11], or programmable behavior such as deployability and selfassembly [12,13,14] have been explored. Such mechanical metamaterials are realized by cellular microstructures, often beam lattices, which can be fabricated not only from stiff metals, but also from flexible, elastically deformable and multifunctional materials such as polymers, elastomers, or hydrogels. These metamaterials pave a pathway for a wide variety of multifunctional applications, such as energy harvesting and storage [15], biomedics and medical devices [16], soft robotics [17], and many more [3].
The mechanical modeling of such flexible beam lattices is particularly challenging, since their microstructure and soft material constituents allow for geometrically large elastic deformations that are accompanied by instabilities due to strut buckling [7, 18], resulting in a highly nonlinear and anisotropic effective behavior. Fullscale simulation using 3D continuum finite elements (FE) is generally not feasible and also (geometrically) nonlinear 3D beam element discretizations can become computationally expensive and only feasible for lattices with a moderate amount of cells [19]. Furthermore, to resolve instabilities and ensure convergence of iterative solution schemes, nonlinear postbuckling analysis methods, such as bucklingmode perturbations [18, 20], are required, which incur further computational effort and complication.
Multiscale simulation approaches are generally preferred for microstructured objects and metamaterials, which have already been widely applied to lattice structures in the linear, infinitesimal strain regime, also for multiscale design and topology optimization [21,22,23,24]. Such approaches require that the microstructure is periodic and sufficiently small compared to the macroscale, i.e., that scales are separated. Then, the effective behavior for the microstructure can be homogenized from a representative unit cell (RUC) to relate the stressstrain response from microscale to macroscale, where the structure can then be regarded as a continuum [25].
In the context of nonlinear problems, the microtomacro transition can either be performed concurrently, using FE^{2} methods, or sequentially, using homogenized effective constitutive models [26, 27]. The concurrent approach has already been successfully applied to beam lattice structures in [20, 28], even including highergradient and curvature effects. However, it is computationally expensive, since solving a macroscale problem requires the solution of many microscale problems—one for each FE quadrature point in each iteration. Furthermore, in [20] only relatively small compressive strain ranges were investigated for bucklingdominated 3D lattices (\(0.6\)% for a BCC structure and \(5\)% for an Octet structure), which indicates that the concurrent multiscale approach also suffers from stability issues. For sequential nonlinear multiscale simulation, the main challenge is to formulate and identify an effective constitutive model, which—in the case of elastic beam lattices—should be hyperelastic for finite elastic deformations, reflect the anisotropy/material symmetry of the microstructure, and closely represent the actual microstructural behavior, which is highly nonlinear due to differences in stretch, bending, or bucklingdominated deformations. For simpler 2D and moderately nonlinear 3D lattices, analytical constitutive models were developed in [29,30,31,32]. However, for bucklingdriven 3D lattices, analytically formulated models do not provide the necessary flexibility [33], which suggests the use of datadriven methods. In recent years, many such datadriven constitutive modeling approaches have been presented for anisotropic, hyperelastic, and inelastic material behavior, e.g., using reduced basis models [34, 35], clustering techniques such as selfconsistent clustering analysis [36,37,38], polynomial or spline interpolation [39,40,41], or machine mearning (ML) with artificial neural networks (ANN) [42,43,44,45,46,47,48,49,50]. To the best of our knowledge, only in [51] ANNbased constitutive models have been applied to the multiscale simulation of highly geometrically nonlinear, bucklingprone microstructures. However, the work is restricted to 2D applications, material symmetry is not considered in the material model formulation, and the accuracy of the effective constitutive model itself is not assessed.
Generally, to ensure both accuracy and efficiency, datadriven constitutive models should:

(1)
Be flexible in terms of their formulation so that highly nonlinear behavior can be represented;

(2)
Incorporate as much structure of the problem as possible to ensure generalization capabilities (i.e., they should be physicsinformed, e.g., by being formulated as hyperelastic, objective and respecting material symmetry);

(3)
Require only sparse calibration data or involve smart data acquisition strategies to save computational or experimental resources; and

(4)
Allow efficient evaluation to save computation time and effort when employed in (multiscale) finite element simulations.
Thus, the goal of this work is to develop a nonlinear sequential multiscale method for the simulation of the highly nonlinear finite deformation behavior of elastic 3D beam lattices using accurate and efficient, yet flexible MLbased constitutive models based on ANNs, inspired by our previous work [33]. On the microscale, the RUC of a bodycentered cubic (BCC) lattice is modelled as geometrically exact 3D beam structure with a linear elastic constitutive model, which results in a nonlinear, hyperelastic effective behavior. However, as mentioned above, these microstructures are prone to instabilities due to beam buckling, which—in practice—occur due to imperfections during manufacturing. For nonlinear postbuckling analysis, here only geometrical perturbations of the beam centerline are considered, while material and crosssection shape are assumed as ideal, in contrast to, e.g. [52, 53]. While usually eigenforms of the structure are prescribed as perturbations, see [18, 20, 53,54,55], here a purely stochastic approach is pursued. As we will demonstrate, this not only reduces the manual effort of bucking eigenvalue analysis, but also significantly reduces parasitic stresses in the homogenized responses, cf. [18]. The homogenized effective energy potentials and stresses obtained for various applied deformation gradients are then used as training and validation data for ANNbased constitutive models. To assesses their accuracy, generalization capabilities and efficiency in terms of required training data and evaluation times, three types of material models are compared: a hyperelastic model fulfilling exactly the cubic anisotropy, a hyperelastic model approximating the cubic anisotropy through data augmentation, cf. [44], and an elastic, stressbased model also approximating the cubic anisotropy through data augmentation. Hereby, it is shortly remarked that [44] considers data augmentation only in terms of the values of the elastic energy density, whereas the present work extends this approach also to stress values. In terms of material symmetry, the approaches of this work rely on the group symmetrization of [33] and the approximation through data augmentation. These approaches are applicable to arbitrary symmetry groups without any of the restrictions of invariantbased approaches relying on structural tensors and humandriven decisions on which invariants to consider. Finally, the hyperelastic ANNbased constitutive models are implemented into a nonlinear FEM so that the sequential multiscale simulation can be executed and verified in terms of convergence behavior and comparison with fullscale simulation of BCC lattice structures. Altogether, we summarize the main contributions of this work as:

Development, ANNbased implementation and validation of a sequential multiscale method for highly nonlinear, bucklingdriven 3D beam lattices.

Evaluation of different anisotropic, ANNbased effective constitutive models for finite deformations in terms of accuracy and computation times. For learning the material symmetry, data augmentation approaches for strain energy densities and stresses are shown to increase the network training times, but significantly reduce their inference times.

Evaluation of efficient data generation strategies for the homogenization of microstructural instabilities using a stochastic perturbation approach, avoiding further computational expenses of eigenformbased approaches.
The manuscript is organized as follows. The description of the framework for the simulation and homogenization of the microscale RUCs is presented in Sect. 2, including the stochastic perturbation approach and its verification. Then, the constitutive modeling framework using ANNs is established in Sect. 3, the models are trained with the microscale data and compared against each other in terms of accuracy and computational performance. In Sect. 4, the nonlinear FE multiscale simulations are performed with the calibrated models and compared to fullscale simulations of lattice structures. Finally, the manuscript concludes with a summary and outlook in Sect. 5.
Microstructure modeling and simulation
Beam modeling of lattice unit cells
In this work, beam lattices are mechanically modeled as geometrically exact, sheardeformable 3D beam structures [56, 57]. This beam model captures large deformations and rotations of the beam centerline \(\varvec{r}:[0,L]\rightarrow \mathbb {R}^3\) and its crosssection, which is described by an orthonormal field \(\varvec{R}:[0,L]\rightarrow SO(3)\). However, due to the use of a linear elastic constitutive model that is characterized by the Young’s modulus E and Poisson’s ration \(\nu \) of an isotropic material, it is only applicable to moderate strains. The balance equations of linear and angular momentum read as:
where \(\varvec{f}\) and \(\varvec{m}\) are the internal forces and moments, which are computed in terms of the kinematic unknowns \(\varvec{r}\) and \(\varvec{R}\), \(\bar{\varvec{f}}\) and \(\bar{\varvec{m}}\) are externally applied distributed forces and moments, which are always zero here for homogenization purposes, and \('=d/ds\) denotes the arclength derivative. As boundary conditions (BCs) at \(s=0\) and \(s=L\), either centerline positions and rotations (essential / Dirichlet BC) or forces and moments (natural / Neumann BC) must be prescribed.
For solving the boundary value problems (BVPs), here the beam model is numerically discretized by the isogeometric collocation method from [58]. First, the centerline curve is parameterized as a Bspline or nonuniform rational Bspline (NURBS) curve:
where \(N_i:[0,L]\rightarrow \mathbb {R}\) are \(C^{p1}\)continuous NURBS basis functions of a certain degree \(p>0\) with \(\ell =np\) internal elements and \(\varvec{r}_i\in \mathbb {R}^3\) are the n corresponding control points—similar to shape functions and nodal displacements in classical FEM. For more details on NURBS and isogeometric methods, see [59] and [60], respectively. Here the crosssection rotations \(\varvec{R}_h\) are parameterized in terms of unit quaternions and then also discretized as NURBS curves. Substituting these discretizations of the kinematic unknowns into the balance equations (1) and evaluating them at n suitable collocation points then yields a nonlinear system of equations, which can be solved using Newton’s method in order to determine the deformed configuration of the beam, see [58] for details.
Beam structures consisting of several beams can be modeled by aggregating the discertizations and nonlinear systems of the individual beams into one. Joints, where two or more beam endpoints meet at a node and are coupled together, are here assumed to be rigid, i.e., the centerline displacements (and thus the positions in the current configuration) and the changes of the crosssection orientations have to be equal for all k coupled endpoints:
Here, we use superscript indices to indicate quantities that relate to the endpoints (nodes) of different beams, i.e., \(r^j_h\) is the centerline position of the jth beam of the joint, which is given by either the first or last control point of the discretization of that beam. Furthermore, forces and moments have to be transferred at the joints, i.e., the balances of forces and moments have to be fulfilled:
where \(I^j=1\) if \(s=0\) for the beam endpoint, i.e., \(\varvec{r}^j=\varvec{r}^j_h(0)=\varvec{r}^j_1\), or \(I^j=1\) if \(s=L\), i.e., \(\varvec{r}^j=\varvec{r}^j_h(L)=\varvec{r}^j_n\). These constraints must also be implemented into the resulting nonlinear system, see [58] for details.
Stochastic perturbation approach for nonlinear postbuckling analysis
Since beams and beam structures are prone to structural instabilities such as buckling, nonlinear solution schemes such as NewtonRaphson and arclength methods, in which converged solutions are computed iteratively for incrementally increasing applied loads or displacements, often suffer from convergence problems at the onset of buckling. Nonlinear postbuckling methods try to overcome this issue typically by applying some perturbation to the BVP, such that the instability and ambiguity of the solution are overcome. These perturbations, which are typically applied to the undeformed geometric configuration in the context of numerical methods, can also be associated with uncertainties in geometric parameters, e.g., stemming from manufacturing inaccuracies and tolerances.
Here, a stochastic, geometric perturbation approach is employed, in which the centerline curves of the initially perfectly straight struts of a beam lattice structure are perturbed. Since the struts are initially straight, they are simply defined by their two endpoints and parameterized as linear line segments, i.e., “NURBS” with \(p=1,\ell =1,n=2\). Now, order elevation to increase the degree of the NURBS to \(p>1\) (prefinement) and subdivison into \(\ell >1\) equidistant elements (hrefinement) are performed, yielding a \(C^{p1}\)continuous NURBS curve with \(n=p+\ell \) control points, which still exactly represent a straight line, compare (2) and see [59] for details.
Now, the struts are perturbed normal to the tangential direction \(\varvec{r}'\) of the beam, i.e., within the crosssection plane. Therefore only two parameters are needed to perturb each individual control point \(\varvec{r}_i\): the magnitude \(a_i\in \mathbb {R}\) and the angle \(\psi _i\in [\frac{\pi }{2},+\frac{\pi }{2})\) of the perturbation (around the axis defined by \(\varvec{d}^3=\varvec{r}'\) and starting from the \(\varvec{d}^1\)axis, as defined by the orthonormal frame \(\varvec{R}=(\varvec{d}^1,\varvec{d}^2,\varvec{d}^3)\)). From a practical point of view, the magnitude is assumed to be normally distributed around a mean of 0 with a variance of \(\sigma ^2\), i.e., \(a_i\in \mathcal {N}(0,\sigma ^2)\), and the angle uniformly distributed, i.e., \(\psi _i\in \mathcal {U}(\frac{\pi }{2},+\frac{\pi }{2})\), where \(+\frac{\pi }{2}\) is excluded from the distribution as opposed to a uniform distribution as in literature [61]. These uncorrelated random perturbations are applied to the inner (nonend) points of each beam, yielding the perturbed control points as:
However, for boundary points that are connected to other beams, as it is always the case for the nodes of a periodic lattice structure, the tangential direction of the beam is ambiguous and it must be ensured that \(\varvec{r}^1=\cdots =\varvec{r}^k\) holds, similar to (3). Thus, the perturbation:
with three separate perturbations \(a_{x,y,z}\in \mathcal {N}(0,\sigma ^2)\), is applied to the node, i.e., for each beam endpoint of the joint, \(j=1,\ldots ,k\). For the RUC of a lattice microstructure, this applies not only to nodes inside the cell, but also to periodically connected nodes on opposing boundary faces and edges, which have to be identified for that purpose.
The stochastic perturbation procedure is illustrated in Fig. 1, which shows the centerlines at a joint with 4 beams in the unperturbed configuration and in 3 random perturbations. In summary, the perturbations are controlled by three parameters: the NURBS refinement given by p and \(\ell \), as well as the magnitude given in terms of the variance \(\sigma ^2\).
Microstructural homogenization framework
The present work aims at the multiscale simulation of the mechanical behavior of elastic beam lattices subject to finite deformations. For this purpose, a separation of scales between the macroscopic lattice structure and its periodic microstructure are assumed. Furthermore, the effective constitutive behavior of the microstructure is characterized by a representative unit cell (RUC), which is given by a domain \(\varOmega _0\subset \mathbb {R}^3\), and assumed to be hyperelastic, i.e., an effective elastic energy density W exists. For any quantity defined on the microscopic continuum \(\phi :\varOmega _0\rightarrow \mathbb {R}\), or in the same way also for tensorvalued quantities, averages over the reference configuration of the RUC are denoted as:
where \(V_0=\varOmega _0\) is the volume of the RUC. Hereby, a local elastic energy density \(W^\mu \) depending on the local deformation gradient \(\varvec{F}^\mu \) within the RUC is assumed as known. The local first PiolaKirchhoff stress tensor is defined as \(\varvec{P}^\mu = {\partial W^\mu }/{\partial \varvec{F}^\mu }\). The effective deformation gradient \(\varvec{F}\) is defined as:
The effective elastic energy density W, also referred to as the effective potential, is defined as:
The effective potential W depends implicitly on the effective deformation gradient \(\varvec{F}\) through the solution of the microscopic boundary value problem of the RUC, i.e., \(W = W(\varvec{F})\). Further, the effective potential W fully describes the effective constitutive behavior of the hyperelastic RUC. Hereby, the effective stress is defined as:
To fulfill the HillMandel condition [26]:
within this work affine and periodic displacement boundary conditions are used to impose an effective \(\varvec{F}\) on the RUC, which implies:
For the homogenization of the effective behavior of a beam lattice microstructure, each beam of the RUC is first perturbed as described in Sect. 2.2. For instance, Fig. 2 shows a bodycentered cubic beam lattice with perturbed struts. Then, the RUC is subjected to several effective deformation modes, which are specified in terms of a displacement gradient \(\varvec{H}=\varvec{F}\varvec{I}\), which is applied to the boundary nodes of the RUC either as an affine displacement (Dirichlet, “d”) or periodic displacement (“p”) BC:
Here, \(\varvec{r}^j\in \mathbb {R}^3,\, j\in \mathcal {B}\) are the reference centerline positions of each node on the boundary of the RUC, where \(\mathcal {B}\) is the set of all boundary nodes, or \(\varvec{r}^{j_+},\varvec{r}^{j_}\in \mathbb {R}^3\), \( \{{j_},{j_+}\}\in \mathcal {B}^\pm \) are the pairs of nodes on opposite boundaries, respectively, and \(\varvec{u}^j,\varvec{u}^{j_+},\varvec{u}^{j_}\in \mathbb {R}^3\) their corresponding displacements. To realize scenarios that can correspond to practical, experimentally conductable mechanical tests, e.g., uniaxial tension with prescribed deformation in the \(x_1\)direction, unconstrained displacements in \(x_2,x_3\)direction and only \(P_{11}\ne 0\), “d” and “p” BC may vary componentwise, see Table 1. The displacement gradients are applied in a stepwise manner, i.e., \(\varvec{H}=\varvec{H}(\lambda )\), where the factor \(\lambda \) is increased in steps of 0.003 until the desired end values of \(\lambda _{stop}=\pm 0.3\) (for tension and compression) are reached, resulting in 201 data points for each mode and perturbation. It should be noted also that the rotational BC for the nodes are always periodic boundary condition (PBC), i.e., all connecting corners need to rotate in the same way independent of the applied displacement gradient.
For discrete microstructures such as beam lattices, the resulting effective strain energy densities \(W(\varvec{F})\) and stresses \(\varvec{P}(\varvec{F})\) resulting from these simulations can in fact not be computed as volume averages via (9) and (12), since there is not microscopic continuum. However, the effective strain energy density \(W(\varvec{F})\) can be computed as the sum of all strain energies of the beams of the RUC, divided by the volume \(V_0\) of the RUC. Furthermore, by converting the volume averages to boundary integrals, the averaged first PiolaKirchhoff stress tensor is computed as:
where \(\varvec{f}^j\) are the internal forces at the node \(\varvec{r}^j\), cf. (1) and (4).
Numerical results and verification
The numerical framework for the finite strain homogenization of the effective behavior of beam lattice microstructures based on nonlinear postbuckling analysis using a stochastic perturbation approach is now being applied to a BCC lattice RUC. Here, and for the reminder of the manuscript, the size of the microstructure is taken as \(L=10\) mm, i.e., \(V_0=10^3 \hbox { mm}^{3}\), the aspect ratio is \(a=d/L=0.1\), i.e., the beam diameters are \(d=2r=1\) mm, and the material parameters are \(E=0.53\) MPa and \(\nu =0.45\). Furthermore, the stochastic perturbations are applied with a spline refinement of \(p=2,\ell =8\) and a variance \(\sigma ^2=0.1r\). These parameters were heuristically determined such that impact of perturbations on the results at small strains is minimized, while reliable convergence is achieved in the (post) buckling regimes. Furthermore, the following verifications were performed to ensure the validity of the approach:
Consistency checks.
To assess the impact of instabilities and perturbations on the numerical results, first a simple consistency check between the numerically computed \(W(\varvec{F})\) and \(\varvec{P}(\varvec{F})\) is performed. Consider the implicit dependency with respect to the process parameter \(\lambda \) and expand \(W(\lambda )=W(\varvec{F}(\lambda ))\) around some \(\hat{\lambda }\) as follows:
We now focus our attention on the tangent \(t(\hat{\lambda }) = {\partial W}/{\partial \lambda }(\hat{\lambda })\) and apply the chain rule:
In view of the discrete simulation results, the identities in (16) motivate the introduction of the following quantities for the secondorder finite difference approximation of \(t(\lambda _n) = t_n\) at an incremental simulation step n:
where \(W_n=W(\lambda _n)\), etc. The tangent approximations \(t_n^{W}\) and \(t_n^{P}\) can be computed independently based on the results for \(W, \varvec{F}\) and \(\varvec{P}\) at steps \(\lambda _n\). The consistency of the simulation results can then be evaluated based on the relative error:
The present work considers the results of a simulation campaign for \(\varvec{H}(\lambda )\) as sufficiently consistent and acceptable if in a maximum of 6 out of the 201 simulation steps \(e_n\) surpasses \(10\%\), 3 for \(\lambda < 0\) and 3 for \(\lambda > 0\). Otherwise, a new random perturbation is applied an the simulation is rerun. All numerical results shown in the following passed this consistency check.
Buckling bands and outliers
During compression on multiple axis, e.g., during the volumetric mode, it occurs that simulation results for some “outlier” perturbations are stiffer compared to the majority of other perturbations and buckle at higher stresses, see Fig. 3 (top). It can be seen that while the majority of runs follows roughly the same path, i.e., is on the same “band”, there are some simulations following a somewhat stiffer path and buckling only later, i.e., at a higher magnitude of applied strain. From this second band, some simulations jump back onto the first band, while the rest remains on this stiffer band. Overall, it seems that this stiffer band represents an unstable equilibrium path and thus at higher compression many solutions fall back to the softer, stable band. In order to eliminate these outliers, all simulation runs, where the trace of \(\varvec{P}\) is greater than 1.25 times the average of all runs at \(\lambda =0.018\), were discarded, see Fig. 3 (bottom). From the 250 simulations with random perturbations executed, we obtained 206 perturbations on the lower band, i.e., with a stable equilibrium path according to the aforementioned criterion. This results in a rejection of 44 perturbations and a rejection rate of \(17.6\%\).
Comparison with reference data
The data obtained by the stochastic perturbation approach is now compared to reference data from [18], where deterministic perturbations were applied from a preceding computation of buckling eigenmodes of the RUC. Figures 4, 5, and 6 show the reference data, all stochastic perturbations (206 for each mode after discarding outliers), and their mean values for the components of \(\varvec{P}\) over \(\lambda \) for uniaxial, biaxial, and shearcombined deformation modes, respectively. Due to the applied boundary conditions and symmetries of the structure that result in a cubic anisotropy of the effective behavior, only some stress components should be nonzero and some of those should be equal, e.g., \(P_{11}=P_{22}\ne 0\) for biaxial deformation. As can be seen in the figures, the perturbation approach with a fixed, deterministic magnitude and shape introduces “parasitic” nonzero stress components, such as \(P_{12}\) and \(P_{21}\) in uniaxial compression in Fig. 4, which are fairly substantial compared to \(P_{11}\). With the stochastic approach used here, these parasitic stresses vanish (almost) in the mean values, while the paths and values of the main, nonzero stress components agree well for both approaches. This shows that the stochastic perturbation approach delivers reliable results and is able to diminish parasitic stress effects, which are undesirable for fitting effective constitutive models using the homogenized microstructural simulation data.
Effective constitutive modeling with ANNs
Basic material theory considerations
In the present work, some materialtheoretic aspects are considered for the constitutive behavior and following model formulation, namely material symmetry and objectivity.
For the lattice structures considered here, anisotropic behavior is expected corresponding to the symmetry group of the hyperelastic RUC. We, therefore, consider the standard anisotropy conditions for hyperelastic materials for the effective potential, see, e.g., [62]:
where \(Inv^+\) denotes the set of all invertible secondorder tensor with positive determinant and \(\mathrm {G}\) refers to the material symmetry group. The present work considers only finite material symmetry groups and denotes the number of elements in \(\mathrm {G}\) as \(\#(\mathrm {G})\). The material symmetry conditions for the effective potential (20) imply the following conditions on the effective first PiolaKirchhoff stress tensor:
For the present work \(\mathrm {G}\) correspond to the symmetry group of cubic materials, which for hyperelastic materials is sufficiently described by the cubic group of \(\#(\mathrm {G}) = 24\) rotation operations.
Additionally, material objectivity, see, e.g., [63],
is to be considered in the model formulation. Material objectivity can be achieved through an intermediate variable dependency on the right CauchyGreen tensor \(\varvec{C}= \varvec{F}^T\varvec{F}\) or Green’s strain tensor \(\varvec{E}= (\varvec{C} \varvec{I})/2\), i.e.:
Hereby it should be noted that the corresponding second PiolaKirchhoff stress tensor \(\varvec{S}= \varvec{F}^{1}\varvec{P}\) fulfills then:
Further materialtheoretic properties are not taken into account in this work. Thereby, it should be noted that, for instance, material stability, polyconvexity and further properties are appealing, but they lie outside of the scope of the present investigation. It should also be noted that the symmetries of the unperturbed RUC are broken by any perturbation approach used for nonlinear postbuckling analysis, see Sect. 2.2. However, since the perturbations are small and random, we still assume that the effective behavior complies to the prescribed material anisotropy.
Constitutive models based on ANNs
Formulation of constitutive models
On the one hand, the present work approaches the formulation of constitutive models based on the ideas of [33] and, on the other hand, focusing on the approximation of some materialtheoretic properties through partial data augmentation, cf. [44].
The work of [33] proposes to use machine learning and artificial neural networks at the core of the model formulation. More specifically, feedforward neural networks (FFNNs) are used for the formulation of hyperelastic and elastic models. The hyperelastic model of [33] uses a core FFNN potential \(\hat{W}^\mathrm {FFNN}(\varvec{E})\) and a group symmetrization with respect to \(\mathrm {G}\), yielding the model:
The approach (25) fulfills per definition all material symmetry conditions (20), (21) and the material objectivity condition (22). The corresponding stress derived of (25) will be denoted as:
The present work further consider two additional models in view of the approximation of properties, which may be expensive in terms of model complexity. The hyperelastic model of [33] uses a group symmetrization for the exact fulfillment of the known material symmetry. The present work investigates how well this property can be approximated by 1.) a hyperelastic model directly using a FFNN:
and 2.) an elastic model also directly using a FFNN for the second PiolaKirchhoff tensor:
The models (27) and (28), of course, do not fulfill, in general, the corresponding material symmetry conditions. But since the transformation rules are known, a simple dataaugmentation can be considered such that the direct models (27) and (28) may be able to intrinsically approximate the material symmetry.
The symmetrized model (25) is, from a computational point of view, far more expensive than the direct ones (27) and (28) due to calling \(\#(\mathrm {G})\) times the core FFNNs in the group symmetrization for every given input \(\varvec{F}\). It is therefore expected that the direct models (27) and (28) have an advantage in terms of evaluation speed.
Implementation and technical details
Implementation framework All three material models \(W^\mathsf {Wsym}(\varvec{F})\) as in (25), \(W^\mathsf {Wdir}(\varvec{F})\) as in (27) and \(\varvec{P}^\mathsf {Pdir}(\varvec{F})\) as in (28) have been implemented in Python, version 3.7.8, with the TensorFlow library, version 2.3.1 [64]. All subsequent timings correspond to a system with a Nvidia GTX 970 GPU with 4GB of GDDR5 RAM, an Intel Xeon E31231 v3 CPU and 24GB of DDR3 RAM, on which the operating system is Microsoft Windows 10 Pro N, version 20H2.
Networks From an implementation point of view, it should be noted that both potential FFNNs \(\hat{W}^\mathrm {FFNN}(\varvec{E})\) of (25) and (27) can be easily implemented with a sixdimensional input (corresponding to the six degrees of freedom of \(\varvec{E}\)) and a onedimensional output. Analogously, the stress FFNN \(\varvec{S}^\mathrm {FFNN}(\varvec{E})\) of (28) can be implemented with a sixdimensional input and output (corresponding to the six degrees of freedom of \(\varvec{E}\) and \(\varvec{S}\)). All FFNNs have been implemented using the swish activation function in all hidden layers and the identity activation function in the output layer. All resulting networks are then infinitely differentiable. The architecture of the implemented FFNNs for the three considered models is tabulated in Table 2.
Taking [33] as a reference, 3 hidden layers seem to be sufficient for the approximation of the effective constitutive behavior of the cubic beam lattice materials investigated therein, such that the present work follows this result. Hereby, the symmetrized hyperelastic model \(W^\mathsf {Wsym}\) uses in the present work 12 neurons per hidden layer. The direct hyperelastic model \(W^\mathsf {Wdir}\) considers 18 neurons per hidden layer since it needs higher internal complexity in order to approximate the material symmetry from the given data. The elastic model \(\varvec{P}^\mathsf {Pdir}\) considers 22 neurons per hidden layer due to additional complexity needed due to the loss of the hyperelasticity structure.
Vector matrix operations Further, the matrix operation \(\varvec{Q}^T\varvec{E}\varvec{Q}\) is linear in \(\varvec{E}\), such that a matrixvectoroperation on the chosen six degrees of freedom of \(\varvec{E}\) can be constructed. For instance, the vector:
can be used to carry the six degrees of freedom of \(\varvec{E}\) and the operation \(\varvec{Q}^T\varvec{E}\varvec{Q}\) can be performed as \(\underline{\underline{Q}}\underline{E}\) with the \(6 \times 6\) matrix:
Stress computation through automatic differentiation. The gradient \(\varvec{S}(\varvec{E}) = {\partial \hat{W}}/{\partial \varvec{E}}\) is relevant for the hyperelastic models \(W^\mathsf {Wsym}\) and \(W^\mathsf {Wdir}\) for the computation of stresses. Here, the automatic differentiation routines of TensorFlow have been used. The differentiation is carried out with respect to the vector \(\underline{E}\) such that a corresponding sixdimensional stress vector \(\underline{S}= {\partial \hat{W}}/{\partial \underline{E}}\) is obtained, which is then recasted as the full symmetric tensor \(\varvec{S}\) with basic tensor operations. The final stress output for the first PiolaKirchhoff stress is then computed as \(\varvec{P}= \varvec{F}\varvec{S}\).
Data augmentation and considered datasets
For any hyperelastic simulation yielding a hyperelastic dataset:
here, the group augmentation is defined as:
More explicitly, for a dataset D containing \(\#(D)\) data points \((\varvec{F}, W, \varvec{P})\), the dataset \(\mathrm {G}\star D\) contains \(\#(\mathrm {G}) \cdot \#(D)\) corresponding data points. For the investigation of cubic unit cells at hand, a dataset D is augmented 24fold. Consideration of augmented datasests for the training of the direct models (27) and (28) offers the option of approximating the known material symmetry intrinsically.
It should be shortly noted that the groupbased data augmentation \(\mathrm {G}\star D\) goes beyond the approach of [44]. In [44] only \(\varvec{F}\) and W were considered for Nickel and no illustration of the stress behavior is displayed. Therefore, in [44] it remains unclear how well a data augmentation only on \(\varvec{F}\) and W works for the approximation of the group symmetry in terms of the stress evaluation. The present investigation is concerned directly with the resulting stresses \(\varvec{P}\) of the unit cells, since in engineering the stresses are of major importance. Calibration of hyperelastic models based only on \(\varvec{F}\) and W could potentially yield models with fairly similar \(W(\varvec{F}) = W(\varvec{F}\varvec{Q})\) for some \(\varvec{Q}\in G\), but with vastly different gradients/stresses. In order to avoid this scenarios from the beginning, the data augmentation is immediately extended to \(\mathrm {G}\star D\) in order to optimize models based (1) on readily available data for \(\varvec{F}\), W and \(\varvec{P}\), and (2) all corresponding implications of the material symmetry.
In the following, the calibration dataset \(D^c\) without any group augmentation corresponds to the simulation results collected from the first five modes of Table 1, namely uniaxial, biaxial, volumetric, planar and shear, averaged over all 206 considered runs. This means that in \((\varvec{F}, W, \varvec{P})\), \(\varvec{F}\) is the average of the deformation gradient over all 206 runs for the corresponding fixed simulation step and corresponding simulation mode. For W and \(\varvec{P}\) the analogous averages are computed, which—based on the small absolute deviations visible in Figs. 4, 5 and 6—are considered as sensible. Each simulation mode has 201 simulation steps, i.e., \(D^c\) contains \(\#(D^c) = 5 \cdot 201 = 1005\) data points. The testing modes of Table 1, namely biaxial0.5, biaxial0.33 and shearcombined, denoted by the dataset \(D^t\), are excluded from the calibration dataset \(D^c\), but monitored during calibration in order to avoid overfitting. The group augmented calibration dataset \(\mathrm {G}\star D^c\) will be used only for the direct approaches with corresponding objective functions, as to be presented in the following section.
Training approach
The standard mean squared error (\(\mathrm {MSE}\)) of a model \(\mathsf {M}\in \{\mathsf {Wsym},\mathsf {Wdir},\mathsf {Pdir}\}\) with respect to a hyperelastic dataset D is considered here as a suitable objective or loss function:
The symmetrized hyperelastic model \(\mathsf {Wsym}\) is trained with \(\mathrm {MSE}(D^c)\), while the direct alternative \(\mathsf {Wdir}\) is trained with \(\mathrm {MSE}(\mathrm {G}\star D^c)\) based on the augmented calibration dataset. The elastic direct model \(\mathsf {Pdir}\) is trained with \(\mathrm {MSE}^P(\mathrm {G}\star D^c)\), i.e., only based on the stress data.
The implemented models are trained with their respective loss functions using the Adam optimizer, cf. [65], at what \(D^t\) is used as validation dataset. The training of a model is stopped if the loss function with respect to \(D^t\) does not improve over 50 epochs. Model instances were created and trained until for every model an instance was found yielding an \(\mathrm {MSE}^P(\mathrm {G}\star D^t)\) value below 1000 for the stresses.
Results
The calibrated models \(W^\mathsf {Wsym}\), \(W^\mathsf {Wdir}\) and \(\varvec{P}^\mathsf {Pdir}\) are now compared based on the stress error \(\mathrm {MSE}^P\). Hereby, for every single simulation mode the \(\mathrm {MSE}^P\) is computed, and its average and standard deviation with respect to \(\mathrm {G}\) are calculated. As shown in Table 3, the hyperelastic model \(W^\mathsf {Wsym}\) yields, of course, the same value for \(\mathrm {MSE}^P\) for every \(\varvec{Q}\in \mathrm {G}\) and zero standard deviation with respect to \(\mathrm {G}\), since it fulfills the material symmetry by construction. Based on the chosen training settings, \(W^\mathsf {Wsym}\) yields, however, approximately twice the \(\mathrm {MSE}^P\) compared to its direct alternative \(W^\mathsf {Wdir}\) for the deformation modes of the calibration dataset \(D^c\). The elastic direct model \(\varvec{P}^\mathsf {Pdir}\) yields the best quantitative results in terms of \(\mathrm {MSE}^P\) with respect to the modes in \(D^c\), but has the highest standard deviations for \(\varvec{Q}\in \mathrm {G}\).
After comparing the errors w.r.t. the calibration modes in \(D^c\), the generalization qualities of the different models, i.e., the ability of a model to predict data it has not seen yet in training, is assessed using the three test modes. The first two test modes are essentially (nonequi) biaxial deformations, where both axes are experiencing different strains. Both modes lead to similar results, as can be seen in the biaxial0.5 and biaxial0.33 rows in Table 3. It should be noted that the first model tends to maintain the same \(\mathrm {MSE}^P\) as in the training modes, whereas the other two models are giving higher \(\mathrm {MSE}^P\). Lastly, not only different tension/compression strains are applied to the models, but also a scenario, where tension/compression is combined with shear strains as depicted by shearcombined. Here, the generalization abilities of the models can be seen even more clearly. Row shearcombined in Table 3 shows that \(\mathsf {Wsym}\) and \(\mathsf {Wdir}\) maintain somewhat accurate predictions, but \(\mathsf {Pdir}\) shows a much higher \(\mathrm {MSE}^P\). These observations suggest that the generalization abilities of the hyperelastic model including the symmetrization \(\mathsf {Wsym}\) are slightly better than without in \(\mathsf {Wdir}\), while the elastic, stressbased model \(\mathsf {Pdir}\) generalizes much worse, though providing far more accurate results on the training data.
These findings can be confirmed in Fig. 7, which shows that the \(\mathsf {Pdir}\) model performs better on the data used for training. In this context, it should be highlighted that the data augmentation is actually crucial for the success of the sleeker models, as can be seen in comparison between Fig. 8, where the fully calibrated models are shown, and Fig. 9, where the models trained with the nonaugmented data used for calibration of \(\mathsf {Wsym}\) are shown. The comparison demonstrates the deterioration of the prediction quality w.r.t. rotations for the \(\mathsf {Wdir}\) and \(\mathsf {Pdir}\) models when the material symmetry is not part of the calibration data, i.e., when the calibration only uses \(D^c\) instead of \(G\star D^c\).
When looking at the generalization abilities of the models, the stress model \(\mathsf {Pdir}\) (trained again with data augmentation) still delivers accurate predictions of the test data for the shearcombined deformation mode shown in Fig. 10 (though not as accurate as for the training data), its much worse accuracy w.r.t. symmetry rotations can be seen in Fig. 11. This explains the large standard deviation for \(\mathsf {Pdir}\) in Table 3 when considering the shearcombined scenario. Furthermore, the visualizations indicate the good generalization properties of the \(\mathsf {Wsym}\) and \(\mathsf {Wdir}\) models and also show that the direct hyperelastic model maintains reasonable accuracy w.r.t. material symmetry.
Performance aspects
Besides the quality of their fit of the microstructure simulations, also the performance of the ANNbased constitutive models should be considered when choosing a network for later use in a macroscale FE computation. Therefore, a comparison of the execution times of the three models \(\mathsf {Wsym}, \mathsf {Wdir}\), and \(\mathsf {Pdir}\) is conducted. For this purpose, all three models are build in TensorFlow and a number of random deformation gradients \(\varvec{F}\) is used as input for the network. In Table 4, the number of deformation gradients \(\varvec{F}\) samples the model has to evaluate is shown in the first column. In the other columns, the corresponding evaluation time each model needed is shown as the average of 100 executions. The sample sizes are doubled as long as the system is able to run the computation. Generally, the times for each model remain roughly constant until a certain number of samples is reached, since evaluations can be performed in parallel by the GPU. Furthermore, it can be seen that the \(\mathsf {Wsym}\) model is about one order of magnitude slower compared to the \(\mathsf {Wdir}\) model, though the architecture is slightly smaller, see Table 2, which is due to the symmetry group transformations involved, compare (25). Modelling the stresses directly through the \(\mathsf {Pdir}\) model results in a minor speedup, as it reduced the time needed by the hyperelastic potential model \(\mathsf {Wdir}\) by about a third, which probably stems from the additional automatic differentiation required for obtaining \(\varvec{P}=\partial W/\partial \varvec{F}\) for the potentialbased, hyperelastic models. Considering that the stress model has the worst generalization abilities compared to \(\mathsf {Wdir}\) and that \(\mathsf {Wsym}\) is an order of magnitude slower compared to the direct model, the \(\mathsf {Wdir}\) model seems the most reasonable choice to use in production.
For comparison, the time needed to compute the response of one randomly perturbed RUC is—depending on the prescribed deformation \(\varvec{F}\)—around 45 s. So even in the case that one singular perturbation is sufficient, the constitutive model evaluations are faster by a factor of about 250 (for \(\mathsf {Wsym}\)) up to 4600 (for \(\mathsf {Wdir}\)). Since multiple runs with different perturbations are necessary in order to minimize the parasitic stresses, as well as due to the fact that parallelization is possible for the models, these factors merely form the lower bound for the speedup. Thus, a sequential multiscale simulation using an ANNbased constitutive model, which will be demonstrated in the following Section, can be expected to show a similar speedup compared to a concurrent FE^{2} multiscale approach.
Macroscale simulation and verification
Nonlinear FEM with ANNbased material models
For the sequential multiscale simulation of elastic beam lattices, the structures are now macroscopically modelled as hyperelastic continua, which are governed by the weak form:
where \(\varOmega \subset \mathbb {R}^3\) is the domain occupied by the microstructured continuum, \(\varvec{u}:\varOmega \rightarrow \mathbb {R}^3\) the sought displacement conforming to the Dirichlet boundary condition \(\varvec{u}=\bar{\varvec{u}}\) on \(\varGamma _d\subset \partial \varOmega \), \(\varvec{f}:\varOmega \rightarrow \mathbb {R}^3\) the internal body forces, and \(\varvec{g}:\varGamma _n\subset \partial \varOmega \rightarrow \mathbb {R}^3\) the traction forces applied as Neumann boundary conditions. The variation of the strain energy can be further expressed as:
compare (23) and (24). Here, the effective behavior of lattice unit cells is incorporated in the formulation through the homogenized constitutive models, i.e., the ANNbased potentials \(W^\mathsf {Wsym}\) and \(W^\mathsf {Wdir}\), as introduced and trained above.
The finite element discretization of this nonlinear mechanical problem is then implemented in the opensource high performance multiphysics finite element software NGsolve, version 6.2.2102 [66]. NGsolve provides a convenient implementation of variational models, which is here used with the ANNbased potentials \(W^\mathsf {Wsym}\) and \(W^\mathsf {Wdir}\). For this purpose, the weights and biases of the trained networks are exported from TensorFlow and all operations for their evaluation, as well as further derivations for linearization, are taken care of by NGsolve.
Numerical results and verification
For the application and verification of the multiscale implementations, a cube of size \(0.1\times 0.1\times 0.1\) m^{3} with BCC lattice microstructure (with aspect ratio \(a=0.1\)) is investigated. The lattice cube is subject to a clamped stretch, i.e., the left and right surface are completely fixed and both surfaces are forced to part from each other using DirichletBCs. In this scenario, no external forces are present, i.e., \(\varvec{f}=0,\,\varvec{g}=0\). The deformation is characterized by the parameter \(\lambda \), which describes the incrementally applied strain on the right surface as a fraction of the position of the right side, whilst all other displacements on the sides are set to be zero:
with \(\varGamma _{left} = \{X_1=0\}\) and \(\varGamma _{right} = \{X_1=0.1\ \text {m}\}\).
Convergence study
To verify the implementation with the ANNbased potentials, first a finite element convergence study is carried out. For this purpose, tetrahedron meshes of degree 1, 2, and 3 are constructed and subsequently refined. The simulations were then executed up to \(\lambda =0.1\) using the \(\mathsf {Wdir}\) model. Figure 12 shows the resulting total strain energies \(\Pi (\varvec{u})=\tfrac{1}{2}\int _\varOmega \hat{W}(\varvec{E}(\varvec{u}))\, dV\) at \(\lambda =0.1\) plotted against the number of elements in the refined meshes. The lines represent the different element degrees, i.e., the degree 1 represents linear tetrahedrons, 2 quadratic, and 3 cubic. It can be observed that all element formulations converge with higher element count and that the accuracy and speed of convergence increases with degree. This behavior is, of course, to be expected and simply confirms that the constitutive models and their linearizations must be correctly implemented in the FE context. When considering the performance apart from the pure stability, the solutions that seem most promising are the quadratic tet with 596 elements, resulting in an energy of \(30.4\ \text {mJ}\) and the cubic tet with 122 elements, resulting in an energy of \(30.4\ \text {mJ}\) as well. Compared to the minimal energy the computations showed of \(30.1\ \text {mJ}\), this is an error of about \(1\%\), which seems acceptable for the following further evaluations.
Comparison with fully simulated beam lattices
After having verified that the homogenized constitutive models are suitable for FE calculations, it is to be verified whether the effective models and sequential multiscale simulation accurately describe the behavior of beam lattices. For this purpose, the same setup as described above with a lattice cube subject to clamped tension was implemented using lattices of \(5\times 5\times 5\) and \(7\times 7\times 7\) BCC RUCs.
In order to assess the reasonable agreement of the homogenized model and the fully resolved, direct lattice simulation with beam elements, the simulation just as described above is conducted for tension with \(\lambda \) up to \(+0.25\) and for compression with \(\lambda \) up to \(0.15\), where possible. Not all models show good convergence in compressive scenarios as is discussed in the next paragraph. The resulting strain energies, as well as the resulting reaction forces on \(\varGamma _{right}\), are plotted over the applied strain \(\lambda \) in Fig. 13. For the fullyresolved lattices, we executed a maximum of 5 simulations with random perturbations for both compression and tension. Up until loss of convergence, these simulations are indistinguishable in terms of their energy and force values and thus only the ones with the highest attained strains are plotted. Even though only relatively few unit cells are used in order to reduce the computational effort of the fully resolved simulations, i.e., no proper separation of scales applies, the homogenized models result in strain energies and forces close to those of the fullscale simulations. Especially the strain energy plot shows good agreement of the homogenized models with the full scale simulation. The forces of the different simulations also agree reasonably well; especially on the compression side the buckling effect in the homogenized models is clearly visible in the flattening of the curve. Whilst being not as sharp as on the fullscale simulation, which also applies to the data fit itself as shown in Fig. 7, this nevertheless shows the capability of the generated models and the multiscale method to predict the bucklingdominated behavior of metamaterials fairly well. Furthermore, the results from the tension simulations (at \(\lambda =0.25\)) are visualized in Fig. 14, while the results form the compression simulations (at \(\lambda =0.05\)) are shown in Fig. 15. Also here, a good visual agreement of the deformed shapes can be observed.
It should also be noted that the homogenized models are numerically more stable compared to the fullscale simulations, which also require perturbations in order to resolve the onset of buckling, resulting in the ability to compute higher compressive strains. However, in compression, at some point convergence issues also occur for the continuum FEM with \(\mathsf {Wdir}\) (at \(\lambda \approx {0.09}\)) and \(\mathsf {Wsym}\) (at \(\lambda \approx {0.14}\)), which can be seen from the prematurely terminated compression curves in Fig. 13. This might probably be related to loss of ellipticity and a lack of uniqueness of the solutions obtainable from the ANNbased models, which could potentially be resolved by incorporating polyconvexity conditions [67] into the models in future work. Despite that, the results demonstrate the ability of the obtained models to predict the behaviour of the metamaterials under large compressive strains, where the effect of the buckling is clearly visible in the flattening of the curves. Even though the ANNbased models also exhibit convergence issues, they are still much faster and more stable than direct numerical simulations.
Conclusions
A sequential nonlinear multiscale method for the simulation of elastic beam lattices was developed in this work, showing that metamaterials subject to large deformations and instabilities can be efficiently and accurately described using highly flexible, datadriven effective constitutive models formulated upon artificial neural networks.
For the generation of calibration data for effective constitutive models, a stochastic perturbation approach for nonlinear postbuckling analysis of RUCs has been introduced, which does not require preceding eigenmode analysis and mode selection. By considering multiple instances and averaging, parasitic stress components of the effective stressstrain responses could be significantly reduced compared to an eigenmodebased reference approach with a single perturbation. Using this method, homogenized energy densities and stress tensors of the BCC unit cell were obtained for various prescribed deformation modes (uniaxial tension, volumetric tension, shear, etc.).
The generated data was then used to train and evaluate three different ANNbased constitutive models: (1) \(\mathsf {Wsym}\), a hyperelastic model incorporating the finite symmetry group of the RUC, (2) \(\mathsf {Wdir}\), a hyperelastic model trained to show anisotropic behavior through data augmentation, and (3) \(\mathsf {Pdir}\), an elastic, stressbased model also trained through data augmentation. By comparing the resulting MSE of the PiolaKirchhoff stress tensor, it could be observed that the stress model \(\mathsf {Pdir}\), whilst possessing superior qualities on the training dataset, showed inferior generalization abilities when evaluated on the validation dataset. In contrast, the hyperelastic models \(\mathsf {Wsym}\) and \(\mathsf {Wdir}\) showed good generalization abilities with only slightly increased errors with respect to the validation dataset. In terms of computational performance, the direct hyperelastic model \(\mathsf {Wdir}\) evaluates much faster than the model \(\mathsf {Wsym}\) due the computational expenses of the \(\mathsf {Wsym}\) model through the group symmetrization. This performance advantage of the \(\mathsf {Wdir}\) model puts up with a comparably small loss in prediction quality when reproducing the anisotropy of the cell, as well as in a neglectable loss in the generalization ability on the test data. This suggests that hyperelastic models should be preferred and that approximation of the material symmetry through data augmentation is favorable if computational performance is prioritized over accuracy and physical exactness of the material model.
The homogenized, anisotropic, hyperelastic material models were then implemented into a nonlinear FE code for sequential multiscale simulation and delivered good results when compared to fullscale simulations of BCC lattice structures. Besides the advantages of computational effort and therefore timesavings, the homogenized continuum models also prevail in the aspect of complexity. The convergence behavior of the continuum models is not dependent on perturbations for postbuckling analysis. Further, it is less intrusive and much easier to implement a model based on the hyperelastic energy density compared to implementing a model based on the internal structure of a lattice and add corresponding lattice perturbations throughout the macroscopic structure. Therefore, the multiscale approach speeds up the development process in two ways: the generation of the system to be simulated and the computation of the simulation itself.
The presented sequential multiscale framework based on anisotropic (hyper)elastic ANNmodels could be readily applied to the multiscale modeling of other beam lattice RUCs, as well as other types of elastic microstructures, such as (minimal) surfacebased foams or composites. In this context, other symmetry groups and parametric microstructures could be investigated, cf. [68], even with potentially varying symmetries. Further, future work should be directed towards including further desirable properties into the constitutive formulations, such as material stability, and extend the framework towards thermodynamically consistent timedependent and inelastic modeling, which is particularly important for applications of additively manufactured, polymeric lattices.
Data Availability Statement
The authors provide access to the complete simulation data through the public GitHub repository https://github.com/CPShub/simdata.
References
 1.
Wegener M (2013) Metamaterials beyond optics. Science vol 342(6161). Publisher: American Association for the Advancement of Science Section: Perspective, pp. 939940. https://doi.org/10.1126/science.1246545
 2.
Barchiesi E, Spagnuolo M, Placidi L (2019) Mechanical metamaterials: a state of the art. Math Mech Solids 24(1):212–234. https://doi.org/10.1177/1081286517735695
 3.
Surjadi JU et al (2019) Mechanical metamaterials and their engineering applications. Adv Eng Mater 21(3):1800864. https://doi.org/10.1002/adem.201800864
 4.
Zheng X et al (2014) Ultralight, ultrastiff mechanical metamaterials. Science 344(6190):1373–1377. https://doi.org/10.1126/science.1252291
 5.
Nguyen BD, Cho JS, Kang K (2016) Optimal design of “Shellular’’, a microarchitectured material with ultralow density. Mater Design 95(Supplement C):490–500. https://doi.org/10.1016/j.matdes.2016.01.126
 6.
Frenzel T, Findeisen C, Kadic M, Gumbsch P, Wegener M (2016) Tailored buckling microlattices as reusable lightweight shock absorbers. Adv Mater 28(28):5865–5870. https://doi.org/10.1002/adma.201600610
 7.
Bertoldi K, Vitelli V, Christensen J, van Hecke M (2017) Flexible mechanical metamaterials. Nat Rev Mat vol 2(11). Number: 11 Publisher: Nature Publishing Group, pp 1–11. https://doi.org/10.1038/natrevmats.2017.66
 8.
Bückmann T, Thiel M, Kadic M, Schittny R, Wegener M (2014) An elastomechanical unfeelability cloak made of pentamode metamaterials. In: Nature Communications 5.1. Number: 1 Publisher: Nature Publishing Group, p 4130. https://doi.org/10.1038/ncomms5130
 9.
Babaee S, Shim J, Weaver JC, Chen ER, Patel N, Bertoldi K (2013) 3D soft metamaterials with negative poisson’s ratio. Adv Mater 25(36):5044–5049. https://doi.org/10.1002/adma.201301986
 10.
Frenzel T, Kadic M, Wegener M (2017) Threedimensional mechanical metamaterials with a twist. Science vol 358(6366). Publisher: American Association for the Advancement of Science Section: Report, pp 10721074. https://doi.org/10.1126/science.aao4640
 11.
Qu J, Kadic M, Naber A, Wegener M (2017) Microstructured twocomponent 3D metamaterials with negative thermalexpansion coefficient from positive constituents. Sci Rep vol 7(1). Number: 1 Publisher: Nature Publishing Group, p 40643. https://doi.org/10.1038/srep40643
 12.
Ding Z, Weeger O, Qi H, Dunn M (2018) 4D Rods: 3D structures via programmable 1D composite rods. Mater Design 137:256–265. https://doi.org/10.1016/j.matdes.2017.10.004
 13.
Bodaghi M, Liao WH (2019) 4D printed tunable mechanical metamaterials with shape memory operations. Smart Mater Struct 28(4):045019. https://doi.org/10.1088/1361665X/ab0b6b
 14.
Ge Q et al. (2021) 3D printing of highly stretchable hydrogel with diverse UV curable polymers. Sci Adv 7(2). Publisher: American Association for the Advancement of Science Section: Research Article, eaba4261. https://doi.org/10.1126/sciadv.aba4261
 15.
Chen Y, Qian F, Zuo L, Scarpa F, Wang L (2017) Broadband and multiband vibration mitigation in lattice metamaterials with sinusoidallyshaped ligaments. Extreme Mech Lett 17:24–32. https://doi.org/10.1016/j.eml.2017.09.012
 16.
Hosseinzadeh HR (2018) Metamaterials in medicine: a new era for future orthopedics. Orthop Res Online J. https://doi.org/10.31031/oproj.2018.02.000549
 17.
Cheng NG, Gopinath A, Wang L, Iagnemma K, Hosoi AE (2014) Thermally tunable, selfhealing composites for soft robotic applications. Macromol Mater Eng 299(11):1279–1284. https://doi.org/10.1002/mame.201400017
 18.
Jamshidian M, Boddeti N, Rosen D, Weeger O (2020) Multiscale modelling of soft lattice metamaterials: Micromechanical nonlinear buckling analysis, experimental verification, and macroscale constitutive behaviour. Int J Mech Sci 188:105956. https://doi.org/10.1016/j.ijmecsci.2020.105956
 19.
Weeger O, Boddeti N, Yeung SK, Kaijima S, Dunn M (2019) Digital design and nonlinear simulation for additive manufacturing of soft lattice structures. Addit Manuf 25:39–49. https://doi.org/10.1016/j.addma.2018.11.003
 20.
Glaesener RN, Träff EA, Telgen B, Canonica RM, Kochmann DM (2020) Continuum representation of nonlinear threedimensional periodic truss networks by onthefly homogenization. Int J Solids Struct 206:101–113. https://doi.org/10.1016/j.ijsolstr.2020.08.013
 21.
Vigliotti A, Pasini D (2012) Linear multiscale analysis and finite element validation of stretching and bending dominated lattice materials. Mech Mater 46:57–68. https://doi.org/10.1016/j.mechmat.2011.11.009
 22.
Arabnejad S, Pasini D (2013) Mechanical properties of lattice materials via asymptotic homogenization and comparison with alternative homogenization methods. Int J Mech Sci 77:249–262. https://doi.org/10.1016/j.ijmecsci.2013.10.003
 23.
Wang Y, Xu H, Pasini D (2017) Multiscale isogeometric topology optimization for lattice materials. Comput Methods Appl Mech Eng. Special Issue on Isogeomric Analysis: Progress and Challenges, 316:568–585. https://doi.org/10.1016/j.cma.2016.08.015
 24.
Latture RM, Begley MR, Zok FW (2018) Design and mechanical properties of elastically isotropic trusses. J Mater Res 33(03):249–263. https://doi.org/10.1557/jmr.2018.2
 25.
Geers M, Kouznetsova V, Brekelmans W (2010) Multiscale computational homogenization: Trends and challenges. J Comput Appl Math 234(7):2175–2182. https://doi.org/10.1016/j.cam.2009.08.077
 26.
Geers MGD, Kouznetsova VG, Matouš K, Yvonnet J (2017) Homogenization Methods and Multiscale Modeling: Nonlinear Problems. In: Encyclopedia of Computational Mechanics. Second Edition. John Wiley & Sons, Ltd
 27.
Matouš K, Geers MGD, Kouznetsova VG, Gillman A (2017) A review of predictive nonlinear theories for multiscale modeling of heterogeneous materials. J Comput Phys 330:192–220. https://doi.org/10.1016/j.jcp.2016.10.070
 28.
Glaesener RN, Lestringant C, Telgen B, Kochmann DM (2019) Continuum models for stretchingand bendingdominated periodic trusses undergoing finite deformations. Int J Solids Struct. https://doi.org/10.1016/j.ijsolstr.2019.04.022
 29.
Pal RK, Ruzzene M, Rimoli JJ (2016) A continuum model for nonlinear lattices under large deformations. Int J Solids Struct 96:300–319. https://doi.org/10.1016/j.ijsolstr.2016.05.020
 30.
El Nady K, Goda I, Ganghoffer JF (2016) Computation of the effective nonlinear mechanical response of lattice materials considering geometrical nonlinearities. Comput Mech 58(6):957–979. https://doi.org/10.1007/s0046601613267
 31.
El Nady K, Dos Reis F, Ganghoffer JF (2017) Computation of the homogenized nonlinear elastic response of 2D and 3D auxetic structures based on micropolar continuum models. Compos Struct 170:271–290. https://doi.org/10.1016/j.compstruct.2017.02.043
 32.
Damanpack AR, Bodaghi M, Liao WH (2019) Experimentally validated multiscale modeling of 3D printed hyperelastic lattices. Int J NonLinear Mech 108:87–110. https://doi.org/10.1016/j.ijnonlinmec.2018.10.008
 33.
Fernández M, Jamshidian M, Böhlke T, Kersting K, Weeger O (2021) Anisotropic hyperelastic constitutive models for finite deformations combining material theory and datadriven approaches with application to cubic lattice metamaterials. Comput Mech 67:653–677. https://doi.org/10.1007/s00466020019547
 34.
Fritzen F, Kunc O (2018) Twostage datadriven homogenization for nonlinear solids using a reduced order model. Eur J Mech A/Solids 69:201–220. https://doi.org/10.1016/j.euromechsol.2017.11.007
 35.
Kunc O, Fritzen F (2019) Finite strain homogenization using a reduced basis and efficient sampling. Math Comput Appl 24(2):56. https://doi.org/10.3390/mca24020056
 36.
Liu Z, Bessa MA, Liu WK (2016) Selfconsistent clustering analysis: an efficient multiscale scheme for inelastic heterogeneous materials. Comput Methods Appl Mech Eng 306:319–341. https://doi.org/10.1016/j.cma.2016.04.004
 37.
Tang S, Zhang L, Liu WK (2018) From virtual clustering analysis to selfconsistent clustering analysis: a mathematical study. Comput Mech 62(6):1443–1460. https://doi.org/10.1007/s004660181573x
 38.
Cheng G, Li X, Nie Y, Li H (2019) FEMCluster based reduction method for efficient numerical prediction of effective properties of heterogeneous material in nonlinear range. Comput Methods Appl Mech Eng 348:157–184. https://doi.org/10.1016/j.cma.2019.01.019
 39.
Yvonnet J, Gonzalez D, He QC (2009) Numerically explicit potentials for the homogenization of nonlinear elastic heterogeneous materials. Comput Methods Appl Mech Eng 198(33):2723–2737. https://doi.org/10.1016/j.cma.2009.03.017
 40.
Yvonnet J, Monteiro E, He QC (2013) Computational homogenization method and reduced database model for hyperelastic hetereogeneous structures. Int J Multiscale Comput Eng. https://doi.org/10.1615/IntJMultCompEng.2013005374
 41.
Ibañez R et al (2017) Datadriven nonlinear elasticity: constitutive manifold construction and problem discretization. Comput Mech 60(5):813–826. https://doi.org/10.1007/s0046601714401
 42.
Man H, Furukawa T (2011) Neural network constitutive modelling for nonlinear characterization of anisotropic materials. Int J Numer Meth Eng 85(8):939–957. https://doi.org/10.1002/nme.2999
 43.
Le BA, Yvonnet J, He QC (2015) Computational homogenization of nonlinear elastic materials using neural networks. Int J Numer Meth Eng 104(12):1061–1084. https://doi.org/10.1002/nme.4953
 44.
Ling J, Jones R, Templeton J (2016) Machine learning strategies for systems with invariance properties. J Comput Phys 318:22–35. https://doi.org/10.1016/j.jcp.2016.05.003
 45.
Ibáñez R, AbissetChavanne E, González D, Duval JL, Cueto E, Chinesta F (2019) Hybrid constitutive modeling: datadriven learning of corrections to plasticity models. IntJ Mater Form 12(4):717–725. https://doi.org/10.1007/s122890181448x
 46.
Yang H, Guo X, Tang S, Liu WK (2019) Derivation of heterogeneous material laws via datadriven principal component expansions. Comput Mech. https://doi.org/10.1007/s0046601901728w
 47.
González D, GarcíaGonzález A, Chinesta F, Cueto E (2020) A datadriven learning method for constitutive modeling: Application to vascular hyperelastic soft tissues. Materials 13(10):1–17. https://doi.org/10.3390/ma13102319
 48.
Linka K, Hillgärtner M, Abdolazizi KP, Aydin RC, Itskov M, Cyron CJ (2020) Constitutive artificial neural networks: a fast and general approach to predictive datadriven constitutive modeling by deep learning. J Comput Phys. https://doi.org/10.1016/j.jcp.2020.110010
 49.
Liu M, Liang L, Sun W (2020) A generic physicsinformed neural networkbased constitutive model for soft biological tissues. Comput Methods Appl Mech Eng 372:113402. https://doi.org/10.1016/j.cma.2020.113402
 50.
Masi F, Stefanou I, Vannucci P, MaffiBerthier V (2021) Thermodynamicsbased Artificial Neural Networks for constitutive modeling. J Mecha Phys Solids. https://doi.org/10.1016/j.jmps.2020.104277
 51.
Xue T et al (2020) A datadriven computational scheme for the nonlinear mechanical properties of cellular mechanical metamaterials under large deformation. Soft Matter 16(32):7524–7534. https://doi.org/10.1039/D0SM00488J
 52.
Talha M, Singh B (2014) Stochastic perturbationbased finite element for buckling statistics of FGM plates with uncertain material properties in thermal environments. Compos Struct 108:823–833. https://doi.org/10.1016/j.compstruct.2013.10.013
 53.
Schafer BW, GrahamBrady L (2006) Stochastic postbuckling of frames using Koiter’s method. Int J Struct Stab Dyn 06(03):333–358. https://doi.org/10.1142/S0219455406001976
 54.
Salerno G, Lanzo AD (1997) A nonlinear beam finite element for the postbuckling analysis of plane frames by Koiter’s perturbation approach. Comput Methods Appl Mech Eng 146(3):325–349. https://doi.org/10.1016/S00457825(96)012406
 55.
Casciaro R, Garcea G, Attanasio G, Giordano F (1998) Perturbation approach to elastic postbuckling analysis. Comput Struct 66(5):585–595. https://doi.org/10.1016/S00457949(97)001120
 56.
Simo JC, VuQuoc L (1986) A threedimensional finitestrain rod model. Part II: Computational aspects. Comput Methods Appl Mech Eng 58(1):79–116. https://doi.org/10.1016/00457825(86)900794
 57.
Antman SS (2005) Nonlinear problems of elasticity. Vol. 107. Applied mathematical sciences. Springer, New York. https://doi.org/10.1007/0387276491
 58.
Weeger O, Yeung SK, Dunn ML (2017) Isogeometric collocation methods for Cosserat rods and rod structures. Comput Methods Appl Mech Eng 316:100–122. https://doi.org/10.1016/j.cma.2016.05.009
 59.
Piegl L, Tiller W (1995) The NURBS Book. Monographs in visual computing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/9783642973857
 60.
Hughes T, Cottrell J, Bazilevs Y (2005) Isogeometric analysis: CAD, finite elements, NURBS, exact geometry and mesh refinement. Comput Methods Appl Mech Eng 194(39):4135–4195. https://doi.org/10.1016/j.cma.2004.10.008
 61.
Pham H (2006) Springer handbook of engineering statistics. Springer Handbooks. Springer, London. https://doi.org/10.1007/9781846282881
 62.
Šilhavý M (1997) The mechanics and thermodynamics of continuous media. Springer, Berlin Heidelberg. https://doi.org/10.1007/9783662033890
 63.
Truesdell C, Noll W, Antman SS (2004) The nonlinear field theories of mechanics, 3rd edn. Springer, Berlin and New York
 64.
Abadi M et al. (2015) TensorFlow: LargeScale Machine Learning on Heterogeneous Systems. Software available from www.tensorflow.org
 65.
Kingma DP, Ba J (2017) Adam: A Method for Stochastic Optimization. arXiv preprint on arxiv:1412.6980v9
 66.
Netgen/NGSolve. Software available from www.ngsolve.org
 67.
Ball J (1976) Convexity conditions and existence theorems in nonlinear elasticity. Arch Ration Mech Anal 63(4):337–403. https://doi.org/10.1007/BF00279992
 68.
Fernández M, Fritzen F, Weeger O (2021) Material modeling for parametric finite hyperelasticity based on machine learning with application in optimization of metamaterials. ResearchGate preprint
Acknowledgements
M.F. and O.W. would like to thank the Department of Mechanical Engineering at TU Darmstadt for financial support.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Affiliations
Contributions
All authors contributed to the study conception and design. The development and implementation of the microscale beam simulations of the RUC and the perturbation approach were conducted by TG and OW. Simulations of the RUC and corresponding data processing were performed by TG. The development and implementation of the effective material models were conducted by TG and MF. The implementation of the effective material models in NGSolve, execution of the macroscopic/multiscale FE computations and corresponding data processing were carried out by TG. The simulations of the fully resolved structures were conducted by TG and OW. All authors contributed to the analysis of the microscopic and macroscopic behavior of the simulated structures. The first draft of the manuscript was written by TG and subsequently refined by MF and OW. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gärtner, T., Fernández, M. & Weeger, O. Nonlinear multiscale simulation of elastic beam lattices with anisotropic homogenized constitutive models based on artificial neural networks. Comput Mech (2021). https://doi.org/10.1007/s0046602102061x
Received:
Accepted:
Published:
Keywords
 Nonlinear multiscale simulation
 Metamaterials
 Constitutive modeling
 Anisotropic hyperelasticity
 Machine learning