Abstract
A variational model of pressure-dependent plasticity employing a time-incremental setting is introduced. A novel formulation of the dissipation potential allows one to construct the condensed energy in a variationally consistent manner. For a one-dimensional model problem, an explicit expression for the quasiconvex envelope can be found which turns out to be essentially independent of the original pressure-dependent yield surface. The model problem can be extended to higher dimensions in an empirical manner. Numerical simulation exhibits well-posed behavior showing mesh-independent results.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Classical models for soils and granular materials are formulated in the framework of linearized plasticity based on pressure-dependent flow rules. While this approach works very well in engineering applications, it cannot explain complex structures seen in experiments conducted in granular media like clay, silt or sand, see in particular [1, Figure 12]. The aim of this paper is both the investigation and the modeling of these complex structures via relaxation methods, a well-established approach based on variational analysis. In a first step, the analysis will be based on a time discretization scheme which involves a variational formulation for the problems at given finite time increments. The corresponding variational problems will be studied concerning their stability properties and the main features of minimizing microstructures, which are closely related to the complexity of the experimental data. Moreover, the dependence of the main features of the minimizing structures on the given yield functions will be investigated. The goal is to employ the variational approach in order to describe the essential model features with as few parameters as possible, thus establishing this type of models for plasticity in soil mechanics.
Starting with [2,3,4], it was realized that experimentally observed microstructures may be explained based on energy minimization. This observation made it possible to use variational methods in order to understand complex material behavior. For example, the shape memory effect is related to an accommodation of macroscopic deformations on a microscopic scale through complex patterns. These patterns are determined through mathematical compatibility conditions which require in the nonlinear theory that two adjacent elastic deformation gradients \(\textbf{A}\) and \(\textbf{B}\) be rank-one connected in the sense that \(\textbf{A} - \textbf{B} = \textbf{a}\otimes \textbf{n}\). The vector \(\textbf{n}\) provides geometric information since it represents the orientation of planes that can connect the two states \(\textbf{A}\) and \(\textbf{B}\). Minimization of the original energy with respect to such compatible microstructures leads to the relaxed energy. This approach allows one on the one hand to formulate well-posed boundary-value problems that lend themselves to numerical solutions. The minimizing microstructures, on the other hand, will provide information on the expected deformation patterns. Applications of this approach specifically to elastoplastic materials can be found in [5, 6].
Since the explicit construction of microstructures requires to determine rank-one connections, which is algebraically a challenging problem, it is not surprising that there are only a few examples with applications to elastic materials for which a closed form of \(W^{qc}\) has been found [7,8,9,10,11,12,13]. In the linearized setting, the compatibility condition concerns symmetric stains \(\textbf{A}\) and \(\textbf{B}\) whose difference is symmetrized rank-one convex, i.e., \(\textbf{A}-\textbf{B} = \textbf{a}\odot \textbf{b}\) with \(\textbf{a}\odot \textbf{b} = \frac{1}{2}(\textbf{a}\otimes \textbf{b} + \textbf{b}\otimes \textbf{a})\). Examples for relaxation results in this context can be found in [14,15,16,17]. Numerical relaxation schemes are employed in [18]. A combination of an analytical approximation of the relaxed energy density combined with a finite element simulation can be found in [19,20,21]. A novel algorithmic approach for the computation of the relaxed energy density with applications to problems in phase transformations and models in plasticity was recently proposed in [22,23,24], see also [25, 26].
The paper is organized as follows: In Sect. 2, standard models of pressure-dependent plasticity are reviewed. In Sect. 3, we bring these models into a variational setting suitable for treatment within relaxation theory. In Sect. 4, this setting is specialized in order to obtain a one-dimensional model problem for which analytical results can be obtained. Section 5 contains the main results of this paper from the mathematical point of view. An explicit expression for the relaxed energy is derived in the one-dimensional case, i.e., the quasiconvex envelope of the energy density, which, in the given situation with one spatial variable, is equal to the convex envelope. In Sect. 6, the one-dimensional relaxed energy is, in a heuristic manner, extended to three dimensions, and in Sect. 7 numerical results are presented for both models, first for the relaxed energy derived for the one-dimensional problem, then for an extension of this energy to three dimensions, for which no rigorous mathematical results are available yet. More extensive results will be given in [27]. We will close with conclusion and outlook in Sect. 8.
2 Pressure-dependent plasticity
The most important class of materials exhibiting pressure-dependent plastic behavior are soils and granular media. They are usually heterogeneous mixtures of fluids (usually air and water) and particles (usually clay, silt, sand, and gravel) that have little to no cementation [28]. The shear strength is provided by friction and interlocking of the particles.
The understanding of deformations of solid materials is of key importance in applications. In contrast to elastic materials, which return to their original configuration upon unloading, elastoplastic materials may undergo a permanent change of their configuration if sufficiently large deviatoric strains occur in the material. On the other hand, it is frequently assumed that they may support arbitrary hydrostatic pressures. Usually this material behavior is modeled in a linearized setting via a convex set \(\mathbb {K}\subset \mathbb {R}^{n\times n}_{\textrm{sym}}\) which describes the set of admissible stresses in the material which do not lead to permanent deformations. Only for stresses on \(\partial \mathbb {K}\), permanent plastic deformation occurs and this deformation is formulated via a flow rule which determines the evolution of the plastic part of the total elastoplastic deformation. Consequently, the set \(\mathbb {K}\) is an infinite cylinder
where \(\textbf{Id}\) denotes the identity matrix, K is a compact subset in \(\mathbb {R}^{n\times n}_{\textrm{D}}\) and
is the set of all symmetric and trace free matrices. Here and in the following we will use boldface symbols for vector-valued or matrix-valued objects.
Thus, the basic problem in the theory of linear elastoplasticity can be formulated as follows. Given a material body described in its reference configuration by a domain \(\Omega \subset \mathbb {R}^n\), find a deformation \(\textbf{y}:\Omega \rightarrow \mathbb {R}^n\) and a decomposition of the symmetrized displacement gradient \(\varvec{\varepsilon }\) defined by
into an elastic and a plastic part, \(\varvec{\varepsilon }=\varvec{\varepsilon }_\textrm{e} + \varvec{\varepsilon }_\textrm{p}\), \(\varvec{\varepsilon }_\textrm{p} \in \mathbb {R}^{n\times n}_{\textrm{D}}\), such that the following system of equations holds:
-
(i)
constitutive equation \(\varvec{\sigma } = \mathbb {C}\,\varvec{\varepsilon }_\textrm{e}\),
-
(ii)
balance of momentum \(\textbf{u}_{tt}-\text {div} \varvec{\sigma } = \textbf{f}\) or the equilibrium condition \(-\text {div}\varvec{\sigma } = \textbf{f}\),
-
(iii)
stress constraint \(\varvec{\sigma } \in \mathbb {K}\),
-
(iv)
associated flow rule \((\varvec{\xi } -\varvec{\sigma }):\dot{\varvec{\varepsilon }}_\textrm{p} \le 0\) for every \(\varvec{\xi }\in \mathbb {K}\).
It is important to note that the assumption (iv) is not realistic for materials in which frictional effects dominate as in clay, sand, gravel or other granular materials. Here, pressure dependence needs to be taken into account and this leads to a set of admissible stresses that cannot be modeled by an infinite cylinder; instead one is led to consider cones in the space of principle stresses, classical models being the Drucker–Prager and Mohr–Coulomb criteria [29]. To be specific, let \(\pi =- 1/3 \, \text {tr}\varvec{\sigma }\) denote the pressure, then the convex elastic domain is given as
where \(K_\pi \) is a compact subset in \(\mathbb {R}^{n\times n}_{\textrm{D}}\) and \(-\infty \le \pi _\textrm{min}<\pi _\textrm{max}\le \infty \). In order to comply with experimental observations, plastic deformation should still be volume preserving. For this purpose, the flow rule (iv) is then replaced by the non-associated flow rule and the full system of equations in its quasistatic form is given by
-
(i)
constitutive equation \(\varvec{\sigma } = \mathbb {C}\,\varvec{\varepsilon }_\textrm{e}\),
-
(ii)
the equilibrium condition \(-\text {div}\varvec{\sigma } = \textbf{f}\),
-
(iii)
stress constraint \(\varvec{\sigma } \in \mathbb {K}\),
-
(iv’)
non-associated flow rule \((\varvec{\xi } -\varvec{\sigma }):\dot{\varvec{\varepsilon }}_\textrm{p} \le 0\) for every \(\varvec{\xi }\in \mathbb {K}_\pi \).
Note that \(\dot{\varvec{\varepsilon }}_\textrm{p}\) is now normal to the boundary of \(\mathbb {K}_\pi \) as subset of \(\mathbb {R}^{n\times n}_{\textrm{D}}\) but not normal to \(\mathbb {K}\) as subset of \(\mathbb {R}^{n\times n}_{\textrm{sym}}\) (Fig. 1). This observation lies at the heart of the problems concerning a variational formulation to be discussed in the following.
3 Variational model
In order to be able to apply relaxation theory, we have to reformulate the setting given in the previous section. For this purpose, we will employ a variational approach for the description of inelastic processes, see [30,31,32,33,34,35]. Let us consider a physical system described by (sets of) external, i.e., controllable, state variables, in our case given by the total strain \(\varvec{\varepsilon }\), and internal state variables \(\textbf{z}\). Here \(\textbf{z}=\{\varvec{\varepsilon }_\textrm{p},p\}\), where \(\varvec{\varepsilon }_\textrm{p}\) denotes plastic strain and \(p\ge 0\) a hardening variable. We assume that the system behavior may be defined using only two scalar potentials: free energy \(\psi (\varvec{\varepsilon },\textbf{z})\) and dissipation potential \(\Delta (\varvec{\varepsilon },\textbf{z},\dot{\textbf{z}})\).
The evolution of the internal variables is described by
Note that Eq. (3.1) may be written as a stationarity condition of the minimization problem
We assume a free energy of the form
Here, p is an auxiliary internal variable, K the bulk modulus, \(\mu \) the shear modulus, and \(\beta \) a hardening modulus. The function \(\rho (\text {tr} \varvec{\varepsilon })\) governs pressure dependence. For constant \(\rho (\text {tr} \varvec{\varepsilon })\), Eq. (3.3) corresponds to the free energy of an isotropic elastoplastic material with kinematic hardening. In this work, we will assume \(\rho \) to be a concave function inside an interval \([\xi _\textrm{min},\xi _\textrm{max}]\) and zero outside of it. The function \(\rho (\text {tr} \varvec{\varepsilon })\) for a typical model, e.g., capped Drucker–Prager, see [28], can be seen in Fig. 2. The stress is given as
Note that the pressure is given by the relation \(\pi = - K \text {tr} \varvec{\varepsilon } - \rho ^\prime (\text {tr}\varvec{\varepsilon }) p\). Here, the second term may be interpreted as a consolidation pressure due to an inelastic volume change governed by the parameter p. A variationally more consistent approach would be to replace Eq. (3.3) by an energy of the form
where \(\theta _\textrm{p}\) now represents the inelastic volume change explicitly. However, we would like to keep our formulation simple within the present work and plan to discuss this formulation in a subsequent paper.
The dissipation potential will be assumed as
This procedure differs from the standard one, as for example implemented in [31]. In the standard approach, Eq. (3.3) is replaced by
and Eq. (3.6) by
However, the formulation in Eq. (3.8) would be problematic with respect to the variational formulation which does not allow the dissipation potential to depend on an external variable. Moreover, it would lead to complications when it comes to integrating it up in order to obtain a dissipation distance, as will be done later in the text. Minimizing trajectories connecting different states will be non-monotone leading to behavior that is questionable from a physical point of view.
Minimization in Eq. (3.2) gives, taking into account the expressions in Eqs. (3.3) and (3.6),
and
Note that Eq. (3.9) is equivalent to the yield condition
and the flow rule
where \(\kappa \ge 0\) is a consistency parameter satisfying the Kuhn–Tucker conditions
Note that Eq. (3.10) defines p as equivalent plastic strain.
Having established a variational scheme for pressure-dependent plasticity now, we will employ a time-incremental approach as proposed in [31, 33] in order to investigate the evolution of material microstructures. For this purpose, we consider a time increment \(\Delta t=[t_n,t_{n+1}]\). Given the external forces and boundary conditions at time \(t_{n+1}\) and the values of the internal variables \(\textbf{z}_n\) at time \(t_n\), we seek the values \(\varvec{\varepsilon }_{n+1}\) and \(\textbf{z}_{n+1}\) at time \(t_{n+1}\).
A time-incremental version of the dissipation-potential is given by the dissipation distance
The extremal principle Eq. (3.2) can now be replaced by
Minimization with respect to p gives immediately
Then, substitution of Eq. (3.16) into Eq. (3.15) and minimization with respect to \(\varvec{\varepsilon }_\textrm{p}\) yields
Solving for \(\varvec{\varepsilon }_\textrm{p}\) in Eq. (3.17)yields
where \(\left[ x\right] _+=\max \{0,x\}\). This allows to introduce the condensed energy, see [31, 33], via
Substitution of Eqs. (3.16) and (3.18) into Eq. (3.19) gives, up to an irrelevant constant term of the form \(\frac{\beta }{2} \left| \varvec{\varepsilon }_{\textrm{p},n} \right| ^2\),
The condensed energy can now be considered as formally elastic and dependent on the parameters \(\varvec{\varepsilon }_{\textrm{p},n},p_n\), governing the behavior during the time increment \(\Delta t=[t_n,t_{n+1}]\). At the end of the time step, the internal variables can be updated using Eqs. (3.16) and (3.18). This way, the inelastic behavior can be approximated by solving a sequence of elastic problems.
4 A one-dimensional model problem
It is well known that quasiconvex envelopes can be calculated analytically only in rare special cases. In order to circumvent this difficulty, we restrict ourselves to a one-dimensional problem for which the quasiconvex envelope coincides with the convex envelope. We consider a state of specific deformation corresponding to a slender bar of material under uniform tension or compression and shear. We assume the bar to be supported by stiff fibers in a way that prevents bending deformations. The nonvanishing displacements and strains are in Cartesian coordinates
where we neglected terms of the form \(y u^{\prime \prime }\) and \(z u^{\prime \prime }\), which are small due to the slenderness of the bar. For the plastic strains, we assume that only the components \(\varepsilon _{\textrm{p}}:= \varepsilon _{\textrm{p}yx} = \varepsilon _{\textrm{p}xy}\) are different from zero. The free energy assumes the form
Substitution of Eq. (4.2) into Eq. (4.3) gives
As before, minimization with respect to \(\varepsilon _{\textrm{p}}\) and p once again leads to the condensed energy
For the remainder of the paper, we will focus on the case \(p_n=0\). This will simplify the mathematical treatment significantly and will allow to keep the derivations stringent and instructive. In mechanical terms, this means that we will describe the initiation of microstructures rather than their evolution. However, the general case will remain subject of future investigations.
It is convenient for the mathematical treatment following in the subsequent section to introduce a dimensionless formulation. For this purpose, we define modified variables and quantities by
giving, using \(p_n=0\),
Finally, we may drop the multiplicative constant \(2\mu \) and write \(z_n = \varepsilon _{\textrm{p},n}\). Since \(\psi _\textrm{cond}\) is the free energy density in the variational functional that needs to be minimized in the first time step, it is crucial to understand the convexity properties of \(\psi _\textrm{cond}\) in order to employ the direct method in the calculus of variations. It turns out that the concavity of r leads to a lack of convexity of \(\psi _\textrm{cond}\) and the approach via relaxation [36] requires one to characterize the quasiconvex envelope of \(\psi _\textrm{cond}\), which, in the scalar case, reduces to the convex envelope. This task is accomplished in the next section.
5 Characterization of the relaxed energy for the one-dimensional model
For any function \(f: \mathbb {R}^2\rightarrow \mathbb {R}\), we denote the convex envelope of f by \(f_\textrm{c}\) and, if f is differentiable at \(y\in \mathbb {R}^2\), the linear Taylor polynomial of f about the point y by \(T_y f\). Fix \(b>0\) and \(y_{\textrm{min}}, y_{\textrm{max}}\in \mathbb {R}\) with \(y_{\textrm{min}}< y_{\textrm{max}}\). Define the set of admissible dissipation functions by
Fix \(y_{\textrm{mid}}:= (y_{\textrm{min}}+y_{\textrm{max}})/2\) and define the piecewise affine function \(r_0\in \mathcal {R}\) by
see Fig. 2 for a sketch with \(b=0.095\). Note that the slope of \(r_0\) is only related to the hardening parameter b and, depending on the relative size of \(r_{\textrm{max}}=\max r\) and b, the two curves may have intersections or not. Our theory covers the case of small values of b for which the two curves do not intersect. For \(r\in \mathcal {R}\), the condensed energy is given by
Since the energy depends only on \(y_2-z_n\), it is sufficient to calculate the convex envelope \(f_\textrm{c}^{(r)}\) of \(f^{(r)}:= f^{(r)}_0\) with \(z_n=0\); the convex envelope of \(f^{(r)}_{z_n}(y_1,y_2)\) is given by \(f_\textrm{c}^{(r)}(y_1, y_2- z_n)\). If the dissipation function is uniquely defined from the context, we simply write f instead of \(f^{(r)}\). Define the disjoint sets
whose union is \(\mathbb {R}^2\), see Fig. 3. These sets describe the regions in which the condensed energy \(f^{(r)}\) is given by different algebraic expressions, depending on the sign of \(\vert y_2\vert -r(y_1)\). The first lemma describes a monotonicity property of the map \(r\mapsto f^{(r)}\).
Lemma 1
Consider two dissipation functions \(r, \bar{r}\in \mathcal {R}\). If \(y_1\in \mathbb {R}\) and \(r(y_1)\le \bar{r}(y_1)\), then for all \(y_2\in \mathbb {R}\) the inequality \(f^{(r)}(y_1, y_2)\le f^{(\bar{r})}(y_1, y_2)\) holds. In particular, if \(r\le \bar{r}\), then \(f^{(r)}\le f^{(\bar{r})}\) and \(g: \mathbb {R}^2\rightarrow \mathbb {R},\ g(y)= \frac{1}{2}y_1^2 + \frac{1}{2} \frac{b}{b+1} y_2^2\) is a convex lower bound of \(f^{(r)}\), which coincides on \(\widetilde{Y}\) with \(f^{(r)}\).
Proof
For any \(y\in \mathbb {R}^2\) with \(r(y_1)\le \bar{r}(y_1)\), it follows that \(\vert y_2\vert \ge \vert y_2\vert - r(y_1)\ge \vert y_2\vert - \overline{r}(y_1)\). Therefore, we have \(y_2^2\ge (\vert y_2\vert - r(y_1))_+^2\ge (\vert y_2\vert - \overline{r}(y_1))_+^2\) and the inequality
is obtained from Eq. (5.3). \(\square \)
Since Lemma 1 establishes a lower bound on \(f_\textrm{c}\), we may use the results in [37, Theorem 2.1] which state the following important property. For a lower semicontinuous function \(f: \mathbb {R}^d\rightarrow \mathbb {R}\), which satisfies the growth condition \(\lim _{\vert y\vert \rightarrow \infty } f(y)/\vert y\vert = \infty \), the convex envelope of f is a real-valued convex function and for any \(y\in \mathbb {R}^d\) the value \(f_\textrm{c}(y)\) is a convex combination of q function values \(f(y^{(i)})\), \(1\le i\le q\le d+1\). Moreover, the subgradient \(\partial f_\textrm{c}(y)\) in the sense of convex analysis, see [38], is nonempty and for any \(v\in \partial f_c(y)\) the hyperplane \(z\mapsto f(y)+\langle v,z-y\rangle \) is a supporting hyperplane at all the points \(y^{(i)}\), that is,
and
Since Eq. (5.6) implies \(f(y^{(i)})- \langle v, y^{(i)} \rangle = f(y^{(j)})- \langle v, y^{(j)} \rangle \) for all \(i,j\in \{1,\ldots ,q\}\) and Eq. (5.7) is equivalent to \(v\in \partial f(y^{(i)})\), a natural approach to calculate the convex envelope of a continuous function \(f: \mathbb {R}^d\rightarrow \mathbb {R}\) is the strategy to find pairwise distinct points \(y^{(1)},\ldots ,y^{(q)}\in \mathbb {R}^d\) with \(q\le d+1\) that satisfy the subgradient condition
and to define, for all \(y\in \text {Conv}(\{y^{(1)},\ldots ,y^{(q)}\})\), the convex hull of the points \(y^{(1)},\ldots ,y^{(q)}\), a candidate for the convex envelope of f by
It is useful to note that the representation (5.9) is related to the tangent plane of f in one of the points \(y^{(i)}\) in case that f is differentiable at \(y^{(i)}\). In fact, if for some direction \(w\in \mathbb {R}^d\) and some \(y\in \mathbb {R}^d\) the directional derivative \(D_w f(y)\) exists, then \(v\in \partial f(y)\) implies \(\langle v, w \rangle = D_w f(y)\). If f is even differentiable in y, then \(\{\partial f(y)\}=\{\nabla f(y)\}\). This leads to the observation that, if \(y^{(1)},\ldots ,y^{\left( q) \right) }\in \mathbb {R}^d\) is a solution of Eq. (5.8) and if f is differentiable in some \(y^{(i)}\), then for all \(y\in \text {Conv}(\{y^{(1)},\ldots ,y^{(q)}\})\) the function \(f_{\textrm{c}}\) is given by
see Fig. 4. The calculation of the subgradient involves an evaluation of the values of f on the whole space, and the solution of Eq. (5.8) is in general challenging and the necessary calculations are presented in “Appendix B.”
Theorem 2
Recall the definition of \(\widetilde{Y}\) in (5.4) and define the disjoint sets
as shown in the right panel in Fig. 5. Then, the convex envelope \(f_\textrm{c}\) of f is given by
The proof is presented in detail in Appendix B and is based on the verification of the geometry for the calculation of \(f_c\) in the left panel in Fig. 5 which formalizes the intuitive picture for the convex hull of a function in Fig. 4. In fact, \(f_c=f\) on \(Y_1\) and for \(y\in Y_2\) the convex hull \(f_{\textrm{c}}(y)\) is obtained as an average of two function values where the red line segments intersect the green and the blue line, respectively. Finally, for \(y\in Y_3\) the value of \(f_{\textrm{c}}(y)\) is an average of three points, namely of the function values in the three corners of the triangle \(Y_3\). The convex hull coincides with the tangent plane to f at \(y_*^+\) as indicated in (5.10).
Corollary 3
For all \(r\in \mathcal {R}\) with \(\sqrt{b} \cdot \frac{y_{\textrm{max}}-y_{\textrm{min}}}{2} \le r\left( \frac{y_{\textrm{min}}+y_{\textrm{max}}}{2} \right) \) the equality
holds.
Proof
In the proof of the previous theorem, we already showed the inequality
which implies the claimed equality by the monotonicity in taking convex envelopes. \(\square \)
Remark 4
More generally, one can even show that without any restrictions on r the convex envelope of \(f^{(r)}\) does not change while replacing r by \(\min \{r,r_0\}\). Corollary 3 is a special case of this statement, since Eq. (B1) implies \(\min \{r,r_0\}= r_0\) by concavity of r on \((y_{\textrm{min}}, y_{\textrm{max}})\).
6 Extension of the results to three dimensions
The convex envelope in the one-dimensional case for the density \(f(y_1,y_2,\varepsilon _{\textrm{p},n},p_n)\) given in Theorem 2 may be generalized in a natural way to three dimensions by employing a substitution inspired by the transformation of variables defined in Eq. (4.6). In this sense, we define a formally relaxed energy as
Observe that, when applying the same substitution to the one-dimensional condensed energy given in Eq. (4.7), we recover the original three-dimensional condensed energy given in Eq. (3.20).
The relations of the relaxed energies according to Fig. 5, and their derivatives with respect to the strain tensor, i.e., the corresponding relaxed stress tensor, are given with
on the domains \(Y_i\), \(i=1,\ldots ,4\) by the following expressions. Domain \(\mathrm {Y_1}\):
Domain \(\mathrm {Y_2}\):
Domain \(\mathrm {Y_3}\):
Domain \(\mathrm {Y_4}\):
7 Numerical results
7.1 The one-dimensional model problem
We make use of the dimensionless formulation introduced in Eq. (4.6). The function \(r(y_1)\) is selected to fit the yield surface of a modified Drucker–Prager model employing a piecewise quadratic function given by
where \( y_\textrm{min} = -0.058 \), \(y_\textrm{max} = 0.00107 \), \( y_0 = -0.0385\) and \(r_\textrm{max} = 0.016 \), see Fig. 6. We choose a hardening parameter \(b=0.095\) to insure that we are within the regime of small values of b treated in Sect. 5.
We model a one-dimensional specimen of length \(L=1\), represented by a domain \(\Omega = (0,1)\), employing a discretization using \(n=80\) standard linear finite elements. The specimen is fixed at the left-hand side, \(u(0)=v(0)=0\), and subjected to given displacements \(u(L)=u_\textrm{ext}\) and \(v(L)=v_\textrm{ext}\) in x- and y-direction, respectively, at the right-hand side. The affine solution is given by \(x\mapsto (u(x), v(x))=x(u_\textrm{ext}, v_\textrm{ext})\) with constant derivative \((u_{\textrm{ext}},v_{\textrm{ext}})\). We fix \(u_{\textrm{ext}}\) throughout our calculations and vary \(v_{\textrm{ext}}\) exploring four different regions in the phase diagram Fig. 5 which correspond to the four panels in Fig. 7: \((u_{\textrm{ext}}, v_{\textrm{ext}})\) in (a) \(Y_1\), (b) \(Y_2\cap \{y_2\le r(y_1)\}\), (c) \(Y_2\setminus \{y_2\le r(y_1)\}\) and (d) \(Y_3\), respectively.
The total energies are then functions of the vectors of nodal displacements \(\textbf{u}=\{u_1,\ldots ,u_{n+1}\}\), \(\textbf{v}=\{v_1,\ldots ,v_{n+1}\}\) with \(u_1 = v_1 = 0\) and \(u_{n+1} = u_\textrm{ext}\), \(v_{n+1} = v_\textrm{ext}\), given as
for the condensed and the relaxed energy, respectively. The minimization is performed using the Mathematica function FindMinimum, which employs a gradient-based local search. Note that, for the relaxed energy, the minimum is obtained for the affine displacements \(u_i=\frac{i-1}{n} u_\textrm{ext}\), \(v_i=\frac{i-1}{n} v_\textrm{ext}\).
We add a random perturbation of amplitude \(\alpha \) to the exact solution given above as an initial guess for the minimization procedure. Hence, let \(\textrm{rand}(i)\) be a random function possessing a uniform distribution within the interval \([-1,1]\). Then, the initial guess is for \(i=2,\ldots ,n\) given by
The results are depicted in Fig. 7. The numerical simulations reflect the predictions based on the phase diagram in Fig. 5. We expect the calculations employing the condensed energy to return microstructures consisting of up to three distinct points in \(y_1\)-\(y_2\)-space. These microstructures will approximate the actual minimum more or less successfully depending on the quality of the initial guess. This behavior is precisely reflected by the points in red color corresponding to the condensed energy in Fig. 7. The calculations employing the relaxed energy, on the other hand, are expected to return the actual minimum up to round-off errors, which is precisely what is found. Due to the fact that the graph of the relaxed energy contains affine (flat) areas, the variational problem with the relaxed energy does have a unique affine minimizer, the displacement gradients are not expected to take only values in the local minima of the nonconvex free energy leading to the formation of microstructure, and can in fact take any value on the affine parts of the energy. The only restriction is the global restriction, that the total energy of the displacement is equal to the minimum. This behavior is reflected once again by the points in green color corresponding to the relaxed energy in Fig. 7. In the following, we comment in Fig. 7 in detail.
In case (a) with \(v_\textrm{ext}\in [0,0.0036]\), both the relaxed and the condensed energy are convex with \(f=f_{\textrm{c}}\) and the corresponding minimizers are affine with gradient \((u_{\textrm{ext}},v_{\textrm{ext}})\). For larger values of \(v_{\textrm{ext}}\in [0.0036, 0.014]\), the function f is locally convex but different from its relaxation \(f_{\textrm{c}}\) which is obtained by a mixture of two states and which is affine on lines parallel to the boundary of the region \(Y^4\), see the red lines in the left panel in Fig. 5. Thus, the calculation with the condensed energy is expected to discover the microstructure supported on two points, while the calculation with \(f_{\textrm{c}}\) can use all the gradients on the affine part. In fact, the second panel in Fig. 7 shows exactly this behavior. The case (c) corresponds to \(v_{\textrm{ext}}\in [ 0.014, 0.063]\) and shows the same behavior of case (b), the only difference being that f is not locally convex at the point \((u_{\textrm{ext}},v_{\textrm{ext}})\). Finally, for \(v_{\textrm{ext}}\in [0.0163, 0.15]\) the condensed energy f is not locally convex and the relaxed energy \(f_{\textrm{c}}\) is affine on the triangle \(Y_3\). Therefore, the range of the gradient of the numerical solution for the calculation with the condensed energy is expected to be located in the three corners of the triangle, while the gradient of the solution with the relaxed energy can explore the full triangle. The last panel in Fig. 7 confirms this prediction.
The procedure employing the condensed energy does not recover the exact minimum, but produces values which are between 10 and 20% higher. However, the values are lower than the energy of the condensed energy evaluated at the affine solution. The approximate minimizer attempts to mimic the construction of the relaxed energy by concentrating points at the corresponding positions within the \(y_1\)–\(y_2\)-plane. The procedure employing the relaxed energy returns the exact minimum with working precision in all cases.
7.2 Three-dimensional model
The generalization to a three-dimensional model derived in Sect. 6 is tested through a two-dimensional boundary value problem, namely the square plate with a circular hole. The plate is fixed at the left-hand side and subject to displacements in compression in x-direction along the right-hand side increasing in time. Two unstructured meshes are computed, a coarse one with 1793 elements and a finer mesh with 4917 elements. The meshes are shown in Fig. 8.
Finite element computations are performed using the finite element analysis program FEAP [39]. Standard hexahedral (8-node) elements are used, they are restricted in the third direction employing plane strain conditions, which apply throughout all examples. Stresses are calculated as analytical derivatives of the energies with respect to the strains. Tangent operators are calculated as second derivatives of the energies with respect to the strains, numerically via perturbation. Equilibrium states are obtained by standard Newton–Raphson procedure.
For the computations, we use the same material parameters as in the one-dimensional case taken from the material model with bulk modulus \(K = 3.9 \,\text {GPa}\), shear modulus \(\mu = 2.8\, \text {GPa}\), internal friction angle \(\phi = 32^\circ \), cohesion \(c= 25 \,\text {MPa}\), Poisson’s ratio \(\nu = 0.25\), and tensile strength \(\sigma _\textrm{t} = 5\, \text {MPa}\), introduced in [40]. Under increasing load, the elements in the neighborhood of the hole start to plastify first. Therefore, we compare the results for the model employing the relaxed energy at the marked elements adjacent to the hole for both meshes. The values are taken for a specific integration point at these elements. We would like to stress that a comparison with the model employing the condensed energy is not possible, as the lack of convexity causes severe numerical instabilities. Therefore, only the behavior of the relaxed model can be inferred from the results below.
The final external displacement is reached employing 100 load steps. The corresponding strains switch from domain \({Y_1}\) into domain \({Y_2}\), inducing an initiation of microstructure. In the top two rows of Fig. 9, we show the stresses and plastic strains (calculated as the difference of total and elastic strains) at the elements marked in Fig. 8 as a function of the external displacements. The point marked in cyan color in Fig. 8 corresponds to the Gauss integration point closest to the vertical axis of symmetry. The point marked in red color in Fig. 8 correspond to the Gauss integration point closest to the diagonal. In order to compare global quantities as well, we display in the bottom row average stresses versus average strains, where the averages are calculated along the path marked in magenta color in Fig. 8.
The distribution of the stress \(\sigma _{yy}\) can be seen in Fig. 10. The relaxed model shows similar stress distribution for the coarse and fine mesh. Microstructure is initiating at the same positions and load step despite the different spatial discretization. Contours of the plastic strains are shown in Fig. 11 for a state shortly after microstructure initiation at \(u_\textrm{ext}= 0.00678\) mm. For comparison, the stresses at the same instant are shown in Fig. 12. It can be seen that for both meshes the results agree very well for local and global quantities. This demonstrates that the relaxed model is mesh independent and capable of capturing the material behavior in different zones of relaxation.
8 Conclusion and outlook
We established a variational model for pressure-dependent plasticity as common in models of soil mechanics. Our approach employed a novel aspect by using a characteristic function as dissipation potential. That way, it was possible to establish a dissipation distance as well as a condensed energy connected to a time-incremental setting in a consistent manner. This allowed us to investigate the initiation of microstructure via relaxation theory by calculating the quasiconvex envelope of the condensed energy. For a one-dimensional model problem, for which the quasiconvex envelope coincides with the convex envelope, we succeeded in arriving at a closed-form expression. Interestingly, the quasiconvex envelope turned out to be largely independent of the yield surface of the original model for small values of the hardening parameter b. The proof of this surprising fact illustrates that even the microstructure can be chosen independently of the yield surface. Numerical simulations confirmed the predictions of the analytical results and illustrated the qualitatively different behavior of the variational models using the condensed and the relaxed energy, respectively. In particular, the simulations employing the condensed energy succeeded in finding the necessary oscillations. It was possible to extend the one-dimensional formulation to higher dimensions in an empirical manner. Numerical simulations for a paradigmatic model problem provided strong evidence for the superior predictive power of the relaxed model.
However, problems left for future investigations are manifold. The characterization of the relaxed energy for large values of b is a challenging task since the affine curves in Fig. 5 bounding the domain with \(f=f_\textrm{c}\) intersect the graphs of the functions \(\pm r\) and become nonlinear. It is expected that the qualitative behavior for the relaxation involving at most three points in the phase diagram remains unchanged for an intermediate range of values for b and that for sufficiently large values of b two points will suffice.
From the mathematical side, the treatment of the higher-dimensional case is completely open. In fact, our heuristic model provides a guaranteed lower bound for the energy in the system but may, in general, fail to be the largest lower bound and thus may not be optimal. Concerning variational modeling, there are still problems to be solved connected to the non-associated character of pressure-dependent plasticity. Finally, the treatment of several successive time increments is still open as well as establishing fully evolutionary models.
References
Wolf, H., König, D., Triantafyllidis, T.: Experimental investigation of shear band patterns in granular material. J. Struct. Geol. 25, 1229–1240 (2003). https://doi.org/10.1016/S0191-8141(02)00163-3
Ball, J.M., James, R.D.: Fine phase mixtures as minimizers of energy. Arch. Ration. Mech. Anal. 100, 13–52 (1987). https://doi.org/10.1007/BF00281246
Ball, J.M., James, R.D.: Proposed experimental tests of a theory of fine microstructure and the two-well problem. Philos. Trans. R. Soc. Lond. A 338, 389–450 (1992)
Chipot, M., Kinderlehrer, D.: Equilibrium configurations of crystals. Arch. Ration. Mech. Anal. 103, 237–277 (1988). https://doi.org/10.1007/BF00251759
Kochmann, D.M., Hackl, K.: The evolution of laminates in finite crystal plasticity: a variational approach. Contin. Mech. Therm. 23(1), 63–85 (2011)
Hackl, K., Heinz, S., Mielke, A.: A model for the evolution of laminates in finite-strain elastoplasticity. ZAMM J. Appl. Math. Mech./Z. Angew. Math. Mech. 92(11–12), 888–909 (2012). https://doi.org/10.1002/zamm.201100155
Carstensen, C., Plecháč, P.: Numerical solution of the scalar double-well problem allowing microstructure. Math. Comput. 66, 997–1026 (1997). https://doi.org/10.1090/S0025-5718-97-00849-1
DeSimone, A., Dolzmann, G.: Macroscopic response of nematic elastomers via relaxation of a class of \(\rm {SO}(3)\)-invariant energies. Arch. Ration. Mech. Anal. 161, 181–204 (2002). https://doi.org/10.1007/s002050100174
Šilhavý, M.: Rank-1 convex hulls of isotropic functions in dimension 2 by 2. In: Proceedings of Partial Differential Equations and Applications (Olomouc, 1999), vol. 126, pp. 521–529 (2001)
Šilhavý, M.: Ideally soft nematic elastomers. Netw. Heterog. Media 2, 279–311 (2007)
Conti, S., Dolzmann, G.: Relaxation of a model energy for the cubic to tetragonal phase transformation in two dimensions. Math. Models. Methods Appl. Sci. 24, 2929–2942 (2014)
Khan, M.S., Hackl, K.: Modeling of Microstructures in a Cosserat Continuum Using Relaxed Energies, pp. 103–125. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-75940-1_6
Khan, M.S., Hackl, K.: Modeling of Microstructures in a Cosserat Continuum Using Relaxed Energies: Analytical and Numerical Aspects, pp. 57–87. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-90051-9_3
Kohn, R.V., Strang, G.: Optimal design and relaxation of variational problems. II. Commun. Pure Appl. Math. 39, 139–182 (1986). https://doi.org/10.1002/cpa.3160390202
Lurie, K.A., Cherkaev, A.V.: On a certain variational problem of phase equilibrium. In: Material Instabilities in Continuum Mechanics (Edinburgh, 1985–1986). Oxford Science Publication. Oxford University Press, New York, pp. 257–268 (1988)
Kohn, R.V.: The relaxation of a double-well energy. Contin. Mech. Thermodyn. 3, 193–236 (1991). https://doi.org/10.1007/BF01135336
Pipkin, A.C.: Elastic materials with two preferred states. Q. J. Mech. Appl. Math. 44, 1–15 (1991). https://doi.org/10.1093/qjmam/44.1.1
Bartels, S., Carstensen, C., Hackl, K., Hoppe, U.: Effective relaxation for microstructure simulations: algorithms and applications. Comput. Methods Appl. Mech. Eng. 193(48), 5143–5175 (2004). https://doi.org/10.1016/j.cma.2003.12.065
Carstensen, C., Conti, S., Orlando, A.: Mixed analytical-numerical relaxation in finite single-slip crystal plasticity. Contin. Mech. Thermodyn. 20, 275–301 (2008)
Miehe, C., Lambrecht, M.: Analysis of microstructure development in shearbands by energy relaxation of incremental stress potentials: large-strain theory for standard dissipative solids. Int. J. Numer. Methods Eng. 58, 1–41 (2003). https://doi.org/10.1002/nme.726
Miehe, C., Lambrecht, M., Gürses, E.: Analysis of material instabilities in inelastic solids by incremental energy minimization and relaxation methods: evolving deformation microstructures in finite plasticity. J. Mech. Phys. Solids 52, 2725–2769 (2004)
Conti, S., Dolzmann, G.: An adaptive relaxation algorithm for multiscale problems and application to nematic elastomers. J. Mech. Phys. Solids 113, 126–143 (2018). https://doi.org/10.1016/j.jmps.2018.02.001
Conti, S., Dolzmann, G.: Numerical study of microstructures in single-slip finite elastoplasticity. J. Optim. Theory Appl. 184(1), 43–60 (2020). https://doi.org/10.1007/s10957-018-01460-0
Conti, S., Dolzmann, G.: In: Mariano, P.M. (Ed.) Numerical Study of Microstructures in Multiwell Problems in Linear Elasticity. Adv. Mech. Math. Springer, Cham, pp. 1–29 (2021)
Bartels, S.: Linear convergence in the approximation of rank-one convex envelopes. M2AN Math. Model. Numer. Anal. 38, 811–820 (2004). https://doi.org/10.1051/m2an:2004040
Bartels, S.: Reliable and efficient approximation of polyconvex envelopes. SIAM J. Numer. Anal. 43, 363–385 (2005). https://doi.org/10.1137/S0036142903428840
Jezdan, G.: Relaxation-based modeling of inelastic materials. Ph.D. thesis, Ruhr-Universität Bochum (2023) (in preparation).
Mitchell, J.K., Soga, K., et al.: Fundamentals of Soil Behavior, vol. 3. Wiley, Hoboken (2005)
Lubliner, J.: Plasticity Theory. Dover Books on Engineering. Dover Publications, Mineola, NY (2008)
Moreau, J.J.: La notion de sur-potentiel et les liaisons unilatérales en elastostatiques. C. R. Acad. Sci. Paris Série A 267, 954–957 (1968)
Carstensen, C., Hackl, K., Mielke, A.: Non-convex potentials and microstructures in finite-strain plasticity. Proc. R. Soc. A 458, 299–317 (2002). https://doi.org/10.1098/rspa.2001.0864
Miehe, C., Schotte, J., Lambrecht, M.: Homogenization of inelastic solid materials at finite strains based on incremental minimization principles. Application to the texture analysis of polycrystals. J. Mech. Phys. Solids 50(10), 2123–2167 (2002)
Ortiz, M., Repetto, E.: Nonconvex energy minimization and dislocation structures in ductile single crystals. J. Mech. Phys. Solids 47(2), 397–462 (1999)
Hackl, K.: Generalized standard media and variational principles in classical and finite strain elastoplasticity. J. Mech. Phys. Solids 45, 667–688 (1997). https://doi.org/10.1016/S0022-5096(96)00110-X
Hackl, K., Fischer, F.D.: On the relation between the principle of maximum dissipation and inelastic evolution given by dissipation potentials. Proc. R. Soc. Lond. A 464, 117–132 (2008)
Dacorogna, B.: Direct Methods in the Calculus of Variations. Applied Mathematical Sciences, vol. 78, p. 308. Springer, Berlin (1989)
Griewank, A., Rabier, P.J.: On the smoothness of convex envelopes. Trans. Am. Math. Soc. 322(2), 691–709 (1990). https://doi.org/10.2307/2001721
Rockafellar, R.T.: Convex Analysis. Princeton Mathematical Series. Princeton University Press, Princeton (1970)
Taylor, R.L.: FEAP—Finite Element Analysis Program. University of California, Berkeley (2014). http://www.ce.berkeley/feap
Baotang Shen, J.S., Barton, N.: An approximate nonlinear modified Mohr–Coulomb shear strength criterion with critical state for intact rocks. J. Rock Mech. Geotech. Eng. 10, 645–652 (2018)
Behr, F.: Variational models in pressure dependent plasticity. Ph.D. thesis, Universität Regensburg (2023) (in preparation).
Acknowledgements
The authors gratefully acknowledge the funding by the German Research Foundation (DFG) within the Priority Program 2256 “Variational Methods for Predicting Complex Phenomena in Engineering Structures and Materials” within project 441211072/441468770 “Variational modeling of pressure-dependent plasticity - a paradigm for model reduction via relaxation.”
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare that there are no competing interests.
Additional information
Communicated by Andreas Öchsner.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A Convex analysis
The construction of \(f_\textrm{c}\) in Theorem 2 is based on the construction of a candidate \(f_\textrm{a}\) and a decomposition of \(\mathbb {R}^2\) into disjoint regions with the property that \(f_\textrm{a}\) is globally continuous and convex on the interior of the different regions. This appendix collects the results that are needed in order to prove that \(f_\textrm{a}\) is convex on \(\mathbb {R}^2\). The proofs follow from classical results in convex analysis, see [41] for more information.
Definition 5
Suppose that \(\Omega \subset \mathbb {R}^d\) is open and that \(f:\Omega \rightarrow \mathbb {R}\). Then, f is said to be locally convex, if for all \(x\in \Omega \) there exists an \(r_x>0\) with \(B(x,r_x)\subset \Omega \) and f convex on \(B(x,r_x)\). A vector \(v\in \mathbb {R}^d\) is called a local subgradient for f at \(x\in \Omega \) if there exists an \(r_x>0\) with \(B(x, r_x)\subset \Omega \) and if for all \(y\in B(x, r_x)\) the inequality \(f(y) \ge f(x) + \langle v, y-x \rangle \) holds. The set of all local subgradients is denoted by \(\partial _\textrm{loc}f(x)\) and referred to as the local subdifferential.
The subdifferential in the sense of convex analysis is denoted by \(\partial f(x)\). Then, by definition, \(\partial f(x)\subset \partial _\textrm{loc}f(x)\) and if f is differentiable, then \(\partial _\textrm{loc}f(x) \subset \{ f'(x) \}\); however, the local subdifferential may be empty if f the graph of f does not lie above the tangent plane in a sufficiently small neighborhood of x.
The construction of the convex envelope of the condensed energy relies on a local construction and the verification that the local construction leads to a convex function. The proof uses the following results.
Lemma 6
Fix a, \(b\in \mathbb {R}\), \(a<0<b\), and \(f_a\in C^1([a,0];\mathbb {R})\), \(f_b\in C^1([0,b];\mathbb {R})\) convex with \(f_a(0)=f_b(0)\), \(f_a'(0)=f_b'(0)\). Then, \(f=f_a \chi _{[a,0)} + f_b\chi _{[0,b]}\in C^1([a,b];\mathbb {R})\) is convex.
Lemma 7
Suppose that \(f\in C(\mathbb {R}^d)\), B, \(C\subset \mathbb {R}^d\) open and disjoint, C convex and f locally convex on \(B\cup C\), \(f\in C^1(\text {int}(B\cup \overline{C}))\). Then, f is locally convex on \(\text {int}(B\cup \overline{C})\).
Lemma 8
Suppose that \(d\ge 2\), \(\Omega \subset \mathbb {R}^d\) is open and \(f:\Omega \rightarrow \mathbb {R}\) is continuous. The following statements are equivalent:
-
(i)
f is locally convex in \(x\in \Omega \);
-
(ii)
for \(x\in \Omega \) there exists an \(r_x>0\) with \(B(x, r_x)\subset \Omega \) such that \(\partial _\textrm{loc}f(y)\ne \emptyset \) for all \(y\in B(x, r_x)\).
Lemma 9
Suppose that \(a_i\), \(b_i\in \mathbb {R}\), \(i=1,2\) with \(a_1<b_1<a_2<b_2\) and that \(f\in C^0([a_1, b_2])\) with \(f\big \vert _{[a_1,a_2]}\), \(f\big \vert _{[b_1,b_2]}\) convex. Then, f is convex.
Lemma 10
Suppose that \(\Omega \subset \mathbb {R}^n\) is open and that \(f\in C(\Omega )\) is locally convex. Then, for any pair a, \(b\in \Omega \) with \(\{\lambda a + (1-\lambda )b:\lambda \in [0,1]\}\subset \Omega \), the convexity inequality holds on [a, b]. In particular, if \(\Omega \) is convex, then f is convex on \(\Omega \).
Lemma 11
Suppose \(\Omega \subset \mathbb {R}^d\) is convex and \(f\in C(\mathbb {R}^d)\). If \(f\big \vert _\Omega \) is convex and if there is a convex function \(g:\mathbb {R}^d\rightarrow \mathbb {R}\) with \(g\le f\) on \(\mathbb {R}^d\) and \(g= f\) on \(\mathbb {R}^d\setminus \Omega \), then f is convex.
Appendix B Proof of Theorem 2
Based on the discussion at the beginning of Sect. 5, for the construction of the convex envelope of the condensed energy \(f^{(r)}\), we first derive pairs and triples of points \(y^{(1)},\ldots ,y^{(q)}\in \mathbb {R}^2\) (\(q= 2,3\)), in which \(f^{(r)}\) is differentiable in the variable \(y_2\) and in which \(\partial _2 f^{(r)}\) is equal. Then, we derive a candidate for the direction \(v\in \mathbb {R}^d\) satisfying the equation in Eq. (5.8). However, at this point we do not know whether this v in Eq. (5.8) is indeed a subgradient of f in all the points \(y^{(1)},\ldots , y^{(q)}\). For those pairs and triples for which we expect this to be true, we do not show explicitly that the first part in Eq. (5.8) is satisfied, rather we apply the construction described in Eqs. (5.9) and (5.10) and prove in Theorem 2 that this construction leads to the convex envelope. It is a remarkable result of the subsequent analysis that the explicit formulas for the relaxed energy show a very different behavior with respect to the dependence on the dissipation function r. In fact, for small values of b relative to r, i.e., for
which is the case of relevance in soil mechanics, the relaxed energy is in fact independent of r. Therefore, we restrict the analysis in this paper to the case of small b, the general case will be treated in [41]. In fact, this assumption implies \(r_0\le r\) by concavity of r. In the following we consider r as fixed, write \(f= f^{(r)}\) and assume without loss of generality that \(r_0<r\) on \((y_{\textrm{min}},y_{\textrm{max}})\).
In order to implement the strategy just outlined, we construct a candidate \(f_\textrm{a}\) for \(f_\textrm{c}\). In a first step, we search for points with common supporting planes in \(Y_0\) and \(\widetilde{Y}\), see Fig. 3. Assume thus that \((y,\widetilde{y})\in Y_0\times \widetilde{Y}\) satisfy Eq. (5.8). Since f is differentiable in y, \(v= \nabla f(y)\) is the only possible choice for v in Eq. (5.8). If \(\widetilde{y}\in \text {int}(\widetilde{Y})\), then f is differentiable in \(\widetilde{y}\) and \(\nabla f(y)= v\in \partial f(\widetilde{y})\subset \{\nabla f(\widetilde{y})\}\) implies \(\widetilde{y}_1= \partial _1 f(\widetilde{y})= \partial _1 f(y)= y_1\), a contradiction. Consequently we assume that \(\widetilde{y}\in \partial \widetilde{Y}\), that is, \(\widetilde{y}_1\in \{y_{\textrm{min}}, y_{\textrm{max}}\}\). The existence of \(\partial _2 f(\widetilde{y})\) implies \(\partial _2 f(\widetilde{y})= v_2= \partial _2 f(y)\) and equivalently
Equation (5.8) is equivalent to \(f(\widetilde{y}) = (T_y f)(\widetilde{y})\). By definition of f and since \(f\vert _{Y_0}\) is a quadratic function, these conditions lead to
In view of \(\nabla ^2 f(y) = \text {Id}\), the algebraic condition can be simplified to
One defines \(s= \vert \widetilde{y}_1-y_1\vert \) and substitutes the relation between \(\widetilde{y}_2\) and \(y_2\) to obtain \(y_2 = \pm \sqrt{b} s\). Since \(\widetilde{y}_1\in \{y_{\textrm{min}}, y_{\textrm{max}}\}\), one obtains four one-parameter families of pairs of points,
where s can take all values in \((y_{\textrm{min}},y_{\textrm{max}})\) with \(\alpha _{\textrm{min}}^\pm (s)\in Y_0\) (or \(\alpha _{\textrm{max}}^\pm (s)\in Y_0\)).
For the construction of the convex envelope only the values \(0< s< (y_{\textrm{max}}- y_{\textrm{min}})/2=: s_*\) will be relevant, since the curves \(\alpha _{\textrm{min}}^\pm \) and \(\alpha _{\textrm{max}}^\pm \) intersect in the point
For each \(y\in \mathbb {R}^2\), which lies on a connecting straight line of two corresponding points of the curves \(\alpha _{\textrm{min}}^\pm \) and \(\beta _{\textrm{min}}^\pm \), i.e., \(y= t\cdot \alpha _{\textrm{min}}^\pm (s)+ (1-t)\cdot \beta _{\textrm{min}}^\pm (s)\) with \(s\in (0,s_*)\) and \(t\in [0,1]\), we define \(f_\textrm{a}(y)= t\cdot f(\alpha _{\textrm{min}}^\pm (s))+ (1-t)\cdot f(\beta _{\textrm{min}}^\pm (s))\) according to Eq. (5.9) and we use the same construction for \(\alpha _{\textrm{max}}^\pm \) and \(\beta _{\textrm{max}}^\pm \). Note that \(f_\textrm{a}\) is affine along these connecting lines and the area, which is covered by those lines consists of four triangular shaped sets denoted by \(Y_2\) in Fig. 5. The fact that \(\alpha _{\textrm{min}}^\pm \) and \(\alpha _{\textrm{max}}^\pm \) intersect in \(y_*^\pm \) means that the tangential plane of f in \(y_*^\pm \) touches the graph in two other points, namely
Since the pairs \((\alpha _{\textrm{min}}^\pm (s),\beta _{\textrm{min}}^\pm (s))\) and \((\alpha _{\textrm{max}}^\pm (s),\beta _{\textrm{max}}^\pm (s))\) satisfy the equality in Eq. (5.8) for all \(s\in (0,s_*)\), the triples \((y_*^\pm ,y_{*\textrm{min}}^\pm ,y_{*\textrm{max}}^\pm )\) also satisfy Eq. (5.8). Therefore, we define \(f_\textrm{a}\) for y in one of the triangles \(\text {Conv}(\{y_*^+, y_{*\textrm{min}}^+, y_{*\textrm{max}}^+\})\cup \text {Conv}(\{y_*^-, y_{*\textrm{min}}^-, y_{*\textrm{max}}^-\})\) (\(Y_3\) in Fig. 5) according to Eq. (5.10) as the unique affine function which coincides with f in the vertices of each of these triangles,
Finally, consider a pair of two different points \((\widetilde{y}^{(1)}, \widetilde{y}^{(2)})\in \partial \widetilde{Y}\) satisfying Eq. (5.8). For the existence of a supporting hyperplane of f, which touches the graph of f in \(\widetilde{y}^{(1)}\) and \(\widetilde{y}^{(2)}\), we require equal partial derivatives in the second component, i.e.,
With \(\widetilde{y}^{(1)}_1, \widetilde{y}^{(2)}_1\in \{y_{\textrm{min}},y_{\textrm{max}}\}\) and \(\widetilde{y}^{(1)}\ne \widetilde{y}^{(2)}\) we get \(\widetilde{y}^{(1)}= y_{\textrm{min}}\) and
\(\widetilde{y}^{(2)}_1= y_{\textrm{max}}\) and obtain for \(s> 0\) the pairs of curves
Since we already constructed our candidate \(f_\textrm{a}\) for the convex envelope on the set \((y_{\textrm{min}},y_{\textrm{max}})\times [- \frac{b+1}{\sqrt{b}} s_*, \frac{b+1}{\sqrt{b}} s_*]\), we restrict s to have values \(s> \frac{b+1}{\sqrt{b}} s_*\). For each \(y\in \mathbb {R}^2\), which lies on a connecting straight line of two corresponding points of the curves \(\alpha _\infty ^\pm \) and \(\beta _\infty ^\pm \), i.e., \(y= t\cdot \alpha _\infty ^\pm (s)+ (1-t)\cdot \beta _\infty ^\pm (s)\) with \(s\in (\frac{b+1}{\sqrt{b}} s_*,\infty )\) and \(t\in [0,1]\) (represented by the two unbounded sets forming \(Y_4\) in Fig. 5), we define \(f_\textrm{a}(y)= t\cdot f(\alpha _\infty ^\pm (s))+ (1-t)\cdot f(\beta _\infty ^\pm (s))\) according to Eq. (5.9).
After these preparations, we turn to the proof of the theorem and divide it into several steps. Denote by \(f_\textrm{a}\) the formula on the right-hand side in the assertion of the theorem in (5.15). We are going to show in Step 1 that \(f_\textrm{a}\) is differentiable on \(\mathbb {R}^2{\setminus } (\{y_{\textrm{min}},y_{\textrm{max}}\}\times \mathbb {R})\) and convex. In Step 2 we verify that \(f_\textrm{a}\) is indeed the expression we obtain from the previously described construction and \(f_\textrm{c}\le f_\textrm{a}\). Finally in Step 3 we prove \(f_\textrm{a}\le f\) and conclude that \(f_\textrm{a}\) coincides with \(f_\textrm{c}\) by \(f_\textrm{c}\le f_\textrm{a}\le f\) and \(f_\textrm{a}\) being convex.
Step 1: Differentiability and convexity of \(f_\textrm{a}\). The differentiability of \(f_\textrm{a}\) on \(\text {int}(\widetilde{Y})\cup (y_{\textrm{min}},y_{\textrm{max}})\times \mathbb {R}\) follows from
and verifying the continuity of \(\nabla f_\textrm{a}\) along the boundaries of the different regions. With the notation \(e_1\odot e_2:= \frac{1}{2} (e_1\otimes e_2+ e_2\otimes e_1)\), the second derivatives are given by
In fact, \(\nabla ^2 f_\textrm{a}\) is positively semidefinite on the interior of each domain \(\widetilde{Y}, Y_1, Y_2, Y_3, Y_4\), consequently \(f_\textrm{a}\) is locally convex on those domains. Since \(f_\textrm{a}\) is differentiable on \((y_{\textrm{min}},y_{\textrm{max}})\times \mathbb {R}\), an iterative application of Lemma 7 gives us the local convexity (and hence the convexity by Lemma 10) of \(f_\textrm{a}\) on \((y_{\textrm{min}},y_{\textrm{max}})\times \mathbb {R}\). A short calculation shows that \(g: \mathbb {R}^2\rightarrow \mathbb {R},\ g(y)= \frac{1}{2}y_1^2+ \frac{1}{2}\frac{b}{b+1}y_2^2\) is a convex lower bound of \(f_\textrm{a}\), and since \(f_\textrm{a}\) and g coincide on \(\widetilde{Y}\) and \(f_\textrm{a}\) is convex on \(\mathbb {R}{\setminus } \widetilde{Y}= (y_{\textrm{min}},y_{\textrm{max}})\times \mathbb {R}\), the convexity of \(f_\textrm{a}\) follows by Lemma 11.
Step 2: \(f_\textrm{c}\le f_\textrm{a}\). Since the calculation of the explicit formula of the candidate \(f_\textrm{a}\) derived from the previously described construction is extensive, we just prove that the given formula is the one, which is obtained by this construction. Refer to Fig. 5 for a sketch of the various domains.
On \(\widetilde{Y}\cup Y_1\): Since by assumption \(r_0\le r\) and hence \(Y_1\subset Y_0\), \(f_\textrm{a}\) coincides with f in \(Y_1\) and in \(\widetilde{Y}\) and \(f_\textrm{c}\le f= f_\textrm{a}\), as asserted.
On \(Y_2\): The set \(Y_2\) is exactly the set of points, which are covered by the straight line segments connecting the corresponding pairs of points on the four pairs of curves \((\alpha _{\textrm{min}}^\pm , \beta _{\textrm{min}}^\pm )\) and \((\alpha _{\textrm{max}}^\pm ,\beta _{\textrm{max}}^\pm )\). Assume \(y\in Y_2\) with \(y_1< y_{\textrm{mid}}\) and \(y_2 > 0\) (the other cases are analogous). There exists \(s\in (0,(y_{\textrm{max}}- y_{\textrm{min}})/2)\) and \(t\in (0,1)\) with \(y= t\cdot \alpha _{\textrm{min}}^+(s)+ (1-t)\cdot \beta _{\textrm{min}}^+(s)\). To show that the given expression for \(f_\textrm{a}(y)\) results from the construction described above, we notice that \(f_\textrm{a}(\alpha _{\textrm{min}}^+(s))= f(\alpha _{\textrm{min}}^+(s))\) and \(f_\textrm{a}(\beta _{\textrm{min}}^+(s))= f(\beta _{\textrm{min}}^+(s))\) and that \(f_\textrm{a}\) is affine along the connecting line between \(\alpha _{\textrm{min}}^+(s)\) and \(\beta _{\textrm{min}}^+(s)\). This can be seen by \(\beta _{\textrm{min}}^+(s)- \alpha _{\textrm{min}}^+(s)=(s-y_{\textrm{min}})\cdot (-1,\sqrt{b}^{-1})^T\) and using \(y_2\), \(y_{\textrm{mid}}- y_1> 0\) to obtain
Since \(f_\textrm{c}\) must lie below the linear interpolation, we obtain \(f_\textrm{c}\le f_\textrm{a}\).
On \(Y_3\): By construction, \(f_\textrm{a}\) and f have the same values in \(y_*^\pm , y_{*\textrm{min}}^\pm , y_{*\textrm{max}}^\pm \) and on \(\text {Conv}(y_*^+, y_{*\textrm{min}}^+, y_{*\textrm{max}}^+)\) and \(\text {Conv}(y_*^-, y_{*\textrm{min}}^-, y_{*\textrm{max}}^-)\) the function \(f_\textrm{a}\) is affine due to \(\nabla ^2 f_\textrm{a}= 0\) on \(\text {int}(Y_3)\). By \(y_*^\pm \in Y_0\), f is differentiable in \(y_*^\pm \) with \(\nabla f(y_*^\pm )= \nabla f_\textrm{a}(y_*^\pm )\). Now it follows in view of Eq. (B10) that \(f_\textrm{a}\) is the above constructed function.
On \(Y_4\): Similar to the argument on \(Y_2\), we recognize that for any \(y\in Y_4\) with \(\pm y_2> 0\) there is an \(s\in (0,\frac{b+1}{\sqrt{b}}(y_{\textrm{max}}- y_{\textrm{min}})/2)\) and \(t\in (0,1)\) with \(y= t\cdot \alpha ^\pm (s)+ (1-t)\cdot \beta ^\pm (s)\). We have \(f_\textrm{a}(\alpha _\infty ^\pm (s))= f(\alpha _\infty ^\pm (s))\) and \(f_\textrm{a}(\beta _\infty ^\pm (s))= f(\beta _\infty ^\pm (s))\), while \(f_\textrm{a}\) is affine along the connecting line between \(\alpha _\infty ^\pm (s)\) and \(\beta _\infty ^\pm (s)\) since \(\beta _\infty ^\pm (s)- \alpha _\infty ^\pm (s)= (y_{\textrm{max}}-y_{\textrm{min}},0)\) and in the formula of \(f_\textrm{a}\) on \(Y_4\) the quadratic term in \(y_1\) cancels out. Finally, we get \(f_\textrm{c}\le f_\textrm{a}\) since any value of \(f_\textrm{a}\) is a convex combination of two or three function values of f.
Step 3: The inequality \(f_\textrm{a}\le f\). We show \(f_\textrm{a}\le f^{(r_0)}\le f\), where the second inequality is an immediate consequence of Lemma 1. For the first inequality, recognize that \(f_\textrm{a}= f^{(r_0)}\) holds on \(\widetilde{Y}\cup Y_1\cup Y_2\) and the inequality only has to be shown on \(Y_3\) and \(Y_4\).
On \(Y_3\), the estimate follows by
On \(Y_4\), we can calculate
For \(y_1\in (y_{\textrm{min}},y_{\textrm{mid}}]\), we have
and for \(y_1\in (y_{\textrm{mid}},y_{\textrm{max}})\) we have
which gives us the desired estimate on \(Y_4\).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Behr, F., Dolzmann, G., Hackl, K. et al. Analytical and numerical relaxation results for models in soil mechanics. Continuum Mech. Thermodyn. 35, 2019–2041 (2023). https://doi.org/10.1007/s00161-023-01225-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00161-023-01225-9