Abstract
In this article we study quantitative rigidity properties for the compatible and incompatible two-state problems for suitable classes of \(\mathcal{A}\)-free differential inclusions and for a singularly perturbed \(T_{3}\) structure for the divergence operator. In particular, in the compatible setting of the two-state problem we prove that all homogeneous, first order, linear operators with affine boundary data which enforce oscillations yield the typical \(\epsilon ^{\frac{2}{3}}\)-lower scaling bounds. As observed in Chan and Conti (Math. Models Methods Appl. Sci. 25(06):1091–1124, 2015) for higher order operators this may no longer be the case. Revisiting the example from Chan and Conti (Math. Models Methods Appl. Sci. 25(06):1091–1124, 2015), we show that this is reflected in the structure of the associated symbols and that this can be exploited for a new Fourier based proof of the lower scaling bound. Moreover, building on Rüland and Tribuzio (Arch. Ration. Mech. Anal. 243(1):401–431, 2022); Garroni and Nesi (Proc. R. Soc. Lond., Ser. A, Math. Phys. Eng. Sci. 460(2046):1789–1806, 2004, https://doi.org/10.1098/rspa.2003.1249); Palombaro and Ponsiglione (Asymptot. Anal. 40(1):37–49, 2004), we discuss the scaling behavior of a \(T_{3}\) structure for the divergence operator. We prove that as in Rüland and Tribuzio (Arch. Ration. Mech. Anal. 243(1):401–431, 2022) this yields a non-algebraic scaling law.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Rigidity and flexibility properties associated with (nonlinear) differential inclusions for the gradient have been objects of intensive study. They arise in a variety of applications, including the analysis of PDEs, e.g. the study of regularity of elliptic systems [5–7], fluid dynamics [8, 9], geometry [10–13] and various settings in the materials sciences, e.g. the study of patterns in shape-memory alloys [14–19]. Motivated by applications of microstructures in composites [20, 21], optimal design problems [22–24] and micromagnetics [25, 26] as well as by recent developments on more general differential inclusion problems [27–29], in this article we study two instances of quantitative rigidity and flexibility properties of differential inclusions for more general operators. On the one hand, we consider constant-coefficient, homogeneous, linear differential operators for which we discuss quantitative versions of the compatible and incompatible two-state problems. On the other hand, we investigate quantitative properties of a \(T_{3}\) structure for the divergence operator.
1.1 On Quantitative Results for the Two-Well Problem for \(\mathcal{A}\)-Free Differential Inclusions
\(\mathcal{A}\)-free differential inclusions arise in many different settings, including linearized elasticity [30, 31], liquid crystal elastomers [32, 33] and the study of the Aviles-Giga functional [34, 35] to name just a few examples. They have been systematically investigated in the context of compensated compactness theory in classical works such as [36–39] but also in more recent literature on compensated compactness theory [40–46], in truncation results [47], in classical minimization and regularity questions in the calculus of variations [48–51] and in the context of fine properties of such operators in borderline spaces [27, 52–54]. In the recent articles [27, 28] general \(\mathcal{A}\)-free versions of the incompatible two-well problem in borderline spaces and the study of \(T_{N}\) structures have been initiated. Motivated by these applications, in the first part of this article, we seek to study quantitative versions of the compatible and incompatible two-state problems.
Before turning to the setting of general \(\mathcal{A}\)-free differential inclusions, let us recall the analogous “classical” setting for the gradient: Inspired by problems from materials science and phase transformations, the exact and approximate rigidity properties of differential inclusion problems for the gradient [14] (see also [18, 55–57]) with and without gauge invariance have been considered. For two energy wells without gauge invariances this amounts to the study of the differential inclusion
for \(A,B \in \mathbb{R}^{d\times d}\), \(A \neq B\) and \(\Omega \subset \mathbb{R}^{d}\) a bounded Lipschitz domain. It is well-known that depending on the compatibility of the wells, a dichotomy arises:
-
On the one hand, for incompatible wells, i.e. if \(B-A \in \mathbb{R}^{d \times d}\) is not a rank-one matrix, the differential inclusion (1) is rigid both for exact and approximate solutions: Indeed, if \(A\), \(B\) are incompatible, (1) only permits solutions with constant deformation gradient, which is in the following referred to as the rigidity of the two-state problem for exact solutions. Moreover, in this setting one has that for sequences \(\nabla u_{k}\) with \(\operatorname {dist}(\nabla u_{k}, \{A,B\}) \rightarrow 0\) in measure, it necessarily holds that along a subsequence \(\nabla u_{k} \rightarrow A\) or \(\nabla u_{k} \rightarrow B\) in measure. We will refer to this as rigidity of the two-state problem for approximate solutions.
We remark that various far-reaching generalizations of these results have been obtained: For settings with \(SO(d)\) symmetries a quantitative version of such a result was deduced in [58]; a further proof was found in [59]. The one-state problem with continuous symmetry group was studied in [60, 61].
The article [27] investigates a similar problem for two incompatible wells for a general constant coefficient, homogeneous, linear differential operator \(\mathcal{A}(D)\), providing qualitative rigidity results for the associated exact and approximate differential inclusions, including \(L^{1}\)-based frameworks.
-
On the other hand, if the wells are compatible, i.e. if \(B-A \in \mathbb{R}^{d \times d}\) is a rank-one matrix, then simple laminate solutions of (1) exist, in which the deformation gradient oscillates between the two fixed values \(A\), \(B\) and is a one-dimensional function depending only on the direction determined by the difference \(B-A\). Due to the failure of rigidity on the exact level, also rigidity on the approximate level cannot be expected without additional regularization terms.
In the first part of this article we seek to consider quantitative, \(L^{2}\)-based variants of these type of results for more general, linear differential operators \(\mathcal{A}(D)\). In this context, we will consider the following two guiding questions:
-
Quantitative incompatible rigidity. For a constant coefficient, homogeneous, linear differential operator \(\mathcal{A}(D)\) and two incompatible wells, i.e. \(A,B \in \mathbb{R}^{n}\) such that \(B-A \notin \Lambda _{\mathcal{A}}\), cf. (5) for the definition of the wave cone, do we have a quantitative rigidity result in terms of domain scaling for prescribed boundary data which are a convex combination of the two states? Here the dimension \(n\) depends on the operator \(\mathcal{A}(D)\); for \(\mathcal{A}(D) = \text{curl}\) (which corresponds to the gradient setting from (1) above) we would for instance consider \(n = d \times d\).
More precisely, we seek to study the following question: Let \(A,B \in \mathbb{R}^{n}\) be such that \(B-A \notin \Lambda _{\mathcal{A}}\). Is it true that
$$\begin{aligned} E_{el}(u,\chi )= \int _{\Omega} |u - \chi _{A} A - \chi _{B} B|^{2} dx \geq C(A,B,\lambda )|\Omega |, \end{aligned}$$if \(u: \mathbb{R}^{d} \rightarrow \mathbb{R}^{n}\), \(\mathcal{A}(D)u =0\) in \(\mathbb{R}^{d}\), \(\chi := A \chi _{A} + B \chi _{B} \in \{A,B\}\) in \(\Omega \) and \(\chi _{A}, \chi _{B} \in \{0,1\}\) with \(\chi _{A}+\chi _{B} =1\) and if for some \(\lambda \in (0,1)\) we have that \(u=F_{\lambda} := \lambda A + (1-\lambda ) B \) in \(\mathbb{R}^{d}\setminus \overline{\Omega}\)?
-
Quantitative compatible rigidity. Let us next consider two compatible wells \(A,B \in \mathbb{R}^{n}\), i.e. let \(A,B \in \mathbb{R}^{n}\) be such that \(B-A \in \Lambda _{\mathcal{A}}\), see (5) below, and let us again consider boundary data \(F_{\lambda}:= \lambda A + (1-\lambda ) B\) for some \(\lambda \in (0,1)\) as above and with the set of admissible functions given by \(\mathcal{D}_{F_{\lambda}}\) in (10). For a singularly perturbed energy similarly as in (15) is it true that as in [62, 63] also in the setting of a more general constant coefficient, homogeneous, linear differential operator \(\mathcal{A}(D)\) the following bound holds
$$\begin{aligned} \inf _{\chi \in BV(\Omega ;\{A,B\})}\inf _{u\in \mathcal{D}_{F_{\lambda}}} (E_{el}(u,\chi ) + \epsilon \int _{ \Omega}|\nabla \chi |) \geq C \epsilon ^{2/3}? \end{aligned}$$
In what follows, we will formulate the set-up, the relevant operator classes and our results on these questions.
1.2 Formulation of the Two-State Problem for \(\mathcal{A}\)-Free Operators in Bounded Domains
Following [27, 28, 48], we consider a particular class of linear, homogeneous, constant-coefficient operators. The operator \(\mathcal{A}(D): C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{n}) \to C^{ \infty}(\mathbb{R}^{d};\mathbb{R}^{m})\) of order \(k \in \mathbb{N}\) is given in the form
where \(\alpha \in \mathbb{N}^{d}\) denotes a multi-index of length \(|\alpha |:=\sum _{j=1}^{d} \alpha _{j}\) and \(A_{\alpha} \in \mathbb{R}^{m \times n}\) are constant matrices. Seeking to study microstructure, in the sequel we are particularly interested in non-elliptic operators. Here the operator \(\mathcal{A}(D)\) is said to be elliptic if its symbol
is injective for all \(\xi \neq 0\). If \(\mathcal{A}(D)\) is not elliptic, there exist vectors \(\xi \in \mathbb{R}^{d} \setminus \{0\}\) and \(\mu \in \mathbb{R}^{n} \setminus \{0\}\) such that
The collection of these vectors \(\mu \in \mathbb{R}^{n} \setminus \{0\}\) forms the wave cone associated with the operator \(\mathcal{A}(D)\):
The relevance of the wave cone \(\Lambda _{\mathcal{A}}\) for compensated compactness and the existence of microstructure is well-known. For instance, for any pair \((\mu ,\xi )\) as in (4) it is possible to obtain \(\mathcal{A}\)-free simple laminate solutions. These are one-dimensional functions \(u(x):=\mu h(x \cdot \xi )\), where \(h: \mathbb{R} \rightarrow \{0,1\}\), which obey the differential constraint \(\mathcal{A}(D) u = 0\) due to the choice of \(\mu \), \(\xi \) and \(u \in \{0,\mu \}\). More generally, if \(k=1\), and for \(\mu \in \Lambda _{\mathcal{A}}\) (see, for instance, [64]) it holds that
is a solution to the differential equation \(\mathcal{A}(D) u = 0\) for vectors \(\xi _{1}, \dots , \xi _{\ell} \in \mathbb{R}^{d}\setminus \{0\}\) forming a basis of the vectorspace
For the row-wise curl operator (\(n=d \times d\)) this is an at most one-dimensional space, while for the row-wise divergence operator (\(n = m \times d\)), it is a space of possibly higher dimension as \(V_{\operatorname {div},\mu} = \ker \mu \), leading to substantially more flexible solutions of the associated differential inclusions than for the curl.
In order to study microstructures arising as solutions to the two-state problem, for the above specified class of operators, analogously as in the gradient setting, we consider the following \(\mathcal{A}\)-free differential inclusion with prescribed boundary values:
with \(\mathcal{K} \subset \mathbb{R}^{n}\), and \(\Omega \subset \mathbb{R}^{d}\) an open, bounded, simply connected domain, with appropriately prescribed boundary data. For the two-state problem we consider \(\mathcal{K} = \{A,B\} \subset \mathbb{R}^{n}\).
Now, in analogy to the gradient setting, on the one hand, we call the differential inclusion for the two-state problem (7) incompatible if it is elliptic in the sense that \(B-A \notin \Lambda _{\mathcal{A}}\). In this case it is proved in [27] that both the exact and approximate differential inclusion (7) are rigid. We emphasize that incompatibility in particular excludes the presence of simple laminates. On the other hand, the differential inclusion (7) is said to be compatible if \(B-A \in \Lambda _{\mathcal{A}}\). In this case, also in the setting of more general operators, a particular class of solutions to (7) consists of (generalized) simple laminates. Moreover, in the compatible setting, we further distinguish a particular case: We consider the subspace
This is the space of values that are (algebraically) unconstrained by \(\mathcal{A}(D)\), meaning that for all \(u \in L^{2}(\mathbb{R}^{d}; I_{\mathcal{A}})\), we have that \(\mathcal{A}(D)u = 0\) without taking any regularity constraints on \(u\). In the case of \(I_{\mathcal{A}} = \{0\}\), the operator \(\mathcal{A}(D)\) belongs to the class of cocanceling operators, introduced in [65].
We seek to study both settings and the resulting microstructures quantitatively in the spirit of scaling results as, for instance, in the following non-exhaustive list involving different physical applications [2, 22, 62, 63, 66–77]. To this end, for \(\Omega \subset \mathbb{R}^{d}\) an open, bounded, Lipschitz set, we introduce elastic and surface energies and consider their minimization for prescribed, not globally compatible boundary data \(F_{\lambda }= \lambda A + (1-\lambda )B\) for some \(\lambda \in (0,1)\), where again the set of states is given by \(\mathcal{K} = \{A,B\}\).
Motivated by the applications from materials science, we study the following “elastic energy”
which we minimize in the following admissible class of deformations
For ease of notation, here and in what follows, we often use the convention that \(\chi := \chi _{A} A + \chi _{B} B\) with \(\chi _{A}, \chi _{B} \in L^{2}(\Omega ;\{0,1\})\) and \(\chi _{A} + \chi _{B} = 1\) in \(\Omega \). Moreover, we further use the notation
In addition to the “elastic” energy contributions, we also introduce a surface energy contribution of the form
and consider the following singularly perturbed elastic energy for \(\epsilon >0\)
and correspondingly
We note that this can be defined for an arbitrary set of states \(\mathcal{K}\) and suitable boundary data \(F_{\lambda}\).
With these quantities in hand, we can formulate the following quantitative rigidity results for the two-state problems:
Theorem 1
Let \(d,n,m,k \in \mathbb{N}\), \(n > 1\). Let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain, and let \(\mathcal{A}(D)\) be as in (2) with the wave cone \(\Lambda _{\mathcal{A}}\) given in (5) and let \(I_{\mathcal{A}}\) be given in (8). Let \(A,B \in \mathbb{R}^{n}\) and let \(\chi = \chi _{A} A + \chi _{B} B \in L^{2}(\Omega ;\{A,B\})\). Further, let the elastic and surface energies \(E_{el}\), \(E_{surf}\) be given as in (9) and (11), respectively. For \(\lambda \in (0,1) \) set \(F_{\lambda }= \lambda A + (1-\lambda )B \in \mathbb{R}^{n}\) and consider \(\mathcal{D}_{F_{\lambda}}\) as in (10). The following results hold:
-
(i)
Incompatible case: Assume that \(B-A \notin \Lambda _{\mathcal{A}}\). Then there is a constant \(C= C(A,B) >0\) such that,
$$\begin{aligned} \inf _{\chi \in L^{2}(\Omega ;\{A,B\})}\inf _{u\in \mathcal{D}_{F_{\lambda}}} E_{el}(u,\chi ) \geq C (\min \{\lambda , 1- \lambda \})^{2}|\Omega |. \end{aligned}$$ -
(ii)
Compatible case: Assume that \(\mathcal{A}(D)\) is one-homogeneous, i.e. \(k=1\) in (2), and that \(B-A \in \Lambda _{\mathcal{A}} \setminus I_{\mathcal{A}}\). Then, there exist \(C= C(\mathcal{A}(D),A,B,\Omega , d, \lambda ) >0\) and \(\epsilon _{0}=\epsilon _{0}(\mathcal{A}(D),A,B,\Omega , d, \lambda )>0\) such that for \(\epsilon \in (0,\epsilon _{0})\)
$$\begin{aligned} \inf _{\chi \in BV(\Omega ;\{A,B\})}\inf _{u\in \mathcal{D}_{F_{\lambda}}} (E_{el}(u,\chi ) + \epsilon E_{surf}(\chi )) \geq C \epsilon ^{2/3}. \end{aligned}$$Furthermore, if we assume \(\mathcal{A}(D) = \operatorname {div}\) and \(\Omega = [0,1]^{d}\), then there exists a constant \(c = c(A,B,\lambda ) > 0\) such that for \(\epsilon > 0 \) we also have the matching upper bound
$$\begin{aligned} \inf _{\chi \in BV(\Omega ;\{A,B\})} \inf _{u \in \mathcal{D}_{F_{ \lambda}}} (E_{el}(u,\chi ) + \epsilon E_{surf}(\chi )) \leq c \epsilon ^{2/3}. \end{aligned}$$ -
(iii)
Super-compatible case: Assume that \(A-B \in I_{\mathcal{A}}\). Then,
$$\begin{aligned} \inf _{\chi \in BV(\Omega ;\{A,B\})}\inf _{u\in \mathcal{D}_{F_{\lambda}}} (E_{el}(u,\chi ) + \epsilon E_{surf}(\chi )) = 0. \end{aligned}$$
We highlight that in our discussion of the compatible case, we have restricted ourselves to operators of order one. This is due to the fact that for higher order operators it is expected that more complicated microstructures may arise. This is also reflected in the Fourier space properties of the symbol \(\mathbb{A}\). We refer to Sect. 3.5, see Proposition 3.10, for a brief discussion of this, illustrating that the scaling may, in general, be no longer of the order \(\epsilon ^{\frac{2}{3}}\) in the higher order setting.
Let us discuss a prototypical example of the above results:
Example 1.1
As an example of the above differential inclusion, we consider the case in which \(\mathcal{A}(D) = \operatorname {div}: C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{m \times d}) \to C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{m})\) row-wise for matrix fields \(u: \mathbb{R}^{d} \to \mathbb{R}^{m \times d}\). In this case the boundary value problem under consideration turns into the following differential inclusion:
for some \(F_{\lambda}:= \lambda A + (1-\lambda ) B\), \(\lambda \in (0,1)\). Such differential inclusions are related to applications in shape-optimization as, for instance, in [22].
The divergence is applied row-wise to matrix fields \(u : \mathbb{R}^{d} \to \mathbb{R}^{m \times d}\), i.e. \(\mathcal{A}(D)u = \sum _{j=1}^{d} (\partial _{j} u) e_{j} \) with the matrix vector product. Hence \(\mathbb{A}(\xi )M = M \xi \) for \(M \in \mathbb{R}^{m \times d}\) and thus the wave cone is given by
Moreover the divergence is a cocanceling operator as \(I_{\operatorname {div}} = \{0\}\).
We emphasize that Example 1.1 is indeed a prototypical example and plays a central role in the study of first order operators in that all first order operators can be reduced to this model operator by a suitable linear transformation, see [28, Appendix] and also Sect. B below. We emphasize that this reduction is particularly useful if the differential inclusion is incompatible or if the boundary data are in \(\Lambda _{\mathcal{A}} \setminus I_{\mathcal{A}}\). As a consequence, quantitative lower bound estimates for incompatible differential inclusions for first order operators, e.g. for \(T_{N}\) structures as qualitatively studied in [28], or for compatible, but not super-compatible boundary data can be deduced from the ones of the divergence operator (see Proposition B.2 and, in general, the discussion in Sect. B). A reduction to an equivalent problem for a modified operator and modified boundary data to the setting involving a cocanceling operator will be discussed in Sect. 3.4, see Proposition 3.8 and Corollary 3.9.
1.3 Quantitative Rigidity of a \(T_{3}\) Structure for the Divergence Operator
In the second part of the article, building on the works [2–4] and motivated by the highlighted considerations on the role of the divergence operator, we study the quantitative rigidity of the \(T_{3}\) configuration
where \(\Omega = [0,1]^{3}\), \(u: \mathbb{R}^{3} \rightarrow \mathbb{R}^{3\times 3}\) and
By virtue of the results from [3, 4] this problem is flexible for approximate solutions but rigid on the level of exact solutions:
-
More precisely, on the level of exact solutions to (13), only constant solutions \(u \equiv A_{j}\) for \(j\in \{1,2,3\}\) obey the differential inclusion.
-
Considering however approximate solutions, i.e. sequences \((u_{k})_{k\in \mathbb{N}}\) such that
$$\begin{aligned} \operatorname {dist}(u_{k}, \{A_{1},A_{2},A_{3}\}) \rightarrow 0 \text{ in measure as } k \rightarrow \infty , \ \operatorname {div}u_{k} = 0 \text{ for all } k \in \mathbb{N}, \end{aligned}$$there exists a sequence of approximate solutions \((u_{k})_{k\in \mathbb{N}}\) for (13) such that there is no subsequence which converges in measure to one of the constant deformations \(\{A_{1},A_{2},A_{3}\}\).
Compared to the setting of the gradient, for the divergence operator rigidity for approximate solutions is already lost for the three-state problem (while this arises only for four or more states for the gradient [78, 79], see also [18] for further instances in which the Tartar square was found and used).
As in [2] we here study a quantitative version of the dichotomy between rigidity and flexibility: We consider the singularly perturbed variant of (13) as in Sect. 1.2
under the constraint \(u\in \mathcal{D}_{F}\), cf. (10), with \(\mathcal{A}(D) = \operatorname {div}\) and where \(F\in \{A_{1},A_{2},A_{3}\}^{qc}\) (see Sect. 2 for the definition of the \(\mathcal{A}\)-quasi-convexification of a compact set). Here the function \(\chi \in BV(\Omega ; \{A_{1},A_{2},A_{3}\})\) denotes the phase indicator of the “phases” \(A_{1}\), \(A_{2}\), \(A_{3}\), respectively.
Adapting the ideas from [2] to the divergence operator in three dimensions, we prove the following scaling result:
Theorem 2
Let \(\Omega = [0,1]^{3}\) and let \(\mathcal{K} = \{A_{1},A_{2},A_{3}\}\) with \(A_{j}\), \(j\in \{1,2,3\}\) given in (14), \(F\in \mathcal{K}^{qc}\setminus \mathcal{K}\), \(E_{\epsilon}\) be as in (15) above for the divergence operator \(\mathcal{A}(D) = \operatorname {div}\), and consider \(\mathcal{D}_{F}\) given analogously as (10). Then, there exist constants \(c = c(F) >0\) and \(C = C(F) > 1\) such that for any \(\nu \in (0,\frac{1}{2})\) there is \(\epsilon _{0} = \epsilon _{0}(\nu ,F)>0\) and \(c_{\nu}>0\) such that for \(\epsilon \in (0,\epsilon _{0})\) we have
Let us comment on this result: As in [2] we obtain essentially matching upper and lower scaling bounds with less than algebraic decay behavior as \(\epsilon \rightarrow 0\), reflecting the infinite order laminates underlying the \(T_{3}\) structure and the fact that the problem is “nearly” rigid. As in [2] a key step is the analysis of a “quantitative chain rule in a negative Sobolev space” which results from the interaction of Riesz type transforms and a nonlinearity originating from the “ellipticity” of the differential inclusion. Both in the upper and the lower bound, these estimates for the divergence operator however require additional care due to the three-dimensionality of the problem. In the upper bound construction this is manifested in the use of careful cut-off arguments; in the lower bound, a more involved iterative scheme has to be used to reduce the possible regions of concentration in Fourier space.
Similarly, as in [76] the scaling law from Theorem 2 is obtained as a consequence of a rigidity estimate encoding both the rigidity and flexibility of the \(T_{3}\) differential inclusion (in analogy to the \(T_{4}\) case from [76, Proposition 3]).
Proposition 1.2
Let \(\Omega = [0,1]^{3}\) and let \(\mathcal{K} = \{A_{1},A_{2},A_{3}\}\) with \(A_{j}\), \(j\in \{1,2,3\}\) given in (14), \(F\in \mathcal{K}^{qc}\setminus \mathcal{K}\). Let \(\chi _{jj}\) denote the diagonal entries of the matrix field \(\chi \in BV(\Omega ;\mathcal{K})\) and denote by \(E_{\epsilon}^{per}(\chi ;F)\) the periodic singularly perturbed energy (see (32) and Sect. 4.3.2). Then, there exists \(\epsilon _{0} >0\) such that for any \(\nu \in (0,1)\) there is a constant \(c_{\nu}>0\) such that for \(\epsilon \in (0,\epsilon _{0})\) we have
We emphasize that in parallel to the setting of the Tartar square, this estimate quantitatively encodes both rigidity and flexibility of the differential inclusion, as it measures the distance to the constant state (and thus reflects rigidity of the exact differential inclusion) but also quantifies the “price” for this in terms of a “high energy” scaling law (and thus reflects the underlying flexibility of the approximate problem). Moreover, due to the flexibility of the differential inclusion, we stress that such an estimate can only be inferred for a combination of elastic and surface energies.
1.4 Outline of the Article
The remainder of the article is structured as follows: After briefly recalling relevant notation and facts on convex hulls related to the operator \(\mathcal{A}(D)\), we first discuss the compatible and incompatible two-state problems in Sect. 3. Here we begin by discussing the incompatible setting, which is a consequence of direct elliptic estimates in Sect. 3.2 and then turn to the lower bounds for the compatible setting in Sect. 3.3. The super-compatible case is then treated in Sect. 3.4. It is also in this section that we discuss a reduction to cocanceling operators. We revisit the scaling of a prototypical higher order operator from [1] in Sect. 3.5 and explain how our scheme of deducing lower bounds also yields a quick Fourier based proof of the lower scaling bound from [1].
In Sect. 4 we then turn to the \(T_{3}\) differential inclusion, for which we first prove upper bounds in Sect. 4.1 and then adapt the ideas from [2] to infer essentially matching lower bounds in Sect. 4.2.
In the Appendix, we complement the general lower bounds for first order differential operators with upper bounds for the specific case of the divergence operator (Sect. A). Moreover, in Sect. B.1 we discuss the reduction to the divergence operator.
2 Notation
In this section we collect the notation which is used throughout the article.
-
For a set \(U\), we denote by \(d_{U}\) the (possibly smoothed-out) distance to this set: \(d_{U}(x) = \operatorname{dist}(x,U) = \inf _{y \in U}|x-y|\) and denote by \(\chi _{U}\) the (in some places smoothed-out) indicator function of this set.
-
For a set \(U\) with finite measure and a function \(f: U \to \mathbb{R}^{n}\) we denote the mean by \(\langle f \rangle = \frac{1}{|U|} \int _{U} f dx\).
-
For a function \(f \in L^{2}(\mathbb{T}^{d})\) or \(f \in L^{2}(\mathbb{R}^{d})\) we denote the Fourier transform by
$$\mathcal{F}(f)(k) = \hat{f}(k) = (2 \pi )^{-\frac{d}{2}}\int _{ \mathbb{T}^{d}} e^{- i k \cdot x} f(x) dx\ (k \in \mathbb{Z}^{d}) $$or
$$\mathcal{F}(f)(\xi ) = \hat{f}(\xi ) = (2 \pi )^{-\frac{d}{2}} \int _{ \mathbb{R}^{d}} e^{-i \xi \cdot x} f(x) dx\ (\xi \in \mathbb{R}^{d}). $$For a function \(h \in L^{\infty}(\mathbb{R}^{d}) \), we denote by \(h(D)\) the corresponding Fourier multiplier
$$\begin{aligned} h(D)f = \mathcal{F}^{-1}(h(\cdot )\hat{f}(\cdot )). \end{aligned}$$ -
We usually denote the phase indicator by \(\chi \in L^{2}(\Omega ;\mathcal{K})\), with the component functions given by \(\chi = (\chi _{i})\), \(i\in \{1,\dots ,n\}\). Moreover, we use \(F \in \mathcal{K}^{qc}\) (where \(\mathcal{K}^{qc}\) is introduced below) as the exterior data.
-
The set of admissible ’deformations’ \(u\) for an operator of order \(k\geq 1\) is given by \(\mathcal{D}_{F} := \mathcal{D}_{F}^{\mathcal{A}} := \{u \in L^{2}_{loc}( \mathbb{R}^{d} ; \mathbb{R}^{n}): \mathcal{A}(D) u = 0 \text{ in } \mathbb{R}^{d}, \ u = F \text{ in } \mathbb{R}^{d} \setminus \bar{\Omega}\}\). Here the equation \(\mathcal{A}(D)u=0\) is considered in a distributional sense.
-
As introduced in Sect. 1.2, for \(F \in \mathcal{K}^{qc}\) we consider the elastic energy with \(u \in \mathcal{D}_{F}\), \(\chi \in L^{2}(\Omega ;\mathcal{K})\):
$$\begin{aligned} E_{el}(u,\chi ) = \int _{\Omega }|u-\chi |^{2} dx, \ E_{el}(\chi ;F) = \inf _{u \in \mathcal{D}_{F}} E_{el}(u,\chi ), \end{aligned}$$(16)and, for \(\chi \in BV(\Omega ;\mathcal{K})\), the surface energy as the total variation norm:
$$\begin{aligned} E_{surf}(\chi ) = \Vert \nabla \chi \Vert _{TV(\Omega )} = \int _{ \Omega }|\nabla \chi |. \end{aligned}$$(17) -
The total energy with \(\epsilon >0\) is given by \(E_{\epsilon}(u,\chi ) = E_{el}(u,\chi ) + \epsilon E_{surf}(\chi )\).
-
We write \(f \sim g\) if there are constants \(c,C>0\) such that \(c f \leq g \leq C f\).
-
We use the notation \(\Vert \cdot \Vert _{\dot{H}^{-1}}\) for the homogeneous \(H^{-1}\) semi-norm for \(f \in H^{-1}(\mathbb{T}^{3};\mathbb{R})\):
$$\begin{aligned} \Vert f \Vert _{\dot{H}^{-1}}^{2} = \sum _{k \in \mathbb{Z}^{3} \setminus \{0\}} \frac{1}{|k|^{2}} |\hat{f}(k)|^{2}. \end{aligned}$$
We further recall the notions of the \(\Lambda _{\mathcal{A}}\)-convex hull of a set (see [64]) and the \(\mathcal{A}\)-quasi-convex hull of a set:
-
Let \(\Lambda _{\mathcal{A}} \subset \mathbb{R}^{n}\) be the wave cone from (5) and let \(K\subset \mathbb{R}^{n}\) be a compact set. For \(j\in \mathbb{N}\), we then define \(K^{(j)}\) as follows:
$$\begin{aligned} K^{(0)} &:= K, \\ K^{(j)} &:=\{M \in \mathbb{R}^{n}: \ M = \lambda A + (1-\lambda ) B: \ A,B \in K^{(j-1)}, \ B-A \in \Lambda _{\mathcal{A}}, \ \lambda \in [0,1] \}. \end{aligned}$$Moreover, we define the \(\Lambda _{\mathcal{A}}\)-convex hull \(K^{lc}\):
$$\begin{aligned} K^{lc}:= \bigcup _{j=0}^{\infty} K^{(j)}. \end{aligned}$$We define the order of lamination of a matrix \(M \in \mathbb{R}^{n}\) to be the minimal \(j\in \mathbb{N}\) such that \(M\in K^{(j)}\). In analogy to the gradient case, we will also refer to \(K^{lc}\) as the laminar convex hull.
-
We recall that the \(\mathcal{A}\)-quasi-convex hull \(K^{qc}\) of a compact set \(K \subset \mathbb{R}^{n}\) is defined by duality to \(\mathcal{A}\)-quasi-convex functions [48]. We recall that the \(\operatorname {div}\)-quasi-convex hull of the \(T_{3}\) matrices from (13), (14) has been explicitly characterized in [21, Theorem 2] to consist of the union of the closed triangle formed by the matrices \(S_{1},S_{2},S_{3}\) and the three “legs” formed by the line segments \(A_{j} S_{j}\). The matrices \(S_{1},S_{2},S_{3} \in \mathbb{R}^{3\times 3}\) are introduced in Sect. 4 below. Given the set \(\{A_{1},A_{2},A_{3}\} \subset \mathbb{R}^{3\times 3}\) from (13), (14), as in the gradient case, we denote its \(\mathcal{A}\)-quasi-convex hull by \(\{A_{1},A_{2},A_{3}\}^{qc}\).
3 Quantitative Results on the Two-State Problem
In this section, we study quantitative versions of the two-state problem for general, linear, constant coefficient, homogeneous operators, always considering the divergence operator as a particular model case.
Building on the precise formulation of the problem from (7), in Sect. 3.1, we first characterize the elastic energies in terms of the operator \(\mathcal{A}(D)\). With this characterization in hand, using ellipticity, we next prove the quantitative \(L^{2}\) bounds in the incompatible two-well case (Sect. 3.2) and lower \(\epsilon ^{\frac{2}{3}}\)-scaling bounds for the compatible case with first order, linear operators (Sect. 3.3). In Sect. 3.4, we prove Theorem 1(iii) and deduce a reduction to cocanceling operators. In Sect. 3.5 we briefly discuss the role of degeneracies in the symbol of the elastic energies which may arise for higher order, linear differential operators and which may thus lead to an alternative scaling behavior different from the \(\epsilon ^{\frac{2}{3}}\)-scaling bound.
Finally, in Sect. A we complement the lower bounds from this section with matching upper bounds for the special case of \(\mathcal{A}(D) = \operatorname {div}\). Similar constructions are also known for the gradient, the symmetrized gradient and lower dimensional problems from micromagnetics [1, 66, 73, 74, 80].
3.1 Elastic Energy Characterization
We begin by recalling an explicit lower bound for the elastic energy in terms of the operator \(\mathcal{A}(D)\) (see, for instance also [49, discussion before Lemma 1.17]) in an, for us, convenient form. In everything that follows, we will assume \(d > 1\).
Lemma 3.1
Fourier characterization of the elastic energy
Let \(d,n,k \in \mathbb{N}\). Let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain with associated indicator function \(\chi _{\Omega}\) and let \(\mathcal{K} \subset \mathbb{R}^{n}\). Let \(\mathcal{A}(D) = \sum _{|\alpha |=k} A_{\alpha }\partial ^{\alpha}\) be as in (2) with the symbol \(\mathbb{A}\), cf. (3), and \(E_{el}\) be as in (16) and \(F \in \mathcal{K}^{qc}\). Then there is a constant \(c=c(\mathcal{A}(D))>0\) such that for any \(\chi \in L^{2}(\Omega ;\mathcal{K})\), extended by zero outside of \(\Omega \),
where the infimum is taken over all \(u \in L^{2}_{loc}(\mathbb{R}^{d} ; \mathbb{R}^{n})\) that fulfill \(\mathcal{A}(D) u = 0\) in \(\mathbb{R}^{d}\).
Proof
The proof follows from a projection argument in Fourier space.
Step 1: Whole space extension, Fourier and pseudoinverse of the differential operator. We begin by transforming our problem to one on the whole space \(\mathbb{R}^{d}\), introducing the whole space extension \(w := u - F\):
where we have extended all functions in the integrand by zero outside of \(\Omega \).
In the following, we write \(\tilde{\chi} := \chi -F\chi _{\Omega}\). Fourier transforming the expression for the elastic energy then leads to
Further, seeking to deduce a lower bound, we relax the boundary data for \(w\), obtaining
Minimizing the integrand \(\hat{w}\) (and still denoting the minimizer by \(\hat{w}\)) for each fixed mode \(\xi \in \mathbb{R}^{d} \setminus \{0\}\), we infer that
where we view \(\mathbb{A}(\xi ) \mathbb{A}(\xi )^{*}: \operatorname {ran}\mathbb{A}(\xi ) \to \operatorname {ran}\mathbb{A}(\xi )\) as an isomorphism. This directly implies the claimed lower bound for \(E_{el}(\chi ;F)\) in terms of the symbol \(\mathbb{A}\) and its pseudoinverse.
Step 2: Proof of the final estimate. Now to show the final estimate for the elastic energy, we seek to bound \(|\mathbb{A}(\xi )^{\ast }(\mathbb{A}(\xi )\mathbb{A}(\xi )^{\ast})^{-1} \mathbb{A}(\xi ) x|\) for every \(\xi \in \mathbb{R}^{d}\), \(x\in \mathbb{R}^{n}\) from below in terms of \(|\mathbb{A}(\xi ) x|\). As the projection operator is zero-homogeneous, we can reduce to \(\xi \in \mathbb{S}^{d-1}\), and thus can use the continuity of \(\mathbb{S}^{d-1} \ni \xi \mapsto \mathbb{A}(\xi )\) and the compactness of \(\mathbb{S}^{d-1}\) for the desired bound: For any \(x\in \mathbb{R}^{n}\), \(\xi \in \mathbb{S}^{d-1}\) it hence holds
As \(\mathcal{A}(D) \neq 0\), we have that \(0<\sup _{\zeta \in \mathbb{S}^{d-1}}|\mathbb{A}(\zeta )|\leq C < \infty \). Dividing by this and plugging this into the expression with the pseudoinverse, we obtain
which concludes the argument. □
We emphasize that we are relaxing the boundary conditions for \(\hat{w}(\xi ) := \Pi _{\ker \mathbb{A}(\xi )} \hat{\tilde{\chi}}(\xi )\) as we do not calculate the projection of \(\chi \) onto \(\mathcal{D}_{F}\), hence the above Fourier bounds only provide lower bounds for the elastic energy.
We apply the lower bound from Lemma 3.1 to the two-well problem:
Corollary 3.2
Let \(d,n \in \mathbb{N}\). Let \(\Omega \), \(\mathcal{A}(D)\), \(\mathbb{A}\), \(E_{el}\) be as in Lemma 3.1. Consider \(\mathcal{K} = \{A,B\} \subset \mathbb{R}^{n}\) with \(F_{\lambda }= \lambda A + (1-\lambda )B\) for \(\lambda \in (0,1)\). Then there exists a constant \(C = C(\mathcal{A}(D)) > 0\) such that for any \(\chi = \chi _{A} A + \chi _{B} B \in L^{2}(\Omega ;\mathcal{K})\), extended to \(\mathbb{R}^{d}\) by zero, it holds
Proof
Using the expression of \(F_{\lambda}\) in terms of \(A\), \(B\), \(\lambda \) yields \(A- F_{\lambda }= (1-\lambda )(A-B)\) and \(B-F_{\lambda }= - \lambda (A-B)\). Thus, the fact that \(\chi = \chi _{A} A + \chi _{B} B\) and Lemma 3.1 imply
□
Remark 3.3
The divergence operator
As seen in Example 1.1, in the case of \(\mathcal{A}(D) = \operatorname {div}\), we have for \(u \in C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{d \times d})\) (note that we chose square matrices out of simplicity)
With this we can calculate
This, in particular, shows that the adjoint operator is given by \(\mathbb{A}(\xi )^{*}: \mathbb{R}^{d} \to \mathbb{R}^{d \times d}\) with
Therefore \(\mathbb{A}(\xi )\mathbb{A}(\xi )^{*} x = (x \otimes \xi ) \xi = | \xi |^{2} x \), and the projection in the lower bound for the elastic energy of Lemma 3.1 takes the desired form
Furthermore it holds
3.2 The Incompatible Two-Well Problem and Scaling
As a first application of the Fourier characterizations from the previous section, we prove a quantitative lower bound for the incompatible two-well problem. We emphasize that – as in [27] – this argument is an elliptic argument and thus can be applied to all linear, constant coefficient homogeneous operators. Indeed, the following result holds:
Proposition 3.4
Let \(d,n \in \mathbb{N}\). Let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain, let \(\mathcal{A}(D)\) be given in (2) and \(A,B \in \mathbb{R}^{n}\) with \(B-A \notin \Lambda _{\mathcal{A}}\), cf. (5), further let \(F_{\lambda }= \lambda A+ (1-\lambda )B\) for some \(\lambda \in (0,1)\), and let \(E_{el}\) be as in (16) with \(\mathcal{D}_{F_{\lambda}}\) given in (10). Then there is \(C = C(A,B, \mathcal{A}(D)) > 0\), such that for any \(\chi \in L^{2}(\Omega ;\{A,B\})\)
Proof
By virtue of Corollary 3.2, we have the lower bound
where the constant only depends on the operator \(\mathcal{A}(D)\).
As \((A-B) \notin \ker \mathbb{A} (\xi )\) for all \(\xi \in \mathbb{R}^{d}\) and thus \(|\mathbb{A}(\xi )(A-B)| > 0\) for any \(\xi \in \mathbb{R}^{d}\), by continuity of \(\xi \mapsto \mathbb{A}(\xi )\) and compactness of \(\mathbb{S}^{d-1}\), this implies \(|\mathbb{A}(\frac{\xi}{|\xi |})(A-B)| \geq C(A,B, \mathcal{A}(D)) > 0\). Hence,
where \(\Omega _{A}:=\operatorname {supp}(\chi _{A})\subset \overline{\Omega}\) and \(\Omega _{B}:=\operatorname {supp}(\chi _{B})\subset \overline{\Omega}\). □
Remark 3.5
We emphasize that this result can be viewed as an incompatible nucleation bound.
3.3 The Compatible Two-Well Case and Scaling
We next turn to the setting of two compatible wells and restrict our attention to first order operators. In this case, we claim the following \(\epsilon ^{\frac{2}{3}}\)-lower scaling bound. This is in analogy to the situation for the gradient which had first been derived in the seminal works [62, 63].
Proposition 3.6
Let \(d,n \in \mathbb{N}\). Let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain and let \(\mathcal{A}(D)\) be a first order operator as in (2). Let \(A,B \in \mathbb{R}^{n}\) be such that \(B-A\in \Lambda _{\mathcal{A}}\setminus I_{\mathcal{A}}\) where \(\Lambda _{\mathcal{A}}\) and \(I_{\mathcal{A}}\) are given in (5) and (8), and define \(F_{\lambda} := \lambda A + (1-\lambda )B\) for some \(\lambda \in (0,1)\). Let \(E_{\epsilon}:= E_{el} + \epsilon E_{surf}\) be given in (12) with \(\mathcal{D}_{F_{\lambda}}\) defined in (10). Then, there exist \(C = C(\mathcal{A}(D),A,B,\Omega , d, \lambda )>0\) and \(\epsilon _{0}=\epsilon _{0}(\mathcal{A}(D),A,B,\Omega , d, \lambda )>0\) such that for any \(\epsilon \in (0,\epsilon _{0})\) we have
In order to deal with the compatible case, we invoke the following (slightly generalized) auxiliary results from [2], see also [23, 71], which we formulate for a general Fourier multiplier \(m\):
Lemma 3.7
Elastic, surface and low frequency cut-off
Let \(d,n,N\in \mathbb{N}\). Let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain. Let \(m: \mathbb{R}^{d} \to \mathbb{R}^{N}\) be a linear map and denote by \(V := \ker m\subsetneq \mathbb{R}^{d}\) its kernel and by \(\Pi _{V}: \mathbb{R}^{d} \to V\) the orthogonal projection onto \(V\). Let \(f \in BV(\mathbb{R}^{d};\{-\lambda ,0,1-\lambda \})\) for \(\lambda \in (0,1)\) with \(f = 0\) outside \(\Omega \) and \(f \in \{-\lambda ,1-\lambda \}\) in \(\Omega \). Consider the elastic and surface energies given by
Then the following results hold:
-
(a)
Low frequency elastic energy control. Let \(\mu > 1\), then there exists \(C=C(m,\Omega )>0\) with
$$\begin{aligned} \Vert \hat{f}\Vert _{L^{2}(\{\xi \in \mathbb{R}^{d}: \ |\Pi _{V}(\xi )| \leq \mu \})}^{2} \leq C \mu ^{2} \tilde{E}_{el}(f). \end{aligned}$$ -
(b)
High frequency surface energy control. There exists \(C=C(d,\lambda )>0\) such that for \(\mu > 0\) it holds
$$\begin{aligned} \Vert \hat{f}\Vert _{L^{2}(\{\xi \in \mathbb{R}^{d}: |\xi |\geq \mu \})}^{2} \leq C \mu ^{-1}( \tilde{E}_{surf}(f) + \operatorname {Per}(\Omega )). \end{aligned}$$
Proof of Lemma 3.7
Since the property (b) is directly analogous to the one from [2, Lemma 2], we only discuss the proof of (a) which requires some (slight) modifications with respect to [2] (and [23]). We thus present the argument for this for completeness. We split \(\xi = \xi ' + \xi '' \), where \(\xi ' \in V^{\perp }\neq \{0\}\), \(\xi '' \in V = \ker m\). With this in hand and by the linearity of \(m\) it holds
for some matrix representation \(M\) of \(m\). Hence, using \(\xi ' \perp \ker m\), there is \(c = c(m) >0\) such that
With this in hand, we argue similarly as in [23] and [2]: For \(a,b\in \mathbb{R}\), \(a,b\geq 0\) and the orthogonal splitting \(\xi = \xi ' + \xi ''\) from above, it holds that
Using the notation \(f(\xi ) = f(\xi ',\xi '')\), setting
and using Plancherel’s identity, the \(L^{\infty}-L^{1}\) bounds for the Fourier transform and Hölder’s inequality, we obtain that
In particular, the constant \(a\) is well-defined. Returning to (18), we consequently deduce that for \(b \in (0, 1)\)
Choosing \(b= \mu ^{-1} < 1\) and noting that \(\{\xi \in \mathbb{R}^{d}: |\Pi _{V}(\xi )|\leq \mu \} = \{\xi \in \mathbb{R}^{d}: \ |\xi ''| \leq \frac{1}{b}\}\) implies the claim. □
With Lemma 3.7 in hand, we turn to the proof of the lower bound in Proposition 3.6:
Proof of the lower bound in Proposition 3.6
Since \(B-A \in \Lambda _{\mathcal{A}}\setminus I_{\mathcal{A}}\), there exists \(\xi \in \mathbb{R}^{d}\setminus \{0\}\) such that \(\mathbb{A}(\xi )(B-A)=0\). As \(\mathcal{A}(D)\) is a first order operator, we have that \(\mathbb{A}(\xi )\) is linear in \(\xi \in \mathbb{R}^{d}\). In particular, we have that the set, cf. (6),
is a linear space. Rewriting \(\mathbb{R}^{d} \ni \xi = \xi ' + \xi '' \) with \(\xi ' \in V_{B-A}^{\perp}\) and \(\xi '' \in V_{B-A}\) as in the proof of Lemma 3.7, then Corollary 3.2 implies that
for a constant \(C\) depending on the operator \(\mathcal{A}(D)\) and on \(A-B\).
Now, setting \(m(\xi ):= \mathbb{A}(\xi )(A-B)\) and \(f:=(1-\lambda ) \chi _{A} - \lambda \chi _{B}\), yields the applicability of Lemma 3.7 with \(V = V_{B-A}\subsetneq \mathbb{R}^{d}\). This is the only place where we use the assumption that \(A-B\notin I_{\mathcal{A}}\). We deduce that by (19) and the decomposition of \(\mathbb{R}^{d}\) into the two regions from Lemma 3.7 we have for \(\mu > 1\)
where the constant \(C\) depends on \(\mathcal{A}(D)\), \(A\), \(B\), \(\Omega \), \(d\), \(\lambda \). Now choosing \(\mu = \epsilon ^{-\frac{1}{3}}>1\), noting that then \(\mu ^{-1}\epsilon ^{-1} = \epsilon ^{-\frac{2}{3}} \) for \(\epsilon < 1\), we obtain
where \(E_{\epsilon}(\chi ;F_{\lambda}):= E_{el}(\chi ;F_{\lambda}) + \epsilon E_{surf}(\chi )\). Using the lower bound
then implies that
Finally, for \(\epsilon \in (0,\epsilon _{0})\) and \(\epsilon _{0} = \epsilon _{0}(\mathcal{A}(D),A,B,\Omega , d, \lambda )>0\) sufficiently small, the perimeter contribution on the right hand side can be absorbed into the left hand side, which yields the desired result. □
3.4 The Super-Compatible Setting: Proof of Theorem 1(iii) and Reduction to Cocanceling Operators
In this subsection we will show that if we are in the setting in which the estimates of the previous subsection degenerate, i.e., \(\mathbb{A}(\xi )(A-B)=0\) for all \(\xi \in \mathbb{R}^{d}\), then in fact there can be no non-trivial bound from below. We will however also show that, in general for pairwise not super-compatible wells, it is possible to reduce to an equivalent minimization problem in the setting of cocanceling operators for suitably modified boundary data.
Proof of the super-compatible case in Theorem 1
It suffices to give an upper bound construction with zero total energy. To this end, we consider \(\chi =A\chi _{\Omega }\) and \(u=A \chi _{\Omega} + F_{\lambda} \chi _{\mathbb{R}^{d} \setminus \overline{\Omega}}\) and observe that
As a consequence, \(u\) is admissible in the definition of the elastic energy and the elastic energy vanishes. Moreover, since \(\chi = A\) in \(\Omega \) we also have the vanishing of the surface energy. This concludes the argument. □
We will show that for two not super-compatible wells, we can always assume that \(I_{\mathcal{A}}=\{0\}\), in which case we work in the class of cocanceling operators introduced by Van Schaftingen in [65].
Proposition 3.8
Let \(n,d,k \in \mathbb{N}\), let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain, \(\mathcal{A}(D)\) a differential operator of order \(k\) as in (2) with \(I_{\mathcal{A}}\) given in (8). For \(\chi \in L^{2}(\Omega ; \mathcal{K})\) for some compact set of states \(\mathcal{K} \subset \mathbb{R}^{n}\) let \(E_{el}(\chi ; F)\) be as in (9) for \(F \in \mathcal{K}^{qc}\), cf. Sect. 2. Then for the restricted operator \(\tilde{\mathcal{A}}(D): C^{\infty}(\mathbb{R}^{d};I_{\mathcal{A}}^{ \perp}) \to C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{m})\) there exists a function \(\chi _{\perp} \in L^{2}(\Omega , \Pi _{I_{\mathcal{A}}^{\perp}} \mathcal {K})\) such that
Here we denote the orthogonal projection of \(F\) onto \(I_{\mathcal{A}}^{\perp}\) by \(F_{\perp}\).
Proof
We use the orthogonal decomposition \(\mathbb{R}^{n} = I_{\mathcal{A}}^{\perp }\oplus I_{\mathcal{A}}\) to write
with \(u_{\perp} : \mathbb{R}^{d} \to I_{\mathcal{A}}^{\perp}\), \(u_{I}: \mathbb{R}^{d} \to I_{\mathcal{A}}\), \(\chi _{\perp} : \Omega \to I_{\mathcal{A}}^{\perp}\), \(\chi _{I}: \Omega \to I_{\mathcal{A}}\). By orthogonality we can also split the elastic energy
Defining the restricted operator \(\tilde{\mathcal{A}}(D): C^{\infty}(\mathbb{R}^{d};I_{\mathcal{A}}^{ \perp}) \to C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{m})\), \(\tilde{\mathcal{A}}(D)u_{\perp} := \mathcal{A}(D)u_{\perp}\) and the restricted space of admissible functions as in (10)
we see that for \(u \in \mathcal{D}_{F}\) it holds \(u_{\perp }\in \mathcal{D}^{\tilde{\mathcal{A}}}_{F_{\perp}}\), with \(F_{\perp }= \Pi _{I_{\mathcal{A}}^{\perp}} F\), and \(u_{I} = F - F_{\perp}\) outside \(\Omega \).
Thus, after minimizing the elastic energy in \(u\), it holds
As we have seen in the proof of Theorem 1(iii), the second term involving \(u_{I}\) vanishes and hence,
This reduces the elastic energy to the case of a cocanceling operator as indeed \(I_{\tilde{A}} = \{0\}\). □
As the surface energy does not depend on the operator \(\mathcal{A}(D)\), this result yields:
As a corollary, we apply this to the \(N\)-well problem:
Corollary 3.9
Finitely many, pairwise not super-compatible wells
Under the same assumptions as in Proposition 3.8, with \(\chi \in BV(\Omega ;\mathcal{K})\), for the special case that \(\mathcal{K}:=\{B_{1},\dots ,B_{N} \}\subset \mathbb{R}^{n}\) for \(N \in \mathbb{N}\) such that \(B_{j}\), \(j\in \{1,\dots ,N\}\), are pairwise not super-compatible, i.e. \(B_{i} - B_{j} \notin I_{\mathcal{A}}\) for \(i \neq j\), there exists a constant \(C=C(B_{1},\dots ,B_{n})>1\) such that
In particular, it hence holds that
Proof
Writing \(\chi = \sum _{j=1}^{N} B_{j} \chi _{\Omega _{j}}\) with \(\chi _{\Omega _{j}} \in BV(\Omega ;\{0,1\})\), \(\sum _{j=1}^{N} \chi _{ \Omega _{j}} = 1\) in \(\Omega \), we can calculate
where we denote the reduced boundary of a set \(E\) with finite perimeter by \(\partial ^{\ast }E\) and used the notation from above for \(B \in \mathbb{R}^{n}\) to write \(B_{\perp }= \Pi _{I_{\mathcal{A}}^{\perp}} B\).
By assumption, for \(i < j\) it holds \(B_{i} - B_{j} \notin I_{\mathcal{A}}\), and therefore also the projection satisfies \((B_{i} - B_{j})_{\perp }\neq 0\). This implies \(|(B_{i} - B_{j})_{\perp}| > 0\) for all tupels \((i,j)\) such that \(i< j\) and hence there is a constant \(0 < C^{-1} < \frac{|B_{i}-B_{j}|}{|(B_{i}-B_{j})_{\perp}|} < C\) such that
This together with Proposition 3.8 concludes the proof. □
We emphasize that Corollary 3.9 in particular holds in the context of Theorem 1. Therefore in the statement of Theorem 1(ii) we can assume without loss of generality that \(I_{\mathcal{A}} = \{0\}\). In fact, note that in the crucial bound of Corollary 3.2 we have
for \(|\xi |=1\).
3.5 Some Remarks on the Compatible Two-State Problem for Higher Order Operators
We conclude our discussion of lower scaling bounds by commenting on the case of the compatible two-well problem for higher order operators. Here the situation is still less transparent, yet some remarks are possible.
Indeed, on the one hand, it is known that, in general, for operators of order \(k\geq 2\) the two-well problem does not have to scale with \(\epsilon ^{\frac{2}{3}}\). In order to illustrate this, we consider the specific operator \(\mathcal{A}{(D)}:= \operatorname{curl} \operatorname{curl}\). This operator is the annihilator of the symmetrized gradient \(e(v):= \frac{1}{2}(\nabla v + (\nabla v)^{t})\). We consider the following quantitative two-state problem (for \(d=2\))
with \(e(v) \in \mathcal{D}^{\mathcal{A}}_{F}\), \(\chi \in BV([0,1]^{2};\{0,1 \})\), \(\alpha \in (0,1)\) and study the corresponding minimization problem with prescribed boundary data \(F:= \begin{pmatrix} 1 & 0 \\ 0 & 1 \end{pmatrix} \).
Proposition 3.10
Let \(\mathcal{E}_{\epsilon}(v,\chi )\) and \(F\) be as in (20). Then there exists \(\epsilon _{0}= \epsilon _{0}(\alpha )>0\) such that for \(\epsilon \in (0,\epsilon _{0})\) it holds that
We remark that this observation is not new; indeed, a geometrically nonlinear version of this had earlier been derived in [1, Theorem 1.2]. As observed in [1] the reason for the different scaling in Proposition 3.10 and [1, Theorem 1.2], compared to the more standard \(\epsilon ^{\frac{2}{3}}\) behavior from Theorem 1, consists of the higher degeneracy of the multiplier associated with the energy which is manifested in the presence of only one possible normal in the (symmetrized) rank-one condition. On the level of the multiplier this can be seen as \(\mathbb{A}(\xi )(e_{2} \otimes e_{2}) = \xi _{1}^{2}\), opposed to \(\mathbb{A}(\xi )(\frac{e_{1} \otimes e_{2} + e_{2} \otimes e_{1}}{2}) = - \xi _{1} \xi _{2}\), has only one root of multiplicity two instead of two roots of single multiplicity on the unit sphere. For convenience of the reader and in order to illustrate the robustness of the above approach within geometrically linear theories, we present an alternative short proof (of the lower bound) of Proposition 3.10 based on our Fourier theoretic framework. We note that in the geometrically linear setting this provides an alternative to the approach from [1] in which the lower bound for the energy is deduced by a local “averaging” argument, considering the energy on representative domain patches with the expected scaling behavior.
Proof
Step 1: Lower bound. We note that the lower bound for this setting directly follows from our arguments above: Indeed, for \(A -B = 2\alpha e_{2} \otimes e_{2}\), we obtain that
With this in hand, an analogous argument as in Lemma 3.7 and, in particular, in (18) implies that
where we have used that in this situation the multiplier is given by \(m(\xi ) = \mathbb{A}(\xi )(B-A) \sim \xi _{1}^{2}\). The different exponent of \(\mu \) in (21) (compared to the one from Lemma 3.7(a)) is a consequence of the degeneracy of the symbol \(m(\xi )\) and the higher order of the operator \(\operatorname{curl} \operatorname{curl}\) (or put, more concretely, the quadratic dependence \(\xi _{1}^{2}\)). Hence, replacing the bound from Lemma 3.7(a) by the one from (21) and carrying out the splitting as in the proof of Theorem 1, we obtain the following optimization problem: For \(f:=1 + \alpha (1-2\chi )\)
Choosing \(\mu \sim \epsilon ^{-\frac{1}{5}}\) and rearranging the estimates then imply the claim.
Step 2: Upper bound. The associated improved upper bound makes use of the vectorial structure of the problem in contrast to the essentially scalar “standard \(\epsilon ^{2/3}\) construction” (see the arguments below). Recalling that the nonlinear construction from the proof of [1, Lemma 2.1] also yields a construction with the desired scaling for the geometrically linearized problem, we do not carry out the details of this but refer to [1, Lemma 2.1] for these. □
On the other hand, the arguments from [1, 73, 74, 81] show that still for \(\mathcal{A}(D) = \operatorname{curl} \operatorname{curl}\) if \(A -B = \gamma ( e_{1} \otimes e_{2} + e_{2} \otimes e_{1})\) for \(\gamma \in \mathbb{R}\setminus \{0\}\), then one recovers the \(\epsilon ^{2/3}\) scaling for the symmetrized gradient differential inclusion. In this case, the symbol reads \(m(\xi ) = \mathbb{A}(\xi )(B-A) \sim \xi _{1} \xi _{2}\) and the operator is “less degenerate”.
We expect that the scaling behavior of general higher order operators is in many interesting settings directly linked to the degeneracy of the symbol \(\mathbb{A}(\xi )(B-A)\). We plan to explore this in future work.
4 Quantitative Rigidity of the \(T_{3}\) Structure from (13), (14) for \(\mathcal{A}(D)=\operatorname {div}\)
In this section, we consider the \(T_{3}\) structure for the divergence operator introduced in (13), (14). The upper bound construction is given by an approximate solution of the type described in the introduction. The lower bound is motivated by the rigidity of exact solutions as outlined in Sect. 4.3.1.
4.1 The Upper Bound Construction – an Infinite Order Laminate
To begin with, we construct an infinite order laminate similar to the one for the Tartar square (cf. [2, 82, 83] for quantitative versions of this). This is based on [3] and will yield the upper bound estimate from Theorem 2. We recall that for the divergence operator, instead of requiring rank-one connectedness for the existence of a laminate as for the curl, in our three-dimensional set-up we need rank-one or rank-two connectedness as can be seen from the wave cone for the divergence operator, cf. (5). As a consequence, for two matrices \(A,B \in \mathbb{R}^{3 \times 3}\) such that \(\operatorname {rank}(B-A) \leq 2 \) there exists a piecewise constant map \(u: \mathbb{R}^{3} \to \mathbb{R}^{3 \times 3}\) such that \(u \in \{A,B\}\) a.e. in \(\Omega \) and \(\operatorname {div}u = 0 \). The lamination can be done in any direction of the kernel \(\ker (B-A) \neq \{0\}\).
Considering now the matrices \(A_{1}\), \(A_{2}\), \(A_{3}\) given in (14), we observe that \(\operatorname {rank}(A_{i} - A_{j}) = 3 \) for \(i \neq j\). Following [3], we introduce auxiliary matrices \(S_{1},S_{2},S_{3} \in \mathbb{R}^{3 \times 3}\).
It then holds for \(i=1,2,3\) that
where \(A_{4} = A_{1}\), \(S_{4} = S_{1}\). As proved in [21, Theorem 2], the \(\mathcal{A}\)-quasi-convex hull \(\{A_{1},A_{2},A_{3}\}^{qc}\) can be explicitly characterized as the convex hull of the matrices \(S_{1}\), \(S_{2}\), \(S_{3}\) together with the “legs” given by the line segments \(A_{j} S_{j}\) for \(j\in \{1,2,3\}\), cf. Fig. 1.
The diagonal matrices \(A_{1}\), \(A_{2}\), \(A_{3}\), \(S_{1}\), \(S_{2}\), \(S_{3}\) with the dashed lines depicting the connections in the wave cone for \(\mathcal{A}(D)=\operatorname {div}\) and the Voronoi-regions of \(A_{i}\) shown by the lines. As shown in [21] the set \(\mathcal{K}^{qc}\) is given by the inner triangle formed by \(S_{1} S_{2} S_{3}\) and the “legs” connecting \(S_{j}\) and \(A_{j}\) for \(j\in \{1,2,3\}\)
For simplicity and definiteness, we first assume, that \(u = F = S_{3}\) outside \(\Omega =[0,1]^{3}\). In this setting we prove the following energy estimate:
Proposition 4.1
Let \(\Omega = [0,1]^{3}\), \(\mathcal{K} = \{A_{1},A_{2},A_{3}\}\) for the particular choice of matrices in (14) and let \(E_{\epsilon}\) be as in (12) and \(\epsilon \in (0,1)\). Then for any \(r\in (0,\frac{1}{4})\) with \(r^{-1} \in 4\mathbb{N}\) there are sequences \(u^{(m)}\in W^{ 1,\infty}(\mathbb{R}^{3}; \mathbb{R}^{3\times 3})\), such that \(\operatorname {div}u^{(m)} = 0\), \(u^{(m)} = F := S_{3}\) outside \([0,1]^{3}\), and \(\chi ^{(m)} \in BV(\mathbb{R}^{3}; \mathcal{K})\), and a constant \(C = C(F)>1\) with
In order to achieve this, in the next subsections, we iteratively construct a higher and higher order laminate (depending on \(\epsilon >0\)). As in the setting of the Tartar square, we keep track of the surface and elastic energy contributions which arise in this process.
4.2 Proof of the Upper Bound from Theorem 2
We split the proof of the upper bound from Theorem 2 into several steps which we will carry out in the next sections and then combine in Sect. 4.2.4.
First, we start by a simple lamination of \(A_{1}\) and \(S_{1}\) to obtain regions in which \(u \in \mathcal{K}\) holds and to satisfy the exterior data condition \(u = F = S_{3} = \frac{1}{2}(A_{1}+S_{1})\) outside \(\Omega \). This is followed by a similar construction replacing \(S_{1}\) by a lamination of \(A_{2}\) and \(S_{2}\) achieving a second order laminate. Iterating the procedure of replacing \(S_{j}\) by a lamination of \(A_{j+1}\) and \(S_{j+1}\) (with the convention that \(A_{4} = A_{1}\), \(S_{4} = S_{1}\)) yields Proposition 4.1. Finally, we optimize the parameter \(r\) and the number of iterations depending on \(\epsilon \) in order to show the desired upper bound estimate in Proposition 4.3.
As we will use a potential for the laminates, we will define a “profile-function” once in a more general form and will then refer to this in our construction for the higher order laminates.
Lemma 4.2
Let \(R = [a_{1},b_{1}] \times [a_{2},b_{2}] \times [a_{3},b_{3}] \subset \mathbb{R}^{3}\) be an axis-parallel cuboid, then for any direction \(e_{j}\) with \(j \in \{1,2,3\}\) and any scale \(r>0\) such that \(\frac{b_{j}-a_{j}}{r} \in \mathbb{N}\) there is a continuous function \(f_{j}(\,\cdot \,;R,r):R \to \mathbb{R}^{3\times 3}\) satisfying the following properties:
-
The function \(f_{j}(\,\cdot \,;R,r)\) only depends on the \(j\)-th coordinate \(x_{j}\) and is \(r\)-periodic.
-
It holds \(f_{j}(x;R,r) = 0\) if \(x \in R\) is such that \(x = (x_{1},x_{2},x_{3})\) lies in one of the planes characterized by \(x_{j} \in a_{j} + r\mathbb{N}_{0}\).
-
The matrix-valued function \(\operatorname{curl} f_{j}(x;R,r)\) only attains the values \(\pm \frac{S_{j}-A_{j}}{2}\), where \(A_{j}\), \(S_{j}\) is given as in (14) and in (22), respectively. Furthermore, the volumes of the level sets are equal, i.e. \(|\{x \in R:f_{j}(x;R,r) = S_{j}\}| = |\{x \in R: f_{j}(x;R,r)=A_{j} \}| = \frac{1}{2}|R|\).
Proof
Without loss of generality by a translation, we may assume that \(R = [0,b_{1}] \times [0,b_{2}] \times [0,b_{3}]\) for \(b_{1},b_{2},b_{3} >0\). We consider the continuous one-periodic extension of the function
Furthermore, we define the matrices
satisfying \(e_{j} \times M_{j} = S_{j} - A_{j}\). With this at hand we define \(f_{j}(\,\cdot \,;R,r): R \to \mathbb{R}^{3\times 3}\):
It follows directly, that \(f_{j}(\,\cdot \,;R,r)\) is continuous, only depends on \(x_{j}\), is \(r\)-periodic, and vanishes for \(x_{j} \in r\mathbb{N}_{0}\). Lastly, we note that \(\operatorname{curl} f_{j}(x;R,r) = h'(\frac{x_{j}}{r}) e_{j} \times M_{j} \in \{\pm \frac{S_{j}-A_{j}}{2}\}\) and that indeed \(|\{x \in R:f_{j}(x;R,r) = S_{j}\}| = |\{x \in R: f_{j}(x;R,r)=A_{j} \}| = \frac{1}{2}|R|\). □
4.2.1 First Order Laminates
We use a potential \(v: \mathbb{R}^{3} \to \mathbb{R}^{3 \times 3}\) to construct our laminates attaining the prescribed exterior data, i.e. we consider the row-wise curl: \(u = \operatorname{curl} v\).
As \(S_{3} = \frac{1}{2} A_{1} + \frac{1}{2} S_{1} \) and \((S_{1} - A_{1}) e_{1} = 0\) the first order lamination is in the \(e_{1}\)-direction.
We seek to use Lemma 4.2 to construct \(v\), but have to adapt the boundary condition. For this we define a mapping \(\tilde{S}_{3}: \mathbb{R}^{3} \to \mathbb{R}^{3 \times 3}\) with \(\operatorname{curl} \tilde{S}_{3} = S_{3} = F\). A possible choice for \(\tilde{S}_{3}\) is given by the following matrix-valued function
Furthermore, we also define the cut-off function
With this, we define the (continuous) potential for \(r \in (0,\frac{1}{4})\), \(r^{-1} \in 4 \mathbb{N}\) by using Lemma 4.2 for \(j=1\), \(R=[0,1]^{3}\):
where \(d_{\partial \Omega}(x)\) denotes a smoothed-out distance function to the boundary \(\partial \Omega \). Without change of notation, we consider the (continuous) extension of \(v^{(1)}\) to \(\mathbb{R}^{3}\) by \(\tilde{S}_{3}(x)\), which is possible, as \(v^{(1)}(x) = \tilde{S}_{3}(x)\) on \(\partial \Omega \).
We then set
and note that in \(\Omega \) it holds
With these considerations, we have obtained the following properties: It holds \(\operatorname {div}u^{(1)} = \operatorname {div}(\operatorname{curl} v^{(1)}) = 0\) in \(\mathbb{R}^{3}\), \(u^{(1)}(x) = F\) in \(\mathbb{R}^{3}\setminus \overline{\Omega}\) and for \(x \in \Omega \setminus d_{\partial \Omega}^{-1}([0,\frac{3}{8}r])\) we have \(u^{(1)} \in \{A_{1},S_{1}\}\). In other words, our deformation \(u^{(1)}\) is a divergence-free function satisfying the desired boundary conditions and which, outside of the cut-off region, is a solution to the differential inclusion \(u^{(1)}\in \{A_{1},S_{1}\}\).
With the higher order laminates in mind we rephrase this using the decomposition of \(\Omega \) into the three disjoint parts consisting of the \(S_{1}\)-cells, the \(A_{1}\)-cells and the cut-off region. To be more precise, we define
Indeed it holds \(\Omega = R^{(1)} \cup Q^{(1)} \cup C^{(1)}\) and \(|C^{(1)}| = 6 \frac{3}{8}r\) and by Lemma 4.2 we know \(|R^{(1)}| \leq \frac{1}{2} |\Omega | = \frac{1}{2}\).
Choosing \(\chi ^{(1)} = \Pi _{\mathcal{K}} u^{(1)}\) as the pointwise orthogonal (with a fixed choice for the not uniquely defined points) projection of \(u^{(1)}\) onto \(\mathcal{K}\), see Fig. 2 for an illustration, up to a uniformly bounded constant, the elastic energy can be bounded by the measure of the region in which \(u^{(1)}(x) = S_{1}\) and the cut-off region:
Furthermore, the surface energy is bounded by counting the interfaces at which \(\chi ^{(1)}\) may jump. This consists of at most \(\frac{2}{r}\) interfaces in the interior and at most \(\frac{2}{r}+ 2 \cdot 6 \leq \frac{8}{r}\) new interfaces in the cut-off region. For \(r \in (0,\frac{1}{4})\), the surface area is thus controlled by
4.2.2 Second Order Lamination
After this first order lamination, the differential inclusion \(u \in \mathcal{K}\) with \(\operatorname {div}u = 0\) holds only in \(Q^{(1)}\). In order to further reduce the energy, we now replace each of the \(\frac{1}{r}\) many cuboids in \(R^{(1)}\) for which \(u^{(1)} = S_{1} = \frac{1}{2} A_{2} + \frac{1}{2} S_{2}\) by a lamination in the \(e_{2}\)-direction. For this, we modify the potential \(v^{(1)}\) in these regions:
For \(x \in R^{(1)}\) and for \(r^{2} \in (0,\frac{r}{2})\), \(r^{-2}+\frac{3}{4r} \in \mathbb{N}\), we define with the help of Lemma 4.2
and \(v^{(2)}(x) = v^{(1)}(x)\) else. Here, and in the following, we use the notation \(f_{j}(x;R^{(1)},r)\) for \(R^{(1)}\) which is not a cuboid but an union of disjoint cuboids and mean depending on \(x\) the corresponding connected cuboid. By this \(v^{(2)}\) defines a continuous map, as inside \(R^{(1)}\) the cut-off attains the constant value one and therefore \(v^{(1)}(x) = f_{1}(x;\Omega ,r) + \tilde{S}_{3}(x)\) for \(x \in \partial R^{(1)}\). By construction, the map \(u^{(2)} := \operatorname{curl} v^{(2)}\) is divergence free, i.e. the interfaces are compatible.
As in the construction for first order laminates we set \(\chi ^{(2)} = \Pi _{\mathcal{K}} u^{(2)}\), and define the sets
Then indeed again we have the decomposition \(R^{(1)} = R^{(2)} \cup Q^{(2)} \cup C^{(2)}\) with \(|R^{(2)}| \leq \frac{1}{2} |R^{(1)}|\) and \(|C^{(2)}| \leq \frac{1}{r} \frac{3}{8} r^{2} 6\) as the volume of the cut-off region can be bounded by the number of cells in \(R^{(1)}\) times \(\frac{3}{8}r^{2}\) times six times the area of the biggest face.
As the elastic energy vanishes in \(Q^{(2)}\), we obtain
Indeed, this follows from the fact that we have improved our deformation in half the volume of the region in which \(u^{(1)} \notin \mathcal{K}\) but have added a new cut-off region in each cuboid in which we do the second order lamination. For the surface energy it holds
as we add at most \(\frac{4}{r^{2}}+12 \leq \frac{10}{r^{2}}\) many new faces in each one of the \(\frac{1}{r}\) many cuboids and as each surface has a surface area of size at most \(\frac{r}{2}\).
4.2.3 Iteration: (m+1)-th Order
Without loss of generality, we assume that the \((m+1)\)-th order lamination will be in \(e_{1}\)-direction, i.e. \(m=3j\) for some \(j \in \mathbb{N}\). Else, we only have to adapt the corresponding roles of the directions.
We define iteratively the \((m+1)\)-th potential with the help of the sets \(R^{(m)}\), for this we set (for given \(v^{(m)}, u^{(m)} = \operatorname{curl} v^{(m)}\))
Inside \(R^{(m)}\) we then define \(v^{(m+1)}\) by
and \(v^{(m+1)}(x) = v^{(m)}(x)\) else for \(x \notin R^{(m)}\). By induction we see that \(v^{(m+1)}\) is continuous, that for \(x \in R^{(m)}\) such that \(\phi (\frac{d_{\partial R^{(m)}}(x)}{r^{m+1}})=1\) it holds that \(u^{(m+1)} := \operatorname{curl} v^{(m+1)} \in \{A_{1},S_{1}\}\) and that we have the decomposition \(R^{(m-1)} = R^{(m)} \cup Q^{(m)} \cup C^{(m)}\) with \(|R^{(m)}| \leq \frac{1}{2} |R^{(m-1)}|\), \(|C^{(m)}| \leq C 2^{-m} \frac{r^{m}}{r^{m-1}} = C 2^{-m} r\) for a universal constant \(C>0\) independent of \(m\), \(r\). This follows from the fact, that the number of cuboids in \(R^{(m)}\) is bounded by \(C \frac{ 2^{-m}}{r^{m-1}r^{m-2}r^{m-3}}\) and that the biggest face of each cuboid has an area of at most \(r^{m-2} r^{m-3}\).
As the previous cut-offs still contribute to the total energy, we obtain the following elastic energy bound (\(\chi ^{(m+1)} := \Pi _{ \mathcal{K}} u^{(m+1)}\))
This bound resembles the (iterative) decomposition of \(\Omega = R^{(1)} \cup Q^{(1)} \cup C^{(1)} = R^{(m+1)} \cup \bigcup _{k=1}^{m+1} Q^{(k)} \cup \bigcup _{k=1}^{m+1} C^{(k)}\) and the fact that \(u^{m+1} = \chi ^{(m+1)}\) in \(\bigcup _{k=1}^{m+1} Q^{(k)}\).
For the surface energy, we again calculate the new contribution of the next order lamination and then sum over all previous ones. Each new face has a surface area of at most \(\frac{r^{m}}{2} \frac{r^{m-1}}{2}\). In each \(S_{3}\)-cell, we add \(\frac{r^{m-2}}{r^{m+1}}\) new faces in the lamination and at most \(\frac{r^{m-2}}{r^{m+1}}+12\) ones for the cut-off, cf. Fig. 3. Since we have at most \(C \frac{2^{-m-1}}{r^{m} r^{m-1} r^{m-2}}\) new cells the surface energy is increased by
yielding the following overall surface energy bound:
For the total energy this implies
With this we have shown the claimed upper bound and have thus concluded the proof of Proposition 4.1. □
4.2.4 Combining the Estimates: Proof of the Upper Bound in Theorem 2
With the previous construction in hand, we conclude the upper estimate from Theorem 2:
Proposition 4.3
Let \(\Omega = [0,1]^{3}\), \(\mathcal{K} = \{A_{1},A_{2},A_{3}\}\) for \(A_{1},A_{2},A_{3} \in \mathbb{R}^{3 \times 3}\) be given in (14), let \(\epsilon \in (0,1)\), and let \(E_{\epsilon}\) be as in (12), \(F\in \mathcal{K}^{qc} \setminus \mathcal{K}\) and \(\mathcal{D}_{F}\) given in (10). Then there are constants \(c>0\) and \(C>1\), only depending on the boundary data \(F\), such that
Proof
Step 1: Conclusion of the argument for \(F=S_{3}\). Using the sequences constructed in the construction from Proposition 4.1, implies
Optimizing the value of \(r\) depending on \(\epsilon >0\), we require that \(r \sim \epsilon ^{\frac{1}{m+1}}\). Finally, we balance the resulting contributions and seek for the optimal number of iterations \(m\). This is given by \(2^{-m} \sim \epsilon ^{\frac{1}{m+1}}\), that is \(m \sim |\log \epsilon |^{\frac{1}{2}}\).
Plugging this into the upper bound results in
This concludes the proof for the special case \(F = S_{3}\).
Step 2: Conclusion of the argument for a general boundary datum. The situation of other boundary data \(F\in \mathcal{K}^{qc}\) can be reduced to the one from Proposition 4.3 by at most two further iterations. Indeed, a general matrix \(F\in \mathbb{R}^{3\times 3}\) in the convex hull of \(S_{1}\), \(S_{2}\), \(S_{3}\), can be represented as
with \(\lambda , \nu _{1}, \nu _{2} \in [0,1]\), \(j,k\in \{1,2,3\}\) and \(k\neq j\). Hence, after two additional iterations compared to the argument from above, we arrive at similar iterative procedures as in the previous subsections. In case that \(F\) is an element of one of the legs \(S_{j} A_{j}\) a single iteration suffices to reduce the situation to the above argument. This proves the result for a general boundary condition \(F\in \mathcal{K}^{qc} \setminus \mathcal{K}\). □
4.3 Proof of the Lower Bound
In this section, we present the proof of the lower bound from Theorem 2. To this end, similarly as in [2], we mimic and quantify the analogous argument from the stress-free setting which we briefly recall in the following Sect. 4.3.1 and for which we will provide a number of auxiliary results in Sect. 4.3.2. The main argument, given in Sect. 4.3.3, will then consist of a bootstrap strategy, similar to [2], in which we iteratively reduce the possible regions of mass concentration in Fourier space.
Contrary to the previous section, in what follows we will work in a periodic set-up. Since the energy contributions on periodic functions provides a lower bound on the energy contributions of functions with prescribed Dirichlet boundary conditions, we hence also obtain the desired lower bound for the setting of Dirichlet boundary conditions. Indeed, for the elastic energy this is immediate; for the surface energy there is at most an increase by a fixed factor (see the discussion in Lemma 4.5 in Sect. 4.3.2).
4.3.1 The Stress-Free Argument
We begin by recalling the argument for the rigidity of the exact inclusion, as we will mimic this on the energetic level.
Proposition 4.4
Let \(u: [0,1]^{3} \rightarrow \mathbb{R}^{3 \times 3}\) be a solution to the differential inclusion (13)-(14). Then, there exists \(j\in \{1,2,3\}\) such that \(u \equiv A_{j}\) in \([0,1]^{3}\).
Proof
In the exactly stress-free setting in which the differential inclusion is satisfied exactly, i.e. \(u \in \{A_{1},A_{2},A_{3}\}\), the observation that \(\operatorname {div}u=0\) and that \(u\) is a diagonal matrix leads to the following three equations
where \(u_{jj}\) denote the diagonal components of the matrix \(u\). As a consequence,
Next we note that the values of \(u_{jj}\) determine the ones for \(u_{kk}\) if \(j\neq k\), i.e. there are functions \(h_{k,j}\) such that \(h_{k,j}(u_{jj}) = u_{kk}\). Hence, comparing the functions \(u_{11}\) and \(u_{22}\), we first obtain that \(u_{11}\) and \(u_{22}\) can only be functions of \(x_{3}\). Indeed it holds
and analogously \(\partial _{1} u_{22}(x) = 0\). Comparing this to \(u_{33}\), we obtain that all three functions must be constant. Hence, any solution to the (exact) differential inclusion must be constant and \(u\) is equal to one of the three matrices \(A_{1}\), \(A_{2}\), \(A_{3}\) globally. The exact problem is hence rigid. □
Using the ideas from [2], we seek to turn this into a corresponding scaling result. The main difference that arises can be seen in the qualitative rigidity argument above: Instead of comparing only two diagonal entries like in [2], we have to compare twice to deduce that the map is constant. This will be seen in the quantitative argument for the lower bound below. Whereas in [2] there are cones around a single axis (the diagonal entries only depend on one variable), we consider cones around a plane (the diagonal entries depend on two variables). Furthermore, the bootstrap argument will be slightly modified as it resembles the comparison of the diagonal entries in the qualitative argument given above.
4.3.2 Reduction to the Periodic Setting and Auxiliary Results for the Elastic Energy
In this subsection, we provide a number of auxiliary results which we will exploit in the following bootstrap arguments for deducing the lower bound. As a first step, we reduce to the situation of periodic deformations.
Lemma 4.5
Let \(\Omega = [0,1]^{3}\), let \(\mathcal{K}:=\{A_{1},A_{2},A_{3}\}\) be as in (14) and \(F\in \mathcal{K}^{qc} \setminus \mathcal{K}\). Let \(E_{\epsilon}(u,\chi )\) be given by (15) and set
Let further \(\mathcal{D}_{F}^{per}:=\{u:\mathbb{R}^{3} \rightarrow \mathbb{R}^{3 \times 3}: \ \operatorname {div}u=0 \textit{ in } \mathbb{R}^{3}, \ \langle u \rangle = F \}\), where \(\langle u \rangle := \int _{\mathbb{T}^{3}} u(x) \mathrm{d}x\). Assume that \(E_{\epsilon}(u,\chi )\leq 1\) and that there is \(\epsilon _{0} >0\) such that for any \(\nu \in (0,\frac{1}{2})\) there is \(c_{\nu}>0\) such that for any \(\epsilon \in (0,\epsilon _{0})\) it holds that
Then, there exists a constant \(C>1\) such that for \(\tilde{\epsilon}_{0}=\tilde{\epsilon}_{0}(\nu )>0\) sufficiently small
for all \(\epsilon \in (0,\tilde{\epsilon}_{0})\).
Proof
In order to infer the lower bound, we show that any function \(u:[0,1]^{3} \rightarrow \mathbb{R}^{3}\) with constant boundary data can be associated with a suitable periodic function which has the boundary data of \(u\) as its mean value and satisfies related energy estimates. Indeed, for given \(u \in \mathcal{D}_{F}\), we view it as a function on \(\mathbb{T}^{3}\) by restriction. By the prescribed boundary data it still satisfies the differential constraint and further the mean value property. Moreover,
Next, viewing \(\chi :[0,1]^{3} \rightarrow \mathcal{K}\) as a periodic function \(\tilde{\chi}:\mathbb{T}^{3} \rightarrow \mathcal{K}\), we infer that
Now, due to (24), we obtain that
For \(\tilde{\epsilon}_{0}(\nu )>0\) sufficiently small, the last right hand side term may thus be absorbed into the left hand side, yielding the desired result. □
With this result in hand it suffices to consider the periodic set-up in the remainder of this section. This will, in particular, allow us to rely on the periodic Fourier transform in deducing lower bounds for the elastic energy. In what follows all (semi-)norms will thus be considered on the torus. With a slight abuse of notation, we will often omit this dependence.
Lemma 4.6
Let \(F \in \mathcal{K}^{qc}\setminus \mathcal{K}\), where \(\mathcal{K} = \{A_{1},A_{2},A_{3}\}\) with \(A_{j}\) in (14), and \(E_{el}^{per}\) and \(\mathcal{D}_{F}^{per}\) be as in Lemma 4.5and as in (10). Then, it holds for any \(\chi \in L^{2}(\mathbb{T}^{3};\mathcal{K})\) and for \(E_{el}^{per}(\chi ;F) := \inf _{u \in \mathcal{D}_{F}^{per}} \int _{ \mathbb{T}^{3}} \big| u- \chi \big|^{2} dx \)
where \(\chi _{i,i}\) are the diagonal entries of \(\chi \) and \(\hat{\chi}\) is the (discrete) Fourier transform of \(\chi \).
Proof
We first calculate \(E_{el}^{per}(u,\chi )\) in Fourier space
which allows us to characterize minimizers of the elastic energy.
In order to minimize this elastic energy in \(u \in \mathcal{D}_{F}^{per}\), \(\hat{u}\) has to be the (pointwise) orthogonal projection of \(\hat{\chi}\) onto the orthogonal complement of \(k\), as the differential constraint \(\operatorname {div}u = 0\) reads \(i \hat{u} k = 0\) in Fourier space. Noting that the row-wise orthogonal projection of a matrix \(M\) onto \(\operatorname {span}(k)\) can be written as \(\Pi _{k}(M) = (M \frac{k}{|k|}) \otimes \frac{k}{|k|}\), the optimal \(\hat{u}\) is given by
for any \(k \in \mathbb{Z}^{3} \setminus \{0\}\).
Returning to our energy, this yields
and shows the claim. □
Next, following the ideas from [2], for \(\mu ,\lambda >0\), \(j=1,2,3\), we introduce the cones
and their corresponding cut-off functions \(m_{j,\mu ,\lambda}(k) \in C^{\infty}(C_{j,2\mu ,2\lambda}\setminus \{0\};[0,1])\) fulfilling \(m_{j,\mu ,\lambda} = 1\) on \(C_{j,\mu ,\lambda}\), \(\operatorname {supp}(m_{j,\mu ,\lambda}(k))\subset C_{j,2\mu ,2\lambda}\) and the decay properties in Marcinkiewicz’s multiplier theorem (see, for instance, [84, Corollary 6.2.5]). The corresponding cut-off multiplier is thus defined by
Furthermore, we use the following results which are shown in [2, Lemma 2, Lemma 3, and Corollary 1]. Following the conventions from [75], with slight abuse of notation compared to our setting in the first part of the article, in the whole following section, we now use \(d\) to denote the degree of some suitable polynomials and no longer the dimension of the ambient space which in the whole section is simply fixed to be equal to three.
Lemma 4.7
Let \(\beta ,\delta , \mu ,\lambda >0\). Let \(m_{i,\mu ,\lambda}(D)\) denote the Fourier multipliers associated with the cones \(C_{i,\mu ,\lambda}\) for \(i\in \{1,2,3\}\) as defined in (25) with the corresponding multipliers given in (26). Let \(f_{i} \in L^{\infty}(\mathbb{T}^{3}) \cap BV(\mathbb{T}^{3})\) for \(i=1,2,3\) and let \(h_{j,i}: \mathbb{R} \to \mathbb{R}\) be nonlinear polynomials (of degree \(d\)) with \(h_{j,i}(0) = 0\) such that \(h_{j,i}(f_{i}) = f_{j}\) for \(i \neq j\). If
then there exist constants \(C = C(h_{i,j},\Vert f_{i} \Vert _{\infty})\), \(C' = C'(h_{i,j},\Vert f_{i} \Vert _{\infty}, d)\), \(C_{0} = C_{0}(h_{i,j}, \Vert f_{i} \Vert _{\infty},d)>0\) such that for any \(\gamma \in (0,1)\) we have for any \(i \neq j\)
Here we choose the constants such that \(C_{0} > 2C+2C^{\prime \,2}+3\).
Let us comment on these bounds: The functions \(f_{i}\) are representing the diagonal entries of the phase indicator \(\chi \in BV([0,1]^{3};\mathcal{K})\). Thus the first estimate corresponds to a first frequency localization by exploiting the surface energy control for the high frequencies and the ellipticity of the elastic energy away from the cones \(C_{j,\mu ,\lambda}\). It can be viewed as a quantified version of the statement that \(u_{11}\) is a function only depending on \(x_{2}\), \(x_{3}\) in Proposition 4.4. The second estimate is a commutator bound that arises from the nonlinear relation \(h_{i,j}(f_{j}) = f_{i}\) for \(i \neq j\). The third estimate combines the first two bounds. The second and third estimate will form the core tool to iteratively decrease the Fourier support of the characteristic functions of our phase indicators. We will detail this in the remainder of the article.
Remark 4.8
It is possible to make the mappings \(h_{j,i}\) for \(i,j \in \{1,2,3\}\), \(i \neq j\) explicit for the choice of matrices \(A_{1}\), \(A_{2}\), \(A_{3}\), cf. (14). To this end, we may, for instance, consider
4.3.3 Comparison Argument in Fourier Space
In this section, we carry out the iterative bootstrap argument which allows us to deduce the final rigidity result.
As a first step of the bootstrap argument, we invoke the results from above which allow us to decrease the region of potential Fourier concentration from the cone \(C_{1,\mu ,\lambda}\) to a cone \(C_{1,\mu ,\lambda '}\) with \(\lambda ' < \lambda \). This resembles (23) in the exactly stress-free setting in a quanitified version. As \(f_{1}\) is determined by \(f_{2}\) and \(f_{3}\) with the help of \(h_{1,j}\) we can reduce \(\lambda \), similarly as in our reduction of the dependences of \(u_{11}\) in Proposition 4.4.
Lemma 4.9
Under the same conditions as in Lemma 4.7and for
it holds
Here the constant \(C_{0}\) is chosen to be the same as in Lemma 4.7.
Remark 4.10
We remark that the interval for \(\lambda '\) is chosen such that it holds \(C_{1,2\mu ,2\lambda} \setminus C_{1,2\mu ,2\lambda '} \subset \{ \max \{|k_{2}|,|k_{3}|\} > 4 d \mu \lambda \} \cap C_{1,2\mu ,2 \lambda}\) and the one for \(\mu \) such that \(\frac{1}{\sqrt{2}} \frac{4d\lambda \mu}{\sqrt{1-4\mu ^{2}}} < \lambda \), i.e. the interval for \(\lambda '\) is non-empty.
Proof
We first observe that the Fourier transform of \(h_{i,j}(m_{j,\mu ,\lambda}(D)f_{j})\) is given by a convolution of the function \(m_{j,\mu ,\lambda} \hat{f}_{j}\) with itself. Hence, the support of \(h_{i,j}(m_{j,\mu ,\lambda}(D)f_{j})\) is contained in the \(d\)-fold Minkowski-sum of \(C_{j,2\mu ,2\lambda}\) with itself. For \(j=2,3\) it therefore holds that
Further we introduce the sets \(K_{2} = \{|k_{2}| > 4 d \mu \lambda \}\), \(K_{3} = \{|k_{3}| > 4 d \mu \lambda \}\) and consider the corresponding Fourier multipliers of the smoothed-out indicator functions \(\chi _{K_{2}}(D)\), \(\chi _{K_{3}}(D)\), \(\chi _{K_{2} \cup K_{3}}(D)\). This implies for \(i=2,3\)
With this we can show that the Fourier mass concentrates in a cone with smaller truncation parameter, depicted in Fig. 4: Indeed, in Fourier space \(\chi _{K_{2} \cup K_{3}} \leq \chi _{K_{2}} + \chi _{K_{3}}\). Thus,
Invoking (27) together with the bounds from Lemma 4.7, then implies for \(i=2,3\)
Combined with (28) this yields
We observe that by the choice of the parameters \(C_{1,2\mu ,2\lambda} \setminus C_{1,2\mu ,2\lambda '} \subset (K_{2} \cup K_{3}) \cap C_{1,2\mu ,2\lambda}\), and thus \(|m_{1,\mu ,\lambda}(k) - m_{1,\mu ,\lambda '}(k)| \leq \chi _{K_{2} \cup K_{3}}(k) m_{1,\mu ,\lambda}(k)\). Therefore,
In conclusion, (for \(C_{0} \geq C\))
□
Let us stress that the decomposition into \(K_{2}\) and \(K_{3}\) is in analogy to the two comparisons from Proposition 4.4 in order to show that \(u_{11}\) is constant. The set \(K_{2}\) resembles the comparison of \(u_{11}\) and \(u_{22}\) to show that \(u_{11}\) is constant in \(x_{2}\) and the set \(K_{3}\) resembles the comparison of \(u_{11}\) and \(u_{33}\) to show that \(u_{11}\) does not depend on \(x_{3}\).
Applying the previous result for all three directions simultaneously then yields the following corollary which will serve as the induction basis for the subsequent inductive bootstrap argument.
Corollary 4.11
Induction Basis
Let \(\beta ,\delta ,\mu ,\lambda >0\) and let \(C_{i,\mu ,\lambda}\) be the cones in (25) with corresponding multiplier \(m_{i,\mu ,\lambda}(D)\), cf. (26), for \(i=1,2,3\). Further let \(f_{i}\), \(h_{j,i}\) be functions as in Lemma 4.7. Let \(C_{0}>0\), \(\gamma \in (0,1)\) and \(d\geq 0\) be the constants from Lemma 4.7. For \(\lambda _{0} = \lambda > 0\), \(\mu \in (0,\frac{1}{2\sqrt{2d^{2}+1}})\), \(\ \lambda _{1} \in ( \frac{4d\mu \lambda _{0}}{\sqrt{2}\sqrt{1-4\mu ^{2}}},\lambda _{0})\) it holds that
With Corollary 4.11 in hand, we now iteratively further decrease the Fourier supports. To this end, we will invoke the commutator bounds from Lemma 4.7.
Lemma 4.12
Iteration process
Let \(\beta ,\delta >0\) and let \(C_{i,\mu ,\lambda}\) be the cones in (25) with corresponding multiplier \(m_{i,\mu ,\lambda}(D)\), cf. (26), for \(i=1,2,3\). Further let \(\mu \in (0, \frac{1}{2 \sqrt{2d^{2}+1}})\), \(\lambda >0\), and let \(f_{i}\), \(h_{j,i}\) be functions as in Lemma 4.7. Let \(C_{0}>0\), \(\gamma \in (0,1)\) and \(d\geq 0\) be the constants from Lemma 4.7. Let \(\lambda _{k} > 0\) be a sequence for \(k \in \mathbb{N}\) with \(\lambda _{0} = \lambda \) and \(\lambda _{k} \in ( \frac{4d\mu \lambda _{k-1}}{\sqrt{2}\sqrt{1-4\mu ^{2}}},\lambda _{k-1})\). It then holds for every \(k \in \mathbb{N}\setminus \{0\}\)
Proof
We prove the statement by induction on \(k\) with the induction basis given by Corollary 4.11. Assume that for some arbitrary but fixed \(k \in \mathbb{N}\) it holds
Now for the induction step \(k \mapsto k+1\) we carry out the same argument as above to show
We argue as in the induction basis and present the calculations only for the first term on the left hand side of (30). By the triangle inequality,
The first contribution is already of the desired form. It thus remains to consider the second contribution. For this we consider an analogous argument as before: Let \(K_{2}^{k} := \{|k_{2}| > 4 d \mu \lambda _{k}\}\), \(K_{3}^{k} := \{|k_{3}| > 4 d \mu \lambda _{k}\}\). Then, since \(\chi _{K_{j}^{k}}(D) h_{1,j}(m_{j,\mu ,\lambda _{k}}(D) f_{j}) = 0\) on \(K_{j}^{k}\),
Again for the second right hand side contribution in (31) we use Lemma 4.7:
Using the inductive hypothesis (29), overall, we arrive at the following upper bound
Arguing symmetrically for \(f_{2}\) and \(f_{3}\) then yields:
Using that \(2C^{\prime \,2} \leq C_{0}\), \(1-\gamma \in (0,1)\) and that \(\frac{C_{0}}{\gamma ^{24 d}} \geq 3\), leads to
This concludes the proof. □
Now with the inductive procedure of reducing regions of Fourier space concentration in hand, we turn to the proof of the lower bound in Theorem 2 and the argument for Proposition 1.2.
Proof of the lower bound in Theorem 2 and proof of Proposition 1.2
We argue in two steps, first fixing the free parameters and then exploiting the boundary conditions.
Step 1: Choice of parameters and proof of Proposition 1.2. We seek to invoke Lemma 4.12 expressing the bounds in terms of our energies, i.e. setting \(\delta = E_{el}^{per}(\chi ;F):=\inf _{u \in \mathcal{D}_{F}} E_{el}^{per}(u, \chi )\), \(\beta = E_{surf}^{per}(\chi )\), and \(\lambda ^{-1} = \mu ^{-2} \epsilon \). Moreover, we use the notation
By virtue of Lemma 4.6 we obtain that for \(f_{i} := \chi _{i,i}\) indeed \(\sum _{j=1}^{3} \Vert \partial _{i} f_{i} \Vert _{\dot{H}^{-1}}^{2} \leq \delta \). It follows directly that also \(\sum _{i=1}^{3} \Vert \nabla f_{i} \Vert _{TV} \leq \beta \) and therefore the conditions in Lemma 4.7 are fulfilled. As a consequence, the above iteration in Lemma 4.12 is applicable. With this in mind, we choose \(\mu = \epsilon ^{\alpha}\) for some \(\alpha >0\) to be specified. Therefore,
and thus since without loss of generality \(E_{\epsilon}^{per}(\chi ;F) := \inf _{u \in \mathcal{D}_{F}} E_{el}^{per}(u, \chi ) + \epsilon E_{surf}^{per}(\chi ) \leq 1\) (having the upper bound from Proposition 4.3 in mind), we deduce
We further choose
where \(M=M(d) = 2\sqrt{2d^{2}+1} > 2\). This is admissible in the sense of the assumptions in Lemma 4.9 since \(\frac{\lambda _{k}}{\lambda _{k-1}} = 2\sqrt{2d^{2}+1}\mu \in ( \frac{1}{\sqrt{2}}\frac{4d\mu}{\sqrt{1-4\mu ^{2}}},1)\).
Next, for \(\alpha = \alpha (\epsilon ) \in (0,1)\) and \(\epsilon >0\) we choose \(k \in \mathbb{N}\) to be given by
This ensures that \(\lambda _{k} \leq \frac{1}{2}\). In what follows, we will choose the parameters \(\epsilon \), \(\alpha \) such that \(\frac{\log M}{|\log \epsilon |}\leq \frac{\alpha}{2}\) which implies \(k \leq \frac{4}{\alpha}\). Exploiting the discrete Fourier transform, then yields
and hence, by (33) results in the estimate
Using that \((1-\gamma )^{\frac{4}{\alpha}} \geq 1 - \frac{4}{\alpha}\gamma \), we set \(\gamma := \frac{\alpha}{8} \in (0,1)\) which leads to the bound
Next, we fix the parameter \(\alpha >0\): Observing that for any \(\nu >0\) there exists \(\alpha _{0}>0\) such that for \(\alpha \in (0,\alpha _{0})\) it holds that
we choose \(\alpha = |\log \epsilon |^{-\frac{1}{2+\nu}}\) for all \(\epsilon \in (0,\epsilon _{0})\) with \(\epsilon _{0}>0\) still to be chosen. In particular, for \(\epsilon _{0}>0\) sufficiently small, such that \(|\log \epsilon |^{\frac{1}{2}} \geq 2 \log M\), (34) holds and also \(\frac{\log M}{|\log \epsilon |}\leq \frac{\alpha}{2}\) holds as required above.
As a consequence, for \(\nu ':=\frac{\nu}{4+2\nu} \in (0,\frac{1}{2})\) we arrive at
Step 2: Conclusion. In order to conclude the estimate, we derive a lower bound for \(\sum _{j=1}^{3} \Vert f_{j} - \langle f_{j} \rangle \Vert _{L^{2}}^{2}\). For this we recall that \(f_{j} = \chi _{j,j}\) is the \(j\)-th diagonal entry of the phase indicator \(\chi \) and hence \(\sum _{j=1}^{3} \Vert f_{j} - \langle f_{j} \rangle \Vert _{L^{2}}^{2} = \Vert \chi - \langle \chi \rangle \Vert _{L^{2}}^{2}\). Thus, by the mean value condition in \(\mathcal{D}_{F}^{per}\),
and, furthermore, as the left hand side is independent of \(u\),
Overall this implies for the total energy \(E_{\epsilon}^{per}(\chi ;F) = E_{el}^{per}(\chi ;F) + \epsilon E_{surf}^{per}( \chi )\)
Finally, solving for \(E_{\epsilon}^{per}(\chi ;F)\) shows the desired estimate
The desired claim follows by an application of Lemma 4.5. □
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Chan, A., Conti, S.: Energy scaling and branched microstructures in a model for shape-memory alloys with \(SO(2)\) invariance. Math. Models Methods Appl. Sci. 25(06), 1091–1124 (2015)
Rüland, A., Tribuzio, A.: On the energy scaling behaviour of a singularly perturbed Tartar square. Arch. Ration. Mech. Anal. 243(1), 401–431 (2022)
Garroni, A., Nesi, V.: Rigidity and lack of rigidity for solenoidal matrix fields. Proc. R. Soc. Lond., Ser. A, Math. Phys. Eng. Sci. 460(2046), 1789–1806 (2004). https://doi.org/10.1098/rspa.2003.1249
Palombaro, M., Ponsiglione, M.: The three divergence free matrix fields problem. Asymptot. Anal. 40(1), 37–49 (2004)
Müller, S., Šverák, V.: Convex integration for Lipschitz mappings and counterexamples to regularity. Ann. Math. 157(3), 715–742 (2003)
Kirchheim, B., Müller, S., Šverák, V.: Studying nonlinear PDE by geometry in matrix space. In: Geometric Analysis and Nonlinear Partial Differential Equations, pp. 347–395. Springer, Berlin (2003)
Székelyhidi, L. Jr: The regularity of critical points of polyconvex functionals. Arch. Ration. Mech. Anal. 172(1), 133–152 (2004)
De Lellis, C., Székelyhidi, L. Jr.: The Euler equations as a differential inclusion. Ann. Math. 170(3), 1417–1436 (2009)
De Lellis, C., Székelyhidi, L. Jr.: The \(h\)-principle and the equations of fluid dynamics. Bull. Am. Math. Soc. 49(3), 347–375 (2012)
Kuiper, N.H.: On \(C^{1}\)-isometric imbeddings. II. In: Indagationes Mathematicae (Proceedings), vol. 58, pp. 683–689. Elsevier, Amsterdam (1955)
Nash, J.: \(C^{1}\) isometric imbeddings. Ann. Math. 60(3), 383–396 (1954)
Conti, S., De Lellis, C., Székelyhidi, L. Jr.: h-principle and rigidity for \(C^{1, \alpha }\) isometric embeddings. In: Nonlinear Partial Differential Equations, pp. 83–116. Springer, Berlin (2012)
De Lellis, C., Székelyhidi, L. Jr.: On h-principle and Onsager’s conjecture. Newsl. - Eur. Math. Soc. 95, 19–24 (2015)
Ball, J.M., James, R.D.: Fine phase mixtures as minimizers of energy. In: Analysis and Continuum Mechanics, pp. 647–686. Springer, Berlin (1989)
Ball, J.M., James, R.D.: Proposed experimental tests of a theory of fine microstructure and the two-well problem. Philos. Trans. R. Soc. Lond. A, Math. Phys. Eng. Sci. 338(1650), 389–450 (1992)
Bhattacharya, K., Firoozye, N.B., James, R.D., Kohn, R.V.: Restrictions on microstructure. Proc. R. Soc. Edinb., Sect. A, Math. 124(5), 843–878 (1994)
Müller, S., Šverák, V.: Convex integration with constraints and applications to phase transitions and partial differential equations. J. Eur. Math. Soc. 1(4), 393–422 (1999)
Müller, S.: Variational models for microstructure and phase transitions. In: Calculus of Variations and Geometric Evolution Problems, pp. 85–210. Springer, Berlin (1999)
Rüland, A.: The cubic-to-orthorhombic phase transition: rigidity and non-rigidity properties in the linear theory of elasticity. Arch. Ration. Mech. Anal. 221(1), 23–106 (2016)
Palombaro, M.: Rank-\((n-1)\) convexity and quasiconvexity for divergence free fields. Adv. Calc. Var. 3(3), 279–285 (2010)
Palombaro, M., Smyshlyaev, V.P.: Relaxation of three solenoidal wells and characterization of extremal three-phase H-measures. Arch. Ration. Mech. Anal. 194(3), 775–822 (2009)
Kohn, R.V., Wirth, B.: Optimal fine-scale structures in compliance minimization for a uniaxial load. Proc. R. Soc. A, Math. Phys. Eng. Sci. 470(2170), 20140432 (2014)
Kohn, R.V., Wirth, B.: Optimal fine-scale structures in compliance minimization for a shear load. Commun. Pure Appl. Math. 69(8), 1572–1610 (2016)
Potthoff, J., Wirth, B.: Optimal fine-scale structures in compliance minimization for a uniaxial load in three space dimensions (2021). arXiv:2111.06910
Kohn, R.V., DeSimone, A., Otto, F., Müller, S.: Recent analytical developments in micromagnetics. Sci. Hyst. 2, 269–381 (2006)
DeSimone, A., Knüpfer, H., Otto, F.: 2-d stability of the Néel wall. Calc. Var. Partial Differ. Equ. 27(2), 233–253 (2006)
De Philippis, G., Palmieri, L., Rindler, F.: On the two-state problem for general differential operators. Nonlinear Anal. 177, 387–396 (2018)
Sorella, M., Tione, R.: The four-state problem and convex integration for linear differential operators (2021). arXiv:2107.10785
Skipper, J.W.D., Wiedemann, E.: Lower semi-continuity for \(\mathcal{A}\)-quasiconvex functionals under convex restrictions (2019). arXiv:1909.11543
Bhattacharya, K.: Microstructure of Martensite. Oxford Series on Materials Modelling. Oxford University Press, London (2003)
Bhattacharya, K.: Comparison of the geometrically nonlinear and linear theories of martensitic transformation. Contin. Mech. Thermodyn. 5(3), 205–242 (1993)
Warner, M., Terentjev, E.M.: Liquid Crystal Elastomers, vol. 120. Oxford University Press, London (2007)
Cesana, P., Della Porta, F., Rüland, A., Zillinger, C., Zwicknagl, B.: Exact constructions in the (non-linear) planar theory of elasticity: from elastic crystals to nematic elastomers. Arch. Ration. Mech. Anal. 237(1), 383–445 (2020)
Lamy, X., Lorent, A., Peng, G.: Rigidity of a non-elliptic differential inclusion related to the Aviles–Giga conjecture. Arch. Ration. Mech. Anal. 238(1), 383–413 (2020)
Lamy, X., Lorent, A., Peng, G.: On a generalized Aviles-Giga functional: compactness, zero-energy states, regularity estimates and energy bounds (2022). arXiv:2203.05418
Tartar, L.: Compensated compactness and applications to partial differential equations. In: Nonlinear Analysis and Mechanics: Heriot-Watt Symposium, vol. 4, pp. 136–212 (1979)
DiPerna, R.J.: Compensated compactness and general systems of conservation laws. Trans. Am. Math. Soc. 292(2), 383–420 (1985)
Tartar, L.: The compensated compactness method applied to systems of conservation laws. In: Systems of Nonlinear Partial Differential Equations, pp. 263–285. Springer, Berlin (1983)
Murat, F., Tartar, L.: H-convergence. In: Topics in the Mathematical Modelling of Composite Materials, pp. 21–43. Springer, Berlin (2018)
Müller, S.: Rank-one convexity implies quasiconvexity on diagonal matrices. Int. Math. Res. Not. 1999(20), 1087–1095 (1999)
Faraco, D., Székelyhidi, L. Jr.: Tartar’s conjecture and localization of the quasiconvex hull in \(\mathbb{R}^{2\times 2}\). Acta Math. 200(2), 279–305 (2008)
Guerra, A., Raiţă, B., Schrecker, M.R.I.: Compensated compactness: continuity in optimal weak topologies. J. Funct. Anal. 283(7), 109596 (2022)
Arroyo-Rabasa, A., De Philippis, G., Rindler, F.: Lower semicontinuity and relaxation of linear-growth integral functionals under PDE constraints. Adv. Calc. Var. 13(3), 219–255 (2020)
Raiţă, B.: Potentials for \(\mathcal{A}\)-quasiconvexity. Calc. Var. Partial Differ. Equ. 58(3), 105 (2019)
Guerra, A., Raiţă, B., Schrecker, M.: Compensation phenomena for concentration effects via nonlinear elliptic estimates (2021). arXiv:2112.10657
Raiţă, B.: A simple construction of potential operators for compensated compactness (2021). arXiv:2112.11773
Behn, L., Gmeineder, F., Schiffer, S.: On symmetric div-quasiconvex hulls and divsym-free \(L^{\infty}\)-truncations (2021). arXiv:2108.05757
Fonseca, I., Müller, S.: \(\mathcal{A}\)-Quasiconvexity, lower semicontinuity, and young measures. SIAM J. Math. Anal. 30(6), 1355–1390 (1999)
Kristensen, J., Raiţă, B.: An introduction to generalized Young measures. Max-Planck-Institut für Mathematik in den Naturwissenschaften Leipzig 45 (2020)
Conti, S., Gmeineder, F.: \(A\)-Quasiconvexity and Partial Regularity (2020). arXiv:2009.13820
Gmeineder, F., Lewintan, P., Neff, P.: Optimal incompatible Korn-Maxwell-Sobolev inequalities in all dimensions (2022). arXiv:2206.10373
De Philippis, G., Rindler, F.: On the structure of \(\mathcal{A}\)-free measures and applications. Ann. Math. 184(3), 1017–1039 (2016)
Arroyo-Rabasa, A., De Philippis, G., Hirsch, J., Rindler, F.: Dimensional estimates and rectifiability for measures satisfying linear PDE constraints. Geom. Funct. Anal. 29(3), 639–658 (2019)
Breit, D., Diening, L., Gmeineder, F.: On the trace operator for functions of bounded \(\mathbb{A}\)-variation. Anal. PDE 13(2), 559–594 (2020)
Dacorogna, B.: Direct Methods in the Calculus of Variations, vol. 78. Springer, Berlin (2007)
Dacorogna, B., Marcellini, P.: Implicit Partial Differential Equations, vol. 37. Springer, Berlin (2012)
Pedregal, P.: Parametrized Measures and Variational Principles. Springer, Berlin (1997)
Chaudhuri, N., Müller, S.: Rigidity estimate for two incompatible wells. Calc. Var. Partial Differ. Equ. 19(4), 379–390 (2004)
De Lellis, C., Székelyhidi, L. Jr.: Simple proof of two-well rigidity. C. R. Math. 343(5), 367–370 (2006)
Friesecke, G., James, R.D., Müller, S.: A theorem on geometric rigidity and the derivation of nonlinear plate theory from three-dimensional elasticity. Commun. Pure Appl. Math. 55(11), 1461–1506 (2002)
Lamy, X., Lorent, A., Peng, G.: Quantitative rigidity of differential inclusions in two dimensions (2022). arXiv:2208.08526
Kohn, R.V., Müller, S.: Branching of twins near an austenite—twinned-martensite interface. Philos. Mag. A 66(5), 697–715 (1992)
Kohn, R.V., Müller, S.: Surface energy and microstructure in coherent phase transitions. Commun. Pure Appl. Math. 47(4), 405–435 (1994)
Barroso, A.C., Matias, J., Santos, P.M.: Differential inclusions and \(\mathcal{A}\)-quasiconvexity. Mediterr. J. Math. 3(14), 1–14 (2017)
Van Schaftingen, J.: Limiting sobolev inequalities for vector fields and canceling linear differential operators. J. Eur. Math. Soc. 15(3), 877–921 (2013)
Choksi, R., Kohn, R.V., Otto, F.: Domain branching in uniaxial ferromagnets: a scaling law for the minimum energy. Commun. Math. Phys. 201(1), 61–79 (1999)
Conti, S., Diermeier, J., Melching, D., Zwicknagl, B.: Energy scaling laws for geometrically linear elasticity models for microstructures in shape memory alloys. ESAIM Control Optim. Calc. Var. 26, 115 (2020)
Kohn, R.V.: Energy-driven pattern formation. In: International Congress of Mathematicians, vol. 1, pp. 359–383 (2007)
Conti, S., Kohn, R.V., Misiats, O.: Energy minimizing twinning with variable volume fraction, for two nonlinear elastic phases with a single rank-one connection. Math. Models Methods Appl. Sci. 32(08), 1671–1723 (2022). https://doi.org/10.1142/S0218202522500397
Knüpfer, H., Kohn, R.V.: Minimal energy for elastic inclusions. Proc. R. Soc. A, Math. Phys. Eng. Sci. 467(2127), 695–717 (2011)
Knüpfer, H., Kohn, R.V., Otto, F.: Nucleation barriers for the cubic-to-tetragonal phase transformation. Commun. Pure Appl. Math. 66(6), 867–904 (2013)
Rüland, A.: A rigidity result for a reduced model of a cubic-to-orthorhombic phase transition in the geometrically linear theory of elasticity. J. Elast. 123(2), 137–177 (2016)
Capella, A., Otto, F.: A quantitative rigidity result for the cubic-to-tetragonal phase transition in the geometrically linear theory with interfacial energy. Proc. R. Soc. Edinb., Sect. A, Math. 142(2), 273–327 (2012)
Capella, A., Otto, F.: A rigidity result for a perturbation of the geometrically linear three-well problem. Commun. Pure Appl. Math. 62(12), 1632–1669 (2009)
Rüland, A., Tribuzio, A.: On the energy scaling behaviour of singular perturbation models involving higher order laminates (2021). arXiv:2110.15929
Rüland, A., Tribuzio, A.: On scaling laws for multi-well nucleation problems without gauge invariances (2022). arXiv:2206.05164
Rüland, A., Taylor, J.M., Zillinger, C.: Convex integration arising in the modelling of shape-memory alloys: some remarks on rigidity, flexibility and some numerical implementations. J. Nonlinear Sci. 29(5), 2137–2184 (2019)
Tartar, L.: Some remarks on separately convex functions. In: Microstructure and Phase Transition, pp. 191–204. Springer, Berlin (1993)
Chlebík, M., Kirchheim, B.: Rigidity for the four gradient problem. J. Reine Angew. Math. 2002(551), 1–9 (2002)
Otto, F., Viehmann, T.: Domain branching in uniaxial ferromagnets: asymptotic behavior of the energy. Calc. Var. Partial Differ. Equ. 38(1), 135–181 (2010)
Diermeier, J.: Domain branching in geometrically linear elasticity (2013)
Winter, M.: An example of microstructure with multiple scales. Eur. J. Appl. Math. 8(2), 185–207 (1997)
Chipot, M.: The appearance of microstructures in problems with incompatible wells and their numerical approach. Numer. Math. 83(3), 325–352 (1999)
Grafakos, L.: Classical Fourier Analysis. Graduate Texts in Mathematics, vol. 249. Springer, Berlin (2014)
Acknowledgements
A.R. and C.T. gratefully acknowledge support through the Heidelberg STRUCTURES Excellence Cluster which is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy EXC 2181/1 - 390900948.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing Interests
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Branching, Upper Bound for the Two-State Problem for the Divergence Operator
We complement our lower bounds for the compatible two-well problem from Sect. 3 by an upper bound in the case of the divergence operator acting on matrix fields as introduced in Example 1.1. For simplicity, we only consider square matrices, i.e. \(m = d\), and like before in Sect. 3 only consider the case \(d>1\) in the following. For earlier, closely related, three-dimensional constructions in the context of compliance minimization problems we refer to [24]. While the \(\epsilon ^{\frac{2}{3}}\) construction is by now rather “standard” [1, 62, 63, 80], our argument does provide a slightly different perspective, in that, in arbitrary dimension, we can ensure boundary conditions on all faces of the domain \(\Omega =[0,1]^{d}\) (see the upper bound construction for [76, Theorem 3] for a similar construction for the gradient).
In deducing the upper bound for the divergence operator, we first provide a construction in a unit cell (Lemma A.1) and iterate this construction (Proposition A.3). This yields a construction which attains the boundary data in two directions. In order to attain these also on the remaining sides we use the flexibility of the wave cone for the divergence operator (Proposition A.5). We remark that in two dimensions there would be no modification with respect to the gradient construction since there the curl and divergence only differ by a rotation of 90 degrees.
In our unit cell branching construction, we do not work on the level of the potential, but directly consider the problem on the level of the wells. In this context, we recall the compatibility conditions for laminates formed by the divergence operator which is determined by the associated wave cone: For \(M\in \mathbb{R}^{d\times d}\), we have that \(M \in \ker \mathbb{A}(\xi )\) if and only it holds \(M \xi = 0\).
With this in hand, we introduce an auxiliary matrix which will play an important role in our construction: Let \(A,B \in \mathbb{R}^{d \times d}\) be such that \((B-A)e_{1} =0\) and let \(n = e_{1} + \gamma _{2} \nu \) for a unit vector \(\nu \) perpendicular to \(e_{1}\) and for some \(\gamma _{2} \in \mathbb{R}\), \(\gamma _{2} \neq 0\). We then define \(E_{\nu}\) by
and the associated “perturbed” matrix \(\tilde{A}_{\nu }= A + E_{\nu}\). By construction, this matrix obeys the identities
This in particular allows for interfaces of \(\tilde{A}_{\nu}\) and \(A\) with normal \(\nu \) and of \(\tilde{A}_{\nu}\) and \(B\) with normal \(n\) which we will use in our branching construction below.
Lemma A.1
For \(0< l< h \leq 1\), we define \(\omega = [0,l] \times [0,h] \times [0,1]^{d-2}\). Let \(A,B \in \mathbb{R}^{d \times d}\) be such that \((B - A)e_{1} = 0\) and let \(F_{\lambda }= \lambda A + (1-\lambda )B\) for some \(\lambda \in (0,1)\). Then there exists \(u: \mathbb{R}^{d} \to \mathbb{R}^{d\times d}\) such that
Furthermore, there exist \(\chi \in BV(\omega ;\{A,B\})\) and a constant \(C = C(A,B) > 0\) such that for any \(\epsilon >0\) the localized energy can be bounded by
Proof
We consider the following partition of the domain \(\omega \) into subdomains:
Based on this we define
where \(E_{e_{2}}\) is given in (A.1) for \(n = e_{1} - \frac{(1-\lambda )l}{2h} e_{2}\). We highlight that \(u\) is independent of \(x_{k}\) for \(k \geq 3\). By definition of \(E_{e_{2}}\), the characterization of the wave cone (5) for the divergence operator and the remarks on laminates in (A.2), this defines an divergence-free mapping. Further, as \((B- F_{\lambda})e_{1} = (A-F_{\lambda})e_{1} = 0\), the exterior data are attained in \(x_{1} \in (-\infty ,0)\cup (l,\infty )\).
To calculate the energy, we observe, that the only contribution to the elastic energy is given in \(\omega _{3}\). Hence,
As the surface energy is determined by the interfaces between \(\omega _{j}\) and \(\omega _{k}\), we obtain (\(l < h\))
This shows the claim. □
As for analogous constructions for the gradient, we will use this unit cell as a building block in order to achieve a self-similar construction attaining the boundary data on two directions. For this to be admissible in the sense of an \(\mathcal{A}\)-free map, we rely on the following lemma. It shows that for first order operators corners in which interfaces meet do not give rise to singularities.
Lemma A.2
Let \(d,n,m \in \mathbb{N}\) and let \(\mathcal{A}(D) = \sum _{j=1}^{d} A_{j} \partial _{j}: C^{\infty}( \mathbb{R}^{d};\mathbb{R}^{n}) \to C^{\infty}(\mathbb{R}^{d}; \mathbb{R}^{m})\) be a homogeneous, linear operator of degree one with symbol \(\mathbb{A}\) given in (3) and let \(\Omega _{j}\), \(j=1,\dots ,l\), be a polygonal set (the set is defined as the intersection of half spaces) with outer unit normal \(n_{j}\) such that
-
\(\mathbb{R}^{d} = \bigcup _{j=1}^{l} \Omega _{j}\),
-
the two sets \(\Omega _{j}\), \(\Omega _{j+1}\) have one common face (\(l+1 = 1\)),
-
and such that they meet in one point, i.e. \(\bigcap _{j=1}^{l} \Omega _{j} = \{x_{0}\}\),
cf. Fig. A.2. Assume further that \(B_{j} \in \mathbb{R}^{n}\), \(j=1,\dots ,l\), are such that \(B_{j} - B_{j+1} \in \ker \mathbb{A}(n_{j})\). Then the map
is \(\mathcal{A}\)-free.
Proof
First we note that \(u\) is indeed well-defined by the properties of \(\Omega _{j}\), further we notice that for \(M \in \mathbb{R}^{n}\) and \(U \subset \mathbb{R}^{d}\) a Lipschitz domain it holds for \(\varphi \in C^{\infty}_{c}(\mathbb{R}^{d};\mathbb{R}^{m})\)
With this it holds
Moreover on \(\partial \Omega _{j} \cap \partial \Omega _{j+1}\) it holds that \(n_{j+1} = - n_{j} \) and thus by the assumptions on \(B_{j}\)
As \(\varphi \) was arbitrary the claim follows. □
With Lemma A.2 in hand, we now iterate the unit cell-construction from Lemma A.1, as illustrated in Fig. A.1.
Self-similar construction in Proposition A.3. (Color figure online)
Setting of Lemma A.2
Proposition A.3
Let \(d,n \in \mathbb{N}\). Let \(\Omega = [0,1]^{d} \), let \(A,B \in \mathbb{R}^{n}\) be such that \(B-A \in \Lambda _{\operatorname {div}}\), cf. (5), and let \(F_{\lambda }= \lambda A + (1-\lambda ) B\) for some \(\lambda \in (0,1)\). Let \(E_{\epsilon}\) be as in (12). Then there exist \(u: \mathbb{R}^{d} \to \mathbb{R}^{d \times d}\) and \(\chi \in BV(\Omega ;\{A,B\})\) with \(\operatorname {div}u = 0\) in \(\mathbb{R}^{d}\) and \(u = F_{\lambda}\) for \((x_{1},x_{2}) \notin [0,1]^{2}\) such that for any \(\epsilon \in (0,1)\) and any \(N \in \mathbb{N}\)
for some constant \(C = C(A,B,\lambda ) > 0\).
Proof
Without loss of generality we may assume \((B-A)e_{1} = 0\), i.e. \(B-A \in \ker \mathbb{A}(e_{1})\) for \(\mathcal{A}(D) = \operatorname {div}\). Let \(\theta \in (\frac{1}{4},\frac{1}{2})\). We argue symmetrically in the upper and lower half of the cube, i.e. we give the construction of \(u\) on \([0,1] \times [\frac{1}{2},1] \times [0,1]^{d-2}\) and define \(u\) on the lower half by symmetry. We define for \(N \in \mathbb{N}\) and for \(j \in \mathbb{N}_{0}\)
Furthermore, let \(j_{0} \in \mathbb{N}\) be the maximal \(j \in \mathbb{N}\) such that \(l_{j} < h_{j}\). We set
for \(k \in \{0,1,\dots ,N2^{j}-1\}\), \(j \in \{0,1,\dots ,j_{0}\}\); for \(k \in \{0,1,\dots ,N 2^{j_{0}+1}-1\}\), \(j = j_{0}+1\) we set
Let \(u_{j}\), \(\chi _{j}\) in \([0,l_{j}] \times [0,h_{j}] \times [0,1]^{d-2}\) be given by Lemma A.1 for \(j=1,\dots ,j_{0}\). Further, in the layer \(j=j_{0}+1\) we interpolate with the desired boundary data by a cut-off argument: To this end, we introduce the cut-off function \(\phi : [0,\infty ) \to [0,1]\) and the profile \(h:[0,1] \to [0,\infty )\) by setting
We consider the function \(\tilde{u}_{j_{0}+1}: [0,l_{j_{0}}] \times [0, \frac{\theta ^{j_{0}+1}}{2}] \times [0,1]^{d-2}\) defined via
The associated phase indicator is defined by
We note that \(\chi _{j_{0}+1}(x) = h'(\frac{x_{1}}{l_{j_{0}+1}}) (A-B) + F_{ \lambda}\) and moreover for \(x_{2} < \frac{1}{2}\) it holds \(\tilde{u}_{j_{0}+1}(x) = \chi _{j_{0}+1}(x) \) and for \(x_{2} > \frac{3}{4}\) correspondingly \(\tilde{u}_{j_{0}+1}(x) = F_{\lambda}\). Furthermore, for \(x_{1} \in \{0,l_{j_{0}+1}\}\), we know \(h(\frac{x_{1}}{l_{j_{0}+1}}) = 0\) and thus \((\tilde{u}_{j_{0}+1}(x) - F_{\lambda}) e_{1} = \phi ( \frac{2 x_{2}}{\theta ^{j_{0}+1}}) h'\Big(\frac{x_{1}}{l_{j_{0}+1}} \Big) (A-B)e_{1} =0 \).
With the help of this construction we meet the prescribed data for \(x_{2} \geq 1\), and we can define \(u\) in the upper half of the full cube:
For the lower half of the cube we argue similarly, mirroring the unit cell construction of Lemma A.1, i.e. instead of \(E_{e_{2}}\) we consider \(E_{-e_{2}}\). We define \(\chi \) in \([0,1]^{d}\) analogously.
We note, that this defines a divergence free mapping, as all the laminations are in compatible directions as \((B-A)e_{1} = 0\) and by the choice of \(E_{e_{2}}\) in (A.1). Lemma A.2 shows, that this function is divergence-free even thought interfaces meet in corners. Moreover, we can bound the energy in the \(\omega _{j_{0}+1,k}\) cells for any \(k \in \{1,\dots ,N2^{j_{0}+1}-1\}\):
and, since \(l_{j_{0}+1} \geq h_{j_{0}+1}\) and \(\theta ^{j_{0}+1} \sim h_{j_{0}+1}\),
Furthermore, the surface energy is bounded by \(E_{surf}(\chi _{j_{0}+1};\omega _{j_{0}+1,k}) \leq C(A,B,\lambda ) h_{j_{0}+1}\).
Overall, we have a function defined on \([0,1]^{d}\) and can extend it to be \(F_{\lambda}\) for \((x_{1},x_{2}) \notin [0,1]^{2}\). The energy then can be bounded by
□
Remark A.4
We remark that in the situation of the divergence operator, there are situations with substantially more flexibility than for the gradient: If for the two wells \(A,B\in \mathbb{R}^{d\times d}\) it does not only hold that \((B-A)e_{1} = 0 \) but also that \((B-A)e_{2}=0\), there would not be any elastic energy contribution involved. In this situation, for the above construction, we would only have contributions to the surface energy, as then \(E_{e_{2}} = 0\) and hence \(A+E_{e_{2}} = A \in \{A,B\}\). In particular, for boundary data which are only attained on two directions this would yield a linear scaling law in \(\epsilon \). For curl free mappings as in gradient inclusions, this is not possible, as the direction of lamination is unique in that case, i.e. \(V_{\operatorname{rot},\lambda}\) is at most one-dimensional.
As a last auxiliary step towards the upper bound construction from Theorem 1, in order to achieve the exterior data on all sides of the unit cube, we adapt the branching construction similarly as in [75], as the construction from Proposition A.3 does not yet satisfy \(F_{\lambda}\) at, e.g., \(x_{3} =0\). Thus, we combine Proposition A.3 with a further domain splitting for which we split \([0,1]^{d}\) into different regions. In each region, we prescribe a different direction for the branching construction from Proposition A.3. To this end, we use that the choice of \(e_{2}\) in the above results was arbitrary and we also can choose any other direction \(e_{j}\) for \(j \in \{2,\dots ,d\}\). Combined with compatibility conditions at the resulting interfaces, this will allow us to deduce the desired branching construction.
Proposition A.5
Under the same assumptions as in Proposition A.3there exist \(u: \mathbb{R}^{d} \to \mathbb{R}^{d \times d}\), \(\chi \in BV(\Omega ;\{A,B\})\) and a constant \(C = C(A,B,\lambda ) > 0\) such that \(u \in \mathcal{D}_{F_{\lambda}}\) for \(F_{\lambda }= \lambda A + (1-\lambda )B\) (\(\lambda \in (0,1)\)) and for any \(\epsilon \in (0,1)\) and \(N \in \mathbb{N}\) it holds
Proof
For simplicity, we first carry out the details for the case \(d=3\) and then only comment on the changes in the case of arbitrary dimension. We split \([0,1]^{3}\) into the following four parts and use different branching directions in each part: Let
and consider the upper (\(\Omega _{j}^{+}\)) and lower (\(\Omega _{j}^{-}\)) halves separately as in the proof of Proposition A.3.
Next, we define \(u_{2}\) in \(\Omega _{2}^{+}\) and \(u_{3}\) in \(\Omega _{3}^{+}\) using Proposition A.3: The function \(u_{2}\) is given by the function from Proposition A.3 above, whereas \(u_{3}\) is obtained from \(u_{2}\) by exchanging roles of \(e_{2}\) and \(e_{3}\), i.e. the branching is done in \(e_{3}\) direction and we use \(E_{e_{3}}\) instead of \(E_{e_{2}}\). For \(\nu \in \{e_{2}, e_{3}\}\) the error matrix \(E_{\nu}\) is given in (A.1). We then define the overall deformation \(u\) in \(\Omega _{2}^{+} \cup \Omega _{3}^{+}\) by
This construction is depicted in Fig. A.3. As above \(u\) is defined in the lower halves \(\Omega _{2}^{-}\), \(\Omega _{3}^{-}\) by symmetry.
We claim that this overall construction is divergence-free. In the individual regions \(\Omega _{2}^{+}\) and \(\Omega _{3}^{+}\) this follows by Proposition A.3. It thus remains to discuss the compatibility at the interface \(x_{2}=x_{3}\). Since all other values of \(u\) are given by (matching domains in which) \(u\in \{A,B\}\), it suffices to discuss the compatibility of the error matrices \(E_{e_{2}}\) and \(E_{e_{3}}\) at this interface. To this end, we however note that \((E_{e_{3}}-E_{e_{2}}) \zeta = 0\) for all \(\zeta \in \operatorname {span}(e_{2},e_{3})\), i.e. also this interface is admissible. This shows that \(u\) indeed defines a divergence free map.
The upper bound for the elastic and surface energies from Proposition A.3 remains valid, thus yielding the claimed estimate which concludes the proof of the proposition.
In order to show the \(d\)-dimensional result, we split \([0,1]^{d}\) into \(2d-2\) regions \(\Omega _{j}^{\pm} := \{x \in [0,1]^{d} : \pm (x_{j}-\frac{1}{2}) \geq 0,\frac{1}{2}-|x_{j}-\frac{1}{2}| = \min _{2 \leq k \leq d} \frac{1}{2} - |x_{k} - \frac{1}{2}|\}\) for \(j = 2,\dots ,d\) and argue as above. □
Finally, with Proposition A.5 in hand, we immediately obtain the proof of the upper bound construction from Theorem 1 for the divergence operator.
Proof of the upper bound in Theorem 1
In order to deduce the upper bound of Theorem 1, we choose \(N \sim \epsilon ^{-\frac{1}{3}}\), which shows the claim. □
Remark A.6
Generalizations
Building on the ideas from the gradient case and the ones from above one can formulate (rather restrictive) conditions, allowing for similar constructions for more general linear, constant coefficient differential operators. A key difficulty here consists in the “hard form” of the prescribed boundary conditions. When considering “softer forms” of these, as for instance in [66], constructions for general constant coefficient operators with the desired boundary conditions would be feasible under much more general conditions by using Fourier theoretic arguments as in [74]. We do not pursue these ideas here but postpone this to possible future work.
Appendix B: On the Role of the Divergence Operator
Following [28], in this section we highlight the relevance of the divergence operator which also partially motivates our discussion of the scaling law for the \(T_{3}\) problem from Sect. 1.3.
To this end, we first recall that considering any first order homogeneous constant coefficient differential operator \(\mathcal{A}(D) = \sum _{j=1}^{d} A^{j}\partial _{j}: C^{\infty}( \mathbb{R}^{d};\mathbb{R}^{n}) \to C^{\infty}(\mathbb{R}^{d}; \mathbb{R}^{m})\), we can rewrite \(\mathcal{A}(D) u = \operatorname {div}\omega _{1}(u)\) with a linear map \(\omega _{1}: \mathbb{R}^{n} \to \mathbb{R}^{m \times d}\). Indeed, let us define
Let \(u \in C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{n})\), then it holds \(\mathcal{A}(D) u = (\operatorname {div}\circ \omega _{1}) (u)\) for the row-wise divergence.
Indeed this can be generalized for higher order operators \(\mathcal{A}(D)u = \sum _{|\alpha |=k} A^{\alpha }\partial ^{\alpha }u\), where \(A^{\alpha }\in \mathbb{R}^{m \times n}\) are coefficient matrices, as follows. For this we denote the space of symmetric \(k\) tensors on \(\mathbb{R}^{d}\) by \(\text{Sym}(\mathbb{R}^{d},k)\). Let the \(k\)-th order divergence be given as
For \(k=1\) this is exactly the row-wise divergence as mentioned above.
Remark B.1
This definition is natural in that sense that this operator (up to a sign) is the adjoint of the \(k\)-th derivative \(D^{k}\).
The linear map \(\omega _{k}: \mathbb{R}^{n} \to \mathbb{R}^{m} \otimes \text{Sym}( \mathbb{R}^{d},k)\) then takes the form
and by this choice it holds \(\mathcal{A}(D) u= (\operatorname {div}^{k} \circ \omega _{k})(u)\) for any \(u \in C^{\infty}(\mathbb{R}^{d};\mathbb{R}^{n})\) and \(\ker \omega _{k} = I_{\mathcal{A}} = \bigcap _{|\alpha |=k} \operatorname{ker} A^{\alpha}\). In what follows, we omit the \(k\) dependence of \(\omega = \omega _{k}\) in the notation.
With this in hand, it is possible to bound the energy for a general homogeneous linear operator \(\mathcal{A}(D)\) (of order \(k\)) by the corresponding energy for the (\(k\)-th order) divergence (cf. [28, Appendix] for the corresponding qualitative result in the case \(k=1\)).
Proposition B.2
Let \(d,n,k \in \mathbb{N}\), \(\mathcal{K} \subset \mathbb{R}^{n}\), and let \(\Omega \subset \mathbb{R}^{d}\) be a bounded Lipschitz domain. Let \(\mathcal{A}(D)\) be a \(k\)-th order homogeneous linear differential as in (2) and the elastic and surface energies be given by (16) and (17). Moreover let \(\omega =\omega _{k}\) be the linear transformation in (B.4) and \(\operatorname {div}^{k}\) the generalized \(k\)-th order divergence in (B.3) with the corresponding energies \(E_{el}^{\operatorname {div}^{k}}\), \(E_{surf}^{\operatorname {div}^{k}}\). Then there exist constants \(C_{1},C_{2} > 0\) such that for any \(\chi \in BV(\Omega ;\mathcal{K})\), \(F \in \mathbb{R}^{n}\)
Moreover if \(\mathcal{A}(D)\) is cocanceling (and thus \(\omega \) is injective), it also holds for all \(u \in \mathcal{D}_{F}\), \(\chi \in BV(\Omega ;\mathcal{K})\) (cf. (10))
Proof
In order to obtain the desired result, we use the pointwise bound \(|u-\chi | \geq C |\omega (u)-\omega (\chi )|\) and consider the adapted boundary data: For any \(u \in \mathcal{D}_{F}\) with \(\mathcal{D}_{F}\) denoting the set from (10), the composition \(\omega (u)\) satisfies \(\operatorname {div}^{k} \omega (u) = \mathcal{A}(D) u = 0\) in \(\mathbb{R}^{d}\) and \(\omega (u) = \omega (F)\) in \(\mathbb{R}^{d} \setminus \overline{\Omega}\). In other words, it holds that \(\omega (u) \in \mathcal{D}_{\omega (F)}^{\operatorname {div}^{k}}\) for the divergence operator and boundary data \(\omega (F)\). For the elastic energy (denoting by \(E_{el}^{\operatorname {div}^{k}}\), \(\mathcal{D}_{\omega (F)}^{\operatorname {div}^{k}}\) the energy and domain for the divergence operator) this implies
We emphasize that, in general, this only yields lower bound inequalities since \(\omega \) is possibly not injective and thus there may be deformations \(u\) with \(\mathcal{A}(D) u = 0\) and \(u \neq F\) outside \(\Omega \) but still fulfilling \(\omega (u) = \omega (F)\) outside \(\Omega \) (see the example in Sect. B.1 below). Replacing now \(\omega (u)\) by a general function \(w: \mathbb{R}^{d} \to \mathbb{R}^{m} \otimes \text{Sym}(\mathbb{R}^{d},k)\) such that \(w \in \mathcal{D}_{\omega (F)}^{\operatorname {div}^{k}}\) yields
Furthermore, as \(|\nabla \chi | \geq c |\nabla (\omega (\chi ))|\), we can also bound the surface energy
In the case of a cocanceling operator \(\omega \) is injective and we also have the bounds \(|u-\chi | \leq C|\omega (u)-\omega (\chi )|\), \(|\nabla \chi | \leq C | \nabla (\omega (\chi ))|\), thus in (B.5) and (B.6) also the matching upper bounds hold, which concludes the proof. □
As a consequence, lower bounds for the divergence operator often also imply lower bounds for more general operators. A particular setting (see [28]) for instance arises in the three state problem with \(\mathcal{K}=\{A_{1},A_{2},A_{3}\}\) being such that \(A_{j} - A_{k} \notin \Lambda _{\mathcal{A}}\). In this case also \(\omega (\mathcal{K})\) consists of three states which is a result of the fact that the kernel of \(\omega \) is given by
In particular, if we find a \(T_{3}\) structure for a general linear, homogeneous, constant coefficient, first order differential operator \(\mathcal{A}(D)\) such that it is mapped to the \(T_{3}\) structure in Sect. 4, we can exploit the same lower bound as for the divergence operator. In addition to the relevance of the divergence operator for applications, this argument serves as an additional motivation for focusing particularly on the divergence operator in this article.
Moreover with Proposition 3.8 in mind, also for pairwise non super-compatible wells, we can assume without loss of generality that \(I_{\mathcal{A}} = \{0\}\) and thus \(\omega \) in injective.
2.1 B.1 Comparison of the Two-State Problem for the Divergence Operator
In this section, we discuss the comparison between the general two-state problem for linear, homogeneous differential operators and the one for the (\(k\)-th order) divergence operator. In particular, this yields yet another proof of the compatible case in Theorem 1.
In the calculations from the first part of Sect. B, we notice that for an injective map \(\omega \) we can also bound the quantities \(E_{el}(u,\chi )\), \(E_{surf}(\chi )\) from above with the corresponding term in which \(u\), \(\chi \) are replaced by \(\omega (u)\), \(\omega (\chi )\); hence for \(I_{\mathcal{A}} = \{0\}\) it holds
We here emphasize that this only holds on the level of fixed \(u\), \(\chi \) and that after the minimization in \(u\), this does not necessarily yield a two-sided comparison of the energies any more. Indeed, while the lower bound estimates always hold (cf. Proposition B.2), this may not be true for the upper bound estimates. In fact, even if \(I_{\mathcal{A}} = \{0\}\), we can at the moment not exclude that there may be \(w \in \mathcal{D}_{\omega (F)}^{\operatorname {div}^{k}} \setminus \omega ( \mathcal{D}_{F})\). We postpone a further discussion of this to future work.
The advantage of \(I_{\mathcal{A}} = \{0\}\) is that we do not lose wells in that sense that for \(I_{\mathcal{A}} = \{0\}\) also \(\ker \omega = \{0\}\) and thus, \(\omega \) is injective. As seen above in Sect. 3.4 for two wells \(A,B \in \mathbb{R}^{n}\), \(A-B \notin I_{\mathcal{A}}\) we can restrict to \(\tilde{A}(D)\) which fulfills \(I_{\tilde{\mathcal{A}}} = \{0\}\). This implies that for two compatible wells, which are not super-compatible, in deducing lower scaling bounds, we can use the corresponding lower bounds of the divergence operator as we do not lose information.
Example B.3
In concluding this section, we give an example of an operator which is not cocanceling. Considering \(d=2\), \(n=3\), \(m=1\) and
implies that \(\omega : \mathbb{R}^{3} \to \mathbb{R}^{1\times 2}\), \(\omega (x) = (x_{2} ,\ x_{3})\) and \(\ker (\omega ) = \operatorname {span}(e_{1}) = I_{\mathcal{A}}\).
The reduced operator \(\tilde{\mathcal{A}}(D)\) would act on mappings taking values only in \(\{0\} \times \mathbb{R}^{2} \subset \mathbb{R}^{3}\).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Raiţă, B., Rüland, A. & Tissot, C. On Scaling Properties for Two-State Problems and for a Singularly Perturbed \(T_{3}\) Structure. Acta Appl Math 184, 5 (2023). https://doi.org/10.1007/s10440-023-00557-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10440-023-00557-7