Abstract
A vectorial Modica–Mortola functional is considered and the convergence to a sharp interface model is studied. The novelty of the paper is that the wells of the potential are not constant, but depend on the spatial position in the domain \(\Omega \). The mass constrained minimization problem and the case of Dirichlet boundary conditions are also treated. The proofs rely on the precise understanding of minimizing geodesics for the degenerate metric induced by the potential.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Phase transitions phenomena are ubiquitous in nature. Examples are the spinodal decomposition in metallic alloys, the change in the crystallographic structure in metals, the order-disorder transitions, and the alterations of the molecular structures. In view of the wide range of physical and industrial applications where phase transitions are observed, it is of primary interest to understand the different mechanisms that govern these complex processes. Many physical models have been proposed over the years to capture the behavior of these phenomena and an enormous amount of insight has been gained by performing analytical studies. For this reason, the theoretical investigation of phase transitions is still currently an active field of research in the mathematical community. In the particular case of liquid-liquid phase transitions, the preferred model was proposed by van der Waals (see [61]) and was later independently rediscovered by Cahn and Hilliard (see [18]). This theory revolves around the study of the so called Modica–Mortola energy functional (often referred to as the Ginzburg–Landau free energy in the physics literature), which is the foundation of the model we consider in this paper.
The primary focus of this work is the study of the \(\Gamma \)-convergence of the family of functionals
where \(u \in H^1(\Omega ; \mathbb {R}^M)\), with \(M \ge 1\), and \(W :\Omega \times \mathbb {R}^M \rightarrow [0,\infty )\) is a locally Lipschitz potential such that, for all \(x \in \Omega \), \(W(x, p) = 0\) if and only if \(p \in \{z_1(x),\dots ,z_k(x)\}\). Here \(\Omega \) denotes an open bounded subset of \(\mathbb {R}^N\) with Lipschitz continuous boundary and, for \(i \in \{1, \dots , k\}\), the \(z_i :\Omega \rightarrow \mathbb {R}^M\) are given Lipschitz functions.
Our main contribution is the treatment of the case \(M \ge 2\) for x-dependent wells, thus providing a first vectorial counterpart to some of the results in [14, 58], where moving wells were considered in the scalar case. For the precise statement of our results we refer the reader to Sect. 1.2.
1.1 Previous works
Denote by \(\Omega \subset \mathbb {R}^N\) the container of the material, and assume that the system is described by a scalar valued phase (or order) parameter \(u :\Omega \rightarrow \mathbb {R}\), which for instance, in the case of a mixture of two or more fluids, represents the density. Stable equilibrium configurations are local minimizers of the Gibbs free energy. Under isothermal conditions, consider
where the free energy density \(W_0:\mathbb {R}\rightarrow [0,\infty )\) is taken to be non-convex in order to support a phase transitions. If the material has two stable phases, the typical form of \(W_0\) is depicted in Fig. 1. In many situations, the physical interpretation of the phase parameter naturally imposes a constraint on the class of admissible functions for the minimization problem for (1). If u represents a density, this often takes the form of a volume constraint, i.e.,
for some \(m\in \mathbb {R}\). For \(W_0\) as in Fig. 1, let \((\alpha ,\beta )\) be the interval where \(W_0\) does not coincide with its convex envelope. To be precise, \(\alpha \) and \(\beta \) are chosen to satisfy
The numbers \(\alpha , \beta ,\mu \), where \(\mu {:}{=} W'_0(\alpha )\), are called Maxwells parameters (see [37]). Notice that if
then there is no phase transitions, in that solutions to the minimization problem for (1) subject to the constraint in (2) are constant. Here \(\mathcal {L}^N(\Omega )\) denotes the volume of the set \(\Omega \). Therefore, we assume that
and restrict our attention to admissible functions u taking values in the interval \([\alpha ,\beta ]\). Define
Notice that \(W(\alpha ) = W(\beta ) = 0\), that \(W > 0\) otherwise, and that, in view of the mass constraint (2), replacing \(W_0\) with W in (1) changes the free energy only by a constant.
As previously remarked by several authors, an energy of the form (1) cannot properly describe the physics of phase transitions. Indeed, given any region \(A \subset \Omega \) with
the phase variable which takes the value \(\alpha \) in A and \(\beta \) in \(\Omega \setminus A\) satisfies the mass constraint (2) and is a minimizer of the energy (1). This is not what it is observed in experiments, where for stable configurations the two phases are separated by an interface with minimal surface area. Therefore, in order to capture this behavior, a term that penalizes the creation of interfaces has to be added to the energy. Indeed, in the van der Waals–Cahn–Hilliard theory of phase transitions the following functional is considered
One can justify heuristically the choice of the singular perturbation in (3) by considering the idealized situation where the transition between the two phases takes place in a layer \(\Sigma _{\varepsilon }\), representing the diffuse separating interface. This is assumed to be an \(\varepsilon \)-tubular neighborhood of an \((N-1)\)-dimensional surface \(\Sigma \). In this case
where the last estimate is obtained by assuming that both u and \(|\nabla u|\) are bounded. Here with \(\mathcal {H}^{N-1}(\Sigma )\) we denote the surface area of \(\Sigma \). Therefore, in order to have an energy of order 1 we need to rescale the functional by a factor of \(1/\varepsilon \). Hence we consider
It was conjectured by Gurtin (see [38]) that in the limit as \(\varepsilon \rightarrow 0\), minimizers \(\{u_\varepsilon \}_{\varepsilon > 0}\) of (5) subject to the constraint (2) converge to a piecewise constant function u that partitions \(\Omega \) into two regions separated by an interface with minimal surface area. This conjecture was proved rigorously by Carr, Gurtin, and Slemrod for \(N = 1\) (see [19]), and by Modica (see [49]) and Sternberg (see [58]) for \(N \ge 2\) (see also [51, 52]), thus showing that the sharp interface limit of the phase field model (5) provides the minimal interface criterion observed in experiments. The mathematical framework used was that of \(\Gamma \)-convergence, a notion of convergence introduced by De Giorgi and Franzoni in [25]. To be precise, it was proved that the energy of the sequence \(\{u_\varepsilon \}_{\varepsilon > 0}\) converges, as \(\varepsilon \rightarrow 0\), to
where \(\Sigma \) is an interface having minimal area, separating two regions whose volume is determined by the mass constraint (2) (which is preserved in the limit), and
Observe that the factor \(\sigma \) represents the energy needed in order to have a phase transition, and it is independent of the position and of the orientation of the interface. The value of \(\sigma \) can be also characterized as
Functions achieving the minimum in (7) are called heteroclinic connections between \(\alpha \) and \(\beta \).
Let us remark that solutions to the minimal area problem enjoy some regularity properties: the interior regularity was studied by Gonzales, Massari, and Tamanini in [35], the behavior at the boundary of \(\Omega \) was investigated by Grüter in [36], while the connectivity of the interface was the focus of the paper [60] by Sternberg and Zumbrun.
After the early works mentioned above, the mathematical study of phase transitions has flourished. Since the literature on this problem and its variants is vast, here we limit ourselves at recalling only the main contributions that are close to the problem we consider in this paper: the static problem for first order phase transitions.
The case where the material has more than two stable phases requires vector-valued phase variables \(u:\Omega \rightarrow \mathbb {R}^M\). Indeed, even if one considers a potential \(W :\mathbb {R}\rightarrow [0,\infty )\) having more than two wells, minimizers of (3) will converge to piecewise constant functions taking only two values, which are selected by the mass constraint. A key ingredient in the treatment of the vectorial case is the study of the relation between the value in (7) where the minimization problem is suitably adapted to the vectorial case, and the geodesic distance between two of the wells \(\alpha _i,\alpha _j\in \mathbb {R}^M\) with respect to the metric induced by the degenerate conformal factor \(2 \sqrt{W}\), namely
The importance of this relation was first observed by Sternberg in [58] for the case of a potential W vanishing on two disjoint closed simple curves in \(\mathbb {R}^2\). The case of a phase variable \(u :\Omega \rightarrow \mathbb {R}^M\) and a potential supporting \(k = 2\) stable phases was treated by Sternberg in [59] when \(M=2\) and by Fonseca and Tartar in [33] when \(M\ge 2\), while the general case \(k \ge 2\) was investigated by Baldo in [11]. In these works the limiting energy is shown to be of the form
where \(\alpha _1,\dots ,\alpha _k\in \mathbb {R}^M\) denote the wells of the potential W, and \(\Omega _i\subset \Omega \) is the region where the phase variable \(u\in BV(\Omega ;\mathbb {R}^M)\) takes the value \(\alpha _i\). Here \(\partial ^*\Omega ^i\) denotes the essential boundary of the set of finite perimeter \(\Omega ^i\) (see Definition 2.14).
A further generalization was studied by Barroso and Fonseca in [12], where the authors considered singular perturbations of the form \(h(x, \varepsilon \nabla u(x))\) and vector-valued phase variables. Moreover, the fully coupled singular perturbations, i.e., Gibbs free energies of the form
were the main focus of the work [55] by Owen and Sternberg in the scalar case and of [32] by Fonseca and Popovici for vector-valued phase variables. For functionals defined as in (10), the sharp interface limiting energy was shown to be of the form
In this case, if x is in the interface separating \(\Omega _i\) and \(\Omega _j\) and \(\nu \in \mathbb {S}^{N-1}\), the energy density \(\sigma \) is given by the so called cell problem
where \(Q_\nu \subset \mathbb {R}^N\) is a unit cube centered at the origin having two faces orthogonal to \(\nu \), and the function v ranges among all Lipschitz functions taking the value \(\alpha _i\) on one of these two faces, \(\alpha _j\) on the other one, and it is 1-periodic in the directions orthogonal to every other face of the cube \(Q_\nu \).
The boundary of the container \(\Omega \) could enter into play either via an interaction energy, or by forcing the phase variable to assume a specific value (not necessarily corresponding to a stable phase). The first case was studied by Modica in [50] where he considered the energy
Here \(\mathcal {E}_\varepsilon \) is defined as in (5), \(\tau :\mathbb {R}\rightarrow [0,\infty )\) is a 1-Lipschitz function, and \(\mathrm {Tr}\,u\) denotes the trace of u on \(\partial \Omega \). The author showed that the sharp interface limit is given by (see (6))
where \(\hat{\tau }\) is a modified contact energy (for a precise definition see [50, Equation (6)]). Owen, Rubinstein, and Sternberg treated the case where admissible functions for the minimization problem for (5) are constrained to satisfy a Dirichlet boundary condition \(u = g_\varepsilon \) on \(\partial \Omega \) (see [54]). The limiting problem was shown to be
where \(g_\varepsilon \rightarrow g\) in a suitable sense and the distance \(d_W\) is the one induced by the degenerate metric as in (8).
The case where the zero level set of W has a more complicated topology was considered by several authors. The particular situation in which the potential vanishes on two disjoint \(C^1\) curves in \(\mathbb {R}^2\) was considered by Sternberg in [58]. The case where the set Z of zeros of W is a generic compact set in \(\mathbb {R}^M\) was studied by Ambrosio in [5], where by considering the canonical quotient space F of \((\mathbb {R}^M, d_W)\), together with the canonical projection map \(\pi :\mathbb {R}^M \rightarrow F\), the author was able to prove that the family of functionals in (5) \(\Gamma \)-converges to
More recently, Lin, Pan, and Wang (see [47]) characterized the asymptotic behavior of sequences of minimizers satisfying a Dirichlet boundary condition in the specific case where Z is the union of two smooth disjoint manifolds \(N^+, N^- \subset \mathbb {R}^M\), under the assumption that
where d denotes the distance function, and f is smooth and behaves linearly in a neighborhood of the origin. Their proofs rely on the fact that geodesics for \(d_W\) are shown to be segments joining two points of minimal distance between \(N^+\) and \(N^-\).
There are physically relevant cases where the phase variable u is not expected to possess derivatives; thus a different singular perturbation is needed. An example is the continuum limit of the Ising spin system on a lattice, where u represents the magnetization density. In this case the appropriate energy to consider is
where K is a ferromagnetic Kac potential, which is assumed to be nonnegative and integrable. This energy was studied by Alberti and Bellettini in [1, 2], where they proved that the discrete nature of the problem does not affect the form of the limiting energy, which in turn was shown to still be an anisotropic perimeter functional as in (11). Notice that the integrability assumption on the potential K excludes the classical seminorm for fractional Sobolev spaces \(W^{s,p}(\Omega )\) for \(0< s < 1\) and \(p > 1\). The one dimensional case for \(s=\frac{1}{2}\) and \(p = 2\) was considered by Alberti, Bouchitté, and Seppecher in [3], where the authors identified the \(\Gamma \)-limit of the family of functionals
Here \(I \subset \mathbb {R}\) is a bounded interval and \(\varepsilon \lambda _\varepsilon \rightarrow l \in (0,\infty )\). The one dimensional case for \(p > 2\) was considered by Garroni and Palatucci in [34]. The \(\Gamma \)-convergence for the nonlocal perimeter functional
in the case \(N \ge 2\) and \(s \in (0,1)\) was studied by Savin and Valdinoci in [57].
The Euler–Lagrange equation for minimizers of the functional (5) subject to the mass constraint (2) was investigated by Luckhaus and Modica in [48] (see [21] for the anisotropic case). The authors considered the equation
where \(\lambda _\varepsilon \in \mathbb {R}\) is the Lagrange multiplier associated to the mass constraint, and proved that
as \(\varepsilon \rightarrow 0\). Here H denotes the mean curvature of the limiting interface. Formula (14) is known in the physics literature as the Gibbs-Thomson relation.
The minimal area interface principle serves as a first selection criterion to choose which of the (infinitely many) minimizers of (1) is physically relevant. More refined information can be obtained by considering the \(\Gamma \)-convergence expansion (see [9, 16]) of the energy (3):
where each \(\mathcal {E}^{(i)}\) is the \(\Gamma \)-limit of the family of functionals
and \(\mathcal {E}^{(0)}_\varepsilon {:}{=}\mathcal {E}_\varepsilon \). The characterization of the functional \(\mathcal {E}^{(1)}\) was carried out by [9, 10, 13, 19, 24, 45, 46] in several cases of interest.
Variants of phase transitions models of the form (5) could also be used to investigate more intricate situations. For instance, the interaction between phase transitions and homogenization phenomena is described by the functional
Here the periodic structure of the material is modeled by the periodicity of the function W in the first variable. The sharp interface model was derived by Braides and Zeppieri in [17] for the one dimensional case \(N = 1\) and by Fonseca, Hagerty, Popovici, and the first author in [22] (see also [20]) for \(N > 1\). When the homogenization takes place at the level of the singular perturbation we refer to [7, 8, 27, 28].
All the previous works are based on (variants of) model (5), which describes phenomena where the system is assumed to be under isothermal conditions. There are physically relevant situations however, where this is not the case. For instance, consider a homogeneous mixture of a binary system in thermal equilibrium. If we quench the system below a critical temperature, then we would expect phase separation. Since the quenching takes place over a finite amount of time, the assumption of isothermal conditions is not plausible. In addition, there are situations where the phase separation process can be directed by using an external thermal activation (see [4] and the references therein). In all of these cases, the model (5) is not adequate to describe the physics of the phenomenon. A system of evolution equations aimed at modelling phase transitions under nonisothermal conditions was proposed by Penrose and Fife in [56] and by Alt and Pawlow in [4]. The free energy they considered reads as
where \(T :\Omega \rightarrow \mathbb {R}\) represents the temperature of the material (or any external field), and K is a given positive function. Here the unknowns of the problem are both the temperature distribution T and the phase parameter u. In particular, it could be the case where the wells of W depend themselves on the temperature, and thus are not necessarily the same for all points \(x \in \Omega \). The dependence of W on both of the unknowns poses analytical challenges.
In order to get some insight we assume the distribution of the temperature T to be given a priori and K to be constant. These simplifications allow us to consider a free energy of the form
where the potential \(W :\Omega \times \mathbb {R}^M \rightarrow [0, \infty )\) is such that \(W(x, p) = 0\) if and only if \(p \in \{z_1(x), \dots , z_k(x)\}\), and the \(z_i :\Omega \rightarrow \mathbb {R}^M\) are given functions representing the stable phases of the material at each point \(x \in \Omega \).
The functional (15) was considered by Ishige in [40] (see also [39]) in the vectorial case, i.e. \(M > 1\), when \(k = 2\) and \(z_1\), \(z_2\) are constants. To the best of our knowledge, there are only two papers that considered the case where the functions \(z_i\) are nonconstant: [58] by Sternberg and [14] by Bouchitté. They both treated the scalar case, i.e. \(M = 1\), with two moving wells. A specific kind of potential in two dimensions is considered in the former work, while fully coupled singular perturbations in general dimension are treated in the latter. More precisely, in [14] the author considered an energy of the form
where \(f(x, u, 0) = 0\) if and only if \(u \in \{z_1(x), z_2(x)\}\). The wells \(z_1\) and \(z_2\) are allowed to coincide in a subset of \(\Omega \). The limiting functional was shown to be
for \(u \in BV_{loc}(\Omega _0)\), where \(\Omega _0 {:}{=} \{ x \in \Omega : z_1(x) \ne z_2(x) \}\), and \(\infty \) otherwise in \(L^1(\Omega )\). Here, for \(x \in \Omega \) and \(\nu \in \mathbb {S}^{N-1}\), we define
where the infimum is taken over all Lipschitz curves \(\gamma \) connecting \(z_1(x)\) and \(z_2(x)\). The scalar nature of the problem allows to implement techniques that are purely one dimensional and that cannot be adapted to the vectorial case.
In this paper we consider for the first time the energy (15) in the vectorial case, with \(k \ge 2\), and for functions \(z_i\) which are possibly nonconstant. In particular, we prove that any sequence \(\{u_{\varepsilon _n}\}_{n \in \mathbb {N}} \subset H^1(\Omega ; \mathbb {R}^M)\) such that
where \(\varepsilon _n \rightarrow 0\), converges (eventually extracting a subsequence) to a function \(u \in BV(\Omega ; \mathbb {R}^M)\) of the form
Here \(\{\Omega _1, \dots , \Omega _k\}\) is a Caccioppoli partition of \(\Omega \). Moreover, the limiting sharp interface energy is
where, for \(p, q \in \mathbb {R}^M\), \(d_W(x, p, q)\) is the geodesic distance induced by the degenerate conformal factor \(2 \sqrt{W(x, \cdot )}\). Notice that if the wells \(z_i\) are independent of x we recover (9). We refer to the next section for the precise statement of the results and for all the assumptions we require.
1.2 Statement of the main results
Let \(\Omega \subset \mathbb {R}^N\), \(N \ge 2\), be a bounded open set with Lipschitz continuous boundary. Throughout the paper we make the following assumptions on the potential W.
-
H.1
\(W :\overline{\Omega } \times \mathbb {R}^M \rightarrow [0, \infty )\) is locally Lipschitz continuous, i.e., Lipschitz continuous on every compact subset of \(\overline{\Omega } \times \mathbb {R}^M\). Moreover, for every \(x \in \overline{\Omega }\), \(W(x, p) = 0\) if and only if \(p \in \{z_1(x), \dots , z_k(x)\}\), where the functions \(z_i :\overline{\Omega } \rightarrow \mathbb {R}^M\) are Lipschitz continuous;
-
H.2
There exists \(\delta > 0\) such that
$$\begin{aligned} \min \left\{ |z_i(x) - z_j(x)| : x \in \overline{\Omega } \text { and } i \ne j \right\} \ge \delta ; \end{aligned}$$ -
H.3
There exists \(r > 0\) such that if \(p \in B(z_i(x), r)\) then
$$\begin{aligned} W(x, p) = \alpha _i |p - z_i(x)|^2, \end{aligned}$$where \(\alpha _i > 0\), for all \(i = 1, \dots , k\) and \(x\in \Omega \);
-
H.4
There exist \(R, S > 0\) such that \(W(x, p) \ge S|p|\), for all \(x \in \Omega \) and all p with \(|p| > R\).
Definition 1.1
For \(\varepsilon > 0\), let \(\mathcal {F}_\varepsilon :L^1(\Omega ; \mathbb {R}^M) \rightarrow [0, \infty ]\) be the functional defined via
In order to define the limiting functional, we need to introduce some notation.
Definition 1.2
For \(p, q \in \mathbb {R}^M\) consider the class
and let \(d_W :\overline{\Omega } \times \mathbb {R}^M \times \mathbb {R}^M \rightarrow [0, \infty )\) be the function defined via
It is immediate to verify that for all \(x \in \overline{\Omega }\), the function \(d_W(x, \cdot , \cdot ) :\mathbb {R}^M \times \mathbb {R}^M \rightarrow [0, \infty )\) defines a distance on \(\mathbb {R}^M\). The existence of solutions to the minimization problem (17), referred to as minimizing geodesics throughout the paper, is a classical problem which has been the subject of investigation of several studies. Since our proofs rely on a precise understanding of the dependence of minimizing geodesics on the variable x, our approach (see Proposition 3.2) requires more stringent assumptions on W than the ones required by Zuniga and Sternberg in [62, Theorem 2.5] (see also [53]), and is in spirit closer to the work of Sternberg [59].
We can now define our limiting functional. For all the relevant definitions we refer the reader to Sect. 2 (see, in particular, Definition 2.9).
Definition 1.3
Set
and let \(\mathcal {F}_0 :L^1(\Omega ; \mathbb {R}^M) \rightarrow [0, \infty ]\) be the functional defined via
Throughout the rest of the paper we fix \(\{\varepsilon _n\}_{n \in \mathbb {N}}\subset (0,\infty )\) with \(\varepsilon _n \rightarrow 0\) as \(n \rightarrow \infty \), and we write \(\mathcal {F}_n\) for \(\mathcal {F}_{\varepsilon _n}\). We are now in position to state our main result.
Theorem 1.4
Let W be given as in (H.1)–(H.4) and let \(\{u_n\}_{n \in \mathbb {N}} \subset H^1(\Omega ; \mathbb {R}^M)\) be such that
Then, eventually extracting a subsequence (not relabeled), we have that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\), where \(u \in BV(\Omega ; z_1, \dots , z_k)\) is such that \(\mathcal {F}_0(u) < \infty \). Moreover, the sequence of functionals \(\mathcal {F}_n\) \(\Gamma \)-converges with respect to the \(L^1\)-topology to \(\mathcal {F}_0\).
The proofs we present are robust and can be adapted to work also for several variants of the problem. In this paper we focus on two of these: the mass constrained problem and the case of Dirichlet boundary conditions.
Fix \(\mathcal {M} = (m_1,\dots ,m_M)\in \mathbb {R}^M\) in such a way that
for every \(j = 1, \dots , M\), where \(z_i^j(x)\) denotes the jth component of \(z_i(x)\).
Theorem 1.5
Let W be given as in (H.1)–(H.4) and let \(\mathcal {M} \in \mathbb {R}^M\) be as above. For \(n\in \mathbb {N}\), let
Then the followings hold:
-
(i)
if \(\{u_n\}_{n \in \mathbb {N}} \subset H^1(\Omega ; \mathbb {R}^M)\) is such that
$$\begin{aligned} \sup \{ \mathcal {F}^{\mathcal {M}}_n(u_n) : n \in \mathbb {N}\} < \infty , \end{aligned}$$then, eventually extracting a subsequence (not relabeled), we have that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\), where \(u \in BV(\Omega ;z_1,\dots ,z_k)\) is such that \(\mathcal {F}^{\mathcal {M}}_0(u) < \infty \). Here the functional \(\mathcal {F}^{\mathcal {M}}_0\) is defined via
$$\begin{aligned} \mathcal {F}_0^{\mathcal {M}}(u) {:}{=} \left\{ \begin{array}{ll} \mathcal {F}_0(u) &{} \text { if }u \in BV(\Omega ; z_1, \dots , z_k) \text { with } \int _\Omega u(x)\,dx=\mathcal {M},\\ &{}\\ \infty &{} \text { otherwise in } L^1(\Omega ; \mathbb {R}^M). \end{array} \right. \end{aligned}$$ -
(ii)
The sequence of functionals \(\mathcal {F}^{\mathcal {M}}_n\) \(\Gamma \)-converges with respect to the \(L^1\)-topology to \(\mathcal {F}^{\mathcal {M}}_0\).
Using the results in [41], we deduce the following corollary.
Corollary 1.6
Under the assumptions of Theorem 1.5, let \(u_0 \in BV(\Omega ;z_1,\dots ,z_k)\) be an \(L^1\)-isolated local minimizer for \(\mathcal {F}_0^{\mathcal {M}}\), namely there exists \(\lambda >0\) such that
for all \(v \in L^1(\Omega ;\mathbb {R}^M)\) with \(0< \Vert v - u_0 \Vert _{L^1(\Omega ; \mathbb {R}^M)} < \lambda \). Then there exists \(\{u_n\}_{n\in \mathbb {N}}\subset H^1(\Omega ;\mathbb {R}^M)\) where each \(u_n\) is an \(L^1\)-local minimizer for \(\mathcal {F}_n^{\mathcal {M}}\), such that \(u_n\rightarrow u_0\) in \(L^1(\Omega ;\mathbb {R}^M)\).
Next, we consider the case where a Dirichlet condition is imposed on the boundary of \(\Omega \).
Theorem 1.7
Let W be given as in (H.1)–(H.4) and fix \(g \in \mathrm{Lip}(\partial \Omega ; \mathbb {R}^M)\) with
For \(n \in \mathbb {N}\) define
Then the followings hold:
-
(i)
if \(\{u_n\}_{n\in \mathbb {N}} \subset H^1(\Omega ; \mathbb {R}^M)\) is such that
$$\begin{aligned} \sup \left\{ \mathcal {F}^{D}_n(u_n) : n \in \mathbb {N}\right\} < \infty , \end{aligned}$$then, eventually extracting a subsequence (not relabeled), we have that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\), where \(u \in BV(\Omega ;z_1,\dots ,z_k)\) is such that \(\mathcal {F}^{D}_0(u) < \infty \). Here the functional \(\mathcal {F}^{D}_0\) is defined via
$$\begin{aligned} \mathcal {F}^{D}_0(u) {:}{=} \mathcal {F}_0(u) + \int _{\partial \Omega } d_W(x,\mathrm {Tr}\,u(x),g(x))\,d{\mathcal {H}}^{N-1}(x). \end{aligned}$$ -
(ii)
The sequence of functionals \(\mathcal {F}^{D}_n\) \(\Gamma \)-converges with respect to the \(L^1\)-topology to \(\mathcal {F}^D_0\).
1.3 Sketch of the strategy
Despite the fact that the strategy we have to follow is clear, the path to the proof of the main result (Theorem 1.4) is studded with technical difficulties.
First of all, we comment on the compactness result. For clarity of exposition, assume that \(k = 2\), i.e., there are only two wells, namely \(z_1, z_2\). From the energy bound and Young’s inequality we get
In the case where W is independent of x, and thus \(z_1, z_2\) are constant, the proof originally proposed by Modica in [49] (see also [33]) proceeds as follows: it can be checked that
where \(w(p){:}{=} d_W(p,z_1)\), and therefore the BV-compactness implies that \(w \circ u_n \rightarrow w \circ u\) in \(L^1(\Omega )\), where \(w\circ u \in BV(\Omega )\). From this, one can then deduce that \(u \in BV(\Omega )\) and that it only takes the values \(z_1, z_2\). In our case, since W depends on x, instead of (18) we get
where
Therefore, in order to apply BV-compactness for the sequence of functions \(\{g_n\}_{n\in \mathbb {N}}\), we need a control on the other derivatives as well. Notice that this does not come from the energy bound, and is achieved by showing that the function \(x \mapsto d_{W}(x,p,q)\) is Lipschitz continuous for every \(p,q \in \mathbb {R}^M\) (see Corollary 3.5). We prove this by first deriving a uniform upper bound on the Euclidean length of minimizing geodesics for our degenerate metric (see Proposition 3.2); we discuss this at the end of this section.
Here a note is a must. To the best of our knowledge, the strategy that we summarized above is the way to get compactness for this kind of problems. Indeed, in every papers that treated the issue (see, for example, [12, 32, 33]), suitable assumptions are required in order to use the argument described above.
We remark that in [11] it was assumed that (for a potential W independent of x)
for every \(p \not \in K\), where \(K {:}{=} [k_1, k_2]^M\). Since W is continuous, (19) implies that W is constant on \(\partial K\), which is a rather restrictive assumption on W. Moreover, since for \(M > 1\) the space \(H^1(\Omega ; \mathbb {R}^M)\) is not closed under truncation, we instead replace (19) with (H.4) and consider projections rather than truncations in order to reduce to a sequence \(\{u_n\}_{n\in \mathbb {N}}\) bounded in \(L^\infty (\Omega ;\mathbb {R}^M)\) (see Step 2 in Proposition 4.1).
The strategy we use to prove the liminf inequality is the blow-up method introduced by Fonseca and Müller in [31]. To summarize the argument it is not restrictive to assume that \(\Omega = Q \subset \mathbb {R}^N\) is the unit cube with faces orthogonal to the coordinate axes and that \(u \in BV(\Omega ; \mathbb {R}^M)\) is defined via
Let \(\{\rho _m\}_{m\in \mathbb {N}} \subset (0, 1)\) be such that \(\rho _m \rightarrow 0\), and consider the rescaled cubes \(Q_{\rho _m} {:}{=} \rho _m \overline{Q}\). Let \(\{u_n\}_{n \in \mathbb {N}}\) be a sequence of functions in \(H^1(\Omega ;\mathbb {R}^M)\) such that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\). For the sake of the argument, assume in addition that \(u_n(x) = z_1(x)\) if \(x_N = -\rho _m/2\), and that \(u_n(x) = z_2(x)\) if \(x_N = \rho _m/2\). Our aim is to show that
Since the map \(x \mapsto W(x,p)\) is continuous, by an application of Young’s inequality and Tonelli’s theorem one expects to obtain
where in the previous to last line we used the notation \((0',t) = (0,\dots ,0,t)\in \mathbb {R}^N\), for \(t\in \mathbb {R}\). One possible way make this heuristics rigorous is the following:
where \(Q'_{\rho _m} {:}{=} \{x' : (x',0) \in Q_{\rho _m}\}\) and \(\widetilde{u}_n(x',s) {:}{=} u_n(x', \rho _m s)\). To conclude, one would need to prove that the function
is continuous. Notice that the minimization problem on the right-hand side of (21) is significantly different from the geodesic problem (17), in that in (21) the conformal factor depends also on the variable of integration s. One way to prove the continuity is to show that curves solving that minimization problem have uniformly finite Euclidean length (or at least that there exists one such curve enjoying this property) and then exploit the Lipschitz continuity of W. However, in the present work we choose to reason as follows. Define
Then one can show that
where
With this in hand, to conclude (see (20)) it is sufficient to show that
Notice that the function \(F_m\) vanishes on the set
In view of (H.2), we can assume m large enough so that the sets \(z_i(Q_{r_m})\) are pairwise disjoint. Let us remark that the advantage to work with (22) instead of (21) is that the latter is a purely geometric problem, i.e., the functional that we aim at minimizing does not depend on the specific choice of the parametrization. We are able to prove an explicit upper bound on the Euclidean length of certain solutions to the minimization problem in (22) (see Proposition 3.2). Furthermore, since the only property of Z that is needed in the proof is that it is the union of the images of a compact convex set through the \(z_i\)’s, the argument also works for the case where \(Z = \{ z_1(x),\dots , z_k(x)\}\) for some \(x \in \Omega \). The strategy we use is the following. First of all, we show that the specific behavior of the potential in a neighborhood of the wells yields
if \(z\in \mathbb {R}^M\) is sufficiently close to Z, where \(d_i(z)\) denotes the distance between z and \(z_i(Q_{\rho _m})\). Given \(p, q \in \mathbb {R}^M\) we want to show that solutions to (22) have uniformly finite Euclidean length. We only discuss the case where p and q belong to a neighborhood of \(z_i(Q_{\rho _m})\) for some \(i\in \{1,\dots ,k\}\) since the general case will be obtained by using the upper bound for each \(i\in \{1,\dots , k\}\) and (H.4) to get an upper bound of the length of geodesics outside of these neighborhoods. We consider three cases:
- (i):
-
If \(p, q \in z_i(Q_{\rho _m})\), then a minimizing geodesic is simply given by the image through \(z_i\) of a segment connecting two points in \(z_i^{-1}(\{p\})\) and \(z_i^{-1}(\{q\})\) respectively;
- (ii):
-
If \(p \in z_i(Q_{\rho _m})\) and \(q\not \in z_i(Q_{\rho _m})\), let us denote by \(q'\) one projection of q on \(z_i(Q_{\rho _m})\). Then the curves obtained by first connecting q and \(q'\) with a segment and then \(q'\) and p with a curve in \(z_i(Q_{\rho _m})\) is a solution to the minimization problem in (22). The proof of this uses the co-area formula and that each curve connecting p and q must traverse every level set of \(d_i\) lower than \(d_i(q)\). This latter fact follows by using (23);
- (iii):
-
If \(p, q \not \in z_i(Q_{\rho _m})\) we are able to prove the existence of a minimizing geodesic \(\gamma \in \mathcal {A}(p,q)\) with the property that
$$\begin{aligned} L(\gamma ) \le |p-p'|+|q-q'|+\mathrm{Lip}(z_i) \mathrm {diam}(Q_{\rho _m}). \end{aligned}$$Here \(L(\gamma )\) is the Euclidean length of \(\gamma \), and \(p'\) and \(q'\) denote projections on \(z_i(Q_{\rho _m})\) of p and q, respectively.
Next, we would like to comment on a hypothesis used to get the liminf inequality in the work [40] by Ishige. There the author considered potentials \(W :\Omega \times \mathbb {R}^M \rightarrow [0, \infty )\) such that, for all \(x \in \Omega \), \(W(x,p) = 0\) if and only if \(p \in \{ \alpha ,\beta \}\), for fixed \(\alpha , \beta \in \mathbb {R}^M\) and satisfy the following property: for each \(\lambda _1 > 0\) there exists \(\lambda _2 > 0\) such that for all \(p\in \mathbb {R}^M\) it holds
whenever \(|x-y| \le \lambda _2\). As remarked in [40], (24) is satisfied if, for example, \(W(x,p) = h(x) U(p)\), with \(h > 0\). We notice that assumption (24) does not hold even in the simple case of a single moving well. For this reason one cannot immediately adapt the proof in [40] to our case.
The construction of the recovery sequence is carried out as follows. Thanks to Lemma 4.4 we can assume
where \(\{\Omega _1,\dots ,\Omega _k\}\) is a Caccioppoli partition of \(\Omega \) and \(\partial \Omega ^i\cap \Omega \) is contained in a finite union of hyperplanes, for each \(i=1,\dots ,k\). For the sake of exposition, we just discuss how to build the recovery sequence in a neighborhood of a connected component \(\Sigma \) of \(\partial \Omega ^i \cap \Omega \) contained in an hyperplane. Without loss of generality, we can assume that \(\Sigma \subset \{x_N=0\}\). The quantity we want to approximate is
To fix the ideas, let us assume that \(u^-(x)=z_1(x)\), and \(u^+(x)=z_2(x)\) for all \(x\in \Sigma \). Consider a grid of \((N-1)\)-dimensional cubes \(Q'(\overline{y}_i,r_n)\subset \{ x_N=0 \}\sim \mathbb {R}^{N-1}\) of side length \(r_n>0\) and center \(\overline{y}_i\in \{ x_N=0 \}\). Identify a point \(x\in \mathbb {R}^N\) with the pair (y, t) where \(y\in \mathbb {R}^{N-1}\) and \(t\in \mathbb {R}\). Since the map \(x\mapsto d_W(x, z_1(x), z_2(x))\) is continuous (see Corollary 3.5), it is enough to approximate
The advantage of considering this discretization is the following: for each \((y,t)\in Q'(\overline{y}_i,r_n) \times (0,\tau _n)\), for some \(\tau _n>0\) with \(\tau _n\rightarrow 0\) as \(n\rightarrow \infty \), we can simply consider a suitable reparametrization of a geodesic \(\gamma _i\in \mathcal {A}( z_1(\overline{y}_i),z_2(\overline{y}_i) )\) for \(d_W\), instead of taking a different geodesic for each \(x\in Q'(\overline{y}_i,r_n)\). This comes at the cost of having to perform two gluing constructions in order for the function we define to have the required regularity \(H^1(\Omega ;\mathbb {R}^M)\). The first one is to use cut-off functions to transition between the geodesics considered in each adjacent cube. The second one is to match the value \(z_i(\overline{y}_i)\) with \(z_i(y,\tau _n)\). This will be done by using a linear interpolation. The technical difficulty is to show that the energy contribution of these gluing constructions is asymptotically negligible.
1.4 A discussion on the assumptions
It is immediate to verify that in view of (H.1), (H.3), and (H.4), there exists a positive number \(\eta \) such that
Let us mention here that while assumption (H.4) is required in order to obtain compactness of sequences with uniformly bounded energies, it is possible to prove the \(\Gamma \)-convergence results of Sect. 1.2 by assuming (H.1)–(H.3), and (25) in place of (H.4). We refer the reader to Sect. 5.4 for more details.
Finally, we notice that (H.2), (H.3), and (25) are only needed in oder to obtain the results of Sect. 3 (see Proposition 3.2 and Corollary 3.5). If the results of Sect. 3 could be obtained with weaker assumptions than (H.2) and (H.3), then the statements and the proofs of the main results would require a few adjustments, as we explain below. First of all, we notice that (H.2), (H.3), and (25) imply that
for all \(i \ne j \in {1, \dots ,k}\) and \(x \in \Omega \). Therefore, the functional \(\widetilde{\mathcal {F}_0} :L^1(\Omega ;\mathbb {R}^M)\rightarrow [0,+\infty ]\) defined as
is finite only if \(u\in BV(\Omega ;z_1,\dots ,z_k)\) with \(\mathcal {H}^{N-1}(J_u)<\infty \). In particular, it coincides with \(\mathcal {F}_0\) (see Definition 1.3). Notice that \(\widetilde{\mathcal {F}_0}\) is well defined since \(J_u\) is countably \(\mathcal {H}^{N-1}\)-rectifiable for all \(u \in L^1_{\mathrm{loc}}(\Omega ;\mathbb {R}^M)\) (see [26]). On the other hand, if (H.2) does not hold, i.e., if
then there could exist \(u \in L^1(\Omega ; \mathbb {R}^M)\) such that \(\widetilde{\mathcal {F}_0}(u) < \infty \), but u is not of bounded variation, as the following remark shows.
Remark 1.8
Take \(N = M = k = 2\), \(\Omega = (-1, 1)^2\), \(W(x, p) {:}{=} |p - z_1(x)|^2 |p - z_2(x)|^2\), where
Notice that W satisfies (H.1), (H.3), and (H.4), but not (H.2). Let \(f(t) {:}{=} \sin \left( t^{-2}\right) \), and consider the function
Fix \(x = (x_1,x_2) \in \Omega \) with \(x_1 > 0\) and let \(\gamma :[-1, 1] \rightarrow \mathbb {R}^2\) be the curve given by
As one can readily check, \(\gamma \in \mathcal {A}(z_1(x),z_2(x))\) and
Therefore, by means of a direct computation we see that
while on the other hand we have
Consequently, \(\widetilde{\mathcal {F}_0}(u) < \infty \), but u is not of bounded variation. Moreover, notice that the jump set of the function u is not the boundary of a partition of \(\Omega \).
Theorem 1.9
Let W be given as in (H.1) and (H.4), and assume that the conclusions of Proposition 3.2 and Corollary 3.5 hold true. Then the following hold:
-
(i)
if \(\{u_n\}_{n \in \mathbb {N}} \subset H^1(\Omega ; \mathbb {R}^M)\) is such that
$$\begin{aligned} \sup \left\{ \mathcal {F}_n(u_n) : n \in \mathbb {N}\right\} < \infty , \end{aligned}$$then, eventually extracting a subsequence (which we do not relabel), we have that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\), where u is such that \(u(x) \in \{z_1(x), \dots , z_k(x)\}\) for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \), and \(\widetilde{\mathcal {F}_0}(u) < \infty \).
-
(ii)
The sequence of functionals \(\mathcal {F}_n\) \(\Gamma \)-converges with respect to the \(L^1\)-topology to \(\widetilde{\mathcal {F}_0}\).
1.5 Outline of the paper
The paper is organized as follows. In Sect. 2 we introduce the notation and we recall the definitions of the mathematical objects we will need for our analysis. The Lipschitz continuity of the function \(x\mapsto d_W(x,z_i(x), z_j(x))\) is shown in Corollary 3.5. The proof makes use of a result obtained in the first part of Sect. 3, namely the fact that geodesics for \(d_W\) (and also for more degenerate conformal factors) joining two points in a compact subset of \(\mathbb {R}^M\) have uniformly bounded Euclidean lenght (see Proposition 3.2). The proof of Theorem 1.4 is divided in three parts: in Sect. 4.1 we prove the compactness result, while Sects. 4.2 and 4.3 are devoted at obtaining the liminf and the limsup inequality, respectively. Finally, in Sect. 5 we discuss how to suitably modify the arguments we used to prove Theorem 1.4 in order to obtain Theorems 1.5, 1.7, and 1.9.
2 Preliminaries
For the convenience of the reader, in this section we collect a few definitions and tools used throughout the paper.
2.1 Radon measures
Let \(\mathcal {M}(\Omega )\) be the space of finite Radon measures on \(\Omega \). We recall that in view of the Riesz representation theorem (see, for example, [30, Theorem 1.200]), if we denote by \(C_0(\Omega )\) the completion with respect to the \(L^\infty \) norm of the space of continuous functions with compact support in \(\Omega \), then the dual of \(C_0(\Omega )\) can be identified with \(\mathcal {M}(\Omega )\). The subset of \(\mathcal {M}(\Omega )\) consisting of all finite nonnegative Radon measures on \(\Omega \) will be denoted by \(\mathcal {M}^+(\Omega )\). For the sake of brevity, the results of this section are stated in the form that will be used in the paper; for this reason we refer the reader to the monographs [29, 30] for a more detailed treatment of these topics.
Definition 2.1
We say that a sequence \(\{\mu _n\}_{n\in \mathbb {N}}\subset \mathcal {M}^+(\Omega )\) weakly-\(*\) converges to \(\mu \in \mathcal {M}^+(\Omega )\), and we write \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \), if
for all \(\varphi \in C_0(\Omega )\).
The first result of this section gives a simple criterion for weak-\(*\) compactness of measures. For a proof see [30, Proposition 1.202].
Theorem 2.2
Let \(\{\mu _n\}_{n \in \mathbb {N}} \subset \mathcal {M}^+(\Omega )\) be a sequence of finite nonnegative Radon measures such that
Then there exist a subsequence (not relabeled) and a measure \(\mu \in \mathcal {M}^+(\Omega )\) such that \(\mu _n {\mathop {\rightharpoonup }\limits ^{*}}\mu \).
The following lemma is a key ingredient in the proof of the liminf inequality (see Proposition 4.2). For a proof see [30, Theorem 1.203(iii)].
Lemma 2.3
Let \(\{\mu _n\}_{n \in \mathbb {N}} \subset \mathcal {M}^+(\Omega )\) be a sequence of finite nonnegative Radon measures such that \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \). Then
for every Borel set \(A \subset \Omega \) with \(\mu (\partial A) = 0\).
Remark 2.4
For our purposes, the condition \(\mu (\partial A) = 0\) in Lemma 2.3 is not very restrictive. Indeed, fix \(x \in \Omega \) and let E be an open convex set that contains x. Consider the family \(\{E_\rho \}_{\rho > 0}\) of rescaled copies of E, i.e., let \(E^\rho {:}{=} x + \rho (E - x)\). Since by assumption \(\mu \) is a finite Radon measure, it is immediate to verify that the set
is at most countable. Indeed, take \(\overline{\rho } > 0\) such that \(E_{\overline{\rho }} \subset \Omega \). Then, for each \(m \in \mathbb {N}\), consider the set
Then
yielding that each \(A_m\) is at most finite (and not all of them can be non-empty). In particular, if \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \), the argument above shows that
for all but at most countably many values of \(\rho < \overline{\rho }\).
In an approximation result needed for the limsup inequality (see Proposition 4.3) we will also make use of another notion of convergence for measures.
Definition 2.5
Let \(\{\mu _n\}_{n \in \mathbb {N}} \subset \mathcal {M}^+(\Omega )\) be a sequence of finite nonnegative Radon measures. We say that \(\mu _n\) converges in \((C_b(\Omega ))'\) to \(\mu \in \mathcal {M}^+(\Omega )\) if
for all \(\varphi \in C_b(\Omega )\). Here \(C_b(\Omega )\) denotes the space of continuous bounded functions on \(\Omega \).
Since \(C_0(\Omega )\subset C_b(\Omega )\), if \(\mu _n\) converges in \((C_b(\Omega ))'\) to \(\mu \), then \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \). The opposite is true if, in addition, we know that the limit measure does not charge the boundary of \(\Omega \), as shown in the next result (for a proof see [30, Proposition 1.206]).
Lemma 2.6
Let \(\{\mu _n\}_{n \in \mathbb {N}} \subset \mathcal {M}^+(\Omega )\) be a sequence of finite nonnegative Radon measures such that \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \) for some \(\mu \in \mathcal {M}^+(\Omega )\), and assume that
Then \(\mu _n\) converges in \((C_b(\Omega ))'\) to \(\mu \).
We conclude this list of results on Radon measures with a well-known theorem from measure theory. The result presented below allows to recover the absolutely continuous part of a measure with respect to another via a differentiation process. For a proof we refer the reader to [30, Theorem 1.153 and Remark 1.154].
Theorem 2.7
(Besicovitch derivation theorem). Let \(\mu ,\nu \in \mathcal {M}^+(\Omega )\), and write \(\nu =\nu ^{ac}+\nu ^s\), where \(\nu ^{ac}\ll \mu \), and \(\nu ^s\perp \mu \). Let \(C\subset \mathbb {R}^N\) be an open convex set that contains the origin. Then there exists a Borel set \(S\subset \Omega \) with \(\mu (S)=0\) such that
and
for all \(x\in \Omega \setminus S\).
2.2 Functions of bounded variation
We start by recalling basic definitions and well known properties of functions of bounded variation and sets of finite perimeter. We refer the reader to [6] for more details.
Definition 2.8
Let \(u \in L^1(\Omega ;\mathbb {R}^M)\). We say that u is a function of bounded variation if its distributional derivative Du is a finite matrix-valued Radon measure on \(\Omega \). In particular,
In this case we write \(u \in BV(\Omega ; \mathbb {R}^M)\).
Definition 2.9
Let \(u \in L^1(\Omega ;\mathbb {R}^M)\). We define \(J_u \subset \Omega \), the jump set of u, as the set of points \(x \in \Omega \) such that there exist distinct vectors \(a, b \in \mathbb {R}^M\) and a direction \(\nu \in \mathbb {S}^{N - 1}\) for which
where
and
If \(x\in J_u\), we denote the triple \((a, b,\nu )\) as \((u^+(x), u^-(x), \nu _u(x))\). Notice that \((u^+(x), u^-(x),\nu _u(x))\) is unique up to replacing \(\nu _u(x)\) with \(-\nu _u(x)\) and interchanging \(u^+(x)\) and \(u^-(x)\).
Remark 2.10
Notice that using cubes instead of balls in Definition 2.9 yields an analogous characterization of the jump set. In order to keep the notation as simple as possible, in the proof of the liminf inequality (see Proposition 4.2) it will be convenient to consider cubes with two faces orthogonal to the vector \(\nu _u\).
The next result concerns the structure of the jump set and the decomposition of the distributional derivative of a function of bounded variation. For a proof see [6, Theorem 3.78].
Theorem 2.11
(Federer–Vol’pert). The jump set \(J_u\) of a function \(u\in BV(\Omega ;\mathbb {R}^M)\) is countably \(\mathcal {H}^{N-1}\)-rectifiable, i.e., there exist Lipschitz continuous functions \(f_i :\mathbb {R}^{N-1} \rightarrow \mathbb {R}^N\) such that
where each \(K_i\) is a compact subset of \(\mathbb {R}^{N-1}\). Moreover,
where \(D^c u\) denotes the so called Cantor part of the distributional derivative.
We now focus on the special class of functions of bounded variations which consists of characteristic functions of sets.
Definition 2.12
Let \(E\subset \Omega \). We say that E has finite perimeter in \(\Omega \) if its characteristic function \(\mathbb {1}_E :\Omega \rightarrow \{0,1\}\), defined as
is of bounded variation in \(\Omega \).
For sets of finite perimeter, we have two notions of boundary coming from measure theory.
Definition 2.13
Let \(E\subset \mathbb {R}^N\) be a set of finite perimeter in \(\Omega \). We call reduced boundary of E, denoted with \(\mathcal {F} E\), the set of points \(x \in \mathrm {supp}|D \mathbb {1}_E|\cap \Omega \) for which the limit
exists in \(\mathbb {R}^N\) and is such that \(|\nu _E(x)|=1\).
Definition 2.14
Let \(E\subset \mathbb {R}^N\) be an \(\mathcal {L}^N\)-measurable set. For \(t\in [0,1]\) we define
the set of points of density t for E. The set \(\partial ^* E{:}{=}\mathbb {R}^N\setminus (E^1\cup E^0)\) is called the essential boundary of E.
The relation between these two notions of measure theoretic boundary is specified in the following theorem (see [6, Theorem 3.61]).
Theorem 2.15
(Federer). Let \(E\subset \mathbb {R}^N\) be a set of finite perimeter in \(\Omega \). Then
and
In a similar fashion as the Federer–Vol’pert theorem, the reduced boundary enjoys some structure properties (see [6, Theorem 3.59]).
Theorem 2.16
(De Giorgi). Let \(E \subset \mathbb {R}^N\) be a set of finite perimeter in \(\Omega \) and for all \(x_0 \in \mathcal {F}E\) and \(\rho > 0\) let \(E_\rho {:}{=} \frac{E-x_0}{\rho }\). Then \(\mathcal {F}E\) is countably \(\mathcal {H}^{N-1}\)-rectifiable and
locally in \(L^1(\Omega )\) as \(\rho \rightarrow 0\), where \(H {:}{=} \{ x\in \mathbb {R}^N : x\cdot \nu _E(x_0)\ge 0 \}\). Moreover,
Finally, we define the notion of Caccioppoli partitions. This will be useful in the approximation results in order to get the limsup inequality (see Sect. 4.3).
Definition 2.17
A partition \(\{\Omega ^i\}_{i \in \mathbb {N}}\) of \(\Omega \) is called a Caccioppoli partition if each \(\Omega ^i\) is a set of finite perimeter in \(\Omega \).
Remark 2.18
Notice that for every \(u \in BV(\Omega ; z_1, \dots , z_k)\) (see Definition 1.3) there exists a Caccioppoli partition \(\{\Omega ^1,\dots ,\Omega ^k\}\) of \(\Omega \) such that \(u(x) = z_i(x)\) for \(\mathcal {L}^N\)-a.e. \(x \in \Omega ^i\), for all \(i \in \{1,\dots ,k\}\), and
Moreover, using [6, Theorem 3.84], it is possible to show that the distributional derivative of each \(u\in BV(\Omega ;z_1,\dots ,z_k)\) has no Cantor part.
2.3 \(\Gamma \)-convergence
We now recall the basic definition and some properties of \(\Gamma \)-convergence that will be used throughout the paper (for a reference see [15, 23]).
Definition 2.19
Let (X, d) be a metric space. We say that a sequence of functions \(F_n :X \rightarrow \mathbb {R}\cup \{\infty \}\) \(\Gamma \)-converges to \(F :X \rightarrow \mathbb {R}\cup \{\infty \}\), and we write \(F_n {\mathop {\longrightarrow }\limits ^{\Gamma -d}} F\), if the following hold:
- (i):
-
for every \(x \in X\) and every sequence \(\{x_n\}_{n \in \mathbb {N}}\) of elements of X such that \(x_n \rightarrow x\) we have
$$\begin{aligned} F(x) \le \liminf _{n \rightarrow \infty } F_n(x_n); \end{aligned}$$ - (ii):
-
for every \(x \in X\) there exists a sequence \(\{x_n\}_{n\in \mathbb {N}}\) of elements of X such that \(x_n \rightarrow x\) and
$$\begin{aligned} \limsup _{n \rightarrow \infty } F_n(x_n)\le F(x). \end{aligned}$$
A sequence \(\{x_n\}_{n\in \mathbb {N}}\) as in (ii) is called a recovery sequence for x.
We recall that the definition of \(\Gamma \)-convergence is primarily motivated by seeking minimal conditions which guarantee the convergence of minima and minimizers for a family of functionals (see, for example, [23, Corollary 7.20]). This is specified in the following theorem.
Theorem 2.20
Let (X, d) be a metric space, let \(F_n, F :X \rightarrow \mathbb {R}\cup \{\infty \}\) and assume that \(F_n {\mathop {\longrightarrow }\limits ^{\Gamma -d}} F\). For each \(n \in \mathbb {N}\), let \(x_n \in X\) be a minimizer of \(F_n\) on X. Then every cluster point of \(\{x_n\}_{n \in \mathbb {N}}\) is a minimizer of F and
3 Existence of minimizing geodesics
The purpose of this section is to collect some preliminary results concerning the existence of minimizing geodesics for possibly degenerate metrics with conformal factor F. To be precise, given a continuous nonnegative function F and \(p, q \in \mathbb {R}^M\), we study the minimization problem
where the class of admissible parametrizations \(\mathcal {A}(p,q)\) is given as in Definition 1.2. Notice that the value of the integral on the right-hand side of (26) is a purely geometric quantity, i.e., it is independent of the choice of the parametrization. Throughout the rest of the paper, we refer to any function \(\gamma \in \mathcal {A}(p,q)\) for which the infimum on the right-hand side of (26) is achieved as a minimizing geodesic. Moreover, we use the phrase sequence of almost minimizing geodesics to denote a sequence in \(\mathcal {A}(p,q)\) for which the infimum is achieved in the limit.
Let us remark that the existence of minimizing geodesics for (26) has been previously investigated by many authors. We mention here the work of Zuniga and Sternberg [62], where existence of solutions to the minimization problem is shown under very general assumptions on the conformal factor F. Of particular interest for our analysis is the special case where the conformal factor is given by
Indeed, we observe that for a fixed value of \(x \in \Omega \), if F is given as above, the distance function \(d_F\) defined in (26) is identically equal to the function \(d_W(x, \cdot , \cdot )\), introduced in Definition 1.2.
As the proofs of our main results rely on a precise understanding of minimizing geodesics for (26), and in particular on their dependence on the variable x when F is chosen as in (27), compared to [62] we require more stringent assumptions on the behavior of the potential W near the wells (see (H.3)). In turn, our approach is in spirit closer to that of Sternberg [59], where the author considered a singular perturbation of the conformal factor which renders the associated Riemannian metric conformal to the Euclidean metric and proceeded to prove a uniform bound with respect to the perturbation parameter. Our method, on the other hand, consists of proving a uniform bound on the Euclidean length of a sequence of almost minimizing geodesics. For technical reasons, we will need to consider conformal factors of the form
where \(\mathcal {R} \subset \Omega \).
In the following, given a function \(\gamma \in W^{1,1}(\mathcal {I};\mathbb {R}^M)\), where \(\mathcal {I} \subset \mathbb {R}\) is an open interval, we work with its representative in \(AC(\overline{\mathcal {I}};\mathbb {R}^M)\) and denote its Euclidean length by \(L(\gamma )\), i.e.,
For any two points \(p,q \in \mathbb {R}^M\), we let \(\ell _{p,q}\) be a parametrization in \(\mathcal {A}(p,q)\) of the line segment that joins p to q. To be precise, for \(t \in [-1,1]\) we let
We begin by presenting a compactness criterion for almost minimizing geodesics. The result states that the existence of a sequence of almost minimizing geodesic for (26) with a uniform bound on the Euclidean length of each element in the sequence implies the existence of a minimizing geodesic which enjoys the same bound. The proof is adapted from the classical result on the existence of shortest paths, i.e., minimizers of the length functional (28) (see, for example, [44, Theorem 5.38]).
Lemma 3.1
Given a continuous function \(F :\mathbb {R}^M \rightarrow [0,\infty )\) and \(p,q \in \mathbb {R}^M\), let \(d_F(p,q)\) be given as in (26), \(\{\gamma _n\}_{n \in \mathbb {N}} \subset \mathcal {A}(p,q)\) be a sequence of almost minimizing geodesics, i.e.,
and furthermore assume that \(L(\gamma _n) \le \Lambda \) for some positive constant \(\Lambda \) independent of n. Then there exists \(\gamma \in \mathcal {A}(p,q)\) with \(L(\gamma ) \le \Lambda \) such that
Proof
Notice that if \(p = q\) then there is nothing to do. Thus, we can assume without loss of generality that \(\gamma _n :[-1,1] \rightarrow \mathbb {R}^M\) is a parametric representation of a continuous simple rectifiable curve. In turn, it can be parametrized by arclength, i.e., there exists a function \(\varphi _n :[0,L(\gamma _n)] \rightarrow [-1,1]\) with the property that
is Lipschitz continuous, and in particular \(|v_n'(s)| = 1\) for \(\mathcal {L}^1\)-a.e. \(s \in (0,L(\gamma _n))\). Eventually extracting a subsequence (which we do not relabel), we can assume that \(L(\gamma _n) \rightarrow \lambda \) for some \(\lambda > 0\). Let \(\psi _n :[-1,1] \rightarrow [0, L(\gamma _n)]\) be defined via
and set \(w_n(t) {:}{=} v_n(\psi _n(t))\). Notice that the functions \(w_n \in \mathcal {A}(p,q)\) and satisfy
for \(\mathcal {L}^1\)-a.e. \(t \in (-1,1)\) and every n sufficiently large. Consequently, we are in a position to apply the Ascoli-Arzelá theorem to find a function \(\gamma :[-1,1] \rightarrow \mathbb {R}^M\) and a further subsequence (not relabeled) such that \(w_n \rightarrow \gamma \) uniformly. Furthermore, since \(L(\cdot )\) is lower semicontinuous with respect to pointwise convergence, we also get
Finally, in view of (30), we notice that for every \(s,t \in (-1,1)\) we have
and thus
This concludes the proof. \(\square \)
With Lemma 3.1 in hand, we can turn our attention back to the minimization problem (26).
Proposition 3.2
Let W be given as in (H.1)–(H.3), and assume that (25) holds for some \(\eta > 0\). Let \(\mathcal {R}\) be a convex compact subset of \(\Omega \), and denote by \(z_i(\mathcal {R})\) the set of points \(z \in \mathbb {R}^M\) such that \(z = z_i(x)\) for some \(x \in \mathcal {R}\). Assume that
whenever \(i \ne j\), where
define
and let \(d_F :\mathbb {R}^M \times \mathbb {R}^M \rightarrow [0,\infty )\) be given as in (26). Then for every \(p,q \in \mathbb {R}^M\) there exists a minimizing geodesic \(\gamma \in \mathcal {A}(p,q)\) for \(d_F(p,q)\) such that
where, for \(\eta \) as in (25),
and
Proof
In the following we let \(d_i(z)\) denote the distance from a point \(z \in \mathbb {R}^M\) to the set \(z_i(\mathcal {R})\). We recall that the functions \(d_i :\mathbb {R}^M \rightarrow [0,\infty )\) are Lipschitz continuous with Lipschitz constant at most 1. We divide the proof into several steps.
Step 1: We begin by showing that for \(\sigma \) as in (33), if \(z \in \mathcal {N}_{\sigma }(z_i(\mathcal {R}))\) then
To this end, for \(z \in \mathcal {N}_{\sigma }(z_i(\mathcal {R}))\) define \(A {:}{=} \{x \in \mathcal {R} : |z - z_i(x)| \le \sigma \}\) and \(B {:}{=} \{y \in \mathcal {R} : |z - z_i(y)| > \sigma \}\). We claim that
Indeed, for every \(x \in A\) we have
while for \(y \in B\), by (31) we obtain
provided that \(|z - z_j(y)| \le r\) for some \(j \ne i\), and \(W(y,z) \ge \eta \) otherwise, where \(\eta > 0\) is the constant introduced in (25). In turn, inequality (36) follows from (33), (37), and (38). Notice in particular that (36) implies that
and (35) readily follows.
Step 2: In this step we show that if \(p \in \mathcal {N}_{\sigma }(z_i(\mathcal {R}))\), and \(q \in z_i(\mathcal {R})\) realizes the distance, i.e., \(d_i(p) = |p - q|\), then the line segment that joins p and q is a minimizing geodesic for \(d_{F}(p,q)\). To see this, let \(\gamma \in \mathcal {A}(p,q)\) and notice that the map \(t \mapsto d_i(\gamma (t))\) is continuous, \(d_i(\gamma (-1)) = |p - q|\), and \(d_i(\gamma (1)) = 0\). Thus, by the mean value theorem, for every \(y \in (0,|p - q|)\) there exists \(t \in (-1,1)\) such that \(d_i(\gamma (t)) = y\). We recall that the composite function \(d_i \circ \gamma \) belongs to the space \(W^{1,1}((-1,1))\) and that
for \(\mathcal {L}^1\)-a.e. \(t \in (-1,1)\). Indeed, as remarked above, the function \(d_i\) is Lipschitz continuous with Lipschitz constant at most 1. Moreover, the composite function \(d_i \circ \gamma :(-1,1) \rightarrow \mathbb {R}\) is absolutely continuous and therefore it is also differentiable for \(\mathcal {L}^1\)-a.e. \(t \in (-1,1)\). If we now let \(t \in (-1,1)\) be a point where both \(\gamma \) and \(d_i \circ \gamma \) are differentiable we see that
which gives (39). By (35), (39), and by an application of the co-area formula (see [6, Theorem 3.2.6]), we get
On the other hand, if we let \(\ell _{p,q}\) be given as in (29), as one can readily check, we have that \(d_i(\ell _{p,q}(t)) = |\ell _{p,q}(t) - q|\) for every \(t \in [-1,1]\) and \(\mathcal {H}^0(\{t : |\ell _{p,q}(t) - q| = y\}) = 1\) for every \(y \in (0, |p - q|)\). In turn, by the co-area formula we conclude that
This proves our claim.
Step 3: If \(p,q \in z_i(\mathcal {R})\) then there are \(x_1,x_2 \in \mathcal {R}\) such that \(z_i(x_1) = p\) and \(z_i(x_2) = q\). Let
and notice that \(\psi (t) \in \mathcal {R}\) for every \(t \in [-1,1]\) since \(\mathcal {R}\) is convex by assumption. Let \(\gamma :[-1,1] \rightarrow \mathbb {R}^M\) be defined via \(\gamma (t) {:}{=} z_i(\psi (t))\). Then \(\gamma \in \mathcal {A}(p,q)\) is a minimizing geodesic for \(d_F(p,q)\), and furthermore
Consequently, we see that if \(p \in \mathcal {N}_{\sigma }(z_i(\mathcal {R}))\) and \(q \in z_i(\mathcal {R})\) then any parametrization in \(\mathcal {A}(p,q)\) of the line segment from p to a closest point on \(z_i(\mathcal {R})\), namely \(p'\), together with any curve with support contained in \(z_i(\mathcal {R})\) that connects \(p'\) to q gives a minimizing geodesic for \(d_{F}(p,q)\).
Step 4: This step is concerned with the proof of the more delicate case where p and q are distinct points in \(\mathcal {N}_{\sigma /2}(z_i(\mathcal {R}))\), neither of which lies on \(z_i(\mathcal {R})\). Throughout the step, we assume without loss of generality that \(d_i(p) \ge d_i(q)\). Let \(\{\gamma _n\}_{n \in \mathbb {N}} \subset \mathcal {A}(p,q)\) be a sequence of almost minimizing geodesics for \(d_F(p,q)\) and observe that it is possible to assume that
for every \((n,t) \in \mathbb {N}\times [-1,1]\). Indeed, if this is not the case then we can find two disjoint subintervals of \((-1,1)\), namely \(\mathcal {I}_1 {:}{=} (s_1,t_1)\) and \(\mathcal {I}_2 {:}{=} (s_2,t_2)\), such that
and
for all \(t \in \mathcal {I}_1 \cup \mathcal {I}_2\). In turn, we have
On the other hand, let \(p',q' \in z_i(\mathcal {R})\) be such that \(d_i(p) = |p - p'|\) and \(d_i(q) = |q - q'|\), and for \(x_1, x_2 \in \mathcal {R}\) such that \(p' = z_i(x_1)\) and \(q' = z_i(x_2)\) define
Then \(\gamma \in \mathcal {A}(p,q)\) is a parametric representation of the curve whose support constitutes of the line segments that join p to \(p'\) and \(q'\) to q, together with an arc in \(z_i(\mathcal {R})\) that connects \(p'\) and \(q'\). By means of a direct computation we see that
Comparing (42) and (44) shows that we can replace every \(\gamma _n\) for which (41) does not hold with the function \(\gamma \) defined in (43) and thus obtain a sequence of almost minimizing geodesics with the desired properties.
Next, we claim that there exists a minimizing geodesic for \(d_F(p,q)\) with Euclidean length bounded from above by
In view of Lemma 3.1, if the sequence \(\{\gamma _n\}_{n \in \mathbb {N}}\) admits a subsequence \(\{\gamma _{n_k}\}_{k \in \mathbb {N}}\) such that \(L(\gamma _{n_k}) \le d_i(p) + d_i(q) + \mathrm{Lip}(z_i)\mathrm{diam}(\mathcal {R})\) then there is nothing to do. Notice also that if there exists a subsequence \(\{\gamma _{n_j}\}_{j \in \mathbb {N}}\) with the property that
then an application of the co-area formula yields
and with similar computations in the interval \((t_j,1)\) we arrive at
where \(\gamma \) is the function defined in (43) and the last equality follows from (44). In turn, \(\gamma \) is a minimizing geodesic and the claim would follow in this case. Thus, throughout the rest of the step we assume that
for every \(n \in \mathbb {N}\). In particular, eventually passing to a subsequence (which we do not relabel), we can assume without loss of generality that \(\gamma _n :[-1,1] \rightarrow \mathbb {R}^M\) is a parametric representation of a continuous simple rectifiable curve. Hence, it can be parametrized by arclength, i.e., there exists a function \(\varphi _n :[0,L(\gamma _n)] \rightarrow [-1,1]\) with the property that
is Lipschitz continuous, and in particular \(|v_n'(s)| = 1\) for \(\mathcal {L}^1\)-a.e. \(s \in (0,L(\gamma _n))\), where
Our aim is to show that if (46) and (48) hold, then the function \(\gamma \) in (43) is a minimizing geodesic for \(d_F(p,q)\). We prove this claim in two substeps.
Substep 1: If \(d_i(p) = d_i(\gamma _n(t_n))\), then \(d_i(p) = d_i(q)\), and so
where in the last equality we have used (48). In this case the claim readily follows by comparing (44) and (49).
Substep 2: If \(d_i(p) > d_i(\gamma _n(t_n))\), let \(p' \in z_i(\mathcal {R})\) be given as above, let \(p_n\) be the point on the line segment that joins p to \(p'\) with the property that \(d_i(p_n) = d_i(\gamma _n(t_n))\), and define
Let \(w_n :[0, a_n + 2d_i(p_n)] \rightarrow \mathbb {R}^M\) be a parametrization by arclength of a simple closed arc of length \(a_n + 2d_i(p_n)\) with the following properties:
Let \(q_n\) be the point on the line segment that joins p and \(p'\) with the property that \(d_i(q_n) = d_i(q)\) and define \(f_n :[0, d_i(p) - d_i(p_n)] \rightarrow \mathbb {R}^M\) and \(g_n :[d_i(p) + d_i(p_n) + a_n, L(\gamma _n)] \rightarrow \mathbb {R}^M\) via
Notice that if \(d_i(q) = d_i(\gamma _n(t_n))\) then \(q_n = p_n\) and the interval of definition of \(g_n\) trivializes to a single point. Finally (see Fig. 2), we set
As one can readily check, when restricted to the interval \((0, d_i(p) + d_i(p_n) + a_n)\) the function \(W_n\) gives a parametrization by arclength of a simple arc of length \(d_i(p) + d_i(p_n) + a_n\). Moreover, if \(d_i(p) + d_i(p_n) + a_n < L(\gamma _n)\), then \(W_n\) restricted to the interval \((d_i(p) + d_i(p_n) + a_n, L(\gamma _n))\) gives a parametrization by arclength of a segment of length
We claim that
for every \(t \in [0,L(\gamma _n)]\), where \(v_n\) is the reparametrization of \(\gamma _n\) introduced in (47). To prove claim, we argue by contradiction by assuming first that there exists \(\overline{t} \in (0,d_i(p) - d_i(p_n))\) for which (53) does not hold, so that by (52)
Recalling that \(d_i\) is Lipschitz continuous with Lipschitz constant at most 1, and in view of (47), we see that
Combining (54) and (55) we arrive at a contradiction. Notice that for \(t \in (d_i(p) - d_i(p_n), d_i(p) + d_i(p_n) + a_n)\) inequality (53) is satisfied in view of (50)–(52), while for all the remaining values of t we can argue as above. Hence the claim is proved and therefore
for every \(t \in (0, L(\gamma _n))\). Moreover, in view of (51) and by means of a direct computation which uses the co-area formula, we see that
In particular, combining the inequalities above with (44) and (56) we obtain
Letting \(n \rightarrow \infty \) in the previous inequality shows that \(\gamma \) is a minimizing geodesic, thus proving our claim.
Step 5: Finally, we show the existence of a minimizing geodesic for any two distinct points \(p,q \in \mathbb {R}^M\). Given \(v \in \mathcal {A}(p,q)\) such that
let
If \(s_1 < 1\), let \(i_1\) be such that \(v(s_1) \in \overline{\mathcal {N}_{\sigma /2}(z_{i_1}(\mathcal {R}))}\), and define
Notice that if \(s_1 < 1\), then \(s_1\) and \(t_1\) denote the first and last instance for which the support of the curve parametrized by v can be found in the \(\sigma /2\)-neighborhood of \(z_{i_1}(\mathcal {R})\), respectively. Similarly, for \(j > 1\), if \(t_{j - 1} < 1\), we define \(s_j\), \(i_j\), and \(t_j\) inductively as follows:
if \(s_j < 1\) then the index \(i_j\) is such that \(v(s_j) \in \overline{\mathcal {N}_{\sigma /2}(z_{i_j}(\mathcal {R}))}\), and
For every \(j \in \{1, \dots , k\}\), let \(v_j :[s_j, t_j] \rightarrow \mathbb {R}^M\) be the reparametrization of the minimizing geodesic which connects \(v(s_j)\) to \(v(t_j)\) found as in the previous steps, and let \(V :[-1,1] \rightarrow \mathbb {R}^M\) be defined via
Then \(V \in \mathcal {A}(p,q)\),
and furthermore, we see from (45) that
where \(\mathcal {J}\) denotes the set of indices for which \(s_j \ne t_j\) and \(\Sigma _\sigma (\mathcal {R})\) is defined as in (34).
Let \(\{\gamma _n\}_{n \in \mathbb {N}} \subset \mathcal {A}(p,q)\) be a sequence of almost minimizing geodesics for \(d_F(p,q)\). Then, for every n sufficiently large we can find a function \(V_n \in \mathcal {A}(p,q)\) such that
Thus, we are in a position to apply Lemma 3.1. This concludes the the proof. \(\square \)
Remark 3.3
In view of (H.2), condition (31) is satisfied if, for example, \(\mathrm{diam}(\mathcal {R})\) is sufficiently small. Moreover, the convexity assumption on \(\mathcal {R}\) can be easily relaxed by requiring that for any two points \(x_1,x_2 \in \mathcal {R}\) there exists a path in \(\mathcal {R}\) with finite Euclidean length from one to the other. One must then change the constant on the right-hand side of (32) accordingly.
Remark 3.4
Notice that the right-hand side of (32) depends continuously on p and q. In particular, given \(\lambda > 0\), set
and observe that for every \(p,q \in B(0,\lambda )\), if \(\ell _{p,q}\) is defined as in (29), we have
Consequently, if we let
then for every \(p,q \in B(0,\lambda )\), Proposition 3.2 yields the existence of a minimizing geodesic \(\gamma \in \mathcal {A}(p,q)\) for \(d_F(p,q)\) such that
Finally, observe that if \(\mathcal {S} \subset \mathcal {R}\) then \(\Sigma _{\sigma }(\mathcal {R}) \le \Sigma _{\sigma }(\mathcal {S})\) (see (34)) and that in view of assumptions (H.1)–(H.3) and (25),
Therefore, the following hold:
- (i):
-
\(\Lambda (\lambda , \mathcal {R}) \ge \Lambda (\lambda , \mathcal {S})\),
- (ii):
-
\(\sup \left\{ \Lambda (\lambda , \{x\}) : x \in \Omega \right\} {=}{:} \Lambda _{W}(\lambda ) < \infty \).
Corollary 3.5
Under the assumptions of Proposition 3.2, the function \(d_W\) introduced in Definition 1.2 is Lipschitz continuous in x and locally Lipschitz continuous with respect to the variables p and q. In particular, \(d_W\) is locally Lipschitz continuous, i.e., Lipschitz continuous on every compact subset of \(\overline{\Omega } \times \mathbb {R}^M \times \mathbb {R}^M\).
Proof
Fix \(p, q \in \mathbb {R}^M\) and let \(x_1,x_2\) be any two points in \(\Omega \). Let \(\lambda \) be such that \(p,q \in B(0,\lambda )\) and notice that we can assume without loss of generality that \(d_W(x_1, p, q) \ge d_W(x_2, p, q)\) since in the other case the result follows from similar computations. It follows from an application of Proposition 3.2 with \(\mathcal {R} = \{x_2\}\), together with Remark 3.4, that there exists a minimizing geodesic for \(d_W(x_2,p,q)\), namely \(\gamma \), such that \(L(\gamma ) \le \Lambda _{W}(\lambda )\).
Since by assumption W is locally Lipschitz continuous, behaves quadratically near the wells, and is bounded away from zero away from the wells (see (H.1)–(H.3), (25)), we have that \(\sqrt{W}\) is also locally Lipschitz continuous. Thus there exists a constant \(\mathrm{Lip}(\sqrt{W}; \lambda )\), which also depends on \(\Lambda _W(\lambda )\), such that
for all \(t \in (-1,1)\). Consequently, using (58), we can estimate
On the other hand, for fixed \(x \in \Omega \) and \(p \in \mathbb {R}^M\) and for every \(q,q' \in B(0,\lambda )\) with \(q \ne q'\), if we let \(\ell _{q,q'} \in \mathcal {A}(q,q')\) be defined as in (29), then we have
where \(\mathcal {W}(\lambda )\) is given as in (57). The rest of the proof follows from similar considerations; we omit the details. \(\square \)
4 Proof of Theorem 1.4
4.1 Compactness
In this section we show that any sequence with bounded energy is precompact in \(L^1(\Omega ;\mathbb {R}^M)\).
Proposition 4.1
Let W be given as in (H.1)–(H.4) and let \(\{u_n\}_{n \in \mathbb {N}} \subset H^1(\Omega ;\mathbb {R}^M)\) be such that
Then, eventually extracting a subsequence (not relabeled), we have that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\), where \(u \in BV(\Omega ;z_1,\dots ,z_k)\) is such that \(\mathcal {F}_0(u) <\infty \).
Proof
We divide the proof into several steps.
Step 1: In this first step we prove that the sequence \(\{u_n\}_{n \in \mathbb {N}}\) is bounded in \(L^1(\Omega ;\mathbb {R}^M)\) and equi-integrable. The proof is standard, but we report it here for the reader’s convenience. The proof we present is adapted from [11, Proposition 4.1] (see also [42, Theorem 1.6 and Theorem 2.4]). To this end, notice that in view of (H.4) and (59) we have that
Consequently, if we let \(E \subset \Omega \) be a measurable set, then
where in the last step we used (60). In particular, by taking \(E = \Omega \) we obtain that the sequence \(\{u_n\}_{n \in \mathbb {N}}\) is bounded in \(L^1(\Omega ;\mathbb {R}^M)\). Moreveor, for every fixed \(s > 0\), setting
and
as a consequence of (61) we obtain that for every \(n \ge \overline{n}\) and every measurable set \(E \subset \Omega \)
provided that \(\mathcal {L}^N(E) \le t\). This shows that the sequence \(\{u_n\}_{n \in \mathbb {N}}\) is equi-integrable.
Step 2: For R as in (H.4) and using the notation introduced in Remark 3.4, set
and let \(\varphi :[0,\infty ) \rightarrow [0,1]\) be a smooth cut-off function such that \(\varphi (\rho ) = 1\) for \(\rho \le R'\) and \(\varphi (\rho ) = 0\) for \(\rho \ge 2R'\). Let
Then \(W_1 :\Omega \times \mathbb {R}^M \rightarrow [0,\infty )\) satisfies (H.1)–(H.4) and moreover
for every \(x \in \Omega \) and \(z \in \mathbb {R}^M\). For every \(n \in \mathbb {N}\), define the function \(v_n :\Omega \rightarrow \mathbb {R}^M\) via
Notice that \(v_n \in H^1(\Omega ;\mathbb {R}^M) \cap L^{\infty }(\Omega ;\mathbb {R}^M)\) with \(\Vert v_n\Vert _{L^{\infty }(\Omega ;\mathbb {R}^M)} \le 2R'\), and that \(|\nabla v_n(x)| \le |\nabla u_n(x)|\) for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \). We claim that
for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \). Indeed, equality holds for \(\mathcal {L}^N\)-a.e. x such that \(u_n(x) \in B(0,2R')\), while if this is not the case then
Thus, by (59), (63), and (65) we see that
We conclude this step by remarking that \(R'\) (see (62)) is chosen in such a way that
for every \(x \in \Omega \) and every \(i,j \in \{1, \dots , k\}\).
Step 3: For \(i \in \{1, \dots , k\}\) and \(n \in \mathbb {N}\), let \(f_n^i :\Omega \rightarrow \mathbb {R}\) be the function defined via
The purpose of this step is to show that, up to the extraction of a subsequence (which we do not relabel), \(f^{i}_n \rightarrow f^{i}\) in \(L^1(\Omega )\) as \(n \rightarrow \infty \), for some \(f^i \in BV(\Omega )\). To prove the claim, it is enough to show that
where by \(\mathrm{Lip}(d_{W_1}; \lambda )\) we denote the Lipschitz constant of \(d_{W_1}\) on \(\Omega \times B(0,\lambda ) \times B(0,\lambda )\) (see Corollary 3.5). Indeed, (66) implies that
and so it follows from (68) and (69) that
Since, as one can readily check, the sequence \(\{f_n^i\}_{n \in \mathbb {N}}\) is bounded in \(L^1(\Omega )\), (70) yields that it is also bounded in \(W^{1,1}(\Omega )\). In turn, the claim follows by the Rellich–Kondrachov compactness theorem (see, for example, [44, Theorem 14.36]). The rest of the step is devoted to the proof of (68), which is adapted from [11, Proposition 2.1].
By the Meyer–Serrin approximation theorem, for every \(n \in \mathbb {N}\) there exists a sequence \(\{v_{n,k}\}_{k \in \mathbb {N}}\) of functions in \(H^1(\Omega ;\mathbb {R}^M) \cap C^1(\overline{\Omega };\mathbb {R}^M)\) such that
Moreover, eventually passing to a subsequence (which we do not relabel), for every \(n \in \mathbb {N}\) we can find a function \(h_n \in L^1(\Omega )\) such that for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \)
Let \(f_{n,k}^i :\Omega \rightarrow \mathbb {R}\) be defined as
Then \(f_{n,k}^i\) is Lipschitz continuous in \(\Omega \) and therefore differentiable almost everywhere. Observe that for \(x, y \in \Omega \)
so that
Fix \(\tau > 0\), and set \(\Omega _{\tau } {:}{=} \{x \in \Omega : \mathrm{dist}(x,\partial \Omega ) > \tau \}\). Then, if \(x \in \Omega _{\tau }\), \(h \in \mathbb {R}\setminus \{0\}\) is such that \(|h| \le \tau \), and \(\nu \in \mathbb {S}^{N - 1}\), by setting \(y = x + h \nu \) in (73), we obtain
where the constant \(C_1\) is defined as
Let \(\ell _{n,k}^h \in \mathcal {A}(v_{n,k}(x),v_{n,k}(x + h \nu ))\) be a parametrization of the line segment that joins \(v_{n,k}(x)\) and \(v_{n,k}(x + h \nu )\) and notice that
If \(f_{n,k}^i\) is differentiable at \(x \in \Omega _{\tau }\), by (74) and (75), and in view of the continuity of \(W_1\), it follows that by letting \(h \rightarrow 0\) we get
Taking the supremum over all \(\nu \in \mathbb {S}^{N - 1}\) in (76), we obtain that for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \)
In turn, by (71), (72), (77), and Lebesgue’s dominated convergence theorem we see that
as \(k \rightarrow \infty \). Next, using the notation introduced in Remark 3.4 (see (57)), we observe that by (72) and (77) we have that
By the monotone convergence theorem, letting \(\tau \rightarrow 0\) in the previous inequality, we conclude that \(\{f_{n,k}^i\}_{k \in \mathbb {N}}\) is bounded in \(H^1(\Omega ;\mathbb {R}^M)\). Therefore, eventually extracting a subsequence (which we do not relabel), there exists \(g_n^i \in H^1(\Omega ;\mathbb {R}^M)\) such that \(f_{n,k}^i \rightharpoonup g_n^i\) in \(H^1(\Omega ;\mathbb {R}^M)\) and \(f_{n,k}^i \rightarrow g_n^i\) in \(L^2(\Omega ;\mathbb {R}^M)\) as \(k \rightarrow 0\). Since \(f_{n,k}^i \rightarrow f_n^i\) in \(L^2(\Omega ;\mathbb {R}^M)\), we obtain that \(f_n^i\) must coincide with \(g_n^i\) almost everywhere in \(\Omega \), and therefore
Combining (78) and (79), we arrive at (68) with an application of the monotone convergence theorem, letting \(\tau \rightarrow 0\).
Step 4: Without loss of generality, we can assume that \(f_n^i \rightarrow f^i\) pointwise \(\mathcal {L}^N\)-a.e. in \(\Omega \) for every i. Next, let
and set
We claim that eventually extracting a subsequence, \(v_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\). Notice that by Vitali’s convergence theorem, together with the results of Step 1, it is enough to show the existence of a subsequence which converges pointwise almost everywhere to u. To this end, we observe that (66) implies that
Thus, for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \) we have that cluster points of the sequence \(\{v_n(x)\}_{n \in \mathbb {N}}\) belong to the set \(\{z_1(x), \dots , z_k(x)\}\). For \(i \ne j\), let \(B_{ij}\) be the subset of \(\Omega ^i\) consisting of all points x for which there exists a subsequence \(\{v_{n_m}(x)\}_{m \in \mathbb {N}}\) such that \(v_{n_m}(x) \rightarrow z_j(x)\). Arguing by contradiction, assume that \(\mathcal {L}^N(B_{ij}) > 0\) for some j. Then we have
Thus we have arrived at a contradiction. In particular, since \(v_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\), we deduce that
for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \), where in the last equality we have used (67) and (80). Finally, notice that by (60) and (64) we have that
and therefore, eventually extracting a subsequence, \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\) as it was claimed.
Step 5: This step is concerned with the proof of additional regularity properties of the function u, defined in (80). Let us remark that in view of (81), throughout the rest proof we can return to working with the potential W instead of \(W_1\). We begin by showing the following characterization of the jump set of u (see Definition 2.9):
where the sets \(U_{ij}\) are defined via
Notice that
where
so that to prove (82) it is enough to show that \(U_{ij} = V_{ij}\). To this end, fix \(x \in V_{ij}\) and observe that
holds for \(\mathcal {L}^N\)-a.e. \(y \in \Omega \). Consequently, from (81), (84), and (85) we see that
as \(\rho \rightarrow 0^+\). Similarly, one can show that
and thus we conclude that \(V_{ij} \subset U_{ij}\). To prove the reverse inequality, we begin by noticing that in view of (H.2) there exists a constant \(\omega \) such that
In turn, if \(x \in U_{ij}\) (see (83)) and \(\varepsilon > 0\) is given, there exists \(\overline{\rho } > 0\) such that if \(\rho \le \overline{\rho }\) then
Consequently, recalling that \(\Vert z_i\Vert _{L^{\infty }(\Omega ;\mathbb {R}^M)} \le R\) by (H.4), notice that for every such \(\rho \) we have
and therefore
Finally, letting \(\varepsilon \rightarrow 0^+\) yields the desired result. Notice that with similar computations one can show that
Hence \(x \in V_{ij}\), and therefore we have shown that also \(U_{ij} \subset V_{ij}\). The characterization of the jump set in (82) readily follows.
Next, observe that for \(U_{ij}\) defined as in (83), if \(\omega \) is the constant given in (86) then
where in the last equality we have used the fact that \(f^i \in BV(\Omega )\). Combining (80), (82), and (87) we see that \(u \in BV(\Omega ; z_1, \dots , z_k)\). Finally, notice that
This concludes the proof. \(\square \)
4.2 Liminf inequality
The goal of this section is to prove the following result.
Proposition 4.2
Let W be given as in (H.1)–(H.4). For \(u \in L^1(\Omega ;\mathbb {R}^M)\), let \(\{u_n\}_{n \in \mathbb {N}}\) be a sequence of functions in \(H^1(\Omega ;\mathbb {R}^M)\) such that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\). Then
Proof
We divide the proof into several steps.
Step 1: Eventually extracting a subsequence (which we do not relabel), we can assume without loss of generality that
Consequently we are in a position to apply Proposition 4.1 for n large enough and conclude that \(u \in BV(\Omega ;\{z_1,\dots ,z_k\})\) (see Definition 1.3). In order to prove the liminf inequality we will use the blow-up method of Fonseca and Müller (see [31]). Let us consider the finite positive Radon measures \(\mu _n\in \mathcal {M}^+(\Omega )\) given by
In view of (88) we can further assume that \(\sup \left\{ \mu _n(\Omega ) : n \in \mathbb {N}\right\} < \infty \), and therefore, up to the extraction of a subsequence (which again we do not relabel), there exists a measure \(\mu \in \mathcal {M}^+(\Omega )\) such that \(\mu _n{\mathop {\rightharpoonup }\limits ^{*}}\mu \) (see Theorem 2.2). Let and let \(x_0 \in J_u\) be such that
Recall that (89) holds for \(\mathcal {H}^{N-1}\)-a.e. \(x_0\in J_u\). Let \(Q_\nu \subset \mathbb {R}^N\) be a unit cube centered at the origin with two faces orthogonal to \(\nu {:}{=} \nu _u(x_0)\), where the direction \(\nu _u(x_0)\) is given as in Definition 2.9, and for \(\rho > 0\) write \(Q(x_0,\nu , \rho ) {:}{=} x_0 + \rho Q_\nu \). Let \(b_1, \dots , b_{N - 1} \in \mathbb {R}^N\) be such that \(\{b_1, \dots , b_{N - 1}, \nu \}\) is an orthonormal bases for \(\mathbb {R}^N\). Then, for every point \(x \in \mathbb {R}^N\) there are constants \(y_1, \dots , y_{N - 1}, t \in \mathbb {R}\) such that
In the following we identify x with \((x', t)\), where \(x' \in \mathbb {R}^{N - 1}\) denotes the vector \((y_1, \dots , y_{N - 1})\). Let
and notice that with this notation at hand we can write
Moreover, we define
Since by assumption \(x_0 \in J_u\), there are two indices \(1 \le i_1 < i_2 \le k\) such that for every \(\varepsilon > 0\) there exists \(\overline{\rho } = \overline{\rho }(\varepsilon ) > 0\) with the property that for every \(\rho \le \overline{\rho }\)
In particular, if we let \(\Omega ^j {:}{=} \{x : u(x) = z_j(x)\}\), we see that
Notice that we can choose \(\overline{\rho }\) in such a way that it also satisfies
Combining (92) and (93) we obtain that for every \(j \ne i_1\)
where
In view of (94), there exists a measurable set \(E^+_j(\rho ) \subset (0,\rho /2)\) with
such that
Indeed, if we assume that (96) does not hold, then for every \(E \subset (0,\rho /2)\) with \(\mathcal {L}^1(E) = (2k - 3)\rho /4(k-1)\) we would have
Let \(E^+_{\rho } {:}{=} \bigcap _{j \ne i_1}E^+_j(\rho )\), and notice that by De Morgan’s law and (95)
A similar argument yields the existence of a set \(E^-_{\rho } \subset (-\rho /2,0)\) with the property that \(\mathcal {L}^1(E^-_{\rho }) \ge \rho /4\) and such the analogous statement to (96) holds for every \(j \ne i_2\). Thus, by letting \(\varepsilon = 1/m\) we can find two decreasing sequences \(\{\rho _m\}_{m \in \mathbb {N}}\) and \(\{r_m^+\}_{m \in \mathbb {N}}\) and an increasing sequence \(\{r_m^-\}_{m \in \mathbb {N}}\) such that
and with the property that \(\mu (\partial R_m(x_0,\nu )) = 0\) (see Remark 2.4), where
Moreover, eventually extracting a subsequence (not relabeled), we can assume that for \(\mathcal {L}^{N-1}\)-a.e. \(x' \in Q'(x_0,\nu ,\rho _m)\) it holds
and
Notice in particular that there are constants C, c such that
Therefore, by Besicovitch’s derivation theorem (see Theorem 2.7), Lemma 2.3, and (89) we obtain that
Thus, to prove the liminf inequality it is enough to show that
Step 2: Using the notation introduced in the previous step, we begin by noticing that
where
and \(\gamma _{m,n}(x',t) {:}{=} u_n(x',g_m(t))\). Then, if we set
we obtain
where for \(p,q \in \mathbb {R}^M\)
By (97), the continuity of \(d_{F_m}\), and Fatou’s lemma, we see that
Let
and notice that in view of (98) we can write
where \(\mathcal {L}^{N - 1}(T_m) = 0\). Observe that as a consequence of (100) and (102) we have
where C is a constant that only depends on W, \(z_i\), and \(z_j\). Recalling the definition of \(r_m^{\pm }\), if \(i \ne i_1\) we have
Similar computations hold if \(j \ne i_2\). Consequently, combining (103)–(105) we arrive at
for some positive constant C. Using the fact that for every \(p,q,z \in \mathbb {R}^M\)
and the continuity of \(d_W(x_0,\cdot ,\cdot )\), we obtain
Combining (106) and (107), we see that
Therefore, in order to prove (99) it is enough to show that
Step 3: This step is dedicated to the proof of (108). We claim that the sequence \(\{F_m\}_{m \in \mathbb {N}}\) converges uniformly to \(2\sqrt{W(x_0,\cdot )}\) on every compact subset of \(\mathbb {R}^M\). Indeed, as one can readily check, the map \(z \mapsto F_m(z)\) is continuous for every m, while the map \(m \mapsto F_m(z)\) is nondecreasing for every z. To conclude, notice that \(F_m(z) \rightarrow 2\sqrt{W(x_0,z)}\) for every \(z \in \mathbb {R}^M\) as \(m \rightarrow \infty \). Since the map \(z \mapsto 2\sqrt{W(x_0,z)}\) is continuous by assumption (see (H.1)), we are in a position to apply Dini’s convergence theorem. This proves the claim.
Let \(p {:}{=} z_{i_1}(x_0)\), \(q {:}{=} z_{i_2}(x_0)\) and, using the notation introduced in Proposition 3.2, notice that (see (H.2); see also Remark 3.3) there exists \(m_1\) such that if \(m \ge m_1\) then for every \(i \ne j\)
Thus, if \(m \ge m_1\), we are in a position to apply the results of Proposition 3.2 and Remark 3.4 and conclude that there exists a minimizing geodesic for \(d_{F_m}(p,q)\), namely \(\gamma _m \in \mathcal {A}(p,q)\), such that if we set
then
Consequently, given \(\varepsilon > 0\), we can find \(m_2 \ge m_1\) with the property that for every \(m \ge m_2\)
for every \(z \in B(0, \Lambda (\lambda , \mathcal {R}_{m_1}(x_0,\nu )) + \lambda )\). Then, from (109) and (110) we see that
Letting \(m \rightarrow \infty \) in the previous inequality we obtain
The desired inequality (108) follows immediately by letting \(\varepsilon \rightarrow 0^+\). \(\square \)
4.3 Limsup inequality
This section is devoted to proving the following result, which concludes the proof of Theorem 1.4.
Proposition 4.3
Let W be as is (H.1)–(H.4). Then, for every \(u \in BV(\Omega ; z_1, \dots , z_k)\) there exists a sequence \(\{u_n\}_{n \in \mathbb {N}}\) of functions in \(H^1(\Omega ; \mathbb {R}^M)\) such that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\) and \(\mathcal {F}_n(u_n) \rightarrow \mathcal {F}_0(u)\).
We split the proof in three lemmas. In the first result presented below, we exhibit a sequence \(\{u_n\}_{n \in \mathbb {N}}\) of functions with polyhedral jump set which approximates the function u and furthermore satisfies \(\mathcal {F}_0(u_n) \rightarrow \mathcal {F}_0(u)\). This is a straightforward consequence of a classical result due to Baldo (see [11, Lemma 3.1]). In the following we say that \(U \subset \Omega \) is a polyhedral set if \(\partial U \cap \Omega \) is contained in a finite union of hyperplanes.
Lemma 4.4
Under the assumptions of Proposition 4.3, let \(u\in BV(\Omega ;z_1,\dots ,z_k)\). Then there exists a sequence \(\{u_n\}_{n \in \mathbb {N}}\) of functions in \(BV(\Omega ; z_1, \dots , z_k)\) of the form
where \(\Omega ^i_n\) is a polyhedral set for each \(i \in \{1,\dots ,k\}\) and each \(n \in \mathbb {N}\), such that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\) and \(\mathcal {F}_0(u_n)\rightarrow \mathcal {F}_0(u)\).
Proof
Write \(u = \sum _{i=1}^k z_i\mathbb {1}_{\Omega ^i}\), where \(\{\Omega ^i\}_{i=1}^k\) is a Caccioppoli partition of \(\Omega \) (see Definition 2.17 and Remark 2.18). Thanks to [11, Proposition A.2] and as a consequence of the proof of [11, Lemma A.7], it is possible to find a sequence \(\{\{\Omega ^i_n\}_{i=1}^k\}_{n \in \mathbb {N}}\) of Caccioppoli partitions of \(\Omega \) with the following properties:
- (i):
-
\(\Omega ^i_n\) is a polyhedral set;
- (ii):
-
\(\mathbb {1}_{\Omega ^i_n}\rightarrow \mathbb {1}_{\Omega ^i}\) in \(L^1(\Omega )\);
- (iii):
-
\(\mu ^{ij}_n\) converges in \((C_b(\Omega ))'\) to \(\mu ^{ij}\)(see Definition 2.5), where for each \(i\ne j\in \{1,\dots ,k\}\), and \(n \in \mathbb {N}\) the measures \(\mu ^{ij}_n, \mu ^{ij} \in \mathcal {M}^+(\Omega )\) are given by
Since \(d_W\in C_b(\Omega )\) (see Corollary 3.5) and recalling that the \(z_i\)’s are bounded we get
where in the previous to last step we used property (iii) above. \(\square \)
A key ingredient in the proof of the limsup inequality is a reparametrization due to Modica (see [49, Proposition 2], and [11, Lemma 3.2]). Being crucial to our construction of the recovery sequence, we present the proof for the reader’s convenience.
Lemma 4.5
Under the assumptions of Proposition 4.3, fix \(\lambda >0\), \(\varepsilon >0\), \(x\in \Omega \), and \(p,q\in \mathbb {R}^M\). Let \(\gamma \in \mathcal {A}(p,q)\) be the parametrization of a curve of class \(C^1\) with \(\gamma '(s) \ne 0\) for all \(s \in (-1,1)\). Then there exist \(\tau >0\) with
and \(g \in C^1((0,\tau ); [-1,1])\) such that
for all \(t\in (0,\tau )\), \(g(0)=-1\), \(g(\tau )=1\), and
Proof
Define
and set \(\tau {:}{=}\Psi (1)\). Then \(\Psi \) is strictly increasing, and its inverse \(g :[0,\tau ] \rightarrow [-1,1]\) is of class \(C^1\) in \((0,\tau )\) and satisfies (112) for all \(t \in (0,\tau )\). Moreover,
Finally, by (112), a change of variables, and the subadditivity of the square root function we get
and (113) follows, thus concluding the proof. \(\square \)
We now turn to the technical construction of the recovery sequence.
Lemma 4.6
Under the assumptions of Proposition 4.3, let \(u \in BV(\Omega ; z_1,\dots ,z_k)\) be such that \(u = \sum _{i=1}^k z_i \mathbb {1}_{\Omega ^i}\), where, for each \(i=1,\dots ,k\), \(\partial \Omega ^i \cap \Omega \) is a polyhedral set. Then there exists a sequence \(\{u_n\}_{n \in \mathbb {N}}\) of functions in \(H^1(\Omega ; \mathbb {R}^M)\) such that \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\) and \(\mathcal {F}_n(u_n) \rightarrow \mathcal {F}_0(u)\) as \(n\rightarrow \infty \).
Proof
Throughout the proof we denote by C a positive constant independent of n, which could possibly differ from line to line. We divide the proof into several steps.
Step 1: (definitions of the main objects). For \(n\in \mathbb {N}\) define
Thanks to Proposition 3.2 and Remark 3.4, there exists \(\overline{L}>0\) such that for every \(x \in \Omega \) and \(i\ne j\in \{1,\dots ,k\}\) it is possible to find a minimizing geodesic \(\gamma \in \mathcal {A}(z_i(x),z_j(x))\) for \(d_W(x, z_i(x), z_j(x))\) satisfying
For each \(n\in \mathbb {N}\) set
For each \(i\in \{1,\dots ,k\}\) write
where \(p_i\in \mathbb {N}\), the \(\Sigma ^i_m\)’s are pairwise disjoint, and, for each \(m\in \{1,\dots ,p_i\}\), \(\Sigma ^i_m\) is contained in a hyperplane with normal \(\nu ^i_m \in \mathbb {S}^{N-1}\). Define the singular set of the partition \(\{\Omega ^i\}_{i = 1}^k\) as
For \(i \in \{1, \dots , k\}\) and \(m \in \{1, \dots , p_i\}\) set
where \(\mathcal {N}_{\Theta \varepsilon _n}(S) {:}{=} \{ x \in \mathbb {R}^N : |x-y| < \Theta \varepsilon _n \text { for some } y \in S \}\). Using the fact that \(\partial \Omega \) is Lipschitz, it is possible to find \(\overline{n}\in \mathbb {N}\) and \(\Theta >0\) such that for all \(n\ge \overline{n}\), \(i \in \{1, \dots , k\}\) and \(m\in \{1,\dots ,p_i\}\) it holds that \(A^n_{m,i} \subset \Omega \), and furthermore that either \(A^n_{m,i} = A^n_{s,j}\), or
for all \(n\ge \overline{n}\), \(i,j\in \{1,\dots ,k\}\), \(m\in \{1,\dots ,p_i\}\), and \(s\in \{1,\dots ,p_j\}\) with \(m\ne s\) if \(i = j\) (see Fig. 3). Without loss of generality, up to increasing the value of \(\overline{n}\), we can assume that for all \(i\in \{1,\dots ,k\}\) and \(m\in \{1,\dots ,p_i\}\) there exists only two different indexes \(j_1,j_2\in \{1,\dots ,k\}\) such that
and
For \(n\in \mathbb {N}\) set \(S_n{:}{=}\mathcal {N}_{\Theta \varepsilon _n}(S)\), and notice that
for some constant \(D>0\) depending only on \(\mathcal {H}^{N-2}(S)\).
Step 2: (definition of \(u_n\) close to \(\Sigma ^i_m\)). Let us first consider \(\Omega ^1\setminus S_n\). Fix \(m \in \{1,\dots , p_1\}\). Without loss of generality we can assume that \(\Sigma ^1_m\subset \{x_N=0\}\) and that the normal vector \(\nu ^1_m = e_N\) points outside \(\Omega ^1\). In the following we write points \(x \in \mathbb {R}^N\) as \(x = (y,t)\) where \(y \in \mathbb {R}^{N-1}\) and \(t \in \mathbb {R}\). For each \(n \in \mathbb {N}\), consider the sets
where \(\overline{y_j}\) ranges among the elements of \(\frac{r_n}{2}\mathbb {Z}^{N-1}\times \{0\}\). In the following we will consider only the indices j for which \(U_n^j \ne \emptyset \). Let \(\varphi _n^j\in C_c^\infty (\mathbb {R}^{N-1})\) be a function such that
where C is a positive constant independent of n, and we used the notation in (90). For every such index j, let \(\gamma ^j\in \mathcal {A}(u^-(\overline{y_j}), u^+(\overline{y_j}))\) be a minimizing geodesic for \(d_W(\overline{y_j}, u^-(\overline{y_j}), u^+( \overline{y_j}))\) such that \(L(\gamma ^j) < \overline{L}\), where \(\overline{L}\) is the constant given in (115). Then it is possible to find a sequence \(\{\gamma ^j_n\}_{n\in \mathbb {N}}\) of \(C^1\) curves in \(\mathcal {A}(u^-(\overline{y_j}), u^+(\overline{y_j}))\) with \((\gamma ^j_n)'(s)\ne 0\) for all \(s\in (-1,1)\) and such that
Let \(\tau ^j_n\in (0,\ell _n)\) and \(g^j_n\in C^1((0,\tau ^j_n))\) be given by Lemma 4.5 corresponding to the choice of
Extend \(g^j_n\) to the whole interval \((0,\infty )\) by setting \(g^j_n(s) {:}{=} 1\) for \(s \ge \tau ^j_n\). For each j, let \(s_j,q_j\in \{1,\dots ,k\}\) be such that \(u^-(\overline{y_j})=z_{s_j}(\overline{y_j})\), and \(u^+(\overline{y_j})=z_{q_j}(\overline{y_j})\). We then define the function \(u_n^j\) as follows:
- (i):
-
for \(y \in U_n^j\) and \(t \in (0, \ell _n)\) set
$$\begin{aligned} u^j_n(y,t) {:}{=} \gamma ^j_n(g^j_n(t)); \end{aligned}$$ - (ii):
-
for \(y \in U_n^j\) and \(t \in (\ell _n, \ell _n+\varepsilon _n)\) set
$$\begin{aligned} u^j_n(y,t) {:}{=} z_{q_j}(\overline{y_j}) + \frac{t - \ell _n}{\varepsilon _n} \left[ z_{q_j} \left( y, \ell _n+\varepsilon _n \right) - z_{q_j}(\overline{y_j}) \right] ; \end{aligned}$$ - (iii):
-
for \(y \in U_n^j\) and \(t \in (-\varepsilon _n,0)\) set
$$\begin{aligned} u^j_n(y,t) {:}{=} z_{s_j}(\overline{y_j}) - \frac{t}{\varepsilon _n} \left[ z_{s_j} \left( y, -\varepsilon _n \right) - z_{s_j}(\overline{y_j}) \right] . \end{aligned}$$
Then, for \((y,0) \in \Sigma ^1_m \setminus S_n\) and \(t \in (-\varepsilon _n, \varepsilon _n + \ell _n)\) define
Notice that, thanks to (118), \(u_n(y,t) = u_n^j(y,t)\) for \(y \in Q'(\overline{y_j},e_N, r'_n)\), and the number of non-zero terms in the sum is bounded above by \(3^N\). We repeat the same construction for all \(m \in \{1, \dots , p_1\}\), and notice that thanks to (116) the functions \(u_n\) are well defined, as the constructions in this step do not overlap.
Next, for
define \(u_n(x) {:}{=} z_1(x)\).
Finally, we repeat the same argument for \(i = 2, \dots , k\) in the sets
assuming that the normal \(\nu ^j_m\) points outside the set \(\Omega ^j\).
Step 3: (estimate of the Lipschitz constant of \(u_n\) close to \(\Sigma ^i_m\)). Reasoning as in the previous step, we assume without loss of generality that \(\Sigma ^i_m\subset \{x_N=0\}\). Using the fact that the number of non-zero terms in the definition of \(u_n\) (see (121)) is bounded above by \(3^N\), we have that
where in the second step we used (118) together with the fact that, by construction, and using (115) it holds
for some constant \(C>0\) independent of n. In order to estimate the second term in (122) we reason as follows. Let \(y_1, y_2\in U^j_n\) and \(t_1,t_2\in (0,\ell _n)\). Using (112), our choice of \(\varepsilon \) and \(\lambda \), and the fact that minimizing geodesics are uniformly bounded, we get
Let \(y_1, y_2\in U^j_n\) and \(t_1,t_2\in [\ell _n, \ell _n+\varepsilon _n)\). Then we can estimate
Similar computations show that, if \(y_1,y_2 \in U^j_n\) and \(t_1, t_2 \in (-\varepsilon _n,0)\), we get
Step 4: (definition of \(u_n\) in \(\Omega \)). We are now in position to define \(u_n\) in the whole \(\Omega \). Using the estimates (122), (124), (125), and (126), Kirszbraun’s theorem ensures that it is possible to extend \(u_n\) to a Lipschitz function, still denoted by \(u_n\), defined in the whole \(\Omega \) in such a way that the Lipschitz constant of the extension is controlled by the Lipschitz constant of the original function \(u_n\). It is immediate to verify that \(u_n \in H^1(\Omega ;\mathbb {R}^M)\) and that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\) as \(n \rightarrow \infty \).
Step 5: (estimate of the energy). We claim that \(\mathcal {F}_n(u_n)\rightarrow \mathcal {F}_0(u)\) as \(n\rightarrow \infty \). To show this, we split \(\Omega \) into several pieces and compute the asymptotic behavior of the energy \(\mathcal {F}_n\) in each of them. For \(i\in \{1,\dots ,k\}\), let \(\widetilde{\Omega }^i_n\) be the set where \(u_n\equiv z_i\). Then
since \(W(x, u_n(x)) = 0\) for \(x \in \widetilde{\Omega }^i\), and \(u_n\) has uniformly bounded gradient in that region.
Fix one connected component \(\Sigma \) of \((J_u \cap \Omega ) \setminus S_n\). Without loss of generality, we can assume \(\Sigma \subset \{x_N = 0\}\). We first consider the transition region
which we split in three parts. Let us start with
Notice that
Indeed, the number of \((N-1)\)-dimensional cubes we consider is of the order of \(r_n^{1-N}\), for each of which we are integrating over a volume of the order of \(\varepsilon _n (r_n-r'_n)^{N-1}\). Since \(\Vert u_n\Vert _{L^\infty (\Omega ;\mathbb {R}^M)} \le C\) we have
for all \(x\in I_n\). Using (122), (126), (128), and (129) we get
as \(n\rightarrow \infty \), where in the previous to last step we used (114).
Next, we prove that the energy contribution in the region
is asymptotically negligible. We do this by noticing that
and that, in view of (124) and (129), we have that
as \(n\rightarrow \infty \). Finally, we consider
and notice that
In turn, by (125) and with similar computations to the ones in (130), we get
Therefore, from (130) –(132) we deduce that
We now consider the region
and observe that
Let \(i \in \{1,\dots ,k\}\) be such that \(U_n \subset \Omega ^i\). Then, for \(x \in Q'(\overline{y_j}, e_N, r'_n)\times \left( \ell _n, \ell _n+\varepsilon _n\right) \) it holds
where in the previous to last step we used the triangle inequality together with fact that \(u_n\) is a linear interpolation between \(z_i (\overline{y_j})\) and \(z_i(y,\ell _n+\varepsilon _n)\). Here \(K > 0\) is such that \(u_n(x), z_i(x) \in B(0,K)\) for all \(x \in Q'(\overline{y_j}, e_N, r'_n)\times \left( \ell _n, \ell _n+\varepsilon _n\right) \). Therefore, from (125) and (135) we get
as \(n\rightarrow \infty \), where the last step follows from (114) with analogous computations as those we used to deduce (130).
With a similar argument it is possible to show that
where
Moreover, using (117), (122), and (123) we also get
Finally, we prove that
where
Thanks to (133), we can equivalently consider the region
Moreover, the Lipschitz continuity of W gives
since \(N\ge 2\). Therefore, using the fact that for every j and every n
we see that
where in the last step we used the fact that the function \(x \mapsto d_W(x, u^+(x), u^-(x))\) is continuous on \(\Sigma \) (see Corollary 3.5), while the previous to last step follows from (120) together with the result of Lemma 4.5.
Finally, since the number of connected components of \((J_u\cap \Omega )\setminus S_n\) is finite, by (127), (133), (136)–(139) we conclude. \(\square \)
We are now in position to prove the limsup inequality.
Proof of Proposition 4.3
For \(u\in BV(\Omega ;z_1,\dots ,z_k)\), let \(\{v_n\}_{n\in \mathbb {N}}\) be the sequence of functions in \(BV(\Omega ;z_1,\dots ,z_k)\) provided by Lemma 4.4. In particular, recall that each \(v_n\) can be written as
where each \(\Omega ^i_n\) is a polyhedral set, that \(v_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\), and that
By an application of Lemma 4.6, for each \(n \in \mathbb {N}\) there exists a sequence \(\{w^n_m\}_{m \in \mathbb {N}}\) of functions in \(H^1(\Omega ;\mathbb {R}^M)\) such that \(w^n_m \rightarrow v_n\) in \(L^1(\Omega ;\mathbb {R}^M)\) and
Therefore, by using a diagonalization argument, it is possible to find a sequence \(\{m_n\}_{n\in \mathbb {N}}\) such that the sequence \(\{u_n\}_{n\in \mathbb {N}}\) of functions in \( H^1(\Omega ;\mathbb {R}^M)\) defined as \(u_n {:}{=} w^n_{m_n}\) is such that \(u_n \rightarrow u\) in \(L^1(\Omega ;R^M)\) and
This concludes the proof. \(\square \)
5 Proofs of the variants of the main results
In this section we discuss how to suitably modify our arguments to obtain the proofs for the several variants we consider. In the following, in order to keep the notation as simple as possible, the value of the constant \(C > 0\) might change from one instance to another.
5.1 Proof of Theorem 1.5
To prove the compactness result, we notice that
for all \(u \in L^1(\Omega ; \mathbb {R}^M)\). Therefore, for any sequence \(\{u_n\}_{n \in \mathbb {N}} \subset L^1(\Omega ; \mathbb {R}^M)\), we have that
By an application of Proposition 4.1, we get that up to the extraction of a subsequence (which we do not relabel) \(u_n \rightarrow u\) in \(L^1(\Omega ; \mathbb {R}^M)\), for some \(u \in BV(\Omega ; z_1, \dots , z_k)\) with \(\mathcal {F}_0(u) < \infty \). Since
we also deduce that \(\mathcal {F}_0^{\mathcal {M}}(u) < \infty \).
Since the proof of the liminf inequality remains unchanged, we omit the details. Next, we discuss how to construct the recovery sequence. Our approach is inspired by that of [39]. To be precise, let \(u \in BV(\Omega ; z_1, \dots , z_k)\) with
and let \(\{w_n\}_{n \in \mathbb {N}} \subset BV(\Omega ; z_1, \dots , z_k)\) be the sequence provided by Lemma 4.4. Without loss of generality, we can assume that
For each \(n\in \mathbb {N}\), by applying Lemma 4.6 to the function \(w_n\), it is possible to find \(v_n \in H^1(\Omega ; \mathbb {R}^M)\) such that
and such that (see Step 1 and 2 in the proof of Lemma 4.6)
for some constant \(C > 0\) independent of n, where \(E_n {:}{=} \{ x \in \Omega : v_n(x) \ne w_n(x) \}\). Moreover, for \(n \in \mathbb {N}\), let
and define
Notice that \(\widetilde{v}_n\) satisfies the mass constraint, i.e.,
Notice also that in view of (142) and (144) we get
and so \(\Vert \widetilde{v}_n - w_n \Vert _{L^1(\Omega ;\mathbb {R}^M)}\le C \varepsilon _n^{\frac{5}{8}}\). Moreover, recalling that \(p \mapsto W(x,p)\) is smooth for \(p \in B(z_i(x), r)\) for all \(i\in \{1, \dots , k\}\), that \(\nabla _p W(x,z_i(x)) = 0\) for all \(x \in \Omega \), and since \(\Vert v_n\Vert _{L^\infty (\Omega ; \mathbb {R}^M)} \le C\) for some constant \(C > 0\) independent of n, an application of Taylor’s formula yields
Here \(|\xi _n(x) - v_n(x)| \le |\eta _n|\) for all \(x\in \Omega \). In view of the result of Lemma 4.4 together with (143)–(145) we get
This concludes the proof. \(\square \)
Remark 5.1
It would be interesting to understand if the mass constraint can play a role in weakening the assumptions on W. In this case, one might conjecture that the strategy implemented in [43] would allow one to drop the assumption (H.4). It is not immediately clear whether this is possible since (H.4) plays a crucial role in obtaining the pivotal estimate (70).
Remark 5.2
For the proof of Theorem 1.5, we opted to use a different approach than the one proposed in [11]. The reason for this is that, to the best of our understanding, the argument in [11] has a flaw that we were not able to correct.
5.2 Proof of Theorem 1.7
In the proof of the compactness result we only need to modify the definition of \(R'\) in (62) as follows:
This is done in order to get (67) and
for all \(i \in \{1,\dots ,k\}\) and \(x \in \partial \Omega \).
The liminf inequality is obtained as follows. Let \(u \in BV(\Omega ; z_1, \dots , z_k)\). Let \(\widetilde{\Omega } \supset \overline{\Omega }\) be an open bounded set with Lipschitz continuous boundary and define the function \(\widetilde{u} \in BV(\widetilde{\Omega }; \mathbb {R}^M)\) as
where \(\widetilde{g} \in \mathrm{Lip}(\widetilde{\Omega } \setminus \Omega ; \mathbb {R}^M)\) is such that \(\widetilde{g} = g\) on \(\partial \Omega \). We then conclude by repeating the argument in the proof of Proposition 4.2 to the function \(\widetilde{u}\).
The only change we need to make to the construction of the recovery sequence is in Lemma 4.6. Let \(u \in BV(\Omega ; z_1, \dots , z_k)\) be as in the statement of Lemma 4.6, and define \(\widetilde{u} \in BV(\widetilde{\Omega }; \mathbb {R}^M)\) as in (146). Extend W and \(z_1, \dots , z_k\) to locally Lipschitz functions defined in \(\widetilde{\Omega } \times \mathbb {R}^M \times \mathbb {R}^M\) and \(\widetilde{\Omega }\) respectively. Set \(z_{k+1} {:}{=} \widetilde{g}\). By applying the construction in the proof of Lemma 4.6 to \(\widetilde{u}\) in \(\widetilde{\Omega }\), we obtain a sequence \(\{\widetilde{u}_n\}_{n \in \mathbb {N}} \subset H^1(\widetilde{\Omega }; \mathbb {R}^M)\) such that
Moreover, by looking at the energy estimates (127), (133), (136)–(139), we get
This concludes the proof. \(\square \)
5.3 Proof of Theorem 1.9
Since the proof of the compactness result remains essentially unchanged, we omit the details. We only notice that since in (86) one can have \(\omega = 0\), we do not recover (87), and thus we cannot in general conclude that \(u \in BV(\Omega ; z_1, \dots , z_k)\).
Recall that in view of Remark 1.8, if \(u \in L^1(\Omega ,\mathbb {R}^M)\) is such that \(u(x) \in \{z_1(x), \dots , z_k(x)\}\) for \(\mathcal {L}^N\)-a.e. \(x\in \Omega \), the collection of sets \(\left\{ \{x \in \Omega : u(x) = z_i(x)\}\right\} _{i = 1}^k\) does not necessarily give a partition of \(\Omega \). On the other hand, as one can readily check, it is possible to find sets \(\Omega _1, \dots , \Omega _k\) which are measurable and pairwise disjoint and with the property that \(u(x) = z_i(x)\) for \(\mathcal {L}^N\)-a.e. \(x \in \Omega ^i\). In particular, this ensures that it is possible to write
almost everywhere in \(\Omega \), where \(\{\Omega ^i\}_{i=1}^k\) is a partition of \(\Omega \) in sets that are not necessarily of finite perimeter.
The proof of the liminf inequality follows closely that of Proposition 4.2; the only changes required are described here in detail for the reader’s convenience. For \(x_0 \in J_u\) as in (89), let \(i_1, i_2 \in \{1, \dots , k\}\) be such that for every \(\varepsilon > 0\) there exists \(\bar{\rho }\) with the property that (91) holds for every \(\rho \le \bar{\rho }\). Set
With these notations at hand, reasoning as in (92) we arrive at
where the sets \(\Omega ^j\) are defined as in (147) and \(\delta _1(x_0) {:}{=} \min \left\{ |z_{i_1}(x_0) - z_j(x_0)| : j \notin \mathcal {J}_1 \right\} \). Arguing as in the proof of Proposition 4.2 with (148) in place of (92) yields the following analogue to (106):
Notice that in view of (147) the sets \(\{Q_m'(i,j)\}_{i,j = 1}^k\) are pairwise disjoint. Moreover, since for every \(i \in \mathcal {J}_1\) and every \(j \in \mathcal {J}_2\) it holds
the rest of the proof follows without changes (see (107)).
In the proof of the limsup inequality we need an additional approximation before applying Lemma 4.4 since, as priviously discussed in Remark 1.8, functions \(u \in L^1(\Omega ;\mathbb {R}^M)\) with \(\widetilde{\mathcal {F}_0}(u) < \infty \) can be such that \(\mathcal {H}^{N-1}(J_u) = \infty \). The following result is adapted from [14, Lemma 4.3].
Lemma 5.3
Let \(u \in L^1(\Omega ;\mathbb {R}^M)\) with \(\widetilde{\mathcal {F}_0}(u) < \infty \) and \(u(x)\in \{z_1(x), \dots , z_k(x)\}\) for \(\mathcal {L}^N\)-a.e. \(x\in \Omega \). Then there exits a sequence \(\{u_n\}_{n \in \mathbb {N}} \subset BV(\Omega ; z_1, \dots , z_k)\) with
where \(\{\Omega ^i_n\}_{i=1}^k\) is Caccioppoli partition of \(\Omega \), such that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\) and
Proof
We divide the proof into two steps.
Step 1: Let \(\{\Omega _i\}_{i = 1}^k\) be a partition of \(\Omega \) such that (147) holds, and notice that we can assume without loss of generality that for every \(i \in \{1, \dots , k\}\)
For \(i = 1, \dots , k - 1\) and \(j = i + 1, \dots , k\) consider the Lipschitz function \(\gamma _{ij} :\Omega \rightarrow [0,\infty )\) given by \(\gamma _{ij}(x){:}{=} |z_i(x) - z_j(x)|\) and, for \(t > 0\), set
The co-area formula (see [6, Theorem 2.93]) gives us
from which we infer that the function \(t \mapsto \mathcal {H}^{N-1}(\partial ^* E^{t}_{ij})\) belongs to the space \(L^1((0,\infty ))\). In turn, it is possible to find a sequence \(\{t_n\}_{n \in \mathbb {N}} \subset (0, \infty )\) with \(t_n \searrow 0\) such that, for all \(i = 1, \dots , k - 1\) and \(j = i + 1, \dots , k\), we have
and
For \(i\in \{1,\dots ,k\}\) and \(n\in \mathbb {N}\) set \(E^{t_n}_{ii}{:}{=}\emptyset \). For \(n \in \mathbb {N}\) define
and, for \(i=2,\dots ,k\),
Note that \(\{\Omega ^i_n\}_{i=1}^k\) is a partition of \(\Omega \). We claim that the sets \(\Omega ^i_n\) are of finite perimeter in \(\Omega \). To prove this, for each \(n \in \mathbb {N}\), let \(\omega _n > 0\) be such that
for all \(i\ne j\). For \(i \in \{1, \dots , k\}\) set \(J^i_u {:}{=} \{ x\in J_u : u^-(x)=z_i(x) \}\) and note that
where in the last step we used the fact that \(u(x)\in \{ z_1(x),\dots , z_k(x) \}\) for \(\mathcal {L}^N\)-a.e. \(x\in \Omega \). Using the fact that each of the sets \(E^{t_n}_{ij}\) has finite perimeter, and recalling that by assumption \(\widetilde{\mathcal {F}_0}(u) < \infty \), we conclude.
Step 2: Define
Since the sets \(\Omega ^i_n\) are of finite perimeter in \(\Omega \), it follows from [6, Theorem 3.84] that \(u_n \in BV(\Omega ; z_1, \dots , z_k)\), and that it is possible to write
Using the fact that \(\Vert z_i\Vert _{L^\infty (\Omega ;\mathbb {R}^M)} \le C < \infty \), we deduce that
Moreover, by (57) we infer that there exists \(C<\infty \) such that
for all \(x \in \Omega \) and all \(i \ne j\). Using (151), (153), and (154), we obtain
where \(E^{t_n} {:}{=} \bigcup _{i\ne j} E^{t_n}_{ij}\). This concludes the proof. \(\square \)
The rest of the proof of Theorem 1.9 follows by a diagonal argument without additional changes; therefore we omit the details. \(\square \)
5.4 Final remarks
We conclude this section by remarking that Proposition 4.2 and Proposition 4.3 can be proved independently of Proposition 4.1. Indeed, we only invoke our compactness result in Proposition 4.2 to deduce that for a given \(u \in L^1(\Omega ;\mathbb {R}^M)\) the following hold:
- (i):
-
the jump set \(J_u\) is countably \(\mathcal {H}^{N-1}\)-rectifiable;
- (ii):
-
if \(\{u_n\}_{n\in \mathbb {N}} \subset H^1(\Omega ;\mathbb {R}^M)\) with \(\sup _{n\in \mathbb {N}} \mathcal {F}_n(u_n)<\infty \) is such that \(u_n \rightarrow u\) in \(L^1(\Omega ;\mathbb {R}^M)\), then \(u(x) \in \{z_1(x),\dots ,z_k(x)\}\) for \(\mathcal {L}^N\)-a.e. \(x\in \Omega \).
Notice that, in view of the recent result [26], property (i) holds for all \(u \in L^1_{\mathrm{loc}}(\Omega ; \mathbb {R}^M)\), while (ii) is readily derived reasoning as follows: without loss of generality, up to the extraction of a subsequence, we can assume that \(u_n(x) \rightarrow u(x)\) for \(\mathcal {L}^N\)-a.e. \(x \in \Omega \). Let
and, arguing by contradiction, suppose that \(\mathcal {L}^N(A) > 0\). Then Fatou’s lemma gives
where \(C > 0\) is independent of n. Consequently, we see that assumption (H.4) is used only to deduce (25), which, together with (H.1)–(H.3), is needed to ensure that we are in a position to apply Proposition 3.2 and Corollary 3.5. In conclusion, the \(\Gamma \)-convergence results of Sect. 1.2 hold for a potential W satisfying (H.1)–(H.3) and (25) in place of (H.4).
References
Alberti, G., Bellettini, G.: A nonlocal anisotropic model for phase transitions: asymptotic behaviour of rescaled energies. Eur. J. Appl. Math. 9, 261–284 (1998)
Alberti, G., Bellettini, G.: A nonlocal anisotropic model for phase transitions. I. The optimal profile problem. Math. Ann. 310, 527–560 (1998)
Alberti, G., Bouchittè, G., Seppecher, P.: Aun rèsultat de perturbations singuliéres avec la norme \({H}^{\frac{1}{2}}\), C. R. Acad. Sci. Paris Sèr. I Math. 319, (1994)
Alt, H.W., Pawlow, I.: A mathematical model of dynamics of non-isothermal phase separation. Physica D 59, 389–416 (1992)
Ambrosio, L.: Metric space valued functions of bounded variation. Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4) 17, 439–478 (1990)
Ambrosio, L., Fusco, N., Pallara, D.: Functions of Bounded Variation and Free Discontinuity Problems. Oxford Mathematical Monographs. The Clarendon Press, Oxford University Press, New York (2000)
Ansini, N., Braides, A., Chiadò Piat, V.: Interactions between homogenization and phase-transition processes. Tr. Mat. Inst. Steklova 236, 386–398 (2002)
Ansini, N., Braides, A., Chiadò Piat, V.: Gradient theory of phase transitions in composite media. Proc. R. Soc. Edinb. Sect. A 133, 265–296 (2003)
Anzellotti, G., Baldo, S.: Asymptotic development by \(\Gamma \)-convergence. Appl. Math. Optim. 27, 105–123 (1993)
Anzellotti, G., Baldo, S., Orlandi, G.: \(\Gamma \)-asymptotic developments, the Cahn–Hilliard functional, and curvatures. J. Math. Anal. Appl. 197, 908–924 (1996)
Baldo, S.: Minimal interface criterion for phase transitions in mixtures of Cahn–Hilliard fluids. Ann. Inst. H. Poincaré Anal. Non Linéaire 7, 67–90 (1990)
Barroso, A.C., Fonseca, I.: Anisotropic singular perturbations the vectorial case. Proc. R. Soc. Edinb. Sect. A 124, 527–571 (1994)
Bellettini, G., Nayam, A.-H., Novaga, M.: \(\Gamma \)-type estimates for the one-dimensional Allen–Cahn’s action. Asymptot. Anal. 94, 161–185 (2015)
Bouchittè, G.: Singular perturbations of variational problems arising from a two-phase transition model. Appl. Math. Optim. 21, 289–314 (1990)
Braides, A.: Gamma-Convergence for Beginners, vol. 22. Clarendon Press, Oxford (2002)
Braides, A., Truskinovsky, L.: Asymptotic expansions by \(\Gamma \)-convergence. Contin. Mech. Thermodyn. 20, 21–62 (2008)
Braides, A., Zeppieri, C.I.: Multiscale analysis of a prototypical model for the interaction between microstructure and surface energy. Interfaces Free Bound. 11, 61–118 (2009)
Cahn, J.W., Hilliard, J.E.: Free energy of a nonuniform system. I. Interfacial free energy. J. Chem. Phys. 28, 258–267 (1958)
Carr, J., Gurtin, M., Slemrod, M.: Structured phase transitions on a finite interval. Arch. Ration. Mech. Anal. 86, 317–351 (1984)
Choksi, R., Fonseca, I., Lin, J., Venkatraman, R.: Homogenization for an Allen–Cahn equation in periodic media: a variational approach. Preprint
Cicalese, M., Nagase, Y., Pisante, G.: The Gibbs–Thomson relation for non homogeneous anisotropic phase transitions. Adv. Calc. Var. 3, 321–344 (2010)
Cristoferi, R., Fonseca, I., Hagerty, A., Popovici, C.: A homogenization result in the gradient theory of phase transitions. Interfaces Free Bound. 21, 367–408 (2019)
Dal Maso, G.: An Introduction to \(\Gamma \)-Convergence. Springer, Berlin (1993)
Dal Maso, G., Fonseca, I., Leoni, G.: Second order asymptotic development for the anisotropic Cahn–Hilliard functional. Calc. Var. Partial Differ. Equ. 54, 1119–1145 (2015)
De Giorgi, E., Franzoni, T.: Su un tipo di convergenza variazionale. Atti Accad. Naz. Lincei Rend. Cl. Sci. Fis. Mat. Natur. (8) 58, 842–850 (1975)
Del Nin, G.: Rectifiability of the jump set of locally integrable functions. Preprint
Dirr, N., Lucia, M., Novaga, M.: \(\Gamma \)-convergence of the Allen–Cahn energy with an oscillating forcing term. Interfaces Free Bound. 8, 47–78 (2006)
Dirr, N., Lucia, M., Novaga, M.: Gradient theory of phase transitions with a rapidly oscillating forcing term. Asymptot. Anal. 60, 29–59 (2008)
Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. Textbooks in Mathematics, revised ed. CRC Press, Boca Raton (2015)
Fonseca, I., Leoni, G.: Modern Methods in the Calculus of Variations: \(L^p\) Spaces. Springer Monographs in Mathematics. Springer, New York (2007)
Fonseca, I., Müller, S.: Relaxation of quasiconvex functionals in \({\rm BV}(\Omega ,{ R}^p)\) for integrands \(f(x, u,\nabla u)\). Arch. Ration. Mech. Anal. 123, 1–49 (1993)
Fonseca, I., Popovici, C.: Coupled singular perturbations for phase transitions. Asymptot. Anal. 44, 299–325 (2005)
Fonseca, I., Tartar, L.: The gradient theory of phase transitions for systems with two potential wells. Proc. R. Soc. Edinb. Sect. A 111, 89–102 (1989)
Garroni, A., Palatucci, G.: A singular perturbation result with a fractional norm. In: Variational Problems in Materials Science, vol. 68 of Progress in Nonlinear Differential Equations Applications. Birkäuser, Basel, pp. 111–126 (2006)
Gonzalez, E., Massari, U., Tamanini, I.: On the regularity of boundaries of setsminimizing perimeter with a volume constraint. Indiana Univ. Math. J. 32, 25–37 (1983)
Grüter, M.: Boundary regularity for solutions of a partitioning problem. Arch. Ration. Mech. Anal. 97, 261–270 (1987)
Gurtin, M.E.: On the theory of phase transitions with interfacial energy. Arch. Ration. Mech. Anal. 87, 187–212 (1985)
Gurtin, M.E.: Some results and conjectures in the gradient theory of phase transitions. In Metastability and Incompletely Posed Problems. Springer, pp. 135–146 (1987)
Ishige, K.: Singular perturbations of variational problems of vector valued functions. Nonlinear Anal. 23, 1453–1466 (1994)
Ishige, K.: The gradient theory of the phase transitions in Cahn–Hilliard fluids with Dirichlet boundary conditions. SIAM J. Math. Anal. 27, 620–637 (1996)
Kohn, R., Sternberg, P.: Local minimizers and singular perturbations. Proc. R. Soc. Edinb. Sect. A 111, 69–84 (1989)
Leoni, G.: Gamma Convergence and Applications to Phase Transitions. CNA Lecture Notes (2013)
Leoni, G.: A remark on the compactness for the Cahn–Hilliard functional. ESAIM Control Optim. Calc. Var. 20, 517–523 (2014)
Leoni, G.: A First Course in Sobolev Spaces. Graduate Studies in Mathematics, vol. 181, 2nd edn. AMS, Providence (2017)
Leoni, G., Murray, R.: Second-order \(\Gamma \)-limit for the Cahn–Hilliard functional. Arch. Ration. Mech. Anal. 219, 1383–1451 (2016)
Leoni, G., Murray, R.: Local minimizers and slow motion for the mass preserving Allen–Cahn equation in higher dimensions. Proc. Am. Math. Soc. 147, 5167–5182 (2019)
Lin, F., Pan, X.-B., Wang, C.: Phase transition for potentials of high-dimensional wells. Commun. Pure Appl. Math. 65, 833–888 (2012)
Luckhaus, S., Modica, L.: The Gibbs–Thompson relation within the gradient theory of phase transitions. Arch. Ration. Mech. Anal. 107, 71–83 (1989)
Modica, L.: The gradient theory of phase transitions and the minimal interface criterion. Arch. Ration. Mech. Anal. 98, 123–142 (1987)
Modica, L.: Gradient theory of phase transitions with boundary contact energy. Ann. Inst. H. Poincaré. Analyse Non Linéaire, pp. 487–512 (1987)
Modica, L., Mortola, S.: Il limite nella \(\Gamma \)-convergenza di una famiglia di funzionali ellittici. Boll. Un. Mat. Ital. A (5) 14, 526–529 (1977)
Modica, L., Mortola, S.: Un esempio di \(\Gamma \)-convergenza. Boll. Un. Mat. Ital. B (5) 14, 285–299 (1977)
Monteil, A., Santambrogio, F.: Metric methods for heteroclinic connections. Math. Methods Appl. Sci. 41, 1019–1024 (2018)
Owen, N.C., Rubinstein, J., Sternberg, P.: Minimizers and gradient flows for singularly perturbed bi-stable potentials with a Dirichlet condition. Proc. R. Soc. Lond. Ser. A 429, 505–532 (1990)
Owen, N.C., Sternberg, P.: Nonconvex variational problems with anisotropic perturbations. Nonlinear Anal. 16, 705–719 (1991)
Penrose, O., Fife, P.C.: Thermodynamically consistent models of phase-field type for the kinetics of phase transitions. J. Chem. Phys. 43, 44–62 (1990)
Savin, O., Valdinoci, E.: \(\Gamma \)-convergence for nonlocal phase transitions. Ann. Inst. H. Poincaré Anal. Non Linéaire 29, 479–500 (2012)
Sternberg, P.: The effect of a singular perturbation on nonconvex variational problems. Arch. Ration. Mech. Anal. 101, 209–260 (1988)
Sternberg, P.: Vector-Valued Local Minimizers of Nonconvex Variational Problems, vol. 21, pp. 799–807. (1991) Current directions in nonlinear partial differential equations (Provo, UT, 1987)
Sternberg, P., Zumbrun, K.: Connectivity of phase boundaries in strictly convex domains. Arch. Ration. Mech. Anal. 141, 375–400 (1998)
Van der Waals, J.D.: The thermodynamics theory of capillarity under the hypothesis of a continuous variation of density. Verhaendel Kronik. Akad. Weten. Amsterdam 1, 386–398 (1893)
Zuniga, A., Sternberg, P.: On the heteroclinic connection problem for multi-well gradient systems. J. Differ. Equ. 261, 3987–4007 (2016)
Acknowledgements
The authors acknowledge the Center for Nonlinear Analysis at Carnegie Mellon University (NSF PIRE Grant No. OISE-0967140) where part of this work was carried out. The work of the first author has been supported by the National Science Foundation under Grant No. DMS-1411646 of Irene Fonseca during the period at CMU, by Grant Nos. EP/R013527/1 and EP/R013527/2 “Designer Microstructure via Optimal Transport Theory” of David Bourne during the period at Heriot-Watt University. The research of the second author was partially funded by the National Science Foundation under Grant Nos. DMS-1412095 and DMS-1714098 during the period at CMU, by the research support programs of Charles University under Grant Nos. PRIMUS/19/SCI/01 and UNCE/SCI/023, and by the Czech Science Foundation (GAČR) under Grant No. GJ19-11707Y. The authors would also like to thank Giovanni Leoni for his helpful insights.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by F.H. Lin.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cristoferi, R., Gravina, G. Sharp interface limit of a multi-phase transitions model under nonisothermal conditions. Calc. Var. 60, 142 (2021). https://doi.org/10.1007/s00526-021-02008-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00526-021-02008-3