Abstract
We study the probability measures \(\rho \in \mathcal M(\mathbb R^{2})\) minimizing the functional
where \(\rho _0\) is a given probability measure and \(d(\rho , \rho _0)\) is the 2-Wasserstein distance of \(\rho \) and \(\rho _0\). \(J[\rho ]\) appears in aggregation models when the movement of particles is advanced by the potential \(-\log |x|*\rho \). We prove the existence of minimizers \(\rho \) and show that the potential \(U^\rho =-\log |x|*\rho \) solves a degenerate obstacle problem, the obstacle being the transport potential. Every minimizer \(\rho \) is absolutely continuous with respect to the Lebesgue measure. The singular set of the free boundary of the obstacle problem is contained in a rectifiable set, and its Hausdorff dimension is \(< n-1\).Moreover, \(U^\rho \) solves a nonlocal Monge–Ampère equation, which after linearization leads to the equation \(\rho _t={\text {div}}(\rho \nabla U^\rho )\). The methods we develop use Fourier transform techniques. They work equally well in high dimensions \(n\ge 2\) for the energy
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In this paper we are concerned with the minimization of the functional
among all probability measures \(\rho \) with finite second momentum. Here \(d^2(\rho , \rho _0)=\inf _{\gamma }\frac{1}{2} \iint |x-y|^2d\gamma (x,y)\) is the square of the Wasserstein distance between \(\rho \) and a given probability measure \(\rho _0\), and \(\gamma \) is a joint probability measure with marginals \(\pi _{x\#}\gamma =\rho \), \(\pi _{y\#}\gamma =\rho _0\). The support of \(\rho \) is a priori unknown (or free) and our main goal is to analyze the regularity of the free boundary, i.e. the boundary of the set where \(\rho \not =0\).
An analogous problem arises in high dimensions if we replace the logarithmic kernel by \(K(x-y)={|x-y|^{2-n}}, n\ge 3\). The methods we employ do not depend on the dimension. We focus on the logarithmic kernels since the potential \(U^\rho =-\rho *\log |x|\) may change sign and log-interaction phenomenon has a number of important applications [29, 32] (in Sect. 2 we also give a connection with random matrices).
An interesting feature of the variational problem for \(J[\rho ]\) is that it leads to an obstacle problem involving the potential of the optimal transport of \(\rho \) to \(\rho _0\). Let \(U^\rho \) be the logarithmic (or the Newtonian potential if \(n\ge 3\)) of the probability measure \(\rho \) and \(\psi \) the potential of the transport map, then formally we have
Since \(\Delta U^\rho =-2\pi \rho \) then it follows that
Thus combining (1.2) and (1.3) we have the obstacle problem
In this formulation the position of the obstacle is a priori unknown as opposed to the classical case [7]. Note that \(\psi \) is semiconvex function, hence from Aleksandrov’s theorem it follows that \(D^2 \psi \) exists a.e. Consequently, the first equation in (1.4) is satisfied in a.e. sense provided that \(\rho \) is absolutely continuous with respect to the Lebesgue measure.
The partial mass transport and Monge–Ampère obstacle problems had been developed in the seminal work of Caffarelli and McCann [6], see also [16, 19] and the references given there.
Several papers introduced variational problems for measures. In [26] McCann formulated a variational principle for the energy
which allowed to prove existence and uniqueness for a family of attracting gas models, and generalized the Brunn-Minkowski inequality from sets to measures.
Another interesting energy
appears in the large deviation laws and log-gas interactions [29, 32]. This problem is also related to the classical obstacle problem [3]. Thanks to the quadratic potential every measure minimizing \(F[\cdot ]\) is confined in some ball. Furthermore, one can prove transport inequalities and bounds for the Wasserstein distance in terms of \(F[\rho ]\) [25].
There is a vast literature on interaction energies for probability measures governed by the Wasserstein metric [8, 10, 11, 20]. In particular, [13] contains an \(L^\infty \) estimate for the equilibrium measure and [14] a connection to obstacle problems.
The energy \(J[\rho ]\) appears in a number of physical considerations, for example in aggregation models where the movement of particles is advanced by the global potential \(U^\rho \). The corresponding gradient flow is \(\rho _t={\text {div}}(\rho \nabla U^\rho )\) [30], page 307. Another application appears in thermalization of granular media [12].
In [31] Savin considered the optimal transport of the probability measures in periodic setting for the energy \( \int |\nabla \rho |^2+d^2(\rho , \rho _0), \rho \in H^1([0,1]^n) \). The resulted obstacle problem takes the form
where \(\psi \) is the transport potential of \(\rho \rightarrow \rho _0\) with given initial periodic probability measure \(\rho _0\) with \(H^1\) density.
The aim of this paper is to study the free boundary of the obstacle problem (1.4) for the minimizers \(\rho \) of \(J[\rho ]\).
In [4], the authors consider a gradient flow of the interaction energy
and study the convergence of radially symmetric solutions under conditions imposed on K, namely radially symmetric solutions converge exponentially fast in some transport distance toward a spherical shell stationary state.
In [5], they study the minimizers of \(E_0\) in \(d_\infty \) topology. Their main result being a Hausdorff dimension estimate of the support of the minimizer.
In [9], the minimization problem for \(E_0\) under the constraint \(d_\infty (\mu , \mu _0)<\epsilon \) is studied. As a result they obtain an obstacle type problem for the potential of transport. They also prove finite perimeter of \(\text{ supp }\rho \) and \(C^{1,1}\) regularity of transport potential under some conditions on K.
In our paper the energy is different, it has the additional term. Moreover, the set of admissible measures is not constrained to a neighborhood of a given initial measure \(\rho _0\) in \(d_\infty \) topology. Note also that above papers do not study the singular set of the free boundary, whereas we estimate the dimension of the singular set of \(\text{ supp }\rho _0\cap \text{ supp }\rho \). This estimate is quite different as opposed to the classical obstacle problem.
1.1 Main results
The energy \(J[\rho ]\) has nonlocal character due to the presence of the logarithmic kernel. However, thanks to the Wasserstein distance \(\rho \) is forced to have compact support provided that \(\mathrm{{supp}}\rho _0\) is compact. Observe that if \(\rho \) has atoms then \(J[\rho ]=\infty \), since the logarithmic term is unbounded, see also the discussion on page 133 [2] for the pure optimal transport case for the energy without the logarithmic term.
Theorem A
If \(\rho _0\) has compact support then there is a probability measure \(\rho \) minimizing J such that \(\mathrm{{supp}}\rho \) is compact. Moreover, \(\rho \) cannot have atoms and hence there is a measure preserving transport map \(y=T(x)\) such that \(\rho _0\) is the push forward of \(\rho \).
The second part of the theorem follows from the standard theory of optimal transport [1, 2]. The chief difficulty in proving the first part is to show that there is a minimizing sequence of probability measure with uniformly bounded supports. In order to establish this we use Carleson’s estimate from below for the nonlocal term and a localization argument for the Fourier transforms of these measures. For other applications of Fourier transforms see [23] and reverences therein.
Next we want to analyze the character of equilibrium measures. We show that \(\rho \in L^\infty \). To see this we compute and explore the first variation of J. The weak form of the Euler-Lagrange equation implies that \(\hat{\rho }\), the Fourier transform of \(\rho \), is in \(L^2\).
Theorem B
Let \(\rho , \rho _0\) be as in Theorem A. Then \(\widehat{\rho }\in L^2(\mathbb R^2)\) and \(d\rho =f dx\) on \(\mathrm{{supp}}\rho \) where \(f\in L^{\infty }(\mathbb R^2)\). In particular, the transport map \(y=T(x)\) (as in Theorem A) is given by
where \(U^\rho =\rho *K\) is the potential of \(\rho \) and \(\nabla U^\rho \) is log-Lipschitz continuous.
The log-Lipschitz continuity of \(\nabla U^\rho \) follows from Judovič’s theorem [21]. In fact from the Calderòn-Zygmund estimates it follows that \(D^2U^\rho \in L^p_{loc}\) for every \(p>1\). The local mass balance condition for the optimal transport leads to a nonlocal Monge–Ampère equation
(1.6) implies that \(\mathrm{{supp}}\rho \subset \mathrm{{supp}}\rho _0\). If we linearize (1.6) using a time discretization scheme, the resulted equation is \(\rho _t={\text {div}}(\rho \nabla U^\rho )\).
The analysis of the structure of singular set in the obstacle problems is the central problem of the regularity theory. Let \(\hbox {MD}(\mathrm{{supp}}\rho \cap B_r(x))\) be the infimum of distances between pairs of parallel planes such that \(\mathrm{{supp}}\rho \cap B_r(x)\) is contained in the strip determined by them [7]. Let
Observe that if \(n=2\) then (1.6) is equivalent to \(2\pi \rho _0[4\det D^2U^\rho +2 \Delta U^\rho +1]=-\Delta U^\rho \). From here we can deduce the equation
Consequently, the standard regularity theory for the Monge–Ampère equation (see [33]) implies that we can get higher regularity for \(\rho \) if \(\rho _0\) is sufficiently smooth.
Theorem C
Let \(\omega ({R})\) be the modulus of continuity of the slab height (see (1.7)), \(B_i=B_{r_i}(x_i)\) a collection of disjoint balls included in \(B_R\) with \(x_i\in S\), where S is the singular set. Then for every \(\beta >n-1\) we have
Furthermore, if \(\omega ({R})= R^\sigma \), then there is \(\sigma '=\sigma '(n, \sigma )\) such that the singular set \(S\subset M_0\cup \bigcup _{i=1}^\infty M_i\) where \(\mathcal {H}^{n-1-\sigma '}(M_0)=0\) and \(M_i\) is contained in some \(C^1\) hypersurface such that the measure theoretic normal exists at each \(x\in S\cap M_i, i\ge 1\).
The paper is organized as follows: In Sect. 2 we recall some facts on the Wasserstein distance and Fourier transformation of measures. One of the key facts that we use is that the logarithmic term can be written as a weighted \(L^2\) norm of the Fourier transformation of \(\rho .\)
Section 3 contains the proof of Theorem A. The chief difficulty in the proof is to control the supports of the sequence of minimizing measures.
Section 4 contains some basic discussion of cyclic monotonicity and maximal Kantorovich potential. Then we derive the Euler-Lagrange equation. From here we infer that \(\rho \) has \(L^\infty \) density with respect to the Lebesgue measure. Theorem B follows from Theorem 4.4 and Corollary 4.6.
In Sect. 5 we study the regularity of free boundary and prove Theorem C.
The last two sections contain some final remarks and possible applications. First, in Sect. 6 we discuss the relation of \(J[\rho ]\) with the large deviations laws for the random matrices with interaction and provide a simple model with energy J. Finally, Sect. 7 is devoted to the nonlocal Monge–Ampère equation and its linearization \(\partial _t \rho ={\text {div}}(\rho \nabla U^\rho )\).
1.2 Notation
We will denote by \(\mathcal M(\mathbb R^n)\) the set of probability measures on \(\mathbb R^n\), and let \(\mu _{\# f}\) be the push forward of \(\mu \in \mathcal M(\mathbb R^{n})\) under a mapping f. \(d(\mu , \rho )\) denotes the 2-Wasserstein distance of \(\mu , \rho \in \mathcal M(\mathbb R^{n})\), \(B_r(x_0)\) the open ball of radius r centered at \(x_0\), K the kernels
\(U^\rho =\rho *K\) is the potential of \(\rho \in \mathcal M(\mathbb R^{n})\), \(\mathcal {H}^n\) the n dimensional Hausdorff measure, \(1_E\) the characteristic function of \(E\subset \mathbb R^n\). The restriction of \(\mu \in \mathcal M(\mathbb R^{n})\) on some \(E\subset \mathbb R^{n}\) will be denoted by , and \(\widehat{\mu }(\xi ) =\int e^{-2\pi i \langle x, \xi \rangle }d\mu (x)\) is the Fourier transform of \(\mu \in \mathcal M(\mathbb R^{n})\).
2 Set-up
Let \(f:\mathbb R^n\rightarrow \mathbb R^n\) be a map, for a Borel set \(E\subset \mathbb R^n\) the push forward is defined by \(\mu _{\# f}(E)=\mu (f^{-1}(E))\). For every joint probability measure \(\gamma \in \mathcal M(\mathbb R^n\times \mathbb R^n)\) we define the projections \(\pi _x:(x, y)\rightarrow x\), \(\pi _y:(x, y)\rightarrow y\).
We require \(\gamma \) to have prescribed marginals \(\rho \), \(\rho _0\in \mathcal M(\mathbb R^{n})\), i.e.
For probability measures \(\rho , \rho _0\in \mathcal M(\mathbb R^n)\) we define their Wasserstein distance as follows
where \(\gamma \)’s are transport plans such that \(\gamma _{\#\pi _x}=\rho \), \(\gamma _{\#\pi _y}=\rho _0\). We recall the following properties of the Wasserstein distance:
-
1)
d is a distance,
-
2)
\(d^2\) is convex, i.e.
$$\begin{aligned} d^2(tu+(1-t)v, w)\le td^2(u, w)+(1-t)d^2(v, w), \quad t\in [0, 1], u, v\in \mathcal M(\mathbb R^{n}), \end{aligned}$$ -
3)
if \(u_k\rightarrow u, v_k\rightarrow v\) in \(L^1_{loc}\) as \(k\rightarrow \infty \) then
$$\begin{aligned} \lim _{k\rightarrow \infty }d(u_k, v_k)=d(u, v), \end{aligned}$$ -
4)
if \(u_k\rightarrow u, v_k\rightarrow v\) weakly, i.e. \(\int u_k\phi \rightarrow \int u\phi , \int v_k\phi \rightarrow \int v\phi \) for every \(\phi \in C_0\), then
$$\begin{aligned} d(u, v)\le \liminf _{k\rightarrow \infty }d(u_k, v_k). \end{aligned}$$
See [34] for more details.
We also need the following definition of Wasserstein class:
Definition 2.1
Let \((\Omega , |\cdot |)\) be a Polish space (i.e. complete separable metric space equipped with its Borel \(\sigma \)-algebra). The Wasserstein space of order 2 is defined as
where \(x_0\in \Omega \) is arbitrary. This space does not depend on the choice of \(x_0\). Thus d defines a finite distance on \(P_2\).
Remark 2.2
If \(\Omega \) is compact then so is \(P_2\). If \(\Omega \) is only locally compact then \(P_2(\Omega )\) is not locally compact, see [34]. This introduces several difficulties in the proof of the existence of a minimizer.
Remark 2.3
Recall that the Fourier transformation of the truncated kernel \(K_{r_0}=1_{B_{r_0}}K, n=2\) can be computed explicitly
where \(c_1>0\) is a universal constant, \(\mathcal B\) is the Bessel function of the first kind such that \(\mathcal B(0)=1, \mathcal B'(0)=0\) and \(\lim _{t\rightarrow + \infty }\mathcal B(t)=0\) [15].
If \(\mu \in \mathcal M(\mathbb R^2)\) has compact support then from the weak Parseval identity we have that
where \(K(x-y)=\log \frac{1}{|x-y|}\) and \(\widehat{\mu }, \widehat{K}\) are the Fourier transforms of \(\mu , K\) respectively, see [22] for the proof. This observation shows that the energy J is nonnegative for compactly supported \(\mu \in \mathcal M(\mathbb R^{2})\).
We say that \(\mu \in \mathcal M(\mathbb R^{n})\) has finite energy if \(I[\mu ]<\infty \) where \(I[\rho ]=\iint K(x-y)d\rho (x)d\rho (y)\). Then \(\mathcal M(\mathbb R^{n})\) with \(\mathcal I[\rho , \mu ]=\iint K(x-y)d\rho (x)d\mu (y)\) has Hilbert structure, [24] page 82, and
is a norm. It is remarkable that the standard mollifications \(\mu _k\) of \(\mu \) converge to \(\mu \) strongly, i.e. \(\lim _{k \rightarrow \infty }\Vert \mu -\mu _k\Vert =0\), see [24] Lemma 1.\(2'\) page 83.
3 Existence of minimizers
Proposition 3.1
Let \(\mu _0\in \mathcal M(\mathbb R^{2})\) and \(\mathrm{{supp}}\mu _0\subset B_{R_0}\) for some \(R_0>0\). Let \(\mu \in P_2(\mathbb R^2)\) and J be given by (1.1), then
-
(i)
\(J[\mu ]>-\infty \) provided that \(J[\mu ]<+\infty \),
-
(ii)
there is \(\varepsilon >0\) depending on \(R_0\) and \(\mu \) such that \(J[\mu _{\varepsilon }]<J[\mu ]\) provided that \(\mathrm{{supp}}\mu \not \subset {B_\varepsilon }, \mathrm{{supp}}\mu \cap B_\varepsilon \not =\emptyset \), where \(\mu _{\varepsilon }=1_{B_\varepsilon }\mu /\mu (B_{\varepsilon })\) is the normalized restriction of \(\mu \) to \(B_{\varepsilon }\),
-
(iii)
if \(0\le J[\mu _k]\le C\) for some sequence \(\{\mu _k\}\subset P_2(\mathbb R^2)\) and \(\varepsilon _k\) are the corresponding numbers from (ii) then there is \(\varepsilon _0>0\) such that \(\varepsilon _k\le \varepsilon _0\) uniformly in k, where \(\varepsilon _0\) depends only on C and \(R_0\).
Proof
We split the proof into three steps:
Step 1: Second moment estimate:
Let \(\varepsilon >0\) be fixed. By Theorem 1 [28] there is transference plan \(\gamma \in \mathcal M(\mathbb R^2\times B_{R_0}) \) with marginals \(\mu , \mu _0\) such that \(d^2(\mu , \mu _0)=\frac{1}{2}\iint |x-y|^2\gamma .\) Set then
Moreover, the projections of . Hence
Since \(\gamma _\varepsilon \) has marginals \(\mu _\varepsilon , \mu _0\) then \(\frac{1}{2}\iint |x-y|^2\gamma _\varepsilon \ge d^2(\mu _\varepsilon , \mu _0)\). Consequently, this in combination with the last inequality yields
where we denote
provided that \(\varepsilon >{R_0}\). From Hölder’s inequality we have that
hence it gives
Step 2: A bound for the logarithmic term:
Now we want to estimate the logarithmic term from below using the method from Chapter 1.1 [29]. To do so we denote \(Q(x)=c_0|x|^2, w(x)=e^{-c_0|x|^2}\) and introduce the logarithmic energy with quadratic potential
It is convenient to introduce the notation \(K_w(x, y)=\log \frac{1}{|x-y|w(x)w(y)}\), with this we have
Observe that
because \(\frac{1}{2}(|x|+|y|)\le \sqrt{|x|^2+|y|^2}\le |x|+|y|\). Therefore for every large constant \(T_0>0\) there is \(\varepsilon \) such that if \(\max \{|x|, |y|\}\ge \varepsilon \) then \(K_w(x, y)\ge T_0\). This yields the following estimate for \(I_w\)
Thus after some simplification we get
Step 3: Energy comparison in \(B_\varepsilon \):
Combining (3.5) with (3.1) we get
The last three terms on the last line can be further estimated from below as follows
In particular from here and (2.3) we see that \(J[\mu ]>-\infty \) and hence (i) follows. Now if we choose
then from (3.6) it follows that
This implies \((\mu (B_\varepsilon ))^2(J[\mu ]-J[\mu _\varepsilon ])>1-(\mu (B_\varepsilon ))^2\), hence it is enough to take the minimization over \(\mathcal M (B_{\varepsilon })\).
It remains to check (iii). First we estimate
From (3.7) it follows that \(T_0\) can be chosen to be the same for every \(\mu _k\), say \(T_0>\hat{C}\), satisfying \(0\le J[\mu _k]\le C\) and the proof is complete. \(\square \)
Now we are ready to finish the proof of Theorem A.
Theorem 3.2
Let \(\rho _0\in \mathcal M(\mathbb R^{2})\) such that \(\mathrm{{supp}}\rho _0\subset B_{R_0}\) for some \(R_0>0\). Then there exists a minimizer \(\rho \in \mathcal M(\mathbb R^{2})\) of J. Moreover, the support of \(\rho \) is bounded.
Proof
First note that if we take the uniform measure \(\mu \) of some ball B having positive distance from \(B_{R_0}\) then \(J[\mu ]<+\infty \). Hence by Proposition 3.1 (i) we have that \(J[\mu ]>-\infty \). Thus if \(\mu _k\in P_2(\mathbb R^2)\) is a minimizing sequence then without loss of generality we can assume that \(J[\mu _k]\le C\) for some \(C>0\) uniformly in k. Moreover, from Proposition 3.1 (ii) it follows that there are positive numbers \(\varepsilon _k>0\) such that for the restriction measures \(\mu _{k, \varepsilon _k}\) we have
On the other hand it follows from (2.3) that \(J[\mu _{k, \varepsilon _k}]\ge 0\) because \(\mathrm{{supp}}\mu _{k, \varepsilon _k}\) is compact. Thus \(0\le J[\mu _{k, \varepsilon _k}]\le C\) uniformly in k and moreover \(J[\mu _{k, \varepsilon _k}]\rightarrow \inf _{\rho \in P_2(\mathbb R^2)} J[\rho ]\) thanks to (3.8). Consequently, applying Proposition 3.1 (iii), we can use the weak compactness of \(\mu _{k, \varepsilon _k}\) in \(\mathcal M(B_{\varepsilon _0})\) to get a weakly converging subsequence still denoted \(\mu _{k, \varepsilon _k}\) to some \(\rho \in \mathcal M(B_{\varepsilon _0})\). The logarithmic term is lower-semicontinuous [29], or [24] page 78, hence from the lower-semicontinuity of d (see property 4) in Sect. 2) it follows that
and the desired result follows. \(\square \)
4 Euler-Lagrange equation
Definition 4.1
We say that a set \(S\subset \mathbb R^n\times \mathbb R^n\) is cyclically monotone if
holds whenever \(m\ge 2\) and \((x_i, y_i)\in S, 1\le i\le m\) with \(x_{m+1}=x_1\). The set \(x_1, x_2, \dots , x_n\) is called a cycle.
Cancelling the square terms from (4.1) we get
Let \(\gamma \) be a transference plane with marginals \(\rho , \rho _0\). It is well known that the support of \(\gamma \) is cyclically monotone, see [1] Theorem 2.2.
Let \(S\subset \mathbb {R} ^{n}\times \mathbb {R} ^{n}\) be cyclically monotone. Set \(c\left( x,y\right) =\dfrac{1}{2}\left| x-y\right| ^{2}\) and introduce the function
where the supremum is taken over all cycles of finite length. It is easy to check that \(\psi \) defined in (4.3) satisfies \( \psi \left( x\right) \le 0 \) and the normalization condition \( \psi \left( x_{0}\right) =0. \)
If \(\gamma (x, y)\) is a transference plan then it is contained in the c superdifferential of the c concave function \(\psi \) constructed above. \(\psi \) is called the maximal Kantorovich potential. Moreover, we have that if \((x', y')\in \mathrm{{supp}}\gamma \) then for every \(x\in \mathbb R^n\)
See Theorem 2.3 [1] for proof.
Remark 4.2
Recall that by Corollary 2.2 [1] if (CC) graphs are \(\rho \) negligible then the transference plan \(\gamma \) is unique and the transport map \(T=\nabla v\) for some convex potential v.
We want to show that in (4.4) we can take \(\psi =2U^\rho \), and \(\rho \) is absolutely continuous with respect to the Lebesgue measure.
Lemma 4.3
Let \(\rho \) be a minimizer, then \(U^\rho \rho \) is a signed Radon measure.
Proof
Let \(\xi \in C_0^\infty (B)\) be a cut-off function of some ball B. Let \(\{\rho _k\}_{k=1}^\infty \) be a sequence of mollifications of \(\rho \). Recall that by Remark 2.3\(I[\rho _k]<\infty \), and \(\Vert \rho -\rho _k\Vert =I[\rho -\rho _k]\rightarrow 0\) as \(k\rightarrow \infty \). Thus
Note that [24] Lemma 1.\(2'\) page 83
as \(k\rightarrow \infty \). Since \(U^\rho \in H^1\) (see [22]) is superharmonic (hence bounded below in B, say by \(C_B\)) then from Fatou’s lemma we get that
where C depends only on the dimension. \(\square \)
Theorem 4.4
Let \(\rho \) be a minimizer. Suppose the infimum in \(d(\rho , \rho _0)\) is realized for a transference plan \(\gamma \) and \((x^*, y^*)\in \mathrm{{supp}}\gamma \). Then \(\rho \) has \(L^\infty \) density with respect to the Lebesgue measure, and for every \(x_0\) we have
Moreover, \(\nabla U^\rho \) is log-Lipschitz continuous.
Proof
Let \(\xi (x)\) be a cut-off function on \(B_\varepsilon (x^*)\). Introduce
Note that \(\gamma ^*_\varepsilon (x, y)\) is not a probability measure. Let \(\gamma _\varepsilon (x, y)=\tau _\#\gamma ^*_\varepsilon (x, y)\), where \(\tau : (x, y)\rightarrow (x-x^*+x_0, y)\) is the translation operator in x so that \(\tau (x^*, y)=(x_0, y)\), see Fig. 1. Letting
be the marginals in x, we can see that the marginals in y are
because \(\tau \) is measure preserving. Here \(\eta \ge 0\) is a continuous function with compact support. For the other marginal, we have
Observe that by (4.8) and the definition of \(\gamma _\varepsilon ^*\) we have
provided that t is small enough.
Consequently we can use \(\rho -t\varphi ^*+t\varphi _0\) against \(\rho \) and get from the convexity of \(d^2\) (see Sect. 2) the following estimate
For the nonlocal term we have
Then the energy comparison yields
Dividing by t and sending \(t\rightarrow 0\), \(t>0\) we get that
Since \(\gamma _\varepsilon \) is the push forward of \(\gamma _\varepsilon ^*\) under translation \(x\rightarrow x-x^*+x_0\) then we have from (4.9)
Taking \(x^*-x_0=\pm he_j\), where \(e_j\) is the unit direction of the jth coordinate axis, \(h>0\), and adding the resulted inequalities (4.10) we get
But \(\left| x+he_j-y\right| ^{2}+|x-he_j-y|^2-2|x-y|^2=2h^2\), hence (4.11) is equivalent to
Note that by Lemma 4.3 the left hand side of (4.12) is well defined.
Claim 4.5
\(\rho \) has \(L^2\) density.
Proof
Let \(\delta _hu=\delta (x, h, u)=\frac{1}{h^2}\sum _j(u(x+he_j)+u(x-he_j)-2u(x))\) be the discrete Laplacian. Then from (4.12) with a sequence of cut offs \(\xi _k\uparrow 1\) on \(B_{\varepsilon _0}\), using the dominated convergence theorem, since \(U^\rho \rho \) is a signed Radon measure (see Lemma 4.3 and (4.6)), and recalling that \(\rho \) has compact support, it follows that
Since \(\mathrm{{supp}}\rho \) is compact we can assume that K vanishes outside of \(B_{r_0}\) and consider the truncated kernel \(K_{r_0}=1_{B_{r_0}}K\). From the weak Parseval identity we get that
Letting \(h\rightarrow 0\) and applying Fatou’s lemma we get
Since the left hand side of the previous inequality does not depend on \(r_0\) we can let \(r_0\rightarrow \infty \) and applying Fatou’s lemma again we see that
Since Fourier transform is isometry on \(L^2\) then \(\tilde{\rho }\), the inverse Fourier transform of \(\widehat{\rho }\), exists and \(\tilde{\rho }\in L^2\). But then \(\widehat{(\rho -\tilde{\rho })}=0\), and it follows that \(\rho \) has \(L^2\) density. The proof of the claim is complete. \(\square \)
Returning to the localized inequality (4.12) with \((x^*, y^*)\in \mathrm{{supp}}\gamma \) we get
Using the weak convergence of second order finite differences in \(L^2\) we finally obtain from (4.13) and \(\delta _hU^\rho \rightarrow \Delta U^\rho =-2\pi \rho \)
Consequently, the upper Lebesgue density of the measure \(\rho \) is bounded by some universal constant and hence \(d\rho =fdx\) for some \(f\in L^\infty (\mathbb R^n)\) [18]. Therefore from Judovič’s theorem [21] \(\nabla U^\rho \) is log-Lipschitz continuous. Moreover, by construction
Hence from (4.9) and the mean value theorem we get that
Thus \(2U^\rho (x_0)+\frac{1}{2}|x_0-y^*|^2\ge 2U^\rho (x^*)+ \frac{1}{2}|x^*-y^*|^2\). \(\square \)
Corollary 4.6
Let \(\rho \) be a minimizer of J, then \(U^\rho =\psi \) on \(\mathrm{{supp}}\rho \). Furthermore, \(\mathrm{{supp}}\rho \) has nonempty interior.
Proof
In view of (4.4) and (4.7) \(U^\rho \) and \(\psi \) have the same c-subdifferential on \(\mathrm{{supp}}\rho \) then it follows that \(U^\rho =\psi \) and at free boundary point \(x^*=y^*\) we have \(\nabla U^\rho (x^*)=0\). The last claim follows from the log-Lipschitz continuity of \(\nabla U^\rho \). \(\square \)
5 Regularity of free boundary
Let \(x^*\in \mathrm{{supp}}\rho \), then from (4.7) we have for every x
Therefore \(U^\rho (x^*)\le U^\rho (x)\) if \(x\in B_{|x^*-y^*|}(x^*):=B\) and \(x^*\not =y^*\). Consequently \(U^\rho \) has local minimum in \(\overline{B}\) at \(x^*\in \partial B\), and since \(U^\rho \) is superharmonic in \(\mathbb R^2\) it follows from Hopf’s lemma, applied to a ball with diameter \(\overline{x^*y^*}, \) that the normal derivative \(\partial _\nu U^\rho (x^*)<0\) where \(\nu =\frac{x^*-y^*}{|x^*-y^*|}\). Hence at the remaining free boundary points we must have \(x^*=y^*\) and hence \(\nabla \psi (x^*)=0\).
Definition 5.1
Let T be the transport map. We say that \(x\in \mathrm{{supp}}\rho \cap \mathrm{{supp}}\rho _0\) is a singular free boundary point if \(x=T(x), \nabla U^\rho (x)=0\) and
The set of singular points is denoted by S.
Lemma 5.2
Let 0 be a singular free boundary point and \(\rho _0\ge s>0\) on \(\mathrm{{supp}}\rho _0\). Then for every small \(\varepsilon >0\) there is \(R^*>0\) such that the set of singular points in \(B_R, R<R^*\) can be trapped between two parallel planes at distance \(\frac{\sqrt{8n+1}}{(sc_n)^{\frac{1}{2n}}} \varepsilon ^{\frac{1}{2n}}R\) where \(c_n=|B_1|\).
Proof
Let \(\mathcal K\) be the convex hull of the singular set in \(B_R\). Then there is \(x_0\in B_R\) and an ellipsoid E (John’s ellipsoid [17] page 139) so that
Let r be the smallest axis of E. By mass balance condition
By assumption 0 is a singular point, so we have \(\limsup \limits _{t\downarrow 0}\frac{1}{|B_t|}\int _{B_t} \rho (x)dx=0\). Thus for every \(\varepsilon >0\) small there is \(a_0\) such that
By assumption \(\rho _0>s>0\) then
while \( \int _{B_a}\rho \left( x\right) dx < \varepsilon a^{n}. \)
Consequently, combining (5.1)–(5.3) we get \( \varepsilon a^{n} > s r^{n}c_n\) or
It follows that (for small R and \(\varepsilon \)) there is a point \(A\in B_{\frac{r}{2n}} \left( x_0\right) \cap \{\rho _0>0 \}\) and \(B\in \{\rho >0\}\) so that \(|OB| \sim a\) and \(T^{-1}\left( A\right) =B\).
Let \(x_s\) be a singular point. Notice that \(x_{s}=T\left( x_{s}\right) \), i.e. the singular free boundary points are fixed points. From the monotonicity (4.2)
or \(\left( x_{s}-A\right) \left( x_{s}-B\right) \ge 0\). Let \(m=\dfrac{A+B}{2}\) be the midpoint of the segment AB, then
because
Hence we arrive at
From simple geometric considerations we have that (see Figure 2)
Note that \(\sin \alpha =\dfrac{\left| AC\right| }{\left| AB\right| }\le \dfrac{2R}{\left| AB\right| }\), hence it follows that
Therefore \(S\cap B_R\) is on one side of the hyperplane containing the intersection \(B_R\) and the ball with diameter AB, see Fig. 2. Hence
or, in view of (5.4), we get \(4R^{2}\ge \dfrac{r}{2n}\left[ r\left( \dfrac{sc_n}{\varepsilon }\right) ^{1/n}-R\right] \). From here
implying \(r \le \frac{\sqrt{8n+1}}{(sc_n)^{\frac{1}{2n}}} \varepsilon ^{\frac{1}{2n}}R\) and the proof is complete. \(\square \)
Lemma 5.3
Let \(\omega (R)\) be the height of the slab containing \(S\cap B_R\) (see (1.7)), \(B_i=B_{r_i}(x_i)\) a collection of disjoint balls included in \(B_R\) with \(x_i\in S\). Then for every \(\beta >n-1\) we have
Proof
Rotate the coordinate system such that \(x_n\) points in the direction of the normal of the parallel planes which are \(\omega ({R})\) apart and contain \(S\cap B_R\). Let \(\mathcal F_0\) be the collection of the balls satisfying \(R\omega (R)<r_i\le R\). If \(B_i\in \mathcal F_0\) then \(\text{ diam }\left( B_i\cap \{x_n=0\}\right) \ge \frac{1}{2}R\omega ({R})\). Therefore there are at most
such balls. Thus we have
and \(\{ B_i\}\setminus \mathcal F_0\) can be covered by balls \(\widehat{B}_{4R\omega ({R})}(y_j)\) such that \(y_j\in \{x_n=0\}\cap B_R\) and \(1\le j\le \frac{1}{\left( \omega ({R})\right) ^{n-1}}\). For each j we have \(S\cap \widehat{B}_{4R\omega ({R})}(y_j)\) is contained in the slab of width
Hence let \(\mathcal F_1\) be the collection of the balls \(B_i\) contained in \(\cup _j \widehat{B}_{4R\omega ({R})}(y_j)\) and satisfying \(R\left( \omega ({R})\right) ^2<r_i\le R\omega ({R}).\) Then every ball \(B_i\) in \(\mathcal F_1\) intersects \(\{x_n=0\}\) such that \(\text{ diam }(B_i\cap \{x_n=0\})\ge \frac{1}{2} R\left( \omega ({R})\right) ^2\) and the number such balls \(B_i\) is at most
Consequently
Again, as above we can choose at most \(\frac{1}{\left( \omega ({R})\right) ^{n-1}}\) balls \(\widehat{B}_{R\left( \omega ({R})\right) ^2}(y_l), l\le \frac{1}{\left( \omega ({R})\right) ^{n-1}}\) that cover \(\{B_i\}\setminus (\mathcal F_0\cup \mathcal F_2)\). We define \(\mathcal F_m\) inductively such that \(R\left( \omega ({R})\right) ^{m}<r_i\le R\left( \omega ({R})\right) ^{m-1}\) for \(B_i\in \mathcal F_m\), then repeating the argument above we have that
Therefore
\(\square \)
Now we can finish the proof of Theorem C.
Theorem 5.4
Suppose \(\omega ({R})= R^\sigma \), then there is \(\sigma '>0\) depending only on \(n, \sigma \) such that \(S\subset M_0\cup \bigcup _{i=1}^\infty M_i\) where \(\mathcal {H}^{n-1-\sigma '}(M_0)=0\) and \(M_i\) is a \(C^1\) hypersurface such that the measure theoretic normal exists at each \(x\in S\cap M_i, i\ge 1\).
Proof
Let \(x\in S\) be such that there exists a unique normal in measure theoretic sense, see Definition 5.6 [18]. Notice that at the point x, where such normal exists the set has approximate tangent plane. Therefore the projections of \(B_r(x)\cap S\) onto two dimensional planes have diameter at least 2R. Thus we let \(M_0\) be the subset of S such that for \(x\in M_0\) there is sequence \(R_k\rightarrow 0\) such that the projections of \(B_{R_i}(x)\) onto some two dimensional plane is of order \(R^{1+\sigma }\).
Now let \(B_{r_i}(x_i)\) be a Besikovitch type covering of \(B_R\cap M_0\). Let us cover \(B_{r_i}(x_i)\cap M_0\) with balls of radius \(r_i^{1+\frac{\sigma }{2}}\), then there are at most
such balls. Hence for \(\alpha >0\) we have
Now we choose \(\delta =\frac{\sigma }{4}\) and \(\beta :=n-1+\delta \) and set
We want to show that for this choice of \(\beta \) we get \(\alpha =n-1-\sigma '\) for some \(\sigma '>0\) depending on n and \(\sigma \). Indeed, we have
\(\square \)
6 Random matrices: an example
In this section we discuss a problem related to random matrices which leads to the obstacle problem (1.4). Let H be a Hermitian matrix, i.e. \(H_{ij}^\dagger =\bar{H}_{ji}\) (or \(H^\dagger =H\) for short) where \(\bar{H}_{ij}\) are the complex conjugates of the entries of \(N\times N\) matrix H. One of the well known random matrix ensembles is the Gaussian ensemble. The probability density of the random variables in the Gauss ensemble is given by the formula
where \(\kappa >0\) and
is the trace of the squared matrix [27]. The dispersion is the same for every H in the ensemble.
The corresponding statistical sum is
\(Z_N\) can be rewritten in an equivalent form
where
and we replaced \(\kappa =Ng\) for convenience. If we assume that the particles (in the equilibrium) have density \(\rho \) then from approximation of Riemann’s sum we get that
As \(N\rightarrow \infty \) the main contribution comes from the minimum of the functional
with respect to the constraint \(\int _\mathbb R\rho =1\).
If in W the quadratic term is replaced by \({-\frac{1}{2}|x_i-y_i|^2\gamma (x_i, y_i)}, g\sim N, H_0=\text{ diag }(y_1, \dots , y_N)\), then we get the model corresponding to the energy J.
Remark 6.1
Let \(n=1\), then the first variation of \(F[\rho ]\) gives
where \(\lambda \) is the Lagrange multiplier of the constraint \(\int _\mathbb R\rho =1\). Differentiating in x we get
The solution of this equation (given in terms of Hilbert’s transform) has the form
and this is Wigner’s famous semicircle law [35], see also [32].
For the problem with \(d^2\) we have \(2U^\rho +\frac{1}{2}|x-T(x)|^2=\lambda \), where \(T:x\rightarrow y\) is the transport map. Since by Theorem B, \(x-T(x)=-2\frac{dU^\rho }{dx}\), it follows that \(U^\rho +\left| \frac{d}{dx} U^\rho \right| ^2=\lambda /2\). Hence \(U^\rho \le \lambda /2\) on \(\mathrm{{supp}}\rho \) and
or equivalently \( \pm 2 \sqrt{\lambda /2-U^\rho }=x+C, \) where C is an arbitrary constant. Thus after normalization we get that
7 The nonlocal Monge–Ampère equation
In this section we use a finite step approximation to obtain a solution, at least formally, to the equation
From Corollary 4.6 we have
Consequently, the prescribed Jacobian equation is
Note that this is a nonlocal Monge–Ampère equation. By standard \(W^{2, p}\) estimates for the potential \(U^\rho \) it follows that \(\mathrm{{supp}}\rho _0\setminus \mathrm{{supp}}\rho \) has vanishing Lebesgue measure.
Let \(h>0\) be small and consider the perturbed energy
Linearizing the corresponding prescribed Jacobian equation
we get
Consequently
or after iterations \(\rho _0, \rho _1, \rho _2, \dots \) with step \(\frac{h}{2}\) we get
Therefore, sending \(h\rightarrow 0\) we obtain the equation
References
Ambrosio, L.: Lecture notes on optimal transport problems, Mathematical aspects of evolving interfaces (Funchal, 2000), Lecture Notes in Math., vol. 1812, Springer, Berlin, pp. 1–52 (2003). https://doi.org/10.1007/978-3-540-39189-01
Ambrosio, L., Gigli, N., Savaré, G.: Gradient flows in metric spaces and in the space of probability measures, 2nd ed., Lectures in Mathematics ETH Zürich, Birkhäuser Verlag, Basel (2008)
Armstrong, S.N., Serfaty, S., Zeitouni, O.: Remarks on a constrained optimization problem for the Ginibre ensemble. Potential Anal. 41(3), 945–958 (2014). https://doi.org/10.1007/s11118-014-9402-0
Balagué, D., Carrillo, J.A., Laurent, T., Raoul, G.: Nonlocal interactions by repulsive-attractive po-tentials: radial ins/stability. Phys. D 260(5), 25 (2013). https://doi.org/10.1016/j.physd.2012.10.002
Balagué, D., Carrillo, J.A., Laurent, T., Raoul, G.: Dimensionality of local minimizers of the interaction energy. Arch. Ration. Mech. Anal. 209(3), 1055–1088 (2013). https://doi.org/10.1007/s00205-013-0644-6
Caffarelli, L.A., McCann, R.J.: Free boundaries in optimal transport and Monge-Amp‘ere obstacle problems. Ann. Math. 171(2), 673–730 (2010). https://doi.org/10.4007/annals.2010.171.673
Caffarelli, L.A.: The obstacle problem revisited. J. Fourier Anal. Appl. 4(4–5), 383.402 (1998). https://doi.org/10.1007/BF02498216
Carrillo, J.A., McCann, R.J., Villani, C.: Contractions in the 2-Wasserstein length space and thermalization of granular media. Arch. Ration. Mech. Anal. 179(2), 217–263 (2006). https://doi.org/10.1007/s00205-005-0386-1
Carrillo, J.A., Delgadino, M.G., Mellet, A.: Regularity of local minimizers of the interaction energy via obstacle problems. Comm. Math. Phys. 343(3), 747–781 (2016). https://doi.org/10.1007/s00220-016-2598-7
Carrillo, J.A., DiFrancesco, M., Figalli, A., Laurent, T., Slepcev, D.: Global-in-time weak measure solutions and finite-time aggregation for nonlocal interaction equations. Duke Math. J. 156(2), 229–271 (2011). https://doi.org/10.1215/00127094-2010-211
Canizo, J.A., Carrillo, J.A., Patacchini, F.S.: Existence of compactly supported global minimisers for the interaction energy. Arch. Ration. Mech. Anal. 217(3), 1197–1217 (2015). https://doi.org/10.1007/s00205-015-0852-3
Carrillo, J.A., McCann, R.J., Villani, C.: Contractions in the 2-Wasserstein length space and thermalization of granular media. Arch. Ration. Mech. Anal. 179(2), 217–263 (2006). https://doi.org/10.1007/s00205-005-0386-1
Carrillo, J.-A., Santambrogio, F.: L\(^\infty \) estimates for the JKO scheme in parabolic- elliptic Keller-Segel systems. Quart. Appl. Math. 76(3), 515–530 (2018). https://doi.org/10.1090/qam/1493
Carrillo, J.A., Delgadino, M.G., Mellet, A.: Regularity of local minimizers of the interaction energy via obstacle problems. Comm. Math. Phys. 343(3), 747–781 (2016). https://doi.org/10.1007/s00220-016-2598-7
Carleson, L.: Selected problems on exceptional sets, Van Nostrand Mathematical Studies, No. 13, D. Van Nostrand Co., Inc., Princeton, N.J.-Toronto, Ont.-London
De Philippis, G., Figalli, A.: The Monge-Ampere equation and its link to optimal transporta-tion. Bull. Amer. Math. Soc. (N.S.) 51(4), 527–580 (2014). https://doi.org/10.1090/S0273-0979-2014-01459-4
de Guzmán, M.: Differentiation of integrals in Rn. Lecture Notes in Mathematics, vol. 481. Springer, Berlin and New York (1975)
Evans, L.C., Gariepy, R.F.: Measure theory and fine properties of functions. Textbooks in Mathematics, Revised, p. 18. CRC Press, Boca Raton, FL (2015)
Figalli, A.: The optimal partial transport problem. Arch. Ration. Mech. Anal. 195(2), 533–560 (2010). https://doi.org/10.1007/s00205-008-0212-7
Jordan, R., Kinderlehrer, D., Otto, F.: The variational formulation of the Fokker-Planck equation. SIAM J. Math. Anal. 29(1), 1–17 (1998). https://doi.org/10.1137/S0036141096303359
Judovic, V.I.: Non-stationary flows of an ideal incompressible fluid. Z. Vy. ci. sl. Mat. i Mat. Fiz. 3, 1032–1066 (1963)
Karakhanyan, A.L.: Remarks on the thin obstacle problem and constrained Ginibre ensembles. Comm. Part. Diff. Equat. 43(4), 616–627 (2018). https://doi.org/10.1080/03605302.2018.1446446
Kimura, M., van Meurs, P.: Regularity of the minimiser of one-dimensional interaction energies, ESAIM: COCV, Forthcoming article
Landkof, N. S. (1972) Foundations of modern potential theory, Springer-Verlag, New York-Heidelberg, Translated from the Russian by A. P. Doohovskoy; Die Grundlehren der mathematischen Wissenschaften, Band 180:11
Ledoux, M., Popescu, I.: Mass transportation proofs of free functional inequalities, and free Poincaré inequalities. J. Funct. Anal. 257(4), 1175–1221 (2009). https://doi.org/10.1016/j.jfa.2009.03.011
McCann, R.J.: A convexity principle for interacting gases. Adv. Math. 128(1), 153–179 (1977). https://doi.org/10.1006/aima.1997.1634
Mehta, M.L.: Random matrices, 2nd edn. Academic Press Inc, Boston, MA (1991)
Rachev, S.T.: The Monge-Kantorovich problem on mass transfer and its applications in stochastics. Teor. Veroyatnost. i Primenen. 29(4), 625–653 (1984)
Saff, E.B., Totik, V.: Logarithmic potentials with external fields, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 316, Springer, Berlin, Appendix B by Thomas Bloom. (1997)
Santambrogio, F.: Optimal transport for applied mathematicians. Progress in Nonlinear differential equations and their applications. Calculus of variations, PDEs, and modeling, vol. 87. Springer, Cham (2015)
Savin, O.: A free boundary problem with optimal transportation. Comm. Pure Appl. Math. 57(1), 126–140 (2004). https://doi.org/10.1002/cpa.3041
Serfaty, S.: Coulomb gases and Ginzburg-Landau vortices, Zurich Lectures in Advanced Mathematics, European Mathematical Society (EMS). Zürich 2, 19 (2015)
Trudinger, N.S., Wang, X.J. (2008) The Monge-Amp‘ere equation and its geometric applications, Handbook of geometric analysis. No. 1, Adv. Lect. Math. (ALM), vol. 7, Int. Press, Somerville, MA, pp. 467–524
Villani, C.: Optimal transport, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], vol. 338. Springer, Berlin (2009)
Wigner, E.P.: On the distribution of the roots of certain symmetric matrices. Ann. Math. (2) 67, 325–327 (1958). https://doi.org/10.2307/1970008
Acknowledgements
The author wishes to express his thanks to Prof. Robert McCann for several helpful comments concerning the related literature.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Andrea Malchiodi.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The author was partially supported by EPSRC Grant No. EP/S03157X/1 Mean curvature measure of free boundary.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.