Abstract
We provide upper and lower bounds on the lowest free energy of a classical system at given one-particle density \(\rho (x)\). We study both the canonical and grand-canonical cases, assuming the particles interact with a pair potential which decays fast enough at infinity.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Density functional theory (DFT) is a powerful tool used in quantum physics and chemistry to model quantum electrons in atoms, molecules and solids [8, 27, 30, 65, 67, 73]. However, DFT is based on a rather general mathematical scheme and it can be applied to many other situations. This work is devoted to the rigorous study of classical DFT, which is used for finite or infinite systems of interacting classical particles.
Classical DFT is widely employed in materials science, biophysics, chemical engineering and civil engineering [109]. It has a much lower computational cost than the more precise molecular dynamics simulations, which are limited to small systems and short times [38, 56, 89]. Classical DFT is typically used at interfaces between liquid–gas, liquid–liquid (in fluid mixtures), crystal-liquid and crystal-gas phases at bulk coexistence. The density is then non constant in space and varies in the interfacial region between the two phases.
The physical theory of inhomogenous fluids goes essentially back to the 60s [19, 20, 58, 72, 100]. Functional methods and their applications to the theory of the structure of bulk fluids were described in [75, 99]. The realization that methods developed in the quantum case by Hohenberg–Kohn–Sham [46, 55] could be transferred to classical fluids arose in the middle of the 70s, in particular in the works of Ebner–Saam [28, 29, 95] and Yang et al. [110]. Several authors then developed approximate free-energy functionals to calculate the density profile and surface tension of the liquid–gas interface. The square-gradient approximation could later be derived rather systematically, following the important works of Hohenberg–Kohn–Sham on the gradient expansion of the uniform (quantum) electron gas. Deriving efficient functionals for the solid–liquid transition was harder and took longer [45, 85, 86]. Well-known references on classical DFT are the two reviews by Evans [31, 32]. Other important physical references on the subject include [1, 2, 33, 43, 44, 69, 84, 97].
Rigorous works on classical DFT are rather scarse. Most of the mathematical works are about proving that one can find an external potential V whose interacting equilibrium Gibbs measure has any desired given density \(\rho \). This is called the inverse or dual problem and justifies the use of density functional methods. In quantum DFT, V is called the Kohn–Sham potential and its existence is unclear in most situations. However, in the classical case, V is usually well defined.
The grand-canonical 1D hard-core gas was solved exactly in a celebrated work by Percus [76], who provided an exact expression of the external potential V. This was used and extended in later works [77,78,79, 106]. In two famous works [10, 11], Chayes, Chayes and Lieb proved in a quite general setting (in particular any space dimension d) the existence and uniqueness of the dual potential V at any positive temperature \(T>0\). At \(T=0\), the canonical model can be reformulated as a multi-marginal optimal transport problem [17, 18, 23, 74, 96], where V is usually called the Kantorovich potential. Its existence and properties are known in many cases [5, 22, 49] but uniqueness usually does not hold. The grand-canonical case was studied in the recent article [24]. Most of these works are based on compactness arguments and do not furnish any quantitative information on the shape of the potential V in terms of the given density \(\rho \). In the recent paper [47], a novel Banach inversion theorem was used to provide an explicit formula for V in terms of \(\rho \) in the form of a convergent series, under the assumption that \(\rho \) is small in \(L^\infty (\mathbb {R}^d)\). This is the equivalent of the Virial expansion for uniform systems.Footnote 1
In this work and the companion paper [48] we do not discuss the dual potential V and instead focus on more quantitative properties of the model depending on the shape of the density \(\rho \). The case of the three dimensional Coulomb interaction \(w(x)=|x|^{-1}\) or, more generally long range Riesz interactions \(|x|^{-s}\) with \(s<d\) has been the object of several recent works [61, 65]. Here we always assume that the interaction potential w decays fast enough at infinity and do not discuss more complicated long range potentials such as Coulomb.
Our main goal in this paper is to show universal local bounds on the free energy \(F_T[\rho ]\) at given density \(\rho \in L^1(\mathbb {R}^d)\). By local we mean that we only use terms in the form
The admissible values of p and q will depend on the temperature T as well as on the singularity of the interaction potential w at the origin, that is, how strong the particles repel each other when they get close. Such universal bounds are important in DFT. They can help to find the natural form of approximate functionals to be used for practical computations.Footnote 2 In addition, these bounds will be useful in our next work [48] where we study the local density approximation.
Deriving simple lower bounds is usually easy, under reasonable stability assumptions on the interaction potential w. Obtaining upper bounds can be much more difficult. They require constructing a good trial state, but the constraint that the density is given and must be exactly reproduced can generate important mathematical complications.
The simplest trial state is obtained by taking i.i.d. particles, that is, a factorized N-particle probability \((\rho /N)^{\otimes N}\) where \(N=\int _{\mathbb {R}^d}\rho \in \mathbb {N}\). Doing so provides an upper bound on the free energy in terms of mean-field theory, often called in this context the Kirkwood-Monroe functional [54]:
This only makes sense when the pair interaction potential w is locally integrable. If w is globally integrable, one can use Young’s inequality and estimate the first integral by the local energy \((\int _{\mathbb {R}^d}w_+/2)\int _{\mathbb {R}^d}\rho (x)^2\,\textrm{d}x\), where \(w_+{:}{=}\max (w,0)\) denotes the positive part. The simplest models of classical DFT use (1) as a basis.
In classical statistical mechanics, it is often convenient to consider potentials w diverging fast enough at the origin, which helps to stabilize the system [25, 26, 39, 83, 87, 94]. This divergence implies that the particles can never get too close to each other, and this requires that the trial state contains rather strong correlations. A factorized state is not appropriate and (1) is infinite. The simplest singular interaction is of course the hard-core potential \(w(x)=(+\infty ){\mathbb {1} }(|x|< r_0)\), which is simply infinite over a ball and vanishes outside.
In this paper we provide two different constructions of a correlated trial state, which give reasonable upper bounds on the classical free energy at given density, for singular interaction potentials at the origin. Our first method uses some ideas from harmonic analysis in the form of a Besicovitch-type covering lemma [21]. We cover space with cubes whose size is adapted to the local value of the density, and put essentially one particle per cube, with the constraint that the cubes are far enough from each other. This method works very well in the grand-canonical setting where the number of particles is allowed to fluctuate. In order to handle the canonical ensemble, a different construction is needed. We instead use techniques from optimal transport theory developed in [14], which give a rather good bound at zero temperature, \(T=0\). For \(T>0\) we couple this to the Besicovitch-type covering lemma and obtain an upper bound which is not as good as the grand-canonical one.
In [48] we will study the behavior of \(F_T[\rho ]\) in some particular regimes and the upper universal bounds derived here will be useful. Namely we will consider the thermodynamic limit where \(\rho \) is essentially constant over a large domain as well as the local density approximation when \(\rho \) varies slowly over big regions. Such regimes have been recently considered for the three dimensional Coulomb potential \(w(x)=|x|^{-1}\) in [63, 66], for more general Riesz potentials in [15, 16] and for a special class of positive-type interactions in [71]. The methods used in these works all rely on the assumption that the potential is positive-definite, and new ideas are necessary in the general (short-range) case.
2 Main Results
2.1 Free Energies at Given Density
This subsection is mainly devoted to precisely introducing models and notation used in the paper. Our main results are stated in the next subsections.
2.1.1 The Interaction Potential w
For convenience, we work in \(\mathbb {R}^d\) with a general dimension \(d\geqslant 1\). The physical cases of interest are of course \(d\in \{1,2,3\}\) but the proofs are the same for all d, except sometimes for \(d=1\). We consider systems of indistinguishable classical particles interacting through a short-range pair potential w. Throughout the paper, we work with an interaction satisfying the following properties.
Assumption 1
(on the short-range potential w) Let \( w:\mathbb {R}^d\rightarrow \mathbb {R}\cup \{+\infty \} \) be an even lower semi-continuous function satisfying the following properties, for some constant \(\kappa >0\):
-
(1)
w is stable, that is,
$$\begin{aligned} \sum \limits _{1 \leqslant j < k \leqslant N} w ( x_j - x_k ) \geqslant - \kappa N, \end{aligned}$$(2)for all \( N \in \mathbb {N}\) and \( x_1, \dotsc , x_N \in \mathbb {R}^d \);
-
(2)
w is upper regular, that is, there exist \( r_0 \geqslant 0 \), \( 0 \leqslant \alpha \leqslant \infty \) and \(s>d\) such that
$$\begin{aligned} w(x) \leqslant \kappa \left( {\mathbb {1} }(|x|<r_0)\left( \frac{r_0}{|x|}\right) ^\alpha + \frac{1}{1 + |x|^s}\right) . \end{aligned}$$(3)
The lower semi-continuity of w will be used later to ensure that the energy is lower semi-continuous as a function of the one-particle density (see Remark 3 below). In statistical mechanics, the stability condition (2) is used to ensure the existence of the thermodynamic limit [94]. On the other hand, upper bounds of the form (3) are sometimes used to get more information on the equilibrium states [93]. At infinity, we assume that our potential w is bounded above by \(|x|^{-s}\), which is integrable since \(s>d\). It could of course decay faster. On the other hand, the parameter \( \alpha \) determines the allowed repulsive strength of the interaction at the origin. If \( \alpha = 0 \), then w is everywhere bounded from above, and if \( 0< \alpha < d \), then w has at most an integrable singularity at the origin. In particular, the positive part \( w_+\) is integrable over the whole of \(\mathbb {R}^d\) (since we are interested in upper bounds, the negative part \(w_-\) will not play a role in this paper). In the case where \( \alpha \geqslant d \), w can have a non-integrable singularity at the origin. If \( \alpha = \infty \), then w can have a hard-core. Our convention is that \((r_0/|x|)^\alpha =(+\infty ){\mathbb {1} }(|x|<r_0)\) for \(\alpha =+\infty \). When \(\alpha <\infty \) we can always assume that \(r_0=1\), possibly after increasing \(\kappa \).
Most short range potentials of physical interest are covered by Assumption 1, including for instance the simple hard-core and the Lennard–Jones potential \(w(x)=a|x|^{-12}-b|x|^{-6}\).
2.1.2 The Canonical Free Energy
In this subsection we define the canonical free energy \(F_T[\rho ]\) at given density \(\rho \).
Suppose that we have N particles in \( \mathbb {R}^d \), distributed according to some Borel probability measure \( \mathbb {P}\) on \( \mathbb {R}^{dN} \). Since the particles are indistinguishable, we demand that the measure \( \mathbb {P}\) is symmetric, that is,
for any permutation \( \sigma \) of , and any Borel sets \(A_1,...,A_N\subset \mathbb {R}^d\). The one-body density of such a symmetric probability \( \mathbb {P}\) equals N times the first marginal of \(\mathbb {P}\), that is,
where the integration is over \( x_2,\dotsc ,x_N \). Equivalently, \(\rho _\mathbb {P}(A)=N\mathbb {P}(A\times (\mathbb {R}^d)^{N-1})\) for every Borel set A. Note the normalization convention \( \rho _{\mathbb {P}}(\mathbb {R}^d) = N \). For a non-symmetric probability \(\mathbb {P}\) we define \(\rho _\mathbb {P}\) as the sum of the N marginals.
Notice that any positive measure \(\rho \) on \(\mathbb {R}^d\) with \(\rho (\mathbb {R}^d)=N\in \mathbb {N}\) arises from at least one N-particle probability measure \(\mathbb {P}\). One can take for instance \(\mathbb {P}=(\rho /N)^{\otimes N}\) for independent and identically distributed particles.
The pairwise average interaction energy of the particles is given by
It could in principle be equal to \(+\infty \), but it always satisfies \(\mathcal {U}_{N} ( \mathbb {P} )\geqslant -\kappa N\) due to the stability condition on w in Assumption 1. When considering systems at positive temperature \(T>0\), it is necessary to also take the entropy of the system into account. It is given by
If \( \mathbb {P}\) is not absolutely continuous with respect to the Lebesgue measure on \( \mathbb {R}^{dN} \), we use the convention that \(\mathcal {S}_N ( \mathbb {P} )=-\infty \). The factor N! appears because the particles are indistinguishable. In fact, we should think that \(N!\,\mathbb {P}\) defines a probability measure over \((\mathbb {R}^d)^N/\mathfrak {S}_N\) where \(\mathfrak {S}_N\) is the permutation group. We need to make sure that \(\mathcal {S}_N(\mathbb {P})<+\infty \), which follows if we assume for instance that \(\rho _\mathbb {P}\) is absolutely continuous with \(\int _{\mathbb {R}^d}\rho _\mathbb {P}|\log \rho _\mathbb {P}|<\infty \). This is due to the well-known inequality (see, e.g., [24, Lemma 6.1])
The latter follows immediately from writing the relative entropy of \(\mathbb {P}\) with respect to \((\rho /N)^{\otimes N}\), which is non-negative, and using \((N/e)^N\leqslant N!\).
The total free energy of the system in the state \( \mathbb {P}\) at temperature \( T \geqslant 0 \) equals
It can be equal to \(+\infty \) but never to \(-\infty \) due to the stability of w and thanks to the inequality (5) if \(T>0\) and \(\int _{\mathbb {R}^d}\rho _\mathbb {P}|\log \rho _\mathbb {P}|<\infty \).
Throughout the paper, we will only consider systems with a given one-body density \(\rho \), which is absolutely continuous with respect to the Lebesgue measure. At \(T>0\) we also assume that \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \). This allows us to consider the minimal energy of N-particle classical systems with density \( \rho \), given by
where the infimum is taken over N-particle states \( \mathbb {P}\) on \( \mathbb {R}^{dN} \) with one-particle density \( \rho _{\mathbb {P}} \) equal to \( \rho \). At \(T=0\), the entropy term disappears and we obtain
This is a multi-marginal optimal transport problem with symmetric cost \(\sum _{j<k}w(x_j-x_k)\) and with all the marginals of \(\mathbb {P}\) equal to \(\rho /N\) [17, 18, 23, 74, 96]. From the stability assumption on w and (5), we have
One of our goals will be to find simple conditions ensuring that \(F_T[\rho ]<\infty \). Before we turn to this question, we first introduce the grand-canonical problem.
Remark 2
(Symmetry) In the definition (7) we can freely remove the constraint that \(\mathbb {P}\) is symmetric. Since the interaction is a symmetric function and the entropy \(\mathcal {S}_N\) is concave, the minimum is the same as for symmetric \(\mathbb {P}\)’s. Recall that for a non-symmetric \(\mathbb {P}\), \(\rho _\mathbb {P}\) is by definition the sum of the N marginals.
Remark 3
(Lower Semi-continuity) The function \(\rho \mapsto F_T[\rho ]\) is lower semi-continuous for the strong topology. That is, we have
At \(T>0\) this is valid under the sole condition that w is measurable (since the limiting probability \(\mathbb {P}\) is necessary absolutely continuous) but at \(T=0\), this uses the lower semi-continuity of w. The details of the argument are provided later in the proof of Theorem 29, for the convenience of the reader.
Remark 4
(Convexity and duality) Using the concavity of the entropy \(\mathcal {S}_N\), one can verify that \(\rho \mapsto F_T[\rho ]\) is convex. This can be used to derive the dual formulation of \(F_T[\rho ]\) in terms of external potentials
see [11]. Our notation \({\widetilde{V}}\) is because the final physical dual potential is, rather, \(V{:}{=}{\widetilde{V}}-T\log \rho \). The existence of a maximizer \({\widetilde{V}}\) realizing the above supremum is proved in [11]. It is the unique potential (up to an additive constant) so that the corresponding Gibbs state has density \(\rho \), that is,
with Z a normalization constant. At \(T=0\), we have the similar formula
where
is the ground state energy in the potential V [49]. Although there usually exist dual potentials at \(T=0\), those are often not unique.
2.1.3 The Grand-Canonical Free Energy
In the grand-canonical picture, where the exact particle number of the system is not fixed, a state \( \mathbb {P}\) is a family of symmetric n-particle positive measures \( \mathbb {P}_n \) on \((\mathbb {R}^d)^n\), so that
Here \(\mathbb {P}_0\) is just a number, interpreted as the probability that there is no particle at all in the system. After replacing \(\mathbb {P}_n\) by \(\mathbb {P}_n/\mathbb {P}_n(\mathbb {R}^{dn})\), we can equivalently think that \(\mathbb {P}\) is a convex combination of canonical states. The entropy of \( \mathbb {P}\) is defined by
and the single particle density of the state \( \mathbb {P}\) is
The grand-canonical free energy of the state \( \mathbb {P}\) at temperature \( T \geqslant 0 \) is
where \( \mathcal {U}( \mathbb {P} ) \) denotes the interaction energy in the state \( \mathbb {P}\),
From the stability of w we have
so that, after summing over n,
By [24, Lemma 6.1] we have the universal entropy bound
This is because the entropy at fixed density \( \rho \) is maximized by the grand-canonical Poisson state
whose entropy is the right side of (15).
When keeping the one-particle density \( \rho = \rho _{\mathbb {P}} \in L^1 ( \mathbb {R}^d ) \) fixed, we denote the minimal grand-canonical free energy by
Using (15), we obtain
where \( \kappa \) is the stability constant of w in Assumption 1.
Remark 5
(Comparing \(F_T\) and \(G_T\)) Since a canonical trial state is automatically also admissible for the grand-canonical minimisation problem (17), we have the bound
for any density \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \) with integer mass. Hence, any universal lower energy bound for the grand-canonical ensemble is also a lower bound for the canonical ensemble. A natural question to ask is under which condition we have \(F_T[\rho ] =G_T[\rho ]\) for a density \(\rho \) of integer mass. In general this is a difficult problem. See [24] for results and comments in this direction at \(T=0\).
If \(\int _{\mathbb {R}^d}\rho =N+t\) with \(t\in (0,1)\) and \(N\in \mathbb {N}\), we can write \(\rho =(1-t)\frac{N}{N+t}\rho +t\frac{N+1}{N+t}\rho \) and obtain after using the concavity of the entropy
This can be used to deduce an upper bound on \(G_T[\rho ]\), once an upper bound has been established in the canonical case. We will see, however, that it is usually much easier to directly prove upper bounds on \(G_T[\rho ]\) than on \(F_T[\rho ]\).
Remark 6
(Weak lower semi-continuity) The functional \(\rho \mapsto G_T[\rho ]\) is weakly lower semi-continuous and, in fact, a kind of lower continuous envelope of \(F_T[\rho ]\) (see [24, 65]). At \(T=0\) this uses the lower semi-continuity of w.
Remark 7
(Duality II) Like in the canonical case, we have the dual formulation
see [10, 11] and the more recent work [24, Sects. 4, 6].
2.2 Representability
Next we turn to the problem of representability. Namely, we are asking what kind of densities \(\rho \) can arise from N-particle probabilities with finite free energy. This depends on the shape of the interaction potential w. We only address this question for \(\rho \in L^1(\mathbb {R}^d)\) and do not look at general measures. The main result is that all densities are representable at zero temperature in the non-hard-core case (\(\alpha <\infty \)). At positive temperature, it is sufficient to assume in addition that \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \).
Theorem 8
(Representability in the canonical case) Let \(\rho \in L^1(\mathbb {R}^d)\) with \(\int _{\mathbb {R}^d}\rho (x) \, \textrm{d}x \in \mathbb {N}\). There exists a symmetric probability measure \(\mathbb {P}\) on \((\mathbb {R}^d)^N\) of density \(\rho \) so that \(|x_j-x_k|\geqslant \delta >0\) \(\mathbb {P}\)—almost everywhere, for some \(\delta >0\).
If w satisfies Assumption 1 without hard-core (\(\alpha <\infty \)), we obtain \(F_0[\rho ]<\infty \). If furthermore \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \), then \(\mathbb {P}\) can be assumed to have finite entropy and \(F_T[\rho ]<\infty \) for any \(T>0\).
The theorem follows from results in optimal transport theory and we quickly outline the proof here for the convenience of the reader. In this paper we will prove much more. We will in fact need some of these tools and more details will thus be provided later in the paper.
Proof
If \(\int _{\mathbb {R}^d}\rho =1\), we must take \(\mathbb {P}=\rho \) and end up with \(F_T[\rho ]=T\int \rho \log \rho \). In the rest of the proof we assume that \(\int _{\mathbb {R}^d}\rho \geqslant 2\).
For \(\rho \in L^1(\mathbb {R}^d)\), the existence of \(\mathbb {P}\) is proved in [14, Theorem 4.3]. The number \(\delta \) must be so that \(\int _{B(x,\delta )}\rho <1\) for any \(x\in \mathbb {R}^d\), where B(x, R) denotes the ball centered at x and of radius R. Such a \(\delta >0\) always exists when \(\rho \in L^1(\mathbb {R}^d)\). See Sect. 5.1 below for more details on the results from [14].
Next we prove that \(\mathcal {F}_0( \mathbb {P} )<\infty \). Since \(\alpha <\infty \) (no hard-core), we can assume \(r_0=1\). We then have \(w(x)\leqslant C_\delta |x|^{-s}\) for all \(|x|\geqslant \delta \), with the constant \(C_\delta =\kappa (1+\delta ^{s-\alpha })\), due to Assumption 1. Hence, on the support of \(\mathbb {P}\) we have
The maximum is bounded by \(C\delta ^{-s}\) independently of N due to [61, Lemma 9]. Integrating with respect to \(\mathbb {P}\) we have proved that \(\mathcal {F}_0( \mathbb {P} )\leqslant C_\delta \delta ^{-s}N\). This bound is not very explicit but it only depends on \(\delta \) and N. Of course, \(\delta \) itself depends on \(\rho \) in a rather indirect way.
The probability measure \(\mathbb {P}\) obtained by the optimal transport method of [14] is probably a singular measure, hence with an infinite entropy. In [9], it is explained how to regularize any given \(\mathbb {P}\) using a method called the Block approximation. This method works well for a compactly supported density, for which it easily implies \(F_T[\rho ]<\infty \). We quickly describe the method here and refer to Sect. 5.3 below for details. In short, we split the space into small cubes \(\{{\mathcal {C}}_j\}\) of size proportional to \(\delta \) and introduce the trial probability measure
That is, we take a convex combination of independent particles over small cubes with probability \(\mathbb {P}({\mathcal {C}}_{j_1}\times \cdots \times {\mathcal {C}}_{j_N})\). Choosing the cubes small enough, we can ensure that \(|x_j-x_k|\geqslant \delta /2\) on the support of \({\widetilde{\mathbb {P}}}\) and \(\int _{{\mathcal {C}}_j}\rho <1\). A computation gives \(\rho _{{\widetilde{\mathbb {P}}}}=\rho _{\mathbb {P}}=\rho \). The entropy can be estimated by
(see Lemma 26 below). Estimating the last sum is not an easy task for a general density. For a compactly supported density we can simply bound it by 1/e times the numbers of cubes intersecting the support of \(\rho \). Since the energy of \({\widetilde{\mathbb {P}}}\) is finite by the previous argument, we deduce that \(F_T[\rho ]<\infty \) for any \(\rho \) of compact support.
It thus remains to explain how to prove that \(F_T[\rho ]\) is finite for a density \(\rho \) of unbounded support. The idea is of course to truncate it. We choose two radii \(R_1<R_2\) so that
(using here \(\int \rho \geqslant 2\)) and we define for shortness \(\rho _1{:}{=}\rho {\mathbb {1} }_{B_{R_1}}\), \(\rho _2{:}{=}\rho {\mathbb {1} }_{B_{R_2}{\setminus } B_{R_1}}\) and \(\rho _3{:}{=}\rho {\mathbb {1} }_{\mathbb {R}^d{\setminus } B_{R_2}}\). We can write
where \(\int _{\mathbb {R}^d}(\rho _1+2\rho _2)=\int _{\mathbb {R}^d}(\rho _1+2\rho _3)=N\). From the convexity of \(F_T\) we obtain
The first density \(\rho _1+2\rho _2\) has compact support hence has a finite energy, as explained above. For the second density \(\rho _1+2\rho _3\) we use an uncorrelated trial state in the form \(\mathbb {P}_1\otimes _s (2\rho _3)\) where \(\mathbb {P}_1\) is also constructed as before, but with \(\rho \) replaced by \(\rho _1\) which has mass \(N-1\). Here \(\otimes _s\) means the symmetric tensor product. A calculation shows that
Thus the finiteness for densities of compact support implies the same for all densities. In fact, after optimizing over \(\mathbb {P}_1\) we have proved the bound
This concludes the proof of Theorem 8. \(\square \)
We have not considered here the hard-core potential, to which we will come back later in Sect. 2.4. Representability is much more delicate in this case. From the inequality (19), we immediately obtain the following.
Corollary 9
(Representability in the grand-canonical case) Let \(\rho \in L^1(\mathbb {R}^d)\). Then we have \(G_0[\rho ]<\infty \) if w has no hard-core (\(\alpha <\infty \)). If furthermore \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \), then \(G_T[\rho ]<\infty \) for all \(T>0\).
2.3 Local Upper Bounds
Recall that we already have rather simple lower bounds in (9) and (18). The proof of Theorem 8 furnishes an upper bound on \(F_T[\rho ]\) but it depends on the smallest distance \(\delta \) between the particles in the system, which is itself a highly nonlinear and nonlocal function of \(\rho \). For non compactly-supported densities, the proof also involves the two radii \(R_1,R_2\) which depend on \(\rho \) as well.
Our goal here is to provide simple local upper bounds involving only integrals of the given density \(\rho \). We start in the next subsection by recalling the simple integrable case at the origin \(\alpha <d\), for which we can just choose i.i.d. particles. The case \(\alpha \geqslant d\) is much more complicated since particles cannot be allowed to get too close.
2.3.1 Upper Bound in the Weakly Repulsive Case \(\alpha <d\)
In the case where \( w_+ \) is integrable at the origin, it is easy to provide a simple upper bound.
Theorem 10
(Weakly repulsive case \(\alpha<\) d) Let w satisfy Assumption 1 with \( \alpha < d \). Let \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \cap L^2 ( \mathbb {R}^d ) \) with integer mass \( \int \rho \in \mathbb {N}\). Let also \(T\geqslant 0\) and assume that \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \) if \(T>0\). Then we have
![](http://media.springernature.com/lw400/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ21_HTML.png)
In the grand-canonical case we have the exact same bound on \(G_T[\rho ]\), this time without any constraint on \(\int _{\mathbb {R}^d} \rho \) and with \(\rho \log \rho \) replaced by \(\rho (\log \rho -1)\) in the last integral.
As we have mentioned in the introduction, the functional appearing on the right side of the first line of (21) is the so-called Kirkwood–Monroe free energy [54], which is the simplest approximation of \(F_T[\rho ]\). It only makes sense for a locally integrable potential w. In addition to being an exact upper bound, the Kirkwood–Monroe free energy also provides the exact behavior of \(F_T[\rho ]\) in some regimes. This was studied in many works, including for instance [4, 36, 37, 41, 57] for the infinite gas at high density and [3, 6, 7, 50, 51, 53, 70, 90, 98] for trapped systems in the mean-field limit.
Proof
We denote \( N = \int _{\mathbb {R}^d} \rho \) and simply take as a trial state the pure tensor product \( \mathbb {P}{:}{=} ( \rho /N )^{\otimes N} \). The interaction energy satisfies
![](http://media.springernature.com/lw402/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ22_HTML.png)
From the stability condition on w, we know that for any \(\eta \geqslant 0\) with \(\int \eta =1\),
Letting \(K\rightarrow \infty \), we find
This is how the stability is expressed in mean-field theory [62]. Since the double integral in (22) is non-negative, we can remove the 1/N for an upper bound. The entropy can itself be estimated by
showing that (21) holds. In the grand-canonical case we use instead the Poisson state in (16) and exactly obtain the mean-field energy on the right side of (21) with \(\rho \log \rho \) replaced by \(\rho (\log \rho -1)\) in the last integral. \(\square \)
2.3.2 Upper Bounds in the Strongly Repulsive Case \(\alpha \geqslant d\)
When \(\alpha \geqslant d\) the right side of (21) is infinite due to the non-integrability of w at the origin. We cannot use a simple uncorrelated probability \(\mathbb {P}\) as a trial state and it is necessary to correlate the particles in such a way that they never get too close to each other. The difficulty is to do this at fixed density, with a reasonable energy cost. Also, we expect the typical distance between the particles to depend on the local value of \(\rho \). If we imagine that there are \(\rho (x)\) particles per unit volume in a neighborhood of a point x, then the distance should essentially be proportional to \(\rho (x)^{-1/d}\). We thus expect a bound in terms of \(\rho (x)^{1+\alpha /d}\) for large densities. We can only fully solve this question in the grand-canonical case. In the canonical case we can only treat \(T=0\) in full. The following is our first main result.
Theorem 11
(Strongly repulsive case \(\alpha \geqslant \) d) Suppose that the interaction w satisfies Assumption 1 with \( d \leqslant \alpha < \infty \). Let \(T\geqslant 0\) and assume that for \( T > 0 \), we have \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \).
\(\bullet \) In the grand-canonical ensemble, we have for any \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \),
Here the constant C only depends on the dimension d and the powers \(\alpha ,s\) from Assumption 1.
\(\bullet \) In the canonical ensemble we have the same estimate on \(F_T[\rho ]\) for all \(T\geqslant 0\) in dimension \(d=1\) and on \(F_0[\rho ]\) at \(T=0\) for \(d\geqslant 2\), provided of course that \(\rho \) has an integer mass.
In the proof we provide an explicit value for the constant C in (23) but we do not display it here since it is by no means optimal and depends on the cases. The parameters \(\kappa \) and \(r_0\) can be used to track the origin of the different terms in our bound (23). The integrable part of the potential gives the \(\rho ^2\) term as it did in Theorem 10. The terms involving \(r_0^\alpha \) on the second line are solely due to the divergence of w at the origin. It is important that we get here the expected and optimal \(\rho ^{1+\alpha /d}\) due to the singularity. Finally, we have an additional term involving \(T\rho \) which is an error in the entropy due to our construction. We otherwise get the optimal \(T\rho \log \rho \).
In dimension \(d=1\), the proof of Theorem 11 is relatively easy, both in the canonical and grand-canonical cases. It is detailed for convenience in Sect. 3. The idea is to split the density \(\rho \) into successive intervals of mass 1/2 and then write \(\rho =(2\rho _\text {odd}+2\rho _\text {even})/2\) where \(\rho _\text {odd}\) is the density restricted to the odd intervals and \(\rho _\text {even}\) to the even ones. We then take a trial state of the form \((\mathbb {P}_\text {odd}+\mathbb {P}_\text {even})/2\), where \(\mathbb {P}_\text {odd}\) corresponds to placing exactly one particle per odd interval at density \(2\rho \) and \(\mathbb {P}_\text {even}\) is defined similarly. This way we have inserted some distance between the particles. It depends on the form of \(\rho \) in the opposite set of intervals. The interaction between the particles can then be easily controlled in terms of \(\rho ^{1+\alpha /d}\), as we explain in Sect. 3.
In higher dimensions, there seems to be no general way of splitting \(\mathbb {R}^d\) into disjoints sets containing a fixed mass of \(\rho \), so that each set has finitely many neighbors at a given distance (except perhaps for very special densities [40]). We can however carry over a similar argument as in the 1D case if we allow a covering with intersections. The Besicovitch covering lemma [21] allows us to work with cubes \(Q_j\) intersecting with finitely many other cubes, such that \(\int _{Q_j}\rho \) is any given number. We can also distribute the \(Q_j\) into a finite (universal) number of subcollections so that the cubes in each family are disjoint and not too close to each other. For each collection of disjoint cubes we then use a simple tensor product similar to the 1D case. The interaction is estimated using that the length of the cubes is related to \(\int _{Q_j}\rho ^{1+\alpha /d}\), leading to a bound involving only \(\int _{\mathbb {R}^d}\rho ^{1+\alpha /d}\). This proof was inspired by the presentation in the recent book [35] of a proof of the Lieb–Thirring and Cwikel–Lieb–Rozenblum inequalities from [91, 92, 108], thus in a completely different context. The difficulty here is that we have no information on the number of particles in each subcollection, due to the overlaps. This is the reason why the proof works well in the grand-canonical setting, but not in the canonical case. The details are given in Sect. 4.
To prove the result in the canonical case at \(T=0\) for \(d\geqslant 2\), we use a completely different method based on optimal transport tools from [14]. As we will explain in Sect. 5, the latter work can be used to construct a trial state \(\mathbb {P}\) with \(\rho _\mathbb {P}=\rho \) so that the distance between any two given particles on the support can be related to some average local value of the density around the particles. This is how we can obtain the bound (23) at \(T=0\) in the canonical case.
The next natural step is to smear this trial measure \(\mathbb {P}\) and use it at \(T>0\) but we could unfortunately not give an optimal bound on the entropy of the smearing. Our bound relies on the local radius \( R ( x ) \) of a density \( \rho \), which is thoroughly studied in Sect. 5.1 and is defined as follows. Let \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \) with \( \int _{\mathbb {R}^d} \rho ( y ) \, \textrm{d}y > 1 \). For each \( x \in \mathbb {R}^d \), we define the local radius R(x) to be the largest number satisfying
This number is always bounded below for a given \(\rho \in L^1(\mathbb {R}^d)\) but behaves like |x| at infinity. If \(\rho \) has compact support, then R(x) is bounded on the support of \(\rho \).
Theorem 12
(Strongly repulsive case \(\alpha \geqslant \) d II) Suppose that the interaction w satisfies Assumption 1 with \( 2\leqslant d \leqslant \alpha < \infty \). Let \(T>0\) and \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d )\) of integer mass with \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \). Then we have
where the constant C only depends on the dimension d and the powers \(\alpha ,s\) from Assumption 1.
The main difference compared to (23) is the additional term \(T\int \rho \log R^d\), which we conjecture should not be present. It is only affecting the bound in places where R is large on the support of \(\rho \), that is, where one cannot find a sufficient amount of mass at a finite distance of x. Another small difference is the additional term \(CT\int \rho ^2\) due to our way of estimating the entropy. The proof is detailed in Sect. 5.4 below.
The upper bounds in Theorems 11 and 12 will be very useful for our next work [48] where we study \(F_T[\rho ]\) and \(G_T[\rho ]\) for extended systems. The sub-optimal upper bound (25) in the canonical case will be sufficient in this context.
Remark 13
(Lower bounds) Even when w really behaves like \(|x|^{-\alpha }\) at the origin (for instance satisfies \(w(x)\geqslant c|x|^{-\alpha }\) for some \(c>0\)), a lower bound in the form (23) cannot hold in general. This is because the density can be large in regions where there is only one particle at a time, which does not create any divergence in the interaction. As an example, consider N points \(X_1,...,X_N\in \mathbb {R}^d\) and place around each point one particle in the state \(\chi _r{:}{=}|B_r|^{-1}{\mathbb {1} }_{B_r}\), with r small enough. The corresponding state is the (symmetrization of the) tensor product \(\mathbb {P}_r=\bigotimes _{j=1}^N\chi _r(\cdot -X_j)\). Assuming that w is continuous, its interaction energy behaves as
hence stays finite, whereas the entropy equals
On the other hand, the right side of (23) diverges much faster, like \(Nr^{-\alpha }\). This proves that a lower bound of the form (23) cannot hold for all possible densities.
Nevertheless, it is expected that the term \(\int \rho ^{1+\alpha /d}\) should appear when there are many particles in a small domain and is thus optimal in such situations. For instance, assuming \(w\geqslant c|x|^{-\alpha }\) for \(|x|\leqslant r_0\) and taking \(\rho =N|B_{r_0/2}|^{-1}{\mathbb {1} }_{B_{r_0/2}}\) (N particles at uniform density in the small ball), we see that
The first minimum is known to behave like \(N^{1+\alpha /d}r_0^{-\alpha }\) in the limit \(N\rightarrow \infty \) [61, Lemma 1], which is exactly proportional to \(\int \rho ^{1+\alpha /d}\). Thus in this case, the lower bound holds and the power \(1+\alpha /d\) is optimal.
2.4 The Hard-Core Case
We conclude this section with a discussion of the hard-core case, which is notoriously more difficult [11, Sect. 9]. We start with the question of representability of a given density and then turn to some upper bounds on the free energy.
2.4.1 Representability
Let \(r_0>0\) be a positive number and consider the hard-core potential \(w_{r_0}(x)=(+\infty ){\mathbb {1} }(|x|<r_0)\). Then we have for any N-particle probability measure \(\mathbb {P}\)
The set of \(\mathbb {P}\)’s such that \(\mathcal {U}_{N}(\mathbb {P})=0\) is convex and its extreme points are the symmetric tensor products of Dirac deltas located at distance \(\geqslant r_0\) from each other. It follows that the convex set of \(w_{r_0}\)–representable densities is the convex hull of the densities in the form
There is a similar result in the grand-canonical case. In spite of this simple characterization, it seems very hard, in general, to determine whether a given density belongs to this convex set or not.
In dimension \(d=1\), the problem can be solved exactly. Any extreme point (26) satisfies
since there is always at most one Dirac delta in any interval of length \(r_0\). This property pertains on the whole convex hull of \(w_{r_0}\)—representable densities. Conversely, any positive measure \(\rho \) with \(\rho (\mathbb {R})=N\) satisfying (27) can be written as a convex combination of Dirac deltas at distance \(\geqslant r_0\). To see this, assume for simplicity \(\rho \in L^1(\mathbb {R})\) and define as in [13] the non-decreasing function \(t\mapsto x(t)\) on (0, N) so that
To avoid any ambiguity when the support of \(\rho \) is not connected, we can choose x(t) to be the largest possible real number satisfying the above condition. The function \(t\mapsto x(t)\) is differentiable, except possibly on a countable set, with \(x'(t)=\rho (x(t))^{-1}\). When \(\rho >0\) almost surely, we have \(\lim _{t\rightarrow 0^+}x(t)=-\infty \) and \(\lim _{t\rightarrow N^-}x(t)=+\infty \). From the definition of x(t) we have
Indeed, if we integrate the right side against some continuous function f we find \(\int _0^N f(x(t))\,\textrm{d}t=\int _\mathbb {R}f(s)\rho (s)\,\textrm{d}s\) after changing variable \(s=x(t)\). Now we can also rewrite (28) as
By definition of x(t) we have
and therefore \(|x(t+k+1)-x(t+k)|\geqslant r_0\) when the condition (27) is satisfied. Hence (29) is the sought-after convex combination of delta’s located at distance \(\geqslant r_0\). The corresponding N-particle probability is
where
is the symmetrization operator. At positive temperature, the previous state can be regularized using the block approximation described in the proof of Theorem 8, provided that \(\int _\mathbb {R}\rho |\log \rho |<\infty \) and (27) holds with a strict inequality.
In dimensions \(d\geqslant 2\), the situation is much less clear. The condition (27) can be re-expressed in the form
where R(x) is the radius previously defined in (24). This can also be written in the form
This is definitely a necessary condition for a density to be \(w_{r_0}\)—representable, in dimension \(d\geqslant 1\). Otherwise we would be able to find an \(x\in \mathbb {R}^d\) and an \(R<r_0/2\) such that \(\int _{B(x,R)}\rho >1\). But then the probability that there are at least two particles in the ball B(x, R) cannot vanish for any \(\mathbb {P}\) of density \(\rho \) and those are at distance \(<r_0\). This was already mentioned in [11, Sect. 9].
For \(d\geqslant 2\) the condition (32) is definitely not sufficient for a density to be representable. A counter example arises naturally within the sphere packing problem. Recall that the d-dimensional sphere packing density
gives the maximal number of points per unit volume one can put while ensuring that they are at distance \(\geqslant 1\) to each other. Here \(\Omega \) is any fixed smooth domain and \(\Omega _\ell =\ell \Omega \). The packing density equals \(\rho _c(1)=1\) in dimension \(d=1\) and is otherwise only known in dimensions \( d\in \{ 2,3,8,24\} \), for which it is given by some special lattices [12, 107]. The sphere packing fraction is defined by
and represents the fraction of the volume occupied by the balls. This is simply \(v_c(1)=1\) in dimension \(d=1\) but is strictly less than 1 for \(d\geqslant 2\). Some volume has to be left unoccupied due to the impossibility to fill space with disjoint balls of fixed radius. It has been shown that \(v_c(d)\) tends to 0 exponentially fast in the limit \(d\rightarrow \infty \) but its exact behavior is still unknown [105]. Let us now consider a constant density \(\rho (x)=\rho _0{\mathbb {1} }_{\Omega _\ell }(x)\) over a large domain \(\Omega _\ell =\ell \Omega \) (for instance a ball). Then we have \(R(x)=(\rho _0|B_1|)^{-1/d}\) well inside \(\Omega _\ell \), whereas \(R(x)\geqslant (\rho _0|B_1|)^{-1/d}\) close to the boundary. This shows that for this density
In particular, the condition (32) is satisfied whenever \(\rho _0\leqslant r_0^{-d}\rho _c(d)/v_c(d)\). On the other hand, it is clear from the packing problem (rescaled by \(r_0\)) that when \(\rho _0>r_0^{-d}\rho _c(d)\) the density cannot be representable for \(\ell \) large enough. Otherwise we would be able to place \(N=\rho _0|\Omega _\ell |>r_0^{-d}\rho _c(d)|\Omega _\ell |\) points in \(\Omega _\ell \) at distance \(r_0\), which contradicts the definition of \(\rho _c(d)\). In conclusion, we have found that, in dimensions \(d\geqslant 2\), constant densities \(\rho _0{\mathbb {1} }_{\Omega _\ell }\) with
satisfy (32) but cannot be \(w_{r_0}\)—representable for \(\ell \gg 1\).
As a side remark, we mention that there are representable densities satisfying (32), with \(R_\rho \) as close as we want to \(r_0/2\). We can just take the sum of two Dirac deltas placed at distance \(R\geqslant r_0\) or a smooth approximation of it. This proves that there cannot exist a simple necessary and sufficient condition of hard core representability involving \(R_\rho \) only, in dimensions \(d\geqslant 2\). This is in stark contrast with the one-dimensional case.
There exists, however, a simple sufficient condition in a form that was conjectured in [11, p. 116]. In [14, Theorem 4.1] (see also Theorem 21 below), it is proved that any density satisfying
is \(w_{r_0}\)–representable. The same holds when \(T>0\) if one puts a strict inequality. It would be interesting to know if such a result is valid for \(R_\rho \geqslant c_d r_0\) with \(c_d<1\), depending on the dimension.
The conclusion of our discussion is that there seems to exist no simple characterization of hard core representability in dimensions \(d\geqslant 2\), involving averages of \(\rho \) over balls. There are necessary or sufficient conditions but they do not match.
2.4.2 Upper Bounds
Next we discuss upper bounds in the hard core case. Even if we do not completely understand when a density is hard-core representable, the energy is very easy to bound when it is the case. Let us assume that w satisfies Assumption 1 with \(\alpha =+\infty \) and that \(\rho \in L^1(\mathbb {R}^d)\) is w–representable. For simplicity we also assume that \(w=+\infty \) on \(B_{r_0}\). Then, for any optimizer \(\mathbb {P}\), we have \(|x_j-x_k|\geqslant r_0\) for \(j\ne k\), \(\mathbb {P}\)–almost surely. This implies
by [61, Lemma 9]. The constant C only depends on s and d. Upper bounds are easy once we know that the particles cannot get too close.
Constructing trial states with a good entropy is more difficult. Our proofs of Theorems 11 and 12 work in the hard-core case, but they require additional conditions, of the form
for a sufficiently small \(\varepsilon \). We do not state the corresponding results here and rather refer the reader to Remarks 16, 20, and 28 below. In the rest of this section we quickly discuss the grand-canonical 1D case which has been studied in a famous paper of Percus [76] and the situation where \(\rho \) is bounded uniformly.
2.4.3 The 1D Grand-Canonical Percus Formula
The grand-canonical inverse problem was completely solved by Percus in dimension \(d=1\) in [76] (see also [88]). Under the optimal assumption that \(R_\rho >r_0/2\), he proved that the grand-canonical Gibbs state with external potential
and hard-core \(w_{r_0}\) has the density \(\rho \). Since the potential \({\widetilde{V}}=V+T\log \rho \) solves the supremum in the dual formula (20), we obtain
for the hard core potential \(w_{r_0}\). This explicit expression shows us that, in one dimension, the nonlocality is solely due to the second logarithmic term, which involves the local average \(\int _{x-r_0}^x\rho \) over a window of length \(r_0\). This is further discussed in [88].
For a general potential w satisfying Assumption 1, we only obtain an upper bound and need to add \(C\kappa r_0^{-s}\int _\mathbb {R}\rho \) by (34). We can estimate the logarithm by assuming, for instance, that \(\int _{x-r_0}^x\rho \leqslant 1-\varepsilon \) for all \(x\in \mathbb {R}\).
To our knowledge the canonical problem was never solved in the manner of Percus. It would be interesting to derive an upper bound on \(F_T[\rho ]\) of the same form as the right side of (35).
2.4.4 Bound for Densities Uniformly Bounded by the Packing Density
In dimensions \(d\geqslant 2\) we have no simple criterion of representability, as we have seen. One simpler situation is when \(\rho \) is everywhere bounded above by the sphere packing density, which we have defined in (33). Then we can prove it is representable and furnish an explicit upper bound on its grand-canonical free energy.
Theorem 14
(Hard-core case with packing density bound) Assume that w satisfies Assumption 1 with \(\alpha =+\infty \). Let \(\rho _c(d)\) be the sphere packing density in (33) and \(v_c(d)=2^{-d}\rho _c(d)|B_1|\) be the volume fraction. Let \(\rho \in L^1 ( \mathbb {R}^d,\mathbb {R}_+ )\) be such that
for some \(\varepsilon \in (0,1)\). We also assume that \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \) if \( T > 0 \). Then
with a constant C depending only on the dimension d and the power s from Assumption 1.
The idea of the proof is to first construct a trial state for a constant density \(\rho _0\approx (1-\varepsilon )^dr_0^{-d}\rho _c(d)\) by using a periodic sphere packing with a large period, uniformly averaged over translations (often called a “floating crystal” [64]). We then “geometrically localize” [60] this state to make it have density \(\rho \). The proof is detailed later in Sect. 6.
3 Proof of Theorem 11 in Dimension \(d=1\)
We start with the one-dimensional canonical case, for which the argument is relatively easy. We detail the proof for the convenience of the reader and because this will pave the way for the more complicated covering methods in higher dimensions. We only consider here the canonical case. The grand-canonical bound (23) follows using (19), but in the next section we will provide a direct proof in the grand-canonical case which also works in dimension \(d=1\).
Theorem 15
(\( d = 1 \)) Suppose the interaction w satisfies Assumption 1 with \( 1 \leqslant \alpha < \infty \). Let \( T \geqslant 0 \) and assume that \( \int _{\mathbb {R}} \rho |\log \rho | < \infty \) for \( T > 0 \). Then for any density \( 0 \leqslant \rho \in L^1 ( \mathbb {R} ) \) with \( \int _{\mathbb {R}} \rho \in \mathbb {N}\), we have
Proof
Denoting \( N = \int _{\mathbb {R}} \rho \), we can split the real numbers \( \mathbb {R}\) into two families \( ( L_j )_{j=1}^N \), \( ( L_j^{*} )_{j=1}^N \) of disjoint intervals in such a way that the mass of \( \rho \) in each of these intervals is exactly \( \int _{L_j} \rho = 1/2 \), and such that each \( L_j \) has neighboring intervals only among the \( L_j^{*} \), and vice versa (see Fig. ).
This allows us to write
a convex combination of two measures with mass equal to N. As trial states for each of these, we take the symmetric tensor products
where \( \Pi _s \) denotes the symmetrization operator in (31). Then the state
has one-body density equal to \( \rho _{\mathbb {P}} = \rho \). Using that the intervals \( ( L_j ) \) are all disjoint, we have for instance
and similarly for \( \mathbb {Q}^{*} \). By concavity of the entropy, we conclude that
To estimate the interaction energy in the state \( \mathbb {P}\), it suffices to provide an estimate for both \( \mathbb {Q}\) and \( \mathbb {Q}^{*} \). We write here the argument only for \( \mathbb {Q}\), since the argument for \( \mathbb {Q}^{*} \) is exactly the same. By Assumption 1 and the construction of \( \mathbb {Q}\), we immediately have
where \( w_2 ( x ) = ( 1 + |x|^s )^{-1} \) and \(\rho _{\mathbb {Q}}^{( 2 )}\) is the two-particle correlation function. For the contribution from the tail of the interaction, we have by Young’s inequality
![](http://media.springernature.com/lw317/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ214_HTML.png)
From the core of w we get
The idea now is to use the intervals \( ( L_j^{*} ) \) to estimate the sum above. For each i we denote by \( \eta _i \) the minimal length of neighboring intervals,
![](http://media.springernature.com/lw191/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ215_HTML.png)
where \( \ell _j^{*} {:}{=} |L_j^{*}| \) is the interval length, and we re-order the collection \( ( L_i ) \) such that \( \eta _1 \leqslant \cdots \leqslant \eta _N \). Fixing the index i, we now clearly have for \( j > i \),
in particular, \( \eta _i \) is smaller than the side length of any interval neighboring \( L_j \). Pick \( x_j \in \overline{L_i} \) and \( y_j \in \overline{L_j} \) such that \( \textrm{d}( L_i, L_j ) = |x_j-y_j| \), and let \( L_k^{*} \) be the neighboring interval of \( L_j \) facing \( y_j \), that is, \( \textrm{d}( y_j, L_k^{*} ) = 0 \). Defining
then \( |\widetilde{L}_j| = \eta _i / 2 \), and \( \eta _i /2 \leqslant |x_j-y| \leqslant |x_j-y_j| \) for all \( y \in \widetilde{L}_j \), so we can estimate
Now summing over j gives
By Hölder’s inequality, we have by construction of the intervals \( L_j^{*} \)
for any \( \alpha > 1 \), so in this case we conclude that
The same bound holds for the interaction energy of \( \mathbb {Q}^{*} \), but with the intervals \( L_i^{*} \) replaced by \( L_i \) at the end. This finishes the proof of the \( \alpha > 1 \) case in (36).
To finish the \( \alpha = 1 \) case, we note that applying Jensen’s inequality on the function \( t \mapsto t^2 ( \log ( 2\lambda t ) )_+ \) for \( \lambda > 0 \) yields
Hence, continuing from (37), we get
Since the corresponding bound also holds for \( \mathbb {Q}^{*} \), this concludes the proof. \(\square \)
Remark 16
(Hard-core case) In the case where w has a hard-core with range \( r_0 > 0 \), it follows from the proof above that
for any density \( \rho \in L^1 ( \mathbb {R} ) \) satisfying the (sub-optimal) condition \(\int _{x}^{x+r_0} \rho \leqslant \frac{1}{2}\) for all \( x \in \mathbb {R}\).
4 Proof of Theorem 11 in the Grand-Canonical Case
In the course of our proof we need to cover the support of our density using disjoint cubes separated by a distance depending on the local value of the density, in order to have a reasonable control of the interaction. We obtain such a covering by a variant of the Besicovitch lemma [21], which we first describe in this subsection. It is different from the standard formulation.
For simplicity we work with a compactly supported density \(\rho \) with \(\int _{\mathbb {R}^d}\rho >1\). For every \(x\in \mathbb {R}^d\), we define \(\ell (x)\) to be the largest number such that
where \({\mathcal {C}}=(-1/2,1/2)^d\) is the unit cube centered at the origin. It is convenient to work with cubes instead of balls. It is important that the chosen value of the integral in (39) is universal and only depends on the space dimension d. This value is motivated by the estimates which will follow, it could be any fixed number \(<1\) at this point. The number \(\ell (x)\) always exists since the full integral is larger than 1. The function \(x\mapsto \ell (x)\) is upper semi-continuous. To simplify our notation we denote by \({\mathcal {C}}(x){:}{=}x+\ell (x){\mathcal {C}}\) the cube centered at x of side length \(\ell (x)\). By Hölder’s inequality we get
and thus obtain the estimate
on the local length \(\ell (x)\). The standard Besicovitch covering lemma (as stated for instance in [21, 35]) implies for compactly supported densities that there exists a set of points \(x_j'^{(k)}\) with \(1\leqslant k\leqslant K'\leqslant 4^d+1\) and \(1\leqslant j\leqslant J_k\) such that
-
the cubes \(\big ({\mathcal {C}}(x_j'^{(k)})\big )_{\begin{array}{c} 1\leqslant k\leqslant K'\\ 1\leqslant j\leqslant J_k \end{array}}\) cover the support of \(\rho \) and each \(x\in \mathbb {R}^d\) is in at most \(2^d\) such cubes,
-
for every k, the cubes \(\big ({\mathcal {C}}(x'^{(k)}_j)\big )_{1\leqslant j\leqslant J_k}\) are all disjoint.
We need to obtain different families which satisfy additional properties, namely we require the cubes to have a safety distance to all the larger cubes within the same family, this distance being comparable to the side length of the cube in question. The precise statement is the following.
Lemma 17
(Besicovitch with minimal distance) Let \(\rho \) be a compactly supported density with \(\int _{\mathbb {R}^d}\rho >1\). Then there exists a set of points \(x_j^{(k)}\) with \(1\leqslant k\leqslant K\leqslant 3^d(4^d+1)\) and \(1\leqslant j\leqslant J_k<\infty \) such that
-
the cubes \(\big ({\mathcal {C}}(x_j^{(k)})\big )_{\begin{array}{c} 1\leqslant k\leqslant K\\ 1\leqslant j\leqslant J_k \end{array}}\) cover the support of \(\rho \) and each \(x\in \mathbb {R}^d\) is in at most \(2^d\) such cubes,
-
for every k, the cubes \(\big ({\mathcal {C}}(x^{(k)}_j)\big )_{1\leqslant j\leqslant J_k}\) in the kth collection satisfy
$$\begin{aligned} \textrm{d}\left( {\mathcal {C}}(x^{(k)}_j),{\mathcal {C}}(x^{(k)}_\ell )\right) \geqslant \frac{1}{2}\min \Big \{\ell (x^{(k)}_j),\ell (x^{(k)}_\ell )\Big \}. \end{aligned}$$
Proof
We start the proof by applying the standard Besicovitch covering lemma recalled above. We obtain K collections of disjoint cubes. To impose the minimal distance we separate each family into \(3^d\) subfamilies. Specifically, we use that the maximal number of disjoint cubes of side length \(\geqslant \ell \) intersecting a cube of side length \(2\ell \) is at most \(3^d\). Thus if we look at a given cube of side length \(\ell \), only \(3^d-1\) other bigger cubes can be at distance \(\leqslant \ell /2\). By induction we can thus always distribute all our cubes into \(3^d\) subfamilies, while ensuring the distance property for all the bigger cubes. \(\square \)
Using Lemma 17 we obtain the following partition of unity
which we are going to use to construct our trial state for the upper bound on \(G_T[\rho ]\). We split the proof into several steps. We start with the case \(\alpha >d\) and treat the special case \(\alpha =d\) at the very end.
Step 1: Less than one particle. If \(\int _{\mathbb {R}^d}\rho \leqslant 1\), we consider the probability \(\mathbb {P}=(\mathbb {P}_n)\) given by
which has density \(\rho \) and no interaction energy. Its free energy is thus just equal to the entropy term
The first term is negative and thus we obtain the desired inequality
Step 2: Compactly supported densities (\(\alpha >d\)). Next we consider the case of a compactly supported density \(\rho \) with \(\int _{\mathbb {R}^d}\rho >1\). Using the partition (41) we write
where we abbreviated \(Q_j^{(k)}={\mathcal {C}}(x^{(k)}_j)\) for simplicity. This is a (uniform) convex combination of the K densities \(\rho ^{(k)}=\sum _{j}\rho _j^{(k)}\). For fixed k, the \(\rho _j^{(k)}\) have disjoint supports with distance greater or equal to \(\min \{\ell (x_j^{(k)}),\ell (x_{j'}^{(k)})\}/2\). In addition, we have
This is the reason for our choice of the constant in (39). Our trial state is given by
where
is the symmetrized tensor product of the states in (42), which has density \(\rho ^{(k)}\). Using the concavity of the entropy, our upper bound is, thus,
We have
since \(K\leqslant 15^d\leqslant e^{3d}\) and \(\eta \geqslant 1\). Thus we obtain
Our next task is to estimate the interaction, for every fixed k. By Assumption 1 we have \(w\leqslant w_1+w_2\) with \(w_1(x)=\kappa (r_0/|x|)^{\alpha }{\mathbb {1} }(|x|<r_0)\) and \(w_2(x)=\kappa (1+|x|^s)^{-1}\). We first estimate the term involving the integrable potential \(w_2\) using Young’s inequality as
After summing over k this gives
Using for instance
and recalling that \(\eta \geqslant 1\) and \(K\leqslant 3^d(4^d+1)\), we obtain
Next we consider the more complicated term involving the singular part \(w_1=\kappa {\mathbb {1} }(|x|<r_0) (r_0/|x|)^{\alpha }\). To simplify our notation, we remove the superscript (k) and thus consider the collection \((\rho _j)_{j=1}^J\) of functions supported in the disjoint cubes \(Q_j\) with the safety distance. For every \(i\ne j\), using \(\int _{\mathbb {R}^d}\rho _j\leqslant 1\), we can estimate
Recall that when \(|Q_i|\leqslant |Q_j|\), the distance \(\textrm{d}(Q_i,Q_j)\) is at least equal to \(\ell _i/2\). We can order our J cubes so that the volume is increasing: \(|Q_1|\leqslant |Q_2|\leqslant \cdots \leqslant |Q_J|\). We need to estimate
where \({\mathcal {C}}=(-1/2,1/2)^d\) and for every i, we have denoted by \(Q'_{i,j}\) the cube centered at \((x_j-x_i)/\ell _i\), of volume \(|Q_j|/|Q_i|\geqslant 1\). To estimate the sum in j, we use the following lemma, which is based on the integrability at infinity of \(|x|^{-\alpha }\) and is similar to [61, Lemma 9].
Lemma 18
Let \({\mathcal {C}}=(-1/2,1/2)^d\) be the unit cube and consider any collection of non-intersecting cubes \(Q_j\) with the property that \(|Q_j|\geqslant 1\) and \(\textrm{d}({\mathcal {C}},Q_j)\geqslant \frac{1}{2}\). Then we have
The constant on the right of (44) is not at all optimal and is only displayed for concreteness.
Proof of Lemma 18
Let \(X_j\in {\mathcal {C}}\) and \(Y_j\in Q_j\) be so that \(\textrm{d}({\mathcal {C}},Q_j)=|X_j-Y_j|\geqslant \frac{1}{2}\). For any \(x\in B(X_j,1/8)\) and \(y\in B(Y_j,1/8)\) we have
Integrating over \(x'\in {\mathcal {C}}\cap B(X_j,1/8)\) and \(y'\in Q_j\cap B(Y_j,1/8)\) we obtain
The volume of the intersection of a ball of radius 1/8 centered at \(X_j\) in a cube of volume \(\geqslant 1\) and that of the other cube is bounded away from 0. It is in fact minimal when \(X_j, Y_j\) are located at a corner, yielding
Thus, we obtain
Summing over j using that the cubes are disjoint we obtain
as was claimed. \(\square \)
From the estimates (40) and (44), we deduce that
Using that \(\int _{\cup _iQ_i}\rho ^{1+\frac{\alpha }{d}} \leqslant \int _{\mathbb {R}^d} \rho ^{1+ \frac{\alpha }{d}}\) and summing over K, we obtain our final estimate
This is our final upper bound, with non-optimal constants only displayed for concreteness.
Step 3: General densities (\(\alpha >d\)). In order to be able to use the Besicovitch lemma, we restricted ourselves to compactly supported densities. We prove here that the exact same estimate holds for general densities. Let \(\rho \in (L^1\cap L^{1+\alpha /d})(\mathbb {R}^d,\mathbb {R}_+)\), \(\varepsilon \in (0,1)\) and write
with \(\mathcal {C}_L=(-L/2,L/2)^d\). Using the concavity of the entropy, we obtain
We choose L so large that
which allows us to use (43) for the second term on the right of (46). For the first term we just use Step 2. We find
By passing first to the limit \(L\rightarrow \infty \) and then \(\varepsilon \rightarrow 0\), we conclude that \(\rho \) satisfies the same estimate (45) as for compactly supported densities.
Step 4: Case \(\alpha =d\). The case when the core of the interaction behaves as \(w_1(x)=\kappa r_0^d |x|^{-d} {\mathbb {1} } ( |x|\leqslant r_0 )\) is similar to the previous situation with some small changes. The function is not integrable around the origin which requires to have a safety distance between particles in our trial state. However this interaction is also non-integrable without cutoff at infinity so we need to use that the core of our interaction is compactly supported on the ball of radius \(r_0\). The following alternative to Lemma 18 is going to be useful.
Lemma 19
Let \({\mathcal {C}}_0=(-\ell _0/2,\ell _0/2)^d\) and consider any collection of non-intersecting cubes \(Q_j\) with the property that \(|Q_j|\geqslant \ell _0^d\) and \(\textrm{d}({\mathcal {C}}_0,Q_j)\geqslant \frac{\ell _0}{2}\). Then we have
Proof
We assume \(\ell _0\leqslant 2r_0\) otherwise there is nothing to prove. Let \(X_j\in {\mathcal {C}}_0\) and \(Y_j\in Q_j\) be such that \(\textrm{d}({\mathcal {C}}_0,Q_j)=|X_j-Y_j|\geqslant \frac{\ell _0}{2}\). For any \(x\in B(X_j,\ell _{0}/8)\) and \(y\in B(Y_j,\ell _{0}/8)\) we have
Integrating over \(x'\in {\mathcal {C}}_0\cap B(X_j,\ell _{0}/8)\) and \(y'\in Q_j\cap B(Y_j,\ell _{0}/8)\) we obtain
Summing over all cubes we get
\(\square \)
Next we explain how to relate the right side of (47) with the density \(\rho \). Recall from (39) that
where \(\mathcal {C}(x)\) is the cube of side length \(\ell (x)\) centered at x. By Jensen’s inequality, we have for every convex function F
Applying this to
we obtain
The rest of the proof is similar to the case \(\alpha >d\), using (50) and Lemma 19. We omit the details. This concludes the proof of Theorem 11.\(\square \)
Remark 20
(Hard-core case) The previous proof can be used in the hard core case \(\alpha =\infty \), under the (sub-optimal) condition that \(\int _{x+r_0{\mathcal {C}}}\rho <\frac{1}{3^d(4^d+1)}\) for all x, where \({\mathcal {C}}=(-1/2,1/2)^d\). The interaction can be bounded by \(\kappa C N\) as we have seen in (34), leading to the bound
5 Proofs in the Canonical Case
5.1 The Local Radius R(x) in Optimal Transport
Here we explain how to construct canonical trial states using a result from optimal transport, in order to obtain bounds at zero temperature for a singular interaction (\(d\leqslant \alpha \leqslant \infty \)).
Consider any density \( \rho \) with \( \int \rho > 1 \), and recall the local radius R(x) from (24). Note that \( R ( x ) \) can never be zero because \( \rho \) as a measure does not have any point mass. The function R is connected to the Hardy–Littlewood maximal function \( M_\rho \), defined by
where \( |B_r| \) denotes the volume of a ball in \( \mathbb {R}^d \) of radius r. By definition of R it is clear that
so that we have the pointwise bound
Furthermore, using Hölder’s inequality gives for any \( p > 0 \),
implying for \( \rho \in L_{\textrm{loc}}^{1+\frac{p}{d}} ( \mathbb {R}^d ) \) the bound
It is also apparent that R is 1-Lipschitz-continuous, see e.g. [14, Theorem 4.1]. One might also remark that R always stays away from zero, i.e.
This is an immediate consequence of the facts that R is continuous and that, necessarily, \( \lim _{|x| \rightarrow \infty } R ( x ) = \infty \), because \( \rho \in L^1 ( \mathbb {R}^d ) \).
To obtain an upper bound on the canonical energy at a fixed density \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \), it is convenient to have existence of states \( \mathbb {P}\) in which the distance between the particles is bounded from below in terms of the function R from (24). The following is a consequence of a result from [14].
Theorem 21
(Optimal transport state) Let \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \) with \( N = \int _{\mathbb {R}^d} \rho \in \mathbb {N}\). There exists an N-particle state \( \mathbb {P}\) with density \( \rho _{\mathbb {P}} = \rho \) such that
\(\mathbb {P}\)—almost everywhere, where R is the function defined by (24), and \( R_{\rho } \) is its minimum in (54).
Proof
The proof is a simple application of [14, Theorem 4.3]. For any \( 0< \eta < 1 \) (we will choose \( \eta = 1/3 \) in a moment) and any \( x \in \mathbb {R}^d \), define a set
![](http://media.springernature.com/lw297/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ216_HTML.png)
Then we have for any \( 0< t < 1 -\eta \), using the Lipschitz continuity of R,
![](http://media.springernature.com/lw476/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ217_HTML.png)
We wish to choose t and \( \eta \) such that the measure of right hand side is equal to one (with respect to the measure \( \rho \)). First, requiring the two balls to have the same radius leads to the choice \( t = \frac{1-\eta }{2} \). Next, we choose \( \eta \) such that \( \frac{\eta }{t} = \frac{\eta }{1-t-\eta } = \frac{2 \eta }{1-\eta } = 1 \), which implies \( \eta = 1/3 \).
Now, defining an open and symmetric set \( D \subseteq \mathbb {R}^d \times \mathbb {R}^d \) by
![](http://media.springernature.com/lw383/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ218_HTML.png)
then satisfies
Thus, by definition of R, we have \( \rho ( B ( x ) ) \leqslant \rho ( B ( x, R( x ) ) ) = 1 \), and since D is open and symmetric, [14, Theorem 4.3] asserts the existence of a \( \mathbb {P}\) with the claimed properties. Specifically, one can take \( \mathbb {P}\) to be the optimizer for the multi-marginal optimal transport problem associated to the cost
![](http://media.springernature.com/lw152/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ219_HTML.png)
where A denotes the set containing the \((x_1,...,x_N)\) satisfying (55). \(\square \)
5.2 Proof of Theorem 11 in the Canonical Case at \(T=0\)
The existence of the state from Theorem 21 allows us to prove the last part of Theorem 11 about the canonical free energy at zero temperature. For convenience we state a proposition valid for any state \(\mathbb {P}\) for which the particles satisfy an inequality similar to (55). Along with Proposition 23 below (which covers the case \( \alpha = d \)), this immediately implies Theorem 11 in the canonical case.
Proposition 22
(Zero temperature energy bound, \( d< \alpha < \infty \)) Let w satisfy Assumption 1 with \( d< \alpha < \infty \). Let \(\mathbb {P}\) be any N-particle probability measure with one-body density \( \rho :=\rho _{\mathbb {P}} \) satisfying
\(\mathbb {P}\)—almost everywhere, for some \( 0 < \eta \leqslant 1 \). Then the interaction energy in the state \( \mathbb {P}\) is bounded by
with C a constant depending only on \(d,\alpha ,s\).
Proof of Proposition 22
In this proof we will not keep track of the exact value of the constants, since we will need the (unknown) one from the Hardy–Littlewood inequality. Hence C denotes here a generic constant depending only on \(d,\alpha ,s\). By the assumptions on w, we have
Let \(x= ( x_1, \dotsc , x_N) \) be in the support of \(\mathbb {P}\). After permutation we can assume that \( R ( x_1 ) \leqslant R ( x_2 ) \leqslant \cdots \leqslant R ( x_N ) \). We fix the index i and consider the points \( x_i - x_j \) in \( \mathbb {R}^d \) for \( j = i+1, \dotsc , N \). Because of (56), these points are all at a distance at least \( \eta ( R( x_i ) + R ( x_j ) ) \) from the origin, and
Hence we can place \( N - i \) disjoint balls in \( \mathbb {R}^d \) with radii \( \eta R ( x_j ) \), centered at the points \( x_i - x_j \), respectively. Inside each of these balls, we place a smaller ball of radius \( \frac{\eta }{2} R ( x_j ) \), centered at
Then \( x_i - x_j \) is the point on the boundary of \( B ( z_j, \frac{\eta }{2} R ( x_j ) ) \) which is the farthest from the origin (see Fig. ), so that
Note that the distance from \( B ( z_j, \frac{\eta }{2} R ( x_j ) ) \) to the origin is bounded from below by
Using this, along with the fact that all the balls are disjoint, we get the pointwise bound
for \( \mathbb {P}\)-a.e. \( x \in \mathbb {R}^{dN} \). We conclude that the contribution to the energy from the core of w can be bounded by
Similarly, we get for the contribution from the tail of w,
so
Finally, recalling from (52) that \( R ( x ) \) is bounded from below in terms of the maximal function of \( \rho \), we apply the Hölder and Hardy–Littlewood maximal inequalities to obtain for any power \( p > 0 \),
Using this on (60) and (61), and combining with (58), we obtain the claimed bound (57). \(\square \)
Proposition 23
(Special case \( \alpha = d \)) Let w be an interaction satisfying Assumption 1 with \( \alpha = d \), and \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \) a density with \( \int \rho = N \). Then, for any N-particle probability measure \( \mathbb {P}\) with one-body density \( \rho _{\mathbb {P}} = \rho \) satisfying (56) for some \( 0 < \eta \leqslant 1 \), the interaction energy is bounded by
where the constants c and C depend only on the dimension d.
Proof
The proof goes along the same lines as the proof of Proposition 22. However, complications arise due to the fact that \( 1/|x|^d \) is not integrable at infinity, so we need to take into account the finite range \( r_0 \) of the core of w. Incidentally, this also forces us to avoid using the Hardy-Littlewood maximal inequality later in the proof. Following the proof of Proposition 22 up to (59) and noting that \( B ( z_j, \frac{\eta }{2} R ( x_j ) ) \subseteq B ( 0, |x_i-x_j| ) \), we have
This leads to the bound
where, in this case, we cannot use the Hardy–Littlewood maximal inequality on the first term. However, this can be circumvented using the fact that \( |x_i - x_j| \geqslant \eta ( R ( x_i ) + R ( x_j ) ) \) on the support of \( \mathbb {P}\), which is the content of Lemma 24 below. Using the lemma, we conclude
where the constant C depends only on the dimension d. \(\square \)
Lemma 24
Let \( 0 \leqslant \rho \in L^1 ( \mathbb {R}^d ) \) be any density with \( \int \rho > 1 \), and take any configuration of points \( x_1, \dotsc , x_M \in \mathbb {R}^d \) satisfying \( |x_i - x_j| \geqslant \eta ( R ( x_i ) + R ( x_j ) ) \) for \( i \ne j \), for some \( 0 < \eta \leqslant 1 \). Then we have the bounds
for any \( p > 0 \), and for any \( \lambda > 0 \),
Proof
We consider any configuration \( x_1, \dotsc , x_M \) as in the statement, and seek to provide a bound on the sum \( \sum _{i=1}^M \frac{1}{R ( x_i )^p} \). We order the points such that \( R ( x_1 ) \leqslant \cdots \leqslant R ( x_M ) \), and assume first for simplicity that all the balls \( B ( x_j, R ( x_j ) ) \) intersect the smallest ball \( B ( x_1, R ( x_1 ) ) \). The main idea of the following argument is to split the space \( \mathbb {R}^d \) into shells of exponentially increasing width, centered around \( x_1 \), and arguing that the number of points among \( x_2, \dotsc , x_M \) that can lie in each shell is universally bounded. To elaborate, take any \( \tau > 1 \) and consider for \( m \in \mathbb {N}_0 \) the spherical shell of points \( y \in \mathbb {R}^d \) satisfying
Note that if \( x_j \) lies in this shell, then by Lipschitz continuity of R,
immediately implying that
This means that the ball \( B ( x_j, \eta R ( x_j ) ) \) is contained in
Furthermore, by the assumption that \( B ( x_1, R ( x_1 ) ) \cap B ( x_j, R ( x_j ) ) \ne \emptyset \), we have that
Since the balls \( B ( x_j, \eta R ( x_j ) ) \) are all disjoint, we conclude that the number of \( x_j \)’s that can lie in the m’th shell around \( x_1 \) is bounded by the ratio of the volumes
![](http://media.springernature.com/lw481/springer-static/image/art%3A10.1007%2Fs10955-023-03086-7/MediaObjects/10955_2023_3086_Equ220_HTML.png)
Note also that no \( x_j \) can be placed inside the first shell (corresponding to \( |x_1- x_j| < \eta R ( x_1 ) \)), because we always have \( |x_1 - x_j| \geqslant \eta ( R ( x_1 ) + R ( x_j ) ) \) by assumption. Now, for any power \( p > 0 \), this allows us to bound, using (53),
To bound the sum involving the logarithm, we note first that for any \( \lambda > 0 \), applying Jensen’s inequality to the function \( t \mapsto t^2 ( \log \lambda t )_+ \) yields
Using this, we obtain by again summing over all the shells,
Finally, we generalize to the case where not all the balls \( B ( x_j, R ( x_j ) ) \) intersect the smallest ball \( B ( x_1, R ( x_1 ) ) \). We split the configuration \( ( x_j )_{1 \leqslant j \leqslant M} \) into clusters \( \left( {x_j^{( k )}}\right) _{1 \leqslant j \leqslant n_k} \) with \( 1 \leqslant k \leqslant K \), such that:
-
For any k, \( R ( x_1^{( k )} ) \leqslant \cdots \leqslant R ( x_{n_k}^{( k )} ) \).
-
\( B ( x_1^{( k )}, R ( x_1^{( k )} ) ) \cap B ( x_j^{( k )}, R ( x_j^{( k )} ) ) \ne \emptyset \) for any j, k.
-
The balls \( B ( x_1^{( k )}, R ( x_1^{( k )} ) ) \) are all pairwise disjoint for \( k = 1, \dotsc , K \).
Then, using (69) on each cluster, we get for instance
which concludes the proof of (64). (65) follows in the same way. \(\square \)
Remark 25
(Hard-core at zero temperature) As we have mentioned in Sect. 2.4, in the hard core case \(\alpha =+\infty \), we know from (34) that for any representable density \(\rho \), we have
where the constant C depends only on d and s. The problem is to determine when \(\rho \) is representable. Using Theorem 21, this is the case when for instance \(R_\rho =\min _{x} R(x)\geqslant r_0\).
5.3 The Block Approximation
While the state from Theorem 21 is useful for obtaining energy bounds at zero temperature, it might be singular with respect to the Lebesgue measure on \( \mathbb {R}^{dN} \), leaving it unsuitable to use for the positive temperature case, because the entropy in this case will be infinite. Here we describe a simple way of regularizing states, while keeping the one-body density fixed, which is a slight generalization to any partition of unity of the construction in [9]. Essentially, it works by cutting \( \mathbb {R}^d \) into “blocks” and then locally replacing the state by a pure tensor product.
Let \( \sum \chi _j = \mathbb {1}_{\mathbb {R}^d} \) be any partition of unity, and \( \mathbb {P}\) any N-particle state with density \( \rho \). The corresponding block approximation is defined by
where we denote
That is, \( \widetilde{\mathbb {P}} \) is a convex combination of tensor products of the normalized \( \frac{\rho \chi _i}{\int \rho \chi _i} \). One can easily show that \( \widetilde{\mathbb {P}} \) has one-body density \( \rho _{\widetilde{\mathbb {P}}} = \rho \). Furthermore, it is clear that \( \widetilde{\mathbb {P}} \) is a symmetric measure whenever \( \mathbb {P}\) is, so we can also write
where \( \Pi _s \) denotes the symmetrization operator in (31). In [9] the chosen partition of unity is just a tiling made of cubes, but in fact any partition works. Applying Jensen’s inequality yields the following.
Lemma 26
(Entropy of the block approximation) Suppose that the state \( \mathbb {P}\) and the partition of unity \( ( \chi _j ) \) are such that \( \chi _{i_1}, \dotsc , \chi _{i_N} \) all have disjoint supports whenever \( \mathbb {P}( \chi _{i_1} \otimes \cdots \otimes \chi _{i_N} ) \ne 0 \). Then we have
Remark 27
Since \( ( \chi _i ) \) is a partition of unity, the term above involving \( \chi _i \log \chi _i \) can always be estimated from above by zero. On the other hand, it is not clear that the sum in last term above is even finite for an arbitrary partition \( ( \chi _i ) \). However, it turns out to behave nicely in many situations. For instance, if \(\int \rho \chi _j\leqslant 1\) for all j, we can estimate it by 1/e times the number of terms in the partition of unity, which is typically finite when \(\rho \) has compact support.
Proof
The entropy of the block approximation can be estimated using Jensen’s inequality by
We have here used the symmetry of \(\mathbb {P}\) to remove the first \(\Pi _s\). It is important that the N! has disappeared in the logarithm. For any non-zero term, the supports of the \( \chi _{i_k} \) are all disjoint, hence only the case \(\sigma =\text {Id}\) remains in the sum. Using that
and plugging this into the previous expression, we conclude that (73) holds. \(\square \)
5.4 Proof of Theorem 12 in the Canonical Case at \(T>0\)
We assume first that the density \( \rho \) is compactly supported, and then remove this assumption at the end. Applying the Besicovitch covering lemma [21, 35] on the cover gives the existence of a (finite) set of points \( ( y_j ) \subseteq {{\,\textrm{supp}\,}}\rho \) satisfying that \( ( B_j ):= ( B ( y_j, \varepsilon R ( y_j ) ) ) \) covers the support of \( \rho \), and the multiplicity of the cover is universally bounded, i.e.,
where the constant \( C_d \) depends only on the dimension d, and thus not on \( \varepsilon \) or \( \rho \). This gives us a partition of unity \( ( \chi _j ) \) defined by \( \chi _j:= \frac{{\mathbb {1} }_{B_j}}{\varphi } \). One way of constructing the Besicovitch cover is to inductively maximize \( \varepsilon R ( y_j ) \) over the remaining volume \( y_j \in {{\,\textrm{supp}\,}}\rho {\setminus } \bigcup _{k=1}^{j-1} B_k \), supposing that \( y_1, \dotsc , y_{j-1} \) have already been chosen. This construction implies the bound on the distances
for all \( j \ne k \).
We now take the optimal transport state \( \mathbb {P}\) obtained from Theorem 21, and denote by \( m_j:= \int \rho \chi _j = \int _{B_j} \frac{\rho }{\varphi } \) the local mass of \( \rho \) with respect to the partition of unity \( ( \chi _j ) \). As a trial state for the free energy, we take the block approximation (72) of \( \mathbb {P}\) using the \( \chi _j \), i.e.,
We show that the support of \( \mathbb {P}_{\varepsilon } \) satisfies the condition (56) for some \( \eta \). For any point \( ( x_1, \dotsc , x_N ) \in {{\,\textrm{supp}\,}}\mathbb {P}_{\varepsilon } \), there must be a term in the sum above such that \( \mathbb {P}( \chi _{j_1} \times \cdots \times \chi _{j_N} ) \ne 0 \), and \( x_k \in B_{j_k} = B( y_{j_k}, \varepsilon R ( y_{j_k} ) ) \) for all k. In particular, since the support of \( \mathbb {P}\) satisfies (55), there exist \( z_1, \dotsc , z_N \) with \( z_k \in B_{j_k} \) and \( |z_k - z_{\ell }| \geqslant \frac{1}{3} ( R ( z_k ) + R ( z_{\ell } ) ) \) for any \( k \ne \ell \). By the Lipschitz continuity of R, \( x_k \in B_{j_k} \) implies that \( R ( y_{j_k} ) \leqslant \frac{1}{1-\varepsilon } R ( x_k ) \), so
Finally, this gives us the bound
This argument also shows that if \( \mathbb {P}( \chi _{j_1} \times \cdots \times \chi _{j_N} ) \ne 0 \), then the sets \( B_{j_k} \) are disjoint for \( k = 1, \dotsc , N \), provided that \( \varepsilon < \frac{1}{9} \).
Now, since \( \mathbb {P}_{\varepsilon } \) satisfies (75), it follows from Proposition 22 that the interaction energy (in case \( \alpha > d \)) is bounded by
and similarly for \( \alpha = d \), using Proposition 23. Thus, to show (25), it only remains to provide a bound on the entropy of the state \( \mathbb {P}_{\varepsilon } \). First, applying Lemma 26 immediately gives
Then, for any numbers \( s,t \geqslant 0 \), we can use the elementary bound
to conclude that
where the last inequality uses (74) and Lemma 24. This proves Theorem 12 for compactly supported densities. \(\square \)
Remark 28
(Hard-core case) In the hard core case \(\alpha =+\infty \), the above proof provides the bound
under the assumption that \(R_\rho =\min _x R(x)>r_0\), where C only depends on d and s. The main difference is the estimate on the distance between the particles in (75). We need to keep the maximum and use
Taking \(\varepsilon =\min (R_\rho /r_0-1,1)/100\) provides (76).
5.5 Removal of the Compactness Condition
To finish this section we describe how to extend a result holding for compactly supported densities to general integrable ones, using this time a compactness argument.
Theorem 29
Assume that w satisfies Assumption 1. If we have for some \(1\leqslant p\leqslant q<\infty \) with \(q\geqslant 2\) and some constants \(C_j\geqslant 0\)
for all \(\rho \in L^1\cap L^q\) of compact support, then the same holds with the same constants for all \(\rho \in L^1\cap L^q\). If \(T>0\) we assume in both cases that \(\int _{\mathbb {R}^d}\rho |\log \rho |<\infty \).
Proof
Let us first assume \(C_4=0\) for simplicity. Our proof uses that the energy \(\rho \mapsto F_T[\rho ]\) is lower semi-continuous for the strong topology of \(L^1\), as previously mentioned in Remark 3, that is,
The theorem then follows immediately by letting
the truncation of \(\rho \) over the ball of radius n. Note that \(\rho _n\leqslant (1+o(1))\rho \). The sequence \(\rho _n\) clearly satisfies the convergence properties of (78) and therefore the lower semi-continuity provides
where \(B'_n:=B_n\cap \big \{\rho \geqslant N^{-1}\int _{B_n}\rho \big \}\).
When \(C_4>0\) the proof is similar. We need to use that \((1+|x|)/C\leqslant R(x),R_n(x)\leqslant C(1+|x|)\) for some \(C>0\) (depending on \(\rho \)), where \(R_n(x)\) is the local radius of the truncated density \(\rho _n\), which converges locally to R. The uniform bounds on R and \(R_n\) imply that we must work under the assumptions that \(\int \rho (\log |x|)_+\) is finite (otherwise there is nothing to show). The limit follows from dominated convergence.
For the convenience of the reader, we conclude by quickly recalling the proof of the lower semi-continuity (78). We consider an arbitrary sequence \(\rho _n\) converging to \(\rho \) strongly in \(L^1\) and satisfying the bounds in (78). It is known that there exists an optimal \(\mathbb {P}_n\) for \(F_T[\rho _n]\) (but we could as well use a quasi-minimizer). From the upper bound we have \(F_T[\rho _n]\leqslant C\) for some constant C and therefore
The first term is non-negative from the stability property of w and the second is a relative entropy, hence is also non-negative. We have thus proved that
where the constant can depend on \(\rho ,N,T\) but not on n. On the other hand, we know that the sequence \((\mathbb {P}_n)\) is tight, that is,
where the right side is small due to the strong convergence in \(L^1\). After extraction of a subsequence, this implies \(\int F\,\textrm{d}\mathbb {P}_n\rightarrow \int F\,\textrm{d}\mathbb {P}\) for every \(F\in C^0_b\). Taking \(F(x_1,...,x_N)=\sum _{j=1}^Nf(x_j)\) with \(f\in C^0_b\), we find that \(\int f\rho _{\mathbb {P}_n}\rightarrow \int f\rho _{\mathbb {P}}\), that is, \(\rho _{\mathbb {P}}=\rho \). In addition, we have (by convexity)
Hence \(\mathbb {P}\) is admissible for \(F_T[\rho ]\), and absolutely continuous with respect to the Lebesgue measure if \(T>0\). We thus have
for every measurable function \(F\geqslant 0\) if \(T>0\) (using the absolute continuity of \(\mathbb {P}\)) and for every lower semi-continuous function \(F\geqslant 0\) if \(T=0\). This is satisfied for our interaction w by Assumption 1 and therefore we obtain as we wanted
Together with the entropy bound (79) when \(T>0\), this proves that
which is the claimed lower semi-continuity (78). \(\square \)
6 Proof of Theorem 14 in the Hard-Core Case
In this section we prove Theorem 14 concerning densities which are uniformly bounded in terms of the tight packing density \(\rho _c(d)\). We start by constructing a trial state with constant density by averaging a periodic tight packing over translations. Such a uniform average of a periodic lattice is often called a “floating crystal” [64, 65] in Physics and Chemistry. Finally, we estimate the entropic cost of “geometrically localizing” [60] this state to enforce the desired density.
Step 1: Constant density. We have assumed \(\rho \leqslant (1-\varepsilon )^dr_0^{-d}\rho _c(d)\). Let \(\eta >0\) be a fixed small number which will later be chosen in terms of \(\varepsilon \). From the definition of \(\rho _c(d)\) we can find a large cube \(C_\ell =(-\ell /2,\ell /2)^d\) and \(n=(1+2\eta )^{-d}r_0^{-d}\rho _c(d)\ell ^d\in \mathbb {N}\) points \(x_1^0,...,x_n^0\in C_\ell \) satisfying \(|x^0_j-x^0_k|\geqslant r_0(1+\eta )\) for all \(j\ne k\). We can also assume that no point is at a distance less than \(r_0\) to the boundary of \(C_\ell \). We are using here that the tight packing density for \(r_0(1+\eta )\) is \((1+\eta )^{-d}r_0^{-d}\rho _c(d)>(1+2\eta )^{-d}r_0^{-d}\rho _c(d)\) and that the limit (33) is the same for cubes and for balls.
Now, we replace each point \(x_j^0\) by a smeared measure
where \(\chi =|B_1|^{-1}{\mathbb {1} }_{B_1}\). The smearing radius \(\eta r_0/2\) has been chosen so that the supports of the \(\chi ^0_j\) remain at distance at least \(r_0\).
Finally, we consider \((2K+1)^d\) copies of our system (\(K\in \mathbb {N}\)), repeated in a periodic fashion so as to form a very large cube \(C_L=(-L/2,L/2)^d\) of side length \(L=(2K+1)\ell \). In other words, we define the \(N:=(2K+1)^dn\) points \(x_j^k:=x_j^0+kL\) with \(k\in \{-K,...,K\}^d\). The smeared measures \(\chi _j^k\) are defined similarly. The state
has the density \(\rho =\sum _{j,k}\chi _j^k\) and the finite entropy
(recall \(\Pi _s\) is the symmetrization operator in (31)). Finally, we average over translations of the big cube and define the trial state
which has the density
The latter is constant, equal to \(n/\ell ^d=(1+2\eta )^{-d}r_0^{-d}\rho _c(d)\) well inside the large cube. Note that, by concavity, the entropy of \({\widetilde{\mathbb {P}}}\) can be estimated by that of \(\mathbb {P}\).
Step 2: Geometric localization. We assume for the rest of the proof that \(\rho \) has a compact support and we choose K large enough so that \({\widetilde{\rho }}\) is constant on the support of \(\rho \). Our estimates will not depend on K. One can then deduce the bound for general densities by adapting the proof of Theorem 29, or by passing to the limit \(K\rightarrow \infty \) in the formulas (80)–(81) of the trial state.
We pick \(\eta \) so that \((1-\varepsilon )^d=(1+2\eta )^{-d}\), that is,
Then we have \(\rho \leqslant {\widetilde{\rho }}\) a.e. This enables us to consider the localization function
and the \(\theta \)–localized state \({\widetilde{\mathbb {P}}}_{|\theta }\), which has the desired density \(\theta \rho _{{\widetilde{\mathbb {P}}}}=\rho \).
We recall that the \(\theta \)—localization \(\mathbb {Q}_{|\theta }\) of a state \(\mathbb {Q}\) (with \(0\leqslant \theta \leqslant 1\)) is the unique state which has the correlation functions \(\rho ^{(k)}=\rho ^{(k)}_{\mathbb {Q}}\theta ^{\otimes k}\) for all k, see [34, 42, 60]. In our case we only need the definition for a tensor product since we have by linearity
For a symmetric tensor product \(\mathbb {Q}=\Pi _s(q_1\otimes \cdots \otimes q_N)\) with probabilities \(q_j\) of disjoint support, the \(\theta \)-localized state can be expressed as
We will need the following.
Lemma 30
(Entropy of localization of tensor products) Let \(\mathbb {Q}=\Pi _s(q_1\otimes \cdots \otimes q_N)\) be a symmetric tensor product, with \(q_1,...,q_N\) probability measures of disjoint supports. For any \(0\leqslant \theta \leqslant 1\), we have
In particular, we deduce
As a side remark we also note also that (82) provides
The additional terms are positive and therefore we recover the subadditivity of the entropy \(\mathcal {S}(\mathbb {Q})\leqslant \mathcal {S}(\mathbb {Q}_{|\theta })+\mathcal {S}(\mathbb {Q}_{|1-\theta })\) [42, Appendix A].
Proof
Each tensor product \((\theta q_{\sigma (1)})\otimes \cdots \otimes (\theta q_{\sigma (n)})\) appears exactly \((N-n)!\) times with the same weight in (81). We can thus write it in the better form
where now the terms all have disjoint supports. We obtain that the entropy equals
Note that the n! in the logarithm simplifies with the 1/n!. Expanding the logarithm and collecting the terms we obtain the claimed formula. \(\square \)
In our case, we deduce by concavity that
We estimate \(\chi _j^k\) in the logarithm by its supremum \(\Vert \chi _j^k\Vert _\infty =\frac{2^d}{(r_0\eta )^d|B_1|}\) and use that
We obtain
On the other hand, the energy bound (34) applies since we still have \(|x_j-x_k|\geqslant r_0\) on the support of the localized state \({\widetilde{\mathbb {P}}}_{|\theta }\). This concludes the proof of Theorem 14. \(\square \)
Data Availability
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
Notes
Recall that one can express the constant density \(\rho \) of an infinite gas as a convergent series in terms of the activity \(z=e^{\beta \mu }\), in the regime \(z\ll 1\) [94]. This corresponds to placing the system in the constant external potential \(V(x)=-\mu \). Since \(\rho \sim _{z\rightarrow 0} z\), the series is invertible and any small uniform density is therefore representable by such a uniform potential, with \(\mu \sim _{\rho \rightarrow 0}\beta ^{-1}\log \rho \).
References
Baus, M.: Statistical mechanical theories of freezing: an overview. J. Stat. Phys. 48, 1129–1146 (1987)
Baus, M., Lutsko, J.F.: Statistical mechanical theories of freezing: where do we stand? Physica A 176, 28–36 (1991)
Braun, W., Hepp, K.: The Vlasov dynamics and its fluctuations in the \(1/N\) limit of interacting classical particles. Commun. Math. Phys. 56, 101–113 (1977)
Buttà, P., Lebowitz, J.L.: Local mean field models of uniform to nonuniform density fluid–crystal transitions. J. Phys. Chem. B 109, 6849–6854 (2005)
Buttazzo, G., Champion, T., De Pascale, L.: Continuity and estimates for multimarginal optimal transportation problems with singular costs. Appl. Math. Optim. 78, 185–200 (2018)
Caglioti, E., Lions, P.-L., Marchioro, C., Pulvirenti, M.: A special class of stationary flows for two-dimensional Euler equations: a statistical mechanics description. Commun. Math. Phys. 143, 501–525 (1992)
Caglioti, E., Lions, P.-L., Marchioro, C., Pulvirenti, M.: A special class of stationary flows for two-dimensional Euler equations: a statistical mechanics description. II. Commun. Math. Phys. 174, 229–260 (1995)
É. Cancès, L.L., Liu, J., Friesecke, G. (eds.): Density Functional Theory, vol. 1 of Springer Series on Molecular Modeling and Simulation. Springer. In preparation
Carlier, G., Duval, V., Peyré, G., Schmitzer, B.: Convergence of entropic schemes for optimal transport and gradient flows. SIAM J. Math. Anal. 49, 1385–1418 (2017)
Chayes, J.T., Chayes, L.: On the validity of the inverse conjecture in classical density functional theory. J. Stat. Phys. 36, 471–488 (1984)
Chayes, J.T., Chayes, L., Lieb, E.H.: The inverse problem in classical statistical mechanics. Commun. Math. Phys. 93, 57–121 (1984)
Cohn, H.: A conceptual breakthrough in sphere packing. Notices Am. Math. Soc. 64, 102–115 (2017)
Colombo, M., De Pascale, L., Di Marino, S.: Multimarginal optimal transport maps for one-dimensional repulsive costs. Can. J. Math. 67, 350–368 (2015)
Colombo, M., Di Marino, S., Stra, F.: Continuity of multimarginal optimal transport with repulsive cost. SIAM J. Math. Anal. 51, 2903–2926 (2019)
Cotar, C., Petrache, M.: Equality of the jellium and uniform electron gas next-order asymptotic terms for Coulomb and Riesz potentials. Preprint at http://arxiv.org/abs/1707.07664 (2019)
Cotar, C., Petrache, M.: Next-order asymptotic expansion for \(N\)-marginal optimal transport with Coulomb and Riesz costs. Adv. Math. 344, 137–233 (2019)
Cotar, C., Friesecke, G., Klüppelberg, C.: Density functional theory and optimal transportation with Coulomb cost. Commun. Pure Appl. Math. 66, 548–599 (2013)
Cotar, C., Friesecke, G., Pass, B.: Infinite-body optimal transport with Coulomb cost. Calc. Var. Partial. Differ. Equ. 54, 717–742 (2015)
De Dominicis, C.: Variational formulations of equilibrium statistical mechanics. J. Math. Phys. 3, 983–1002 (1962)
De Dominicis, C., Martin, P.C.: Stationary entropy principle and renormalization in normal and superfluid systems. I. Algebraic formulation. J. Math. Phys. 5, 14–30 (1964)
de Guzmán, M.: Differentiation of Integrals in \(R^{n}\). Lecture Notes in Mathematics, vol. 481. Springer, Berlin (1975)
De Pascale, L.: Optimal transport with Coulomb cost. Approximation and duality. ESAIM Math. Model. Numer. Anal. 49, 1643–1657 (2015)
Di Marino, S., Gerolin, A., Nenna, L.: Optimal Transportation Theory with Repulsive Costs, vol. “Topological Optimization and Optimal Transport in the Applied Sciences” of Radon Series on Computational and Applied Mathematics, ch. 9, pp. 204–256. De Gruyter (2017)
Di Marino, S., Lewin, M., Nenna, L.: Grand-canonical optimal transport. ArXiv e-prints (2022)
Dobrušin, R.L.: Investigation of conditions for the asymptotic existence of the configuration integral of Gibbs’ distribution. Theory Probab. Appl. 9, 566–581 (1964)
Dobrušin, R.L., Minlos, R.A.: Existence and continuity of pressure in classical statistical physics. Teor. Verojatnost. i Primenen. 12, 595–618 (1967)
Dreizler, R., Gross, E.: Density Functional Theory. Springer, Berlin (1990)
Ebner, C., Punyanitya, C.: Density-functional theory of simple classical fluids. II. Localized excess electron states. Phys. Rev. A 19, 856–865 (1979)
Ebner, C., Saam, W.F., Stroud, D.: Density-functional theory of simple classical fluids. I. Surfaces. Phys. Rev. A 14, 2264–2273 (1976)
Engel, E., Dreizler, R.: Density Functional Theory: An Advanced Course. Springer, New York (2011)
Evans, R.: The nature of the liquid–vapour interface and other topics in the statistical mechanics of non-uniform, classical fluids. Adv. Phys. 28, 143–200 (1979)
Evans, R.: Density Functionals in the Theory of Nonuniform Fluids, pp. 85–176. Marcel Dekker Inc, New York (1992)
Evans, R., Oettel, M., Roth, R., Kahl, G.: New developments in classical density functional theory. J. Phys. Condens. Matter 28, 240401 (2016)
Fournais, S., Lewin, M., Solovej, J.P.: The semi-classical limit of large fermionic systems. Calc. Var. Partial Differ. Equ. 57, 1–42 (2018)
Frank, R., Laptev, A., Weidl, T.: Schrödinger Operators: Eigenvalues and Lieb–Thirring Inequalities, Cambridge Studies in Advanced Mathematics. Cambridge University Press, Cambridge (2022)
Gates, D.: Rigorous results in the mean-field theory of freezing. Ann. Phys. 71, 395–420 (1972)
Gates, D.J., Penrose, O.: The van der Waals limit for classical systems. I. A variational principle. Commun. Math. Phys. 15, 255–276 (1969)
Gelb, L.D., Gubbins, K.E., Radhakrishnan, R., Sliwinska-Bartkowiak, M.: Phase separation in confined systems. Rep. Prog. Phys. 62, 1573–1659 (1999)
Georgii, H.-O.: Gibbs Measures and Phase Transitions. De Gruyter Studies in Mathematics, vol. 9, 2nd edn. Walter de Gruyter & Co., Berlin (2011)
Gigante, G., Leopardi, P.: Diameter bounded equal measure partitions of Ahlfors regular metric measure spaces. Discrete Comput. Geom. 57, 419–430 (2017)
Grewe, N., Klein, W.: Rigorous derivation of the Kirkwood–Monroe equation for small activity. J. Math. Phys. 17, 699–703 (1976)
Hainzl, C., Lewin, M., Solovej, J.P.: The thermodynamic limit of quantum Coulomb systems. Part II. Applications. Adv. Math. 221, 488–546 (2009)
Hansen, J.-P., McDonald, I.R.: Theory of Simple Liquids. Elsevier, Berlin (1990)
Hansen, J.-P., McDonald, I.: Theory of Simple Liquids (With Applications to Soft Matter). Academic Press, Cambridge (2013)
Haymet, A., Oxtoby, D.W.: A molecular theory for the solid–liquid interface. J. Chem. Phys. 74, 2559–2565 (1981)
Hohenberg, P., Kohn, W.: Inhomogeneous electron gas. Phys. Rev. 136, B864–B871 (1964)
Jansen, S., Kuna, T., Tsagkarogiannis, D.: Virial inversion and density functionals. J. Funct. Anal. 284, 109731 (2022)
Jex, M., Lewin, M., Madsen, P. Classical Density Functional Theory: The Local Density Approximation, in preparation (2023)
Kellerer, H.G.: Duality theorems for marginal problems. Z. Wahrsch. Verw. Gebiete 67, 399–432 (1984)
Kiessling, M.K.H.: On the equilibrium statistical mechanics of isothermal classical self-gravitating matter. J. Stat. Phys. 55, 203–257 (1989)
Kiessling, M.K.-H.: Statistical mechanics of classical particles with logarithmic interactions. Commun. Pure. Appl. Math. 46, 27–56 (1993)
Kiessling, M.K.-H.: The Vlasov continuum limit for the classical microcanonical ensemble. Rev. Math. Phys. 21, 1145–1195 (2009)
Kiessling, M.K.-H., Percus, J.K.: Nonuniform van der Waals theory. J. Stat. Phys. 78, 1337–1376 (1995)
Kirkwood, J.G., Monroe, E.: Statistical mechanics of fusion. J. Chem. Phys. 9, 514–526 (1941)
Kohn, W., Sham, L.J.: Self-consistent equations including exchange and correlation effects. Phys. Rev. 140, A1133–A1138 (1965)
Landers, J., Gor, G.Y., Neimark, A.V.: Density functional theory methods for characterization of porous materials. Colloids Surf. A Physicochem. Eng. Asp. 437, 3–32 (2013)
Lebowitz, J.L., Penrose, O.: Rigorous treatment of the van der Waals-Maxwell theory of the liquid–vapor transition. J. Math. Phys. 7, 98–113 (1966)
Lebowitz, J.L., Percus, J.K.: Statistical thermodynamics of nonuniform fluids. J. Math. Phys. 4, 116–123 (1963)
Levy, M., Perdew, J.P.: Tight bound and convexity constraint on the exchange-correlation-energy functional in the low-density limit, and other formal tests of generalized-gradient approximations. Phys. Rev. B 48, 11638–11645 (1993)
Lewin, M.: Geometric methods for nonlinear many-body quantum systems. J. Funct. Anal. 260, 3535–3595 (2011)
Lewin, M.: Coulomb and Riesz gases: the known and the unknown. J. Math. Phys. 63, 061101 (2022)
Lewin, M., Nam, P.T., Rougerie, N.: The mean-field approximation and the non-linear Schrödinger functional for trapped Bose gases. Trans. Am. Math. Soc. 368, 6131–6157 (2016)
Lewin, M., Lieb, E.H., Seiringer, R.: Statistical mechanics of the uniform electron gas. J. Éc. Polytech. Math. 5, 79–116 (2018)
Lewin, M., Lieb, E.H., Seiringer, R.: Floating Wigner crystal with no boundary charge fluctuations. Phys. Rev. B 100, 035127 (2019)
Lewin, M., Lieb, E.H., Seiringer, R.: Chapter in the book “density functional theory—modeling, mathematical analysis, computational methods, and applications. In: Cancès, É., Friesecke, G. (eds.) Universal Functionals in Density Functional Theory. Springer, Berlin (2019)
Lewin, M., Lieb, E.H., Seiringer, R.: The local density approximation in density functional theory. Pure Appl. Anal. 2, 35–73 (2020)
Lieb, E.H.: Density functionals for Coulomb systems. Int. J. Quantum Chem. 24, 243–277 (1983)
Lieb, E.H., Oxford, S.: Improved lower bound on the indirect Coulomb energy. Int. J. Quantum Chem. 19, 427–439 (1980)
Löwen, H.: Density functional theory of inhomogeneous classical fluids: recent developments and new perspectives. J. Phys. Condens. Matter 14, 11897–11905 (2002)
Messer, J., Spohn, H.: Statistical mechanics of the isothermal Lane–Emden equation. J. Stat. Phys. 29, 561–578 (1982)
Mietzsch, N.: The validity of the local density approximation for smooth short range interaction potentials. J. Math. Phys. 61, 113503 (2020)
Morita, T., Hiroike, K.: A new approach to the theory of classical fluids. III: general treatment of classical systems. Progr. Theor. Phys. 25, 537–578 (1961)
Parr, R., Yang, W.: Density-Functional Theory of Atoms and Molecules. International Series of Monographs on Chemistry, Oxford University Press, Oxford (1994)
Pass, B.: Multi-marginal optimal transport: theory and applications. ESAIM Math. Model. Numer. Anal. 49, 1771–1790 (2015)
Percus, J.: Frontiers in Physics, p. 33. W.A. Benjamin, Inc, New York (1964)
Percus, J.K.: Equilibrium state of a classical fluid of hard rods in an external field. J. Stat. Phys. 15, 505–511 (1976)
Percus, J.K.: Nonuniform fluids in the grand canonical ensemble. Int. J. Quantum Chem. 22, 33–48 (1982)
Percus, J.K.: One-dimensional classical fluid with nearest-neighbor interaction in arbitrary external field. J. Stat. Phys. 28, 67–81 (1982)
Percus, J.K.: Nonuniform classical fluid mixture in one-dimensional space with next neighbor interactions. J. Stat. Phys. 89, 249–272 (1997)
Perdew, J.P.: Unified theory of exchange and correlation beyond the local density approximation. In: Ziesche, P., Eschrig, H. (eds.) Electronic Structure of Solids ’91, pp. 11–20. Akademie Verlag, Berlin (1991)
Perdew, J., Sun, J.: The Lieb-Oxford Lower Bounds on the Coulomb Energy, Their Importance to Electron Density Functional Theory, and a Conjectured Tight Bound on Exchange, pp. 165–178. EMS Press, Helsinki (2022)
Perdew, J.P., Burke, K., Ernzerhof, M.: Generalized gradient approximation made simple. Phys. Rev. Lett. 77, 3865–3868 (1996)
Petrenko, S.N., Rebenko, A.L.: Superstable criterion and superstable bounds for infinite range interaction. I. Two-body potentials. Methods Funct. Anal. Topol. 13, 50–61 (2007)
Plischke, M., Bergersen, B.: Equilibrium Statistical Physics. World Scientific Publishing Company, Singapore (2006)
Ramakrishnan, T., Yussouff, M.: Theory of the liquid–solid transition. Solid State Commun. 21, 389–392 (1977)
Ramakrishnan, T.V., Yussouff, M.: First-principles order-parameter theory of freezing. Phys. Rev. B 19, 2775–2794 (1979)
Rebenko, A.L.: A new proof of Ruelle’s superstability bounds. J. Stat. Phys. 91, 815–826 (1998)
Robledo, A., Varea, C.: On the relationship between the density functional formalism and the potential distribution theory for nonuniform fluids. J. Stat. Phys. 26, 513–525 (1981)
Roth, R.: Fundamental measure theory for hard-sphere mixtures: a review. J. Phys. Condens. Matter 22, 063102 (2010)
Rougerie, N.: De Finetti theorems, mean-field limits and Bose–Einstein condensation. ArXiv e-prints, (2015)
Rozenbljum, G.V.: The distribution of the eigenvalues of the first boundary value problem in unbounded domains. Dokl. Akad. Nauk SSSR 200, 1034–1036 (1971)
Rozenblum, G.V.: Distribution of the discrete spectrum of singular differential operators. Dokl. Akad. Nauk SSSR 202, 1012–1015 (1972)
Ruelle, D.: Superstable interactions in classical statistical mechanics. Commun. Math. Phys. 18, 127–159 (1970)
Ruelle, D.: Statistical Mechanics. Rigorous Results. World Scientific, Imperial College Press, Singapore, London (1999)
Saam, W.F., Ebner, C.: Density-functional theory of classical systems. Phys. Rev. A 15, 2566–2568 (1977)
Seidl, M., Di Marino, S., Gerolin, A., Nenna, L., Giesbertz, K.J.H., Gori-Giorgi, P.: The strictly-correlated electron functional for spherically symmetric systems revisited. ArXiv e-prints (2017)
Singh, Y.: Density-functional theory of freezing and properties of the ordered phase. Phys. Rep. 207, 351–444 (1991)
Spohn, H.: On the Vlasov hierarchy. Math. Methods Appl. Sci. 3, 445–455 (1981)
Stell, G.: Frontiers in Physics, p. 171. W.A. Benjamin INC, New York (1964)
Stillinger, F.H., Buff, F.P.: Equilibrium statistical mechanics of inhomogeneous fluids. J. Chem. Phys. 37, 1–12 (1962)
Sun, J., Perdew, J.P., Ruzsinszky, A.: Semilocal density functional obeying a strongly tightened bound for exchange. Proc. Nat. Acad. Sci. U.S.A. 112, 685–689 (2015)
Sun, J., Ruzsinszky, A., Perdew, J.P.: Strongly constrained and appropriately normed semilocal density functional. Phys. Rev. Lett. 115, 036402 (2015)
Sun, J., Remsing, R.C., Zhang, Y., Sun, Z., Ruzsinszky, A., Peng, H., Yang, Z., Paul, A., Waghmare, U., Wu, X., Klein, M.L., Perdew, J.P.: Accurate first-principles structures and energies of diversely bonded systems from an efficient density functional. Nat. Chem. 8, 831–836 (2016)
Tao, J., Perdew, J.P., Staroverov, V.N., Scuseria, G.E.: Climbing the density functional ladder: nonempirical meta-generalized gradient approximation designed for molecules and solids. Phys. Rev. Lett. 91, 146401 (2003)
Torquato, S., Stillinger, F.H.: New conjectural lower bounds on the optimal density of sphere packings. Exp. Math. 15, 307–331 (2006)
Vanderlick, T.K., Davis, H.T., Percus, J.K.: The statistical mechanics of inhomogeneous hard rod mixtures. J. Chem. Phys. 91, 7136–7145 (1989)
Viazovska, M.: Almost impossible \({E}_{8}\) and Leech lattices. EMS Mag. 121, 4–8 (2021)
Weidl, T.: On the Lieb–Thirring constants \(L_{\gamma ,1}\) for \(\gamma \ge 1/2\). Commun. Math. Phys. 178, 135–146 (1996)
Wu, J.: Density functional theory for chemical engineering: from capillarity to soft materials. AIChE J. 52, 1169–1193 (2006)
Yang, A.J.M., Fleming, P.D., Gibbs, J.H.: Molecular theory of surface tension. J. Chem. Phys. 64, 3732–3747 (1976)
Acknowledgements
ML thanks Rupert L. Frank for providing him with a preliminary version of the book [35]. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant Agreement MDFT No 725528 of ML). MJ also received financial support from the Ministry of Education, Youth and Sport of the Czech Republic under the Grant No. RVO 14000.
Funding
Open access publishing supported by the National Technical Library in Prague.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Communicated by Michael Kiessling.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jex, M., Lewin, M. & Madsen, P.S. Classical Density Functional Theory: Representability and Universal Bounds. J Stat Phys 190, 73 (2023). https://doi.org/10.1007/s10955-023-03086-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10955-023-03086-7