Abstract
We prove a rigorous upper bound on the correlation energy of interacting fermions in the mean-field regime for a wide class of interaction potentials. Our result covers the Coulomb potential, and in this case we obtain the analogue of the Gell-Mann–Brueckner formula \(c_{1}\rho \log \left( \rho \right) +c_{2}\rho \) in the high density limit. We do this by refining the analysis of our bosonization method to deal with singular potentials, and to capture the exchange contribution which is absent in the purely bosonic picture.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Although interacting Fermi gases have been studied extensively from the beginning of quantum mechanics, their rigorous understanding remains one of the major issues of condensed matter physics. From first principles, a system of N fermions in \({\mathbb {R}}^{3}\) can be described by a Schrödinger equation in \({\mathbb {R}}^{3N}\), subject to the anti-symmetry condition between the variables due to Pauli’s exclusion principle. However, this fundamental theory becomes very complex when \(N\rightarrow \infty \), leading to the need of various approximations. Justifying these approximations is an important task of mathematical physics.
One of the most basic approximations for fermions is the Hartree–Fock (HF) theory. In HF theory, the particles are assumed to be independent, namely the HF energy is computed by restricting the consideration to Slater determinants. In spite of its simplicity, the HF theory is used very successfully in computational physics and chemistry to compute the ground state energy of atoms and molecules. The accuracy of the HF energy (in comparison to the full quantum energy) for large Coulomb systems was investigated in the 1990 s by Fefferman and Seco [12], Bach [1], and Graf and Solovej [15].
On the other hand, for the electron gas (e.g. jellium, a homogeneous electron gas moving in a background of uniform positive charge), the HF theory is essentially trivial in the high density limit since the HF energy only contains an exponentially small correction to the energy of the Fermi state, the ground state of the non-interacting gas [14]. Therefore, computing the correlation energy,Footnote 1 namely the correction to the HF energy, is a crucial task to understand the effect of the interaction. It was already noticed by Wigner in 1934 [23] and confirmed by Heisenberg in 1947 [17] that it would be very challenging to accomplish this task within perturbation theory due to the long-range property of the Coulomb potential. Nevertheless, a remarkable attempt in this direction was done by Macke in 1950 [18] when he used a partial resummation of the divergent series to predict the leading order contribution \(c_{1}\rho \log \left( \rho \right) \) of the correlation energy (with density \(\rho \rightarrow \infty \)).
A cornerstone in the correlation analysis of the electron gas is the random phase approximation (RPA) which was proposed by Bohm and Pines in the 1950 s [7,8,9, 19]. As an important consequence of the Bohm-Pines RPA theory, the electron gas could be decoupled into collective plasmon excitations and quasi-electrons that interacted via a screened Coulomb interaction. The latter fact justified the independent particle approach commonly used for many-body fermion systems. The justification of the RPA was a major question in condensed matter and nuclear physics in the late 1950 s and 1960 s. An important justification was given by Gell-Mann and Brueckner in 1957 [13] when they formally derived the RPA from a resummation of Feynman diagrams where each term separately diverges but the sum is convergent. More precisely, by considering the diagrams corresponding to the interaction of pairs of fermions, one from inside and one from outside the Fermi state, Gell-Mann and Brueckner were able to produce the leading order contribution \(c_{1}\rho \log \left( \rho \right) +c_{2}\rho \) of the correlation energy.
Soon after the achievement of Gell-Mann and Brueckner, Sawada [21] and Sawada–Brueckner–Fukuda–Brout [22] proposed an alternative approach to the RPA where the pairs of electrons are interpreted as bosons, leading to an effective Hamiltonian which is quadratic in terms of the bosonic creation and annihilation operators. Note that within the purely bosonic picture, quadratic Hamiltonians can be diagonalized by Bogolubov transformations [6], and hence their spectra can be computed explicitly. Therefore, the Hamiltonian approach in [21, 22] is conceptually more transparent than the resummation method in [13]. Unfortunately the analysis in [21, 22] only gives the contribution \(c_{1}\rho \log \left( \rho \right) \) of the correlation energy because the exchange contribution of order \(\rho \) is missed in the purely bosonic picture.
Recently, the bosonization argument in [21, 22] has been revisited and made rigorous in the mean-field regime with smooth interaction potentials [2,3,4,5, 10, 11, 16]. In principle, if the interaction is sufficiently weak, then the non-bosonizable terms of the interaction energy are negligible, and the quasi-bosonic Hamiltonian can be analyzed with great precision. In particular, the correlation energy has been successfully computed to the leading order [2, 3, 5, 10]. However, the boundedness of interaction potentials is crucial for all of these works, and extending the analysis to the electron gas remains a very interesting open question.
In the present paper, we will give the first rigorous upper bound to the correlation energy of the electron gas in the mean-field regime. Our bound is consistent with the Gell-Mann–Brueckner formula \(c_{1}\rho \log \left( \rho \right) +c_{2}\rho \) for jellium in the high density limit [13]. Although our trial state argument is inspired by the bosonization method in [21, 22], we are able to capture correctly the exchange contribution by carefully distinguishing the purely bosonic picture and the quasi-bosonic one. On the mathematical side, we will use the general method in our recent work [10], but several new estimates are needed to deal with the singularity of the potential. The matching lower bound in the mean-field regime, as well as the corresponding result in the thermodynamic limit, remain open, and we hope to be able to come back to these issues in the future.
On the technical side, the key idea of [10] is that while the bosonic property of fermionic pairs holds only in an average sense, this weak bosonic property is sufficient to extract correctly the correlation energy by implementing a quasi-bosonic Bogolubov transformation. The main contribution of the present paper is to show that this approach is also sufficient to extract the exchange correction to the purely bosonic computation. On the other hand, another bosonization method has been proposed in [2], where the bosonic property of fermionic pairs is strengthened by using suitable patches on the Fermi sphere for the quasi-bosonic creation and annihilation operators, making the comparison with the purely bosonic computation significantly easier. In fact, as explained in [5], the approach in [2] can be extended to give the leading order of the correlation energy upper bound for potentials satisfying \(\sum V_k^2|k| < \infty \). Although this condition only barely fails for the Coulomb potential, there is a huge difference to the Coulomb case. While for \(\sum V_k^2|k| < \infty \) the bosonic correlation contribution is of order \(k_F\) and the exchange correlation is of lower order \(o(k_F)\), for the Coulomb potential the exchange contribution raises to the order \(k_F\), whereas the bosonic correlation behaves as \(k_F \log (k_F)\), which makes the Coulomb case much more challenging (here \(k_F\) is the radius of the Fermi ball). In particular, the method in [2, 5] does not seem to capture the exchange contribution which is indeed important for the Coulomb potential.
1.1 Main result
Let \({\mathbb {T}}^{3}=\left[ 0,2\pi \right] ^{3}\) with periodic boundary conditions. Let \(V:{\mathbb {T}}^{3}\rightarrow {\mathbb {R}}\) be defined by
with Fourier coefficients satisfying
We implicitly assume that \({\hat{V}}_{0}=0\), or equivalently that the “background” has been subtracted.
For \(k_{F}>0\), let \(N=\left| B_{F}\right| \) be the number of integer points in the Fermi ball \(B_{F}={\overline{B}}\left( 0,k_{F}\right) \cap {\mathbb {Z}}^{3}\) and consider the mean-field Hamiltonian
on the fermionic space \({\mathcal {H}}_{N}=\bigwedge ^{N}{\mathfrak {h}}\) with \({\mathfrak {h}}=L^{2}\left( {\mathbb {T}}^{3}\right) \).Footnote 2 The leading order of the ground state energy of \(H_{N}\) is given by the Fermi state
It is straightforward to find (see e.g. [10, Eqs. (1.10) and (1.20)])
where for every \(k\in {\mathbb {Z}}_{*}^{3}\), we denoted the lune associated to k by
Our main result concerns the corrections to the ground state energy. For every \(k\in {\mathbb {Z}}_{*}^{3}\), define
We will prove the following:
Theorem 1.1
As \(k_{F}\rightarrow \infty \) it holds that
where
is the bosonic contribution and
is the exchange contribution, for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Some remarks on our result:
1. Consider the Coulomb potential, \({\hat{V}}_{k}=g\left| k\right| ^{-2}\) for a constant \(g>0\). Following the analysis of [15], we find that
where \(E_{\textrm{FS}}\) contains the kinetic energy of order \(k_{F}^{5}\), the direct interaction energy of order \(k_{F}^{5}\) and the exchange interaction energy of order \(k_{F}^{3}\). Furthermore, it is straightforward to adapt the proof in [14] to see that the difference between \(E_{\textrm{FS}}\) and the HF energy is exponentially small as \(k_{F}\rightarrow \infty \). Therefore our result really concerns the correlation energy, which we bound from above by
plus the error term of order
In fact, it is easy to verify (1.10) using \(\sum _{|k|\le k_F} {{\hat{V}}}_k^2 |k| \sim \log ( k_F)\) and \(\sum _{|k|\ge k_F} {{\hat{V}}}_k^2 \sim k_F^{-1}\). To see the leading order behavior \(E_{\textrm{corr},\textrm{ex}}\sim k_F\) in (1.9), one may use that \(\lambda _{k,p} \sim |k| \max \{|k|,k_F\} \) (in an average sense) and that \(|L_k| \sim k_F^2 \min \left\{ |k|,k_{F}\right\} \). Moreover, from the expansion
we have
and hence the asymptotic behavior \(E_{\textrm{corr},\textrm{bos}}\sim - k_F \log (k_F)\) in (1.9) follows similarly.
Note that the correlation energy \(E_{\textrm{corr},\textrm{bos}}+E_{\textrm{corr},\textrm{ex}}\) in Theorem 1.1 is exactly the mean-field analogue of the Gell-Mann–Brueckner formula \(c_{1}\rho \log \left( \rho \right) +c_{2}\rho \) for jellium in the thermodynamic limit [13]. Indeed, substituting \(k_{F}^{-1}{\hat{V}}_{k}\rightarrow 4\pi e^{2}\left| k\right| ^{-2}\) and \(\left( 2\pi \right) ^{3}\rightarrow \) the volume \(\Omega \), \(E_{\textrm{corr},\textrm{bos}}\) agrees with [22, Eq. (34)] which is equivalent with [13, Eq. (19)] (accounting also for spin). In the thermodynamic limit, the right-hand side of (1.12) always diverges, no matter if we have the mean-field scaling or not, but the full expression on the left-hand side converges in either case.
Furthermore, we also obtain the exchange contribution \(E_{\textrm{corr},\textrm{ex}}\), which is the analogue of [13, Eq. (9)], which is completely absent from the bosonic model of [22]. With the same substitutions as above, the exchange contribution takes the form
which agrees with [20, Eq. (9.14)] (noting that we take \(m=1/2\)).
2. If the potential satisfies \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\left| k\right| <\infty \), and so is less singular than the Coulomb potential, then the bosonic contribution \(E_{\textrm{corr},\textrm{bos}}\) is of order \(k_{F}\), while the exchange contribution is \(o\left( k_{F}\right) \). In this case, the upper bound
is already known; see [10, Remark 1 after Theorem 1.3] and [5, Appendix A]. Under the stronger condition \(\sum {\hat{V}}_{k}\left| k\right| <\infty \) the matching lower bound was established in [5, 10] (see also [2] and [3] for previous results on the upper and lower bounds, respectively, when \({\hat{V}}_{k}\) is finitely supported). In comparison, the Coulomb potential is much more challenging to analyze, since it leads to an additional logarithmic factor in the bosonic contribution, and lifts the exchange contribution to the order \(k_{F}\). On the mathematical side, working with the Coulomb potential thus requires a substantial refinement of the bosonization method compared to the existing works.
3. Although the case of the greatest physical interest is the Coulomb potential, our result covers a far greater class of singular potentials: Under the condition \(\sum _{k{\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \), the error term \(\sqrt{\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\min \left\{ \left| k\right| ,k_{F}\right\} }\) is of order at most \(O\left( \sqrt{k_{F}}\right) \), and so Theorem 1.1 is always a meaningful result.
1.2 Overview of the proof
We will construct a trial state by applying a quasi-bosonic Bogolubov transformation to the Fermi state \(\psi _{\textrm{FS}}\). We will follow the general formulation of the bosonization method in [10]. We quickly recall this here for the reader’s convenience, after which we explain the new components of the proof and the structure of the rest of the paper.
1.2.1 Rewriting the Hamiltonian
We will use the second quantization formalism in which we associate to every plane wave state \(u_{p}\) of equation (1.4) the creation and annihilation operators \(c_{p}^{*}=a^{*}(u_p)\) and \(c_{p}=a(u_p)\) on the fermionic Fock space. They obey the canonical anti-commutation relations (CAR)
The Hamiltonian \(H_{N}\) of equation (1.3) can then be written as \( H_{N}=H_{\textrm{kin}}+k_{F}^{-1}H_{\textrm{int}} \) where
Note that the Fermi state \(\psi _{\textrm{FS}}\) obeys (\(B_{F}^{c}\) denoting the complement of \(B_{F}\) with respect to \({\mathbb {Z}}^{3}\))
and so it follows by the CAR that the kinetic energy of the Fermi state is
We define the localized kinetic operator \(H_{\textrm{kin}}^{\prime }\) by
where we for the last identity used the “particle-hole symmetry”
From the last identity of equation (1.19) it is clear that \(H_{\textrm{kin}}^{\prime }\) is non-negative.
We normal-order \(H_{\textrm{int}}\) with respect to \(\psi _{\textrm{FS}}\): Using the CAR and the fact that \(\sum _{p\in {\mathbb {Z}}^{3}}c_{p}^{*}c_{p}={\mathcal {N}}=N\) on \({\mathcal {H}}_{N}\), it factorizes as
Decomposing for every \(k\in {\mathbb {Z}}_{*}^{3}\)
we can write
Using the CAR again it is easy to compute that
whence (using also that \({\hat{V}}_{k}={\hat{V}}_{-k}\))
Note that the first sum is finite as \(\left| L_{k}\right| =N\) for \(\left| k\right| >2k_{F}\). It is easily verified that \(D_{k}\psi _{\textrm{FS}}=D_{k}^{*}\psi _{\textrm{FS}}=B_{k}\psi _{\textrm{FS}}=0\), so we deduce from this identity that
and we summarize the calculations above in the following:
Proposition 1.2
It holds that
where \(E_{\textrm{FS}}=\left\langle \psi _{\textrm{FS}},H_{N}\psi _{\textrm{FS}}\right\rangle \) and the cubic and quartic terms, \({\mathcal {C}}\) and \({\mathcal {Q}}\), are defined by
We will prove that the cubic and quartic terms are negligible, and so the main contribution to the correlation energy comes from the bosonizable terms
We will write these in terms of quasi-bosonic operators, which will lead us to define a quasi-bosonic Bogolubov transformation that serves to effectively diagonalize them.
1.2.2 The quasi-bosonic quadratic Hamiltonian
We define the excitation operators \(b_{k,p}^{*}\), \(b_{k,p}\), for \(k\in {\mathbb {Z}}_{*}^{3}\) and \(p\in L_{k}\), by
The name is due to the fact that \(b_{k,p}^{*}\) acts by annihilating a state with momentum \(p-k\in B_{F}\) and creating a state with momentum \(p\in B_{F}^{c}\), i.e. it excites the state \(p-k\) to the state p.
For the purpose of computations it is convenient to also introduce a basis-independent notation for the quasi-bosonic operators. Consider for \(k\in {\mathbb {Z}}_{*}^{3}\) the auxilliary space \(\ell ^{2}(L_{k})\), which we will consider only as a real vector space, with standard orthonormal basis \(\left( e_{p}\right) _{p\in L_{k}}\). For any \(k\in {\mathbb {Z}}_{*}^{3}\) and \(\varphi \in \ell ^{2}(L_{k})\) we define the generalized excitation operators \(b_{k}(\varphi )\) and \(b_{k}^{*}(\varphi )\) by
Note that the assignments \(\varphi \mapsto b_{k}(\varphi ),\,b_{k}^{*}(\varphi )\) are both linear (as we only consider \(\ell ^{2}(L_{k})\) as a real vector space). In this notation we simply have that \(b_{k}\left( e_{p}\right) =b_{k,p}\). A short calculation using the CAR shows that these operators are quasi-bosonic in the following sense:
Lemma 1.3
For any \(k,l\in {\mathbb {Z}}_{*}^{3}\), \(\varphi \in \ell ^{2}(L_{k})\) and \(\psi \in \ell ^{2}(L_l)\) it holds that
where the exchange correction \(\varepsilon _{k,l}\left( \varphi ;\psi \right) \) is given by
Note that in the purely bosonic picture the exchange correction is absent. In our quasi-bosonic case, these corrections are small but non-zero; it will be important to keep careful track of them as it is these that gives rise to the exchange contribution \(E_{\textrm{corr},\textrm{ex}}\).
For any operators A, B on \(\ell ^{2}(L_{k})\), we define the associated quadratic operators \(Q_{1}^{k}(A)\), \(Q_{2}^{k}(B)\) on \({\mathcal {H}}_{N}\) byFootnote 3
and
Defining the operator \(P_{k}\) on \(\ell ^{2}(L_{k})\) by
we can express the interaction part of the bosonizable terms as
The localized kinetic operator \(H_{\textrm{kin}}^{\prime }\) cannot be written exactly in a quadratic quasi-bosonic form, but due to the commutation relation
(see [10, Eq. (1.76)]) and the quasi-bosonicity of the \(b_{k,p}^{*}\) operators, it is sensible to consider it analogous to a quadratic operator of the form
where the operators \(h_{k}\) on \(\ell ^{2}(L_{k})\) are simply defined by \(h_{k}e_{p}=\lambda _{k,p}e_{p}\). In all we thus consider the bosonizable terms as being analogous to a quasi-bosonic quadratic operator as
1.2.3 The quasi-bosonic Bogolubov transformation
If the quadratic Hamiltonian on the right-hand side of equation (1.36) was exactly bosonic, it could be diagonalized by a Bogolubov transformation. Motivated by this we define such a transformation in the quasi-bosonic setting, while keeping careful track of the additional terms arising from the exchange correction.
Let \(K_{k}:\ell ^{2}(L_{k})\rightarrow \ell ^{2}(L_{k})\), \(k\in {\mathbb {Z}}_{*}^{3}\), be a collection of symmetric operators satisfying
Then we define the associated quasi-bosonic Bogolubov kernel \({\mathcal {K}}\) on \({\mathcal {H}}_{N}\) by
It is obvious from the second equation that \({\mathcal {K}}\) is skew-symmetric; \({\mathcal {K}}\) thus generates a unitary transformation \(e^{{\mathcal {K}}}:{\mathcal {H}}_{N}\rightarrow {\mathcal {H}}_{N}\) - the quasi-bosonic Bogolubov transformation.
We consider the case \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}\left\| K_{k}\right\| _{\textrm{HS}}^{2}<\infty \), in which case \({\mathcal {K}}\) is not only well-defined but even bounded as an operator on \({\mathcal {H}}_{N}\), as we will prove in the next section.
We choose the operators \(\left( K_{k}\right) \) such that \(e^{{\mathcal {K}}}\) would diagonalize the right-hand side of equation (1.36) if it was exactly bosonic. As explained in [10, Sect. 3] the diagonalizing kernel is
Keeping careful track of the quasi-bosonic corrections, the action of \(e^{{\mathcal {K}}}\) on the bosonizable terms are as follows:
Theorem 1.4
Let \(H_{\textrm{eff}}\) be as in (1.27). Assume \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \). Then \(e^{{\mathcal {K}}}\) is well-defined and
where for any symmetric operators \(A_{k},B_{k}:\ell ^{2}(L_{k})\rightarrow \ell ^{2}(L_{k})\) we define
and for \(t\in \left[ 0,1\right] \) the operators \(A_{k}(t),B_{k}(t):\ell ^{2}(L_{k})\rightarrow \ell ^{2}(L_{k})\) are given by
This result is essentially the same as [10, Proposition 5.7], except that we now do not introduce a momentum cut-off and assume only that \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \). For the readers convenience, we include in Appendix A the proof of the identity of Theorem 1.4 - that the condition \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \) is sufficient to define \(e^{{\mathcal {K}}}\) is proved in the next section.
1.2.4 Outline of the paper
Now we come to the main part of the paper. We will choose as our trial state \(\Psi =e^{-{\mathcal {K}}}\psi _{\textrm{FS}}\). As mentioned the cubic and quartic terms are negligible, so the energy of our trial state energy is by Theorem 1.4, to leading order,
The main task will thus be to extract the exchange contribution \(E_{\textrm{corr},\textrm{ex}}\) from this last term. The outline of the paper is as follows:
In Sect. 2 we show that \(e^{{\mathcal {K}}}\) is well-defined by proving that \({\mathcal {K}}\) is bounded under the condition \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \). We do this by employing a type of higher-order fermionic estimate, resulting in a bound of the form
which will also be crucial in allowing us to control \({\mathcal {N}}_{E}\) later.
In Sect. 3 we establish various bounds on the one-body operators \(K_{k}\), \(A_{k}(t)\) and \(B_{k}(t)\). This is conceptually similar to the one-body analysis in our previous paper [10], but we must refine several estimates in order to establish control using only the assumption that \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}<\infty \).
In Sect. 4 comes the main new work: We engage in a detailed study of the exchange terms \(\mathcal {{\mathcal {E}}}_{k}^{1}(A_k)\) and \(\mathcal {{\mathcal {E}}}_{k}^{2}(B_k)\) so that we can extract \(E_{\textrm{corr},\textrm{ex}}\) from the last term of equation (1.40), first in the form
and then analyze this expression further to obtain the leading order of this, which is precisely \(E_{\textrm{corr},\textrm{ex}}\) as given in Theorem 1.1.
Finally in Sect. 5 we control the non-bosonizable cubic and quartic terms, and bound the number operator \({\mathcal {N}}_{E}\) and its powers by a Gronwall argument. We end the paper by concluding Theorem 1.1.
2 The Bogolubov Kernel
We consider the kernel \({\mathcal {K}}\) defined by (1.38). We prove the following:
Proposition 2.1
Let \(K_{l}:\ell ^{2}(L_l)\rightarrow \ell ^{2}(L_l)\), \(l\in {\mathbb {Z}}_{*}^{3}\), be a collection of symmetric operators. Then provided \(\sum _{l\in {\mathbb {Z}}_{*}^{3}}\left\| K_{l}\right\| _{\textrm{HS}}^{2}<\infty \), the expression
defines a bounded operator \({\mathcal {K}}:{\mathcal {H}}_{N}\rightarrow {\mathcal {H}}_{N}\), and for any \(\Psi ,\Phi \in {\mathcal {H}}_{N}\) we have
Note that \({\mathcal {N}}_{E}=\sum _{p\in B_{F}^c} c_{p}^*c_{p}=\sum _{p\in B_{F}}c_{p}c_{p}^{*}\le \left| B_{F}\right| =N\) on \({\mathcal {H}}_{N}\). Moreover, it was shown in [10] (see also Theorem 3.1) that the kernels in (1.39) satisfy \(\left\| K_{k}\right\| _{\textrm{HS}}\le C{\hat{V}}_{k}\), and hence the boundedness of \({\mathcal {K}}\) follows from the assumption \(\sum _{k\in {\mathbb {Z}}_*^3} {{\hat{V}}}_k^2<\infty \). Let us write
and focus on the boundedness of \({\mathcal {K}}\). Since
for any \(\Psi ,\Phi \in {\mathcal {H}}_{N}\) we may estimate by the Cauchy–Schwarz inequality
The operator appearing under the root can be written as
Let us estimate the following general expression, with some coefficients \(A_{p,q,r}\),
2.1 A higher order fermionic estimate
Note that the Cauchy–Schwarz inequality trivially implies that
but this is non-optimal for fermionic states. The “standard fermionic estimate” states that
which can be proved by appealing to the CAR as follows:
One can imagine generalizing this to quadratic expressions of the form \(\sum _{p,q}A_{p,q}c_{p}c_{q}\), but this fails since the CAR only yields a commutation relation for such expressions, and not an anticommutation relation. However, for cubic expressions, such as \(\sum _{p,q,r}A_{p,q,r}c_{p}^{*}c_{q}c_{r}\), the CAR does yield an anticommutation relation, allowing the trick to be applied. The anticommutator is of course not constant, but rather a combination of quadratic, linear and constant expressions, but this still yields a reduction in “number operator order”, which will be crucial for our estimation of \(e^{{\mathcal {K}}}{\mathcal {N}}_{E}^{m}e^{-{\mathcal {K}}}\) later on. We will need the following basic anticommutator:
Lemma 2.2
For any \(p,p'\in B_{F}^{c}\) and \(q,q',r,r'\in B_{F}\) it holds that
We can now conclude the desired bound:
Proposition 2.3
Let \(A_{p,q,r}\in {\mathbb {C}}\) for \(p\in B_{F}^{c}\) and \(q,r\in B_{F}\) with \(\sum _{p\in B_{F}^{c}}\sum _{q,r\in B_{F}}\left| A_{p,q,r}\right| ^{2}<\infty \) be given. Then for any \(\Psi \in {\mathcal {H}}_{N}\)
Proof
As in the proof of the standard fermionic estimate (2.8), we have
Hence, by the identity of Lemma 2.2, we bound the left-hand side by
We estimate the different types of expressions appearing above. Firstly, by the standard fermionic estimate (2.8),
and likewise for the other two terms on the first line of equation (2.9). For the terms on the second line we similarly estimate
The terms on the third line of equation (2.9) all factorize in a manifestly non-positive fashion, and so can be dropped, while for the fourth line
Lastly, the terms on the fifth line are seen to simply be constant and easily bounded by \(\sum _{p\in B_{F}^{c}}\sum _{q,r\in B_{F}}\left| A_{p,q,r}\right| ^{2}\), whence the proposition follows. \(\quad \square \)
We can now conclude the following bound for \(\tilde{{\mathcal {K}}}\), which in turn implies Proposition 2.1.
Proposition 2.4
For any \(\Psi ,\Phi \in {\mathcal {H}}_{N}\) it holds that
Proof
By (2.3) and (2.4), combined with the estimate of Proposition 2.3, we can bound
The sum inside the first square root is exactly equal to \(\sum _{l\in {\mathbb {Z}}_{*}^{3}}\left\| K_{l}\right\| _{\textrm{HS}}^{2}\).
3 Analysis of the One-Body Operators
In this section we analyze the operators \(K_{k}\), \(A_{k}(t)\) and \(B_{k}(t)\) which appear in Theorem 1.4, obtaining the following:
Theorem 3.1
For any \(k\in {\mathbb {Z}}_{*}^{3}\) it holds that
Moreover, for all \(p,q\in L_{k}\) and \(t\in \left[ 0,1\right] \),
for a constant \(C>0\) independent of all relevant quantities.
The analysis of this section is similar to that of [10, Sect. 7], but compared to that section, the estimates of this section are considerably more precise: We quantify the error of the upper bound on \(\left\langle e_{p},\left( -K_{k}\right) e_{q}\right\rangle \), obtain elementwise estimates for \(A_{k}(t)\) and \(B_{k}(t)\) (rather than only estimates for the norm \(\left\| \cdot \right\| _{\infty ,2}\) as in [10]), and determine the leading term of the operator \(\int _{0}^{1}B_{k}(t)dt\) which will be needed to extract the exchange contribution in the next section.
3.1 Matrix element estimates for K-quantities
To ease the notation we will abstract the problem slightly: Instead of \(\ell ^{2}(L_{k})\) we consider a general n-dimensional Hilbert space \(\left( V,\left\langle \cdot ,\cdot \right\rangle \right) \), let \(h:V\rightarrow V\) be a positive self-adjoint operator on V with eigenbasis \(\left( x_{i}\right) _{i=1}^{n}\) and eigenvalues \(\left( \lambda _{i}\right) _{i=1}^{n}\), and let \(v\in V\) be any vector such that \(\left\langle x_{i},v\right\rangle \ge 0\) for all \(1\le i\le n\), and let \(P_{w}\left( \cdot \right) =\left\langle w,\cdot \right\rangle w\) be the projection onto \(w\in V\). Theorem 3.1 will then be obtained at the end by insertion of the particular operators \(h_{k}\) and \(P_{k}\).
We define \(K:V\rightarrow V\) by
As \(\big (h^{2}+2P_{h^{\frac{1}{2}}v}\big )^{\frac{1}{2}} \ge h\) we see that \(K\le 0\). In [10, Sect. 7.2] we proved the following result.
Proposition 3.2
For all \(1\le i,j\le n\) it holds that
Below it will be more convenient to consider the hyperbolic functions \(\sinh \left( -2K\right) \) and \(\cosh \left( -2K\right) \) rather than \(e^{-2K}\) and \(e^{2K}\). The previous proposition implies the following for these operators:
Corollary 3.3
For any \(1\le i,j\le n\) it holds that
Proof
These bounds follow from Proposition 3.2 and the identities
\(\quad \square \)
Now we extend our elementwise estimates to more general operators. These estimates are similar to those of Proposition 7.10 of [10], but more precise. First we consider K itself:
Proposition 3.4
For any \(1\le i,j\le n\) it holds that
Proof
From the identity
which follows by the Mercator series, we thus have that \(-K=\frac{1}{2}\sum _{m=1}^{\infty }\frac{1}{m}\left( 1-e^{2K}\right) ^{m}\). Noting that Proposition 3.2 in particular implies that \(\left\langle x_{i},\left( 1-e^{2K}\right) x_{j}\right\rangle \ge 0\) for all \(1\le i,j\le n\), whence also \(\left\langle x_{i},\left( 1-e^{2K}\right) ^{m}x_{j}\right\rangle \ge 0\) for any \(m\in {\mathbb {N}}\), we may estimate
which is the lower bound. This similarly implies that \(\left\langle x_{i},\left( -K\right) ^{m}x_{j}\right\rangle \ge 0\) for all \(1\le i,j\le n\), \(m\in {\mathbb {N}}\), so the upper bound now also follows from Proposition 3.2 by noting that
The proof of Proposition 3.4 is complete. \(\quad \square \)
The fact that \(\left\langle x_{i},\left( -K\right) ^{m}x_{j}\right\rangle \ge 0\) for all \(1\le i,j\le n\), \(m\in {\mathbb {N}}\), has the important consequence that for any such i and j, the functions
are non-negative and convex for \(t\in \left[ 0,\infty \right) \), as follows by considering the Taylor expansions of the operators involved. This allows us to extend the bounds of Corollary 3.3 to arbitrary \(t\in \left[ 0,1\right] \):
Proposition 3.5
For all \(1\le i,j\le n\) and \(t\in \left[ 0,1\right] \) it holds that
Proof
By the noted convexity we immediately conclude the upper bounds
and by non-negativity of \(\left\langle x_{i},\left( \sinh \left( -tK\right) +tK\right) x_{j}\right\rangle \) and Proposition 3.4, the lower bound
Lastly we can apply the non-negativity of the hyperbolic operators to conclude the bound for \(e^{tK}-1\) as
\(\quad \square \)
3.2 Matrix element estimates for A(t) and B(t)
We now consider operators \(A(t),B(t):V\rightarrow V\) defined by
for \(t\in \left[ 0,1\right] \). We decompose these as
with
We begin by estimating the \(e^{tK}P_{v}e^{tK}\) terms:
Proposition 3.6
For all \(1\le i,j\le n\) and \(t\in \left[ 0,1\right] \) it holds that
Proof
Writing
we see that
Now, by Proposition 3.5 we can for any \(1\le i\le n\) estimate
whence the claim follows. \(\quad \square \)
Note that for \(\left\langle x_{i},e^{tK}P_{v}e^{tK}x_{j}\right\rangle \) this in particular implies the bound
We now consider \(A_{h}(t)\) and \(B_{h}(t)\):
Proposition 3.7
For all \(1\le i,j\le n\) and \(t\in \left[ 0,1\right] \) it holds that
Proof
The estimates of Proposition 3.5 imply that
and
The latter estimate only relied on the inequality
which is also true for \(C_{K}(t)\), so the terms \(C_{K}(t)h\,C_{K}(t)\), \(C_{K}(t)h\,S_{K}(t)\) and \(S_{K}(t)h\,C_{K}(t)\) also obey this estimate. It thus only remains to bound \(tP_{v}-\left\{ h,S_{K}(t)\right\} \). From Proposition 3.5 we see that
whence
\(\square \)
Combining equation (3.16) and Proposition 3.7 we conclude the following:
Proposition 3.8
For all \(1\le i,j\le n\) and \(t\in \left[ 0,1\right] \) it holds that
3.2.1 Analysis of \(\left\{ K,B(t)\right\} \) and \(\int _{0}^{1}B(t)dt\)
We end by estimating \(\left\{ K,B(t)\right\} \) and \(\int _{0}^{1}B(t)dt\), the latter of which will be needed for the analysis of the exchange contribution in the next section.
Proposition 3.9
For all \(1\le i,j\le n\) and \(t\in \left[ 0,1\right] \) it holds that
Proof
Using the Propositions 3.4 and 3.8 we see that
This estimate is also valid for \(\left| \left\langle x_{i},B(t)Kx_{j}\right\rangle \right| \) whence the claim follows. \(\quad \square \)
Proposition 3.10
For all \(1\le i,j\le n\) it holds that
Proof
Noting that \(\frac{1}{2}\left\langle x_{i},v\right\rangle \left\langle v,x_{j}\right\rangle =\frac{1}{2}\left\langle x_{i},P_{v}x_{j}\right\rangle \) and that
we can estimate using the Propositions 3.6 and 3.7 that
\(\square \)
3.2.2 Insertion of the particular operators \(h_{k}\) and \(P_{k}\)
Recall that the particular operators we must consider are \(h_{k},P_{k}:\ell ^{2}(L_{k})\rightarrow \ell ^{2}(L_{k})\) defined by
For these we have that
In [10] the following estimates for sums of the form \(\sum _{p\in L_{k}}\lambda _{k,p}^{\beta }\) were proved:
Proposition 3.11
For any \(k\in {\mathbb {Z}}_{*}^{3}\) and \(\beta \in \left[ -1,0\right] \) it holds that
for a constant \(C_{\beta }>0\) independent of k and \(k_{F}\).
In particular, it holds that
so \(\left\langle v_{k},h_{k}^{-1}v_{k}\right\rangle \le C{\hat{V}}_{k}.\) Additionally,
Inserting these quantities into the statements of the Propositions 3.4, 3.8 and 3.9 yields Theorem 3.1, noting also that by Proposition 3.4
4 Analysis of the Exchange Terms
In this section we analyze the exchange terms, by which we mean the quantities of the expression
which appears in Theorem 1.4. The name is apt as these enter our calculations due to the presence of the exchange correction \(\varepsilon _{k,l}\left( p;q\right) \) of the quasi-bosonic commutation relations (see Lemma 1.3). To be precise, we will consider in this section the operators \(\varepsilon _{k}(\left\{ K_{k},B_{k}(t)\right\} )\), \({\mathcal {E}}_{k}^{1}(A_k(t))\) and \({\mathcal {E}}_{k}^{2}(B_k(t))\), and the effect of the integration will be handled in the next section. The main result of this section is the following estimates for them:
Theorem 4.1
For any \(\Psi \in {\mathcal {H}}_{N}\) and \(t\in \left[ 0,1\right] \) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
The constant terms in the final estimate of the theorem give the exchange contribution
It is not generally negligible for singular potentials V, and the leading behavior is given by by
Proposition 4.2
It holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\), where
4.1 Analysis of \(\varepsilon _{k}\) terms
Let us first consider terms of the form \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}\varepsilon _{k}(A_{k})\), where we recall that
When summing over \(k\in {\mathbb {Z}}_{*}^{3}\), we can split the sum into two parts and interchange the summations as follows:
Recalling that \({\mathcal {N}}_{E}=\sum _{p\in B_{F}^{c}}c_{p}^{*}c_{p}=\sum _{q\in B_{F}}c_{q}c_{q}^{*}\) on \({\mathcal {H}}_{N}\), we can then immediately conclude that
By the estimates of the previous section we thus obtain the first estimate of Theorem 4.1:
Proposition 4.3
For any \(\Psi \in {\mathcal {H}}_{N}\) and \(t\in \left[ 0,1\right] \) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
By Theorem 3.1 we have that
for a constant \(C>0\) independent of all quantities, so
As \(\Vert {\hat{V}}\Vert _{\infty }^{2}\le \Vert {\hat{V}}\Vert _{2}^{2}=\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\) the claim follows. \(\quad \square \)
4.2 Analysis of \({\mathcal {E}}_{k}^{1}\) terms
We consider terms of the form
Recalling that
we see that \({\mathcal {E}}_{k}^{1}(A_k)\) splits into two sums as
The two sums on the right-hand side have the same “schematic form”: They can be written as
where the index set is either the lune \(S_{k}=L_{k}\) or the corresponding hole states \(S_{k}=L_{k}-k,\) and depending on this index set the variables \(p_{1},p_{2},p_{3},p_{4}\) are given by
Note that in either case \(p_{1}\), \(p_{3}\) only depend on p and k, while \(p_{2}\), \(p_{4}\) depend only on p and l. Additionally, \(p_{1}\) is always an element of \(L_{k}\) and \(p_{4}\) is always an element of \(L_{-l}\).
Since \(b_{k,p}=c_{p-k}^{*}c_{p}={\tilde{c}}_{p-k}{\tilde{c}}_{p}\) it is easily seen that \(\left[ b,{\tilde{c}}\right] =0\), so in normal-ordering (with respect to \(\psi _{\textrm{FS}}\)) the summand of equation (4.11) we find
To bound a sum of the form \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\mathcal {E}}_{1}^{k}(A_k)\) it thus suffices to estimate the two schematic forms
4.2.1 Preliminary estimates
We prepare for the estimation of these schematic forms by deriving some auxilliary bounds for the operators involved. Recall that for any \(k\in {\mathbb {Z}}_{*}^{3}\) and \(\varphi \in \ell ^{2}(L_{k})\),
Denote \({\mathcal {N}}_{k}=\sum _{p\in L_{k}}b_{k,p}^{*}b_{k,p}\). We can bound both \(b_{k}(\varphi )\) and \(b_{k}^{*}(\varphi )\) as follows:
Proposition 4.4
For any \(k\in {\mathbb {Z}}_{*}^{3}\), \(\varphi \in \ell ^{2}(L_{k})\) and \(\Psi \in {\mathcal {H}}_{N}\) it holds that
Proof
By the triangle and Cauchy-Schwarz inequalities we immediately obtain
and the bound for \(\left\| b_{k}^{*}(\varphi )\Psi \right\| \) now follows from (4.16) and the fact that
\(\square \)
It is straightforward to see that \({\mathcal {N}}_{k}\le {\mathcal {N}}_{E}\). Moreover, by rearranging the summations,
on \({\mathcal {H}}_{N}\). We also note that for any \(\Psi \in {\mathcal {H}}_{N}\) and \(p\in {\mathbb {Z}}^{3}\)
as follows by the inequality (considering \(p\in B_{F}^{c}\) for definiteness)
and the fact that \(\left[ {\tilde{c}}_{p}^{*}c_{p},{\mathcal {N}}_{k}\right] =0=\left[ {\tilde{c}}_{p}^{*}c_{p},{\mathcal {N}}_{E}\right] \). Similarly
To analyze the commutator term \(\left[ b_{-l}\left( K_{-l}e_{p_{4}}\right) ,{\tilde{c}}_{p_{3}}^{*}\right] \) we calculate a general identity: For any \(l\in {\mathbb {Z}}_{*}^{3}\), \(\psi \in \ell ^{2}(L_l)\) and \(p\in {\mathbb {Z}}^{3}\)
so for our particular commutator we obtain
It will be crucial to our estimates that the prefactors obey the following:
Proposition 4.5
For any \(k,l\in {\mathbb {Z}}_{*}^{3}\) and \(p\in S_{k}\cap S_{l}\) it holds that
Proof
Recall that \(p_{1},p_{2},p_{3},p_{4}\) are given by
From this we see that for any \(p\in S_{k}\cap S_{l}\)
where the assumption that \(p\in S_{k}\cap S_{l}\) enters to ensure that \(1_{B_{F}}(p-k)=1=1_{B_{F}}(p-l)\) or \(1_{B_{F}^{c}}(p+k)=1=1_{B_{F}^{c}}(p+l)\), respectively. Importantly this also implies that, when combined with such an indicator function, we also have the identity
The claim now follows by applying these identities to the estimates
which are given by Theorem 3.1. \(\quad \square \)
Below we will only use the simpler bound
but for the \({\mathcal {E}}_{k}^{2}\) terms the more general ones will be needed.
4.2.2 Estimation of \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\mathcal {E}}_{k}^{1}(A_k(t))\)
Now the main estimate of this subsection:
Proposition 4.6
For any collection of symmetric operators \((A_k)\) and \(\Psi \in {\mathcal {H}}_{N}\) it holds that
Proof
Using the triangle and Cauchy-Schwarz inequalities and Proposition 4.4 we estimate
and the first bound now follows by recalling that \(\left\| K_{l}\right\| _{\textrm{HS}}^{2}\le C{\hat{V}}_{l}\). For the second we have by the equations (4.23) and (4.28) that
where we used \(\left\| A_{k}e_{p_{1}}\right\| \lambda _{k,p_{1}}^{-\frac{1}{2}}=\Vert A_{k}h_{k}^{-\frac{1}{2}}e_{p_{1}}\Vert \). The claim follows by \(\sum _{p\in L_{l}}\lambda _{l,p}^{-1}\le Ck_{F}\). \(\square \)
The bound on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\mathcal {E}}_{k}^{1}(A_k(t))\) of Theorem 4.1 now follows by our matrix element estimates:
Proposition 4.7
For any \(\Psi \in {\mathcal {H}}_{N}\) and \(t\in \left[ 0,1\right] \) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
By Theorem 3.1 we have
Combining with \(\left| L_{k}\right| \le C\min \left\{ k_{F}^{2}\left| k\right| ,k_{F}^{3}\right\} \) since \(\sum _{q\in L_{k}}\lambda _{k,q}^{-1}\le Ck_{F}\), we get
Inserting these estimates into Proposition 4.6 yields the claim. \(\quad \square \)
4.3 Analysis of \({\mathcal {E}}_{k}^{2}\) terms
Now we come to the terms
We will analyze these similarly to the \({\mathcal {E}}_{k}^{1}(A_k)\) terms. Noting that
we find that \({\mathcal {E}}_{k}^{2}(B_{k})\) splits into two sums as
and again these share a common schematic form, namely
where the momenta are now
Again \(p_{1}\), \(p_{3}\) only depend on p and k while \(p_{2}\), \(p_{4}\) only depend on p and l.
We normal order the summand: As
and simply
the summand decomposes into 8 schematic forms as
Of these it should be noted that only the last one is proportional to a constant (i.e. does not contain any creation or annihilation operators). As the rest annihilate \(\psi _{\textrm{FS}}\), it follows that (when summed) the constant term yields precisely \(\left\langle \psi _{\textrm{FS}},{\mathcal {E}}_{k}^{2}(B_{k})\psi _{\textrm{FS}}\right\rangle \), whence bounding the other terms amounts to estimating the operator
as in the statement of Theorem 4.1.
4.3.1 Estimation of the top terms
We begin by bounding the “top” terms
By the quasi-bosonic commutation relations, the commutator term reduces to
where we used that \(p_{1}=p_{4}\) and \(p_{2}=p_{3}\) when \(k=l\). Now, the exchange correction of the second sum splits as
which are both of the schematic form \(\sum _{q\in S_{k}^{\prime }\cap S_{l}^{\prime }}\left\langle B_{k}e_{p_{1}},e_{q_{1}}\right\rangle \left\langle e_{q_{4}},K_{l}e_{p_{4}}\right\rangle {\tilde{c}}_{q_{2}}^{*}{\tilde{c}}_{q_{3}}\).
To estimate \(\sum _{k,l\in {\mathbb {Z}}_{*}^{3}}\sum _{p\in S_{k}\cap S_{l}}{\tilde{c}}_{p_{2}}^{*}\varepsilon _{k,l}\left( B_{k}e_{p_{1}};K_{l}e_{p_{4}}\right) {\tilde{c}}_{p_{3}}\) it thus suffices to consider
The estimates for the top terms are as follows:
Proposition 4.8
For any collection of symmetric operators \((B_k)\) and \(\Psi \in {\mathcal {H}}_{N}\) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
The first term we can estimate as in Proposition 4.6 by
and obviously \(\Vert {\mathcal {N}}_{E}^{\frac{1}{2}}{\mathcal {N}}_{k}^{\frac{1}{2}}\Psi \Vert \le \Vert {\mathcal {N}}_{E}\Psi \Vert \Vert {\mathcal {N}}_{E}^{\frac{3}{2}}\Psi \Vert \). For the commutator term we have
By the matrix element estimate for \(K_{k}\) of Theorem 3.1 we have for any \(p\in L_{k}\) that
since \(\sum _{q\in L_{k}}\lambda _{k,q}^{-1}\le Ck_{F}\). Consequently
and clearly
Finally
whence the claim follows as \(\left\| K_{l}\right\| _{\textrm{HS}}\le C{\hat{V}}_{l}\). \(\quad \square \)
4.3.2 Estimation of the single commutator terms
For the single commutator terms
we note that by equation (4.22), the commutator \(\left[ b_{l}\left( K_{l}e_{p_{4}}\right) ,{\tilde{c}}_{p_{3}}^{*}\right] \) is given by
The prefactors again obey an estimate as in Proposition 4.5:
Proposition 4.9
For any \(k,l\in {\mathbb {Z}}_{*}^{3}\) and \(p\in S_{k}\cap S_{l}\) it holds that
The proof is essentially the same as that of Proposition 4.5 (indeed, this proposition can be obtained directly from the former by appropriate substition, but some care must be used since the \(p_{i}\)’s differ in their definition).
For the single commutator terms we again only need the simpler bound
but the full one will be needed for the double commutator terms below. Now the estimate:
Proposition 4.10
For any collection of symmetric operators \((B_k)\) and \(\Psi \in {\mathcal {H}}_{N}\) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
As in the second estimate of Proposition 4.6 we have
By equation (4.22) it holds that
so the second term can be bounded as
\(\square \)
4.3.3 Estimation of the double commutator terms
Finally we have the double commutator terms
An identity for the iterated commutators is obtained by applying the identity of equation (4.22) to itself: For any \(k,l\in {\mathbb {Z}}_{*}^{3}\), \(\varphi \in \ell ^{2}(L_{k})\), \(\psi \in \ell ^{2}(L_l)\) and \(p\in {\mathbb {Z}}_{*}^{3}\)
The estimates are the following:
Proposition 4.11
For any collection of symmetric operators \((B_k)\) and \(\Psi \in {\mathcal {H}}_{N}\) it holds that
are all bounded by
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
For these estimates we consider only the case \(S_{k}=L_{k}\) for the sake of clarity, i.e. we let
the case \(S_{k}=L_{k}-k\) can be handled by similar manipulations.
Using the identity of equation (4.57) we start by estimating (by the bound of Proposition 4.9)
where we used \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}1_{L_{l}}(p_{3}+l)\lambda _{l,p_{3}+l}^{-1}\le \sum _{q\in L_{l}}\lambda _{l,q}^{-1}\le Ck_{F}\). From (4.57) we have
so the second term can be similarly estimated as
Finally, from (4.51) and (4.54) we see that \(\left[ b_{l}\left( K_{l}e_{p_{4}}\right) ,{\tilde{c}}_{p_{3}}^{*}\right] ^{*}\left[ b_{k}\left( B_{k}e_{p_{1}}\right) ,{\tilde{c}}_{p_{2}}^{*}\right] \) is equal to
so we estimate
\(\square \)
The \({\mathcal {E}}_{k}^{2}\) bound of Theorem 4.1 now follows:
Proposition 4.12
For any \(\Psi \in {\mathcal {H}}_{N}\) and \(t\in \left[ 0,1\right] \) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
Clearly
for any \(B_{k}\), and as our estimate for \(B_{k}(t)\) in Theorem 3.1 is the same as that for \(A_{k}(t)\), the bounds
follow exactly as those of Proposition 4.7. Insertion into the Propositions 4.8, 4.10 and 4.11 yields the claim. \(\square \)
4.4 Analysis of the exchange contribution
Finally we determine the leading order of the exchange contribution. To begin we derive a general formula for a quantity of the form \(\left\langle \psi _{\textrm{FS}},{\mathcal {E}}_{k}^{2}(B_{k})\psi _{\textrm{FS}}\right\rangle \): We can write
where, using equation (4.22) in the form
the terms A and B are given by
and similarly
Although non-obvious, there holds the identity \(A=B\). To see this we rewrite both terms: First, for A, we note that the presence of the \(\delta _{p+q,k+l}\) makes the \(L_{l}\) of the summation \(p,q\in L_{k}\cap L_{l}\) redundant: For any \(p,q\in B_{F}^{c}\) there holds the equivalence
by the trivial identities
so A can be written as
A similar observation applies to B: For any \(p,q\in B_{F}\) we likewise have
so
where we lastly used that the kernels \(K_{k}\) obey
In all we thus have the identity
Our matrix element estimates of the last section now yield the following:
Proposition
(4.2) It holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\), where
Proof
Since all the one-body operators are real-valued we can drop the \(\textrm{Re}\left( \cdot \right) \) and apply the above identity for
Now, note that \(E_{\textrm{corr},\textrm{ex}}\) can be written as
since, much as in Proposition 4.5, the \(\delta _{p+q,k+l}\) implies the following identity for the denominators:
In conclusion we thus see that
We estimate A and B. By the matrix element estimates of Theorem 3.1 we have that (using our freedom to replace \(\lambda _{l,p}+\lambda _{l,q}\) by \(\lambda _{k,p}+\lambda _{k,q}\))
where we applied the inequality \(\sum _{q\in L_{k}}\lambda _{k,q}^{-1}\le Ck_{F}\) and also used that Proposition 3.11 implies that
for a \(C>0\) independent of all quantities. By Cauchy-Schwarz we can further estimate
for the bound of the statement. By similar estimation also
and the claim follows likewise. \(\quad \square \)
5 Estimation of the Non-Bosonizable Terms and Gronwall Estimates
In this section we perform the final work which will allow us to conclude Theorem 1.1.
The main content of this section lies in the estimation of the non-bosonizable terms, by which we mean the cubic and quartic terms
The cubic terms \({\mathcal {C}}\) will not present a big obstacle to us: As was first noted in [2] (in their formulation), the expectation value of these in fact vanish identically with respect to the type of trial state we will consider. The bulk of the work will thus be to estimate the quartic terms. We prove the following bounds:
Theorem 5.1
It holds that \({\mathcal {Q}}=G+{\mathcal {Q}}_{\textrm{LR}}+{\mathcal {Q}}_{\textrm{SR}}\) where for any \(\Psi \in {\mathcal {H}}_{N}\)
and \(e^{{\mathcal {K}}}{\mathcal {Q}}_{\textrm{SR}}e^{-{\mathcal {K}}}={\mathcal {Q}}_{\textrm{SR}}+\int _{0}^{1}e^{t{\mathcal {K}}}\left( 2\,\textrm{Re}\left( {\mathcal {G}}\right) \right) e^{-t{\mathcal {K}}}dt\) for an operator \({\mathcal {G}}\) obeying
\(C>0\) being a constant independent of all quantities.
With these all the general bounds are established. As all our error estimates are with respect to \({\mathcal {N}}_{E}\) and powers thereof, it then only remains to control the effect which the transformation \(e^{{\mathcal {K}}}\) has on these. By a standard Gronwall-type argument this control will follow from the estimate of Proposition 2.4, and we then end the paper by concluding Theorem 1.1.
5.1 Analysis of the cubic terms
Expanding the \(\textrm{Re}\left( \cdot \right) \), the cubic terms are
The operators \(B_{k}\) can be written simply as \(B_{k}=\sum _{p\in L_{k}}b_{k,p}\) in terms of the excitation operators \(b_{k,p}=c_{p-k}^{*}c_{p}\), whence it is easily seen that
As a consequence, \(B_{k}\) maps the eigenspace \(\left\{ {\mathcal {N}}_{E}=M\right\} \) into \(\left\{ {\mathcal {N}}_{E}=M-1\right\} \) and \(B_{k}^{*}\) maps \(\left\{ {\mathcal {N}}_{E}=M\right\} \) into \(\left\{ {\mathcal {N}}_{E}=M+1\right\} \). Meanwhile, the operators \(D_{k}\) preserve the eigenspaces: Writing \(D_{k}=D_{1,k}+D_{2,k}\) for
these annihilate and create one hole or excitation, respectively, whence \(\left[ {\mathcal {N}}_{E},D_{k}\right] =0=\left[ {\mathcal {N}}_{E},D_{k}^{*}\right] \).
It follows that \({\mathcal {C}}\) maps the eigenspace \(\left\{ {\mathcal {N}}_{E}=M\right\} \) into \(\left\{ {\mathcal {N}}_{E}=M-1\right\} \oplus \left\{ {\mathcal {N}}_{E}=M+1\right\} \). Decomposing \({\mathcal {H}}_{N}\) orthogonally as \({\mathcal {H}}_{N}={\mathcal {H}}_{N}^{\textrm{even}}\oplus {\mathcal {H}}_{N}^{\textrm{odd}}\) for
we thus see that \({\mathcal {C}}\) maps each subspace into the other. On the other hand, since our transformation kernel \({\mathcal {K}}\) is of the form
we note that \({\mathcal {K}}\) maps each \(\left\{ {\mathcal {N}}_{E}=M\right\} \) into \(\left\{ {\mathcal {N}}_{E}=M-2\right\} \oplus \left\{ {\mathcal {N}}_{E}=M+2\right\} \), hence \({\mathcal {K}}\) preserves \({\mathcal {H}}_{N}^{\textrm{even}}\) and \({\mathcal {H}}_{N}^{\textrm{odd}}\), and so too does the transformation \(e^{-{\mathcal {K}}}\). As any eigenstate \(\Psi \in {\mathcal {H}}_{N}\) of \({\mathcal {N}}_{E}\) is contained in either \({\mathcal {H}}_{N}^{\textrm{even}}\) or \({\mathcal {H}}_{N}^{\textrm{odd}}\), and these are orthogonal, we conclude the following:
Proposition 5.2
For any eigenstate \(\Psi \) of \({\mathcal {N}}_{E}\) it holds that
5.2 Analysis of the quartic terms
Now we consider the quartic terms
We begin by rewriting these: Recalling the decomposition \(D_{k}=D_{1,k}+D_{2,k}\) above, we calculate
and similarly
For any \(k\in {\mathbb {Z}}_{*}^{3}\) we can likewise write \(\sum _{p\in L_{k}}\left( c_{p}^{*}c_{p}+c_{p-k}c_{p-k}^{*}\right) \) in the form
Noting that \(D_{1,k}=0\) for \(\left| k\right| >2k_{F}\), as then \(B_{F}\cap \left( B_{F}+k\right) =\emptyset \), we thus obtain the decomposition
where G is the one-body operator
the long-range terms \({\mathcal {Q}}_{\textrm{LR}}\) are given by
and the short-range terms \({\mathcal {Q}}_{\textrm{SR}}\) are
5.2.1 Estimation of G and \({\mathcal {Q}}_{\textrm{LR}}\)
G and the long-range terms are easily controlled: First, interchanging the summations we can write G as
from which it is obvious that G obeys
This implies the following:
Proposition 5.3
For any \(\Psi \in {\mathcal {H}}_{N}\) it holds that
for a constant \(C>0\) independent of all quantities.
Proof
For any \(p\in {\mathbb {Z}}^{3}\) we estimate by Cauchy-Schwarz
where we lastly used that \(k\mapsto \min \left\{ \left| k\right| ,k_{F}\right\} ^{-1}\) is radially decreasing and that \(\left( B_{F}+p\right) \cap {\mathbb {Z}}_{*}^{3}\) contains at most \(\left| B_{F}\right| \) points. As it is well-known that \(\sum _{k\in {\overline{B}}\left( 0,R\right) \backslash \left\{ 0\right\} }\left| k\right| ^{-1}\le CR^{2}\) as \(R\rightarrow \infty \) the bound follows. \(\quad \square \)
\({\mathcal {Q}}_{\textrm{LR}}\) can be handled in a similar manner:
Proposition 5.4
For any \(\Psi \in {\mathcal {H}}_{N}\) it holds that
for a constant \(C>0\) independent of all quantities.
Proof
Consider the first term in the parenthesis of (5.13): For any \(k\in {\mathbb {Z}}_{*}^{3}\) we can estimate
As e.g.
the terms \(D_{1,k}^{*}D_{2,k}\) and \(D_{2,k}^{*}D_{1,k}\) can be handled similarly, whence
where \(\sum _{k\in {\overline{B}}\left( 0,2k_{F}\right) \cap {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}\) was bounded as in equation (5.17). \(\quad \square \)
5.2.2 Analysis of \({\mathcal {Q}}_{\textrm{SR}}\)
Lastly we come to
Recall that the transformation \({\mathcal {K}}\) can be written as \({\mathcal {K}}=\tilde{{\mathcal {K}}}-\tilde{{\mathcal {K}}}^{*}\) for
To determine \(e^{{\mathcal {K}}}{\mathcal {Q}}_{\textrm{SR}}e^{-{\mathcal {K}}}\) we will need the commutator \(\left[ {\mathcal {K}},{\mathcal {Q}}_{\textrm{SR}}\right] =2\,\textrm{Re}\left( \left[ \tilde{{\mathcal {K}}},{\mathcal {Q}}_{\textrm{SR}}\right] \right) \). Noting that for any \(p\in B_{F}^{c}\) and \(l\in {\mathbb {Z}}_{*}^{3}\), \(q\in L_{l}\), we have
we deduce (with the help of Lemma A.1) that
Using this we conclude the following:
Proposition 5.5
It holds that \(e^{{\mathcal {K}}}{\mathcal {Q}}_{\textrm{SR}}e^{-{\mathcal {K}}}={\mathcal {Q}}_{\textrm{SR}}+\int _{0}^{1}e^{t{\mathcal {K}}}\left( 2\,\textrm{Re}\left( {\mathcal {G}}\right) \right) e^{-t{\mathcal {K}}}dt\) for
Proof
By the fundamental theorem of calculus
and as noted \(\left[ {\mathcal {K}},{\mathcal {Q}}_{\textrm{SR}}\right] =2\,\textrm{Re}\left( \left[ \tilde{{\mathcal {K}}},{\mathcal {Q}}_{\textrm{SR}}\right] \right) \). Using equation (5.22) we compute that \({\mathcal {G}}:=\left[ \tilde{{\mathcal {K}}},{\mathcal {Q}}_{\textrm{SR}}\right] \) is given by
where we for the third inequality substituted \(p\rightarrow q\) and \(k\rightarrow -k\) in the second sum. By the identity of equation (4.22) the anti-commutator is given by
which is inserted into the previous equation for the claim. \(\quad \square \)
We bound the \({\mathcal {G}}\) operator as follows:
Proposition 5.6
For any \(\Psi \in {\mathcal {H}}_{N}\) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
Using Proposition 4.4 we estimate the sum of the first term of \({\mathcal {G}}\) as
Now, the \(\left\| K_{k}\right\| _{\textrm{HS}}\) estimate of Theorem 3.1 and Cauchy-Schwarz lets us estimate
and
for a constant \(C>0\) independent of all quantities, so in all the first term of \({\mathcal {G}}\) obeys
Similarly, for the second term (using simply that \(\left\| {\tilde{c}}_{p-l}\right\| _{\textrm{Op}}=1\) at the beginning)
\(\square \)
5.3 Gronwall estimates
We now establish control over the operators \(e^{{\mathcal {K}}}{\mathcal {N}}_{E}^{m}e^{-{\mathcal {K}}}\) for \(m=1,2,3\). Consider first the mapping \(t\mapsto e^{t{\mathcal {K}}}{\mathcal {N}}_{E}e^{-t{\mathcal {K}}}\): Noting that for any \(\Psi \in {\mathcal {H}}_{N}\)
Gronwall’s lemma implies that to bound \(e^{t{\mathcal {K}}}\left( {\mathcal {N}}_{E}+1\right) e^{-t{\mathcal {K}}}\) it suffices to control \(\left[ {\mathcal {K}},{\mathcal {N}}_{E}\right] \) with respect to \({\mathcal {N}}_{E}+1\) itself. We determine the commutator: As \({\mathcal {K}}=\mathcal {{\tilde{K}}}-\mathcal {{\tilde{K}}}^{*}\) for
and \(\left[ b_{l,p},{\mathcal {N}}_{E}\right] =b_{l,p}\) it holds that \(\left[ \tilde{{\mathcal {K}}},{\mathcal {N}}_{E}\right] =2\,\tilde{{\mathcal {K}}}\), whence
The estimate of Proposition 2.4 immediately yields that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\), whence by Gronwall’s lemma
This proves the bound for \({\mathcal {N}}_{E}\); for \({\mathcal {N}}_{E}^{2}\) we will as in [10] apply the following lemma:
Lemma 5.7
Let A, B, Z be given with \(A>0\), \(Z\ge 0\) and \(\left[ A,Z\right] =0\). Then if \(\pm \left[ A,\left[ A,B\right] \right] \le Z\) it holds that
The estimates are as follows:
Proposition 5.8
For any \(\Psi \in {\mathcal {H}}_{N}\) and \(\left| t\right| \le 1\) it holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
The case of \(m=1\) was proved above. For \(m=2\) it suffices to control \(\left[ {\mathcal {K}},{\mathcal {N}}_{E}^{2}\right] \) in terms of \({\mathcal {N}}_{E}^{2}+1\); by the identity \(\left\{ A,B\right\} =A^{\frac{1}{2}}BA^{\frac{1}{2}}+[A^{\frac{1}{2}},[A^{\frac{1}{2}},B]]\) we can write
and note that the commutator \(\left[ \tilde{{\mathcal {K}}},{\mathcal {N}}_{E}\right] =2\,\tilde{{\mathcal {K}}}\) also implies that
so by Lemma 5.7 and equation (5.33)
Similarly, for \({\mathcal {N}}_{E}^{3}\),
implies that
hence the \(m=3\) bound. \(\quad \square \)
5.3.1 Conclusion of Theorem 1.1
We can now conclude:
Theorem
(1.1) It holds that
for a constant \(C>0\) depending only on \(\sum _{k\in {\mathbb {Z}}_{*}^{3}}{\hat{V}}_{k}^{2}\).
Proof
By the variational principle applied to the trial state \(e^{-{\mathcal {K}}}\psi _{\textrm{FS}}\) we have by Proposition 1.2 and the Theorems 1.4, 3.1 and 5.1 that
where we also used that
and that \(\left\langle \psi _{\textrm{FS}},e^{{\mathcal {K}}}{\mathcal {C}}e^{-{\mathcal {K}}}\psi _{\textrm{FS}}\right\rangle =0\) by Proposition 5.2. The errors \(\epsilon _{1}\), \(\epsilon _{2}\) and \(\epsilon _{3}\) obey
by Proposition 4.2,
by Theorem 4.1, and
by Theorem 5.1, where we for the last error terms also used that
as follows by Proposition 5.8. \(\quad \square \)
Notes
This name comes from the fact that Slater determinants are the least correlated wave functions under Pauli’s exclusion principle.
We consider spinless particles for simplicity. Including the spin only requires slight modifications of the analysis.
Note that these definitions differ slightly from those of [10]. The main change is the definition of \(Q_{1}^{k}(A)\); this operator is what was denoted \({\tilde{Q}}_{1}^{k}(A)\) in that paper.
References
Bach, V.: Error bound for the Hartree–Fock energy of atoms and molecules. Commun. Math. Phys. 147, 527–548 (1992)
Benedikter, N., Nam, P.T., Porta, M., Schlein, B., Seiringer, R.: Optimal upper bound for the correlation energy of a fermi gas in the mean-field regime. Commun. Math. Phys. 374, 2097–2150 (2020)
Benedikter, N., Nam, P.T., Porta, M., Schlein, B., Seiringer, R.: Correlation energy of a weakly interacting Fermi gas. Invent. Math. 225, 885–979 (2021)
Benedikter, N., Nam, P.T., Porta, M., Schlein, B., Seiringer, R.: Bosonization of fermionic many-body dynamics. Ann. Henri Poincaré 23, 1725–1764 (2022)
Benedikter, N., Porta, M., Schlein, B., Seiringer, R.: Correlation energy of a weakly interacting fermi gas with large interaction potential. Preprint 2021, arXiv:2106.13185
Bogolubov, N.N.: On the theory of superfluidity. J. Phys. (USSR), 11, p. 23 (1947)
Bohm, D., Pines, D.: A collective description of electron interactions. I. Magnetic interactions. Phys. Rev. 82, 625–634 (1951)
Bohm, D., Pines, D.: A collective description of electron interactions: II. Collective vs. Individual particle aspects of the interactions. Phys. Rev. 85, 338–353, (1952)
Bohm, D., Pines, D.: A collective description of electron interactions: III. Coulomb interactions in a degenerate electron gas. Phys. Rev. 92 609–625, (1953)
Christiansen, M.R., Nam, P.T., Hainzl, C.: The random phase approximation for interacting fermi gases in the mean-field regime. Preprint 2021, arXiv:2106.11161
Christiansen, M.R., Nam, P.T., Hainzl, C.: On the Effective quasi-bosonic Hamiltonian of the electron gas: collective excitations and plasmon modes. Lett. Math. Phys. 112, Art. 114, (2022)
Fefferman, C.L., Seco, L.A.: On the energy of a large atom. Bull. Am. Math. Soc. 23, 525–530 (1990)
Gell-Mann, M., Brueckner, K.A.: Correlation energy of an electron gas at high density. Phys. Rev. 106, 364 (1957)
Gontier, D., Hainzl, C., Lewin, M.: Lower bound on the Hartree–Fock energy of the electron gas. Phys. Rev. A 99, 052501 (2019)
Graf, G.M., Solovej, J.P.: A correlation estimate with applications to quantum systems with Coulomb interactions. Rev. Math. Phys 06, 977–997 (1994)
Hainzl, C., Porta, M., Rexze, F.: On the correlation energy of interacting fermionic systems in the mean-field regime. Commun. Math. Phys. 524, 374–485 (2020)
Heisenberg, W.: Zur Theorie der Supraleitung. Zeitschrift für Naturforschung A 2(4), 185–201 (1947)
Macke, W.: Über die Wechselwirkungen im Fermi-Gas. Polarisationserscheinungen, Correlationsenergie, Elektronenkondensation. Zeitschrift für Naturforschung A, 5 (4), pp. 192–208, (1950)
Pines, D.: A collective description of electron interactions: IV. Electron interaction in metals. Phys. Rev. 92, 626 (1953)
Raimes, S.: Many Electron theory. North-Holland Publishing Company (1972)
Sawada, K.: Correlation energy of an electron gas at high density. Phys. Rev. 106, 372 (1957)
Sawada, K., Brueckner, K.A., Fukuda, N., Brout, R.: Correlation energy of an electron gas at high density: plasma oscillations. Phys. Rev. 108, 507 (1957)
Wigner, E.: On the interaction of electrons in metals. Phys. Rev. 46(11), 1002–1011 (1934)
Acknowledgements
We would like to thank the editor and the referees for helpful suggestions. MRC and PTN acknowledge the support from the Deutsche Forschungsgemeinschaft (DFG project Nr. 426365943).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by L. Erdos.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Diagonalization of the Bosonizable Terms
Diagonalization of the Bosonizable Terms
In this section we derive the identity of Theorem 1.4. This is to a degree equivalent with the contents of Section 5 of [10], but for the reader’s convenience, and since the notation used in the papers differ, we include a brief derivation in this appendix.
To determine the action of \(e^{{\mathcal {K}}}\), we must first compute several commutators involving \({\mathcal {K}}\). To simplify the calculations we will make repeated use of the following result ( [10, Lemma 3.2]).
Lemma A.1
Let \(\left( V,\left\langle \cdot ,\cdot \right\rangle \right) \) be an n-dimensional Hilbert space and let \(q:V\times V\rightarrow W\) be a sesquilinear mapping into a vector space W. Let \(\left( e_{i}\right) _{i=1}^{N}\) be an orthonormal basis for V. Then for any linear operators \(S,T:V\rightarrow V\) it holds that
The lemma is easily proved by orthonormal expansion. In our case, where we regard \(\ell ^{2}(L_{k})\) as real vector spaces, sesquilinearity is simply bilinearity. Moreover, the operators \(K_{k}\) satisfy
where \(I_{k}:\ell ^{2}(L_{k})\rightarrow \ell ^{2}\left( L_{-k}\right) \) denotes the unitary mapping defined by \(I_{k}e_{p}=e_{-p}\), \(p\in L_{k}\). Thus Lemma A.1 allows us to move operators from one argument to another (when summed), as e.g.
We start by computing the commutator of \({\mathcal {K}}\) with an excitation operator:
Proposition A.2
For any \(k\in {\mathbb {Z}}_{*}^{3}\) and \(\varphi \in \ell ^{2}(L_{k})\) it holds that
where
Proof
It suffices to determine \(\left[ {\mathcal {K}},b_{k}(\varphi )\right] \). Using Lemmas A.1 and 1.3 we calculate that
where in the last identity we recognized \(K_{-k}\sum _{q\in L_{-k}}\left\langle \varphi ,e_{-q}\right\rangle e_{q}=K_{-k}I_{k}\varphi =I_{k}K_{k}\varphi \). \(\square \)
Using this relation we can now determine the commutators with \(Q_{1}^{k}\) terms:
Proposition A.3
For any \(k\in {\mathbb {Z}}_{*}^{3}\) and symmetric operators \(A_{\pm k}:\ell ^{2}(L_{\pm k})\rightarrow \ell ^{2}(L_{\pm k})\) such that \(I_{k}A_{k}=A_{-k}I_{k}\), it holds that
where
Proof
Using Proposition A.2 (and Lemma A.1 together with symmetry of \(A_{k}\)) we find that
The assumption that \(I_{k}A_{k}=A_{-k}I_{k}\) yields \( Q_{2}^{k}(A_{k}K_{k})=Q_{2}^{-k}(K_{-k}A_{-k}). \) Summing over both k and \(-k\), we obtain the desired identity. \(\quad \square \)
To state the commutator of \({\mathcal {K}}\) with \(Q_{2}^{k}\) terms we note the identity
where we introduced the convenient notation
The commutator is then given as follows:
Proposition A.4
For any \(k\in {\mathbb {Z}}_{*}^{3}\) and symmetric operators \(B_{\pm k}:\ell ^{2}(L_{\pm k})\rightarrow \ell ^{2}(L_{\pm k})\) such that \(I_{k}B_{k}=B_{-k}I_{k}\), it holds that
where
Proof
Writing \(Q_{2}^{k}(B_{k})=2\,\textrm{Re}\left( \sum _{p\in L_{k}}b_{k}(B_k e_p)b_{-k}\left( e_{-p}\right) \right) \) and using Proposition A.2 we get
For (I), the first term on the right-hand side, using (A.5) we find that
Summing over k and \(-k\) and using \({\mathcal {E}}_{k}^{2}(B_{k})=\sum _{p\in L_{k}}\left\{ b_{k}(B_k e_p),{\mathcal {E}}_{-k}\left( e_{-p}\right) \right\} \) for (II), we obtain the desired identity. \(\square \)
Finally we calculate the commutator with \(H_{\textrm{kin}}^{\prime }\):
Proposition A.5
It holds that
Proof
By equation (1.34) we have
so using that \(I_{k}h_{k}=h_{-k}I_{k}\) we find
\(\square \)
Now we can now determine the action of \(e^{{\mathcal {K}}}\) on quadratic operators:
Proposition A.6
For any \(k\in {\mathbb {Z}}_{*}^{3}\) and symmetric operators \(T_{\pm k}:\ell ^{2}(L_{\pm k})\rightarrow \ell ^{2}(L_{\pm k})\) such that \(I_{k}T_{k}=T_{-k}I_{k}\) it holds that
and
where for \(t\in \left[ 0,1\right] \),
Proof
We prove the first identity, the second following by a similar argument. Note that the operators \(A_{k}(t)=T_k^1(t)\), \(B_k(t)=T_k^2(t)\) satisfy
By Propositions A.3 and A.4 we get
The second term on the right-hand side vanishes due to (A.10). Specifying also the initial conditions in (A.10) we conclude by the fundamental theorem of calculus,
where we also used that by the assumptions on \(A_{k}(t)\) and \(B_{k}(t)\)
The proof of Proposition A.6 is complete. \(\square \)
From this we can also easily deduce the action of \(e^{{\mathcal {K}}}\) on \(H_{\textrm{kin}}^{\prime }\):
Proposition A.7
It holds that
where for \(t\in \left[ 0,1\right] \),
Proof
By the Propositions A.3 and A.5 we see that
whence by the fundamental theorem of calculus
Applying Proposition A.6 now yields the claim. \(\square \)
We are now equipped to conclude Theorem 1.4. By the two previous propositions, we see that
where the operators \(A_{k}(t),B_{k}(t):\ell ^{2}(L_{k})\rightarrow \ell ^{2}(L_{k})\) are given by
Now we choose \(K_{k}\) such that \(B_{k}(1)=0\). This amounts to the diagonalization condition
of which the solution is given in (1.39). Since (A.17) is fulfilled, it follows that also \(A_{k}(1)=e^{-K_{k}}h_{k}e^{-K_{k}}-h_{k}\), and so the identity in Theorem 1.4 follows provided we can show that
To establish this final identity we will use the following integral representation of the square root of a one-dimensional perturbation, first used in [2]:
Lemma A.8
Let \(A:V\rightarrow V\) be a positive self-adjoint operator. Then for any \(w\in V\) and \(g\in {\mathbb {R}}\) such that \(A+gP_{w}>0\) it holds that
The trace identity (A.18) now follows (note that this is essentially Proposition 7.6 of [10]):
Proposition A.9
Let \(F\left( x\right) =\log \left( 1+x\right) -x.\) For any \(k\in {\mathbb {Z}}_{*}^{3}\) it holds that
Proof
By cyclicity of the trace and Lemma A.8, \(\textrm{tr}\left( e^{-K_{k}}h_{k}e^{-K_{k}}-h_{k}\right) \) is equal to
The claim follows by inserting the definition of \(h_{k}\) and \(v_{k}\), and noting also that
where we used the integral identity \(\int _{0}^{\infty }a/(a^2+t^2)dt=\pi /2\) for every \(a>0\). \(\quad \square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Christiansen, M.R., Hainzl, C. & Nam, P.T. The Gell-Mann–Brueckner Formula for the Correlation Energy of the Electron Gas: A Rigorous Upper Bound in the Mean-Field Regime. Commun. Math. Phys. 401, 1469–1529 (2023). https://doi.org/10.1007/s00220-023-04672-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00220-023-04672-2