Spectral approximation scheme for a hybrid, spin-density Kohn–Sham density-functional theory in an external (nonuniform) magnetic field and a collinear exchange-correlation energy

Melgaard, M.; Syrjanen, V. J. J.

doi:10.1007/s10910-023-01557-6

Spectral approximation scheme for a hybrid, spin-density Kohn–Sham density-functional theory in an external (nonuniform) magnetic field and a collinear exchange-correlation energy

Original Paper
Open access
Published: 08 January 2024

Volume 62, pages 711–760, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Mathematical Chemistry Aims and scope Submit manuscript

Spectral approximation scheme for a hybrid, spin-density Kohn–Sham density-functional theory in an external (nonuniform) magnetic field and a collinear exchange-correlation energy

Download PDF

M. Melgaard¹ &
V. J. J. Syrjanen¹

868 Accesses
Explore all metrics

Abstract

We provide a mathematical justification of a spectral approximation scheme known as spectral binning for the Kohn–Sham spin density-functional theory in the presence of an external (nonuniform) magnetic field and a collinear exchange-correlation energy term. We use an extended density-only formulation for modeling the magnetic system. No current densities enter the description in this formulation, but the particle density is split into different spin components. By restricting the exchange-correlation energy functional to be of a collinear LSDA form, we prove a series of results which enable us to mathematically justify the spectral binning scheme using the method of Gamma-convergence, in conjunction with auxiliary steps involving recasting the electrostatic potentials, justifying the spectral approximation by making a spectral decomposition of the Hamiltonian and “linearizing" the latter Hamiltonian.

A Variational Framework for Spectral Approximations of Kohn–Sham Density Functional Theory

Article 18 February 2016

Non-analytic Spin-Density Functionals

Kohn–Sham theory in the presence of magnetic field

Article 02 September 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Among the various methods proposed to simplify the numerical simulation of solutions of the non-relativistic Schrödinger equation, the significance of density-functional theory (DFT) in condensed-matter physics and quantum chemistry cannot be overstated [2, 12, 15, 33]. Its theoretical foundations are based upon the seminal work of Hohenberg and Kohn (HK) [13] wherein the HK Theorem was established, stating that the particle density, $\rho $, determines the external potential up to a constant, and an HK variational principle presented. This led to replacement of the many-body Schrödinger problem with a minimization problem involving a universal density functional. Further advances were made in Lieb’s work [20] which developed a more mathematically tractable formulation based on convex analysis. According to the HK theorem, $\rho $ can be treated as the fundamental unknown in a many-electron system without magnetic field. Since $\rho $ only depends on three spatial coordinates, numerical simulations of large molecules and even solids became possible when Kohn and Sham [17] presented an implementation of DFT based upon the idea of replacing the physical system of interacting electrons with a fictitious system (the Kohn–Sham system) of non-interacting electrons, having the same ground state electron density as the physical system. In the Kohn–Sham system, an electron moves in an effective potential consisting of three terms: the external potential induced by the attraction between the electron and the nucleus, the Hartree potential which describes the repulsion between electron and electron, and the exchange-correlation (abbreviated xc) potential which is generated by certain non-classical Coulomb interactions. The latter is not known analytically and, therefore, approximate methods are needed. As part of their work, Kohn and Sham introduced a local density approximation (LDA) for the xc potential. Since then, many other approximations have been proposed such as the generalized gradient approximation (GGA), and the meta-generalized gradient approximation (meta-GGA) (see, e.g., [24]). By solving numerically the Kohn–Sham equation, we can determine the ground state electron density of the original physical system and, via the ground state, other interesting quantities.

Understanding magnetic properties is absolutely crucial in modern material science (atomistic spin-dynamics, complex oxides, saturation magnetization etc.). A magnetic field has two effects on a system of electrons: (i) it tends to align their spins, and (ii) it alters their translational motion. The first effect appears when one adds a term of the form $-e\hbar m^{-1} \varvec{s} \cdot {\varvec{B}}$ to the Hamiltonian, while the second, diamagnetic effect arises from the usual kinetic energy $(2m)^{-1} | {{\textbf{p}}}|^{2}$ being replaced by $(2m)^{-1} | {{\textbf{p}}}-(e/c) {{\mathcal {A}}}|^{2}$. Here ${{\textbf{p}}}$ is the momentum operator, ${{\mathcal {A}}}$ is the vector potential, ${\varvec{B}}=\nabla \times {{\mathcal {A}}}$ is the magnetic field associated with ${{\mathcal {A}}}$, and $\varvec{s}$ is the angular momentum vector. Within the numerical practice, one approach is to apply a perturbation method to compute the variations of the characteristic parameters of, say, a molecule, with respect to the outside perturbation. It is interesting to go beyond and consider the full minimization problem of the perturbed energy. In Hartree-Fock theory one typically only takes into account the effect (ii), whereas in non-relativistic DFT it is common to include the spin-dependent term and to ignore (ii) and to study the minimization of the resulting nonlinear functional, which depends upon two densities, one for spin “up” electrons and the other for spin “down” electrons. Each density satisfies a normalisation constraint which can be interpreted as the total number of spin “up” or “down” electrons. The resulting spin-polarized density-functional theory (SDFT) was first developed by von Barth and Hedin in a very general setting [32] (see also [14, 21]). Magnetism of itinerant electrons in solid state materials can be described well by SDFT. This powerful tool underpins quantitative theoretical determination of spin magnetic moments, and it enables one to understand the mechanisms, which lead to the occurrence of magnetism in solid state materials [16, 36, 37].

Most SDFT applications use a restricted version, where local magnetization is constrained along a fixed direction, so-called collinear spin-polarized DFT. In systems with a collinear magnetization (and in which spin-orbit coupling is neglected), spin-up and spin-down states are orthogonal, and hence the density matrix is diagonal. This results in two Kohn–Sham equations, with one Kohn–Sham equation for the majority ($\uparrow $) spin channel, and one for the minority ($\downarrow $) spin channel. The majority and minority spin channels are connected only through the spin dependence of the exchange-correlation part of the effective one-particle potential. Since the two KS equations can be solved independently, the computational effort for a collinear calculation seems to be just twice the effort for a non-magnetic calculation. However, most magnetic calculations are computationally considerably more demanding since the quantities in question (magnetic moments, energy differences between various magnetic configurations) require much higher accuracy than what is needed for nonmagnetic systems. Systems that can be described by SDFT are all kinds of magnetic materials that assume a collinear magnetic order, e.g. ferromagnetic, antiferromagnetic or ferrimagnetic states [3, 8].

We study a new hybrid spin-density Kohn–Sham DFT in the presence of an external (nonuniform) magnetic field ${\varvec{B}}$ for which the energy minimization problem is given by

$$\begin{aligned} {{\mathcal {E}}}^\textrm{hy} ({\varvec{\gamma }}):= & {} \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\uparrow \uparrow } \right) + \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\downarrow \downarrow } \right) \nonumber \\{} & {} + {{\mathcal {J}}}(\rho _{{\varvec{\gamma }}}) + \int _{{{\mathbb {R}}}^3} {\text {tr}}\,_{\mathbb {C}^2} \left( {\varvec{U}} {\varvec{R}}_{{\varvec{\gamma }}} \right) \, d {{\textbf{r}}}+{{\mathcal {E}}}_{\textrm{xc}}^\textrm{cLSDA}(\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }, \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } ), \end{aligned}$$

(1.1)

where (the reader finds the definitions of ${\text {Tr}}\,( \cdot )$, ${\text {tr}}\,(\cdot )$ and ${{\mathcal {J}}}(\cdot )$ in Sect. 2) ${\varvec{\gamma }}$ is the $2 \times 2$ spin density matrix, V is the electron-nuclei potential,

$$\begin{aligned} {\varvec{U}}=\begin{pmatrix} V-\mu _{{\varvec{B}}} {\varvec{B}}_z &{}-\mu _{{\varvec{B}}} {\varvec{B}}_x+i \mu _{{\varvec{B}}} {\varvec{B}}_y \\ -\mu _{{\varvec{B}}} {\varvec{B}}_x-i \mu _{{\varvec{B}}} {\varvec{B}}_y&{}V+\mu _{{\varvec{B}}} {\varvec{B}}_z. \end{pmatrix} \end{aligned}$$

(1.2)

Moreover, the collinear, local-spin density approximation (LSDA) introduced by von Barth and Hedin [32] consists in writing

$$\begin{aligned} {{\mathcal {E}}}_{\textrm{xc}}^\textrm{LSDA}(\rho ^{\uparrow \uparrow }, \rho ^{\downarrow \downarrow }):= \dfrac{1}{2} \left[ {{\mathcal {E}}}_{\textrm{xc}}^{\textrm{LDA}} (2 \rho ^{\uparrow \uparrow }) + {{\mathcal {E}}}^{\textrm{LDA}}_{\textrm{xc}} (2 \rho ^{\downarrow \downarrow }) \right] , \end{aligned}$$

(1.3)

where $\rho ^{\alpha \alpha }$, $\alpha =\uparrow ,\downarrow $, are the two eigenvalues of the $2 \times 2$ density matrix, and $E_{\textrm{xc}}^\textrm{LDA}$ is the standard exchange-correlation functional in the non-polarized case, which we can write as [17]

$$\begin{aligned} {{\mathcal {E}}}_{\textrm{xc}}^\textrm{LDA} (\rho ) = \int _{{{\mathbb {R}}}^3} h(\rho ({{\textbf{r}}})) \, d^{3} {{\textbf{r}}}. \end{aligned}$$

(1.4)

The conditions on the function h is given in Assumption 2.1. We justify a spectral approximation scheme, known as spectral binning, developed by Wang et al. [35] for the standard KSLDA model, for the collinear Local Spin Density Approximation (LSDA) Kohn–Sham DFT in the presence of an external magnetic field, even allowing a nonuniform magnetic field. In order to transform the problem into one that can be solved numerically, we discretize the system. In the case of the spatial components, the kth level of discretization will be achieved by using the first k basis functions of the space. For the spectral components on the other hand, we will be using a different type of discretization known as spectral binning. The idea is that at the kth level of discretization, we make a step function with k intervals, and each interval will count the number of eigenvalues in that interval, or “bin”, which is where the name comes from. Since the number of eigenvalues is the same, we ensure that our discretized function remains in the minimization space. To give a mathematical justification of our hybrid, Kohn–Sham collinear LSDA DFT (1.1), we will show that as we increase the levels of iterations, the minimizers of the discretized equations will have Gamma-convergence [6]. This is desirable because the idea of $\Gamma $-convergence is that “the limit of the minimizers is the minimizer of the limit”. In other words, suppose we have a sequence of functionals, and each functional has it own minimizing function. If the functionals have $\Gamma $-convergence, what we say is that the functionals converge to a limit functional in the sense that minimizing functions will converge to a limit function, and furthermore this limit function will minimize the limit functional. In future, one can numerically search for the rates of convergence. We show convergence with the $\Gamma $-convergence method: we first show lower semi-continuity, and then show the existence of a recovery sequence. However, all of the above precludes that the non-discretized minimizing function can be given a spectral approximation. Therefore we must first prove this. If the Hamiltonian under consideration was linear, from spectral theory we would know that a spectral decomposition for the minimizer exists, and therefore we can build a spectral binning function. Unfortunately the Hamiltonian we use is not linear with respect to ${\varvec{\gamma }}$. But what we can do is “linearize” the system with respect to ${\varvec{\gamma }}$ by introducing dummy variables (which we either minimize or maximize over). The idea is that whatever is on the inside of the minimization/maximization will be linear with respect to ${\varvec{\gamma }}$. If we can furthermore show that we can bring the ${\varvec{\gamma }}$ minimization inside the dummy variable variations, we will have a new minimization problem on the inside which will be linear with respect to ${\varvec{\gamma }}$, and therefore open to being approximated by a spectral approximation. The approximation scheme and methodology is described in full in Sect. 3 after a summary of DFT in magnetic fields leading to the model studied herein. The specifics of the $\Gamma $-convergence method summarized above is described in Sect. 4, wherein we also state our main result, Theorem 4.1. All proofs are provided in Sect. 5.

Wang et al. [35] shows that the solution of the Kohn–Sham equations within the KSLDA theory (non-magnetic case) can be approximated by its spectral representation, and as one adds more terms to the spectral representation, it $\Gamma $-converges to the true solution. This is very useful when one does numerical calculations, such as in [25] where the authors apply this method to studying crystaline structures. A key idea in [35] is that, by reformulating the Kohn–Sham equations into a linear form, one can show that any ground state solution is in fact a Borel function, thus restricting the space of admissible minimizing functions. The first few spectral binning benchmark problem examples exhibit excellent accuracy and convergence characteristics [34]. For the numerical scheme developed for the hybrid model herein, it remains to be seen how well it captures magnetic properties, where the direction of the local magnetization is not constrained to a particular axis but can vary over the space [22] (this type of behaviour is known and has been observed in, for example, bulk $\gamma $-Fe, geometrically spin-frustrated lattices as jarosites, halogen salts of erbium, and surfaces; see [4] and references therein), or spin-orbit coupling is included [26].

2 Density functional theory in magnetic fields

2.1 Quantum mechanics

In accordance with the Schrödinger-Pauli equation, the electronic Hamiltonian for a molecular system consisting of N electrons and K nuclei (N and K are fixed throughout this paper) is given by

$$\begin{aligned} H(V,{{\mathcal {A}}})=\sum _{n=1}^N \frac{1}{2}\left( {\varvec{\sigma }}_n \cdot \left( -i\nabla _n+\frac{1}{c}{{\mathcal {A}}}({{\textbf{r}}}_n)\right) ^2\right) +\sum _{1\le m,n \le N}\frac{1}{|{{\textbf{r}}}_m-{{\textbf{r}}}_n|}+\sum _{n=1}^N V({{\textbf{r}}}_n), \end{aligned}$$

where ${{\mathcal {A}}}: {{\mathbb {R}}}^{3} \rightarrow {{\mathbb {R}}}^{3}$ is the magnetic vector potential (recall that ${\text {curl}}\,{{\mathcal {A}}}={\varvec{B}}$ is the magnetic field), ${\varvec{\sigma }}_{n}$ contains the Pauli matrices acting on the $n^\textrm{th}$ spin, viz.

$$\begin{aligned} {\varvec{\sigma }}_{n} = ( \sigma _{xn}, \sigma _{yn}, \sigma _{zn} ) = \left( \begin{pmatrix} 0 &{} 1 \\ 1 &{} 0 \end{pmatrix}, \begin{pmatrix} 0 &{} -i \\ i &{} 0 \end{pmatrix}, \begin{pmatrix} 1 &{} 0 \\ 0 &{} 1 \end{pmatrix} \right) , \end{aligned}$$

(2.1)

and $V({\varvec{r}}) = - \sum _{k=1}^{K} Z_{k} / | {\varvec{r}} -{\varvec{R}}_{k} |$, where ${\varvec{R}}_{k}$ denotes the location of the kth nucleus. The N-particle Hilbert space is $\varvec{{{\mathcal {L}}}}^{2} = \varvec{{{\mathcal {L}}}}_{N}^{2} = \bigotimes _{n=1}^{N} L^{2}({{\mathbb {R}}}^{3} \times \mathbb {C}^{2}) = \bigotimes _{n=1}^{N} L^{2}({{\mathbb {R}}}^{3}; \mathbb {C}^{2})$ and the antisymmetric analogue is $\varvec{{{\mathcal {L}}}}_\textrm{a}^{2}:= \varvec{{{\mathcal {L}}}}_{\textrm{a},N}^{2}:= \bigwedge _{n=1}^{N} L^{2}({{\mathbb {R}}}^{3}; \mathbb {C}^{2})$. Likewise, for Sobolev spaces $\varvec{{{\mathcal {H}}}}^{t}:=\varvec{{{\mathcal {H}}}}_{N}^{t}:= H^{t}({{\mathbb {R}}}^{3N} \times (\mathbb {C}^{2})^{N})$ and its antisymmetric analogue $\varvec{{{\mathcal {H}}}}_\textrm{a}^{t}:= \varvec{{{\mathcal {H}}}}_{\textrm{a}, N}^{t}:= \varvec{{{\mathcal {H}}}}^{t} \cap \varvec{{{\mathcal {L}}}}_\textrm{a}^{2}$. We will only need $t=1,2$ below. The operator $H(V,{{\mathcal {A}}})$ is linear and its form domain is $\varvec{{{\mathcal {H}}}}_\textrm{a}^{1}$ and any $\Psi \in \varvec{{{\mathcal {H}}}}_\textrm{a}^{1}$ has $2^N$ components and it satisfies

$$\begin{aligned} \left\{ \begin{array}{l} \Psi \left( {{\textbf{r}}}_{p(1)},s_{p(1)},{{\textbf{r}}}_{p(2)},s_{p(2)},\ldots ,{{\textbf{r}}}_{p(N)},s_{p(N)}\right) =\varepsilon (p)\Psi ({{\textbf{r}}}_1,s_1,{{\textbf{r}}}_2,s_2,\ldots ,{{\textbf{r}}}_N,s_N)\\ \\ \sum _{n=1}^N\sum _{s_1,\ldots ,s_N \in \{\uparrow ,\downarrow \}}\int _{{{\mathbb {R}}}^{3N}} \left| \nabla _n \Psi ( {{\textbf{r}}}_1,s_1,\ldots , {{\textbf{r}}}_{N}, s_{N})\right| ^2 \, d{{\textbf{r}}}_1\ldots d {{\textbf{r}}}_N<\infty . \end{array}\right. \end{aligned}$$

The one-particle Hilbert space is ${{\mathcal {H}}}=L^2({{\mathbb {R}}}^3;\mathbb {C}^2):=\left\{ \, {\varvec{\phi }}=(\phi ^{\uparrow },\phi ^{\downarrow }) \in (L^2({{\mathbb {R}}}^3))^{2} \,: \,\right. \left. \Vert {\varvec{\phi }} \Vert _{{{\mathcal {H}}}}<\infty \,\right\} $ with inner product

$$\begin{aligned} \langle {\varvec{\phi }},{\varvec{\psi }} \rangle _{{{\mathcal {H}}}}=\int _{{{\mathbb {R}}}^3} \left( \overline{\phi ^{\uparrow }({{\textbf{r}}})}\psi ^{\uparrow }({{\textbf{r}}})+\overline{\phi ^{\downarrow }({{\textbf{r}}})} \psi ^{\downarrow }({{\textbf{r}}})\right) \, d {{\textbf{r}}}. \end{aligned}$$

The set of N-particle density matrices describing the pure states is

$$\begin{aligned} {{\mathcal {P}}}_N:=\left\{ \,\Gamma =\, | {\Psi } \rangle \langle {\Psi }| \,:\;\Psi \in \bigwedge _{n=1}^{N} H^1({{\mathbb {R}}}^3;\mathbb {C}^2),\;\Vert \Psi \Vert _{L^{2}({{\mathbb {R}}}^{3N})}=1\,\right\} \end{aligned}$$

and the set of N-particle density matrices describing the mixed states, denoted by ${{\mathcal {M}}}_N$, is the convex hull $\textrm{CH} ({{\mathcal {P}}}_{N})$ of ${{\mathcal {P}}}_N$. Our main object of interest is the ground state energy defined as

$$\begin{aligned} E_\textrm{QM}(V, {{\mathcal {A}}})=\min _{\Psi \in \varvec{{{\mathcal {H}}}}_\textrm{a}^{1}, \Vert \Psi \Vert _{L^2}=1} \langle {\Psi },{H(V, {{\mathcal {A}}}) \Psi } \rangle \end{aligned}$$

(2.2)

or, equivalently, $E_\textrm{QM}(V,{{\mathcal {A}}})=\min _{\Gamma \in {{\mathcal {P}}}_N} {\text {Tr}}\,\big (H(V, {{\mathcal {A}}})\Gamma \big )$. Since $H(V, {{\mathcal {A}}})$ is linear and since ${{\mathcal {M}}}_N$ is the convex hull of ${{\mathcal {P}}}_N$, we have $E_\textrm{QM}(V, {{\mathcal {A}}})=\min _{\Gamma \in {{\mathcal {M}}}_N} {\text {Tr}}\,(H(V, {{\mathcal {A}}})\Gamma )$. A calculation yields

$$\begin{aligned} {\text {Tr}}\,\big (H(V,{{\mathcal {A}}})\Gamma \big )= & {} {\text {Tr}}\,(H (0,0) \Gamma ) \\{} & {} +\int \left( V({{\textbf{r}}})+\frac{1}{2}\dfrac{|{{\mathcal {A}}}(r)|^2}{c^2}\right) \rho ({{\textbf{r}}}) \, d {{\textbf{r}}}\\{} & {} +\int {{\mathcal {A}}}({{\textbf{r}}})\cdot {{\textbf{j}}}_{\rho }({{\textbf{r}}}) \, d {{\textbf{r}}}-\mu _{{\varvec{B}}} \int _{{{\mathbb {R}}}^3} {\varvec{B}} ({{\textbf{r}}}) \cdot {\varvec{m}}_{\Gamma } ({{\textbf{r}}}) \, d {{\textbf{r}}}, \end{aligned}$$

where the (total) electronic density is given by

$$\begin{aligned} \rho _{\Gamma }({{\textbf{r}}}):= N \sum _{s \in \{\uparrow ,\downarrow \}} \int _{{{\mathbb {R}}}_{\{\uparrow ,\downarrow \}}^{3(N-1)}} \Gamma ({{\textbf{r}}},s; {{\textbf{x}}}_2,\ldots ,{{\textbf{x}}}_N; {{\textbf{r}}},s; {{\textbf{x}}}_2,\ldots ,{{\textbf{x}}}_N) \, d {{\textbf{x}}}_2 \ldots d {{\textbf{x}}}_N, \end{aligned}$$

${\varvec{j}}_{\rho }$ is the paramagnetic current [9, Sect. 7.4] and ${\varvec{m}}_{\Gamma }$ is the spin density [32].

2.2 Spin density functional theory (SDFT)

As explained in the introduction, a magnetic field has two effects on a system of electrons and we usually study them separately under the following names:

Spin density functional theory (SDTF) for spin effects: ${{\mathcal {A}}}=0$ and ${\varvec{B}}\ne 0$.
Current DFT (CDFT) for orbital effects: ${{\mathcal {A}}}\ne 0$ and ${\varvec{B}}=0$.

In this paper we study the former which arises from the Pauli operator

$$\begin{aligned} H(V,{\varvec{B}}):= & {} \sum _{n=1}^{n} \left( -\frac{1}{2} \Delta _{n} + V({{\textbf{r}}}_{n}) \right) {\varvec{I}}_{2} - \mu \sum _{n=1}^{N} {\varvec{B}}({{\textbf{r}}}_{n}) \cdot {\varvec{\sigma }}_{n}\nonumber \\ {}{} & {} + \sum _{1 \le m,n \le N} \dfrac{1}{|{{\textbf{r}}}_m - {{\textbf{r}}}_n |} {\varvec{I}}_{2} \end{aligned}$$

(2.3)

acting in ${\varvec{{{\mathcal {L}}}}}_\textrm{a}^{2}$; hence no current densities enter but the particle density will be split into different spin components. (For CDFT and other density-variables in DFT with magnetic fields, we refer to [18, 23, 30]). We are thus interested in the SDFT (${{\mathcal {A}}}=0$, ${\varvec{B}} \ne 0$) energy expressed by

$$\begin{aligned} {\text {Tr}}\,\big (H(V,{\varvec{B}})\Gamma \big )={\text {Tr}}\,(H(0,0) \Gamma )+\int V({{\textbf{r}}})\rho ({{\textbf{r}}})\, d {{\textbf{r}}}-\mu _{{\varvec{B}}} \int {\varvec{B}}_{\Gamma } ({{\textbf{r}}}) \cdot {\varvec{m}}({{\textbf{r}}}) \, d {{\textbf{r}}}. \nonumber \\ \end{aligned}$$

(2.4)

For $\Gamma \in {{\mathcal {M}}}_{N}$ with Schwarz kernel $\Gamma ( {{\textbf{r}}}_{1}, s_{1}, \ldots , {{\textbf{r}}}_{N}, s_{N}; {{\textbf{r}}}^{\prime }, s_{1}^{\prime }, \ldots , {{\textbf{r}}}_{N}^{\prime }, s_{N}^{\prime } )$ we introduce the matrix of spin-polarized electronic densities (the spin density matrix)

$$\begin{aligned} {\varvec{R}}_{\Gamma }({{\textbf{r}}}):=\begin{pmatrix} \rho _{\Gamma }^{\uparrow \uparrow }({{\textbf{r}}}) &{}\rho _{\Gamma }^{\uparrow \downarrow }({{\textbf{r}}})\\ \rho _{\Gamma }^{\downarrow \uparrow }({{\textbf{r}}})&{}\rho _{\Gamma }^{\uparrow \uparrow }({{\textbf{r}}}) \end{pmatrix}, \end{aligned}$$

where, for $\alpha , \beta \in \{ \uparrow ,\downarrow \}$, we let

$$\begin{aligned} \rho _{\Gamma }^{\alpha \beta } ({{\textbf{r}}}):= & {} N \; \sum _{(s_2, \ldots , s_N) \in \{ \uparrow , \downarrow \}^{N-1}}\\ {}{} & {} \times \int _{{{\mathbb {R}}}^{3(N-1)}} \Gamma ({{\textbf{r}}}, \alpha , {{\textbf{r}}}_2, s_2, \ldots , {{\textbf{r}}}_N, s_N; {{\textbf{r}}}, \beta , {{\textbf{r}}}_2, s_2, \ldots , {{\textbf{r}}}_N, s_N) \, d {{\textbf{r}}}_2 \ldots d {{\textbf{r}}}_N. \end{aligned}$$

Then the spin angular momentum density ${\varvec{m}}_{\Gamma }$, is given by $ {\varvec{m}}_{\Gamma } = ( \rho _{\Gamma }^{\uparrow \downarrow }+ \rho _{\Gamma }^{\downarrow \uparrow }, i \rho _{\Gamma }^{\downarrow \uparrow }- i\rho _{\Gamma }^{\uparrow \downarrow }, \rho _{\Gamma }^{\uparrow \uparrow }- \rho _{\Gamma }^{\downarrow \downarrow } )^\textrm{T}$. With ${\varvec{B}}=({\varvec{B}}_x,{\varvec{B}}_y,{\varvec{B}}_z)$, we have

$$\begin{aligned} {\varvec{B}} \cdot {\varvec{m}}_{\Gamma } = {\varvec{B}}_x \left( \rho _{\Gamma }^{\uparrow \downarrow } + \rho _{\Gamma }^{\downarrow \uparrow } \right) + i {\varvec{B}}_y \left( \rho _{\Gamma }^{\downarrow \uparrow } - \rho _{\Gamma }^{\uparrow \downarrow }\right) + {\varvec{B}}_z \left( \rho _{\Gamma }^{\uparrow \uparrow } - \rho _{\Gamma }^{\downarrow \downarrow }\right) . \end{aligned}$$

We denote the $2 \times 2$ matrix trace by ${\text {tr}}\,_{\mathbb {C}^2}$. Since ${\varvec{m}}_{\Gamma } = {\text {tr}}\,_{\mathbb {C}^2} \left( \varvec{\sigma } \cdot {\varvec{R}}_{\Gamma } \right) $, and $\rho _{\Gamma } = \rho _{\Gamma }^{\uparrow \uparrow } + \rho _{\Gamma }^{\downarrow \downarrow }$ is the total electronic density, we have that ${\text {tr}}\,_{\mathbb {C}^2} \left( {\varvec{U}} {\varvec{R}}_{\Gamma } \right) = V \rho _{\Gamma } - \mu {\varvec{B}} \cdot {\varvec{m}}_{\Gamma }$, where ${\varvec{U}}$ is given in (1.2). Then we can recast Eq. (2.4) as ${\text {Tr}}\,\big (H(V,{\varvec{B}})\Gamma \big )={\text {Tr}}\,(H (0,{\varvec{0}}) \Gamma )+\int _{{{\mathbb {R}}}^{3}} {\text {tr}}\,_{\mathbb {C}^2}\left[ {\varvec{U}} {\varvec{R}}_{\Gamma }\right] \, d{{\textbf{r}}}$. When one puts ${\varvec{B}} = \varvec{0}$, one obtains the usual potential energy density V multiplied by the electron density $\rho _{\Gamma }$, which appears in spin-unpolarized DFT. Similar to standard DFT we apply the constrained search method (see the second equality below), attributed to Levy [19], Valone [31], and Lieb [20], yielding

$$\begin{aligned} E(V,{\varvec{B}})= & {} \min _{\Gamma \in {{\mathcal {M}}}_N} {\text {Tr}}\,\big (H(V,{\varvec{B}})\Gamma \big ) \nonumber \\ {}= & {} \min _{{\varvec{R}}_{\Gamma } \in {{\mathcal {J}}}_N \cdot ({{\mathcal {M}}}_N)} \left\{ \,F({\varvec{R}}_{\Gamma })+\int {\text {tr}}\,_{\mathbb {C}^2}\left[ {\varvec{U}} {\varvec{R}}_{\Gamma } \right] \,\right\} \end{aligned}$$

(2.5)

with $ F({\varvec{R}}_{\Gamma }): = \inf _{\Gamma \in {{\mathcal {M}}}_N,\Gamma \rightarrow {\varvec{R}}_{\Gamma } } {\text {Tr}}\,[H(0,0)\Gamma ]$ and the admissible set of spin density matrices given by ${{\mathcal {J}}}_N ({{\mathcal {M}}}_N):=\left\{ \, {\varvec{R}}_{\Gamma }\;: \; \Gamma \in {{\mathcal {M}}}_N\,\right\} $. Note that (2.5) is the minimum of a functional which depends only on the $2 \times 2$ spin-density matrix ${\varvec{R}}_{\Gamma }$; the latter explains the name SDFT. In comparison with the linear problem (2.2), which suffers from the curse of dimensionality,^{Footnote 1} the minimization problem (2.5) is formulated on a space of lower dimension (which is good) but, on the other hand, the problem has become nonlinear. We know that ${{\mathcal {J}}}_N({{\mathcal {M}}}_N)$ consists of all ${\varvec{R}}_{\Gamma } \in {{\mathcal {M}}}_{2\times 2}\big (L^1({{\mathbb {R}}}^3)\big )$ for which $\varvec{R}_{\Gamma }$ is Hermitian, nonnegative, $\int _{{{\mathbb {R}}}^3} {\text {tr}}\,_{\mathbb {C}^2}[ {\varvec{R}}_{\Gamma } ] \, d{{\textbf{r}}}=N$, and $\sqrt{{\varvec{R}}_{\Gamma }} \in {{\mathcal {M}}}_{2\times 2}$ with $\sqrt{\cdot }$ meant in the Hermitian sense [10]. One of the challenges of DFT is that the functional F cannot be easily evaluated. Kohn and Sham [17] made the problem practically accessible by approximating F by studying a system of non-interacting electrons. Following their approach, we introduce, for $\Gamma \in {{\mathcal {M}}}_N$, the one-particle density matrix

$$\begin{aligned} {\varvec{\gamma }}_{\Gamma }({{\textbf{r}}},{{\textbf{r}}}^\prime )=\begin{pmatrix} \gamma _{\Gamma }^{\uparrow \uparrow }&{}\gamma _{\Gamma }^{\uparrow \downarrow }\\ \gamma _{\Gamma }^{\downarrow \uparrow }&{}\gamma _{\Gamma }^{\downarrow \downarrow } \end{pmatrix}({{\textbf{r}}},{{\textbf{r}}}^\prime ), \end{aligned}$$

where, for $\alpha ,\beta \in \left\{ \uparrow ,\downarrow \right\} ^2$,

$$\begin{aligned} \gamma _{\Gamma }^{\alpha \beta }({{\textbf{r}}}, {{\textbf{r}}}^{\prime }):= & {} N \; \sum _{s_2,\ldots ,s_N\in \{\uparrow ,\downarrow \}^{N-1}}\int _{{{\mathbb {R}}}^{3(N-1)}}\\ {}{} & {} \times \Gamma \left( {{\textbf{r}}}, \alpha , {{\textbf{r}}}_2, s_2,\ldots ,{{\textbf{r}}}_N, s_N; {{\textbf{r}}}^{\prime }, \beta ,{{\textbf{r}}}_2, s_2,\ldots ,{{\textbf{r}}}_N, s_N\right) \, d {{\textbf{r}}}_2 \ldots d {{\textbf{r}}}_N. \end{aligned}$$

Then ${\varvec{R}}_{\Gamma }({{\textbf{r}}})={\varvec{\gamma }}_{\Gamma }({{\textbf{r}}},{{\textbf{r}}})$ and the set of mixed state one-particle density matrices is ${\varvec{{{\mathcal {C}}}}}_N:= \{ {\varvec{\gamma }}_{\Gamma }: \quad \Gamma \in {{\mathcal {M}}}_N \}$, and, identifying the kernel ${\varvec{\gamma }}({{\textbf{r}}}, {{\textbf{r}}}')$ with the corresponding operator of ${\mathcal {S}}(L^2({{\mathbb {R}}}^3; \mathbb {C}^2))$, the space of self-adjoint, bounded operators on $L^2({{\mathbb {R}}}^3; \mathbb {C}^2)$, Coleman [5] proved that

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_N = \left\{ {\varvec{\gamma }} \in {\mathcal {S}}(L^2({{\mathbb {R}}}^3; \mathbb {C}^2)): \quad 0 \le {\varvec{\gamma }} \le 1, \quad {\text {Tr}}\,( {\varvec{\gamma }}) = N, \quad {\text {Tr}}\,( - \Delta {\varvec{\gamma }} ) < \infty \right\} , \nonumber \\ \end{aligned}$$

(2.6)

where ${\text {Tr}}\,( - \Delta {\varvec{\gamma }} ):= {\text {Tr}}\,( - \Delta \gamma ^{\uparrow \uparrow } ) + {\text {Tr}}\,( - \Delta \gamma ^{\downarrow \downarrow } )$. In a similar way, we can define, for $\lambda > 0$,

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_\lambda := \left\{ {\varvec{\gamma }} \in {\mathcal {S}}(L^2({{\mathbb {R}}}^3; \mathbb {C}^2)), \quad 0 \le {\varvec{\gamma }} \le 1, \quad {\text {Tr}}\,( {\varvec{\gamma }}) = \lambda , \quad {\text {Tr}}\,( - \Delta {\varvec{\gamma }}) < \infty \right\} . \nonumber \\ \end{aligned}$$

(2.7)

By invoking the spectral theory for compact self-adjoint operators, we can write the components $\gamma ^{\alpha \beta }$ of any ${\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_{\lambda }$ in the form

$$\begin{aligned}&\gamma ^{\alpha \beta } ({{\textbf{r}}}, {{\textbf{r}}}') = \sum _{k=1}^\infty n_k \xi _k^\alpha ({{\textbf{r}}}) \overline{\xi _k^\beta ({{\textbf{r}}}')}, \ 0 \le n_k \le 1, \ \sum _{k=1}^\infty n_k = \lambda , \nonumber \\&{\varvec{\xi }}_k = \begin{pmatrix} \xi _k^\uparrow \\ \xi _k^\downarrow \end{pmatrix} \in L^2({{\mathbb {R}}}^3; \mathbb {C}^2), \qquad \langle {{\varvec{\xi }}_k},{ {\varvec{\xi }}_l} \rangle = \delta _{kl}, \nonumber \\&{\text {Tr}}\,( - \Delta {\varvec{\gamma }}) := \sum _{k=1}^\infty n_k \Vert \nabla {\varvec{\xi }}_k \Vert _{L^2}^2 = {\text {Tr}}\,(- \Delta \gamma ^{\uparrow \uparrow }) + {\text {Tr}}\,(- \Delta \gamma ^{\downarrow \downarrow }) < \infty . \end{aligned}$$

(2.8)

Since ${\varvec{\gamma }}_{\Gamma } ( {{\textbf{r}}}, {{\textbf{r}}}) = {\varvec{R}}_{\Gamma }( {{\textbf{r}}})$, we write $ {\varvec{R}}_{{\varvec{\gamma }}} ({{\textbf{r}}}):= {\varvec{\gamma }}({{\textbf{r}}}, {{\textbf{r}}})$ for ${\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_{N}$. We finally introduce ${\varvec{{{\mathcal {D}}}}}_\lambda := \left\{ {\varvec{R}} \in {{\mathcal {M}}}_{2 \times 2}(L^1({{\mathbb {R}}}^3)): \quad \exists {\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_\lambda \text{ s.t. } {\varvec{R}} = {\varvec{R}}_{{\varvec{\gamma }}} \right\} $.

2.2.1 Extended Kohn–Sham approach

Following Kohn and Sham [17] we decompose $F({\varvec{R}})$ as $F({\varvec{R}})=T_\textrm{KS}({\varvec{R}})+{{\mathcal {J}}}(\rho _{{\varvec{R}}})+{{\mathcal {E}}}_\textrm{xc}({\varvec{R}})$ for all ${\varvec{R}} \in {\varvec{{{\mathcal {D}}}}}_{\lambda }$, where $T_\textrm{KS}({\varvec{R}})$ is the Kohn–Sham kinetic energy of a non-interacting system given by

$$\begin{aligned} T_\textrm{KS}({\varvec{R}}):=\inf _{{\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_{\lambda }, {\varvec{R}}_{{\varvec{\gamma }}} ={\varvec{R}}} \left\{ \,\frac{1}{2}{\text {Tr}}\,(-\Delta {\varvec{\gamma }} )\,\right\} , \qquad \forall {\varvec{R}} \in {\varvec{{{\mathcal {D}}}}}_{\lambda }. \end{aligned}$$

The second term $J(\rho _{{\varvec{R}}})$ is the direct Coulomb energy given by

$$\begin{aligned} {{\mathcal {J}}}(\rho _{{\varvec{R}}}) =\frac{1}{2}\int _{{{\mathbb {R}}}^3}\int _{{{\mathbb {R}}}^3}\dfrac{\rho _{{\varvec{R}}} ({{\textbf{r}}}) \rho _{{\varvec{R}}}({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^{\prime } |} \, d {{\textbf{r}}}\, d {{\textbf{r}}}^\prime , \end{aligned}$$

and the third term ${{\mathcal {E}}}_\textrm{xc}({\varvec{R}})$ is the exchange-correlation energy given by ${{\mathcal {E}}}_\textrm{xc}({\varvec{R}}):=F({\varvec{R}})- T_\textrm{KS}({\varvec{R}}) - J({\varvec{R}})$ for all $ {\varvec{R}} \in {\varvec{{{\mathcal {D}}}}}_{\lambda }$.

2.2.2 Local spin density approximation

Next we address the question on how to choose ${{\mathcal {E}}}_\textrm{xc}({\varvec{R}})$. The local-spin density approximation (in short, LSDA) introduced by von Barth and Hedin [32] consists in the approximation

$$\begin{aligned} {{\mathcal {E}}}_{\textrm{xc}}({\varvec{R}}) \approx {{\mathcal {E}}}_{\textrm{xc}}^\textrm{LSDA}(\rho ^+, \rho ^-), \end{aligned}$$

(2.9)

where ${{\mathcal {E}}}_{\textrm{xc}}^\textrm{LSDA}(\rho ^+, \rho ^-)$ is given in (1.3), $\rho ^{+}$ and $\rho ^-$ are the two eigenvalues of the $2 \times 2$ matrix ${\varvec{R}}$, and ${{\mathcal {E}}}_{\textrm{xc}}^\textrm{LDA}$ is the standard LDA of the exchange-correlation functional in the non-polarized case [17], which takes the form (1.4), where the conditions imposed on the function h is summarized in Assumption 2.1 [35].

Assumption 2.1

(P1)
Smoothness condition. The function $h: {{\mathbb {R}}}^+ \cup \{0\} \rightarrow {{\mathbb {R}}}$ satisfies $h \in C^1 ({{\mathbb {R}}}^+ \cup \{0\})$.
(P2)
Curvative condition. The function h is concave in ${{\mathbb {R}}}^+$.
(P3)
Zero density condition: $h(0)=0$.
(P4)
Non-positivity condition: $h(t) \le 0$ for all $t \in {{\mathbb {R}}}^+$.
(P5)
Decay condition: For $t> 0$ the function h satisfies $h^{\prime } (t) \le 0$.
(P6)
Growth condition: For $t>0$, $C_1 |t|^{4/3} + C_2 \le |h(t)| \le C_3 |t|^{4/3} + C_4$ for some real constants $C_1> 0, \ C_2 \le 0, \ C_3 > 0$ and $C_4 \ge 0$.

We can extend h, by reflection, to a function from ${{\mathbb {R}}}$ to ${{\mathbb {R}}}$, setting $h(t):=h(-t)$ for $t<0$. This extended function, again denoted by h, is continuous in ${{\mathbb {R}}}$ due to the property (P3). Since h(t) is continuous in ${{\mathbb {R}}}$ and since $|h(t)| \le C_3 |t|^{4/3} + C_4$, from the upper bound in (P6), Fatou’s Lemma implies that ${{\mathcal {E}}}_\mathrm{{xc}}(\rho _{\gamma })$ is continuous in $L^{4/3}({{\mathbb {R}}}^3)$. Essentially all LDA xc terms used in practice are covered by Assumption 2.1 (see, e.g., [24]).

Taking into account the approximations in (2.9) and (1.4) in the minimization problem in (2.5) while switching to the one-particle density matrix formulation, we arrive at the minimization problem

$$\begin{aligned} I_\lambda := \inf \left\{ {{\mathcal {E}}}({\varvec{\gamma }}) \,: \, {\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_\lambda \right\} , \end{aligned}$$

(2.10)

where

$$\begin{aligned} {{\mathcal {E}}}({\varvec{\gamma }}){} & {} : = \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\uparrow \uparrow } \right) + \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\downarrow \downarrow } \right) + {{\mathcal {J}}}(\rho _{{\varvec{\gamma }}}) \nonumber \\{} & {} \quad + \int _{{{\mathbb {R}}}^3} {\text {tr}}\,_{\mathbb {C}^2} \left( {\varvec{U}} {\varvec{R}}_{{\varvec{\gamma }}} \right) \, d {{\textbf{r}}}+{{\mathcal {E}}}_{\textrm{xc}}^\textrm{LSDA}(\rho _{{\varvec{\gamma }}^+}, \rho _{{\varvec{\gamma }}^-}). \end{aligned}$$

(2.11)

which takes into account an external field ${\varvec{B}} = ({\varvec{B}}_x, {\varvec{B}}_y, {\varvec{B}}_z)$; see (1.2). In the present work we replace the exchange-correlation term by its collinear version given by ${{\mathcal {E}}}_\textrm{xc}^{ cLSDA} (\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }, \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }):= {{\mathcal {E}}}_\textrm{xc}^\textrm{LSDA} (\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }$, $\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow })$. For the resulting functional ${{\mathcal {E}}}^\textrm{hy} ({\varvec{\gamma }}):= {{\mathcal {E}}}({\varvec{\gamma }})$, see (2.11) where the last term ${{\mathcal {E}}}_{\textrm{xc}}^\textrm{LSDA}(\rho _{{\varvec{\gamma }}^+}, \rho _{{\varvec{\gamma }}^-})$ is replaced by ${{\mathcal {E}}}_\textrm{xc}^{ cLSDA} (\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }, \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow })$, we justify the spectral binning approximation scheme.

3 Approximation scheme and methodology

In the spin DFT setting, we will be minimizing over density matrices, which we denote by ${\varvec{\gamma }}$. Since we will replace ${{\mathbb {R}}}^{3}$ by an open, bounded and simply-connected set $\Omega \subset {{\mathbb {R}}}^{3}$ (a domain), we first set the scene for density matrices defined on $\Omega $. Let ${{\mathcal {S}}}({{\mathcal {H}}})$ denote the set of self-adjoint, bounded operators on a Hilbert space ${{\mathcal {H}}}$. Using $\mathbb {C}^2$ to denote the $2 \times 2$ matrix structure we write

$$\begin{aligned} {\varvec{\gamma }} \in {{\mathcal {S}}}(L^2(\Omega ; \mathbb {C}^2)), \text{ with } \text{ kernel } {\varvec{\gamma }} ({{\textbf{r}}},{{\textbf{r}}}^\prime ) = \begin{pmatrix} \gamma ^{\uparrow \uparrow }({{\textbf{r}}},{{\textbf{r}}}^\prime ) &{} \gamma ^{\uparrow \downarrow }({{\textbf{r}}},{{\textbf{r}}}^\prime ) \\ \gamma ^{\downarrow \uparrow }({{\textbf{r}}},{{\textbf{r}}}^\prime ) &{} \gamma ^{\downarrow \downarrow }({{\textbf{r}}},{{\textbf{r}}}^\prime ) \end{pmatrix}. \end{aligned}$$

For $j \in {{\mathbb {N}}}$, we write an orthonormal basis of $L^2(\Omega ; \mathbb {C}^2)$ as $({\varvec{\xi }}_j)_{j=1}^\infty $, where ${\varvec{\xi }}_j({\varvec{r}}) = ( \xi _j^\uparrow ({\varvec{r}}), \xi _j^{\downarrow }({\varvec{r}}) )^\textrm{T}$ $\in L^2(\Omega ; \mathbb {C}^2)$. With the bra-ket notation, and using the basis $({\varvec{\xi }}_j)$, we may write ${\varvec{\gamma }}$ using $(\alpha _j)$ corresponding to the basis as ${\varvec{\gamma }}({{\textbf{r}}},{{\textbf{r}}}^\prime ) = \sum _{j=1}^\infty \alpha _j \, | {{\varvec{\xi }}_j({{\textbf{r}}})} \rangle \langle {{\varvec{\xi }}_j({{\textbf{r}}}^\prime )}| \,$. We define

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}} = \big \lbrace {\varvec{\gamma }}\in {{\mathcal {S}}}\left( L^2 \left( \Omega ; \mathbb {C}^2\right) \right) : {\text {Tr}}\,({\varvec{\gamma }})< \infty , \, \text{ and } \, {\text {Tr}}\,(-\Delta {\varvec{\gamma }}) < \infty \big \rbrace , \end{aligned}$$

(3.1)

where ${\varvec{{{\mathcal {C}}}}}$ has the corresponding norm $\left\Vert {\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}:= {\text {Tr}}\,(|{\varvec{\gamma }}|) + {\text {Tr}}\,(|\nabla |{\varvec{\gamma }}|\nabla |)$.

Recalling that $0 \le {\varvec{\gamma }}\le 1$ is equivalent to saying that, for the basis expansion of ${\varvec{\gamma }}$ we have $0 \le \alpha _i \le 1$ for all i, we will define a subspace ${\varvec{{{\mathcal {C}}}}}_N \subset {\varvec{{{\mathcal {C}}}}}$ by

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_N=\left\{ {\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}} \,: \, 0 \le {\varvec{\gamma }}\le 1, {\text {Tr}}\,({\varvec{\gamma }})=N \right\} . \end{aligned}$$

(3.2)

The only difference between the latter space and the space ${\varvec{{{\mathcal {C}}}}}_{N}$ found in (2.6) is that we have replaced ${{\mathbb {R}}}^{3}$ by $\Omega $ in the definitions in (3.1) and (3.2). Minimizing over ${\varvec{{{\mathcal {C}}}}}_N$ with respect to the ${\varvec{{{\mathcal {C}}}}}$ norm will still give a solution in ${\varvec{{{\mathcal {C}}}}}_N$ [11]. It is clear that we can define the $2 \times 2$ spin density and the total electronic density in the usual way. We note here that using the basis vector representation, we have that $\rho _{{\varvec{\gamma }}}({{\textbf{r}}})=\rho _{{\varvec{\gamma }}}({{\textbf{r}}})^{\uparrow \uparrow }+\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }({{\textbf{r}}})=\sum _{j=1}^\infty \alpha _j |{\varvec{\xi }}_j({{\textbf{r}}})|^2$.

3.1 Reformulation of problem on a bounded domain

It is convenient to introduce the notation

$$\begin{aligned} E_{ \mathrm ext}(\rho _{ {\varvec{\gamma }}}) = \int _{\Omega } V({{\textbf{r}}}) \rho _{{\varvec{\gamma }}} \, d{{\textbf{r}}}, \qquad E_\mathrm{{ZZ}}= \frac{1}{2} \sum _{i=1}^K \sum _{\begin{array}{c} j\ne i\\ j=1 \end{array}}^K \frac{Z_i Z_j}{| {{\textbf{R}}}_i-{{\textbf{R}}}_j|}. \end{aligned}$$

(3.3)

Our first aim is to reformulate the three terms ${{\mathcal {J}}}$, $E_\textrm{ext}$ and $E_\textrm{ZZ}$ appearing in ${{\mathcal {E}}}(\cdot )$, viz.

$$\begin{aligned} {{\mathcal {E}}}({\varvec{\gamma }})= & {} \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\uparrow \uparrow } \right) + \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\downarrow \downarrow } \right) \nonumber \\{} & {} + {{\mathcal {J}}}(\rho _{ {\varvec{\gamma }}})+E_{ \mathrm ZZ} + E_{ \mathrm ext}(\rho _{ {\varvec{\gamma }}}) - \mu \int _{\Omega } {\varvec{B}} \cdot {\varvec{m}}_{{\varvec{\gamma }}} d {{\textbf{r}}}+{{\mathcal {E}}}_{ \mathrm xc}^\textrm{cLSDA}(\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }, \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }). \nonumber \\ \end{aligned}$$

(3.4)

We will explicitly assume, given K fixed, that there exists an $\epsilon >0$ depending on K only so that for all $i \ne j$, we have

$$\begin{aligned} | {{\textbf{R}}}_i - {{\textbf{R}}}_j|> \epsilon . \end{aligned}$$

(3.5)

This essentially follows from the physical condition that the ith and jth positive nuclei at coordinates ${{\textbf{R}}}_i$ and ${{\textbf{R}}}_j$ repel each other. (We note however that this condition does not automatically translate to an infinite collection of nuclei). Writing ${\underline{{{\textbf{R}}}}} =({{\textbf{R}}}_1, \ldots , {{\textbf{R}}}_K)$, we introduce a mollifier function $f_{{{\textbf{R}}}_i}({{\textbf{r}}})$ and the “potential” function

$$\begin{aligned} {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) = \sum _{i=1}^K Z_i f_{{{\textbf{R}}}_i}({{\textbf{r}}}). \end{aligned}$$

(3.6)

Then, using Lemma 5.1 and Proposition 5.2 we find that we can write the electrostatic terms as

$$\begin{aligned}{} & {} {{\mathcal {J}}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ext}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ZZ}} \nonumber \\{} & {} \quad = \sup _{\phi \in W_0^{1,2}(\Omega )} \left\{ -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } \left( {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) + \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) \right) \phi ({{\textbf{r}}}) d{{\textbf{r}}}\right\} + C_\mathrm{{self}}. \nonumber \\ \end{aligned}$$

(3.7)

By $C_\textrm{self}$ we mean a constant that is independent of $\rho _{{\varvec{\gamma }}}$ and $\phi $. It is worth noting that in [35] no justification is given for its analogue of (3.7). Although in [27, p. 259] a very brief outline is given, there is no formal statement or proof of an analogue to Proposition 5.2. We would like to stress that Proposition 5.2 and Lemma 5.1 and their proofs are new. In particular, the repulsion assumption (3.5) is not found elsewhere.

Let us define the $2 \times 2$ matrix-valued, linear operator ${\mathfrak {B}}$ which acts on ${{\mathcal {S}}}(L^2(\Omega ); \mathbb {C}^2)$ by

$$\begin{aligned} {\mathfrak {B}}:= & {} \mu {\varvec{B}}_x({{\textbf{r}}})\delta ({{\textbf{r}}},{{\textbf{r}}}^\prime ) \begin{pmatrix} 0&{}1\\ 1 &{} 0 \end{pmatrix} + \mu i{\varvec{B}}_y({{\textbf{r}}})\delta ({{\textbf{r}}},{{\textbf{r}}}^\prime ) \begin{pmatrix} 0&{}-1\\ 1 &{} 0 \end{pmatrix} \nonumber \\ {}{} & {} + \mu {\varvec{B}}_z({{\textbf{r}}})\delta ({{\textbf{r}}},{{\textbf{r}}}^\prime ) \begin{pmatrix} 1&{}0\\ 0 &{} -1 \end{pmatrix}, \end{aligned}$$

(3.8)

where $\delta $ is the Dirac delta distribution. Then for ${\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N$ we have that

$$\begin{aligned} {\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }})= & {} \mu \int _{\Omega } {\varvec{B}}_x \left( \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } \right) + i {\varvec{B}}_y \left( \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } - \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } \right) \nonumber \\ {}{} & {} + {\varvec{B}}_z \left( \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }\right) \, d{{\textbf{r}}}. \end{aligned}$$

(3.9)

Our strategy for reformulating the exchange-correlation energy is to construct its dual functional, taking advantage of the concavity property (P2) of the xc term ${{\mathcal {E}}}_\textrm{xc}$ (we suppress the superscript for simplicity) in Assumption 2.1. Let

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}} (\rho ):= - {{\mathcal {E}}}_\mathrm{{xc}} (\rho ). \end{aligned}$$

(3.10)

From the properties (P2) and (P6) of the exchange-correlation function h, the functional ${\mathfrak {G}}_\mathrm{{xc}}$ is convex and continuous in $L^{4/3} (\Omega )$. As usual the Legendre-Fenchel dual ${\mathfrak {G}}_\mathrm{{xc}}^{\star }$ of ${\mathfrak {G}}_\mathrm{{xc}}$ is defined as

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}}^{\star } (u):= \sup _{\rho \in L^{4/3} (\Omega )} \{ \langle {u},{\rho } \rangle - {\mathfrak {G}}_\mathrm{{xc}} (\rho ) \}, \end{aligned}$$

(3.11)

where ${\mathfrak {G}}_\mathrm{{xc}}^{\star } (u)$ is expressed explicitly in (5.17) via the Legendre transform of $-h(t)$ with h being the function which enters in the xc energy, see (1.4). In Lemma 5.3 we show that

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}} (\rho _{{\varvec{\gamma }}}) = \sup _{u \in L^4 (\Omega )} \{ \langle {\rho _{{\varvec{\gamma }}}},{u} \rangle - {\mathfrak {G}}_\mathrm{{xc}}^{\star } (u) \}. \end{aligned}$$

(3.12)

As a direct consequence of (3.12) we can rewrite the ${{\mathcal {E}}}_\mathrm{{xc}}$ functional as

$$\begin{aligned} {{\mathcal {E}}}_\mathrm{{xc}} (\rho _{{\varvec{\gamma }}}) = \inf _{u \in L^4 (\Omega )} \{ - \langle {\rho _{{\varvec{\gamma }}}},{u} \rangle + {\mathfrak {G}}_\mathrm{{xc}}^{\star } (u) \}. \end{aligned}$$

In our case, we rewrite ${{\mathcal {E}}}_\textrm{xc}^\textrm{cLSDA}$ using duality as above, see Sect. 5.2, and our minimization problem thus becomes

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}})= & {} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_{N}} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{ \phi \in W_0^{1,2} (\Omega )} \bigg \{ \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\uparrow \uparrow } \right) + \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\downarrow \downarrow } \right) \\{} & {} \left. + \left\{ -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } \left( {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) + \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) \right) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}\right\} + C_\textrm{self} \right. \\{} & {} \left. + \left( -\left\langle u,\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } \right\rangle +{\mathfrak {G}}_\mathrm{{xc}}^*(u) \right) + \left( -\left\langle v,\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } \right\rangle +{\mathfrak {G}}_\mathrm{{xc}}^*(v) \right) + {\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }}) \right\} . \\ \end{aligned}$$

3.2 Spectral theory

We introduce an unbounded local operator $\Phi ({{\textbf{r}}},{{\textbf{r}}}')=\phi ({{\textbf{r}}}) \ \delta ({{\textbf{r}}},{{\textbf{r}}}')$ for $\phi \in W_0^{1,2} (\Omega )$ as in (3.7) and with $\delta $ representing the Dirac delta distribution so that

$$\begin{aligned} {\text {Tr}}\,(\Phi {\varvec{\gamma }}) = \int _{\Omega } \phi ({{\textbf{r}}}) \, \rho _{{\varvec{\gamma }}} ({{\textbf{r}}}) \ d{{\textbf{r}}}. \end{aligned}$$

(3.13)

It is convenient to rephrase the ground state energy as

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}}) = \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_{N}} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{ \phi \in W_0^{1,2} (\Omega )} L(u,v,\phi ,{\varvec{B}}, {\varvec{\gamma }}) \end{aligned}$$

(3.14)

where the Lagrangian $L \,: \, L^4 (\Omega ) \times L^4 (\Omega ) \times W_0^{1,2} (\Omega ) \times L^2(\Omega )^3 \times {\varvec{{{\mathcal {C}}}}}_N \rightarrow {{\mathbb {R}}}$ is given by

$$\begin{aligned} L(u,v,\phi ,{\varvec{B}}, {\varvec{\gamma }}):= & {} \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\uparrow \uparrow } \right) + \dfrac{1}{2} {\text {Tr}}\,\left( - \Delta \gamma ^{\downarrow \downarrow } \right) \nonumber \\{} & {} -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{{\textbf{R}}}) \phi ({{\textbf{r}}}) d{{\textbf{r}}}+C_\textrm{self} +{\mathfrak {G}}_\mathrm{{xc}}^*(u) \nonumber \\{} & {} +{\mathfrak {G}}_\mathrm{{xc}}^*(v) + {\text {Tr}}\,(\Phi {\varvec{\gamma }}) - {\text {Tr}}\,(U {\varvec{\gamma }}) - {\text {Tr}}\,(V {\varvec{\gamma }}) -{\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }}) \nonumber \\= & {} {\text {Tr}}\,( {\varvec{H}}(u,v,\phi ,{\varvec{B}}){\varvec{\gamma }}) + \int _{\Omega } \left( - \frac{1}{8\pi } |\nabla \phi ({{\textbf{r}}})|^2 + {\mathfrak {f}}({{\textbf{r}}},{{\textbf{R}}}) \phi ({{\textbf{r}}}) \right) d{{\textbf{r}}}\nonumber \\{} & {} +C_\textrm{self} +{\mathfrak {G}}_\mathrm{{xc}}^*(u)+{\mathfrak {G}}_\mathrm{{xc}}^*(v), \end{aligned}$$

(3.15)

with the $2 \times 2$ Hamiltonian ${\varvec{H}}(u,v,\phi ,{\varvec{B}}) = - \dfrac{1}{2} \Delta \otimes \varvec{I}_{2} + \Phi - U -V - {\mathfrak {B}}$ with $\varvec{I}_{2}$ being the $2 \times 2$ matrix-valued identity operator and $\Phi $, U, V, and ${\mathfrak {B}}$ are defined above. Since we can exchange the order of the infimum, the ground-state energy also equals

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}}) =\inf _{u \in L^4 (\Omega )} \inf _{v \in L^4(\Omega )} \inf _{{{\varvec{\gamma }}}\in {\varvec{{{\mathcal {C}}}}}_N} \sup _{\phi \in W_0^{1,2} (\Omega )} L(u,v,\phi , {\varvec{B}},{\varvec{\gamma }}). \end{aligned}$$

(3.16)

Next we identify sufficient properties of $L(u,v,\cdot ,{\varvec{B}},\cdot )$ that allow us to exchange the order of the infimum over ${\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N$ and the supremum over $\phi \in W_0^{1,2} (\Omega )$. In particular, we note that $L(u,v,\phi ,{\varvec{B}},{\varvec{\gamma }})$ is linear with respect to ${\varvec{\gamma }}$. This explains the first equality in

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}})= & {} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{\phi \in W_0^{1,2}(\Omega )}\;L(u,v,\phi ,{\varvec{B}}, {\varvec{\gamma }}) \\ {}= & {} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N} \sup _{\phi \in W_0^{1,2}(\Omega )}\; L(u,v,\phi , {\varvec{B}},{\varvec{\gamma }}), \end{aligned}$$

and it is well-known that we can swap the places of infimum which explains the second equality. In Sect. 5.3 we establish Theorem 5.6 which allows us to interchange $\inf _{{\varvec{\gamma }}}$ and $\sup _\phi $ above. Proposition 2.2 in Ekeland [7, Chapter 6] underpins the theorem. This results in

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}}) = \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{\phi \in W_0^{1,2}(\Omega )} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N} L(u,v,\phi , {\varvec{B}},{\varvec{\gamma }}). \end{aligned}$$

(3.17)

Hence minimization with respect to ${\varvec{\gamma }}$ can be executed while keeping $\phi $, u and v constant. This makes it possible to use spectral approximation. Moreover, the Lagrangian L depends linearly on ${\varvec{\gamma }}$ and the latter only enters in the trace of ${\varvec{H}}(u,v,\phi ,{\varvec{B}}){\varvec{\gamma }}$. This motivates the definition of the band energy as $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}}):= {\text {Tr}}\,({\varvec{H}}(u,v,\phi ,{\varvec{B}}){{\varvec{\gamma }}})$. Utilizing spectral theory, one can form a basis of $L^2(\Omega ; \mathbb {C}^2)$ using a countable family consisting of the orthonormal eigenvectors ${\varvec{\xi }}_k$ of ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$. We recall that they can be written as ${\varvec{\xi }}_k = ( \xi _k^\uparrow , \xi _k^\downarrow )^\textrm{T}$. In Theorem 5.7, we prove that, for every $u,v\in L^4 (\Omega )$, ${\varvec{B}} \in L^2(\Omega )$ and every $\phi \in W_0^{1,2} (\Omega )$ a minimizer of the band energy $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},\cdot )$ in ${\varvec{{{\mathcal {C}}}}}_N$ commutes with the Hamiltonian ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$. In particular, a minimizer ${\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_{N}$ of $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},\cdot )$ can be written as a spectral function, i.e., ${\text {argmin}}\,_{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_{N}} E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},{\varvec{\gamma }}) = g( {\varvec{H}}(u,v,\phi ,{\varvec{B}})$, wherein g is the Borel function given in (5.37). Since the minimizer is of the form of a Borel function, it suffices to minimize over Borel functions only. In other words, we will minimize over the space

$$\begin{aligned} {\varvec{{\widehat{{{\mathcal {C}}}}}}}_N = \lbrace {{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N \,: \, {{\varvec{\gamma }}}=g( {\varvec{H}}(u,v,\phi ,{\varvec{B}}) ), \, \, g \text{ is } \text{ a } \text{ Borel } \text{ function } \rbrace \end{aligned}$$

(3.18)

and the minimization problem becomes

$$\begin{aligned} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_{N}} E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},{\varvec{\gamma }}) = \inf _{{\varvec{\gamma }}\in {\varvec{{\widehat{{{\mathcal {C}}}}}}}_N} E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},{\varvec{\gamma }}). \end{aligned}$$

(3.19)

3.3 Spatial discretization

At this stage it is necessary to introduce a spatial discretization of the function spaces $W_{0}^{1,2}(\Omega )$, $L^{4}(\Omega )$ and $L^{2}(\Omega )^3$ by restriction to finite-dimensional subspaces. Specifically, for $j \in {{\mathbb {N}}}$, let $W_0^{1,2} (\Omega )_j$ be from a family of finite-dimensional exhausting subspaces of $W_0^{1,2} (\Omega )$ spanned by the basis $\{e_1, \ldots , e_j\}$. This can be for example a subspace that corresponds to a finite element discretization. We let $L^4 (\Omega )_j$ be from a family of finite-dimensional exhausting subspaces of $L^4 (\Omega )$ spanned by the basis $\{ d_1, \ldots , d_j \}$. These can be for example piecewise constant functions. We also have $\{{\mathfrak {d}}_1, \ldots , {\mathfrak {d}}_j \}$ as the basis of $(L^2(\Omega )_j)^{3}$ for the analogous discretization of $L^2(\Omega )^3$. The restriction of the electrostatic field to $W_0^{1,2} (\Omega )_j$ is of the form

$$\begin{aligned} \phi _j ({{\textbf{r}}}) = \sum _{a=1}^j \varphi _a e_a ({{\textbf{r}}}). \end{aligned}$$

(3.20)

Accordingly, the restriction of the nuclear charge distribution is

$$\begin{aligned} {\mathfrak {f}}_j ({{\textbf{r}}},{{\textbf{R}}}) =\sum _{a=1}^j {{\mathfrak {f}}_a^{\varvec{{{\textbf{R}}}}} } e_a ({{\textbf{r}}}) \end{aligned}$$

(3.21)

and the restricted dual density potential is

$$\begin{aligned} u_j ({{\textbf{r}}}) =\sum _{a=1}^j {\mathfrak {u}}_a d_a ({{\textbf{r}}}), \qquad v_j ({{\textbf{r}}}) =\sum _{a=1}^j {\mathfrak {v}}_a d_a ({{\textbf{r}}}). \end{aligned}$$

(3.22)

In (3.20)-(3.22) the coefficients $\{ \varphi _a \}_{a=1}^{j}$, $\{ {\mathfrak {f}}_a^{\varvec{{{\textbf{R}}}}} \}_{a=1}^{j}$, $\{ {\mathfrak {u}}_a \}_{a=1}^j$ and $\{ {\mathfrak {v}}_a \}_{a=1}^{j}$ are sets of real numbers. For the magnetic field term, we define the discretized magnetic field as ${\varvec{B}}_j:= ( {\varvec{B}}_{j,x}, {\varvec{B}}_{j,y}, {\varvec{B}}_{j,z} )^\textrm{T}$, with ${\varvec{B}}_{j,\nu }({{\textbf{r}}}) = \sum _{a=1}^j {{\mathcal {B}}}_{\nu ,a} {\mathfrak {d}}_a({{\textbf{r}}})$, $\nu = x, y, z$. For the notation, we note that ${\mathfrak {B}}$ and $\mathfrak {B^j}$, which will be defined in (3.24), are operators. Next, ${\varvec{B}}$ and ${\varvec{B}}_{j}$ are vectors, ${\varvec{B}}_{x}$ and ${\varvec{B}}_{j,x}$ are vector components, and $\lbrace {{\mathcal {B}}}_{x,a} \rbrace _{a=1}^j$ are real coefficients. Moreover, we define the discrete counterparts to ${\varvec{\gamma }}$, $\rho _{{\varvec{\gamma }}}$, $\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }$ and $\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }$; we refer to Sect. 5.5 for details. Next we introduce

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_N^{j}= \left\{ {{\varvec{\gamma }}}: {{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N \cap S\left( W_0^{1,2}(\Omega ;\mathbb {C}^{2})_j \right) \right\} . \end{aligned}$$

(3.23)

The discrete Lagrangians $L^j$, obtained by restriction of the functional L in (3.15) to $L^4 ( \Omega )_j \times L^4 ( \Omega )_j \times W_0^{1,2} ( \Omega )_j \times L^2(\Omega )_{j}^{3} \times {\varvec{{{\mathcal {C}}}}}_N^{j}$ follow as

$$\begin{aligned} L(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}})^j= & {} {\text {Tr}}\,( {\varvec{H}}^j(u,v,\phi ,{\varvec{B}}) {\varvec{\gamma }}_j) \\{} & {} +\sum _{a_1=1}^j \sum _{a_2=1}^j \left( -\frac{1}{8\pi } \int _{\Omega } \phi _{a_1} A_{a_1,a_2} \phi _{a_2} + {\mathfrak {f}}_{a_1}^R {\mathcal {M}}_{a_1,a_2} \phi _{a_2} \right) \\ {}{} & {} +{\mathfrak {G}}_\mathrm{{xc}}^*(u)+{\mathfrak {G}}_\mathrm{{xc}}^*(v). \end{aligned}$$

Here ${\varvec{H}}^j(u,v,\phi ,{\varvec{B}})$ denotes the restriction of the operators ${\varvec{H}}^j(u,v,\phi ,{\varvec{B}})$ to the finite-dimensional subspaces $L^4(\Omega )_j$, $W_0^{1,2} (\Omega )_j$ and $L^2(\Omega )_{j}^{3}$, and we define

$$\begin{aligned} {\varvec{H}}^j&:= \frac{1}{2}A^{j} + \Phi ^j -U^j -V^j -{\mathfrak {B}}^j ,\\ A_{a_1,a_2}&:= \int _{\Omega } \nabla {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \cdot \nabla {\varvec{\xi }}_{a_2}({{\textbf{r}}}) \, d{{\textbf{r}}}, \qquad {\mathcal {M}}_{a_1,a_2} := \int _{\Omega } {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \cdot {\varvec{\xi }}_{a_2}({{\textbf{r}}}) \, d{{\textbf{r}}}, \\ \Phi ^j_{a_1,a_2}&:= \int _{\Omega } \left( \sum _{a=1}^j \phi _a e_a({{\textbf{r}}}) \right) {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \cdot {\varvec{\xi }}_{a_2}({{\textbf{r}}}) \, d{{\textbf{r}}},\\ U^j_{a_1,a_2}&:= \int _{\Omega } \left( \sum _{a=1}^j {{\mathfrak {u}}}_a d_a({{\textbf{r}}}) \right) \xi _{a_1}^{\uparrow }({{\textbf{r}}}) \xi _{a_2}^{\uparrow }({{\textbf{r}}}) \, d{{\textbf{r}}},\\ V^j_{a_1,a_2}&:= \int _{\Omega } \left( \sum _{a=1}^j {\mathfrak v}_a d_a({{\textbf{r}}}) \right) \xi _{a_1}^{\downarrow }({{\textbf{r}}}) \xi _{a_2}^{\downarrow }({{\textbf{r}}}) \, d{{\textbf{r}}}, \end{aligned}$$

where

$$\begin{aligned}&A^j = \sum _{a_1=1}^j \sum _{a_2=1}^j A_{a_1,a_2}^j, \qquad \Phi ^j = \sum _{a_1=1}^j \sum _{a_2=1}^j \Phi _{a_1,a_2}^j , \\&U^j = \sum _{a_1=1}^j \sum _{a_2=1}^j U_{a_1,a_2}^j, \qquad V^j = \sum _{a_1=1}^j \sum _{a_2=1}^j V_{a_1,a_2}^j . \end{aligned}$$

We also define the discrete magnetic operator term as

$$\begin{aligned} {\mathfrak {B}}^j =&\mu \int _{\Omega } \left( \sum _{a=1}^j {{\mathcal {B}}}_{x,a} {\mathfrak {d}}_a({{\textbf{r}}}) \right) \sum _{a_1=1}^j \sum _{a_2=1}^j {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \begin{pmatrix} 0 &{} 1 \\ 1 &{} 0 \end{pmatrix} {\varvec{\xi }}_{a_2} dr\nonumber \\&+\mu \int _{\Omega } \left( \sum _{a=1}^j i {{\mathcal {B}}}_{y,a} {\mathfrak {d}}_a({{\textbf{r}}}) \right) \sum _{a_1=1}^j \sum _{a_2=1}^j {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \begin{pmatrix} 0 &{} -1 \\ 1 &{} 0 \end{pmatrix} {\varvec{\xi }}_{a_2} dr\nonumber \\&+\mu \int _{\Omega } \left( \sum _{a=1}^j {{\mathcal {B}}}_{z,a} {\mathfrak {d}}_a({{\textbf{r}}}) \right) \sum _{a_1=1}^j \sum _{a_2=1}^j {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0 \\ 0 &{} -1 \end{pmatrix} {\varvec{\xi }}_{a_2} dr. \end{aligned}$$

(3.24)

Formally, A and ${{\mathcal {M}}}$ also depend on j as they are restrictions of operators to the span of $\{\xi _1,\ldots , \xi _j\}$. We omit this dependence here for simplicity of notation. The discrete band energy is given by

$$\begin{aligned}{} & {} E_\mathrm{{ band}}^j \,: \, L^4 (\Omega )_j \times L^4 (\Omega )_j \times W_0^{1,2} (\Omega )_j \times L^2(\Omega )_{j}^{3} \times {\varvec{{{\mathcal {C}}}}}_N^{j} \rightarrow {{\mathbb {R}}}, \end{aligned}$$

(3.25)

$$\begin{aligned}{} & {} E_\mathrm{{band}}^j (u,v,\phi , {\varvec{B}},{\varvec{\gamma }}):= {\text {Tr}}\,( {\varvec{H}}^j (u,v,\phi ,{\varvec{B}}) \ {\varvec{\gamma }}_j). \end{aligned}$$

(3.26)

In the following we write ${\varvec{H}}^j \equiv {\varvec{H}}^j (u,v,\phi , {\varvec{B}})$ for simplicity. Similarly to (3.23) we need to introduce the family of discrete constraint sets

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}^j}:= \{ {\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N^{j} \,: \, {\varvec{\gamma }}=g( {\varvec{H}}^j) \text{ for } \text{ a } \text{ Borel } \text{ function } g \text{ over } {{\mathbb {R}}}, \ 0 \le g \le 1 \}. \nonumber \\ \end{aligned}$$

(3.27)

Likewise, the corresponding discrete energies ${{\textsf {E}}}_j$ follow as

$$\begin{aligned} {{\textsf {E}}}_j =\inf _{u \in L^4 (\Omega )_j} \inf _{v \in L^4 (\Omega )_j} \sup _{ \phi \in W_0^{1,2} (\Omega )_j} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}^j}} L^j (u,v,\phi , {\varvec{B}},{\varvec{\gamma }}). \end{aligned}$$

(3.28)

Theorem 5.6 (see Sect. 5.3) justifies the interchange of $\inf _{{\varvec{\gamma }}}$ and $\sup _\phi $ above. In Sect. 5.5 our choices of $\Phi _{a_{1}, a_{2}}^{j}$, ${\mathfrak {B}}^{j}$ etc., are provided.

3.4 Spectral binning

We now will give the discretization for the minimizers following Wang et al. [35]. As the original ${\varvec{H}}$ was a self-adjoint $2 \times 2$ matrix operator, so is ${\varvec{H}}^j$. Hence, we can make a spectral discretization of it, in the form ${\varvec{H}}^j = \int _{{\text {spec}}\,({\varvec{H}}^j)} \lambda \, d {\varvec{P}}^j(\lambda )$, where ${\varvec{P}}^j$ is the resolution of the identity, and ${\text {spec}}\,({\varvec{H}}^j)$ is the spectrum. For ${\varvec{\gamma }}_j$ there also exist bounded Borel functions $g^j \,: \, {{\mathbb {R}}}\rightarrow {{\mathbb {R}}}$ so that $ {\varvec{\gamma }}_j= \int _{{\text {spec}}\,({\varvec{H}}^j)} g^j(\lambda ) \, d {\varvec{P}}^j(\lambda ) =: g^j({\varvec{H}}^j)$. Then

$$\begin{aligned} E^j(g^j):= & {} {\text {Tr}}\,({\varvec{H}}^j {\varvec{\gamma }}_j) = \sum _{a=1}^\infty \int _{{\text {spec}}\,({\varvec{H}}^j)} g^j(\lambda ) \lambda \, d \mu _{{\varvec{\xi }}_a,{\varvec{\xi }}_a}^j (\lambda ), \end{aligned}$$

(3.29)

$$\begin{aligned} N^j(g^j):= & {} {\text {Tr}}\,({\varvec{\gamma }}_j) =\sum _{a=1}^\infty \int _{{\text {spec}}\,({\varvec{H}}^j)} g^j(\lambda ) \, d \mu _{{\varvec{\xi }}_a,{\varvec{\xi }}_a}^j (\lambda ) \end{aligned}$$

(3.30)

with the spectral measure

$$\begin{aligned} \mu _{{\varvec{\xi }}_a,{\varvec{\xi }}_a}^j (\lambda ):= \left\langle {\varvec{\xi }}_a, {\varvec{P}}^j(\lambda ) {\varvec{\xi }}_a \right\rangle . \end{aligned}$$

(3.31)

We let $\mathscr {B}$ denote the space of real-valued, bounded Borel functions defined on ${{\mathbb {R}}}$. If, for every a, (3.29)–(3.30) and (3.31) are known, the determination of ${\varvec{\gamma }}^{j}$ for fixed $(\phi ,u,v,{\varvec{B}})$ amounts to minimizing

$$\begin{aligned} \left\{ \begin{array}{l} E^j(g^j) \text{ over } g^{j} \in \mathscr {B}, \\ \text{ subject } \text{ to } \text{ the } \text{ constraints } 0 \le g^j \le 1, N^j(g^j)=N. \\ \end{array} \right. \end{aligned}$$

Minimizing over all Borel functions is not numerically feasible and therefore we need to restrict to a finite-dimensional subspace $\mathscr {B}_{k}$ of $\mathscr {B}$. Let $\lbrace s_q\rbrace _{q=1}^k$ be a basis of $\mathscr {B}_{k}$ such that the minimization problem above is replaced by

$$\begin{aligned} \left\{ \begin{array}{l} E^j(g^j) \text{ over } g^{j} \in \mathscr {B}_{k} \qquad \text{ i.e. } g^{j} =\sum _{q=1}^{k} c_{q} s_{q} \\ \text{ subject } \text{ to } \text{ the } \text{ constraints } 0 \le g^j \le 1, N^j(g^j)=N. \\ \end{array} \right. \end{aligned}$$

With this in mind, we introduce the subset

$$\begin{aligned} {\varvec{{{\mathcal {C}}}}}_{N,k}^{{\varvec{H}}^j}:= \left\{ {{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_{N}^{{\varvec{H}}^j} \,: \, {{\varvec{\gamma }}} = \sum _{q=1}^k c_q s_q({\varvec{H}}^j) \right\} . \end{aligned}$$

With this definition, the discrete band energy will become

$$\begin{aligned} E^j(g^j)= & {} E \left( \sum _{q=1}^k c_q s_q\right) = {\text {Tr}}\,( {\varvec{H}}^j {{\varvec{\gamma }}}) =\sum _{q=1}^\infty \int _{{\text {spec}}\,({\varvec{H}}^j)} \lambda \sum _{q=1}^k c_q s_q(\lambda ) \, d \mu _{{\varvec{\xi }}_1,{\varvec{\xi }}_2}^{j} (\lambda ) \nonumber \\= & {} \sum _{q=1}^\infty c_q \left( \int _{{\text {spec}}\,({\varvec{H}}^j)} \lambda \sum _{q=1}^k s_q(\lambda ) \, d \mu _{{\varvec{\xi }}_1,{\varvec{\xi }}_2}^{j} (\lambda ) \right) =: \sum _{q=1}^k c_q w_{q}^{k,j}. \end{aligned}$$

(3.32)

Similarly the number of electrons will be

$$\begin{aligned} N^j(g^j)= & {} N^j \left( \sum _{q=1}^k c_q s_q\right) =\sum _{q=1}^\infty \int _{{\text {spec}}\,({\varvec{H}}^j)} \sum _{q=1}^k c_q s_q(\lambda ) \, d \mu _{{\varvec{\xi }}_1,{\varvec{\xi }}_2}^j(\lambda ) \nonumber \\= & {} \sum _{q=1}^\infty c_q \left( \int _{{\text {spec}}\,({\varvec{H}}^j)} \sum _{q=1}^k s_q(\lambda ) \, d \mu _{{\varvec{\xi }}_1,{\varvec{\xi }}_2}^j(\lambda ) \right) =: \sum _{q=1}^k c_q n_{q}^{k,j}. \end{aligned}$$

(3.33)

Therefore, with $c= ( c_1, \dots , c_k ) \in {{\mathbb {R}}}^k$ the minimization becomes

$$\begin{aligned} \min _{c \in {{\mathbb {R}}}^k} \left\{ E^j \left( \sum _{q=1}^k c_q s_q \right) : 0 \le c_q \le 1, \qquad \sum _{q=1}^k c_q n_q^{k,j} =N \right\} . \end{aligned}$$

We proceed to specifying the basis $\lbrace s_q \rbrace _{q=1}^{k}$ of $\mathscr {B}_{k}$. Let ${{\mathcal {I}}}=[\lambda _{l}, \lambda _{r}] \subset {{\mathbb {R}}}$ be a fixed interval and recall that $\lambda _\textrm{min}^{j}$, resp. $\lambda _\textrm{max}^{j}$, is the smallest, resp. largest, eigenvalue of the operator ${\varvec{H}}^{j}$. By introducing the ‘bins’ $\{ t_{q} \}_{q=0}^{k}$ chosen such that

$$\begin{aligned} t_0=\lambda _{l} \le \lambda _\mathrm{{min}}^{j} \le t_1 \le \dots \le t_{k-1}< \lambda _{r} \le t_k < \lambda _\mathrm{{max}}^{j}, \end{aligned}$$

(3.34)

we can partition the interval ${{\mathcal {I}}}$ into k subintervals with endpoints $\lbrace t_q \rbrace _{q=0}^k$. With such a partition, we define the piece-wise constant functions

$$\begin{aligned} s_{t_q}(\lambda ) = {\left\{ \begin{array}{ll} 1, &{} \text{ if } \, t_{q-1} \le \lambda \le t_q, \\ 0, &{} \text{ otherwise, } \end{array}\right. } \end{aligned}$$

(3.35)

and we choose $\mathscr {B}_{k}$ to be the span of $\{ s_{t_{q}} \}_{q=1}^{k}$; note that $\overline{{\text {span}}\,_{q=1, \ldots , k} s_{t_{q}} } = L^{2}({{\mathcal {I}}})$. For a density operator ${\varvec{\gamma }}_{k}^{j} \in {{\mathcal {C}}}_{N,k}^{{\varvec{H}}^{j}}$ we thus obtain

$$\begin{aligned} {\varvec{\gamma }}_{k}^{j} = \int _{{\text {spec}}\,{{\varvec{H}}^{j}}} \sum _{q=1}^{k} c_{q} s_{t_{q}} (\lambda ) \, dP^{j} (\lambda ) \end{aligned}$$

via the functional calculus with respect to the basis $\{ s_{t_{q}} \}_{q=1}^{k}$. The band energy corresponding to this ${\varvec{\gamma }}_{k}^{j}$ is given by

$$\begin{aligned} E^j( {\varvec{\gamma }}_{k}^{j} )= & {} E ^{j} \left( \sum _{q=1}^{k} c_q s_{t_{q}} \right) = {\text {Tr}}\,({\varvec{H}}^j {{\varvec{\gamma }}}_{k}^{j} ) \nonumber \\= & {} \sum _{q=1}^{k} c_q \left\{ \sum _{a=1}^{\infty } \int _{{\text {spec}}\,({\varvec{H}}^j)} \lambda s_{t_{q}} (\lambda ) \, d \mu _{{\varvec{\xi }}_a,{\varvec{\xi }}_a}^{j} (\lambda ) \right\} =: \sum _{q=1}^{k} c_{q} w_{q}^{k,j}. \end{aligned}$$

(3.36)

Similarly the number of electrons will be

$$\begin{aligned} N^j( {\varvec{\gamma }}_{k}^{j} )= & {} N^j \left( \sum _{q=1}^k c_q s_{t_{q}} ({\varvec{H}}^{j} ) \right) \nonumber \\= & {} \sum _{q=1}^{k} c_{q} \left\{ \sum _{a=1}^{\infty } \int _{{\text {spec}}\,({\varvec{H}}^j)} s_{t_{q}} (\lambda ) \, d \mu _{{\varvec{\xi }}_a,{\varvec{\xi }}_a}^j(\lambda ) \right\} =: \sum _{q=1}^{k} c_{q} n_{q}^{k,j}. \end{aligned}$$

(3.37)

Note that $n_{q}^{k,j}$ can be understood as the number of eigenvalues in the qth ‘bin’ $(t_{q-1}, t_{q})$ which explains the terminology “Spectral binning" introduced in [35]. Similarly, $w_{q}^{k,j}$ is interpreted as the band energy in each bin. In this way, we minimize

$$\begin{aligned} \min _{\varvec{c} \in {{\mathbb {R}}}^{k}} \sum _{q=1}^k c_q w_{q}^{k,j}, \text{ subject } \text{ to } \left\{ \begin{array}{l} 0 \le c_{q} \le 1, \\ \sum _{q=1}^k c_q n_{q}^{k,j}=N. \end{array} \right. \end{aligned}$$

If we are solving for this numerically, we need to find the values of $w_{q}^{k,j}$ and $n_{q}^{k,j}$. Sylvester’s Theorem [28] enables us to compute $n_{q}^{k,j}$ but we are unable to do the same to find $w_{q}^{k,j}$. However, we will use the $n_{q}^{k,j}$ to make an approximation. We note that for an interval $(t_{q-1},t_q)$, its centre of mass $m_q^{k,j}$ will be given by

$$\begin{aligned} m_q^{k,j}&:= \frac{w^{k,j}_{q}}{n^{k,j}_{q}} = \frac{1}{n_{q}^{k,j}} \sum _{i=1}^\infty \left( \int _{{\text {spec}}\,{({\varvec{H}}^{j})}} \lambda s_{t_q}(\lambda ) d \mu _{{\varvec{\xi }}_i, {\varvec{\xi }}_i}^j (\lambda ) \right) . \end{aligned}$$

However, a reasonable approximation for the centre of mass is the centre of interval, i.e.,

$$\begin{aligned} m_q^{k,j} \approx m_q := \frac{t_q + t_{q-1}}{2} . \end{aligned}$$

(3.38)

In particular, $w^{k,j}_q = m_q^{k,j} n^{k,j}_q \approx m_q n^{k,j}_q$. Hence, we will define the approximate trace as

$$\begin{aligned} {\widetilde{{\text {Tr}}\,}}_k( {\varvec{H}}^{j} {\varvec{\gamma }}^{j}) := \sum _{q=1}^k c_q m_q n^{k,j}_q, \end{aligned}$$

(3.39)

where in comparison to ${\text {Tr}}\,( {\varvec{H}}^j {\varvec{\gamma }}^{j} )$ we have replaced $w^{k,j}_q$ with $m_q n^{k,j}_q$. The discrete spectral band energy is therefore given by

$$\begin{aligned} E_{ \textrm{band}}^{k_j,j} := {\widetilde{{\text {Tr}}\,}}_{k_j}({\varvec{H}}^j {{\varvec{\gamma }}}). \end{aligned}$$

(3.40)

4 Main result and strategy of its proof

Our main theorem states that in the limit of the number of spatial discretizations $j \rightarrow \infty $, and consequently in the limit of the number of spectral discretizations $k_j \rightarrow 0$, the family of ground state energies of the spatially and spectrally discrete energy functional converges to the full ground state energy given by (assuming it is attained)

$$\begin{aligned} I_\lambda := \inf \left\{ {{\mathcal {E}}}^\textrm{hy} ({\varvec{\gamma }}) \,: \, {\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_\lambda \right\} , \end{aligned}$$

(4.1)

where ${{\mathcal {E}}}^\textrm{hy}(\cdot )$ is given in (1.1). Before we state the theorem we describe the strategy for proving it and, in the process, we introduce the mathematical objects used in its formulation.

4.1 Strategy

We prove convergence with respect to spectral and spatial discretization using the method of $\Gamma $-convergence. In order to apply the latter method, we rewrite the extended energy functional in (3.15) by introducing the indicator function $I_{M}$ associated with a set M as

$$\begin{aligned} I_M (x):= {\left\{ \begin{array}{ll} 0 &{} \text{ if } x \in M, \\ +\infty &{} \text{ otherwise. } \end{array}\right. } \end{aligned}$$

(4.2)

Then, resuming our discussion about (3.17) above,

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}})= & {} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{\phi \in W_0^{1,2}(\Omega )} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N } L(u,v,\phi ,{\varvec{B}}, {{\varvec{\gamma }}}) \nonumber \\= & {} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \Biggl \lbrace C_\mathrm{{self}} +{\mathfrak {G}}_\mathrm{{xc}}^{*} (u)+{\mathfrak {G}}_\mathrm{{xc}}^{*} (v)\nonumber \\{} & {} + \sup _{\phi \in W_0^{1,2}(\Omega )} \biggl [ -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}\nonumber \\{} & {} + \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \lbrace E_\mathrm{{band}}(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}}) +I_{{\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}}} ({{\varvec{\gamma }}}) \rbrace \biggr ] \Biggr \rbrace , \end{aligned}$$

(4.3)

where (4.3) is obtained by rearranging the terms according to their dependence on u, v, $\phi $ and ${\varvec{\gamma }}$. Note that the minimization over ${\varvec{{{\mathcal {C}}}}}_N$ is replaced by the minimization over ${\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}$. This ensures the existence of a spectral function and is justified by (3.19) and (5.38) from the proof of Theorem 5.6. The overall idea is to show convergence with respect to both spectral and spatial discretization using three nested $\Gamma $-convergence proofs.

4.1.1 Limit functionals

In order to clarify the steps of our proof, we introduce the following functionals

$$\begin{aligned}&S_1(\cdot ) \,: \, L^4(\Omega ) \rightarrow {{\mathbb {R}}},&\qquad S_2(\cdot ,v) \,: \, L^4(\Omega ) \rightarrow {{\mathbb {R}}}, \\&S_3(u,v,\cdot ) \,: \, W_0^{1,2}(\Omega ) \rightarrow {{\mathbb {R}}},&\qquad S_4(u,v,\phi ,{\varvec{B}},\cdot ) \,: \, {{\mathcal {S}}}(L^4(\Omega ;\mathbb {C}^2)) \rightarrow {{\mathbb {R}}}\end{aligned}$$

defined by

$$\begin{aligned} S_1(v)&:= C_\mathrm{{self}} +{\mathfrak {G}}_\mathrm{{xc}}^*(v) + \inf _{u \in L^4(\Omega )} S_2(u,v) \end{aligned}$$

(4.4)

$$\begin{aligned} S_2(u,v)&:= {\mathfrak {G}}_\mathrm{{xc}}^*(u) + \sup _{\phi \in W_0^{1,2}} S_3(u,v,\phi ) \end{aligned}$$

(4.5)

$$\begin{aligned} S_3(u,v,\phi )&:= -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) d{{\textbf{r}}}\nonumber \\ {}&\quad + \inf _{{{\varvec{\gamma }}} \in {{\mathcal {S}}}(L^4(\Omega ; \mathbb {C}^2))} S_4(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) \end{aligned}$$

(4.6)

$$\begin{aligned} S_4(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})&:= E_\mathrm{{band}}(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) +I_{{\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}} }}({{\varvec{\gamma }}}) . \end{aligned}$$

(4.7)

Resuming from (4.3) we thus consider the minimization problem

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}})= \inf _{v \in L^4 (\Omega )} S_1(v). \end{aligned}$$

(4.8)

4.1.2 Functionals involving spectral and spatial approximations

Let j refer to the dimension of the finite element spaces $W_0^{1,2} (\Omega )_j$ and $L^4(\Omega )_j$, as in Sect. 3.3, and let $k_j$ refer to the number of subdivisions in the binning approximation, as in Sect. 3.4; recall that $k_j \rightarrow \infty $ as $j \rightarrow \infty $. Next we introduce the approximated constrained set of density operators

$$\begin{aligned} {{\mathcal {C}}}_{N,k}^{{\varvec{H}}^j}:= \left\{ {{\varvec{\gamma }}} \in {{\mathcal {C}}}_{N}^{{\varvec{H}}^j} \,: \, {{\varvec{\gamma }}} = \sum _{q=1}^k c_q s_q({\varvec{H}}^j) \right\} , \end{aligned}$$

(4.9)

with piecewise constant functions $\left\{ s_{t_q} (\lambda )\ \right\} _{q=1}^{k_j}$ defined in (3.35) and the discrete band energies $E_\mathrm{{band}}^{j,k_{j}} (u,v,\phi , {\varvec{B}}, \cdot ) \,: \, {\varvec{{{\mathcal {C}}}}} \rightarrow {{\mathbb {R}}}$

$$\begin{aligned}&E_\mathrm{{band}}^{j,k_j} (u,v,\phi ,{\varvec{B}},\gamma ) := {\widetilde{{\text {Tr}}\,}}_{k_j}( {\varvec{H}}^j {{\varvec{\gamma }}}) = \sum _{q=1}^{k_{j}} c_q m_q n^{k_{j},j}_q, \end{aligned}$$

(4.10)

where ${\widetilde{{\text {Tr}}\,}}_{k_j} (\cdot )$ is the approximation of the trace operator described in (3.39). Analogously to (4.4)– (4.7) we introduce, for $j \in {{\mathbb {N}}}$, functionals

$$\begin{aligned}&S_1^{j,k_j}(\cdot ) \,: \, L^4(\Omega ) \rightarrow {{\mathbb {R}}}\cup \lbrace + \infty \rbrace , \qquad&\qquad S_2^{j,k_j}(\cdot ,v) \,: \, L^4(\Omega ) \rightarrow {{\mathbb {R}}}\cup \lbrace + \infty \rbrace ,\\&S_3^{j,k_j}(u,v,\cdot )\,: \, W_0^{1,2}(\Omega ) \rightarrow {{\mathbb {R}}}\cup \lbrace - \infty \rbrace , \qquad&\qquad S_4^{j,k_j}(u,v,\phi ,{\varvec{B}},\cdot ) \,: \, {{\mathcal {C}}}_{N,k}^{{\varvec{H}}^j} \rightarrow {{\mathbb {R}}}\end{aligned}$$

defined by

$$\begin{aligned} S_1^{j,k_j}(v):= C_\mathrm{{self}} +{\mathfrak {G}}_\mathrm{{xc}}^*(v) + \inf _{v \in L^4(\Omega )} S_2^{j,k_j}(u,v) +I_{L^4(\Omega )_j}(v) \end{aligned}$$

(4.11)

and

$$\begin{aligned} S_2^{j,k_j}(u,v):= & {} {\mathfrak {G}}_\mathrm{{xc}}^*(v) + \sup _{\phi } S_3^{j,k_j}(u,v,\phi )+I_{L^4(\Omega )_j}(u) \end{aligned}$$

(4.12)

$$\begin{aligned} S_3^{j,k_j}(u,v,\phi ):= & {} -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 d{{\textbf{r}}}+ \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) d{{\textbf{r}}}\nonumber \\{} & {} +\inf _{{{\varvec{\gamma }}} \in {{\mathcal {S}}}(L^4(\Omega ; \mathbb {C}^2))} S_4^{j,k_j}(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}}) - I_{W_0^{1,2}(\Omega )_j}(\phi ) \end{aligned}$$

(4.13)

$$\begin{aligned} S_4^{j,k_j} (u,v,\phi ,{\varvec{B}},{\varvec{\gamma }}):= & {} E_\mathrm{{band}}^{j,k_j}(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) +I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j}}({{\varvec{\gamma }}}). \end{aligned}$$

(4.14)

This is in conjunction with the family of energies

$$\begin{aligned} {{\textsf {E}}}_{j,k_{j}} ({\varvec{B}} ) =\inf _{v\in L^4 (\Omega )} S_1^{j,k_j}(v). \end{aligned}$$

(4.15)

It is worth to point out here that the convergence of the discrete magnetic field ${\varvec{B}}_j$ to the true magnetic field ${\varvec{B}}$ takes place in the indicator function $I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j}}$, which is why there is no ${\varvec{B}}_j$ dependence in $S_1^{j,k_j}$, $S_2^{j,k_j}$ and $S_3^{j,k_j}$. We justify inserting $-I_{W_0^{1,2}(\Omega )_j}(\phi )$, in addition to $I_{L^4(\Omega )_j}(u)$ and $I_{L^4(\Omega )_j}(v)$, since we are finding the minimizer over these restricted spaces.

Since $n^{k,j}_q$ is the number of eigenvalues in the qth interval, we have that

$$\begin{aligned} n^{k,j}_q= & {} \int _{\sigma (H^j)} s_q(\lambda ) d \mu _{ {\varvec{\xi }}_1, {\varvec{\xi }}_2}^j(\lambda ) = \int _{t_{q-1}}^{t_q} s_q(\lambda ) d \mu _{ {\varvec{\xi }}_1, {\varvec{\xi }}_2}^j(\lambda ) \\= & {} \mu _{ {\varvec{\xi }}_1, {\varvec{\xi }}_2}^j(t_{q}) -\mu _{ {\varvec{\xi }}_1,{\varvec{\xi }}_2}^j(t_{q-1}), \end{aligned}$$

and, therefore,

$$\begin{aligned} {\widetilde{{\text {Tr}}\,}}_{k_j}( {\varvec{H}}^j {{\varvec{\gamma }}})= \sum _{q=1}^{k_j} c_q m_q \left( \mu _{ {\varvec{\xi }}_1, {\varvec{\xi }}_2}^j (t_{q}) -\mu _{ {\varvec{\xi }}_1, {\varvec{\xi }}_2}^j (t_{q-1}) \right) . \end{aligned}$$

Collecting (3.33) and (3.40) for ${{\varvec{\gamma }}}_{j,k_{j}} \in {\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j (u_j,v_j, \phi _j,{\varvec{B}}_j)}$, that is, the spectral approximation of ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}^j (u,v,\phi ,{\varvec{B}})}$, the approximate trace operator depending on $k_j \in {{\mathbb {N}}}$ is given by

$$\begin{aligned} {\widetilde{{\text {Tr}}\,}}_{k_j} ( {\varvec{H}}^j (u_j,v_{j}, \phi _j, {\varvec{B}}_j) {{\varvec{\gamma }}}_{j,k_{j}})&= \sum _{i=1}^{\infty } \sum _{q=1}^{k_j} c_q m_q \int _{t_{q - 1}}^{t_q} s_q (\lambda ) \ d \mu _{ {\varvec{\xi }}_i, {\varvec{\xi }}_i} (\lambda ) \nonumber \\&= \sum _{i=1}^{\infty } \sum _{q=1}^{k_j} c_q m_q (\mu _{ {\varvec{\xi }}_i, {\varvec{\xi }}_i} (t_q) - \mu _{ {\varvec{\xi }}_i, {\varvec{\xi }}_i} (t_{q - 1})), \end{aligned}$$

(4.16)

with the centers of mass $m_q =\dfrac{t_q + t_{q-1}}{2}$ for $\ 1 \le q \le k_j$, as defined in (3.38).

We can finally formulate the main theorem.

Theorem 4.1

Let Assumption 2.1 hold, and let the interval $[\lambda _{l}, \lambda _{r}]$ be partitioned by $\{ t_{q} \}_{q=0}^{k}$; see (3.34). Then, in the limit of the number of spatial discretization $j \rightarrow \infty $, and consequently in the limit of the number of spectral discretizations $k_j \rightarrow 0$, the family of ground state energies of the spatially and spectrally discrete energy functional converges to the full ground state energy in (4.1), that is,

$$\begin{aligned} \lim \limits _{j \rightarrow \infty }\inf _{v \in L^4(\Omega )} S_{1}^{j, k_j}(v) = \inf _{v \in L^4(\Omega )} S_{1}(v) = {{\textsf {E}}}({\varvec{B}}). \end{aligned}$$

In particular, the theorem implies that

$$\begin{aligned} \inf _{ v \in L^{4}(\Omega )} S_{1}(v) = \inf _{ v \in L^{4}(\Omega )} \inf _{ u \in L^{4}(\Omega )} \sup _{\phi \in W_{0}^{1,2}(\Omega )} \inf _{{\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_{N}} L(u, v, \phi , {\varvec{B}}, {\varvec{\gamma }}) = {{\textsf {E}}}({\varvec{B}}).\nonumber \\ \end{aligned}$$

(4.17)

5 Proofs

5.1 Recasting of electrostatic energies

To prove the equality

$$\begin{aligned}{} & {} {{\mathcal {J}}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ext}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ZZ}} \\ {}{} & {} \quad = \sup _{\phi \in W_0^{1,2}(\Omega )} \left\{ -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } \left( {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}} ) + \rho _{{{\varvec{\gamma }}}}({{\textbf{r}}}) \right) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}\right\} \\ {}{} & {} \quad + C_\mathrm{{self}}, \end{aligned}$$

we need the following lemma in which we transform $E_\mathrm{{ZZ}}$ in (3.3) to an integral form. Here the ${{\textbf{R}}}_i$, $Z_i$ are spatial coordinates and nuclear charge, respectively, of the ith nuclei. Bear in mind also the definition of ${\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})$ in (3.6).

Lemma 5.1

Let $f_{{\varvec{R}}_i}({{\textbf{r}}})$ be a regularised and bounded function with support on a small ball around ${{\textbf{R}}}_i$, so that $\int _{\Omega } f_{{\varvec{R}}_i}({{\textbf{r}}}) \, d{{\textbf{r}}}=1$. Let $\epsilon >0$ be defined as in (3.5), and in particular let us choose the $f_{{\varvec{R}}_i}$ so that for all $1 \le i \le K$, ${\text {diam}}\,(\textrm{supp}\,f_{{\varvec{R}}_i} ) < \epsilon /3$. Then

$$\begin{aligned} E_\mathrm{{ZZ}}= \frac{1}{2} \int _{\Omega } \int _{\Omega } \frac{{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) {\mathfrak {f}}({{\textbf{r}}}^{\prime }, {\underline{{{\textbf{R}}}}})}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}\, d{{\textbf{r}}}^\prime +C_\mathrm{{self}}, \end{aligned}$$

(5.1)

where $C_\mathrm{{self}}$ depends only on dimension, minimum distance between the nuclear coordinates ${{\textbf{R}}}_i$, maximum nuclear charge, and choice of the functions $f_{{\varvec{R}}_i}$.

We use the previous lemma in the proof of the following proposition.

Proposition 5.2

Let ${{\mathcal {J}}}$, $E_\mathrm{{ext}}$, $E_\mathrm{{ZZ}}$ be defined as above. Then

$$\begin{aligned}&{{\mathcal {J}}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ext}}(\rho _{{\varvec{\gamma }}}) + E_\mathrm{{ZZ}} \nonumber \\&\quad = \sup _{\phi \in W_0^{1,2}(\Omega )} \left\{ -C_d \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}+ \int _{\Omega } \left( {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) + \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) \right) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}\right\} + C_\mathrm{{self}}, \end{aligned}$$

(5.2)

where $C_\mathrm{{self}}$ depends only on dimension, minimum distance between the nuclear coordinates ${{\textbf{R}}}_i$, maximum nuclear charge, and choice of the functions $f_{{\varvec{R}}_i}$.

We first give a proof of Lemma 5.1.

Proof of Lemma 5.1

Assume ${{\textbf{r}}}\in \textrm{supp}\,f_{{\varvec{R}}_i}$ and ${{\textbf{r}}}^\prime \in \textrm{supp}\,f_{{\varvec{R}}_j}$, for $i \ne j$. Then,

$$\begin{aligned}&\left| \frac{1}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} - \frac{1}{| {{\textbf{R}}}_i-{{\textbf{R}}}_j|} \right| = \left| \frac{| {{\textbf{R}}}_i-{{\textbf{R}}}_j| - |( {{\textbf{r}}}-{{\textbf{R}}}_i)+( {{\textbf{R}}}_j-{{\textbf{r}}}^\prime ) +( {{\textbf{R}}}_i-{{\textbf{R}}}_j)|}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime || {{\textbf{R}}}_i-{{\textbf{R}}}_j|} \right| . \end{aligned}$$

By our choice of the supports of $f_{{\varvec{R}}_i}$ for $1 \le i \le M$, we know that $| {{\textbf{R}}}_i-{{\textbf{R}}}_j|> \epsilon $ and $| {{\textbf{r}}}_i-{{\textbf{r}}}_j|> \epsilon /3$. In addition, both $| {{\textbf{r}}}-{{\textbf{R}}}_i|$ and $| {{\textbf{r}}}- {{\textbf{R}}}_j|$ are bounded above by $\epsilon /3$. Hence,

$$\begin{aligned} \frac{1}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime | \, | {{\textbf{R}}}_i-{{\textbf{R}}}_j|} < \frac{1}{\epsilon ^2/3} \end{aligned}$$

and $| {{\textbf{R}}}_i-{{\textbf{R}}}_j| - |( {{\textbf{r}}}-{{\textbf{R}}}_i)+( {{\textbf{R}}}_j-{{\textbf{r}}}^\prime ) +({{\textbf{R}}}_i-{{\textbf{R}}}_j)| < 2\epsilon / 3$. Thus, we have

$$\begin{aligned} \left| \frac{1}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} - \frac{1}{| {{\textbf{R}}}_i-{{\textbf{R}}}_j|} \right|< \frac{2}{\epsilon }<C_0({{\textbf{R}}})< + \infty , \end{aligned}$$

(5.3)

where $C_0$ is a constant that only depends on the minimum distance between the nuclei points of ${{\textbf{R}}}$. Thus for $i \ne j$,

$$\begin{aligned} \frac{1}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} = \frac{1}{| {{\textbf{R}}}_i-{{\textbf{R}}}_j|} + g_{i,j}({{\textbf{r}}},{{\textbf{r}}}^\prime ), \end{aligned}$$

(5.4)

where $|g_{i,j}| < C_0.$ Hence,

$$\begin{aligned} \frac{1}{2}\int _{\Omega } \int _{\Omega } \frac{{\mathfrak {f}}({{\textbf{r}}}, {\underline{{{\textbf{R}}}}}) {\mathfrak {f}}({{\textbf{r}}}^\prime , {\underline{{{\textbf{R}}}}})}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime&= \frac{1}{2} \sum _{i,j =1}^K Z_i Z_j \int _{\Omega } \int _{\Omega } \frac{ f_{{\varvec{R}}_i}({{\textbf{r}}}) f_{{\varvec{R}}_j}({{\textbf{r}}}^\prime )}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime \nonumber \\&=\frac{1}{2} \sum _{\begin{array}{c} i, j =1 \\ i\ne j \end{array}}^K Z_i Z_j \int _{\Omega } \int _{\Omega } \frac{ f_{{\varvec{R}}_i}({{\textbf{r}}}) f_{{\varvec{R}}_j}({{\textbf{r}}}^\prime )}{| {{\textbf{R}}}_i-{{\textbf{R}}}_j|} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime \\&\quad {+} \frac{1}{2} \sum _{\begin{array}{c} i, j {=}1 \\ i\ne j \end{array}}^K Z_i Z_j \int _{\Omega } \int _{\Omega } f_{{\varvec{R}}_i}({{\textbf{r}}}) f_{{\varvec{R}}_j}({{\textbf{r}}}^\prime ) g_{i,j}({{\textbf{r}}},{{\textbf{r}}}^\prime ) \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime \\&\quad + \frac{1}{2}\sum _{i=1}^K |Z_i|^2 \int _{\Omega } \int _{\Omega } \frac{ f_{{\varvec{R}}_i} ({{\textbf{r}}}) f_{{\varvec{R}}_i}({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime . \end{aligned}$$

In the final line, the first term is $E_\mathrm{{ZZ}}$. The second term is a constant $C_1$, where

$$\begin{aligned} \frac{1}{2} \sum _{\begin{array}{c} i, j =1 \\ i\ne j \end{array}}^K Z_i Z_j \int _{\Omega } \int _{\Omega } f_{{\varvec{R}}_i} ({{\textbf{r}}}) f_{{\varvec{R}}_j} ({{\textbf{r}}}^\prime ) {g_{i,j}({{\textbf{r}}},{{\textbf{r}}}^\prime )} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime = C_1 \le K \left| \max _{1 \le i \le K} \lbrace |Z_i|\rbrace \right| ^2 C_0. \end{aligned}$$

The constant $C_1$ depends on the minimum distance between nuclei and the maximum charge of the system. For the third term, by linear transformations of ${{\textbf{R}}}_i$ and $f_{{\varvec{R}}_i}$ to the origin, it suffices to show that we can bound the integral

$$\begin{aligned} \int _{B_{\epsilon /3}(0)} \int _{B_{\epsilon /3}(0)} \frac{f({{\textbf{r}}}) f({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime . \end{aligned}$$

(5.5)

For a fixed ${{\textbf{r}}}^\prime $, we have

$$\begin{aligned} \int _{B_{\epsilon /3}(0)} \frac{f({{\textbf{r}}})}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}\le \int _{B_{2\epsilon /3}( {{\textbf{r}}}^\prime )} \frac{f( {{\textbf{r}}}-{{\textbf{r}}}^\prime )}{|{{\textbf{r}}}|} \, d{{\textbf{r}}}= \epsilon C_2(f) < + \infty , \end{aligned}$$

where the constant $C_2$ depends only on the choice of the function f and the dimension (in this case 3), and is finite due to smoothness of f. Hence it follows that

$$\begin{aligned} \frac{1}{2}\sum _{i=1}^M |Z_i|^2 \int _{\Omega } \int _{\Omega } \frac{ f_{{\varvec{R}}_i}({{\textbf{r}}}) f_{{\varvec{R}}_j} ({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}d{{\textbf{r}}}^\prime =: C_3 \le (\epsilon )^2 M \left| \max _{1 \le i \le M} \lbrace |Z_i|\rbrace \right| ^2 C_2 \end{aligned}$$

with $C_3$, as $C_1$, depending on the function f and the maximum charge of the $Z_i$, but in addition depending on the dimension (3 in this case). Writing $C_\mathrm{{self}} = C_1 + C_3$, we finally arrive at (5.1). $\square $

We now can give a proof of Proposition 5.2.

Proof of Proposition 5.2

We consider the Poisson equation (in dimension $d=3$),

$$\begin{aligned} -\frac{1}{4\pi } \nabla ^{2} \Phi ({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) = \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) + {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}). \end{aligned}$$

(5.6)

By considering the Green’s function, we find that the unique solution of (5.6) is given by

$$\begin{aligned} \Phi ({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) = \int \frac{\rho _{{\varvec{\gamma }}}({{\textbf{r}}}^\prime ) +{\mathfrak {f}}({{\textbf{r}}}^\prime ,{\underline{{{\textbf{R}}}}})}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d {{\textbf{r}}}^\prime . \end{aligned}$$

(5.7)

We have, by comparing to the end of the proof of Lemma 5.1, that

$$\begin{aligned} \int _{\Omega } \frac{{\mathfrak {f}}({{\textbf{r}}}^\prime ,{\underline{{{\textbf{R}}}}})}{|{{\textbf{r}}}^\prime -{{\textbf{r}}}|} \, d{{\textbf{r}}}^\prime = \int \sum _{i=1}^M \frac{Z_i f_{{\varvec{R}}_i} ({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}^\prime -{{\textbf{r}}}|} \, d{{\textbf{r}}}^\prime = \sum _{i=1}^M \frac{Z_i}{|{{\textbf{R}}}_i-{{\textbf{r}}}|} + C_1 = V ({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) + C_1 . \end{aligned}$$

(5.8)

Putting this into the Green’s function we arrive at

$$\begin{aligned} \Phi ({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) =\int \frac{\rho _{{\varvec{\gamma }}}({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}^\prime +V + C_1. \end{aligned}$$

(5.9)

Hence,

$$\begin{aligned} \int \left( \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) + {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right) \Phi ({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \, d{{\textbf{r}}}= & {} \int \int \frac{\rho _{{\varvec{\gamma }}}({{\textbf{r}}}) \rho _{{\varvec{\gamma }}}({{\textbf{r}}}^\prime )}{|{{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}^\prime d{{\textbf{r}}}\\ {}{} & {} + \int \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) V ({{\textbf{r}}}) \, d{{\textbf{r}}}+ \int \int \frac{{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \rho _{{\varvec{\gamma }}}({{\textbf{r}}}^\prime )}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}^\prime d{{\textbf{r}}}\\ {}{} & {} + \int {\mathfrak {f}}({{\textbf{r}}},{{\textbf{R}}}) V ({{\textbf{r}}}) \, d{{\textbf{r}}}+ C_1 \int (\rho _{{\varvec{\gamma }}}+{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})) \, d{{\textbf{r}}}. \end{aligned}$$

Applying (5.8), we have

$$\begin{aligned} \int (\rho _{{\varvec{\gamma }}} +{\mathfrak {f}}) \Phi \, d{{\textbf{r}}}&= \int \int \frac{\rho _{{\varvec{\gamma }}}( {{\textbf{r}}}) \rho _{{\varvec{\gamma }}}({{\textbf{r}}}^\prime )}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}^\prime d{{\textbf{r}}}+ 2 \int \rho _{{\varvec{\gamma }}} V \, d{{\textbf{r}}}+ \int C_1 \rho _{{\varvec{\gamma }}} d{{\textbf{r}}}\nonumber \\&\quad +\int \int \frac{{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) {\mathfrak {f}}({{\textbf{r}}}^\prime , {\underline{{{\textbf{R}}}}})}{| {{\textbf{r}}}-{{\textbf{r}}}^\prime |} \, d{{\textbf{r}}}^\prime d{{\textbf{r}}}+ C_1 \int \rho _{{\varvec{\gamma }}} d{{\textbf{r}}}\nonumber \\&= 2 \left( {{\mathcal {J}}}+ E_\mathrm{{ext}} + E_\mathrm{{ZZ}} -C_\mathrm{{self}} \right) + C. \end{aligned}$$

(5.10)

The constant $ C =2 C_1 \int \rho _{{\varvec{\gamma }}} d{{\textbf{r}}}= 2C_1 N$ is independent of $R_{{\varvec{\gamma }}}$, and in the final line we used the result from Lemma 5.1. Going back to the Poisson equation, we rearrange (5.6) to arrive at

$$\begin{aligned} \frac{1}{8 \pi } \nabla ^2 \Phi + (\rho _{{\varvec{\gamma }}} + {\mathfrak {f}}) = \frac{1}{2} (\rho _{{\varvec{\gamma }}} +{\mathfrak {f}}). \end{aligned}$$

(5.11)

Taking the integrals of both sides and using (5.10), we get

$$\begin{aligned} -\frac{1}{8\pi } \int |\nabla \Phi |^2 \, d{{\textbf{r}}}+ \int (\rho _{{\varvec{\gamma }}} + {\mathfrak {f}})\Phi \, d{{\textbf{r}}}= {{\mathcal {J}}}+ E_\mathrm{{ext}}+ E_\mathrm{{ZZ}} +C. \end{aligned}$$

(5.12)

We will now show that the unique solution $\Phi $ of the Poisson equation will also give the maximal value of the following maximization expression, that is to say,

$$\begin{aligned} \sup _{\phi \in W_0^{1,2} (\Omega )} \left\{ -\frac{1}{8\pi } \int |\nabla \phi |^2 \, d{{\textbf{r}}}+ \int (\rho _{{\varvec{\gamma }}} + {\mathfrak {f}})\phi \, d{{\textbf{r}}}\right\}= & {} -\frac{1}{8\pi } \int |\nabla \Phi |^2 \, d{{\textbf{r}}}\nonumber \\ {}{} & {} + \int (\rho _{{\varvec{\gamma }}} + {\mathfrak {f}})\Phi \, d{{\textbf{r}}}.\nonumber \\ \end{aligned}$$

(5.13)

We define

$$\begin{aligned} I(\phi ):=-\frac{1}{8\pi } \int |\nabla \phi |^2 \, d{{\textbf{r}}}+ \int (\rho _{{\varvec{\gamma }}} + {\mathfrak {f}})\phi \, d{{\textbf{r}}}. \end{aligned}$$

(5.14)

Since we are taking a supremum over the $\phi $, it suffices to show that for all $\phi \in W_0^{1,2}$ we have $I(\Phi ) \ge I(\phi )$. Taking the integral over the zero function, we have

$$\begin{aligned} 0&= \left( - \frac{1}{4 \pi } \nabla ^2 \Phi - (\rho _{{\varvec{\gamma }}}+{\mathfrak {f}}) \right) \cdot (\phi - \Phi )\\&=\frac{1}{4\pi } \int \nabla \Phi \cdot \nabla \phi \, d{{\textbf{r}}}-\frac{1}{4\pi }\int |\nabla \Phi |^2 \, d{{\textbf{r}}}+ \int (\rho _{{\varvec{\gamma }}} +{\mathfrak {f}}) \Phi \, d{{\textbf{r}}}-\int (\rho +{\mathfrak {f}})\phi \, d{{\textbf{r}}}. \end{aligned}$$

We use the basic inequality $ab \le \frac{1}{2} (a^2+b^2)$ to get

$$\begin{aligned} 0&\le \frac{1}{8\pi } \int |\nabla \Phi |^2 \, d{{\textbf{r}}}+ \frac{1}{8\pi } \int |\nabla \phi |^2 \, d{{\textbf{r}}}-\frac{1}{4\pi } \int |\nabla \Phi |^2 \, d{{\textbf{r}}}\nonumber \\&\quad +\int (\rho _{{\varvec{\gamma }}}+ {\mathfrak {f}}) \Phi \, d{{\textbf{r}}}-\int (\rho _{{\varvec{\gamma }}} +{\mathfrak {f}})\phi \nonumber \\&=-\frac{1}{8\pi } \int |\nabla \Phi |^2 \, d{{\textbf{r}}}+\int (\rho _{{\varvec{\gamma }}}+{\mathfrak {f}})\Phi \, d{{\textbf{r}}}+ \frac{1}{8\pi } \int |\nabla \phi |^2\nonumber \\&\quad -\int (\rho _{{\varvec{\gamma }}}+{\mathfrak {f}}) \phi \, d{{\textbf{r}}}= I(\Phi ) -I(\phi ). \end{aligned}$$

(5.15)

This proves $I(\Phi ) \ge I(\phi )$, and hence the desired result follows. $\square $

5.2 Recasting of XC energy

As a consequence of (3.11), ${\mathfrak {G}}_\mathrm{{xc}}^{\star }$ is introduced as the pointwise supremum of a family of continuous affine functions. By (P2), ${\mathfrak {G}}_\mathrm{{xc}}$ is convex. By (P2) and (P6), ${\mathfrak {G}}_\mathrm{{ xc}}$ is continuous in $L^{4/3}(\Omega )$. Important properties of ${\mathfrak {G}}_\mathrm{{xc}}^{\star }$ are summarized in Lemma 5.3 below, wherein $\langle {u},{v} \rangle $ denotes the dual product of $u \in L^4 (\Omega ), \ v \in L^{4/3} (\Omega )$. It is defined by

$$\begin{aligned} \langle {u},{v} \rangle := \int _{\Omega } u({{\textbf{r}}}) v({{\textbf{r}}}) \, d{{\textbf{r}}}. \end{aligned}$$

(5.16)

Lemma 5.3

The functional ${\mathfrak {G}}_\mathrm{{xc}}^{\star }$ defined in (3.11) is convex and lower semi-continuous on $L^4(\Omega )$ and satisfies

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}}^{\star } (u) = \int _{\Omega } h^{\star } \left( u ({{\textbf{r}}}) \right) \, d{{\textbf{r}}}. \end{aligned}$$

(5.17)

The function $h^{\star } (x) \,: \, {{\mathbb {R}}}\rightarrow {{\mathbb {R}}}$ is the Legendre transform of $-h(t)$ and fulfills the growth conditions

$$\begin{aligned} C_5 |x|^4 + C_7 \le h^{\star } (x) \le C_6 |x|^4 + C_8 \end{aligned}$$

(5.18)

for real constants $C_5>0, \ C_6 >0, \ C_7$ and $C_8$. For ${\mathfrak {G}}_\mathrm{{xc}} (\rho _{{\varvec{\gamma }}})=-{{\mathcal {E}}}_\mathrm{{xc}} (\rho _{{\varvec{\gamma }}})$ it holds that

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}} (\rho _{{\varvec{\gamma }}}) = \sup _{u \in L^4 (\Omega )} \{ \langle {\rho _{{\varvec{\gamma }}}},{u} \rangle - {\mathfrak {G}}_\mathrm{{xc}}^{\star } (u)\}. \end{aligned}$$

(5.19)

Proof

From [7], we know that a dual function formulated as in (3.11) is both convex and lower semi-continuous, (5.17) also follows from [7, Proposition 1.2 Chapter IV]. The growth conditions (5.18) follow from applying (P6) to the Legendre transform of h(t). Since ${\mathfrak {G}}_\mathrm{{xc}}$ is also convex and lower semi-continuous, we have from [7, Proposition 4.1 Chapter I] that ${\mathfrak {G}}_\mathrm{{xc}}^{\star \star } = {\mathfrak {G}}_\mathrm{{xc}}$. Hence equality (5.19) follows. $\square $

As in [35], we will use the dual function method, twice in fact, for two different terms, namely $E_\mathrm{{xc}}^\textrm{LDA}(2\rho ^{\uparrow \uparrow })$ and $E_\mathrm{{xc}}^\textrm{LDA} (2\rho ^{\downarrow \downarrow })$. This means that we can write

$$\begin{aligned} E_\mathrm{{xc}}^\mathrm{{cLSDA}}= \inf _{u \in L^4(\Omega )} \left( -\left\langle u,\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } \right\rangle + \frac{1}{2} {\mathfrak {G}}_\mathrm{{xc}}^*(u) \right) + \inf _{v \in L^4(\Omega )} \left( -\left\langle v,\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } \right\rangle + \frac{1}{2} {\mathfrak {G}}_\mathrm{{xc}}^*(v) \right) . \end{aligned}$$

Therefore, we define the operators U and V so that

$$\begin{aligned} U:= u({{\textbf{r}}}) \delta ({{\textbf{r}}},{{\textbf{r}}}') \begin{pmatrix} 1 &{} 0 \\ 0 &{} 0 \end{pmatrix}, \qquad \qquad V:= u({{\textbf{r}}}) \delta ({{\textbf{r}}},{{\textbf{r}}}') \begin{pmatrix} 0 &{} 0 \\ 0 &{} 1 \end{pmatrix}. \end{aligned}$$

(5.20)

Then we have that

$$\begin{aligned} -{\text {Tr}}\,(U {\varvec{\gamma }})= & {} - \int _{\Omega } u({{\textbf{r}}}) \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }({{\textbf{r}}}) d {{\textbf{r}}}= -\left\langle u,\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } \right\rangle , \\ -{\text {Tr}}\,(V {\varvec{\gamma }})= & {} - \int _{\Omega } v({{\textbf{r}}}) \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }({{\textbf{r}}}) d {{\textbf{r}}}= -\left\langle v,\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } \right\rangle \end{aligned}$$

The operators U and V are linear with respect to ${\varvec{\gamma }}$.

5.3 Interchange of infimum and supremum

First we identify sufficient properties of $L(u,v, \cdot ,{\varvec{B}},\cdot )$ that allow us to exchange the order of the infimum over ${\varvec{\gamma }}\in {\varvec{{{\mathcal {C}}}}}_N$ and the supremum over $\phi \in W_0^{1,2} (\Omega )$.

Lemma 5.4

For all $u,v \in L^4(\Omega )$, $\phi \in W_0^{1,2}(\Omega )$, the functional $L(u,v,\phi ,{\varvec{B}},\cdot )$ is convex and lower semi-continuous with respect to ${\varvec{\gamma }}$. In addition, for any fixed $u,v \in L^4(\Omega )$, ${\varvec{B}} \in L^2(\Omega )$, and $\phi \in W_0^{1,2}(\Omega )$,

$$\begin{aligned} \lim _{\left\Vert {\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}} \rightarrow \infty } L(u,v,\phi ,{\varvec{B}},{\varvec{\gamma }}) = + \infty . \end{aligned}$$

(5.21)

Proof

All of the terms involving ${\varvec{\gamma }}$ in $L(u,v,\phi ,{\varvec{B}},{\varvec{\gamma }})$ are linear, and so the convexity is immediate from that. Note in particular that because the exchange correlation energy was a function of $\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } $ and $ \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }$, the resulting inner products which come up are linear with respect to ${\varvec{\gamma }}$. For semi-continuity, we note that $\frac{1}{2}{\text {Tr}}\,\left( -\Delta {\varvec{\gamma }}\right) $ is lower semi-continuous in ${\varvec{{{\mathcal {C}}}}}_N$. Indeed, $\frac{1}{2}{\text {Tr}}\,\left( -\Delta {\varvec{\gamma }}\right) $ is lower semi-continuous if, for all ${\varvec{\gamma }}_0 \in {\varvec{{{\mathcal {C}}}}}_N$, and for all real $\epsilon >0$ we can find a neighbourhood ${\varvec{{{\mathcal {N}}}}}_{0}$ containing ${\varvec{\gamma }}_0$ so that for all ${\varvec{\gamma }}\in {\varvec{{{\mathcal {N}}}}}_x$, we have that $\frac{1}{2}{\text {Tr}}\,\left( -\Delta {\varvec{\gamma }}\right) \le \frac{1}{2}{\text {Tr}}\,\left( -\Delta {\varvec{\gamma }}_0 \right) +\epsilon $.

For a given ${\varvec{\gamma }}_0$ and $\epsilon >0$, consider the ${\varvec{{{\mathcal {C}}}}}$-ball centered on ${\varvec{\gamma }}_0$ with radius $\epsilon /2$. Therefore, for all ${\varvec{\gamma }}$ in this ball, $\left\Vert {\varvec{\gamma }}_0-{\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}< \epsilon /2$. With $(\alpha _{0,k})$ denoting the basis expansion of ${\varvec{\gamma }}_0$ and $(\alpha _k)$ denoting the basis expansion of ${\varvec{\gamma }}$, we thus have

$$\begin{aligned} \epsilon /2 > \left\Vert {\varvec{\gamma }}_0-{\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}} \ge {\text {Tr}}\,(|\nabla |{\varvec{\gamma }}_0-{\varvec{\gamma }}|\nabla |) = \sum _{k=1}^\infty |\alpha _{0,k}-\alpha _k| \langle {\nabla {\varvec{\xi }}_k}| {\nabla {\varvec{\xi }}_k} \rangle \,. \end{aligned}$$

Therefore, it follows that

$$\begin{aligned} \frac{1}{2}{\text {Tr}}\,(-\Delta {\varvec{\gamma }}) \le \frac{1}{2}\sum _{k=1}^\infty \left( |\alpha _{0,k}|+|\alpha _{0,k}-\alpha _k|\right) \langle {\nabla {\varvec{\xi }}_k}| {\nabla {\varvec{\xi }}_k} \rangle \, \le \frac{1}{2}{\text {Tr}}\,(-\Delta {\varvec{\gamma }}_0) + \epsilon , \end{aligned}$$

whence lower semi-continuity holds. Moving onto the density terms, we have

$$\begin{aligned} \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}^{\mu \mu }}\right\Vert _{L^2}^2 \le {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\mu \mu }), \end{aligned}$$

(5.22)

where we write $ {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\mu \mu }):= \sum _{k=1}^\infty \alpha _k |\nabla \xi _k^{\mu }|^2$; slightly abusing notation. Invoking the Gagliardo-Nirenberg-Sobolev embedding theorem, we also have $\left\Vert \rho _{{\varvec{\gamma }}}^{\mu \mu }\right\Vert _{L^3} \le C {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\mu \mu })$. Since we are working on a bounded domain $\Omega $, we get

$$\begin{aligned} \left\Vert \rho _{{\varvec{\gamma }}}^{\mu \mu }\right\Vert _{L^2} \le C \left\Vert \rho _{{\varvec{\gamma }}}^{\mu \mu }\right\Vert _{L^3} \le C {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\mu \mu }) \le C {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}) \le C \left\Vert {\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}, \end{aligned}$$

(5.23)

where the constant C changes from left to the right above. Hence, we have that if ${{\varvec{\gamma }}}_i \rightarrow {{\varvec{\gamma }}}$ in ${\varvec{{{\mathcal {C}}}}}$ as $i \rightarrow \infty $, then $\rho _{{{\varvec{\gamma }}}_i}^{\mu \mu } \rightarrow \rho _{{\varvec{\gamma }}}^{\mu \mu }$ in $L^{2}(\Omega )$. Hence by applying Hölder’s inequality with $p=q=2$ we have continuity, and therefore lower semi-continuity, for the terms

$$\begin{aligned} \int _\Omega \rho _{{\varvec{\gamma }}}({{\textbf{r}}}) \phi ({{\textbf{r}}}) d {{\textbf{r}}}, \quad - \int _\Omega u({{\textbf{r}}}) \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } ({{\textbf{r}}}) d {{\textbf{r}}}, \quad - \int _\Omega v({{\textbf{r}}}) \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } ({{\textbf{r}}}) d {{\textbf{r}}}, \end{aligned}$$

as well as for the magnetic field terms:

$$\begin{aligned} -{\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }})= & {} -\mu \int _{\Omega } {{\textbf{B}}}_{x} \left( \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } \right) \, d{{\textbf{r}}}-\mu \int _{\Omega } i {{\textbf{B}}}_y \left( \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } - \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } \right) \, d{{\textbf{r}}}\\ {}{} & {} - \mu \int _\Omega {\varvec{B}}_z (\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }) d {{\textbf{r}}}. \end{aligned}$$

Let us concretely demonstrate the continuity of the terms by considering as an example the term $\int _\Omega \rho _{{\varvec{\gamma }}} \phi d {{\textbf{r}}}$. Using Hölder’s inequality and inequality (5.23) we arrive at

$$\begin{aligned} \left| \int _\Omega \rho _{{\varvec{\gamma }}_i} \phi d {{\textbf{r}}}- \int _\Omega \rho _{{\varvec{\gamma }}} \phi d {{\textbf{r}}}\right|&= \left| \int _\Omega \rho _{{\varvec{\gamma }}_i-{\varvec{\gamma }}} \phi d {{\textbf{r}}}\right| \\ {}&=\int _\Omega \left| \phi (\rho _{{\varvec{\gamma }}_i-{\varvec{\gamma }}}^{\uparrow \uparrow } +\rho _{{\varvec{\gamma }}_i-{\varvec{\gamma }}}^{\downarrow \downarrow }) \right| d {{\textbf{r}}}\\&\le \left\Vert \phi \right\Vert _{L^2}\left( \left\Vert \rho _{{\varvec{\gamma }}_i-{\varvec{\gamma }}}^{\uparrow \uparrow }\right\Vert _{L^2} + \left\Vert \rho _{{\varvec{\gamma }}_i-{\varvec{\gamma }}}^{\downarrow \downarrow }\right\Vert _{L^2} \right) \\ {}&\le \left\Vert \phi \right\Vert _{L^2} 2 C \left\Vert {\varvec{\gamma }}_i-{\varvec{\gamma }}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}, \end{aligned}$$

meaning that ${\varvec{\gamma }}\mapsto \int _\Omega \rho _{{\varvec{\gamma }}} \phi d {{\textbf{r}}}$ is continuous with respect to $\Vert \cdot \Vert _{{\varvec{{{\mathcal {C}}}}}}$. To show coercivity, we will apply Hölder’s inequality to the terms in L. We write L as

$$\begin{aligned} L(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}})= & {} -\frac{1}{2}{\text {Tr}}\,( \Delta {{\varvec{\gamma }}}) + {\text {Tr}}\,(\Phi {{\varvec{\gamma }}}) - {\text {Tr}}\,(U {{\varvec{\gamma }}})- {\text {Tr}}\,(V {{\varvec{\gamma }}}) - {\text {Tr}}\,({\mathfrak {B}} {{\varvec{\gamma }}}) \nonumber \\ {}{} & {} \quad + C_{u,v,\phi }, \end{aligned}$$

(5.24)

where $C_{u,v,\phi } = \int _{\Omega } \left( -C_d |\nabla \phi |^2 +{\mathfrak {f}}\phi \right) d {{\textbf{r}}}+C_\mathrm{{self}} + {\mathfrak {G}}_\mathrm{{xc}}^*(u) + {\mathfrak {G}}_\mathrm{{xc}}^*(v)$. By applying the triangle and Hölder ($p=2=q$) inequalities to ${\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }})$, we have

$$\begin{aligned} |{\text {Tr}}\,({\mathfrak {B}} {\varvec{\gamma }})|&\le \mu \left\Vert {{\textbf{B}}}_x\right\Vert _{L^2(\Omega )}\left\Vert \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow }\right\Vert _{L^2(\Omega )} + \mu \left\Vert {{\textbf{B}}}_y\right\Vert _{L^2(\Omega )}\left\Vert \rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow }\right\Vert _{L^2(\Omega )} \nonumber \\&+ \mu \left\Vert {{\textbf{B}}}_z\right\Vert _{L^2(\Omega )}\left\Vert \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }\right\Vert _{L^2(\Omega )}. \end{aligned}$$

(5.25)

Next, we note that

$$\begin{aligned} \left\Vert \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } -\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }\right\Vert _{L^2} \le \left\Vert \rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } +\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }\right\Vert _{L^2} = \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2}. \end{aligned}$$

(5.26)

Furthermore, by considering their eigenvalue representation, and with $\textrm{Re}$ and $\textrm{Im}$ being the real and imaginary parts respectively, we have that

$$\begin{aligned}&\rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } = \sum _{j=1}^\infty \alpha _j \left( \xi _j^\uparrow \overline{\xi _j^\downarrow } + \xi _j^\downarrow \overline{\xi _j^\uparrow }\right) = \sum _{j=1}^\infty \alpha _j 2 \textrm{Re} (\xi _j^\uparrow \overline{\xi _j^\downarrow }),\\&\rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow } = \sum _{j=1}^\infty \alpha _j \left( \xi _j^\uparrow \overline{\xi _j^\downarrow } - \xi _j^\downarrow \overline{\xi _j^\uparrow }\right) = \sum _{j=1}^\infty \alpha _j 2 \textrm{Im} (\xi _j^\uparrow \overline{\xi _j^\downarrow }),\\&\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow } = \sum _{j=1}^\infty \alpha _j \left( \xi _j^\uparrow \overline{\xi _j^\uparrow } + \xi _j^\downarrow \overline{\xi _j^\downarrow }\right) = \sum _{j=1}^\infty \alpha _j \left( |\xi _j^\uparrow |^2 + |\xi _j^\downarrow |^2 \right) . \end{aligned}$$

By considering the quadratic expansion, we have that

$$\begin{aligned} 2 \textrm{Re} (\xi _j^\uparrow \overline{\xi _j^\downarrow }) \le 2 |\xi _j^\uparrow | |\xi _j^\downarrow | \le |\xi _j^\uparrow |^2 + |\xi _j^\downarrow |^2, \qquad 2 \textrm{Im} (\xi _j^\uparrow \overline{\xi _j^\downarrow }) \le 2 |\xi _j^\uparrow | |\xi _j^\downarrow | \le |\xi _j^\uparrow |^2 + |\xi _j^\downarrow |^2. \end{aligned}$$

Therefore it follows that

$$\begin{aligned}&|\rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow }| \le |\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }|= |\rho _{{\varvec{\gamma }}}| , \end{aligned}$$

(5.27)

$$\begin{aligned}&|\rho _{{\varvec{\gamma }}}^{\uparrow \downarrow } - \rho _{{\varvec{\gamma }}}^{\downarrow \uparrow }| \le |\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow } + \rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }| = |\rho _{{\varvec{\gamma }}}| . \end{aligned}$$

(5.28)

By inserting inequalities (5.26)-(5.28) into (5.25), we arrive at

$$\begin{aligned} | {\text {Tr}}\,({\varvec{B}})| \le \mu \left( \left\Vert {\varvec{B}}_x\right\Vert _{L^2(\Omega )}+ \left\Vert {\varvec{B}}_y\right\Vert _{L^2(\Omega )}+ \left\Vert {\varvec{B}}_z\right\Vert _{L^2(\Omega )} \right) \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2(\Omega )}. \end{aligned}$$

(5.29)

Therefore, when we apply Hölder’s inequality with $p=q=2$ for operators $\Phi $, U and V to (5.24), and use inequality (5.29), we get

$$\begin{aligned} L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})&\ge -\frac{1}{2}{\text {Tr}}\,( \Delta {{\varvec{\gamma }}}) + C_{u,v,\phi } \end{aligned}$$

(5.30)

$$\begin{aligned}&-\left( \left\Vert u\right\Vert _{L^2(\Omega )} + \left\Vert v\right\Vert _{L^2(\Omega )}+ \left\Vert \phi \right\Vert _{L^2(\Omega )} \right) \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2(\Omega )} \nonumber \\&- \mu \left( \left\Vert {{\textbf{B}}}_x\right\Vert _{L^2(\Omega )} + \left\Vert {{\textbf{B}}}_y\right\Vert _{L^2(\Omega )} + \left\Vert {{\textbf{B}}}_{z}\right\Vert _{L^2(\Omega )} \right) \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2(\Omega )}. \end{aligned}$$

(5.31)

Next, by interpolation, we have that $\left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2(\Omega )} \le \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^{1}(\Omega )}^{1/4} \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^3(\Omega )}^{3/4}$, and by the Gagliardo-Nirenberg-Sobolev inequality, there exists a constant C independent of $\rho _{{\varvec{\gamma }}}$ so that $\left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^3(\Omega )} = \left\Vert \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^6(\Omega )}^{2} \le C \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2(\Omega )}^{2}$. We note that $\left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^1(\Omega )} = N$, and, therefore,

$$\begin{aligned} \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^2(\Omega )} \le \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^{1}(\Omega )}^{1/4} \left\Vert \rho _{{\varvec{\gamma }}}\right\Vert _{L^3(\Omega )}^{3/4} \le N^{1/4} C \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2(\Omega )}^{3/2}. \end{aligned}$$

Hence, we arrive at

$$\begin{aligned} L(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}}) \ge&\frac{1}{2}{\text {Tr}}\,(-\Delta {{\varvec{\gamma }}}) +C_{u,v,\phi } - C_1 \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2(\Omega )}^{3/2}, \end{aligned}$$

where, on the right-hand side,

$$\begin{aligned}{} & {} C_1 = CN^{1/4} \left( \left\Vert u\right\Vert _{L^2(\Omega )} + \left\Vert v\right\Vert _{L^2(\Omega )}+ \left\Vert \phi \right\Vert _{L^2(\Omega )}+ \mu \left[ \left\Vert {{\textbf{B}}}_x\right\Vert _{L^2(\Omega )} \right. \right. \\{} & {} \quad \left. \left. + \left\Vert {{\textbf{B}}}_y\right\Vert _{L^2(\Omega )} +\left\Vert {{\textbf{B}}}_{z}\right\Vert _{L^2(\Omega )} \right] \right) \end{aligned}$$

depends on $N,u,v,\phi ,{{\textbf{B}}}$, but is independent of ${{\varvec{\gamma }}}$. By writing ${\text {Tr}}\,(-\Delta {{\varvec{\gamma }}})= {\text {Tr}}\,(|\nabla |{{\varvec{\gamma }}}|\nabla |)$ and utilizing $\frac{1}{2}\left( {\text {Tr}}\,({{\varvec{\gamma }}})-N \right) =0$, we get

$$\begin{aligned} L(u,v,\phi ,{\varvec{B}}, {\varvec{\gamma }})&\ge \frac{1}{2}{\text {Tr}}\,(\nabla | {{\varvec{\gamma }}}| \nabla ) + \frac{1}{2} {\text {Tr}}\,({{\varvec{\gamma }}}) -N/2 +C_{u,v,\phi } -C \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2(\Omega )}^{3/2}. \end{aligned}$$

We recall the definition of $\left\Vert \cdot \right\Vert _{{\varvec{{{\mathcal {C}}}}}} = {\text {Tr}}\,(|\cdot |) +{\text {Tr}}\,(|\nabla |\cdot |\nabla |)$. Using the inequality (5.22), and that for f, g positive we have $\big |\nabla \sqrt{f+g}\big |\le \big |\nabla \sqrt{f}\big | +\big |\nabla \sqrt{g}\big |$, and we thus obtain

$$\begin{aligned} \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2}^2\le & {} 2\left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }}\right\Vert _{L^2}^2 + 2\left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }}\right\Vert _{L^2}^2\\\le & {} {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\uparrow \uparrow }) + {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}^{\downarrow \downarrow }) \le {\text {Tr}}\,(- \Delta {{\varvec{\gamma }}}). \end{aligned}$$

As a consequence,

$$\begin{aligned} \left\Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\right\Vert _{L^2(\Omega )}^2 \le {\text {Tr}}\,(-\Delta {{\varvec{\gamma }}})={\text {Tr}}\,(|\nabla |{{\varvec{\gamma }}}|\nabla |), \end{aligned}$$

(5.32)

and it follows that

$$\begin{aligned} L(u,v,\phi , {\varvec{B}},{\varvec{\gamma }})&\ge \frac{1}{2}\left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}} -N/2 +C_{u,v,\phi } -C {\text {Tr}}\,(|\nabla |{{\varvec{\gamma }}}|\nabla |)^{3/4}\\&\ge \left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}^{3/4}\left( \frac{1}{2}\left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}^{1/4} -C \right) -N/2 +C_{u,v,\phi }. \end{aligned}$$

Since

$$\begin{aligned} \lim _{\left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}} \rightarrow \infty } \left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}^{3/4}\left( \frac{1}{2}\left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}}^{1/4} -C \right) -N/2 +C_{u,v,\phi } =+ \infty , \end{aligned}$$

it follows that for fixed u, v and $\phi $ that $\lim _{\left\Vert {{\varvec{\gamma }}}\right\Vert _{{\varvec{{{\mathcal {C}}}}}} \rightarrow \infty } L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) = +\infty $, as desired. $\square $

Next we establish concavity.

Lemma 5.5

For all $u,v \in L^4(\Omega )$, ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N$, the functional $L(u,v,\cdot ,{\varvec{B}},{{\varvec{\gamma }}})$ is concave and upper semi-continuous with respect to $\phi $. In addition, for any fixed $u,v \in L^4(\Omega )$, ${\varvec{B}} \in L^2(\Omega )$, and ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N$,

$$\begin{aligned} \lim _{\left\Vert \phi \right\Vert \rightarrow \infty } L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) = - \infty . \end{aligned}$$

(5.33)

Proof

We write $L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})$ as

$$\begin{aligned} L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})={\text {Tr}}\,(\Phi {{\varvec{\gamma }}}) -\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \,d{{\textbf{r}}}+ \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) d{{\textbf{r}}}+C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}, \end{aligned}$$

where

$$\begin{aligned} C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}} =&{\text {Tr}}\,(-\Delta {{\varvec{\gamma }}}) +{\mathfrak {G}}_\mathrm{{xc}}^*(u)+{\mathfrak {G}}_\mathrm{{xc}}^*(v) - {\text {Tr}}\,(U {{\varvec{\gamma }}}) \\&- {\text {Tr}}\,(V {{\varvec{\gamma }}}) -{\text {Tr}}\,({\mathfrak {B}} {{\varvec{\gamma }}}) +C_\mathrm{{self}}. \end{aligned}$$

Recalling (3.13), ${\text {Tr}}\,(\Phi {{\varvec{\gamma }}})$ and also $\int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) d{{\textbf{r}}}$ are linear with respect to $\phi $, they are also concave. Since the second term is a negative quadratic, it is also concave with respect to $\phi $.

Next, we show that $L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})$ is upper semi-continuous with respect to $\phi $. Again, for fixed ${{\varvec{\gamma }}}$ and b, we have that ${\text {Tr}}\,(\Phi {{\varvec{\gamma }}})$ and $\int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}$ are continuous with respect to $\phi $. The quadratic term is also upper semi-continuous, see for example [6, Proposition 2.1].

All that remains is to show that, for fixed $u,v,{\varvec{B}}$ and ${{\varvec{\gamma }}}$, as $\left\Vert \phi \right\Vert _{W_0^{1,2}}$ tends to infinity, $L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})$ will tend to negative infinity. By applying Hölder’s inequality to the linear terms with $p=2=q$, we arrive at

$$\begin{aligned} -L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})&=-{\text {Tr}}\,(\Phi {{\varvec{\gamma }}}) +\frac{1}{8\pi } \int _{\Omega } |\nabla \phi ({{\textbf{r}}})|^2 \, d{{\textbf{r}}}\\ {}&\quad - \int _{\Omega } {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) \, d{{\textbf{r}}}- C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}\\&\ge \frac{1}{8\pi }\left\Vert \nabla \phi \right\Vert _{L^2(\Omega )}^2- \left\Vert \phi \right\Vert _{L^2(\Omega )}\left\Vert \rho _{{\varvec{\gamma }}} +{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right\Vert _{L^2(\Omega )} -C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}\\&\ge C_\mathrm{{poin}}\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 {-} \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}\left\Vert \rho _{{\varvec{\gamma }}} {+}{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right\Vert _{L^2(\Omega )} - C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}, \end{aligned}$$

where for the final inequality we applied the Poincaré inequality, which is where the constant $C_\mathrm{{ poin}}$ comes from. Next, by Young’s inequality,

$$\begin{aligned} ab \le \epsilon a^2+ \frac{b^2}{4 \epsilon }, \end{aligned}$$

with $a=\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )} $ and $b=\left\Vert \rho _{{\varvec{\gamma }}} + {\mathfrak {f}}({{\textbf{r}}},{{\textbf{R}}})\right\Vert _{L^2(\Omega )}$, we have that

$$\begin{aligned} -L(u,v,\phi , {\varvec{B}},{{\varvec{\gamma }}})&\ge C_\mathrm{{poin}}\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 - \epsilon \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 \nonumber \\ {}&\quad - \frac{1}{4 \epsilon }\left\Vert \rho _{{\varvec{\gamma }}} +{\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right\Vert _{L^2(\Omega )}^2 - C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}} \nonumber \\&\ge \left( C_\mathrm{{poin}}- \epsilon \right) \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 -\frac{1}{4 \epsilon } \left\Vert \rho _{{\varvec{\gamma }}} + {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right\Vert _{L^2(\Omega )}^2 - C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}. \end{aligned}$$

(5.34)

Let us choose $\epsilon $ small enough so that $C_\mathrm{{poin}}-\epsilon >0$. This means that

$$\begin{aligned} \lim _{\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega ) } \rightarrow \infty } \left( C_\mathrm{{poin}}- \epsilon \right) \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 = + \infty . \end{aligned}$$

Then,

$$\begin{aligned} L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) \le -\left( C_\mathrm{{poin}}- \epsilon \right) \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 +\frac{1}{4 \epsilon } \left\Vert \rho _{{\varvec{\gamma }}} + {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}})\right\Vert _{L^2(\Omega )}^2 + C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}}, \end{aligned}$$

and, therefore,

$$\begin{aligned} \lim _{\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )} \rightarrow \infty } L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})&\le - \lim _{\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )} \rightarrow \infty } \left( C_\mathrm{{poin}}- \epsilon \right) \left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )}^2 \\ \nonumber&\quad +\frac{1}{4 \epsilon } \left\Vert \rho _{{\varvec{\gamma }}} + {\mathfrak {f}}({{\textbf{r}}},{{\textbf{R}}})\right\Vert _{L^2(\Omega )}^2 + C_{u,v,{\varvec{B}},{{\varvec{\gamma }}}} = - \infty . \end{aligned}$$

Hence, $\lim _{\left\Vert \phi \right\Vert _{W_0^{1,2}(\Omega )} \rightarrow \infty } L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})=-\infty $. $\square $

Having prepared the auxiliary results above, we are now ready to show that it is possible to exchange the orders of the infimum and supremum when computing the energy.

Theorem 5.6

With the assumptions as before,

$$\begin{aligned} {{\textsf {E}}}({\varvec{B}})= & {} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \sup _{\phi \in W_0^{1,2}(\Omega )} L(u,v,\phi ,{\varvec{B}}, {{\varvec{\gamma }}}) \\= & {} \inf _{u \in L^4(\Omega )} \inf _{v \in L^4(\Omega )} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sup _{\phi \in W_0^{1,2}(\Omega )} L(u,v,\phi ,{\varvec{B}}, {{\varvec{\gamma }}}) \\= & {} \inf _{u \in L^4 (\Omega )} \inf _{v \in L^4(\Omega )} \sup _{\phi \in W_0^{1,2} (\Omega )} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_{N}} \Big \{ {\text {Tr}}\,( {\varvec{H}}(u,v,\phi ,{\varvec{B}}) {{\varvec{\gamma }}}) \\{} & {} + \int _{\Omega } \left( -C_d | \nabla \phi ({{\textbf{r}}})|^2 + {\mathfrak {f}}({{\textbf{r}}},{\underline{{{\textbf{R}}}}}) \phi ({{\textbf{r}}}) \right) d {{\textbf{r}}}+ {\mathfrak {G}}_\mathrm{{xc}}^{*} (u)+{\mathfrak {G}}_\mathrm{{xc}}^{*} (v) \Big \}. \end{aligned}$$

Proof

For fixed $u,v \in L^4 (\Omega )$ we want to apply [7, Proposition 2.2] in the functional $(\phi ,{{\varvec{\gamma }}}) \mapsto L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})$. The convergence in ${{\varvec{\gamma }}}$ is with respect to the $\text{ weak}^{*}$-topology in ${\varvec{{{\mathcal {C}}}}}$. Since $W_0^{1,2} (\Omega )$ and ${\varvec{{{\mathcal {C}}}}}_N$ are convex, non-empty sets, [7, Proposition 2.2] is valid provided the following conditions are met for arbitrary $u,v \in L^4 (\Omega )$:

(p1) For all $\phi \in W_0^{1,2} (\Omega )$, $L(u,v,\phi ,{\varvec{B}},\cdot )$ is convex and lower semi-continuous.

(p2) For all ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N$, $L(u,v,\cdot ,{\varvec{B}},{{\varvec{\gamma }}})$ is concave and upper semi-continuous.

(p3) There exists a $\phi _0 \in W_0^{1,2} (\Omega )$ such that

$$\begin{aligned} \lim _{\Vert {{\varvec{\gamma }}}\Vert _{{\varvec{{{\mathcal {C}}}}}} \rightarrow \infty } L(u,v,\phi _0,{\varvec{B}},{{\varvec{\gamma }}}) = + \infty . \end{aligned}$$

(5.35)

(p4) There exists ${{\varvec{\gamma }}}_0 \in {\varvec{{{\mathcal {C}}}}}_N$ such that $\displaystyle \lim _{\Vert \phi \Vert _{W^{1,2} (\Omega )} \rightarrow 0} L(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}_0)=-\infty $.

The properties (p1) and (p3) follow from Lemma 5.4, Lemma 5.5 implies the properties (p2) and (p4). Hence [7, Proposition 2.2] applies. This ensures that there exists at least one saddle point $(\phi , {{\varvec{\gamma }}})$ of $L(u,v,\cdot ,{\varvec{B}},\cdot )$ for every $u,v \in L^4 (\Omega )$ and that the order of infimum in ${{\varvec{\gamma }}}$ and supremum in $\phi $ can be exchanged without affecting the ground-state energy. $\square $

5.4 Commutativity of band energy

Theorem 5.7

With the assumptions stated above, we have that, for every $u,v\in L^4 (\Omega )$, ${\varvec{B}} \in L^2(\Omega )$ and every $\phi \in W_0^{1,2} (\Omega )$ a minimizer of the band energy $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},\cdot )$ in ${\varvec{{{\mathcal {C}}}}}_N$ commutes with the Hamiltonian ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$.

Proof

For fixed u, v, and $\phi $, the operator ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$ on $L^2(\Omega ; \mathbb {C}^2)$ is self-adjoint, unbounded, and semi-bounded below. Moreover, since ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$ is bounded below, there exists a smallest eigenvalue, say $\lambda _1$. Let $\lambda _k$ and ${\varvec{\xi }}_k$ denote the $k^{th}$ eigenvalue and eigenvector, respectively, where the eigenvalues are in increasing order. It then follows that

$$\begin{aligned} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} E_\mathrm{{band}}(u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}})&= \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} {\text {Tr}}\,({\varvec{H}}(u,v,\phi ,{\varvec{B}}){{\varvec{\gamma }}})\\&= \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \left\langle {\varvec{H}}(u,v,\phi ,{\varvec{B}}){{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{\xi }}_i \right\rangle . \end{aligned}$$

By self-adjointness we have

$$\begin{aligned}&\inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \left\langle {\varvec{H}}(u,v,\phi ,{\varvec{B}}){{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{\xi }}_i \right\rangle = \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \left\langle {{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{H}}(u,v,\phi ,{\varvec{B}}) {\varvec{\xi }}_i \right\rangle . \end{aligned}$$

Since ${\varvec{\xi }}_i$ are eigenvectors of ${\varvec{H}}$, we have

$$\begin{aligned}&\inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \left\langle {{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{H}}(u,v,\phi ,{\varvec{B}}) {\varvec{\xi }}_i \right\rangle = \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \left\langle {{\varvec{\gamma }}} {\varvec{\xi }}_i, \lambda _i {\varvec{\xi }}_i \right\rangle . \end{aligned}$$

If we wish to minimize the expression $\inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \lambda _i \left\langle {{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{\xi }}_i \right\rangle $, we remember that ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N$ requires that ${\text {Tr}}\,({{\varvec{\gamma }}}) = N$. Therefore, the infimum over ${{\varvec{\gamma }}}$ corresponds to taking the N eigenvectors that have the correspondingly lowest eigenvalues. Hence,

$$\begin{aligned} \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N} \sum _{i=1}^\infty \lambda _i \left\langle {{\varvec{\gamma }}} {\varvec{\xi }}_i, {\varvec{\xi }}_i \right\rangle = \sum _{i=1}^N \lambda _k. \end{aligned}$$

(5.36)

Therefore, it follows that there exists a Borel function g so that

$$\begin{aligned} g(\lambda )= {\left\{ \begin{array}{ll} 1, &{} \lambda \le \lambda _N \\ 0, &{} \text{ otherwise } \end{array}\right. }. \end{aligned}$$

(5.37)

Since the minimizer is of the form of a Borel function, it suffices to minimize over Borel functions only. In other words, we will minimize over the restricted space

$$\begin{aligned} {\varvec{{\widehat{{{\mathcal {C}}}}}}}_N = \lbrace {{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N \,: \, {{\varvec{\gamma }}}=g( {\varvec{H}}(u,v,\phi ,{\varvec{B}}) ), \, g \text{ is } \text{ a } \text{ Borel } \text{ function } \rbrace . \end{aligned}$$

(5.38)

Operators $\hat{{\varvec{\gamma }}}$ in $\hat{{\varvec{{{\mathcal {C}}}}}}_N$ have a spectral decomposition, and therefore $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},\hat{{\varvec{\gamma }}})$ commutes with the Hamiltonian ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$. Because we have shown that minimizing over ${\varvec{{{\mathcal {C}}}}}_N$ reduces to minimizing over $\hat{{\varvec{{{\mathcal {C}}}}}}_N$, it follows that $E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},{\varvec{\gamma }})$ will also commute with the Hamiltonian ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$ for ${\varvec{\gamma }} \in {\varvec{{{\mathcal {C}}}}}_N $. $\square $

5.5 Spatial discretization

We define the discrete counterparts to ${\varvec{\gamma }}$, $\rho _{{\varvec{\gamma }}}$, $\rho _{{\varvec{\gamma }}}^{\uparrow \uparrow }$ and $\rho _{{\varvec{\gamma }}}^{\downarrow \downarrow }$. First, however, we define the basis vectors ${\varvec{\xi }}_a \in W_0^{1,2}(\Omega ; \mathbb {C}^2)_j$, for $1 \le a \le j$, by ${\varvec{\xi }}_a = ( \xi _a^\uparrow , \xi _a^\downarrow )^\textrm{T}$. We have

$$\begin{aligned} {\varvec{\gamma }}_j = \begin{pmatrix} \gamma _j^{\uparrow \uparrow } &{} \gamma _j^{\uparrow \downarrow }\\ \gamma _j^{\downarrow \uparrow } &{} \gamma _j^{\downarrow \downarrow } \end{pmatrix}, \end{aligned}$$

where

$$\begin{aligned} \gamma _j^{\uparrow \uparrow }( {{\textbf{r}}}, {{\textbf{r}}}^\prime ) = \sum _{a_1=1}^j \sum _{a_2=1}^j \gamma _{j, a_1, a_2}^{\uparrow \uparrow } \xi _{a_1}^{\uparrow }({{\textbf{r}}}) \overline{\xi _{a_2}^{ \uparrow }({{\textbf{r}}}^\prime )} = \sum _{a_1=1}^j \sum _{a_2=1}^j \gamma _{j, a_1, a_2}^{ \uparrow \uparrow } {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \cdot \begin{pmatrix} 1 &{} 0\\ 0 &{} 0 \end{pmatrix} {\varvec{\xi }}_{a_2}({{\textbf{r}}}^\prime ), \\ \gamma _j^{\uparrow \downarrow }( {{\textbf{r}}}, {{\textbf{r}}}^\prime ) = \sum _{a_1=1}^j \sum _{a_2=1}^j \gamma _{j, a_1, a_2}^{\uparrow \downarrow } \xi _{a_1}^{\uparrow }({{\textbf{r}}})\overline{\xi _{a_2}^{ \downarrow }({{\textbf{r}}}^\prime )} = \sum _{a_1=1}^j \sum _{a_2=1}^j \gamma _{j, a_1, a_2}^{ \uparrow \downarrow } {\varvec{\xi }}_{a_1}({{\textbf{r}}}) \cdot \begin{pmatrix} 0 &{} 1\\ 0 &{} 0 \end{pmatrix} {\varvec{\xi }}_{a_2}( {{\textbf{r}}}^\prime ), \end{aligned}$$

and, analogously, for $\gamma _j^{\downarrow \uparrow }( {{\textbf{r}}}, {{\textbf{r}}}^\prime )$ and $ \gamma _j^{\downarrow \downarrow }( {{\textbf{r}}}, {{\textbf{r}}}^\prime )$. For $\rho _{{\varvec{\gamma }}}$, we recall that

$$\begin{aligned} \rho _{{\varvec{\gamma }}}= & {} {{\varvec{\gamma }}}^{\uparrow \uparrow }({{\textbf{r}}},{{\textbf{r}}}) + {{\varvec{\gamma }}}^{\downarrow \downarrow }({{\textbf{r}}},{{\textbf{r}}}) =\sum _{k=1}^\infty \alpha _k \left( \xi ^{\uparrow }_k({{\textbf{r}}}) \overline{\xi ^{\uparrow }}_k({{\textbf{r}}}) + \xi ^{\downarrow }_k({{\textbf{r}}}) \overline{\xi ^{\downarrow }}_k({{\textbf{r}}}) \right) \\= & {} \sum _{k=1}^\infty \alpha _k {\varvec{\xi }}_k({{\textbf{r}}}) \cdot {\xi }_k ({{\textbf{r}}}), \end{aligned}$$

where the second equality comes from the spectral definition of ${{\varvec{\gamma }}}$. Therefore, we will define

$$\begin{aligned} \rho _{j} = \sum _{a_1=1}^j \sum _{a_2=1}^j \rho _{j, a_1, a_2} \left( \xi _{a_1}^{\uparrow }({{\textbf{r}}})\overline{\xi _{a_2}^{\uparrow }({{\textbf{r}}})} +\xi _{a_1}^{ \downarrow }({{\textbf{r}}})\overline{\xi _{a_2}^{\downarrow }({{\textbf{r}}})}\right) = \sum _{a_1=1}^j \sum _{a_2=1}^j \rho _{j, a_1, a_2} {\varvec{\xi }}_{a_1} ({{\textbf{r}}}) \cdot {\varvec{\xi }}_{a_2} ({{\textbf{r}}}). \end{aligned}$$

Note that $\alpha _k$ is not necessarily equal to $\rho _{j,a_1,a_2}$, since we still want $\rho _j$ to integrate to the total density, N. We justify our choices of $\Phi _{a_1,a_2}^j$ and ${\mathfrak {B}}^j$. The others follow in an analogous way. We initially defined $\Phi $ in order that

$$\begin{aligned} {\text {Tr}}\,(\Phi {{\varvec{\gamma }}})= & {} \int _{\Omega } \phi ({{\textbf{r}}}) \rho _{{{\varvec{\gamma }}}}({{\textbf{r}}}) \, d{{\textbf{r}}}=\int _{\Omega } \phi ({{\textbf{r}}}) \left( \rho ^{\uparrow \uparrow }({{\textbf{r}}}) + \rho ^{\downarrow \downarrow }({{\textbf{r}}})\right) \, d{{\textbf{r}}}\\ {}= & {} \sum _{k=1}^\infty \alpha _k \int _{\Omega } \phi ({{\textbf{r}}}) {\varvec{\xi }}_{k}({{\textbf{r}}}) \cdot {\varvec{\xi }}_{k}({{\textbf{r}}}) \, d{{\textbf{r}}}, \end{aligned}$$

where the second equality comes from our definition of $\rho _{{\varvec{\gamma }}}$. Since we have

$$\begin{aligned} \phi ^j({{\textbf{r}}}) = \left( \sum _{k=1}^j \phi _{k} e_k({{\textbf{r}}}) \right) , \qquad \rho _{{\varvec{\gamma }}}^j({{\textbf{r}}}) = \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} {\varvec{\xi }}_{k_1}({{\textbf{r}}}) \cdot {\varvec{\xi }}_{k_2}({{\textbf{r}}}), \end{aligned}$$

it then follows that

$$\begin{aligned} {\text {Tr}}\,(\Phi ^j {{\varvec{\gamma }}}_j )&= \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \int _{\Omega } \left( \sum _{k=1}^j \phi _{k} e_k({{\textbf{r}}}) \right) {\varvec{\xi }}_{k_1}({{\textbf{r}}}) \cdot {\varvec{\xi }}_{k_2}({{\textbf{r}}}) \, d{{\textbf{r}}}\\ {}&= \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \Phi _{a_1,a_2}^j , \end{aligned}$$

where the final equality comes from our definition of $\Phi _{a_1, a_2}^j.$ Let us now take a look at the other example, ${\mathfrak {B}}^j$. In particular we look at the ${{\textbf{B}}}_z$ component. We recall by definition of the trace that

$$\begin{aligned} \mu \int _{\Omega } {\varvec{B}}_z({{\textbf{r}}}) \left[ \rho ^{\uparrow \uparrow }({{\textbf{r}}}) -\rho ^{\downarrow \downarrow }({{\textbf{r}}})\right] \, d{{\textbf{r}}}= \mu \int _{\Omega } {\varvec{B}}_{z}({{\textbf{r}}}) \left[ \sum _{k=1}^\infty \alpha _k {\varvec{\xi }}_k({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0\\ 0 &{}-1 \end{pmatrix} {\varvec{\xi }}_k({{\textbf{r}}}) \right] \, d{{\textbf{r}}}. \end{aligned}$$

Therefore, with $ {\varvec{B}}_{j,z}({{\textbf{r}}}) = \sum _{k=1}^j {{\mathcal {B}}}_{z,k} {\mathfrak {d}}_k({{\textbf{r}}})$ and

$$\begin{aligned} (\rho ^{\uparrow \uparrow }({{\textbf{r}}}) -\rho ^{\downarrow \downarrow }({{\textbf{r}}}))^j = \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} {\varvec{\xi }}_{k_1}({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0\\ 0 &{}-1 \end{pmatrix} {\varvec{\xi }}_{k_2}({{\textbf{r}}}), \end{aligned}$$

we arrive at

$$\begin{aligned}&\mu \int _{\Omega } \left( \sum _{k=1}^j {{\mathcal {B}}}_{z,k} {\mathfrak {d}}_k ({{\textbf{r}}}) \right) \left[ \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} {\varvec{\xi }}_{k_1} ({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0\\ 0 &{}-1 \end{pmatrix} {\varvec{\xi }}_{k_2} ({{\textbf{r}}}) \right] \, d{{\textbf{r}}}\\&= \mu \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \int _{\Omega } \sum _{k=1}^j {{\mathcal {B}}}_{z,k} {\mathfrak {d}}_k({{\textbf{r}}}) {\varvec{\xi }}_{k_1} ({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0\\ 0 &{}-1 \end{pmatrix} {\varvec{\xi }}_{k_2}({{\textbf{r}}}) \, d{{\textbf{r}}}. \end{aligned}$$

Repeating a similar process with ${{\textbf{B}}}_x$ and ${{\textbf{B}}}_y$, we obtain

$$\begin{aligned} {\text {Tr}}\,({\mathfrak {B}}^j {{\varvec{\gamma }}_j} )= & {} \mu \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \int _{\Omega } \sum _{k=1}^j {{\mathcal {B}}}_{x,k} {\mathfrak {d}}_k ({{\textbf{r}}}) {\varvec{\xi }}_{k_1} ({{\textbf{r}}}) \begin{pmatrix} 0 &{} 1\\ 1 &{} 0 \end{pmatrix} {\varvec{\xi }}_{k_2} ({{\textbf{r}}}) \, d{{\textbf{r}}}\\{} & {} + \mu \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \int _{\Omega } \sum _{k=1}^j i{{\mathcal {B}}}_{y,k} {\mathfrak {d}}_k({{\textbf{r}}}) {\varvec{\xi }}_{k_1}({{\textbf{r}}}) \begin{pmatrix} 0 &{} -1\\ 1 &{}0 \end{pmatrix} {\varvec{\xi }}_{k_2} ({{\textbf{r}}}) \, d{{\textbf{r}}}\\{} & {} +\mu \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} \int _{\Omega } \sum _{k=1}^j {{\mathcal {B}}}_{z,k} {\mathfrak {d}}_k({{\textbf{r}}}) {\varvec{\xi }}_{k_1} ({{\textbf{r}}}) \begin{pmatrix} 1 &{} 0\\ 0 &{}-1 \end{pmatrix} {\varvec{\xi }}_{k_2} ({{\textbf{r}}}) \, d{{\textbf{r}}}\\ {}= & {} \sum _{k_1=1}^j \sum _{k_2=1}^j \rho _{j,k_1,k_2} {\mathfrak {B}}^j. \end{aligned}$$

5.6 Proof of main Theorem 4.1

We first establish the convergence of the exact band energies ${\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j}, \phi _j, {\varvec{B}}_j) {{\varvec{\gamma }}}_j)$. Secondly, we validate the convergence of the approximate trace operators.

5.6.1 Convergence of exact band energies

We first establish $\Gamma $-convergence of the exact band energies ${\text {Tr}}\,({\varvec{H}}^j (u_j,v_j, \phi _j, {\varvec{B}}_j) {\varvec{\gamma }}_j)$. For given sequences $(u_j)_{j\in {{\mathbb {N}}}}, (v_j)_{j\in {{\mathbb {N}}}} \subset L^4(\Omega )$, $({\varvec{B}}_j)_{j \in {{\mathbb {N}}}} \subset L^2(\Omega )$, $(\phi _j)_{j\in {{\mathbb {N}}}} \subset W_0^{1,2} (\Omega )$ with $u_j \rightarrow u, v_{j} \rightarrow v$ in $L^4 (\Omega )$, ${\varvec{B}}_j \rightarrow {\varvec{B}}$ in $L^2(\Omega )^3$, and $\phi _j \rightarrow \phi $ in $W_0^{1,2} (\Omega )$ for $j \rightarrow \infty $ we are going to establish the $\Gamma $-convergence result

$$\begin{aligned} {\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j}, {\varvec{B}}_j, \phi _j) {{\varvec{\gamma }}}_j) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_{j}, {\varvec{B}}_j, \phi _j)}} ({{\varvec{\gamma }}}_j)\overset{\Gamma }{\rightarrow } & {} E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}},{{\varvec{\gamma }}}) \nonumber \\ {}{} & {} + I_{{\varvec{{{\mathcal {C}}}}}_{N}^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}} ({{\varvec{\gamma }}}) \end{aligned}$$

(5.39)

with respect to the $\text{ weak}^{\star }$-topology in ${\varvec{{{\mathcal {C}}}}}$ as $j \rightarrow \infty $. Recalling the properties of $\Gamma $-convergence, we need to verify: (i) The lower semi-continuity of

$$\begin{aligned} {{\varvec{\gamma }}} \mapsto {\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j},\phi _j, {\varvec{B}}_j) {{\varvec{\gamma }}}_j) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j(u_j,v_{j}, \phi _j, {\varvec{B}}_j )}} ({{\varvec{\gamma }}}); \end{aligned}$$

(5.40)

(ii) The existence of a recovery sequence.

Part (i) is shown in Lemma 5.8, and Part (ii) is shown in Lemma 5.9.

Lemma 5.8

Let $u_j \rightharpoonup u$ and $v_j \rightharpoonup v$, both in $L^4 (\Omega ), \ \phi _j \rightharpoonup \phi $ in $W_0^{1,2} (\Omega )$ and ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in its usual space $L^2(\Omega )^3$, as $j \rightarrow \infty $, be four given sequences. Then, for arbitrary $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}} \subset {\varvec{{{\mathcal {C}}}}}$ with ${{\varvec{\gamma }}}_j \rightharpoonup ^\star {{\varvec{\gamma }}}$ in ${\varvec{{{\mathcal {C}}}}}$,

$$\begin{aligned} \liminf _{j \rightarrow \infty } \{ {\text {Tr}}\,( {\varvec{H^j}} (u_j,v_{j}, \phi _j,{\varvec{B}}_j) {{\varvec{\gamma }}}_j) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_{j}, \phi _j,{\varvec{B}}_j)}} ({{\varvec{\gamma }}}_j) \}\ge & {} E_\mathrm{{band}} (u, v, \phi ,{\varvec{B}},{{\varvec{\gamma }}})\nonumber \\ {}{} & {} + I_{{\varvec{{{\mathcal {C}}}}}_{N}^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}} ({{\varvec{\gamma }}}). \nonumber \\ \end{aligned}$$

(5.41)

We call the band energies ${\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j}, \phi _j, {\varvec{B}}_j) {{\varvec{\gamma }}}_j)$ exact because no spectral discretization of the trace operator enters.

Proof

We will consider four disjoint and exhaustive cases: 1. Let ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}$ and $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}} \subset {\varvec{{{\mathcal {C}}}}}$ be a sequence with ${{\varvec{\gamma }}}_j \overset{\star }{\rightarrow }\ {{\varvec{\gamma }}}$ such that there exists a $j_1 \in {{\mathbb {N}}}$ such that ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j(u_j,v_{j},\phi _j,{\varvec{B}}_j)}$ for all $j \ge j_1$. By the lower semi-continuity, we have $\lim _{j \rightarrow \infty } \inf {\text {Tr}}\,(-\Delta {{\varvec{\gamma }}}_j) \ge {\text {Tr}}\,(-{{\varvec{\gamma }}} )$ and by the compact embedding of $W_0^{1,2} (\Omega )$ in $L^2 (\Omega ), \ {{\varvec{\gamma }}}_j \rightarrow {{\varvec{\gamma }}}$ in ${\varvec{{{\mathcal {C}}}}}$ implies that $\rho _{{{\varvec{\gamma }}}_j}^{\mu \mu } \rightarrow \rho _{{{\varvec{\gamma }}}}^{\mu \mu }$ in $L^2 (\Omega )$. This yields for $\Phi _j, \ U_j, V_j$ corresponding to $\phi _j, u_j, v_j$, defined as in (3.20) and (3.22),

$$\begin{aligned} \lim _{j\rightarrow \infty } {\text {Tr}}\,( (\Phi _j - U_j-V_j) {\varvec{\gamma }}_j )= & {} \lim _{j \rightarrow \infty } \int _{\Omega } ( \phi _j ({{\textbf{r}}}) - u_j ({{\textbf{r}}})- v_j ({{\textbf{r}}}) ) \ \rho _{{{\varvec{\gamma }}}_j}({{\textbf{r}}}) \ d {{\textbf{r}}}\\= & {} \int _{\Omega } (\phi ({{\textbf{r}}}) - u({{\textbf{r}}}) - v({{\textbf{r}}})) \ \rho _{{\varvec{\gamma }}} ({{\textbf{r}}}) \ d{{\textbf{r}}}= {\text {Tr}}\,( (\Phi - U-V) {\varvec{\gamma }}). \end{aligned}$$

The second equality is true because of the weak convergence of $( \phi _j)$ and $( u_j )$, and because

$$\begin{aligned}{} & {} \lim _{j \rightarrow \infty } \left[ \int _{\Omega } (\phi _j({{\textbf{r}}}) - u_j({{\textbf{r}}})- v_j({{\textbf{r}}})) \rho _{{{\varvec{\gamma }}}_j}({{\textbf{r}}}) \ d {{\textbf{r}}}- \int _{\Omega } (\phi _j({{\textbf{r}}}) - u_j({{\textbf{r}}})- v_j({{\textbf{r}}})) \rho _{{{\varvec{\gamma }}}}({{\textbf{r}}}) \ d {{\textbf{r}}}\right] \\{} & {} \quad = \lim _{j \rightarrow \infty } \int _{\Omega } (\phi _j({{\textbf{r}}}) - u_j({{\textbf{r}}})- v_j({{\textbf{r}}}))(\rho _{{{\varvec{\gamma }}}_j}({{\textbf{r}}}) - \rho _{{{\varvec{\gamma }}}}({{\textbf{r}}})) \ d {{\textbf{r}}}\\{} & {} \quad \le \left\Vert \phi _j({{\textbf{r}}}) - u_j({{\textbf{r}}})- v_j({{\textbf{r}}})\right\Vert _{L^2(\Omega )} \left\Vert \rho _{{{\varvec{\gamma }}}_j}({{\textbf{r}}}) - \rho _{{{\varvec{\gamma }}}}({{\textbf{r}}})\right\Vert _{L^2(\Omega )} =0, \end{aligned}$$

where we used Hölder’s inequality and the $L^2$-convergence of $(\rho _{{{\varvec{\gamma }}}_j})_{j}$. The magnetic term also follows easily,

$$\begin{aligned}{} & {} \lim _{j \rightarrow \infty } {\text {Tr}}\,( {\mathfrak {B}}^{j} {{\varvec{\gamma }}}_j ) = \lim _{j \rightarrow \infty } \mu \bigg [ \int _{\Omega } {\varvec{B}}_{j,x} ({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}_j}^{\uparrow \downarrow }({{\textbf{r}}})+\rho _{{{\varvec{\gamma }}}_j}^{\downarrow \uparrow }({{\textbf{r}}})) \ d {{\textbf{r}}}\\{} & {} \qquad + \int _{\Omega } i {\varvec{B}}_{j,y}({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}_j}^{\downarrow \uparrow }({{\textbf{r}}})-\rho _{{{\varvec{\gamma }}}_j}^{\uparrow \downarrow }({{\textbf{r}}})) \ d {{\textbf{r}}}+ \int _{\Omega } {\varvec{B}}_{j,z}({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}_j}^{\uparrow \uparrow }({{\textbf{r}}})-\rho _{{{\varvec{\gamma }}}_j}^{\downarrow \downarrow }({{\textbf{r}}})) \ d {{\textbf{r}}}\bigg ] \\{} & {} \quad = \lim _{j \rightarrow \infty } \mu \bigg [ \int _{\Omega } {\varvec{B}}_{j,x}({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}}^{\uparrow \downarrow }({{\textbf{r}}}){+}\rho _{{{\varvec{\gamma }}}}^{\downarrow \uparrow }({{\textbf{r}}})) \ d {{\textbf{r}}}{+} \int _{\Omega } i {\varvec{B}}_{j,y}({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}}^{\downarrow \uparrow }({{\textbf{r}}}){-}\rho _{{{\varvec{\gamma }}}}^{\uparrow \downarrow }({{\textbf{r}}})) \ d {{\textbf{r}}}\\{} & {} \qquad + \int _{\Omega } {\varvec{B}}_{j,z}({{\textbf{r}}}) (\rho _{{{\varvec{\gamma }}}}^{\uparrow \uparrow }({{\textbf{r}}})-\rho _{{{\varvec{\gamma }}}}^{\downarrow \downarrow }({{\textbf{r}}})) \ d {{\textbf{r}}}\bigg ] +0 = {\text {Tr}}\,({\mathfrak {B}}{{\varvec{\gamma }}}). \end{aligned}$$

Hence, $\lim _{j\rightarrow \infty } {\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j},\phi _j, {\varvec{B}}_j ) {{\varvec{\gamma }}}_j) \ge {\text {Tr}}\,({\varvec{H}}(u, v,\phi ,{\varvec{B}}) {{\varvec{\gamma }}})$. This coincides with (5.41) for the case considered here, completing this case.

2. Let ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}}_j)}$ and $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}} \subset {\varvec{{{\mathcal {C}}}}}$ be a sequence such that there exists an index $j_2 \in {{\mathbb {N}}}$ so that ${{\varvec{\gamma }}}_j \not \in {\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j (u_j, v_{j}, \phi _j,{\varvec{B}}_j)}$ for all $j \ge j_2$. Then in this case, (5.41) holds because as $j \rightarrow \infty $, ${\varvec{\gamma }}_j$ will not be in the space ${\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j(u_j, v_{j}, \phi _j,{\varvec{B}}_j)}$, and therefore

$$\begin{aligned} \lim _{j \rightarrow \infty } \inf \left\{ {\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j}, \phi _j,{\varvec{B}}_j) {\varvec{\gamma }}_j) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j(u_j, v_{j}, \phi _j,{\varvec{B}}_j)}} ({\varvec{\gamma }}_j) \right\} = + \infty . \end{aligned}$$

(5.42)

3. Let ${\varvec{\gamma }}\not \in {\varvec{{{\mathcal {C}}}}}_N$ and $({{\varvec{\gamma }}}_j)_{j\in {{\mathbb {N}}}} \subset {\varvec{{{\mathcal {C}}}}}$ be a sequence such that there exists an index $j_3 \in {{\mathbb {N}}}$ so that ${{\varvec{\gamma }}}_j \not \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_{j}, \phi _j,{\varvec{B}}_j)}$ for all $j \ge j_3$. In this case (5.41) is satisfied trivially because both the left-hand side and the right-hand side are equal to infinity, and therefore each other.

4. Finally, we consider the case where ${{\varvec{\gamma }}} \not \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}$, and show that there does not exist a sequence ${{\varvec{\gamma }}}_j \overset{\star }{\rightharpoonup }\ {{\varvec{\gamma }}}$ and $j_4 \in {{\mathbb {N}}}$ so that ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_j,\phi _j, {\varvec{B}}_j)}$ for all $j \ge j_4$. Let $({\varvec{\xi }}_i)_{i \in {{\mathbb {N}}}} \subset W_0^{1,2} (\Omega ; \mathbb {C}^2)$ be the eigenvectors of ${\varvec{H}}(u,v,\phi ,{\varvec{B}})$. As in the proof of Theorem 5.6, these $({\varvec{\xi }}_i)_{i \in {{\mathbb {N}}}}$ form an orthonormal basis of $L^2(\Omega ;\mathbb {C}^2)$. Similarly, for each $j \in {{\mathbb {N}}}$ let $( {\varvec{\xi }}_i^j)_{i \in {{\mathbb {N}}}} \subset W_0^{1,2} (\Omega ;\mathbb {C}^2)$ be the eigenvectors of ${\varvec{H}}^j (u_j, v_j, \phi _j, {\varvec{B}}_j)$. By the spatial discretization of the Hamiltonian, we know that for every $i \in {{\mathbb {N}}}$, we have

$$\begin{aligned} \lim _{j\rightarrow \infty } \Vert {\varvec{\xi }}_i^j - {\varvec{\xi }}_i\Vert _{{L^{2}(\Omega ; \mathbb {C}^2})} =0. \end{aligned}$$

(5.43)

However, since in this case we assume that ${{\varvec{\gamma }}} \not \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}$, there must exist at least one eigenvector of ${\varvec{H}} (u, v, \phi ,{\varvec{B}})$ that is not an eigenvector of ${{\varvec{\gamma }}}$. Without loss of generality suppose ${\varvec{\xi }}_1$ is this eigenvector. Therefore, when we take the eigenvector representation of ${{\varvec{\gamma }}}$, ${{\varvec{\gamma }}} {\varvec{\xi }}_1 =\sum _{q=1}^{\infty } c_{1q} {\varvec{\xi }}_q$, there must exist some index $p \in {{\mathbb {N}}}, \ p \ne 1$, with $c_{1p} \ne 0$. By orthogonality of the eigenvectors, we have that

$$\begin{aligned} \langle {{\varvec{\gamma }}{\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle&=\sum _{q=1}^{\infty } \langle { c_{1q} {\varvec{\xi }}_q},{ {\varvec{\xi }}_p} \rangle c_{1p} =c_{1p}. \end{aligned}$$

However, by the weak-star convergence of ${\varvec{\gamma }}_j \rightharpoonup ^\star {\varvec{\gamma }}$, we know that $\langle {{{\varvec{\gamma }}} {\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle = \lim _{j \rightarrow \infty } \langle {{{\varvec{\gamma }}}_j {\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle .$ For a contradiction, suppose that there exists an index $j_4 \in {{\mathbb {N}}}$ so that ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_j,\phi _j, {\varvec{B}}_j )}$ for all $j \ge j_4$. It then follows that

$$\begin{aligned} c_{1p}&= \lim _{j \rightarrow \infty } \langle {{{\varvec{\gamma }}}_j {\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle =\lim _{j\rightarrow \infty } \langle {g_j ( {\varvec{H}}^j (u_j,v_{j}, \phi _j, {\varvec{B}}_j)) {\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle , \end{aligned}$$

(5.44)

where we know that the Borel functions $g_j$ in (5.44) do exist, because ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_{j},\phi _j,{\varvec{B}}_j)}$ for large j. Therefore, for an arbitrary $p \ne 1$, we have that

$$\begin{aligned} c_{1,p}&= \lim _{j \rightarrow \infty } \langle {g_j ( {\varvec{H}}^j (u_j,v_j,\phi _j, {\varvec{B}}_j)) {\varvec{\xi }}_1},{ {\varvec{\xi }}_p} \rangle \\ {}&= \lim _{j\rightarrow \infty } \langle {g_j( {\varvec{H}}^j (u_j,v_j,\phi _j, {\varvec{B}}_j))\left( {\varvec{\xi }}_1 - {\varvec{\xi }}_1^j + {\varvec{\xi }}_1^j\right) },{ {\varvec{\xi }}_p} \rangle \\&= \lim _{j\rightarrow \infty } \langle {g_j( {\varvec{H}}^j (u_j,v_j,\phi _j,{\varvec{B}}_j))},{( {\varvec{\xi }}_1 - {\varvec{\xi }}_1^j) {\varvec{\xi }}_p} \rangle \\ {}&\quad + \lim _{j\rightarrow \infty } \langle {g_j( {\varvec{H}}^j (u_j,v_j,\phi _j,{\varvec{B}}_j )) {\varvec{\xi }}_1^j},{ {\varvec{\xi }}_p} \rangle \\&= \lim _{j \rightarrow \infty } g_j (\lambda _1^j) \langle { {\varvec{\xi }}_1^j},{ {\varvec{\xi }}_p} \rangle =0, \end{aligned}$$

where the final line comes from the orthogonality of the ${\varvec{\xi }}_j$, and in the penultimate line we used equations (5.43) and (5.44). Therefore we arrive at the contraction that $c_{1,p}$ is both zero and non-zero. Therefore, if ${{\varvec{\gamma }}} \not \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}$, we cannot have $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}}$ with ${{\varvec{\gamma }}}_j \in {\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^i(u_j, v_j,\phi _j, {\varvec{B}}_j)}$. Therefore, this final case cannot occur, and inequality (5.41) holds trivially. $\square $

Lemma 5.9

Let $u_j \rightarrow u$ and $v_j \rightarrow v$ in $L^4 (\Omega )$, ${\varvec{B}}_{j} \rightarrow {\varvec{B}}$ in $L^2(\Omega )^3$, $\phi _j \rightarrow \phi $ in $W_0^{1,2} (\Omega )$ for $j \rightarrow \infty $ be four given sequences. Then there exists a recovery sequence $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}} \subset {\varvec{{{\mathcal {C}}}}}, \ {{\varvec{\gamma }}}_j \overset{\star }{\rightarrow }\ {{\varvec{\gamma }}}$, such that

$$\begin{aligned} \lim _{j \rightarrow \infty } \sup {\text {Tr}}\,( {\varvec{H}}^j (u_j, v_{j},\phi _j,{\varvec{B}}_j) {{\varvec{\gamma }}}_j ) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^i(u_j,v_{j},\phi _j,{\varvec{B}}_j)}} ({{\varvec{\gamma }}}_j)\le & {} E_\mathrm{{band}} (u,v, \phi ,{\varvec{B}},{{\varvec{\gamma }}}) \nonumber \\ {}{} & {} + I_{{\varvec{{{\mathcal {C}}}}}_{N}^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}} \end{aligned}$$

(5.45)

and, consequently, in view of Lemma 5.8

$$\begin{aligned}&\left( {{\varvec{\gamma }}}_j\mapsto {\text {Tr}}\,( {\varvec{H}}^j (u_j,v_{j},\phi _j{\varvec{B}}_j){{\varvec{\gamma }}}_j) + I_{{\varvec{{{\mathcal {C}}}}}_{N,k_j}^{{\varvec{H}}^j (u_j,v_{j},\phi _j,{\varvec{B}}_j)}} ({{\varvec{\gamma }}}_j) \right) \nonumber \\&\overset{\Gamma }{\rightarrow }\ \left( {\varvec{\gamma }} \mapsto E_\mathrm{{band}} (u,v,\phi ,{\varvec{B}}, {\varvec{\gamma }}) + I_{{\varvec{{{\mathcal {C}}}}}_{N}^{{\varvec{H}} (u,v,\phi ,{\varvec{B}})}} ( {\varvec{\gamma }}) \right) \end{aligned}$$

(5.46)

with respect to the $\text{ weak}^{\star }$-topology in ${\varvec{{{\mathcal {C}}}}}$ in the limit $j \rightarrow \infty $.

We omit the proof of Lemma 5.9 because, with obvious modifications, it is similar to the proof of [35, Lemma 4.1] (Details are provided in [29]). However, we provide an outline:

Proof

(Outline of the proof of Lemma 5.9) We consider two disjoint cases. The first case, where we assume ${{\varvec{\gamma }}} \not \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}} (u,v,\phi ,{\varvec{B}})}$ is standard and we omit it here. For the second case, we suppose that ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}} (u,v,\phi ,{\varvec{B}})}$. Then, ${{\varvec{\gamma }}}=\sum _{i=1}^{\infty } \alpha _i \, | { {\varvec{\xi }}_i} \rangle \langle { {\varvec{\xi }}_i}| \, =\sum _{i=1}^{\infty } \alpha _i {\varvec{\xi }}_i \otimes {\varvec{\xi }}_i$.

As in Lemma 5.8, $( {\varvec{\xi }}_i)_{i \in {{\mathbb {N}}}}$ are the eigenvectors of ${\varvec{H}} (u,v,\phi ,{\varvec{B}})$. For each $j \in {{\mathbb {N}}}$, let $( {\varvec{\xi }}_i^j)_{i \in {{\mathbb {N}}}}$ be the corresponding eigenvectors of ${\varvec{H}}^j (u_j,v_{j}, \phi _j,{\varvec{B}}_j)$. The idea is to construct a recovery sequence in three steps. For the first step, we construct a sequence of finite-rank operators $({{\varvec{\gamma }}}_j)_{j \in {{\mathbb {N}}}}$ from the eigenvectors of ${\varvec{H}}^j (u_j,v_{j}, \phi _j,{\varvec{B}}_j)$. In the second step, we construct for each ${{\varvec{\gamma }}}_j$ a sequence $({{\varvec{\gamma }}}_{k,j})_{k \in {{\mathbb {N}}}}$ by the spectral binning approximation. For the third and final step, we choose a diagonal sequence $({{\varvec{\gamma }}}_{k_j,j})_{j \in {{\mathbb {N}}}}$, which will yield the recovery sequence for this case. $\square $

Now that we have the $\Gamma $-convergence result, we can show that we have convergence at the first level of approximation. However, also we need the following lemma.

Lemma 5.10

For every $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$ and every $u_j \rightharpoonup u$, $v_j \rightharpoonup v $ in $L^4(\Omega )$, equi-coercivity with respect to the weak-$\star $ topology in ${\varvec{{{\mathcal {C}}}}}$ holds for the family

$$\begin{aligned} \left\{ {\text {Tr}}\,( {\varvec{H}}^j(u_j, v_{j}, \phi _j, {\varvec{B}}_j){{\varvec{\gamma }}}) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_{j}, \phi _j,{\varvec{B}}_j)}} ({{\varvec{\gamma }}}) \right\} _{j \in {{\mathbb {N}}}}. \end{aligned}$$

Proof

We give an outline of the proof here, for the detailed proof we refer to [29]. Similar to the proof of lower semi-continuity in Lemma 5.4, we show that

$$\begin{aligned} {\text {Tr}}\,( {\varvec{H}}^j(u_j, v_{j}, \phi _j,{\varvec{B}}_j){{\varvec{\gamma }}}) \ge \frac{1}{2} {\text {Tr}}\,( - \Delta {{\varvec{\gamma }}}) - C_{12} \Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\Vert _{L^2(\Omega )}^{\frac{3}{2}} \end{aligned}$$

(5.47)

where

$$\begin{aligned} C_{12}= & {} C_{11} \sup _{j \in {{\mathbb {N}}}} \Big \{ \Vert \phi _j\Vert _{L^2(\Omega )} + \Vert u_j\Vert _{L^2(\Omega )} +\Vert v_j\Vert _{L^2(\Omega )} + \Vert {{\textbf{B}}}_{j,x}\Vert _{L^2(\Omega )} \\ {}{} & {} + \Vert {{\textbf{B}}}_{j,y}\Vert _{L^2(\Omega )}+ \Vert {{\textbf{B}}}_{j,z}\Vert _{L^2(\Omega )} \Big \} N^{\frac{1}{4}}, \end{aligned}$$

and the constant $C_{11}$ results from the Gagliardo–Nirenberg–Sobolev inequality. Note that $C_{12}$ is well defined and bounded, because weakly convergent series are bounded. Since ${\text {Tr}}\,(-\Delta {{\varvec{\gamma }}}) \ge \Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\Vert _{L^2(\Omega )}^2$, the kinetic energy is the dominating $\gamma $ term in the inequality. Therefore, for any $t \in {\mathbb {R}}$ the level sets

$$\begin{aligned} \left\{ {{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}: {\text {Tr}}\,\left( {\varvec{H}}^j(u_j, v_{j}, \phi _j, {\varvec{B}}_j){{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_{j} \phi _j,{\varvec{B}}_j)}} ({{\varvec{\gamma }}}) \le t \right\} \end{aligned}$$

(5.48)

are bounded, indeed, $t \ge (1/2) \Vert {{\varvec{\gamma }}}\Vert _{{\varvec{{{\mathcal {C}}}}}} - C_{12} \Vert \nabla \sqrt{\rho _{{\varvec{\gamma }}}}\Vert _{L^2(\Omega )}^{\frac{3}{2}} - (N/2)$, where we used that $\left\Vert \cdot \right\Vert _{{\varvec{{{\mathcal {C}}}}}} \ge {\text {Tr}}\,(-\Delta \cdot )$. By the results in [6, Prop 7.7] this shows that for every j and $k_j$, the level sets (5.48) are precompact and hence equi-coercive. $\square $

We now show that we have convergence in the first level, that is to say, spatial discretisation.

Lemma 5.11

If $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, and $u_j \rightharpoonup u, v_j \rightharpoonup v$ in $L^4(\Omega )$ then

$$\begin{aligned}&\lim \limits _{j \rightarrow \infty }\inf \limits _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ {\text {Tr}}\,\left( {\varvec{H}}^j(u_j, v_{j},\phi _j, {\varvec{B}}_j){{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_{j} , \phi _j, {\varvec{B}}_j)}}({{\varvec{\gamma }}}) \right\} \\&= \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ E_\mathrm{{band}}(u,v, \phi ,{\varvec{B}}, {{\varvec{\gamma }}}) + I_{{\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}}({{\varvec{\gamma }}}) \right\} . \end{aligned}$$

Proof

This comes directly applying [6, Thm 7.8], because we have shown equi-coercivity in Lemma 5.10 and the existence of a recovery sequence in Lemma 5.9. $\square $

5.6.2 Convergence of spectral binning operator $E_\mathrm{{band}}^{j,k_j}$

Next we establish $\Gamma $-convergence of $E_\mathrm{{band}}^{j,k_j}$ with approximation of the trace operator. In the previous section we showed convergence of the exact trace operator. We will now record the convergence result with the spectral binning operator $E_\mathrm{{band}}^{j,k_j}$, which was defined in (3.40).

Lemma 5.12

Let $u_j \rightharpoonup u$ and $v_j \rightharpoonup v$ in $L^4(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$ as $j \rightarrow \infty $ and ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_{j}, \phi _j)}$ for all $j \in {\mathbb {N}}$. Then

$$\begin{aligned} \lim \limits _{j \rightarrow \infty } \left| {\widetilde{{\text {Tr}}\,}}_{k_{j}} ( {\varvec{H}}^j (u_{j},v_{j}, \phi _{j},{\varvec{B}}_j) {{\varvec{\gamma }}}_{j, k_j}) - {\text {Tr}}\,( {\varvec{H}}^j (u_{j}, v_{j}, \phi _{j}, {\varvec{B}}_j) {{\varvec{\gamma }}}_{j,k_j}) \right| = 0.\nonumber \\ \end{aligned}$$

(5.49)

With obvious modifications, its proof is similar to the proof of [35, Lemma 4.5]; the proof is written out in [29].

Having the convergence of ${\widetilde{{\text {Tr}}\,}} (\cdot )$ to ${\text {Tr}}\,(\cdot )$ under control, we easily establish the announced $\Gamma $-convergence result. Utilizing Lemma 5.9 and Lemma 5.12, and using the same recovery sequence as the one constructed in Lemma 5.9, the proof is straightforward and thus we omit it here; details are provided in [29].

Lemma 5.13

For every $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$ every $u_j \rightharpoonup u, v_j \rightharpoonup v $ in $L^4(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, and any $({{\varvec{\gamma }}}_{j} )_{j}$ in ${\varvec{{{\mathcal {C}}}}}$ with $ {{\varvec{\gamma }}}_{j} \overset{\star }{ \rightharpoonup }\ {{\varvec{\gamma }}}$, as $j \rightarrow \infty $, one has

The next result for ${\widetilde{{\text {Tr}}\,}}(\cdot )$ is analogous to Lemma 5.10.

Lemma 5.14

If $u_j \rightharpoonup u, v_j \rightharpoonup v$ in $L^4(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, and $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$, then for every ${{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}$, equi-coercivity on ${\varvec{{{\mathcal {C}}}}}$ holds for the family of functionals

$$\begin{aligned} \left\{ {\widetilde{{\text {Tr}}\,}}_{k_{j}} \left( {\varvec{H}}^j(u_j,v_{j}, \phi _j, {\varvec{B}}_j) {{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_{j}, \phi _j, {\varvec{B}}_j)}} ({{\varvec{\gamma }}}) \right\} . \end{aligned}$$

We use Lemma 5.10, especially (5.47), Lemma 5.12, in conjunction with standard arguments found in [6] to prove the latter result; the proof is similar to [35, Lemma 4.7]. Details are found in [29].

We now obtain the next level of convergence from Lemma 5.10 and Lemma 5.12:

Lemma 5.15

If $\phi _j \rightharpoonup \phi $ in $W_0^{1,2}(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, $v_j \rightharpoonup u$ in $L^4(\Omega )$ and $u_j \rightharpoonup u$ in $L^4(\Omega )$, then

$$\begin{aligned}{} & {} \lim \limits _{j \rightarrow \infty } \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ {\widetilde{{\text {Tr}}\,}}\left( {\varvec{H}}^j(u_j,v_j, \phi _j, {\varvec{B}}_j){{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_j,\phi _j,{\varvec{B}}_j)}} \right\} \nonumber \\{} & {} \quad = \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ {\text {Tr}}\,\left( {\varvec{H}}(u,v,\phi ,{\varvec{B}}){{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}}(u,v,\phi ,{\varvec{B}})}} ({{\varvec{\gamma }}})\right\} . \end{aligned}$$

(5.50)

5.6.3 Convergence of $S_3^{j,k_j}$.

Next we show $\Gamma $-convergence of $S_3^{j,k_j}$ from (4.13).

Lemma 5.16

If $u_j \rightharpoonup u, v_j \rightharpoonup v$ in $L^4(\Omega )$ then, for $j \rightarrow \infty $, with respect to $\phi $ in the weak topology of $W_0^{1,2}(\Omega )$.

Proof

From Lemma 5.15 for every $u,v \in L^4(\Omega )$ and all $u_j \rightharpoonup u,$ $v_j \rightharpoonup v$ in $L^4(\Omega )$, ${\varvec{B}}_j \rightharpoonup {\varvec{B}}$ in $L^2(\Omega )^3$, and $\phi \in W_0^{1,2}(\Omega )$,

$$\begin{aligned}{} & {} \lim \limits _{j\rightarrow \infty } \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ {E_\mathrm{{band}}}_{j,k_j}(u_j,v_j, \phi ,{\varvec{B}}_j, {{\varvec{\gamma }}}) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j, v_j, \phi , {\varvec{B}}_j)}}({{\varvec{\gamma }}}) \right\} \nonumber \\{} & {} \quad = \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ E_\mathrm{{band}}(u, v,\phi , {\varvec{B}}, {{\varvec{\gamma }}}) + I_{{\varvec{{{\mathcal {C}}}}}_N^{{\varvec{H}} (u,v,\phi ,{\varvec{B}})}}({{\varvec{\gamma }}}) \right\} , \end{aligned}$$

(5.51)

which covers the ${{\varvec{\gamma }}}$ term in $S_3$; the dependency on ${{\varvec{\gamma }}}$ is given explicitly in (4.6). It is straightforward to treat the remaining terms, whence we omit the details; see [29] for details. $\square $

We now show that these functionals are equi-coercive.

Lemma 5.17

If $u_j \rightharpoonup u, v_j \rightharpoonup v$ in $L^4(\Omega )$ then the family of functionals $ \{ \phi \mapsto -S_3^{j,k_j}(u_j, v_j, \phi ) \}_{j \in {\mathbb {N}}}$ is equi-coersive with respect to the weak topology in $W_0^{1,2}(\Omega )$.

Proof

By Hölder’s inequality, we have that

$$\begin{aligned} - S_3^{j,k_j}(u_j,v_j, \phi )= & {} \int _{\Omega } \left( C_s |\nabla \phi ({{\textbf{r}}})|^2 - {\mathfrak {f}}({{\textbf{r}}}, \{ {\overline{R}}_1, \ldots , {\overline{R}}_M \})\phi ({{\textbf{r}}})\right) \, d{{\textbf{r}}}\nonumber \\{} & {} - \inf _{{{\varvec{\gamma }}} \in {\varvec{{{\mathcal {C}}}}}} \left\{ {\widetilde{{\text {Tr}}\,}}\left( {\varvec{H}}^j(u_j,v_j, \phi , {\varvec{B}}_j){{\varvec{\gamma }}} \right) + I_{{\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j,v_j,\phi , {\varvec{B}}_j)}} \right\} \nonumber \\ {}{} & {} + I_{W_0^{1,2}(\Omega )_j}(\phi ) \nonumber \\\ge & {} C_s \Vert \nabla \phi \Vert _{L^2(\Omega )}^2 - \Vert {\mathfrak {f}}({{\textbf{r}}}, \underline{{\varvec{R}}}) \Vert _{L^2(\Omega )} \cdot \Vert \phi \Vert _{L^2(\Omega )} \nonumber \\ {}{} & {} - {\text {Tr}}\,( {\varvec{H}}(u,v, \phi , {\varvec{B}})\tilde{{{\varvec{\gamma }}}}) + \varepsilon _{k_j}, \end{aligned}$$

(5.52)

where we introduce the term $\varepsilon _{k_j}$, given by ${\widetilde{{\text {Tr}}\,}}\left( {\varvec{H}}^j(u_j,v_j, \phi ,{\varvec{B}}_j)\tilde{{{\varvec{\gamma }}}_j} \right) = {\text {Tr}}\,{\varvec{H}}^j (u_j,v_j,\phi ,{\varvec{B}}_j) \tilde{{{\varvec{\gamma }}}}_j - \varepsilon _{k_j}$. Due to Lemma 5.15, $\varepsilon _{k_j} \rightarrow 0$ as $ j \rightarrow \infty $. In (5.52) we also introduced $\tilde{{{\varvec{\gamma }}}}_j \in {\varvec{{{\mathcal {C}}}}}_{N, k_j}^{{\varvec{H}}^j(u_j,v_j, \phi ,{\varvec{B}}_j)}$ and $\tilde{{{\varvec{\gamma }}}} \in {\varvec{{{\mathcal {C}}}}}_{N, k}^{{\varvec{H}}(u,v, \phi ,{\varvec{B}})}$ as the minimizers of the left and right sides of (5.50), respectively.

By expanding the ${\text {Tr}}\,( {\varvec{H}} \tilde{{{\varvec{\gamma }}}})$ as above, it follows that

$$\begin{aligned} -S_3^{j,k_j} \ge C_{13}\Vert \phi \Vert _{L^2(\Omega )}^2 - C_{14}\Vert \phi \Vert _{L^2(\Omega )} + C_{15}, \end{aligned}$$

(5.53)

with a constant $C_{13} > 0$ originating from Poincaré’s inequality, and with further constants $C_{14}:= \Vert {\mathfrak {f}}({{\textbf{r}}}, \underline{{\varvec{R}}}) \Vert _{L^2(\Omega )} + \sup _{j \in {\mathbb {N}}} \Vert \rho _{{\tilde{{{\varvec{\gamma }}}}}_j}\Vert _{L^2(\Omega )}$,

$$\begin{aligned} C_{15}:= & {} \sup _{j \in {\mathbb {N}}} \left\{ - \left( \Vert u_j\Vert _{L^2(\Omega )} +\Vert v_j\Vert _{L^2(\Omega )} + \Vert {{\textbf{B}}}_{z,j} \Vert _{L^2(\Omega )} \right) \Vert \rho _{{\tilde{{{\varvec{\gamma }}}}}_j}\Vert _{L^2(\Omega )} \right. \\{} & {} + \left. \frac{1}{2}{\text {Tr}}\,(-\Delta \tilde{{{\varvec{\gamma }}}}_j) + \varepsilon _{k_j} \right\} . \end{aligned}$$

Inequality (5.53) shows that $-S_3^{j,k_j}(u_j,v_j, \phi )$ is an equi-coercive functional, and similar to Lemma 5.10, $-S_3^{j,k_j}(u_j,v_j, \phi )$ is equi-coercive to with respect to the weak topology in $W_0^{1,2}(\Omega )$. $\square $

To conclude this section, we establish convergence for this level of our model by using standard arguments [6] in conjunction with Lemmas 5.16 and 5.17.

Lemma 5.18

If $u_j \rightharpoonup u$ and $v_j \rightharpoonup v$ in $L^4(\Omega )$ then

$$\begin{aligned} \lim _{j \rightarrow \infty }\sup _{\phi \in W_0^{1,2}(\Omega )} S_3^{j,k_j}(u_j,v_j, \phi ) = \sup _{\phi \in W_0^{1,2}(\Omega )} S_3(u,v, \phi ). \end{aligned}$$

(5.54)

5.6.4 Convergence of $S_{i}^{j,k_j}$, $i=1, 2$

We proceed to establishing $\Gamma $-convergence of the operators $S_{i}^{j,k_j}$, $i=1, 2$. We need the following lemma for this purpose.

Lemma 5.19

$$\begin{aligned} \lim \limits _{j \rightarrow \infty } {\mathfrak {G}}_\mathrm{{xc}}^*(u_j) = {\mathfrak {G}}_\mathrm{{xc}}^*(u) \end{aligned}$$

(5.55)

Proof

From the definition of ${\mathfrak {G}}_\mathrm{{xc}}^*$, the sequence $\left( {\mathfrak {G}}_\mathrm{{xc}}^*(u_j) \right) $ is bounded above. The reverse Fatou Lemma, together with $u_j$ being the projections onto the space $L^4(\Omega )_j$, then yields

$$\begin{aligned} \limsup \limits _{j \rightarrow \infty } {\mathfrak {G}}_\mathrm{{xc}}^*(u_j)= \limsup \limits _{j \rightarrow \infty } \int _\Omega h^*( u_j ({{\textbf{r}}}) ) \, d {{\textbf{r}}}= {\mathfrak {G}}_\mathrm{{xc}}^*(u). \end{aligned}$$

Finally, we know that ${\mathfrak {G}}_\mathrm{{xc}}^*$ is lower semi-continuous, and therefore we conclude (5.55). $\square $

Establishing $\Gamma $-convergence of $S_{i}^{j,k_j}$ now follows easily if we apply Lemma 5.18, in conjunction with weakly lower semi-continuity of ${\mathfrak {G}}_\mathrm{{xc}}^*(u)$, together with Lemma 5.19.

Lemma 5.20

For $v_j \rightharpoonup v$, the family $\{ S_2^{j,k_j}(u,v) \}_{j\in {\mathbb {N}}}$ converges in the $\Gamma $-sense, i.e, for $j \rightarrow \infty $, with respect to the weak topology in $L^4(\Omega )$.

For the next result, we provide details of the proof.

Lemma 5.21

For $v_j \rightharpoonup v$ in $L^4(\Omega )$, the family $\{ S_2^{j,k_j}(u,v) \}_{j \in {\mathbb {N}}}$ is equi-coersive with respect to the weak topology in $L^4(\Omega )$.

Proof

In view of Lemma 5.17 and the bounds from (5.18) in Lemma 5.3, there exist real constants $C_{16} > 0$ and $C_{17}$ such that

$$\begin{aligned} {\mathfrak {G}}_\mathrm{{xc}}^*(u) \ge C_{16} \Vert u\Vert _{L^4(\Omega )}^4 - C_{17}|\Omega |. \end{aligned}$$

(5.56)

The estimate (5.56) implies natural bounds from below on the functional $S_2^{j,k_j}$,

$$\begin{aligned} S_2^{j,k_j}(u,v) \ge {\mathfrak {G}}_\mathrm{{xc}}^*(u) + N \lambda _{l}(u,v, {\tilde{\phi }}) \ge {\mathfrak {G}}_\mathrm{{xc}}^*(u) + N \left( \lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})} + C_j \right) , \nonumber \\ \end{aligned}$$

(5.57)

where ${\tilde{\phi }} = 0$ is a test function in $W_0^{1,2}(\Omega )$, $\lambda _{l}$ denotes the lower bound of the binning interval $[\lambda _{l}, \lambda _{r}]$ for ${\varvec{H}}^j(u,v,{\tilde{\phi }},{\varvec{B}})$, and $\lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})}$ denotes the lowest eigenvalue of ${\varvec{H}}^j(u,v,{\tilde{\phi }},{\varvec{B}})$. Let $\lambda _{l} = \lambda _1^{{\varvec{H}}^j(u,v,{\tilde{\phi }},{\varvec{B}})} + C_j$. We know that $\sup _j |C_j|$ is uniformly bounded, because $\lambda _{l}$ is only a functional of ${\tilde{\phi }}$, u and v, and independent of spatial discretization. If ${\varvec{\xi }}_1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})}$ denotes the normalized eigenvector of ${\varvec{H}}^j(u,v, {\tilde{\phi }}, {\varvec{B}})$ we can derive a lower bound of $\lambda _1^{{\varvec{H}}^j(u,v,{\tilde{\phi }},{\varvec{B}})}$ by the ellipticity of the underlying variational problem

$$\begin{aligned} \lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})} = \langle { {\varvec{H}}^j(u,v,{\tilde{\phi }}, {\varvec{B}}) {\varvec{\xi }}_1^{{\varvec{H}}^j(u,v, {\tilde{\phi }}, {\varvec{B}})}},{ {\varvec{\xi }}_1^{{\varvec{H}}^j(u,v,{\tilde{\phi }},{\varvec{B}})}} \rangle \ge - \Vert u\Vert _{L^2(\Omega )}. \nonumber \\ \end{aligned}$$

(5.58)

Using (5.56)–(5.58), we bound $T^{j,k_j}(u)$ below by a coersive functional which is independent of j and $k_j$:

$$\begin{aligned} S_2^{j,k_j}(u,v) \ge {\mathfrak {G}}_\mathrm{{xc}}^*(u) - N \Vert u\Vert _{L^2(\Omega )} \ge C_{16}\Vert u\Vert _{L^4(\Omega )}^{4} - N \Vert u\Vert _{L^4(\Omega )}^2. \end{aligned}$$

In the limit $\Vert u\Vert _{L^4(\Omega )} \rightarrow \infty $ the term $C_{16}\Vert u\Vert _{L^4(\Omega )}^4$ dominates, hence we have $T^{j,k_j} \rightarrow \infty $. $\square $

Lemma 5.22

If $v_j \rightharpoonup v$ in $L^4(\Omega )$, then $\lim _{j \rightarrow \infty } \inf _{v \in L^4(\Omega ) }S_2^{j,k}(u,v_j) = \inf _{v \in L^4(\Omega ) }S_2(u,v)$.

Proof

We apply [6] using Lemmas 5.19 and 5.21. $\square $

We now state and prove the analogous results for $S_1$.

Lemma 5.23

The family of functionals $\{ S_1^{j,k_j}(v) \}_{j\in {\mathbb {N}}}$ converges in the $\Gamma $-sense, i.e, for $j \rightarrow \infty $, with respect to the weak topology in $L^4(\Omega )$.

Proof

We first consider the $\liminf $ condition. We recall from (4.4) that $ S_1(v) = C_\mathrm{{self}} +{\mathfrak {G}}_\mathrm{{xc}}^*(v) + \inf _{v} S_2(u,v)$. The $\liminf $ condition holds for $\inf _{v} S_2(u,v)$ by Lemma 5.22, and holds for ${\mathfrak {G}}_\mathrm{{xc}}^*(v)$ by the same arguments as in the proof of Lemma 5.21.

The $\limsup $ condition follows for the same reasons too. $\square $

Our second result follows in a similar fashion.

Lemma 5.24

The family of functionals $\{ S_1^{j,k_j}(v) \}_{j \in {\mathbb {N}}}$ is equi-coersive with respect to the weak topology in $L^4(\Omega )$.

Proof

The lower bound (5.57) on $S_2^{j,k_j}$ implies

$$\begin{aligned}{} & {} S_1^{j,k_j}(v) \ge C_\mathrm{{self}} +{\mathfrak {G}}_\mathrm{{xc}}^*(v) + \inf _{u \in L^4(\Omega )} \left\{ {\mathfrak {G}}_\mathrm{{xc}}^*(u) + N \left( \lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})} + C_j \right) \right. \nonumber \\{} & {} \left. \quad +I_{L^4(\Omega )_j}(u) \right\} , \end{aligned}$$

(5.59)

The only term depending on v inside the $\inf _u$ is then $ \lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})}$, and as in (5.58), we have the lower bound $\lambda _1^{{\varvec{H}}^j(u,v, {\tilde{\phi }},{\varvec{B}})} \ge - \left\Vert v\right\Vert _{L^2(\Omega )}$. However, using (5.56) with v as the argument gives us $ {\mathfrak {G}}_\mathrm{{xc}}^*(v) \ge C_{18} \left\Vert v\right\Vert _{L^4(\Omega )}^{4} - C_{19}|\Omega |$. Therefore, the dominating term in (5.59) will be greater than $ C_{18} \left\Vert v\right\Vert _{L^4(\Omega )}^{4}$, thus giving us equi-coercivity. $\square $

5.6.5 Main result

Finally we are ready to prove the main theorem.

Proof

(Proof of Theorem 4.1) We apply [6, Thm 7.8] with Lemmas 5.23 and 5.24. $\square $

6 Conclusion

We present a mathematical analysis of DFT for modeling magnetic systems using an extended density-only formulation. No current densities enter the description in this formulation, but the particle density is split into different spin components. By restricting the exchange-correlation energy functional to be of an LSDA form (but further constrained to facilitate the analysis), we prove a series of results which enable us to establish our main result; a spectral (binning) approximation scheme. We prove the main result using the method of $\Gamma $-convergence. Auxiliary steps involves recasting the electrostatic potentials and justifying the spectral approximation by making a spectral decomposition of the Hamiltonian. In turn, we justify the spectral decomposition by “linearizing" the Hamiltonian. It is worth to note that, if we could by-pass the linearizing step of the proof, this would open up the possibility of using the general LSDA term as well as other xc functionals, such as GGAs. We leave this challenge for a future work.

Data availibility

Data sharing not applicable to this article as no datasets were generated or analysed during the current study.

Notes

This depends on the way the integrals are evaluated numerically. For other approaches, e.g., numerical tensor methods (Tensor-train decompostion and MALS) this might be different.

References

R.A. Adams, Arch. Ration. Mech. Anal. 29, 390 (1968)
Article Google Scholar
A.D. Becke, J. Chem. Phys. 140(18), 18A301 (2014)
Article ADS PubMed Google Scholar
G. Bihlmayer, in Handbook of Materials Modeling, ed. by W. Andreoni, S. Yip (Springer, Cham, 2018), p. 1
I.W. Bulik, G. Scalmani, M.J. Frisch, G.E. Scuseria, Phys. Rev. B 87, 035117 (2013)
Article ADS Google Scholar
A.J. Coleman, Rev. Mod. Phys. 35, 668 (1963)
Article ADS Google Scholar
G. Dal Maso, An Introduction to Gamma-Convergence (Birkhäuser, Basel, 1993)
Book Google Scholar
I. Ekeland, R. Témam, Convex Analysis and Variational Problems (SIAM, Berlin, 1999)
Book Google Scholar
O. Eriksson et al., Atomistic Spin Dynamics (Oxford University Press, Oxford, 2017)
Book Google Scholar
G.F. Giuliani, G. Vignale, Quantum Theory of the Electron Liquid (Cambridge University Press, Cambridge, 2005)
Book Google Scholar
D. Gontier, Phys. Rev. Lett. 111, 153001 (2013)
Article ADS PubMed Google Scholar
D. Gontier, Nonlinearity 28(1), 57 (2015)
Article ADS MathSciNet Google Scholar
S. Grimme, P.R. Schreiner, Angew. Chem. Int. Ed. 57, 4170 (2018)
Article CAS Google Scholar
P. Hohenberg, W. Kohn, Phys. Rev. 136(3B), B884 (1964)
Article Google Scholar
C.R. Jacob, M. Reiher, Int. J. Quant. Chem. 112(23), 3661 (2012)
Article CAS Google Scholar
R.O. Jones, Rev. Mod. Phys. 87(3), 897 (2015)
Article ADS Google Scholar
J. Kübler, Theory of Itinerant Electron Magnetism (Clarendon Press, Oxford, 2000)
Google Scholar
W. Kohn, L.J. Sham, Phys. Rev. A 140, 1133 (1965)
Article ADS Google Scholar
A. Laestadius, M. Benedicks, Int. J. Quant. Chem 114(12), 782–795 (2012)
Article Google Scholar
M. Levy, Proc. Natl Acad. Sci. U.S.A. 76(12), 6062 (1979)
Article ADS PubMed PubMed Central CAS Google Scholar
E.H. Lieb, Int. J. Quant. Chem. 24, 242 (1983)
Article Google Scholar
M.M. Pant, A.K. Rajagopal, Solid State Commun. 10, 1157 (1972)
Article ADS CAS Google Scholar
L. Petit, D. Paudyal, Y. Mudryk, K.A. Gschneidner, V.K. Pecharsky, M. Lüders, Z. Szotek, R. Banerjee, J.B. Staunton, Phys. Rev. Lett. 115, 207201 (2015)
Article ADS PubMed CAS Google Scholar
M. Penz, E.I. Tellgren, M.A. Csirik, M. Ruggenthaler, A. Laestadius, ACS Phys. Chem. Au 3(6), 492–511 (2023). https://doi.org/10.1021/acsphyschemau.3c00006
Article PubMed PubMed Central CAS Google Scholar
J.P. Perdew, S. Kurth, in A Primer in Density Functional Theory, ed. by C. Fiolhais, F. Nogueira, M. Marques, vol. 620. Springer Lecture Notes in Physics (Springer, Berlin, 2003), pp. 1–55
M. Ponga, K. Bhattacharya, M. Ortiz, J. Comput. Phys. 407, 109249 (2020)
Article MathSciNet CAS Google Scholar
P. Saha, M.B. Sahariah, J. Phys. D Appl. Phys. 54, 455301 (2021)
Article ADS CAS Google Scholar
P. Suryanarayana, V. Gavini, T. Blesgen, K. Bhattacharya, M. Ortiz, J. Mech. Phys. Solids 58, 256 (2010)
Article ADS MathSciNet CAS Google Scholar
J.J. Sylvester, Philos. Mag. 4, 138 (1852)
Article Google Scholar
V.J.J. Syrjanen, Rigorous mathematical results for electronic structure models. Ph.D. Dissertation, University of Sussex (2022)
E.I. Tellgren, S. Kvaal, E. Sagvolden, U. Ekström, A.M. Teale, T. Helgaker, Phys. Rev. A 86, 062506 (2012)
Article ADS Google Scholar
S.M. Valone, J. Chem. Phys. 73, 1344 (1980)
Article ADS MathSciNet Google Scholar
U. von Barth, L. Hedin, J. Phys. C 5, 1629 (1972)
Article ADS Google Scholar
P. Verma, D.G. Truhlar, Trends Chem. 2, 302 (2020)
Article CAS Google Scholar
X.C. Wang, A variational framework for spectral discretization of the density matrix in Kohn Sham density functional theory. Ph.D. Dissertation, California Institute of Technology (2015)
X.-C. Wang, T. Blesgen, K. Bhattacharya, M. Ortiz, Arch. Ration. Mech. Anal. 221, 1035 (2016)
Article MathSciNet Google Scholar
M. Zbiri, M. Johnson, H. Schober, S. Rols, N. Qureshi, S. Clarke, R. Mittal, Collection SFN 12, 77 (2011)
Article Google Scholar
R. Zeller, in Computational Nanoscience: Do It Yourself!, ed. by J. Grotendorst, S. Blügel, D. Marx (John von Neumann Institute for Computing, Jülich, NIC Series, vol. 31, 2006), pp. 419–445

Download references

Acknowledgements

The first author acknowledges financial support from the Dr Perry James Browne Research Centre on Mathematics and its Applications. The second author is grateful to the University of Sussex for the award of a Ph.D. studentship. The authors also thank the anonymous referees for their constructive feedback and in particular for pointing out a few minor errors.

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Mathematics, School of Mathematical and Physical Sciences, University of Sussex, BN1 9QH, Brighton, UK
M. Melgaard & V. J. J. Syrjanen

Authors

M. Melgaard
View author publications
You can also search for this author in PubMed Google Scholar
V. J. J. Syrjanen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

This is a joint work in all respects.

Corresponding author

Correspondence to M. Melgaard.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Melgaard, M., Syrjanen, V.J.J. Spectral approximation scheme for a hybrid, spin-density Kohn–Sham density-functional theory in an external (nonuniform) magnetic field and a collinear exchange-correlation energy. J Math Chem 62, 711–760 (2024). https://doi.org/10.1007/s10910-023-01557-6

Download citation

Received: 12 June 2023
Accepted: 17 November 2023
Published: 08 January 2024
Issue Date: March 2024
DOI: https://doi.org/10.1007/s10910-023-01557-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Spectral approximation scheme for a hybrid, spin-density Kohn–Sham density-functional theory in an external (nonuniform) magnetic field and a collinear exchange-correlation energy

Abstract

Similar content being viewed by others

A Variational Framework for Spectral Approximations of Kohn–Sham Density Functional Theory

Non-analytic Spin-Density Functionals

Kohn–Sham theory in the presence of magnetic field

1 Introduction

2 Density functional theory in magnetic fields

2.1 Quantum mechanics

2.2 Spin density functional theory (SDFT)

2.2.1 Extended Kohn–Sham approach

2.2.2 Local spin density approximation

Assumption 2.1

3 Approximation scheme and methodology

3.1 Reformulation of problem on a bounded domain

3.2 Spectral theory

3.3 Spatial discretization

3.4 Spectral binning

4 Main result and strategy of its proof

4.1 Strategy

4.1.1 Limit functionals

4.1.2 Functionals involving spectral and spatial approximations

Theorem 4.1

5 Proofs

5.1 Recasting of electrostatic energies

Lemma 5.1

Proposition 5.2

Proof of Lemma 5.1

Proof of Proposition 5.2

5.2 Recasting of XC energy

Lemma 5.3

Proof

5.3 Interchange of infimum and supremum

Lemma 5.4

Proof

Lemma 5.5

Proof

Theorem 5.6

Proof

5.4 Commutativity of band energy

Theorem 5.7

Proof

5.5 Spatial discretization

5.6 Proof of main Theorem 4.1

5.6.1 Convergence of exact band energies

Lemma 5.8

Proof

Lemma 5.9

Proof

Lemma 5.10

Proof

Lemma 5.11

Proof

5.6.2 Convergence of spectral binning operator \(E_\mathrm{{band}}^{j,k_j}\)

Lemma 5.12

Lemma 5.13

Lemma 5.14

Lemma 5.15

5.6.3 Convergence of \(S_3^{j,k_j}\).

Lemma 5.16

Proof

Lemma 5.17

Proof

Lemma 5.18

5.6.4 Convergence of \(S_{i}^{j,k_j}\), \(i=1, 2\)

Lemma 5.19

Proof

Lemma 5.20

Lemma 5.21

Proof

Lemma 5.22

Proof

Lemma 5.23

Proof

Lemma 5.24

Proof

5.6.5 Main result

Proof

6 Conclusion

Data availibility