Abstract
Schrödinger operators with periodic (possibly complex-valued) potentials and discrete periodic operators (possibly with complex-valued entries) are considered, and in both cases the computational spectral problem is investigated: namely, under what conditions can a ‘one-size-fits-all’ algorithm for computing their spectra be devised? It is shown that for periodic banded matrices this can be done, as well as for Schrödinger operators with periodic potentials that are sufficiently smooth. In both cases implementable algorithms are provided, along with examples. For certain Schrödinger operators whose potentials may diverge at a single point (but are otherwise well-behaved) it is shown that there does not exist such an algorithm, though it is shown that the computation is possible if one allows for two successive limits.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction and main results
We study the computational spectral problem for periodic discrete operators, acting in \(\ell ^2({\mathbb {Z}})\), as well as Schrödinger operators with periodic potentials acting in \(L^2({\mathbb {R}}^d)\). We show that it is possible to compute their respective spectra as limits of finite-dimensional approximations. However, in the Schrödinger case this becomes impossible if the potential is allowed to be discontinuous at a single point (but otherwise it is smooth). More precisely, we prove:
Theorem 1.1
(Periodic banded matrices) Let \(B(\ell ^2({\mathbb {Z}}))\) denote the set of bounded operators on \(\ell ^2({\mathbb {Z}})\) and let \(\{e_i\}_{i\in {\mathbb {Z}}}\) be a basis for \(\ell ^2({\mathbb {Z}})\). Let \(\Omega ^{\mathrm {per}}\subset B(\ell ^2({\mathbb {Z}}))\) be the class of banded, periodic operators with respect to the basis \(\{e_i\}_{i\in {\mathbb {Z}}}\) and \(\Omega ^{\mathrm {per}}_{N,b}\) the subset of matrices with period N and bandwidth b. Then
-
(i)
there exists an algorithm that can compute the spectrum \(\sigma (A)\) of any \(A\in \Omega ^{\mathrm {per}}\) as the limit of a sequence computable approximations;
-
(ii)
there exists an algorithm that can compute the spectrum \(\sigma (A)\) of any \(A\in \Omega ^{\mathrm {per}}_{N,b}\) with guaranteed error bounds.
Theorem 1.2
(Schrödinger: good case) For \(d\in {\mathbb {N}}\) define the class of potentials
and given \(M>0\) and \(p>d\) define the class
Then
-
(i)
there exists an algorithm that can compute the spectrum \(\sigma (H)\) of any Schrödinger operator \(H=-\Delta +V\) with \(V\in \Omega ^{\mathrm {Sch}}\) as the limit of a sequence computable approximations;
-
(ii)
moreover, for \(V\in \Omega ^{\mathrm {Sch}}_{p,M}\) this algorithm yields spectral inclusion with guaranteed error bounds;
-
(iii)
if, in addition to belonging to \(\Omega ^{\mathrm {Sch}}_{p,M}\) the potential V is also real-valued and of bounded variation in the sense of Hardy and Krause, then \(\sigma (H)\) can be computed with full error control.
Theorem 1.3
(Schrödinger: bad case) Let \(x_0\in [0,1]\) and let
Then there does not exist an algorithm that can compute the spectrum \(\sigma (H)\) of any Schrödinger operator \(H=-\Delta +V\) with \(V\in \Omega ^{\mathrm {Sch}}_{x_0}\) as the limit of a sequence computable approximations. However, there does exist an algorithm that can compute \(\sigma (H)\) by taking two successive limits.
Below, in Sect. 2, we give precise definitions of what an ‘algorithm’ is,what information is available to it, how it computes, and hence what we mean when we say ‘computable’. Informally, Theorems 1.1 and 1.2 imply that these computations can be performed numerically. In fact, we provide actual algorithms which can access the matrix entries (in Theorem 1.1) and pointwise evaluations of the potential (in Theorem 1.2). This should be contrasted with Theorem 1.3 which implies that it is impossible to devise an algorithm that can compute the spectrum of any Schrödinger operator with a potential belonging to \(\Omega ^{\mathrm {Sch}}_{x_0}\). We prove this by contradiction: assuming the existence of such an algorithm, we explicitly construct a potential \(V\in \Omega ^{\mathrm {Sch}}_{x_0}\) for which this algorithm would fail in its attempt to compute the spectrum.
These statements are nontrivial. The existence of a ‘one-size-fits-all’ algorithm as in Theorems 1.1 and 1.2 is not obvious: while there are techniques for computing the spectrum of a given operator in each of the above cases—these are indeed well-studied problems—here we prove the existence of a single algorithm (which can be coded) that can handle any input from the given class, without any additional a priori information. Moreover, in the cases of \(\Omega ^{\mathrm {per}}_{N,b}\) and \(\Omega ^{\mathrm {Sch}}_{p,M}\) there are even guaranteed error bounds. In particular, even when only finitely many values of the potential are known to the algorithm, it can still produce an approximation, which contains the original spectrum up to a known error. Conversely, the non-existence result of Theorem 1.3 is also not obvious: we prove that regardless of what operations are allowed, as long as the algorithm can only read a finite amount of information at each iteration, there will necessarily be a potential for which the computation will fail. In the spirit of the Solvability Complexity Theory (see a brief discussion below in Subsection 1.1, with more details in Sect. 2) we show that if one allows for two successive limits (which cannot be collapsed to a single limit), the computation is possible. Note that although the class of potentials \(\Omega ^{\mathrm {Sch}}_{x_0}\) allows for a blowup near \(x_0\), all potentials in this class are integrable over compact sets, and so \(x_0\) is still a regular point for the differential equation [10, footnote, p.67]; elsewhere the potentials are even more well-behaved.
1.1 The solvability complexity index hierarchy
Our exploration of the spectral computational problem—for both discrete and Schrödinger operators—continues a line of research initiated by Hansen in [21] and then further expanded in [4, 5]. This sequence of papers established the so-called Solvability Complexity Index (SCI) Hierarchy, which is a classification of the computational complexity of problems that cannot be computed in finite time, only approximated. This theory is motivated by the following general questions:
What are the limitations of computers, and can they be quantified? Do there always exist algorithms to approximate any problem?
Consider, for instance, the problem of computing the spectrum \(\sigma (A)\) of an operator \(A\in B(\ell ^2({\mathbb {N}}))\) (\(B(\ell ^2({\mathbb {N}}))\) is the set of bounded linear operators on \(\ell ^2({\mathbb {N}})\)). Any such A can be represented as an infinite matrix. A computer, however, will never have access to the entire matrix: it will always only ‘see’ finitely many entries of this matrix, and will only be able to perform finitely many operations, leading to some approximation \(\sigma _n(A)\). In this level of abstraction, can anything be said about the distance of \(\sigma _n(A)\) from \(\sigma (A)\)? Can we always devise an approximation procedure that will guarantee the convergence of \(\sigma _n(A)\) to \(\sigma (A)\) for any \(A\in B(\ell ^2({\mathbb {N}}))\)?
The papers [4, 21] prove the following remarkable result: there will always be some \(M\in B(\ell ^2({\mathbb {N}}))\) for which \(\sigma _n(M)\) will not converge to \(\sigma (M)\) in any metric. This does not depend on any specific properties of the approximation procedure, as long as it is a procedure that mimics a computer: it can only read a finite amount of information and it works in a consistent manner. However, it is shown that if one allows several successive limits this problem becomes tractable: there does exist an algorithm \(\sigma _{n,m,k}\) such that
and these limits cannot be exchanged and cannot be collapsed. The parameters \(n,m,k\in {\mathbb {N}}\) are related to how the matrix representation of A is read by the computer and to the size of some thresholds arising in the computation of singular values of the resulting finite sections. This algorithm is given explicitly.
In this case, we say that the above problem has an SCI value of 3, since 3 successive limits are required. It turns out that if we were to repeat the above discussion with \(B(\ell ^2({\mathbb {N}}))\) replaced by the set of selfadjoint operators in \(B(\ell ^2({\mathbb {N}}))\) then the index would be 2, and if it were replaced by the set of compact operators in \(B(\ell ^2({\mathbb {N}}))\) then the index would be 1. Of course, this raises the following fundamental question: what does it mean for an algorithm to require several limits? Naturally, a numerical analyst might answer this question differently to, say, a theoretical computer scientist. However the mathematical fact remains: some computations cannot be carried out using a single-limit approximation procedure.
The precise definitions relevant to this theory (for instance, a definition of what an algorithm is) are provided in the detailed discussion in Sect. 2. This classification can even be refined further, to account for algorithms that can perform a computation with known error bounds. The following classes of problems can be defined:
- \(\Delta _k:\):
-
For \(k\ge 2\), \(\Delta _k\) is the class of problems that require at most \(k-1\) successive limits to solve, i.e. the class of problems that have an \({\text {SCI}}\) value of at most \(k-1\). Problems in \(\Delta _1\) can be solved in one limit with known error bounds.
- \(\Sigma _k:\):
-
For all \(k\in {\mathbb {N}}\), \(\Sigma _k\subset \Delta _{k+1}\) is the class of problems in \(\Delta _{k+1}\) that can be approximated from “below” with known error bounds.
- \(\Pi _k:\):
-
For all \(k\in {\mathbb {N}}\), \(\Pi _k\subset \Delta _{k+1}\) is the class of problems in \(\Delta _{k+1}\) that can be approximated from “above” with known error bounds.
By an approximation from “above” (resp. “below”) we mean that the output of the algorithm is a superset (resp. subset) of the object we are computing (this clearly requires that this object and its approximations belong to a certain topological space). It can also be shown that for \(k\in \{1,2,3\}\) we have \(\Delta _k=\Sigma _k\cap \Pi _k\).
Previous results: In [4] the spectral computational problems for both \(B(\ell ^2({\mathbb {N}}))\) and for Schrödinger operators were addressed. Some of the results shown there includeFootnote 1
New results: Theorems 1.1, 1.2 and 1.3 respectively imply that
Remark 1.4
We point out that showing that a problem belongs to \(\Pi _1,\,\Sigma _1\) or \(\Delta _1\) is significant, as it shows that the computation can be done with certain guaranteed error bounds.
We finally note that recent years have seen a flurry of activity in this direction with many results classifying various problems within the SCI Hierarchy. We point out [11, 12] where some of the theory of spectral computations has been further developed; [32] where this has been applied to certain classes of unbounded operators; [2] where solutions of PDEs were considered; [6, 7] where we considered resonance problems; and [13] where the authors give further examples of how to perform certain spectral computations with error bounds.
1.2 Periodic operators
Schrödinger equations with periodic potentials have been the subject of study since the earliest days of quantum mechanics, perhaps most famously for the Bethe-Sommerfeld Conjecture [37], which states that the number of gaps in the essential spectrum is finite in dimensions \(\ge 2\). After more than seventy years this conjecture was finally proved in complete generality for Schrödinger operators in \({{\mathbb {R}}}^d\) by Parnovski [27]. In dimensions 2 and 3 the conjecture had already been proved by Popov and Skriganov [28] and Skriganov [36] respectively, and in dimension 4 by Helffer and Mohamed [22].
Beyond these results in mathematical analysis, since the 1990s interest in periodic problems has grown rapidly in the applied analysis and computational mathematics literature, partly driven by models of photonic crystals. These models are typically based on time-harmonic Maxwell equations or upon second order elliptic equations with periodic coefficients. Figotin and Kuchment [16,17,18] give particularly thorough analyses of some of these models, showing that already in these cases with piecewise constant coefficients the associated operators may possess an arbitrarily large number of spectral gaps.
For a periodic problem with some particular coefficients, often the first question of interest is whether it has any spectral gaps at all. Numerical methods may be used to obtain some preliminary evidence, and are almost always based on the Floquet-Bloch decomposition. The fact that the coefficients are often only piecewise continuous requires substantial effort to be given to adaptive meshing, see Giani and Graham [19], although the continuous variation of the quasi-momentum over the Brillouin zone means that some of the effort can be recycled from one quasi-momentum to the next. Despite the substantial computational cost, Floquet-Bloch techniques can sometimes be used to go beyond preliminary evidence, and have been combined with interval arithmetic to obtain algorithms which yield computer-assisted proofs of existence of spectral gaps for a wide class of problems with coefficients expressed in terms of elementary functions, see Hoang, Plum and Wieners [23]. The literature for periodic problems which are not self-adjoint is much less extensive. For the ODE case, the work of Rofe-Beketov [31] is generally the starting point for any research on this topic.
The discrete case encompasses numerous different directions of research. Typical examples are Toeplitz and Laurent operators [20] or Jacobi operators [38]. Another direction that has gained a great deal of attention in the last two decades is that of periodic discrete Schrödinger operators and generalizations thereof. A particular type known as almost Mathieu operator has been shown to exhibit rich spectral behavior (e.g. the spectrum can be a set of non-integer Hausdorff dimension, cf. [1, 24]). Beyond this, the literature pertaining to operators that are either not tri-diagonal or not self-adjoint is limited. The best starting point would be the book of Trefethen and Embree [39].
1.3 Organization of the paper
In Sect. 2 we give a brief introduction to the main ideas of the \({\text {SCI}}\) theory. Sections 3, 4 and 5 are dedicated to the proofs of Theorems 1.1, 1.2 and 1.3, respectively. Finally, in Sect. 6 we provide some numerical examples.
2 The solvability complexity index hierarchy
The Solvability Complexity Index (\({\text {SCI}}\)) and the \({\text {SCI}}\) Hierarchy provide a unified approach for understanding just how “difficult” it is to approximate infinite-dimensional problems (such as computing spectra) starting from finite-dimensional approximations. We start by setting the scene with a concrete example before providing precise abstract definitions.
2.1 Informal discussion and examples
Consider the set \(\Omega = B(\ell ^2({\mathbb {N}}))\) of all bounded operators on \(\ell ^2({\mathbb {N}})\). Let \(\{e_i\}_{i\in {\mathbb {N}}}\) be the canonical basis. Then any element \(A\in \Omega \) is represented by an infinite matrix. Denote by \(\Lambda \) the set of all entries in this matrix. Then one could ask:
For any \(A\in \Omega \), is it possible to compute its spectrum \(\sigma (A)\) as the limit of a sequence of computations \(\Gamma _n\), where each \(\Gamma _n\) has access to only finitely many elements of \(\Lambda \) and can only perform finitely many arithmetic computations?
Needless to say, the whole point here is that the algorithms \(\Gamma _n\) are not tailored for this specific element A: they are meant to be able to handle any element \(A\in \Omega \). The convergence of \(\Gamma _n(A)\) to \(\sigma (A)\) is made precise by realizing them as elements of the metric space \({\mathcal {M}}=(\mathrm {cl}({\mathbb {C}}), d)\) which comprises all closed subsets of \({\mathbb {C}}\) endowed with an appropriate metric d (such as the Hausdorff metric).
In [21], Hansen showed that it is possible to compute \(\sigma (A)\) for any \(A\in \Omega \) as above. However, rather than having algorithms \(\Gamma _n\) with a single index \(n\in {\mathbb {N}}\), three indices were required, satisfying \(\sigma (A)=\lim _{n_3\rightarrow \infty }\lim _{n_2\rightarrow \infty }\lim _{n_1\rightarrow \infty }\Gamma _{n_1,n_2,n_3}(A)\). The algorithms \(\Gamma _{n_1,n_2,n_3}\) are given explicitly, and can be implemented numerically (though this raises a philosophical question about what it means to take successive limits numerically). In [4] it was proved that this is optimal: this computation cannot be performed with fewer than 3 limits, and hence we say that this problem has an \({\text {SCI}}\) value of 3.
The \({\text {SCI}}\) value strongly depends on \(\Omega \): intuitively, if \(\Omega \) contains fewer elements, then devising a ‘one-size-fits-all’ algorithm should be easier. Indeed, if one considers \(\Omega ^{\mathrm {sa}}:=\{A\in \Omega \,|\,A\text { is selfadjoint}\}\) then the \({\text {SCI}}\) value reduces to 2 and for \(\Omega ^{\mathrm {cpt}}:=\{A\in \Omega \,|\,A\text { is compact}\}\) it further reduces to 1.
The classification into \({\text {SCI}}\) values can be further refined into a classification that takes into account error bounds. This is the so-called SCI Hierarchy which we describe below.
2.2 Definitions
We formalize the foregoing example with precise definitions:
Definition 2.1
(Computational problem) A computational problem is a quadruple \((\Omega ,\Lambda ,\Xi ,{\mathcal {M}})\), where
-
(i)
\(\Omega \) is a set, called the primary set,
-
(ii)
\(\Lambda \) is a set of complex-valued functions on \(\Omega \), called the evaluation set,
-
(iii)
\({\mathcal {M}}\) is a metric space,
-
(iv)
\(\Xi :\Omega \rightarrow {\mathcal {M}}\) is a map, called the problem function.
Remark 2.2
In this paper it is often clear what \({\mathcal {M}}, \Lambda \) and \(\Xi \) are, and the important element is the primary set \(\Omega \). In this case we may abuse notation and refer to \(\Omega \) alone as the computational problem.
Definition 2.3
(Arithmetic algorithm) Let \((\Omega ,\Lambda ,\Xi ,{\mathcal {M}})\) be a computational problem. An arithmetic algorithm is a map \(\Gamma :\Omega \rightarrow {\mathcal {M}}\) such that for each \(T\in \Omega \) there exists a finite subset \(\Lambda _\Gamma (T)\subset \Lambda \) such that
-
(i)
the action of \(\Gamma \) on T depends only on \(\{f(T)\}_{f\in \Lambda _\Gamma (T)}\),
-
(ii)
for every \(S\in \Omega \) with \(f(T)=f(S)\) for all \(f\in \Lambda _\Gamma (T)\) one has \(\Lambda _\Gamma (S)=\Lambda _\Gamma (T)\),
-
(iii)
the action of \(\Gamma \) on T consists of performing only finitely many arithmetic operations on \(\{f(T)\}_{f\in \Lambda _\Gamma (T)}\).
Definition 2.4
(Tower of arithmetic algorithms) Let \((\Omega ,\Lambda ,\Xi ,{\mathcal {M}})\) be a computational problem. A tower of algorithms of height k for \(\Xi \) is a family \(\Gamma _{n_1,n_2,\dots ,n_k}:\Omega \rightarrow {\mathcal {M}}\) of arithmetic algorithms such that for all \(T\in \Omega \)
Definition 2.5
(SCI) A computational problem \((\Omega ,\Lambda ,\Xi ,{\mathcal {M}})\) is said to have a Solvability Complexity Index (\({\text {SCI}}\)) of k if k is the smallest integer for which there exists a tower of algorithms of height k for \(\Xi \). If a computational problem has solvability complexity index k, we write
If there exists a family \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\) of arithmetic algorithms and \(N_1\in {\mathbb {N}}\) such that \(\Xi =\Gamma _{N_1}\) then we define \({\text {SCI}}(\Omega ,\Lambda ,\Xi ,{\mathcal {M}})=0\).
Definition 2.6
(The SCI Hierarchy) The \({\text {SCI}}\) Hierarchy is a hierarchy \(\{\Delta _k\}_{k\in {{\mathbb {N}}_0}}\) of classes of computational problems \((\Omega ,\Lambda ,\Xi ,{\mathcal {M}})\), where each \(\Delta _k\) is defined as the collection of all computational problems satisfying:
with the special class \(\Delta _1\) defined as the class of all computational problems in \(\Delta _2\) with a convergence rate:
Hence we have that \(\Delta _0\subset \Delta _1\subset \Delta _2\subset \cdots \)
When the metric space \({\mathcal {M}}\) has certain ordering properties, one can define further classes that take into account convergence from below/above and associated error bounds. In order to not burden the reader with unnecessary definitions, we provide the definition that is relevant to the cases where \({\mathcal {M}}\) is the space of closed (and bounded) subsets of \({\mathbb {R}}^d\) together with the Attouch-Wets (Hausdorff) distance (definitions of which can be found in Appendix A). These are the cases of relevance to us. A more comprehensive and abstract definition can be found in [4].
Definition 2.7
(The SCI Hierarchy (Attouch-Wets/Hausdorff metric)) Consider the setup in Definition 2.6 assuming further that \({\mathcal {M}}=(\mathrm {cl}({\mathbb {R}}^d),d)\) where \(d=d_{\mathrm {AW}}\) or \(d=d_{\mathrm {H}}\). Then for \(k\in {\mathbb {N}}\) we can define the following subsets of \(\Delta _{k+1}\):
It can be shown that \(\Delta _k=\Sigma _k\cap \Pi _k\) for \(k\in \{1,2,3\}\), see Fig. 1. We refer to [4] for a detailed treatise.
3 Banded periodic matrices
We begin by studying periodic banded matrices, which can be regarded as a natural simplification of periodic Schrödinger operators (indeed, our treatment covers the case of discrete Schrödinger operators). In fact, our method of proof in either case is very similar, but turns out to be considerably simpler in the discrete case and therefore provides a good starting point for our analysis.
In this section we prove Theorem 1.1. We do this by defining an explicit algorithm and show that its output converges to the desired spectrum. The two computational problems we consider only differ in the primary set \(\Omega \), and are as follows:
where \(\{e_i\}_{i\in {\mathbb {Z}}}\) denotes the canonical basis of \(\ell ^2({\mathbb {Z}})\) and \(d_\mathrm {H}\) denotes the Hausdorff distance. We remind the reader that \(\Omega ^{\mathrm {per}}_{N,b}\) is the class of operators on \(\ell ^2({\mathbb {Z}})\) whose canonical matrix representation has bandwidth b (i.e. \(A_{ij}=0\,\,\forall |i-j|>b\)) and whose matrix entries repeat periodically along the diagonals with period N (here \(N,b\in {\mathbb {N}}\)). Clearly, every \(A\in \Omega ^{\mathrm {per}}_{N,b}\) defines a bounded operator on \(\ell ^2({\mathbb {Z}})\). Note that \(\Omega ^{\mathrm {per}} = \bigcup _{N,b\in {\mathbb {N}}}\Omega ^{\mathrm {per}}_{N,b}\) is the class of operators on \(\ell ^2({\mathbb {Z}})\) whose canonical matrix representation is banded and whose matrix entries repeat periodically along the diagonals. In the language of the Solvability Complexity Index, the three parts of Theorem 1.1 can be expressed as follows:
-
Part (i) amounts to proving that the computational problem for \(\Omega ^{\mathrm {per}}\) has an \({\text {SCI}}\) value of 1 (or, equivalently, it belongs to \(\Delta _2\)).
-
Part (ii) amounts to showing that the computational problem for \(\Omega ^{\mathrm {per}}_{N,b}\) belongs to \(\Delta _1\), i.e. it can be approximated with explicit error bounds; this is restated as Theorem 3.3 below.
Remark 3.1
We note that the Hausdorff distance is only defined for non-empty sets, and it is finite only if the sets are bounded. Hence it is important to observe that for any \(A\in \Omega ^{\mathrm {per}}\), the set \(\sigma (A)\) is both non-empty and bounded. Indeed, boundedness of the spectrum follows immediately from boundedness of A, while non-emptyness follows from the Floquet-Bloch theory described in Sect. 3.2. We discuss the metrics used in this paper in Appendix A.
Example 3.2
The class \(\Omega _{N,1}^{\mathrm {per}}\) contains all Jacobi-type matrices of the form
3.1 Proof of Theorem 1.1(i)
To prove Theorem 1.1(i) we assume to be known Theorem 1.1(ii). Theorem 1.1(ii) can be restated in the language of the SCI Hierarchy as follows:
Theorem 3.3
For any fixed \(N,b\in {\mathbb {N}}\) the computational problem for \(\Omega ^{\mathrm {per}}_{N,b}\) can be solved in one limit with explicit error bounds, i.e. \(\big (\Omega ^{\mathrm {per}}_{N,b},\Lambda ,\Xi ,{\mathcal {M}}\big ) \in \Delta _1\).
The proof of this theorem is contained in Subsection 3.4 below, after some preparatory work. First, however, we prove Theorem 1.1(i):
Proof
(Proof of Theorem 1.1(i)) By Theorem 3.3, for every \(N\in {\mathbb {N}}\) there exists a family of algorithms \(\{\Gamma ^{(N)}_n\}_{n\in {\mathbb {N}}}\), such that \(\Gamma ^{(N)}_n(B)\rightarrow \sigma (B)\) as \(n\rightarrow +\infty \) for any \(B\in \Omega ^{\mathrm {per}}_{N,b}\). Now, let \(A=(a_{ij})_{i,j\in {\mathbb {Z}}}\in \Omega ^{\mathrm {per}}\) and define a new family \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\) by the following pseudocode.
To clarify the meaning of \(d_i\) we note that in Example 3.2 one would have \(d_i = (\dots ,0,c_i,a_i,b_i,0,\dots )\). Loosely speaking, Pseudocode 1 first takes a finite section of A, searches it for periodic repetitions, and then defines a matrix \(B_n\in \Omega ^{\mathrm {per}}_{N,b}\) by periodic extension, to which \(\Gamma ^{(N)}_n\) can be applied. Because A is banded and periodic, this routine will eventually find its period: there exists \(n_0\in {\mathbb {N}}\) such that for all \(n>n_0\), N (as defined in the routine) is equal to the period of A and \(B_n\equiv A\). Hence, for \(n>n_0\) we have \(\Gamma _n(A) = \Gamma _n^{(N)}(B_n) = \Gamma _n^{(N)}(A)\rightarrow \sigma (A)\) as \(n\rightarrow +\infty \), by the properties of \(\Gamma _n^{(N)}\).
Finally, note that every line of Pseudocode 1 can be executed with finitely many algebraic operations on the matrix elements of A. \(\square \)
The following subsections are devoted to the proof of Theorem 3.3. The proof is constructive, i.e. we will provide an explicit algorithm that computes the spectrum of any given operator \(A\in \Omega ^{\mathrm {per}}_{N,b}\) with explicit error bounds.
3.2 Floquet-Bloch transform
Let N be as in the statement of Theorem 3.3. Given a vector \(x=(x_n)_{n\in {\mathbb {Z}}}\in \ell ^2({\mathbb {Z}})\) and given \(\theta \in [0,2\pi ]\), define
We also introduce the symbol \(\ell ^2_{\text {per}}(N)\) to denote the space of all N-periodic sequences \((y_k)_{k\in {\mathbb {Z}}}\), together with the norm \(\Vert y\Vert _{\ell ^2_{\text {per}}(N)}^2 = \sum _{k=0}^{N-1} |y_k|^2\). Note that \(\ell ^2_{\text {per}}(N)\) is canonically isomorphic to the Euclidean space \({\mathbb {R}}^N\). The following lemma is easily proved by direct computation.
Lemma 3.4
(Properties of \({\mathcal {U}}_\theta \)) The map \({\mathcal {U}}_\theta \) defined in (3.3) has the following properties.
-
(i)
For any \(x\in \ell ^2({\mathbb {Z}})\), \({\mathcal {U}}_\theta x\) is N-periodic, that is \({\mathcal {U}}_\theta :\ell ^2({\mathbb {Z}})\rightarrow \ell ^2_{\text {per}}(N)\);
-
(ii)
The map
$$\begin{aligned} {\mathcal {U}}: \ell ^2({\mathbb {Z}})&\rightarrow \int _{[0,2\pi ]}^\oplus \ell ^2_{\mathrm {per}}(N)\, d\theta \\ x&\mapsto ({\mathcal {U}}_\theta x)_{\theta \in [0,2\pi ]} \end{aligned}$$is unitary;
-
(iii)
The inverse \({\mathcal {U}}^{-1}\) is given by
$$\begin{aligned} ({\mathcal {U}}^{-1}y)_n = (2\pi )^{-\frac{1}{2}} \int _0^{2\pi } y_n(\theta )e^{\i n \frac{\theta }{N}} \,d\theta . \end{aligned}$$
Proof
This is standard, and the proof is omitted. \(\square \)
3.3 Transform and properties of \(A\in \Omega ^{\mathrm {per}}_{N,b}\)
The N-periodicity of \(A\in \Omega ^{\mathrm {per}}_{N,b}\) along diagonals is equivalent to the identity
Lemma 3.5
For any \(A\in \Omega ^{\mathrm {per}}_{N,b}\), \(y\in \ell ^2_{ \mathrm {per}}(N)\) and \(\theta \in [0,2\pi ]\), define \((A(\theta )y)_n = \sum _{j\in {\mathbb {Z}}}e^{\i \theta \frac{j-n}{N}} A_{nj}y_j\). Then one has
Proof
A straightforward calculation (which we leave to the reader) shows that \(({\mathcal {U}}_\theta Ax)_n = (A(\theta ){\mathcal {U}}_\theta x)_n\) for \(n\in {\mathbb {Z}}\) and \(x\in \ell ^2({\mathbb {Z}})\). The assertion now follows from the invertibility of \({\mathcal {U}}\). \(\square \)
Remark 3.6
Observe that \((A(\theta )y)_n=(A(\theta )y)_{n+kN}\) for any \(k\in {\mathbb {Z}}\) so that \(A(\theta )y\in \ell ^2_{\mathrm {per}}(N)\). Hence, \(A(\theta )\) is an operator \( \ell ^2_{\mathrm {per}}(N)\rightarrow \ell ^2_{\mathrm {per}}(N)\), which can be expressed as an \(N\times N\) matrix. Note, however, that the numbers \(e^{-\i \theta \frac{n-j}{N}} A_{nj}\) are not the matrix elements of \(A(\theta )\) with respect to any basis. Indeed, \(\ell ^2_{\text {per}}(N)\) is finite-dimensional, while \(e^{-\i \theta \frac{n-j}{N}} A_{nj}\) (\(n,j\in {\mathbb {Z}}\)) are infinitely many numbers.
As noted earlier, we have \(\ell ^2_{\text {per}}(N) \cong {\mathbb {R}}^N\). The identification can be made via the basis
i.e. \((e_n^{\text {per}})_j = \delta _{(j\text { mod }N),\,n}\) for \(n\in \{0,\dots ,N-1\}\) (Kronecker symbol). In this basis, the matrix elements of \(A(\theta )\) become
Note that the sum in the last line is actually finite, because A is banded. Indeed, if the band width of A is less than the period, then the sum over \(j'\) in (3.5) contains only one term.
Example 3.7
If A is a matrix with \(N=1\), i.e. A is a Laurent operator, then formula (3.5) yields a scalar function of \(\theta \) given by
Writing \(z:=e^{\i \theta }\), we see that \(A(\theta ) = \sum _{j\in {\mathbb {Z}}} z^j A_{0, j}\) is given by the symbol of the Laurent operator. We thus recover the classical result that the spectrum of a Laurent operator is given by the image of the unit circle under its symbol (cf. [39, Th. 7.1]).
Example 3.8
If A is tri-diagonal and \(N=5\) (cf. (3.2)), the formula above gives
Next, we establish some elementary facts about the spectrum of a periodic operator. By standard results about the Floquet-Bloch transform, we have
for all \(A\in \Omega ^{\mathrm {per}}_{N,b}\). Thus, an algorithm may be devised by determining the zeros of the map \(z\mapsto \det (A(\theta )-zI)\), \(\theta \in [0,2\pi ]\). To this end, note that by definition of the determinant, one has
where the coefficient functions \(p_n(\theta )\) are polynomials in the matrix entries \(A(\theta )_{mn}\) and hence analytic and periodic in \(\theta \). Hence they are bounded:
Moreover, \(p_N(\theta )\equiv 1\).
Lemma 3.9
Let \(A\in \Omega ^{\mathrm {per}}_{N,b}\) and \(R>0\). For any \(z,w\in {\mathbb {C}}\) with \(|z|,|w|\le R\) and \(\theta ,\vartheta \in [0,2\pi ]\) one has
where we note that \(\Vert A\Vert _{\infty }=\max \{|A_{ij}|\,|\,i,j\in {\mathbb {Z}}\}\) can be computed in finitely many steps.
Proof
Denote \(B_R:=\{z\in {\mathbb {C}}\,|\,|z|\le R\}\). From the mean value theorem it follows that for any differentiable function \(f:B_R\times [0,2\pi ]\rightarrow {\mathbb {C}}\) one has
Hence to prove the claim it is enough to bound \(\Vert \nabla f\Vert _{L^\infty (B_R)\times [0,2\pi ]}\) for \(f(z,\theta ) = \det (A(\theta )-zI)\). This follows from the Jacobi formula: for any square matrix M one has
where \({\text {cof}}(M)\) denotes the cofactor matrix of M. Hence,
Using Hadamard’s inequality to bound the cofactor matrix, we obtain the bounds
where the last two lines follow from the explicit formula (3.5). The bounds (3.6) and (3.7) imply
and the claim follows. \(\square \)
3.4 Proof of Theorem 1.1(ii)
We can finally prove Theorem 1.1(ii) which was restated equivalently as Theorem 3.3. First, we define the family of algorithms \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\), where each of them maps \(\Gamma _n:\Omega ^{\mathrm {per}}_{N,b}\rightarrow {\mathcal {M}}\) (we recall that \({\mathcal {M}}\) is the space of all compact subsets of \({\mathbb {C}}\) endowed with the Hausdorff metric). It is easy to see that for any \(A\in \Omega ^{\mathrm {per}}_{N,b}\) one has \(\Vert A\Vert _{\ell ^2\rightarrow \ell ^2}\le R_A:=\sum _{j=1}^N\sum _{k=-b}^b |A_{jk}|\) (this follows from Young’s inequality), a quantity which can be computed in finitely many steps. Therefore, if we denote \(B_{R_A}:=\{z\in {\mathbb {C}}\,|\,|z|\le R_A\}\), the a priori inclusion \(\sigma (A)\subset B_{R_A}\) holds true for any \(A\in \Omega ^{\mathrm {per}}_{N,b}\).
Definition 3.10
(N, b-Periodic Matrix Algorithm) Let \(A\in \Omega ^{\mathrm {per}}_{N,b}\) and for \(n\in {\mathbb {N}}\), let \(\Theta _n = \big (\theta _1^{(n)},\dots ,\theta _n^{(n)}\big )\) be a linear spacing of \([0,2\pi ]\) and let \(L_n:=\frac{1}{n}({\mathbb {Z}}+\i {\mathbb {Z}}) \cap B_{R_A}\) be a finite lattice with spacing \(n^{-1}\). Then we define
Remark 3.11
We emphasize that (3.8) can be computed in finitely many arithmetic operations on the matrix elements of A. Indeed, computing the radius \(R_A\) consists of a finite sequence of multiplications and additions, as does the computation of each of the determinants \(\det (A(\theta _i^{\scriptscriptstyle (n\scriptscriptstyle )})-z)\) for z, \(\theta _i^{\scriptscriptstyle (n\scriptscriptstyle )}\) in the finite sets \(L_n\), \(\Theta _n\).
Proof of Theorem 3.3 (equiv. Theorem 1.1(ii))
The proof has two steps.
Step 1: \(\underline{\sigma (A)}\) \(\underline{ \text {is approximated from above by} }\) \(\underline{\Gamma _n(A)}\). For any set \(K\subset {\mathbb {C}}\) we denote by \(B_\delta (K)\) the \(\delta \)-neighborhood of K.
Let \(z\in \sigma (A)\). Then \(|z|\le \Vert A\Vert _{\ell ^2\rightarrow \ell ^2} \le R_A\) and there exists \(\theta \in [0,2\pi ]\) such that \(\det (A(\theta )-zI)=0\). Choose \(z_n\in L_n\) such that \(|z-z_n| \le n^{-1}\) and \(\theta _n\in \Theta _n\) such that \(|\theta -\theta _n| \le n^{-1}\). Applying Lemma 3.9 we obtain the bound
This inequality implies that \(z_n\in \Gamma _n(A)\) as soon as \(2n^{-1} N^{\frac{N}{2}+2}\big ((2b+1)\Vert A\Vert _{\infty } + R_A\big )^N (2b+1)^2 \Vert A\Vert _{\infty } \le n^{-\frac{1}{2}}\), or equivalently,
Note that the right-hand side of (3.9) is computable in finitely many arithmetic operations if N and b are known a priori. Since \(|z-z_n| \le n^{-1}\) by construction, this shows that \(\sigma (A)\subset B_{\frac{1}{n}}(\Gamma _n(A))\) for \(n > (2 N^{\frac{N}{2}+2}\big ((2b+1)\Vert A\Vert _{\infty } + R_A\big )^N (2b+1)^2 \Vert A\Vert _{\infty })^2\).
Step 2: \(\underline{\sigma (A)}\) \(\underline{\text {is approximated from below by}}\) \(\underline{\Gamma _n(A)}\). Next we prove that \(\Gamma _n(A)\subset B_{\delta }(\sigma (A))\) for n large enough. We first note that, since \(\det (zI-A(\theta ))\) is a polynomial in z, it can be factored to take the form
where \(z_i(\theta )\) are the zeros of \(z\mapsto \det (zI-A(\theta ))\) (note that for a characteristic polynomial the coefficient of the leading order term is always 1). From (3.10) we obtain the bound
Let \(z_n\in \Gamma _n(A)\) be an arbitrary sequence. Then, by definition, \(|\det (z_nI-A(\theta _n))|\le n^{-\frac{1}{2}}\) for some \(\theta _n\in \Theta _n\). From (3.11) we conclude that
and thus \(z_n\in B_{n^{-\nicefrac {1}{2N}}}(\sigma (A))\) for all \(n\in {\mathbb {N}}\). This concludes step 2.
Together, steps 1 and 2 imply that for any given \(\delta >0\) one has both \(\Gamma _n(A)\subset B_\delta (\sigma (A))\) and \(\sigma (A)\subset B_\delta (\Gamma _n(A))\) provided that
Since the right-hand side of (3.12) is computable from N, b and the matrix elements of A in finitely many arithmetic operations, we conclude that the computational problem is in \(\Delta _1\). \(\square \)
Example 3.12
The algorithm from Definition 3.10 can easily be implemented in Matlab. An example calculation with bandwidth \(b=1, N=5\) and A of the form (3.2) with
yields the following output in the complex plane. Note that the output is a set of points on a discrete grid in the complex plane and hence the output looks ‘fat’. As n is taken larger this set of points dwindles to just those points that lie in an ever decreasing neighborhood of the true spectrum.
The Matlab implementation that produced Fig. 2 is available online at https://github.com/frank-roesler/TriSpec.
4 Schrödinger operators with periodic potentials
In this section we prove Theorem 1.2 regarding the spectrum of Schrödinger operators \(H=-\Delta +V\) with periodic potentials \(V:{\mathbb {R}}^d\rightarrow {\mathbb {C}}\). Again, this is done by defining an explicit algorithm. We shall consider three computational problems which only differ in their primary set \(\Omega \), and are as follows (primary sets are defined immediately below):
where \(-\Delta +V\) is meant to be defined on \(L^2({\mathbb {R}}^d)\) with domain \(H^2({\mathbb {R}}^d)\), and \(d_{\text {AW}}\) denotes the Attouch-Wets metric, which is a generalization of the Hausdorff metric for the case of sets which may be unbounded (see Appendix A for a brief discussion). Note that the spectrum \(\sigma (-\Delta +V)\) is always non-empty in this case, so taking this metric makes sense. For \(p>d\) and \(M>0\), the primary sets are defined as follows:
Note that by Morrey’s inequality, every \(V\in \Omega ^{\mathrm {Sch}}_p\) is continuous, and so the evaluation set \(\Lambda \) which comprises point evaluations of V, is well-defined. In the language of the Solvability Complexity Index, the three parts of Theorem 1.2 can be expressed as follows:
-
Part (i) amounts to proving that the computational problem for \(\Omega ^{\mathrm {Sch}}\) has an \({\text {SCI}}\) value of 1 (or, equivalently, it belongs to \(\Delta _2\)).
-
Part (ii) amounts to showing that the computational problem for \(\Omega ^{\mathrm {Sch}}_{p,M}\) belongs to \(\Pi _1\), i.e. it can be approximated from above with explicit error bounds.
-
Part (iii) amounts to showing that the computational problem for potentials that are real-valued and in \(\Omega ^{\mathrm {Sch}}_{p,M}\) belongs to \(\Delta _1\), i.e. it can be approximated with explicit error bounds.
The proof of Theorem 1.2 is contained in Subsection 4.3, and it relies on the following weaker theorem:
Theorem 4.1
The computational problem for \(\Omega ^{\mathrm {Sch}}_{p}\) can be solved in one limit: \({\text {SCI}}(\Omega ^{\mathrm {Sch}}_p) = 1\) (equivalently, \(\Omega ^{\mathrm {Sch}}_p\in \Delta _2\)).
Note that this theorem is evidently weaker than Theorem 1.2(i) as the class of potentials \(\Omega ^{\mathrm {Sch}}_p\) considered here is a strict subset of the class \(\Omega ^{\mathrm {Sch}}=\bigcup _{p>d}\Omega ^{\mathrm {Sch}}_p\) considered in Theorem 1.2(i). The proof is constructive, i.e. we provide an explicit algorithm that computes the spectrum of any given operator with \(V\in \Omega ^{\mathrm {Sch}}_p\). Note that this problem is fundamentally different from the discrete problem (3.1) where we could directly access the matrix elements of the (discrete) operator. Instead, the evaluation set \(\Lambda \) gives access to the point values of the potential. Hence our task is to construct a sequence of algorithms \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\), such that each \(\Gamma _n\) computes its output from finitely many point evaluations of V using finitely many algebraic operations.
The proof of Theorem 4.1 is contained in Subsection 4.2. Prior to that, Subsection 4.1 contains several technical estimates needed to prove convergence of the algorithm.
4.1 Estimating and approximating the potential
This section uses well-known facts about the Floquet-Bloch transform, which we gather in Appendix B. The critical step is to compute approximations to the spectrum \(\sigma (-\Delta +V)\) using only finitely many pointwise evaluations of V. This is done using the Floquet-Bloch transform in conjunction with the Birman-Schwinger principle. The approximate potential is defined in (4.8) and the critical error bound is stated in Lemma 4.6.
4.1.1 A Birman-Schwinger principle for \(H(\theta )\)
It shall be convenient for us to characterize the eigenvalues using the well-known Birman-Schwinger principle [8, 33]. This principle requires a decomposition, which we choose as follows. Let \(p>d,\; V\in \Omega _p^{\mathrm {Sch}}, \;\theta \in [0,2\pi ]^d\) and let \(H(\theta )\) be the corresponding Floquet-Bloch operator as in (B.1). Expanding the operator square, \(H(\theta )\) can be written as
Let us choose the following decomposition of \(H(\theta )\). We define
Next we derive an operator identity that characterizes \(\lambda \in \sigma (H(\theta ))\setminus \sigma (H_0)\) as those points for which \(1\in \sigma ({\mathcal {K}}(\lambda ))\), for a certain compact operator \({\mathcal {K}}(\lambda )\). Clearly one has \(H(\theta ) = H_0+B(\theta )\). The auxiliary constant 1, which is added in \(H_0\) and subtracted again in \(B(\theta )\) was chosen for convenience, so that \(H_0\) becomes a positive invertible operator. Note that \(B(\theta )\) is relatively compact with respect to \(H_0\). For \(\lambda \notin \sigma (H_0)\) one has
where I denotes the identity operator on \(L^2((0,1)^d)\) It follows that \(\lambda - H_0 - B(\theta )\) is invertible if and only if \(I - H_0^{-\nicefrac 12}B(\theta )(\lambda - H_0)^{-1}H_0^{\nicefrac 12}\) is invertible, and in that case
This identity implies that
4.1.2 Schatten class estimates
We now study the analytic operator valued function
We choose a Fourier basis for \(L^2((0,1)^d)\), that is, we choose a numbering \({\mathbb {N}}\ni j\mapsto k_j\in 2\pi {\mathbb {Z}}^d\) such that \(|k_j|\) is monotonically increasing with j and set
We note that \(e_j\in {\text {dom}}(H(\theta ))\) for all \(\theta \) and \(\Vert e_j\Vert _{L^2((0,1)^d)}=1\) for all \(j\in {\mathbb {N}}\). In this basis, the operators \(H_0^{\nicefrac 12}\), \(H_0\) and \(\theta \cdot \nabla \) are all diagonal and one has
Therefore, we have
Now the following lemma is easily proved.
Lemma 4.2
(Schatten bound for K) For every \(s>d\) the operator \(K(\lambda ,\theta )\) belongs to the Schatten class \({\mathcal {C}}_s\) and one has
where \(C_\lambda := \sup _{j\in {\mathbb {N}}}\big | 1-\frac{\lambda -1}{|k_j|^2} \big |^{-1}\) and \(\Vert \cdot \Vert \) denotes the \(L^2\) operator norm.
Proof
Let \(\lambda \in {\mathbb {C}}\setminus \sigma (H_0)\) and note that \(C_\lambda <+\infty \) by our choice of \(\lambda \). Observe that simple geometric considerations lead to the bound
Then one has
Hence the characteristic numbers of \(H_0^{-\frac{1}{2}}(-2\i \theta \cdot \nabla ) H_0^{\frac{1}{2}}(\lambda - H_0)^{-1}\) are bounded by \(\frac{2}{\pi }C_\lambda |\theta | j^{-\frac{1}{d}}\) and thus one has
and
Next we turn to the potential term in \(K(\lambda ,\theta )\), that is, the operator \(H_0^{-\nicefrac 12}\big (|\theta |^2 - 1 + V \big ) H_0^{\nicefrac 12}(\lambda - H_0)^{-1}\). This is easily treated by the ideal property of \({\mathcal {C}}_s\), since the operator \(|\theta |^2 - 1 + V\) is bounded. Indeed, we have for every \(s>d\)
where the last line follows from a similar calculation to (4.4) and the fact that \(\Vert H_0^{-\nicefrac 12}\Vert =1\) (this follows from the matrix representation (4.2) and the fact that \(k_1=0\)). \(\square \)
Lemma 4.3
(Lipschitz continuity of K) For every \(s>d\) and \(\lambda ,\mu \in {\mathbb {C}}\setminus \sigma (H_0)\), \(\theta ,\vartheta \in [0,2\pi ]^d\) one has
where \(C_\lambda \) is defined as in Lemma 4.2 and \(c_V = \frac{1}{2} + \pi d^{\frac{1}{2}} + (4\pi d^{\frac{1}{2}})^{-1} \Vert V-1\Vert _\infty \).
Proof
First, note that
Using (4.7) together with the resolvent identity for \((\lambda -H_0)^{-1}\) one obtains
Taking norms in \({\mathcal {C}}_s\) and using the estimates from the proof of Lemma 4.2 (and in particular (4.5)) we obtain the bound
Finally, applying Lemma 4.2 and using \(|\theta +\vartheta |\le 4\pi d^{\frac{1}{2}}\) we obtain
This concludes the proof. \(\square \)
While Lemma 4.3 gives precise information about the dependence of the Lipschitz constant of K on all parameters, it will be useful for us to have a bound which is less precise but more explicit (and manifestly computable).
Corollary 4.4
Let \(s>d\) and \(\delta , R>0\). Then for any \(\theta ,\vartheta \in [0,2\pi ]^d\) and \(\lambda ,\mu \in {\mathbb {C}}\) such that \(|\lambda -(1+|k_j|^2)|, |\mu -(1+|k_j|^2)|>\delta \) for all j and \(|\lambda |,|\mu |<R\) one has
Proof
This follows immediately from Lemma 4.3 via rather crude estimates, noting that \(C_\lambda \le R\delta ^{-1}\) and \(\Vert V\Vert _\infty \le \frac{3p-d}{p-d}\Vert V\Vert _{W^{1,p}((0,1)^d)}\) (cf. [9, Ch. 9.3]). \(\square \)
4.1.3 Approximation of the potential
Next we study the matrix representation of the potential V in the Fourier basis \(\{e_j\}_{j\in {\mathbb {N}}}\) and its approximations. First, we observe that in the Fourier basis \(\{e_j\}_{j\in {\mathbb {N}}}\) one has
where \({\hat{V}}_k\) denote the Fourier coefficients of V. Indeed, a direct calculation gives
Now, we want to build a computable, finite size approximation of the matrix \((V_{jm}) = (\langle {e_j , Ve_m}\rangle )\). We start by approximating the Fourier coefficients \({\hat{V}}_k\).
Lemma 4.5
Let \(n\in {\mathbb {N}}\) and define the lattice \(I_n:=\{\frac{m}{n}\,|\,m=0,\dots ,n-1\}^d\subset (0,1)^d\). For every \(f\in W^{1,p}((0,1)^d)\), \(p>d\), one has
Proof
The proof is a standard application of Morrey’s inequality (cf. (28) in the proof of [9, Th. 9.12]), which implies that \(|f(x) - f(\xi )| \le \frac{2n^{-1+\nicefrac dp}}{1-\nicefrac dp}\Vert \nabla f\Vert _{L^p((0,1)^d)}\) for x in the cube \((0,\frac{1}{n})^d+\xi \). Details are left to the reader. \(\square \)
Let us introduce the approximate Fourier coefficients for \(k\in 2\pi {\mathbb {Z}}^d\) and \(n\in {\mathbb {N}}\),
Note that the \({\hat{V}}_k^{\text {appr},n}\) can be computed in finitely many operations from the information provided in \(\Lambda \). Lemma 4.5 applied to the function \(f(x)=V(x)e^{\i k\cdot x}\) leads to the error estimate
Next, we define the approximate potential matrix
We remark that the approximated potential matrix cannot be computed in finitely many algebraic operations from the point values of V, because it has infinitely many entries. This issue will be addressed next.
Lemma 4.6
(Main Error Bound) For \(N\in {\mathbb {N}}\) let \({\mathcal {H}}_N=\mathrm {Span}\{e_1,\dots ,e_N\}\) and let \(P_N:L^2((0,1)^d)\rightarrow {\mathcal {H}}_N\) be the orthogonal projection. Moreover, define
Then for every \(s>d\) one has
where we recall that \(C_\lambda = \sup _{j\in {\mathbb {N}}}\big | 1-\frac{\lambda -1}{|k_j|^2} \big |^{-1}\) and \(C_{s,d}^1 ,\,C_{s,p,d}^2, \,C_{s,d}^3\) are explicit constants independent of \(n,N,\lambda ,\theta \).
Proof
Again, we denote by \(\Vert \cdot \Vert \) the \(L^2({\mathbb {R}}^d)\) operator norm in this proof. We first treat the \(\theta \cdot \nabla \) term. By Eqs. (4.2)–(4.3) we have
and by (4.4) we can estimate the above as
To estimate the next term, we denote \(W_n:=|\theta |^2 - 1 + V^{\mathrm {appr},n}\) and \(W:=|\theta |^2 - 1 + V\) for brevity, and note that the diagonal operators \(H_0^{-\frac{1}{2}}\) and \((\lambda - H_0)^{-1}\) commute with \(P_N\). Thus we have \(P_NH_0^{-\frac{1}{2}} W_n H_0^{\frac{1}{2}}(\lambda - H_0)^{-1}P_N = H_0^{-\frac{1}{2}}P_N W_n P_N H_0^{\frac{1}{2}}(\lambda - H_0)^{-1}\). Then we calculate
Let us first consider the second term on the right-hand side of (4.11).
where the fourth line follows from (4.9), with a similar calculation as in (4.6). To simplify notation, we collect all constants independent of \(\lambda ,n,N\) into one and write
Next we turn to the first term on the right-hand side of (4.11). We add and subtract \(P_N W\) and use the triangle inequality to obtain
Next we note that, by (4.2)–(4.3), \(\Vert H_0^{-\frac{1}{2}}\Vert = 1\) and \(\Vert H_0^{\frac{1}{2}}(\lambda - H_0)^{-1}\Vert \le 2 C_\lambda \). Therefore
Finally, we employ (4.2)–(4.3) again to estimate the finite section error in (4.13). A straightforward calculation shows that
Using these bounds in (4.13), we finally obtain the error estimate
Combining (4.10), (4.12) and (4.14) yields the assertion with constants
\(\square \)
4.2 Proof of Theorem 4.1
In this section we construct an algorithm \(\Gamma _N(V)\), which approximates \(\sigma (-\Delta +V)\) as \(N\rightarrow \infty \). The output of \(\Gamma _N\) consists of two parts \(\Gamma _n^z\) and \({\tilde{\Gamma }}_n^z\), which we call algorithmic components. We first recall a result on Lipschitz continuity of perturbation determinants.
Theorem 4.7
( [35, Thm. 6.5]) For \(m\in {\mathbb {N}}\), denote by \(\det _m\) the perturbation determinant on \({\mathcal {C}}_m\) (cf. [14, Sec. XI.9]). Then there exists a constant \(c_m\) such that
for all \(A,B\in {\mathcal {C}}_m\). Moreover, one has \(c_m\le e(2+\log (m))\).
Next, we define
Definition 4.8
(First algorithmic component) Let \(p>d\), \(V\in \Omega _p^{\mathrm {Sch}}\) and \(z_0\in {\mathbb {C}}\). For \(N\in {\mathbb {N}}\) let \(\Theta _N = \big (\theta _1^{(N)},\dots ,\theta _N^{(N)}\big )\) be a linearly spaced lattice in \([0,2\pi ]^d\) and let \(L_N:=\frac{1}{N}({\mathbb {Z}}+\i {\mathbb {Z}}) \cap Q_{z_0}\), where \(Q_{z_0}:=\left\{ z\in {\mathbb {C}}\,\big |\, |{\text {Im}}(z-z_0)|,|{\text {Re}}(z-z_0)|\le \frac{1}{2} \right\} \). Then we let \(n(N):=N^{\lceil \alpha \rceil }\), with \(\alpha =\frac{1+2d^{-1}-p^{-1}}{1-p^{-1}d}\),Footnote 2 and define
Note that for every \(N\in {\mathbb {N}}\), \(\Gamma _N^{z_0}(V)\) can be computed from the information in \(\Lambda \) using finitely many algebraic operations (recall in particular the approximated Fourier coefficients (4.8)). Therefore, every \(\Gamma _N^{z_0}\) defines an arithmetic algorithm in the sense of Definition 2.3.
Proposition 4.9
(Convergence of first algorithmic component) Let \(z_0\in {\mathbb {C}}\) and let \(V\in \Omega _{p}^{\mathrm {Sch}}\). The following statements hold.
-
(i)
For any sequence \(z_N\in \Gamma ^{z_0}_N(V)\) with \(z_N\rightarrow z\in Q_{z_0}\setminus \bigcup _j(1+|k_j|^2)\) one has \(z\in \sigma (-\Delta +V)\).
-
(ii)
For any \(z\in \left( \sigma (-\Delta +V)\cap Q_{z_0}\right) \setminus \bigcup _j(1+|k_j|^2)\) there exists a sequence \(z_N\in \Gamma ^{z_0}_N(V)\) with \(z_N\rightarrow z\) as \(N\rightarrow +\infty \).
-
(iii)
Let \(V\in \Omega _{p,M}^{\mathrm {Sch}}\). For any given \(\varepsilon ,\delta > 0\) one has \(\left( \sigma (-\Delta +V)\cap Q_{z_0}\right) \setminus B_\delta \big (\bigcup _j(1+|k_j|^2)\big ) \subset B_\varepsilon (\Gamma _N^{z_0}(V))\) as soon as \(N>\max \{ \varepsilon ^{-1},\, N_{\delta ,z_0} \}\), where
$$\begin{aligned} N_{\delta ,z_0} = \Big [ C_{\delta ,z_0}^\text {Lip} + G(|z_0|+1)\delta ^{-1} \Big ]^{\frac{2}{\nicefrac {1}{p}+\nicefrac {1}{d}}} \end{aligned}$$(4.16)and \(C_{\delta ,z_0}^\text {Lip}\), G are explicit constants, which can be computed in finitely many operations from \(z_0,\delta ,s,p,d\) and the a priori bound M for \(\Vert V\Vert _{W^{1,p}}\).
We defer the proof of Proposition 4.9 to Appendix C. The algorithmic component \(\Gamma _N^{z_0}\) defined in (4.15) on its own is not sufficient to prove Theorem 4.1, because it does not approximate any eigenvalues lying in the set \(\bigcup _j(1+|k_j|^2)\). This is ultimately due to the decomposition (4.1) into \(H_0\) and \(B(\theta )\). In order to solve this issue, we define a second algorithmic component based on a different decomposition. We define
Then, obviously, \({\tilde{H}}_0 + {\tilde{B}}(\theta )=H(\theta )\) and by the same calculation as in Sect. 4.1.1,
Moreover, we have \(\sigma ({\tilde{H}}_0) = \bigcup _j(2+|k_j|^2)\) and
Lemma 4.10
One has
Proof
The assertion is equivalent to the equation \(2 + (2\pi )^2\sum _{i=1}^d{\tilde{n}}_i^2 = 1 + (2\pi )^2\sum _{i=1}^d n_i^2\) having no solutions for any integers \(n_i,{\tilde{n}}_i\), which becomes \(1 = (2\pi )^2\sum _{i=1}^d (n_i^2 - {\tilde{n}}_i^2)\). Clearly the right-hand side of the last equation is always irrational or zero, while the left hand side is always nonzero rational. Thus no solution exists. \(\square \)
Subsection 4.1 and Proposition 4.9 carry over trivially to the decomposition \({\tilde{H}}_0,\,{\tilde{B}}(\theta )\). Analogously to (4.15) we define the new algorithm
Definition 4.11
(Second algorithmic component) Let \(p>d\), \(V\in \Omega _p^{\mathrm {Sch}}\) and \(z_0\in {\mathbb {C}}\). For \(N\in {\mathbb {N}}\) let \(\Theta _N = \big (\theta _1^{(N)},\dots ,\theta _N^{(N)}\big )\) be a linearly spaced lattice in \([0,2\pi ]^d\) and let \(L_N:=\frac{1}{N}({\mathbb {Z}}+\i {\mathbb {Z}}) \cap Q_{z_0}\), where \(Q_{z_0}:=\left\{ z\in {\mathbb {C}}\,\big |\, |{\text {Im}}(z-z_0)|,|{\text {Re}}(z-z_0)|\le \frac{1}{2} \right\} \). Then we choose \(n(N):=N^{\lceil \alpha \rceil }\) (with \(\alpha \) as in Definition 4.8) and define
where now
From the proofs in Subsection 4.1 and Proposition 4.9 we immediately obtain the following convergence result.
Proposition 4.12
(Convergence of second algorithmic component) Let \(z_0\in {\mathbb {C}}\) and let \(V\in \Omega _{p}^{\mathrm {Sch}}\). The following statements hold.
-
(i)
For any sequence \(z_N\in {\tilde{\Gamma }}^{z_0}_N(V)\) with \(z_N\rightarrow z\in Q_{z_0}\setminus \bigcup _j(2+|k_j|^2)\) one has \(z\in \sigma (-\Delta +V)\).
-
(ii)
For any \(z\in \left( \sigma (-\Delta +V)\cap Q_{z_0}\right) \setminus \bigcup _j(2+|k_j|^2)\) there exists a sequence \(z_N\in {\tilde{\Gamma }}^{z_0}_N(V)\) with \(z_N\rightarrow z\) as \(N\rightarrow +\infty \).
-
(iii)
Let \(V\in \Omega _{p,M}^{\mathrm {Sch}}\). For any given \(\varepsilon ,\delta >0\) one has \(\left( \sigma (-\Delta +V)\cap Q_{z_0}\right) \setminus B_\delta \big (\bigcup _j(2+|k_j|^2)\big ) \subset B_\varepsilon ({\tilde{\Gamma }}_N^{z_0}(V))\) as soon as \(N>\max \{ \varepsilon ^{-1},\, N_{\delta ,z_0} \}\), where \(N_{\delta ,z_0}\) was defined in (4.16).
Armed with the algorithmic components (4.15) and (4.18), we are finally able to define the main algorithm:
Definition 4.13
(Main Algorithm) Let \(p>d\), \(V\in \Omega _p^{\mathrm {Sch}}\), and choose a numbering \(\{Z_j\}_{j\in {\mathbb {N}}}\) of \({\mathbb {Z}}+\i {\mathbb {Z}}\) such that \(|Z_i|\le |Z_j|\) for \(i\le j\). For \(N\in {\mathbb {N}}\) define
Combining Propositions 4.9 and 4.12, we can complete the proof of Theorem 4.1:
Proof
(Proof of Theorem 4.1) Let \(p>d\). For any \(V\in \Omega _p^{\mathrm {Sch}}\) we need to show that
Indeed, by Propositions 4.9 and 4.12, the following holds
-
(a)
For any sequence \(z_N\in \Gamma _N(V)\) with \(z_N\rightarrow z\in {\mathbb {C}}\) one has \(z\in \sigma (-\Delta +V)\).
-
(b)
For any \(z\in \sigma (-\Delta +V)\) there exists a sequence \(z_N\in \Gamma _N(V)\) with \(z_N\rightarrow z\) as \(N\rightarrow +\infty \).
Attouch-Wets convergence now follows from a standard argument, cf. [32, Prop. 2.8]. \(\square \)
4.3 Proof of Theorem 1.2
We can finally prove Theorem 1.2.
Proof
Proof of part (i) The a priori knowledge of p can be removed by slightly modifying the algorithmic components (4.15) and (4.18). An algorithm that achieves \({\text {SCI}}=1\) is obtained by modifying the determinant threshold in (4.15) and (4.18). If the cutoff \(N^{-(\frac{1}{2d} - \frac{1}{2p})}\) is replaced by \(\log (N)^{-1}\) and the discretization width \(n(N)=N^{\lceil \alpha \rceil }\) is replaced by \(n(N):=e^N\), the proof of Propositions 4.9 and 4.12 is valid independently of the value of p (cf. Eq. (C.3)).
Proof of part (ii). Fix \(R>0\). According to our choice of numbering \(\{Z_i\}_{i\in {\mathbb {N}}}\) we have \(|Z_i|>R\) for all \(i>4R^2\) and thus \(B_R(0)\subset \bigcup _{i\le 4R^2} Q_{Z_i}\). Let \(\delta _R = \frac{1}{2} \min \bigl \{|x-y|\,\big |\,x\in \sigma (H_0),\,y\in \sigma ({\tilde{H}}_0),\,|x|,|y|\le R\bigr \}\), which is greater than 0 by Lemma 4.10. This choice implies that removing \(\delta _R\)-neighborhoods of the free spectra does not actually restrict our domain of computation:
for all \(R>0\). Now let \(\varepsilon >0\). By Propositions 4.9(iii) and 4.12(iii) we have
as soon as \(N > \max \{ 4R^2,\, \varepsilon ^{-1},\, N_{R}\}\), where \(N_R = \max \{N_{\delta _R,Z_i}\,|\, i\le 4R^2 \}\) (recall \(N_{\delta ,z}\) from (4.16)). We recall that
-
the lower bound \(4R^2\) ensures that \(B_R(0)\) is covered by the search regions \(Q_{Z_i}\);
-
the lower bound \(\varepsilon ^{-1}\) ensures that for every \(z\in \sigma (-\Delta +V)\) there exists \(z_N\in L_N\) with \(|z-z_N|<\varepsilon \);
-
the lower bound \(N_{R}\) ensures that \(z_N\in \Gamma _N(V)\).
To conclude the proof of part (ii) (i.e., that \(\Omega _{p,M}^{\mathrm {Sch}}\in \Pi _1\)), we need a set \(X_k\) as in Definition 2.7. To this end, for \(V\in \Omega _{p,M}^{\mathrm {Sch}}\) and \(k\in {\mathbb {N}}\), choose \(N(k):=\max \{4k^2, 2^k, N_k\}\) and define
Then by the definition of the Attouch-Wets distance \(d_{\text {AW}}\) (cf. Definition A.2) one has
where the third line follows from the definition of \(X_k\). Moreover, by (4.19) (with \(\varepsilon =2^{-k}\) and \(R=k\)) we have
for all \(k\in {\mathbb {N}}\). These facts, together with step (i) imply \(\Omega _{p,M}^{\mathrm {Sch}}\in \Pi _1\).
Proof of part (iii). A previous result (cf. [4, Th. 8.3]) implies that \(\{V\in \Omega ^{\mathrm {Sch}}_{p,M}\,|\, V\text { real-valued}\}\in \Sigma _1\). Combined with Theorem 1.2(ii) and [4, Prop. 6.15] we obtain
i.e. if the potential is periodic and real-valued, the spectrum can be computed with full error control. \(\square \)
Remark 4.14
In view of Theorems 3.3 and 4.1 it is natural to ask whether one might have \(\Omega _{p,M}^{\mathrm {Sch}}\in \Sigma _1\) (and therefore \(\Omega _{p,M}^{\mathrm {Sch}}\in \Delta _1\)). This is indeed a nontrivial open problem. Recalling the proof of Theorem 3.3 (in particular (3.11)), the error bound for the approximation “from below” used the fact that \(\det (zI-A(\theta ))\) is a polynomial. In the proof of Theorem 4.1 on the other hand, the function \(\det _{\lceil p\rceil }(I-K(z,\theta ))\), whose zeros must be approximated, is only known to be analytic. Obtaining \(\Sigma _1\) classification amounts to obtaining explicit upper bounds on the width of the zeros of this analytic function. These cannot be deduced in any straightforward way from the values of the potential V.
5 Sometimes two limits are necessary
In Sect. 2 we described a complicated construction called towers of algorithms, where more than one successive limit is required to correctly perform certain computations. In this section we exhibit this phenomenon first hand: we prove Theorem 1.3 which is rephrased in the language of \({\text {SCI}}\) as Theorem 5.1 below. This theorem shows that there exists a class of potentials (which are less smooth, yet can be evaluated at any point) for which there do not exist algorithms that can approximate the associated spectral problem in a single limit. That is, \({\text {SCI}}>1\). We are able, though, to construct an arithmetic algorithm which converges by taking two successive limits. That is, \({\text {SCI}}=2\). This class contains potentials which are allowed to have a singularity at a single point \(x_0\in (0,1)\), but are otherwise smooth:
We emphasize that \(\Omega _{x_0}^{\mathrm {Sch}}\) contains only real-valued functions and that the pointwise evaluation \(V\mapsto V(x)\) is well-defined for all \(x\in {\mathbb {R}}\). Moreover, we remark that by [29, Th. XIII.96] the Schrödinger operator \(-\Delta +V\) is well-defined and selfadjoint with domain \(H^2({\mathbb {R}})\). We shall therefore consider the computational problem
and shall prove
Theorem 5.1
The computational problem (5.1) has \({\text {SCI}}=2\) (equivalently, it belongs to \(\Delta _3\setminus \Delta _2\)).
The proof of Theorem 5.1 has two parts. To prove \({\text {SCI}}\le 2\) we construct an explicit tower of algorithms that computes the spectrum in two limits (cf. Definition 2.4). The proof of \({\text {SCI}}> 1\) is by contradiction. We assume the existence of a sequence \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\) with \(\Gamma _n(V)\rightarrow \sigma (-\partial _x^2+V)\) for every \(V\in \Omega _{x_0}^{\mathrm {Sch}}\) and via a diagonal process construct a potential \(V\in \Omega _{x_0}^{\mathrm {Sch}}\) such that \(\Gamma _n(V)\nrightarrow \sigma (-\partial _x^2+V)\), yielding the desired contradiction.
5.1 Lemmas
We first collect some technical lemmas that will be necessary for the proof.
Lemma 5.2
Denote by W(H) the numerical range of an operator H. Let \(V\in \Omega _{x_0}^{\mathrm {Sch}}\). There exists \(\delta >0\) such that \(\inf W(-\partial _x^2+V) \ge -\frac{1}{4}\) whenever \(\Vert V\Vert _{L^1([0,1])}<\delta \).
Proof
Choose a partition of unity \(\{\chi _i\}_{i\in {\mathbb {Z}}}\) such that \({\text {supp}}(\chi _i)\subset [i-1,i+1]\) and \(\sup _{i\in {\mathbb {Z}}}\Vert \chi _i\Vert _{W^{1,\infty }}<+\infty \). Then for any \(\phi \in C_0^\infty ({\mathbb {R}})\) we have
where the last line follows from periodicity of V. Moreover, we have
where the second line follows from the Sobolev embedding \(W^{1,1}\hookrightarrow L^\infty \) (cf. [9, Th. 8.8]) and the third follows by the product rule and Hölder’s inequality. Summing (5.2) over i we obtain
where \(C' = 2C\sup _{i\in {\mathbb {Z}}}\Vert \chi _i\Vert _{W^{1,\infty }}\) is finite by choice of the functions \(\chi _i\). Turning to the numerical range, we have for \(\phi \in C_0^\infty ({\mathbb {R}})\)
where the second line follows from (5.3). Finally, let \(\delta :=(4C')^{-1}\). Then if \(\Vert V\Vert _{L^1([0,1])}<\delta \) one has
and the proof is complete. \(\square \)
Lemma 5.3
For \(h>0\), \(q_1,q_2\in [0,1]\) with \(0<q_1<q_2<1\), define the periodic step function
Then for every \({\tilde{V}}\in \Omega _{x_0}^{\mathrm {Sch}}\) with \({\tilde{V}}\le 0\) one has
Proof
Define the sequence of test functions
We note that \(\Vert \phi _n\Vert _{L^2({\mathbb {R}})}\rightarrow 1\) as \(n\rightarrow +\infty \). Then if \({\tilde{V}}\in \Omega _{x_0}^{\mathrm {Sch}}\) with \({\tilde{V}}\le 0\) we have
The assertion follows by letting \(n\rightarrow +\infty \). \(\square \)
The next lemma is needed to construct the tower of algorithms that will compute the spectrum of elements in \(\Omega ^{\mathrm {Sch}}_{x_0}\).
Lemma 5.4
Let \(V\in \Omega _{x_0}^{\mathrm {Sch}}\) and for \(n\in {\mathbb {N}}\) let \(\rho _n\in W^{1,\infty }({\mathbb {R}})\) be a periodic function such that
then one has \(\sigma (-\partial _x^2+\rho _n V) \rightarrow \sigma (-\partial _x^2+V)\) in Attouch-Wets distance.
Proof
A Neumann series argument shows that
where \(z\notin {\mathbb {R}}\) and the right hand side is defined whenever \(\Vert (\rho _n-1)V(-\partial _x^2 + V - z)^{-1}\Vert _{L^2\rightarrow L^2}<1\). We will prove that the right hand side of (5.5) converges to 0 in the norm resolvent sense. To this end let \(u\in L^2({\mathbb {R}})\) and compute
where Hölder’s inequality was used in the third line, periodicity of \((\rho _n-1)V\) was used in the fourth line and the Sobolev embedding \(H^1({\mathbb {R}})\hookrightarrow L^\infty ({\mathbb {R}})\) was used in the fifth line. Combining Eqs. (5.5) and (5.6) we find that
for all \(n>N\) large enough such that \(\Vert (\rho _n-1)V\Vert _{L^2([0,1])} < \Vert (-\partial _x^2 + V - z)^{-1}\Vert _{L^2\rightarrow H^1}^{-1}\). Since \(V\in L^2([0,1])\) we know that \(\Vert (\rho _n-1)V\Vert _{L^2([0,1])}\rightarrow 0\), hence such N must exist. For \(n>N\) the geometric series gives
with \(q_n = \Vert (\rho _n-1)V\Vert _{L^2([0,1])} \Vert (-\partial _x^2 + V - z)^{-1}\Vert _{L^2\rightarrow H^1}\), which immediately implies norm resolvent convergence. Finally, an application of [30, Thms. VIII.23-24] yields the desired spectral convergence. \(\square \)
5.2 Proof of Theorem 5.1
We are now ready to prove Theorem 5.1.
Step 1: \(\underline{\text {Construction of an adversarial potential} }\) \(\underline{V}\). Assume for contradiction that there exists a sequence of algorithms \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\) such that for any \(V\in \Omega _{x_0}^{\mathrm {Sch}}\), \(\Gamma _n(V)\rightarrow \sigma (-\partial _x^2+V)\) as \(n\rightarrow +\infty \). We now describe a process that defines an “adversarial” potential V for which the sequence \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\) will necessarily fail.
We begin with an example construction that illustrates how a potential can be obtained that “fools” \(\Gamma _n\) for a single n. This construction is then iterated below to obtain a potential whose spectrum is not approximated by the entire sequence \(\{\Gamma _n\}_{n\in {\mathbb {N}}}\). For the sake of definiteness we make the arbitrary choice \(x_0:=\frac{1}{4}\), though the proof does not depend on this choice. Moreover, we note that by Hölder’s inequality one has \(\Vert f\Vert _{L^1([0,1])} \le \Vert f\Vert _{L^2([0,1])}\) for all \(f\in L^2([0,1])\).
With the notation from Lemma 5.3, let \(V_1\) be the periodic square well potential \(V_{h_1,0,\nicefrac 12}\) with \(h_1>0\) chosen such that \(\inf W(-\partial _x^2+V_1)\le -1\). Because \(-\partial _x^2+V\) is selfadjoint, this implies that \(\inf \sigma (-\partial _x^2+V_1)\le -1\). Our assumptions about \(\Gamma _n\) imply that there exists \(n_1\in {\mathbb {N}}\) such that \(\inf {\text {Re}}(\Gamma _n(V))<-\frac{1}{2}\) for all \(n\ge n_1\). By the definition of an algorithm (Definition 2.3), \(\Gamma _{n_1}(V)\) depends only on finitely many elements of \(\Lambda \), i.e. finitely many point values \(V(x_i)\), \(i\in \{1,\dots ,m_{n_1}\}\). We may assume without loss of generality that the sets \(\{x_1,\dots ,x_{m_n}\}\) are growing with n.
For later reference, let \(\rho \) be a smooth, compactly supported function on \({\mathbb {R}}\) such that
Next, define a new potential \(\tilde{V_1}\) by “thinning out” \(V_1\) around the points \(x_i\). More concretely, let \(\delta >0\) (to be determined later) and define \(l_1 := \min \big \{|x_i-x_j|\,\big |\, i,j\in \{0,\dots ,m_{n_1}\},\,x_i\ne x_j\big \}\) (note that the point \(x_0\) appears in the definition of \(l_1\)) and let
Then by construction we have \(\Vert \tilde{V_1}\Vert _{L^2([0,1])} \le \sum _{i=1}^{m_{n_1}} h_1 \sqrt{\delta l_1} \le m_{n_1} h_1 \sqrt{\delta }\). Next, apply Lemma 5.2 and choose \(\delta \) such that \(\inf W(-\partial _x^2+\tilde{V_1}) \ge -\frac{1}{4}\) (and hence \(\inf \sigma (-\partial _x^2+\tilde{V_1}) \ge -\frac{1}{4}\)). Note that we have \(\tilde{V_1}(x_i)=V_1(x_i)\) for all \(i\in \{0,\dots ,m_{n_1}\}\). Hence by consistency of algorithms we have \(\Gamma _{n_1}(\tilde{V_1}) = \Gamma _{n_1}(V_1)\). We have constructed a smooth potential such that
and thus \(d_{\mathrm {AW}}\big (\Gamma _{n_1}(\tilde{V_1}), \sigma (-\partial _x^2+\tilde{V_1})\big ) \ge \frac{1}{4}\). We remark that \({\tilde{V}}_1\equiv 0\) in a neighbourhood of \(x_0\).
The constructions outlined above define an iterative process that yields a sequence of smooth potentials \(\{V_k\}_{k\in {\mathbb {N}}}\) and \(\{{\tilde{V}}_k\}_{k\in {\mathbb {N}}}\) (see Fig. 3). We outline the details below. Fix \(\eta >0\) to be determined later and initialize \({\tilde{V}}_0 \equiv 0\).
-
Let \(2\le k\in {\mathbb {N}}\) and suppose that \({\tilde{V}}_{k-1}\) has already been defined.
-
Choose an interval \(I=(x_0-\varepsilon ,x_0+\varepsilon )\) on which \({\tilde{V}}_k\equiv 0\) and let \(V_{k}:={\tilde{V}}_{k-1} + V_{h_{k},x_0-\varepsilon ,x_0+\varepsilon }\), where \(h_{k}\) is chosen such that \(\inf W(-\partial _x^2+V_{k})\le -1\) (cf. Lemma 5.3).
-
Choose \(n_{k}\) large enough such that \(\inf {\text {Re}}(\Gamma _{n_{k}}(V_{k}))<-\frac{1}{2}\). Then \(\Gamma _{n_{k}}(V_{k})\) depends only on finitely many point values \(x_1,\dots ,x_{m_{n_{k}}}\).
-
Let \(l_k := \min \big \{|x_i-x_j|\,\big |\, i,j\in \{0,\dots ,m_{n_{k}}\},\,x_i\ne x_j\big \}\) and define a “thinned out” potential \({\tilde{V}}_{k}\) by
$$\begin{aligned} \tilde{V}_{k}(x) := {\tilde{V}}_{k-1}(x) + V_{h_{k},x_0-\varepsilon ,x_0+\varepsilon }(x)\sum _{i=1}^{m_{n_{k}}}\rho \left( \frac{x}{\delta l_k} - x_i\right) . \end{aligned}$$Then \(\Vert {\tilde{V}}_{k}\Vert _{L^2([0,1])}\le \Vert {\tilde{V}}_{k-1}\Vert _{L^2([0,1])} + m_{n_{k}} h_{k} \sqrt{\delta }\).
-
Choose \(0<\delta <\left( \frac{\eta }{2^{k}m_{n_{k}}h_{k}}\right) ^2\) such that \(\inf W(-\partial _x^2+\tilde{V}_{k})>-\frac{1}{4}\).
-
It follows that \(\Gamma _{n_{k}}(\tilde{V}_{k}) = \Gamma _{n_{k}}(V_{k})\) and thus \(d_{\mathrm {AW}}\big (\Gamma _{n_{k}}(\tilde{V}_{k}), \sigma (-\partial _x^2+\tilde{V}_{k})\big ) \ge \frac{1}{4}\).
-
Moreover, we have by construction
$$\begin{aligned} \Vert {\tilde{V}}_{k}\Vert _{L^2([0,1])} \le \eta \sum _{j=0}^k 2^{-j} \end{aligned}$$(5.7)and there exists an interval \((x_0-\varepsilon ',x_0+\varepsilon ')\) on which \({\tilde{V}}_{k}\equiv 0\).
This process defines a sequence of potentials \(\{{\tilde{V}}_k\}_{k\in {\mathbb {N}}}\) such that for a subsequence \(\{n_k\}_{k\in {\mathbb {N}}}\) one has
Next we show that the sequence \({\tilde{V}}_k\) converges pointwise to a function \(V^\eta \in L^2([0,1])\), which is smooth on \([0,1]\setminus \{x_0\}\). By construction, for every \(\varepsilon >0\), sequence \(\{{\tilde{V}}_k|_{[0,1]\setminus (x_0-\varepsilon ,x_0+\varepsilon )}\}_{k\in {\mathbb {N}}}\) is eventually constant. Combined with the fact that \({\tilde{V}}_k(x_0)=0\) for all k, this implies that \(\{{\tilde{V}}_k\}_{k\in {\mathbb {N}}}\) converges pointwise to a function \(V^\eta \) on [0, 1]. Because for every \(\varepsilon >0\) there exists \(k\in {\mathbb {N}}\) such that \(V^\eta |_{[0,1]\setminus (x_0-\varepsilon ,x_0+\varepsilon )} = {\tilde{V}}_k|_{[0,1]\setminus (x_0-\varepsilon ,x_0+\varepsilon )}\), we have that \(V^\eta \) is smooth on \([0,1]\setminus \{x_0\}\). Finally, by (5.7) for any \(\varepsilon >0\) there exists \(k\in {\mathbb {N}}\) such that
Letting \(\varepsilon \rightarrow 0\) we conclude by monotone convergence that \(V^\eta \in L^2([0,1])\) and consequently \(V^\eta \in \Omega _{x_0}^{\mathrm {Sch}}\). Moreover, the inequality \(\Vert V^\eta \Vert _{L^2([0,1])}\le 2\eta \) allows us to use Lemma 5.2 and choose \(\eta >0\) small enough that
To conclude the proof we note that for any \(k\in {\mathbb {N}}\) we have \(V^\eta (x_i)={\tilde{V}}_{k}(x_i)\) for all \(i\in \{1,\dots ,m_{n_k}\}\) and by consistency of algorithms we have
and thus \(\inf {\text {Re}}(\Gamma _{n_k}(V^\eta )) \le -\frac{1}{2}\). Combining eqs. (5.8), (5.9) and (5.10) we conclude that
Consequently, the sequence \(\Gamma _{n}(V^\eta )\) cannot converge to \(\sigma (-\partial _x^2+V^\eta )\) and the desired contradiction follows, proving that \({\text {SCI}}(\Omega _{x_0}^{\mathrm {Sch}})\ge 2\).
Step 2: \(\underline{\mathrm{Construction \,of\, \,tower\, of\, algorithms.}}\) We conclude the proof of Theorem 5.1 by showing \({\text {SCI}}(\Omega _{x_0}^{\mathrm {Sch}})\le 2\). To this end, choose a function \(\rho _n\) as in (5.4) change this, if we change the \(\rho _n\) above, whose values are explicitly computable (e.g. piecewise linear) and define the mapping
where \(\Gamma _m\) denotes the algorithm from Definition 4.13. Note that \(\Gamma _m(\rho _n V)\) is well-defined because \(\rho _n V\in W^{1,\infty }({\mathbb {R}})\) for every \(n\in {\mathbb {N}}\). Applying Theorem 1.2 and Lemma 5.4 we immediately find
where all limits are taken in Attouch-Wets distance. This completes the proof. \(\square \)
Note that the two limits in Step 2 above cannot be swapped, because it is unclear how \(\Gamma _m(\rho _n V)\) behaves when \(\rho _n V\) converges to a non-smooth function.
6 Numerical results
To illustrate our abstract results, we implemented a version of the algorithm from Definition 4.13 in one dimension in Matlab. In this section we show the results of this implementation and compare them against known abstract and numerical results.
In order to obtain an implementation with adequate performance, we fixed a box Q in the complex plane and then computed the quantity
where the numbers N, n, C were treated as independent parameters. Moreover, the spectral shift, which was chosen to be 1 in (4.1) and 2 in (4.17) can be fixed to be any point \(z_0\) outside the box Q. The routine is illustrated in Pseudocode 2.
The actual Matlab implementation of Pseudocode 2 is available on https://github.com/frank-roesler/PeriodicSpectra.
6.1 Implementation in one dimension. The Mathieu equation
We consider the Mathieu equation
where \(\mu \in {\mathbb {C}}\) is a constant and \(\lambda \) denotes the spectral parameter. This equation was first studied in [25] in the context of vibrating membranes and has been studied extensively since (see [26, Ch. 5] for a discussion). Figure 4 shows the output of our implementation for various values of \(\mu \).
In the case of a real-valued potential (top panel of Fig. 4) our algorithm produces the expected band-gap structure, with one gap showing around \(\lambda =10\) and another around \(\lambda =40\). In the case of purely imaginary \(\mu \), the theory of \(\mathcal {PT}\)-symmetric operators (i.e. operators that are invariant under simultaneous parity and time reversal) can be used to prove abstract results about the possible shape of the spectrum [34]. A comparison between the middle panel of Fig. 4 and [34, Fig. 2] shows agreement between the theoretical results and the output of our algorithm. Finally, the bottom panel in Fig. 4 shows the output when \(\mu \) has both a real and an imaginary part and the \(\mathcal {PT}\) symmetry is broken.
To further validate our results, let us focus on the \(\mathcal {PT}\) symmetric case (\(\mu =10\i \)) and compare them to existing results available in one dimension. Let \(\phi _{1,\lambda }\), \(\phi _{2,\lambda }\) be two classical solutions of (6.1) on (0, 1) with initial conditions \(\phi _{1,\lambda }(0)=1\), \(\phi _{1,\lambda }'(0)=0\) and \(\phi _{2,\lambda }(0)=0\), \(\phi _{2,\lambda }'(0)=1\). Then, the Hill Discriminant is defined by
and one can show (cf. [15]) that \(\lambda \) is in the spectrum of (6.1) if and only if \(-1\le D(\lambda )\le 1\). Figure 5 shows the points in \({\mathbb {C}}\) satisfying a softened version of this discriminant inequality, computed from a Runge-Kutta approximation of \(\phi _{1,\lambda }\), \(\phi _{2,\lambda }\). A comparison between Figs. 5 and 4 shows good agreement.
Finally, we note that our method is naturally immune to the common problem of spectral pollution. By definition, spectral pollution occurs when there exist sequences \(z_n\in \Gamma _n(V)\) such that \(\{z_n\}_{n\in {\mathbb {N}}}\) has an accumulation point outside \(\sigma (-\Delta +V)\). This effect appears in the approximation of Mathieu’s equation if a naive finite difference scheme is used on a truncated domain (see Fig. 6). For arbitrarily fine discretization and arbitrarily large truncated domain, the method is sensitive to small perturbations of the domain and can yield a set which is far from the correct spectrum.
6.2 Implementation in two dimensions
In principle, the algorithm given by Pseudocode 2 can be implemented in any space dimension. However, in d dimensions the size of the matrix \(V^{\text {appr},n}\) grows like \(n^d\) and the computation quickly becomes unfeasible.
In the case \(d=2\) an algorithm based on Pseudocode 2 can still be implemented, with certain modifications. A full search of the lattice \(L_N\) is not feasible anymore. Instead we implemented an algorithm that finds zeros of D successively: Having computed a zero \(z_0\), increment \(\theta \) and choose a number of points \(\{w_1,\dots ,w_k\}\) and for each \(i\in \{1,\dots ,k\}\) minimize D using \(w_i\) as starting point. Increment \(\theta \) again and continue. A Matlab implementation of this idea is on https://github.com/frank-roesler/PeriodicSpectra2d.
Figure 7 shows the output of our algorithm for the potential
Note that V has a finite Fourier expansion. This makes the matrix \(V^{\text {appr},n}\) sparse and speeds up the computation. More generic potentials can only be computed with significantly reduced accuracy.
Notes
\(^{*}\)These known BV bounds mean that for any \(R>0\) one has a priori knowledge of the total variation of V on the ball \({B_{R}}\) of radius R centered at the origin.
The exponent \(\alpha \) is chosen such that \(N^{1+\nicefrac {1}{d}}/n^{1-\nicefrac dp}\le N^{\nicefrac 1p - \nicefrac 1d}\) (cf. Lemma (4.6)).
References
Avila, A., Jitomirskaya, S.: The Ten Martini Problem. Ann. Math., 303–342 (2009)
Becker, S., Hansen, A.C.: Computing solutions of Schrödinger equations on unbounded domains-On the brink of numerical algorithms. arXiv: 2010.16347 e-prints (2020)
Beer, G.: Topologies on closed and closed convex sets. Mathematics and its Applications, vol. 268. Kluwer Academic Publishers Group, Dordrecht (1993)
Ben-Artzi, J., Colbrook, M.J., Hansen, A.C., Nevanlinna, O., Seidel, M.: Computing spectra—On the solvability complexity index hierarchy and towers of algorithms. arXiv: 1508.03280 e-prints (2015)
Ben-Artzi, J., Hansen, A.C., Nevanlinna, O., Seidel, M.: New barriers in complexity theory: on the solvability complexity index and the towers of algorithms. Comptes Rendus Math. 353(10), 931–936 (2015)
Ben-Artzi, J., Marletta, M., Rösler, F.: Computing scattering resonances. arXiv e-prints (2020)
Ben-Artzi, J., Marletta, M., Rösler, F.: Computing the sound of the sea in a seashell. Found. Comput. Math. (2020)
Birman, M.V.: On the spectrum of singular boundary-value problems. Math. Sb. 55(97), 125–174 (1961)
Brezis, H.: Functional analysis, Sobolev spaces and partial differential equations. Universitext, Springer, New York (2011)
Coddington, E.A., Levinson, N.: Theory of ordinary differential equations. McGraw-Hill Book Company Inc, New York-Toronto-London (1955)
Colbrook, M.J.: On the computation of geometric features of spectra of linear operators on Hilbert spaces. arXiv: 1908.09598 e-prints (2019)
Colbrook, M.J., Hansen, A.C.: The foundations of spectral computations via the Solvability Complexity Index hierarchy: part I. arXiv: 1908.09592 e-print (2019)
Colbrook, M.J., Roman, B., Hansen, A.C.: How to compute spectra with error control. Phys. Rev. Lett. 122(25), 250201 (2019)
Dunford, N., Schwartz, J.T.: Linear operators. Part II. Wiley Classics Library. John Wiley & Sons, Inc., New York (1988). Spectral theory. Selfadjoint operators in Hilbert space, with the assistance of Bade, W.G., and Bartle, R.G., Reprint of the 1963 original. A Wiley-Interscience Publication
Eastham, M.S.P.: The spectral theory of periodic differential equations. Texts in Mathematics (Edinburgh). Scottish Academic Press, Edinburgh; Hafner Press, New York (1973)
Figotin, A., Kuchment, P.: Band-gap structure of the spectrum of periodic Maxwell operators. J. Stat. Phys. 74(1–2), 447–455 (1994)
Figotin, A., Kuchment, P.: Band-gap structure of spectra of periodic dielectric and acoustic media. I. Scalar model. SIAM J. Appl. Math. 56(1), 68–88 (1996)
Figotin, A., Kuchment, P.: Band-gap structure of spectra of periodic dielectric and acoustic media. II. Two-dimensional photonic crystals. SIAM J. Appl. Math. 56(6), 1561–1620 (1996)
Giani, S., Graham, I.G.: Adaptive finite element methods for computing band gaps in photonic crystals. Numer. Math. 121(1), 31–64 (2012)
Gohberg, I., Goldberg, S., Kaashoek, M. A.: Laurent and Toeplitz Operators. In Basic Classes of Linear Operators, pages 135–170. Birkhäuser Basel, Basel, (2003)
Hansen, A.C.: On the solvability complexity index, the \(n\)-pseudospectrum and approximations of spectra of operators. J. Am. Math. Soc. 24(1), 81–124 (2011)
Helffer, B., Mohamed, A.: Asymptotic of the density of states for the Schrödinger operator with periodic electric potential. Duke Math. J. 92(1), 1–60 (1998)
Hoang, V., Plum, M., Wieners, C.: A computer-assisted proof for photonic band gaps. Z. Angew. Math. Phys. 60(6), 1035–1052 (2009)
Last, Y.: Spectral theory of sturm-liouville operators on infinite intervals: a review of recent developments. Sturm-Liouville Theory, pp. 99–120 (2005)
Mathieu, É.: Mémoire sur le mouvement vibratoire d’une membrane de forme elliptique. J. Math. Pures Appl. 13, 137–203 (1868)
Morse, P.M., Feshbach, H.: Methods of theoretical physics. Am. J. Phys. 22(6), 410–413 (1954)
Parnovski, L.: Bethe-Sommerfeld conjecture. In: Ann. Henri Poincaré, vol. 9, no. 3, pp. 457–508 (2008)
Popov, V.N., Skriganov, M.M.: Remark on the spectrum structure of the two-dimensional Schrödinger operator with the periodic potential. Zap. Nauchnykh Sem. POMI 109, 131–133 (1981)
Reed, M., Simon, B.: Methods of modern mathematical physics. IV. Analysis of operators. Academic Press [Harcourt Brace Jovanovich, Publishers], New York, London (1978)
Reed, M.,Simon, B.: Methods of modern mathematical physics. I. Functional Analysis. Academic Press, Inc. [Harcourt Brace Jovanovich, Publishers], New York, 2nd edn. (1980)
Rofe-Beketov, F.S.: On the spectrum of non-selfadjoint differential operators with periodic coefficients. Dokl. Akad. Nauk SSSR 152, 1312–1315 (1963)
Rösler, F.: On the solvability complexity index for unbounded selfadjoint and schrödinger operators. Integral Equ. Oper. Theory 91(6), 54 (2019)
Schwinger, J.: On the bound states of a given potential. Proc. Nat. Acad. Sci. U.S.A. 47, 122–129 (1961)
Shin, K.C.: On the shape of spectra for non-self-adjoint periodic Schrödinger operators. J. Phys. A 37(34), 8287–8291 (2004)
Simon, B.: Notes on infinite determinants of Hilbert space operators. Adv. Math. 24(3), 244–273 (1977)
Skriganov, M.M.: The spectrum band structure of the three-dimensional Schrödinger operator with periodic potential. Invent. Math. 80(1), 107–121 (1985)
Sommerfeld, A., Bethe, H.: Elektronentheorie der Metalle. In: Aufbau Der Zusammenhängenden Materie. Volume 19 of Heidelberger Taschenbücher, pp. 333–622. Springer, Berlin Heidelberg, Berlin, Heidelberg (1933)
Teschl, G.: Jacobi operators and completely integrable nonlinear lattices. Mathematical surveys and monographs . American Mathematical Society (2000)
Trefethen, L.N., Embree, M.: Spectra and pseudospectra. The behavior of nonnormal matrices and operators. Princeton University Press, Princeton, NJ (2005)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
JBA acknowledges support from an Engineering and Physical Sciences Research Council Fellowship EP/N020154/1. MM acknowledges support from an Engineering and Physical Sciences Research Council Grant EP/T000902/1. FR acknowledges support from the European Union’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie grant agreement No 885904.
The authors thank the anonymous referees for their detailed comments and remarks which helped improve both the quality and presentation of this paper.
Appendices
Appendix A. The Hausdorff and Attouch-Wets distances
Since the Attouch-Wets distance is not well-known, we provide the basic definition here, and use the opportunity to remind the reader of the Hausdorff distance as well. For further information we refer to [3].
Definition A.1
(Hausdorff distance for subsets of \({\mathbb {R}}^d\)) Let \(A,B\subset {\mathbb {R}}^d\) be two non-empty bounded sets. Their Hausdorff distance is
Definition A.2
(Attouch-Wets distance for subsets of \({\mathbb {R}}^d\)) Let \(A,B\subset {\mathbb {R}}^d\) be two non-empty (possibly unbounded) sets. Their Attouch-Wets distance is
Note that if A, B are bounded, then \(d_{\mathrm {AW}}\) and \(d_\mathrm {H}\) are equivalent. Furthermore, it can be shown (cf. [3, Ch. 3]) that
Appendix B. Floquet-Bloch transform
The Floquet-Bloch transform for Schrödinger operators with periodic potentials is well-studied. The following lemma is a collection of results in [29] (below, \({\mathcal {S}}({\mathbb {R}}^d)\) denotes the Schwartz space of rapidly decaying functions).
Lemma B.1
( [29, Ch. XIII.16]) For \(f\in {\mathcal {S}}({\mathbb {R}}^d)\) and \(\theta \in [0,2\pi ]^d\), define the map
Then the following hold.
-
(i)
\({\mathcal {U}}_\theta \) extends uniquely to a bounded operator on \(L^2({\mathbb {R}}^d)\);
-
(ii)
For any \(f\in L^2({\mathbb {R}}^d)\), \({\mathcal {U}}_\theta f\) is 1-periodic;
-
(iii)
The map
$$\begin{aligned} {\mathcal {U}}: L^2({\mathbb {R}}^d)&\rightarrow \int _{[0,2\pi ]^d}^\oplus L^2((0,1)^d)\, d\theta \\ f&\mapsto ({\mathcal {U}}_\theta f)_{\theta \in [0,2\pi ]^d} \end{aligned}$$is unitary;
-
(iv)
The inverse \({\mathcal {U}}^{-1}\) is given by
$$\begin{aligned} ({\mathcal {U}}^{-1}g)(x+n) = (2\pi )^{-\frac{d}{2}} \int _{[0,2\pi ]^d} g(x,\theta )e^{-\i n \theta } \,d\theta ; \end{aligned}$$ -
(v)
For \(H=-\Delta +V\), \(V\in \Omega _p\) one has
$$\begin{aligned} {\mathcal {U}}H {\mathcal {U}}^{-1}&= \int _{[0,2\pi ]^d}^\oplus H(\theta )\,d\theta , \end{aligned}$$where
$$\begin{aligned} H(\theta )&= -(\nabla +\i \theta )^2 + V , \nonumber \\ {\text {dom}}(H(\theta ))&= H^2_{\mathrm {per}}\big ((0,1)^d\big ) \qquad \forall \theta \in [0,2\pi ]^d. \end{aligned}$$(B.1)Moreover, the map \(\theta \mapsto H(\theta )\) is analytic and
$$\begin{aligned} \sigma (H) = \bigcup _{\theta \in [0,2\pi ]^d} \sigma (H(\theta )). \end{aligned}$$(B.2)
The spectral identity (B.2) follows from the unitarity of \({\mathcal {U}}\) and a straightforward calculation, noting that \(\bigcup _{\theta \in [0,2\pi ]^d} \sigma (H(\theta ))\) is closed by analyticity and periodicity in \(\theta \).
Appendix C. Proof of proposition 4.9
(i) Let \(z_N\in \Gamma ^{z_0}_N(V)\) and assume that \(z_N\rightarrow z\) for some \(z\in {\mathbb {C}}\). We need to show that \(z\in \sigma (-\Delta +V)\). Since \(z_N\in \Gamma _N^{z_0}(V)\), we have \(\det \nolimits _{\lceil p\rceil }\big (I-P_NK^{\mathrm {appr}}_{n(N)}(z,\theta _{i_N})P_N\big )\rightarrow 0\) for some sequence \(\{\theta _{i_N}\}_{N\in {\mathbb {N}}}\subset [0,2\pi ]^d\). Then there exists a convergent subsequence (again denoted by \(\theta _{i_{N}}\)) converging to some some \(\theta \in [0,2\pi ]^d\). We first note that due to Theorem 4.7 we have the N-independent determinant error bound
(note that \(\det _{\lceil p\rceil }(I-K(z_{N},\theta _{i_{N}}))\) is well-defined by Lemma 4.2 because \(p>d\)). We note that the exponential factor in the last line is uniformly bounded in N by some explicit constant \(c_{\text {exp}}\) (cf. Lemmas 4.2 and 4.6). Using Lemma 4.6 with our choice \(n(N)=N^{\lceil \alpha \rceil }\) and \(s=p\), we obtain
Note that each term on the right hand side has a negative power of N and therefore tends to 0 as \(N\rightarrow +\infty \). Note further that \(C_{z_N}=\sup _{j\in {\mathbb {N}}}\big | 1-\frac{z_N-1}{|k_j|^2} \big |^{-1}\) remains bounded as \(N\rightarrow +\infty \) by our assumption that \(z\notin \bigcup _j(1+|k_j|^2)\). Therefore we have
To conclude the proof of (i), if we note that continuity in \((z,\theta )\) and periodicity in \(\theta \) imply
Hence we have \(1\in \sigma (K(z,\theta ))\) and thus \(z\in \sigma (-\Delta +V)\).
(ii) Conversely, denote \(H:=-\Delta +V\) and let \(z\in \sigma (H)\cap Q_{z_0}\setminus \bigcup _j(1+|k_j|^2)\). We need to show that there exists a sequence \(z_N\in \Gamma _N^{z_0}(V)\) such that \(z_N\rightarrow z\). In fact, let \(z_N\in L_N\) be any sequence with \(|z-z_N|<\frac{1}{N}\) (such a sequence exists by the definition of \(L_N\)). Since \(z\in \sigma (H)\) there exists \(\theta \in [0,2\pi ]^d\) with \(z\in \sigma (H(\theta ))\), cf. Lemma B.1. Consequently
Consider some sequence \(\theta _N\in \Theta _N\) with \(|\theta _N-\theta |<\frac{1}{N}\), the existence of which is guaranteed by the definition of \(\Theta _N\). By Lemma 4.3 and Theorem 4.7 there exists \(C>0\) with
Thus, using (C.2) and the main error estimate (C.1), we obtain
To keep the notation simple, we collect all constants independent of N into a single constant \(G=G(p,d,V)\) (note that \(|\theta _N|\le 2\pi \sqrt{d}\) for all N). This gives
where the last line holds for N large enough (note that \(\frac{1}{2d} - \frac{1}{2p}>0\) by our assumption \(p>d\) and that \(C_{z_N}\) remains bounded as \(N\rightarrow +\infty \) by our assumptions on z). Comparing (C.3) to (4.15), we see that \(z_N\in \Gamma _N^{z_0}(V)\) for N large enough.
(iii) The proof is similar to that of (ii). Let \(z\in \left( \sigma (-\Delta +V)\cap Q_{z_0}\right) \setminus B_\delta \big (\bigcup _j(1+|k_j|^2)\big )\) be an arbitrary point and let \(z_N\in L_N\) be any sequence with \(|z-z_N|<\frac{1}{N}\). Since \(z\in \sigma (H)\) there exists \(\theta \in [0,2\pi ]^d\) with \(z\in \sigma (H(\theta ))\), cf. Lemma B.1. Consequently
Consider some sequence \(\theta _N\in \Theta _N\) with \(|\theta _N-\theta |<\frac{1}{N}\), the existence of which is guaranteed by the definition of \(\Theta _N\). By Corollary 4.4 and Theorem 4.7 this implies
where \(C_{\delta ,z_0}^\text {Lip} = c_{\text {exp}}\delta ^{-2}48(|z_0|+1)^2\frac{sd}{s-d}\frac{3p-d}{p-d}\). Turning to the finite approximation \(K^{\mathrm {appr}}_n\), this gives
where G denotes the explicit and computable constant introduced in step (ii) above and we have used the bound \(C_{z_N}\le (|z_0|+1)\delta ^{-1}\). The condition \(z_N\in \Gamma _N(V)\) is thus implied by
which immediately implies the assertion, noting that \(|z-z_N|<\frac{1}{N}<\varepsilon \) as soon as \(N>\varepsilon ^{-1}\).
\(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ben-Artzi, J., Marletta, M. & Rösler, F. Universal algorithms for computing spectra of periodic operators. Numer. Math. 150, 719–767 (2022). https://doi.org/10.1007/s00211-021-01265-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00211-021-01265-w