On the Computation of Geometric Features of Spectra of Linear Operators on Hilbert Spaces

Colbrook, Matthew J.

doi:10.1007/s10208-022-09598-0

On the Computation of Geometric Features of Spectra of Linear Operators on Hilbert Spaces

Open access
Published: 01 December 2022

(2022)
Cite this article

Download PDF

You have full access to this open access article

Foundations of Computational Mathematics Aims and scope Submit manuscript

On the Computation of Geometric Features of Spectra of Linear Operators on Hilbert Spaces

Download PDF

Matthew J. Colbrook¹

1957 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Computing spectra is a central problem in computational mathematics with an abundance of applications throughout the sciences. However, in many applications gaining an approximation of the spectrum is not enough. Often it is vital to determine geometric features of spectra such as Lebesgue measure, capacity or fractal dimensions, different types of spectral radii and numerical ranges, or to detect gaps in essential spectra and the corresponding failure of the finite section method. Despite new results on computing spectra and the substantial interest in these geometric problems, there remain no general methods able to compute such geometric features of spectra of infinite-dimensional operators. We provide the first algorithms for the computation of many of these long-standing problems (including the above). As demonstrated with computational examples, the new algorithms yield a library of new methods. Recent progress in computational spectral problems in infinite dimensions has led to the solvability complexity index (SCI) hierarchy, which classifies the difficulty of computational problems. These results reveal that infinite-dimensional spectral problems yield an intricate infinite classification theory determining which spectral problems can be solved and with which type of algorithm. This is very much related to S. Smale’s comprehensive program on the foundations of computational mathematics initiated in the 1980s. We classify the computation of geometric features of spectra in the SCI hierarchy, allowing us to precisely determine the boundaries of what computers can achieve (in any model of computation) and prove that our algorithms are optimal. We also provide a new universal technique for establishing lower bounds in the SCI hierarchy, which both greatly simplifies previous SCI arguments and allows new, formerly unattainable, classifications.

On Spectral Approximations of Unbounded Operators

Article Open access 02 May 2019

Sum-of-Squares Relaxations for Information Theory and Variational Inference

Article 05 April 2024

An iterative method for the solution of Laplace-like equations in high and very high space dimensions

Article Open access 01 April 2024

1 Introduction

This paper resolves open computational spectral problems related to geometric features of spectra of operators. In other words, we consider the following problem:

Are there algorithms that given a bounded^{Footnote 1} operator $A\in {\mathcal {B}}(l^2({\mathbb {N}}))$, approximate key geometric features (e.g. spectral gaps, notions of sizes and capacity, measures, topological features such as fractal dimensions, etc.) of the set $\textrm{Sp}(A)$ from a matrix representation of A?

To answer this question, we use the newly established solvability complexity index (SCI) hierarchy [18, 51, 91], a classification tool that determines the boundaries of what is computationally possible. Classifying spectral problems and providing a library of optimal algorithms^{Footnote 2} remains largely uncharted territory in the foundations of computational mathematics. In exploring this territory, there will, necessarily, have to be many different types of algorithms, as different structures on the various classes of operators and different spectral properties require different techniques.

A famous example of the above question is the almost Mathieu operator on $l^2({\mathbb {Z}})$ (see Sect. 4.4):

$$\begin{aligned} (H_{\alpha }x)_n=x_{n-1}+x_{n+1}+2\lambda \cos (2\pi n\alpha )x_n, \end{aligned}$$

which induces the Hofstadter butterfly [92]. The almost Mathieu operator plays an important role in physics [104], arising in the study of the quantum Hall effect [160], and has become a laboratory for exploring the spectral properties of ergodic Schrödinger operators [95]. When $\alpha $ is irrational, the Lebesgue measure of the spectrum is $4\left| 1-\left| \lambda \right| \right| $. This formula was conjectured based on the numerical work of Aubry and André [8] and became one of B. Simon’s problems for the twenty-first century [146]. It was later proven by Avila and Krikorian [11]. Similarly, M. Kac’s “Ten Martini Problem”, that the spectrum is a Cantor set for all irrational $\alpha $ and $\lambda >0$, was conjectured by Azbel [13] and also became one of B. Simon’s problems. This problem attracted a host of numerical and analytical work (see the summary in [104]), before being proven by Avila and Jitomirskaya [9]. In both of these examples, we see a crucial interplay between computation, conjecture, and mathematical proof. The above geometric features of spectra play an important role in the physics of the underlying quantum system [90, 99, 100, 147]. The almost Mathieu operator is by no means unique in this regard, and there is a growing literature on computational studies of geometric features of spectra in diverse areas of physics [14, 68, 83, 94, 103, 106, 110, 120, 125, 133, 138, 139, 156, 161].

However, there is a current lack of rigorous computational theory and convergence analysis, and no known algorithms can tackle general cases. Moreover, the foundations of computation (i.e. what is and what is not computationally possible) for computing geometric features of spectra are almost entirely unexplored. We solve these open problems and others by providing algorithms that compute geometric features of spectra and by classifying the computational problems in the SCI hierarchy.

1.1 The SCI Hierarchy

The SCI hierarchy has recently been used to resolve the problem of computing spectra of general bounded operators in infinite dimensions [18, 91] and is now being used to explore the foundations of computation in many diverse areas of mathematics [2, 15, 16, 19,20,21,22,23, 30, 52, 53, 55, 57, 59, 60, 64, 140, 141, 166].^{Footnote 3} Whilst for some classes of operators one can compute spectra with error control [54, 60, 64], a potentially surprising consequence is that, for general operators, one needs several successive limits to compute the spectrum. Since traditional approaches are dominated by techniques based on one limit, this explains why many computational spectral problems remain unsolved and opens the door to an infinite classification theory. Moreover, this phenomenon is not just restricted to spectral problems but is shared by other areas of computational mathematics. An example is S. Smale’s problem of root-finding of polynomials with rational maps [149], which also requires several successive limits as established by McMullen [115, 116] and Doyle and McMullen [70]. These results can be expressed in terms of the SCI hierarchy [18], which generalises Smale’s seminal work [148, 150] with Blum et al. [28, 29, 66], and his program on the foundations of scientific computing and the existence of algorithms. Many other problems in the foundations of computations, such as the work by Weinberger [167], can also be viewed in the context of the SCI hierarchy.

The SCI hierarchy is further motivated by computer-assisted proofs. Computer-assisted proofs are rapidly becoming an essential part of modern mathematics [86] and, perhaps surprisingly, non-computable problems can be used in computer-assisted proofs. Examples include the recent proof of Kepler’s conjecture (Hilbert’s 18th problem) [87, 88] on optimal packings of 3-spheres, led by T. Hales, and the Dirac–Schwinger conjecture on the asymptotic behaviour of ground states of certain Schrödinger operators, proven in a series of papers by Fefferman and Seco [72,73,74,75,76,77,78,79,80]. Both of these proofs rely on computing non-computable problems. This apparent paradox can be explained by the SCI hierarchy (the $\Sigma ^A_1$ and $\Pi _1^A$ classes described below become available for computer-assisted proofs); Hales, Fefferman and Seco implicitly prove $\Sigma ^A_1$ classifications in the SCI hierarchy in their papers. Some of the problems we consider also lie in $\Sigma ^A_1\cup \Pi _1^A$, meaning that they can be used for computer-assisted proofs.

1.2 The Problems Addressed in this Paper

The algorithms we provide are sharp in the SCI hierarchy, meaning that they realise the boundaries of what computers can achieve. Table 1 provides a summary of the main SCI classifications of this paper. The main theorems are contained in Sect. 3, including further motivations and classifications for different classes of operators. We provide resolutions to the following problems:

(i)
Computing spectral radii, essential spectral radii, polynomial operator norms and capacity of spectra. The spectral radius is perhaps the most basic geometric property of spectra and arises in stability analysis. We show that computing the spectral radius is high up in the SCI hierarchy for non-normal operators. In fact, it has the same classification in the SCI hierarchy for general bounded operators as that of computing the spectrum itself. Classifications are given for different types of operators (e.g. known column decay, control on resolvent norms) and also for the essential spectral radius. In many cases, the problem of computing polynomial operator norms is easier in the sense of SCI hierarchy. We also consider the problem of computing the logarithmic capacity of the spectrum, following the work of Halmos [89], which has applications in orthogonal polynomials, approximation theory and when studying the convergence of Krylov methods (see, for example, the work of Nevanlinna [121,122,123] and Miekkala and Nevanlinna [117]).
(ii)
Computing essential numerical ranges, gaps in essential spectra, and determining whether spectral pollution occurs on sets. We provide classification results for the essential numerical range, which also hold in the case of unbounded operators. In connection with computing spectra, there has been a substantial effort in studying the finite section method and locating gaps in essential spectra of operators (see the discussion in Sect. 3.4). When using the finite section method to approximate spectra of self-adjoint operators, spurious eigenvalues, known as spectral pollution, can occur anywhere within these gaps. Paradoxically, we show that determining if spectral pollution occurs on a given set is strictly harder in the sense of the SCI hierarchy than computing the spectrum itself. Hence, computing a failure flag for the finite section method is, in a certain sense, strictly harder than solving the original problem for which it was designed. Moreover, we establish the SCI of detecting gaps in essential spectra of self-adjoint operators, a problem that arises in areas such as perturbation theory and defect models.
(iii)
Computing Lebesgue measure of spectra and pseudospectra, and determining if the spectrum is Lebesgue null. An important property of the spectrum is its Lebesgue measure, with recent progress in the field of Schrödinger operators with random or almost periodic potentials [9, 11, 12, 17, 135]. If the spectrum of an operator is Lebesgue null; then, this implies the absence of absolutely continuous spectra,^{Footnote 4} which is related to transport properties if the operator represents a Hamiltonian. Whilst results are known for specific one-dimensional examples such as the almost Mathieu operator [11] or the Fibonacci Hamiltonian [154], very little is known in the general case or higher dimensions. This is reflected by the difficulty of performing rigorous numerical studies, despite many examples studied in the physics literature (see the references in [10, 24, 147]). We provide the first algorithms for computing the Lebesgue measure of spectra and pseudospectra, and determining whether the spectrum is Lebesgue null, for many different classes of operators.
(iv)
Computing fractal dimensions of spectra. Fractal dimensions of spectra are important in many applications. For example, in quantum mechanics, they lead to upper bounds on the spreading of wavepackets and are related to time-dependent quantities associated with wave functions [90, 99, 100]. Fractal spectra appear in a wide variety of contexts, such as exciting new results in multilayer materials (e.g. bilayer graphene) [68, 83, 94, 133], strained materials [120, 139] or quasicrystals [14, 103, 106, 156]. Another well-studied area where fractal spectral properties appear is optics [125, 138], following the analytical and numerical work of Berry and coauthors [25,26,27]. Despite the physical importance of fractal dimensions, analytical results are known only for a limited number of specific models. Moreover, there are currently no algorithms for computing fractal dimensions of spectra for general operators, or even tridiagonal self-adjoint operators. We provide the first algorithms for computing the box-counting and Hausdorff dimensions of spectra for many different classes of operators.

1.3 Contributions to the SCI Hierarchy Itself

Our final contribution is a new tool to prove lower bounds (impossibility results) in the SCI hierarchy. This is crucial for some of the classifications of the above problems and holds regardless of the model of computation. We show that for a certain special class of combinatorial problems, the SCI hierarchy is equivalent to the Baire hierarchy from descriptive set theory. (This equivalence does not hold in general.) By embedding these combinatorial problems into spectral problems,^{Footnote 5} we provide the first technique for dealing with problems that have SCI greater than three and also greatly simplify the proofs of results lower down in the SCI hierarchy. However, it should be stressed that this is not a paper on descriptive set theory or mathematical logic. Our discussion is entirely self-contained and written for a wide audience from a primarily computational background.

1.4 Outline of Paper

In Sect. 2, we provide a brief summary of the SCI hierarchy and define the classes of operators for the interpretation of Table 1 and the main results. A detailed discussion of the SCI hierarchy is delayed until Sect. 5.1. In Sect. 3, we summarise our main results on the classification of computational spectral problems. Computational examples are then given in Sect. 4. For example, we provide numerical evidence that a portion of the spectrum of the graphical Laplacian on an infinite Penrose tile is Lebesgue null and fractal, with a fractal dimension of approximately 0.8, and that the whole spectrum has a logarithmic capacity of approximately 2.26. Mathematical preliminaries, including definitions of the SCI hierarchy and the new tool to provide lower bounds in the SCI hierarchy, are presented in Sect. 5. Proofs of our results are given in Sects. 6–9. To make the paper self-contained, we include a short appendix on the results/algorithms of [64], which are used in some of our proofs. Pseudocode for many of the new algorithms is provided in “Appendix B”.

Table 1 Summary of the main results for the readable information $\Lambda _1$ consisting of matrix values

Full size table

2 Essentials of the SCI Hierarchy and Preliminary Definitions

2.1 A Brief Introduction to the SCI Hierarchy

2.1.1 Description of the SCI Hierarchy

First, we define a computational problem. The basic objects of a computational problem are:

$\Omega $, called the domain,
$\Lambda $, a set of complex-valued functions on $\Omega $, called the evaluation set,
$({\mathcal {M}},d)$, a metric space,
$\Xi :\Omega \rightarrow {\mathcal {M}}$ the problem function.

The set $\Omega $ is the set of objects that give rise to our computational problems, the goal being to compute the problem function $\Xi : \Omega \rightarrow {\mathcal {M}}$. The set $\Lambda $ is the collection of functions that provide us with the information we are allowed to read as input to the algorithm. This leads to the following definition:

Definition 2.1

(Computational problem) Given a domain $\Omega $; an evaluation set $\Lambda $, such that for any $A_1, A_2 \in \Omega $, $A_1 = A_2$ if and only if $f(A_1) = f(A_2)$ for all $f \in \Lambda $; a metric space ${\mathcal {M}}$; and a problem function $\Xi :\Omega \rightarrow {\mathcal {M}}$, we call the collection $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$ a computational problem.

The definition of a computational problem is deliberately general. The SCI of a computational problem is the smallest number of successive limits needed to compute the solution to the problem. We call a corresponding suitably indexed family of algorithms a ‘tower of algorithms’. In addition, we will use finer notions of error control. For example, consider the case that $({\mathcal {M}},d)$ is the space of non-empty compact subsets of ${\mathbb {C}}$, equipped with the Hausdorff metric. Then, the SCI hierarchy [18, 51] can be described as follows.

The SCI hierarchy Given a collection ${\mathcal {C}}$ of computational problems,

(i)
$\Delta ^{\alpha }_0 = \Pi ^{\alpha }_0 = \Sigma ^{\alpha }_0$ is the set of problems that can be computed in finite time (the SCI $=0$). In other words, $\exists $ an algorithm $\Gamma $ such that $\Gamma (A)=\Xi (A), \forall A\in \Omega $.
(ii)
$\Delta ^{\alpha }_1$ is the set of problems that can be computed using one limit (the SCI $=1$) with control of the error, i.e. $\exists $ a sequence of algorithms $\{\Gamma _n\}$ such that $d(\Gamma _n(A), \Xi (A)) \le 2^{-n}, \, \forall A \in \Omega $.
(iii)
$\Sigma ^{\alpha }_1$: We have $\Delta ^{\alpha }_1 \subset \Sigma ^{\alpha }_1 \subset \Delta ^{\alpha }_2 $ and $\Sigma ^{\alpha }_1$ is the set of problems for which $\exists $ a sequence of algorithms $\{\Gamma _n\}$ such that $\forall A \in \Omega $ we have $\Gamma _n(A) \rightarrow \Xi (A)$ as $n \rightarrow \infty $. Moreover, $\sup _{z\in \Gamma _n(A)}\textrm{dist}(z,\Xi (A))\le 2^{-n}$, where $\textrm{dist}(x,S)$ denotes the Euclidean distance of x to S.
(iv)
$\Pi ^{\alpha }_1$: We have $\Delta ^{\alpha }_1 \subset \Pi ^{\alpha }_1 \subset \Delta ^{\alpha }_2 $ and $\Pi ^{\alpha }_1$ is the set of problems for which $\exists $ a sequence of algorithms $\{\Gamma _n\}$ such that $\forall A \in \Omega $ we have $\Gamma _n(A) \rightarrow \Xi (A)$ as $n \rightarrow \infty $. Moreover, $\sup _{z\in \Xi (A)}\textrm{dist}(z,\Gamma _n(A))\le 2^{-n}$.
(v)
$\Delta ^{\alpha }_2$ is the set of problems that can be computed using one limit (SCI $=1$) without error control, i.e. $\exists $ a sequence of algorithms $\{\Gamma _n\}$ such that $\lim _{n\rightarrow \infty }\Gamma _n(A) = \Xi (A), \, \forall A \in \Omega $.
(vi)
$\Delta ^{\alpha }_{m+1}$, for $m \in {\mathbb {N}}$, is the set of problems that can be computed by using m successive limits, (SCI $\le m$), i.e. $\exists $ a family of algorithms $\{\Gamma _{n_m, \ldots , n_1}\}$ such that
$$\begin{aligned} \lim _{n_m \rightarrow \infty }\cdots \lim _{n_1\rightarrow \infty }\Gamma _{n_m,\ldots , n_1}(A) = \Xi (A), \quad \, \forall A \in \Omega . \end{aligned}$$
(vii)
$\Sigma ^{\alpha }_{m}$ is the set of problems in $\Delta ^{\alpha }_{m+1}$ such that, letting $\Gamma _{n_m}(A)=\lim _{n_{m-1} \rightarrow \infty }\cdots \lim _{n_1\rightarrow \infty }\Gamma _{n_m,\ldots , n_1}(A)$, $\sup _{z\in \Gamma _{n_m}(A)}\textrm{dist}(z,\Xi (A))\le 2^{-n_m}$. In other words, computing the mth limit is a $\Sigma ^{\alpha }_1$ problem.
(viii)
$\Pi ^{\alpha }_{m}$ is the set of problems in $\Delta ^{\alpha }_{m+1}$ such that $\sup _{z\in \Xi (A)}\textrm{dist}(z,\Gamma _{n_m}(A))\le 2^{-n_m}$. In other words, computing the mth limit is a $\Pi ^{\alpha }_1$ problem.

Schematically, the SCI hierarchy can be viewed in the following way:

A visual demonstration of these classes is shown in Fig. 1. For the description for decision problems, see Sect. 5.1. The $\Sigma _1^{\alpha }$ and $\Pi _1^{\alpha }$ classes become crucial in computer-assisted proofs (see below).

Remark 2.2

(Computability, not complexity) It is important to note that (despite its name) the SCI hierarchy is a hierarchy for classifying computability, not complexity. Most computational spectral problems of interest are $\notin \Delta _1$ in the SCI hierarchy, and complexity theory only makes sense for problems in $\Delta _1$. Hence, it is impossible to build a complexity theory for most infinite-dimensional spectral problems. The scientific community computes with non-computable problems ($\notin \Delta _1$) on a daily basis (e.g. in quantum mechanics). This also happens in high-profile computer-assisted proofs (see below). $\square $

2.1.2 The Model of Computation $\alpha $

The $\alpha $ in the superscript indicates the model of computation, which is described in Sect. 5.1. For $\alpha = G$, the underlying algorithm is general (see Definition 5.1) and can use any tools at its disposal. The reader may think of a Blum–Shub–Smale (BSS) machine or a Turing machine with access to any oracle, although a general algorithm is even more powerful. However, for $\alpha = A$ this means that only arithmetic operations and comparisons are allowed. In particular, if rational inputs are considered, the algorithm is a Turing machine, and in the case of real inputs, a BSS machine. Hence, a result of the form

$$\begin{aligned} \notin \Delta _k^G \text { is stronger than } \notin \Delta _k^A. \end{aligned}$$

Indeed, a $\notin \Delta _k^G$ result is universal and holds for any model of computation. Moreover,

$$\begin{aligned} \in \Delta _k^A \text { is stronger than } \in \Delta _k^G, \end{aligned}$$

and similarly for the $\Pi _k$ and $\Sigma _k$ classes. In this paper, we prove lower bounds for $\alpha = G$ and upper bounds for $\alpha = A$, thus obtaining the strongest results. Remark 5.12 discusses further how the model of computation is of less importance in infinite dimensions.

2.1.3 Computer-Assisted Proofs

The class of problems $\Delta _1^A$ are precisely those that are computable according to Turing’s definition of computability (i.e. there exists an algorithm such that for any $\epsilon > 0$ the algorithm can produce an $\epsilon $-accurate output). However, most infinite-dimensional spectral problems are $\notin \Delta _1^A.$ The simplest example is the problem of computing spectra of infinite diagonal matrices. Very few interesting infinite-dimensional spectral problems are actually in $\Delta _1^A$, and most of the literature on spectral computations provides algorithms that yield $\Delta _2^A$ classification results. Such algorithms converge, but may not provide error control. In many cases, error control is impossible.

Problems not in $\Delta _1^A$ are a daily occurrence in the sciences due to suggestive numerical simulations or evidence based on experiments. However, in the field of computer-assisted proofs, this is not possible, since only $100\%$ rigour is accepted. Nevertheless, there are many examples of famous conjectures that have been proven using computational problems that do not lie in $\Delta _1^A$. For example, the proof of Kepler’s conjecture (Hilbert’s 18th problem) [87, 88] relies on decision problems that are not in $\Delta _1^A$ [15]. Another example is C. Fefferman and L. Seco’s proof of the Dirac–Schwinger conjecture on the asymptotics of ground states of certain Schrödinger operators [72,73,74,75,76,77,78,79,80]. The reason for this apparent paradox is that the $\Sigma ^A_1$ and $\Pi ^A_1$ classes are larger than $\Delta ^A_1$, but can still be used in computer-assisted proofs. Both of the above examples implicitly prove $\Sigma ^A_1$ classifications. For example, suppose we have a computational spectral problem that lies in $\Sigma ^A_1$. This means that there is an algorithm that will converge and never provide incorrect output, up to a user-specified error bound. Thus, conjectures about operators never having spectra in a certain area (a common problem in stability analysis, for example) could be disproved by a computer-assisted proof. Recent results using computer-assisted proofs in spectral theory include [33, 111].

2.2 Evaluation Sets and Domains

Throughout this paper, unless otherwise specified, A will be a bounded operator acting on the canonical Hilbert space $l^2({\mathbb {N}})$ (we define $\Omega _{\textrm{B}}:={\mathcal {B}}(l^2({\mathbb {N}}))$), and realised as a matrix with respect to the canonical basis. However, the results of this paper extend to general separable Hilbert spaces ${\mathcal {H}}$ through a choice of orthonormal basis $e_1,e_2,\ldots $ if one can compute the matrix values of the operators with respect to this basis (see the discussion of the evaluation sets below). For example, we can treat operators naturally defined on lattices such as ${\mathbb {Z}}^d$, or more generally on graphs. Such operators are abundant in mathematical physics. Below we give the evaluation sets and classes of operators treated in this paper. For convenience, this information is summarised in Tables 2 and 3.

Table 2 Summary of evaluation sets used in this paper

Full size table

Table 3 Summary of classes of operators treated in this paper

Full size table

2.2.1 Evaluation Sets

We consider two natural sets of information that our algorithms can read. The first, $\Lambda _1$, provides the entries of the matrix representation of A with respect to the canonical basis $\{e_i\}_{i\in {\mathbb {N}}}$:

$$\begin{aligned} \Lambda _1=\{f^1_{i,j}: A\mapsto \langle Ae_j,e_i\rangle | i,j\in {\mathbb {N}}\}. \end{aligned}$$

The second, $\Lambda _2$, appends $\Lambda _1$ with the entries of the matrix representations of $A^*A$ and $AA^*$ with respect to the canonical basis $\{e_i\}_{i\in {\mathbb {N}}}$:

$$\begin{aligned} \Lambda _2=\Lambda _1\cup \{f^2_{i,j}: A\mapsto \langle Ae_j,Ae_i\rangle , f^3_{i,j}: A\mapsto \langle A^*e_j,A^*e_i\rangle | i,j\in {\mathbb {N}}\}. \end{aligned}$$

We include $\Lambda _2$ since it is natural for problems posed in variational form, and can often be evaluated through numerical integration. When considering classes with functions f (and $\{c_n\}$) and g as in (2.1) and (2.2) below, we will add these to the relevant evaluation set (evaluating g at rational points) and with an abuse of notation still use the notation $\Lambda _i$. A small selection of the problems also require additional information, such as when testing if a set intersects a spectral set, but any changes to $\Lambda _i$ will be pointed out where appropriate.

2.2.2 Classes of Operators

Let $\Omega _\textrm{N}$ denote the class of normal operators in $\Omega _{\textrm{B}}$, $\Omega _{\textrm{SA}}$ denote the class of self-adjoint operators in $\Omega _\textrm{N}$, and $\Omega _\textrm{D}$ denote the class of self-adjoint diagonal operators in $\Omega _{\textrm{SA}}$. For $f:{\mathbb {N}}\rightarrow {\mathbb {N}}$, $f(n)\ge n+1$ define

$$\begin{aligned} D_{f,n}(A):=\max \left\{ {\left\| (I-P_{f(n)})AP_n\right\| },{\left\| P_nA(I-P_{f(n)})\right\| }\right\} , \end{aligned}$$

(2.1)

where $P_m$ is the orthogonal projection onto $\textrm{span}\{e_1,\ldots , e_m\}$. Given such an f, we assume access to an estimate $D_{f,n}(A) \le c_n(A)\in {\mathbb {Q}}_{\ge 0}$, where $c_n \rightarrow 0$ as $n \rightarrow \infty $. We let $\Omega _f$ denote the class of bounded operators with known function f and $\{c_n\}$.^{Footnote 6} As a special case, if we know our matrix is sparse with finitely many nonzero entries in each column and row (and we know the positions of the nonzero entries) then we know an f with $c_n=0$. Let $g:{\mathbb {R}}_{+}\rightarrow {\mathbb {R}}_{+}$ be a strictly increasing, continuous function that vanishes only at 0 with $\lim _{x\rightarrow \infty }g(x)=\infty $. Let $\Omega _g$ be the class of bounded operators with

$$\begin{aligned} {g(\textrm{dist}(z,\textrm{Sp}(A)))}\le \left\| R(z,A)\right\| ^{-1}, \end{aligned}$$

(2.2)

for $z\in {\mathbb {C}}$, where $R(z,A)=(A-zI)^{-1}$. A simple compactness argument shows that such a g always exists for any given $A\in \Omega _{\textrm{B}}$. However, the classification of spectral problems in the SCI hierarchy generally depends on whether one knows an estimate for g or not. For example, in the self-adjoint and normal cases, $g(x) = x$ is the trivial choice of g. Operators with $g(x) = x$ are known as $G_1$ and include the well-studied class of hyponormal operators (operators with $A^*A-AA^*\ge 0$) [136]. A common assumption is that

$$\begin{aligned} \Vert R(z,A)\Vert \le \frac{C}{\textrm{dist}(z,\textrm{Sp}(A))} \quad \forall z\notin \textrm{Sp}(A), \end{aligned}$$

for some constant C, which is equivalent to $A\in \Omega _g$ with $g(x)=x/C$. For example, if A is similar to a normal operator with a similarity transformation S that has bounded condition number $\kappa (S)$, we can take $C=\kappa (S)$. Other examples with nonlinear g include perturbations of self-adjoint operators [84, e.g. Theorem 7.7.1]. More generally, one can view the function g as a measure of stability of the spectrum of A through the formula

$$\begin{aligned} \textrm{Sp}_{\epsilon }(A):=\textrm{Sp}(A)\cup \{z\notin \textrm{Sp}(A):\left\| R(z,A)\right\| \ge 1/\epsilon \}=\bigcup _{B\in \Omega _{\textrm{B}},\left\| B\right\| \le \epsilon }\textrm{Sp}(A+B), \end{aligned}$$

(2.3)

where $\textrm{Sp}_{\epsilon }(A)$ denotes the ($\epsilon $-)pseudospectrum of A [162]. The function g is held fixed for a given class $\Omega _g$ and a smaller g leads to a larger class of operators $\Omega _g$.

3 Main Results: The Foundations of Computing Geometric Features of Spectra

Our results classify computing geometric features of spectra in the SCI hierarchy. In other words, we are concerned with the foundations of computation for geometric features of spectra. There are two aspects of this classification: proving impossibility results (lower bounds), where we make use of the tools developed in Sect. 5 and Theorem 5.19, and proving upper bounds through the construction of algorithms. This ensures that our algorithms realise the boundary of what computers can achieve in spectral computations. We have included routines for some of the main algorithms in “Appendix B” and computational examples in Sect. 4.

Remark 3.1

(Bounding the operator norm) The proofs of lower bounds make clear that all classifications still hold if we replace the respective sub-class $\Omega \subset \Omega _{\textrm{B}}$ by the restriction to operators in $\Omega $ having operator norm at most $M\in {\mathbb {R}}_{>0}$, adding such a value M (constant function) to the evaluation set $\Lambda $.$\square $

Remark 3.2

(Computing the resolvent norm) Some of the algorithms are built on the local approximation of the functions (or similar functions) defined by

$$\begin{aligned} \gamma _{n}(z;A)=\min \{\sigma _{\textrm{inf}}((A-zI){|_{P_n{\mathcal {H}}}}),\sigma _{\textrm{inf}}((A^*-{\bar{z}}I){|_{P_n{\mathcal {H}}}})\}, \end{aligned}$$

where $\sigma _{\textrm{inf}}$ denotes the smallest singular value or injection modulus:

$$\begin{aligned} \sigma _{\textrm{inf}}(T)=\inf \{\Vert Tv\Vert :\Vert v\Vert =1\}. \end{aligned}$$

The functions $\gamma _{n}$ converge to the resolvent norm $\Vert R(z,A)\Vert ^{-1}$ uniformly on compact subsets of ${\mathbb {C}}$ from above as $n\rightarrow \infty $. This idea was crucial in [60, 64] to compute spectra with $\Sigma _1^A$ error control for a large class of operators. A theme of some of our proofs, especially those concerning Lebesgue measure and fractal dimensions, is the extension of these ideas to compute geometric properties of the spectrum.$\square $

3.1 Spectral Radii

We begin with a very simple geometric feature of the spectrum. The spectral radius, r(A), of a bounded operator A is the supremum of the absolute values of members of the spectrum, which is attained. Spectral radii commonly appear in applications involving stability analysis. We set $\Xi _r(A):=r(A)$ and make the following initial observations:

(i)
One can easily show that the computational problem of the operator norm of any $A\in \Omega _{\textrm{B}}$ lies in $\Sigma _1^A$. Hence, since $r(A)\le \left\| A\right\| $, we can easily get an upper bound for $\Xi _r(A)$ in one limit. Of course, if A is not normal, this upper bound may not agree with $\Xi _r(A)$.
(ii)
If an operator lies in $\Omega _g$ with $g(x)=x$, then the convex hull of the spectrum is equal to the closure of the numerical range (recall that the numerical range is $\{\langle Ax,x\rangle :\Vert x\Vert =1\}$) [131]. Such operators are known as convexoid and the problem of computing $\Xi _r(A)$ for such operators lies in $\Sigma _1^A$.
(iii)
In light of Gelfand’s famous formula $\Xi _r(A)=\lim _{n\rightarrow \infty }\Vert A^n\Vert ^{\frac{1}{n}}$, one might expect that the computation of $\Xi _r(A)$ is strictly easier in the sense of the SCI hierarchy than that of the spectrum.

The following shows that the intuition in (iii) is misguided in general and only occurs if an operator is convexoid as in (ii). Computing $\Xi _r(A)$ is just as hard as computing the spectrum for the class $\Omega _{\textrm{B}}$. Controlling the resolvent via a function g as in (2.2) makes the problem easier in the sense of SCI hierarchy than the general class $\Omega _{\textrm{B}}$, but is not sufficient to reduce the SCI of the problem to 1.

Theorem 3.3

Let $g:{\mathbb {R}}_{+}\rightarrow {\mathbb {R}}_{+}$ be a strictly increasing, continuous function that vanishes only at 0 with $\lim _{x\rightarrow \infty }g(x)=\infty $. In addition, suppose that $g(x)\le (1-\delta )x$ for some $\delta \in (0,1)$. Then:

$$\begin{aligned}&\Delta ^G_1 \not \ni \{\Xi _r,\Omega _\textrm{D},\Lambda _1\} \in \Sigma ^A_1,\quad{} & {} \Delta ^G_1 \not \ni \{\Xi _r,\Omega _\textrm{N},\Lambda _1\} \in \Sigma ^A_1,\quad{} & {} \Delta ^G_1 \not \ni \{\Xi _r,\Omega _f\cap \Omega _g,\Lambda _1\} \in \Sigma ^A_1,\\&\Delta ^G_2 \not \ni \{\Xi _r,\Omega _g,\Lambda _1\} \in \Sigma ^A_2,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _r,\Omega _f,\Lambda _1\} \in \Pi ^A_2,\quad{} & {} \Delta ^G_3 \not \ni \{\Xi _r,\Omega _{\textrm{B}},\Lambda _1\} \in \Pi ^A_3. \end{aligned}$$

When considering the evaluation set $\Lambda _2$, the only changes are the following classifications:

$$\begin{aligned}&\Delta ^G_1 \not \ni \{\Xi _r,\Omega _g,\Lambda _2\} \in \Sigma ^A_1,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _r,\Omega _{\textrm{B}},\Lambda _2\} \in \Pi ^A_2. \end{aligned}$$

Remark 3.4

The $\Pi _2^A$ algorithm for $\{\Xi _r,\Omega _f\}$ does not need a null sequence $\{c_n\}$ bounding the dispersion, $D_{f,n}(A)\le c_n$, to be sharp in the SCI hierarchy since this is absorbed in the first limit.$\square $

Remark 3.5

The proofs of the lower bounds in Theorem 3.3 for $\Omega _g$ require g with the stated additional property and $\delta >0$. In particular, the lower bound does not cover the smaller class of $G_1$ operators.$\square $

3.2 Essential Spectral Radii

Next, we consider the essential spectral radius. Define the essential spectrum of $A\in \Omega _{\textrm{B}}$ as

$$\begin{aligned} \textrm{Sp}_{\textrm{ess}}(A)=\bigcap _{B\in \Omega _K}\textrm{Sp}(A+B), \end{aligned}$$

where $\Omega _K$ denotes the class of compact operators. The essential spectral radius, $\Xi _{er}(A)$, is simply the supremum of the absolute values over $\textrm{Sp}_{\textrm{ess}}(A)$.

Theorem 3.6

We have the following classifications for $i=1,2$:

$$\begin{aligned}&\Delta ^G_2 \not \ni \{\Xi _{er},\Omega _\textrm{D},\Lambda _i\} \in \Pi ^A_2,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _{er},\Omega _\textrm{N},\Lambda _i\} \in \Pi ^A_2,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _{er},\Omega _f,\Lambda _i\} \in \Pi ^A_2. \end{aligned}$$

Whereas, for general operators,

$$\begin{aligned} \Delta ^G_3 \not \ni \{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\} \in \Pi ^A_3,\quad \Delta ^G_2 \not \ni \{\Xi _{er},\Omega _{\textrm{B}},\Lambda _2\} \in \Pi ^A_2. \end{aligned}$$

3.3 Capacity and Polynomial Operator Norms

Given a polynomial p of degree at least two,^{Footnote 7} we consider the problem of computing $\Xi _{r,p}=\Vert p(A)\Vert $ and the capacity of the spectrum defined by

$$\begin{aligned} \Xi _{cap}(A)= & {} \inf _{\text {monic polynomial }p}\Vert p(A)\Vert ^{\frac{1}{\textrm{deg}(p)}}\\= & {} \lim _{d\rightarrow \infty }\inf \left\{ \Vert p(A)\Vert ^{\frac{1}{d}}:\text {monic polynomial }p,\textrm{deg}(p)=d\right\} . \end{aligned}$$

A theorem of Halmos shows that this definition of capacity agrees with the usual potential-theoretic definition of capacity of the set $\textrm{Sp}(A)$ [89]. Roughly speaking, the capacity measures the ability of $\textrm{Sp}(A)$ to hold electrical charge. We will also see some other measures of size in Sects. 3.5 and 3.6. The capacity of the spectrum is of particular interest in Krylov methods where, for instance, it is related to the speed of convergence^{Footnote 8} [117, 119, 121,122,123]. The capacity is also an important object in local spectral theory [1, 105, 119], and related work [48, 124] includes methods for computing the polynomially convex hull of an operator. The following theorem provides the relevant SCI classifications.

Theorem 3.7

We have the following classifications for $i=1,2$ and ${{\hat{\Omega }}}=\Omega _\textrm{D}$ or $\Omega _f$:

$$\begin{aligned}&\Delta ^G_1 \not \ni \{\Xi _{r,p},{{\hat{\Omega }}},\Lambda _i\} \in \Sigma ^A_1,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _{cap},{{\hat{\Omega }}},\Lambda _i\} \in \Pi ^A_2. \end{aligned}$$

For ${{\tilde{\Omega }}}=\Omega _\textrm{N},\Omega _g$ or $\Omega _{\textrm{B}}$,

$$\begin{aligned}&\Delta ^G_2 \not \ni \{\Xi _{r,p},{{\tilde{\Omega }}},\Lambda _1\} \in \Sigma ^A_2,\quad{} & {} \Delta ^G_3 \not \ni \{\Xi _{cap},{{\tilde{\Omega }}},\Lambda _1\} \in \Pi ^A_3\\&\Delta ^G_1 \not \ni \{\Xi _{r,p},{{\tilde{\Omega }}},\Lambda _2\} \in \Sigma ^A_1,\quad{} & {} \Delta ^G_2 \not \ni \{\Xi _{cap},{{\tilde{\Omega }}},\Lambda _2\} \in \Pi ^A_2. \end{aligned}$$

The proof shows these problems have the same classifications for $\Omega _\textrm{SA}$ as $\Omega _\textrm{N}$. Somewhat surprising is the result that the computation of $\Vert p(A)\Vert $ requires two successive limits for self-adjoint operators. The proof shows that one reason for this is spectral pollution associated with finite section methods.

3.4 Essential Numerical Range, Gaps in Essential Spectra and Detecting Failure of Finite Section

We now consider geometric features of spectra that are related to the finite section method, the most intensely studied computational method of approximating spectra [35, 36, 40, 41].^{Footnote 9} The basic form of the finite section method approximates the spectrum of A by $\textrm{Sp}(P_nA|_{P_n{\mathcal {H}}})$, where $\{P_m\}$ is a sequence of finite-dimensional projections converging strongly to the identity as $m\rightarrow \infty $. The computation is often done with finite element, finite difference or spectral methods by discretising the operator on a suitable finite-dimensional space [31, 32, 47, 50, 102, 108, 137, 168]. Even when A is self-adjoint, spurious eigenvalues, that have nothing to do with $\textrm{Sp}(A)$, can accumulate anywhere within gaps of the essential spectrum as $n\rightarrow \infty $.^{Footnote 10} This is known as spectral pollution. More precisely, the essential numerical range of $A\in \Omega _{\textrm{B}}$ is defined as

$$\begin{aligned} W_e(A)=\bigcap _{B\in \Omega _K}\overline{W(A+B)}, \end{aligned}$$

(3.1)

where $W(A)=\{\langle Ax,x\rangle :\Vert x\Vert =1\}$ is the usual numerical range.^{Footnote 11} We recall the following two theorems.

Theorem 3.8

(Pokrzywa [132]) Let $A \in {\mathcal {B}}({\mathcal {H}})$ and let $\{P_n\}$ be a sequence of finite-dimensional projections converging strongly to the identity. Suppose that $S \subset W_e(A).$ Then there exists a sequence $\{Q_n\}$ of finite-dimensional projections such that $P_n < Q_n$ (so $Q_n \rightarrow I$ strongly) and

$$\begin{aligned} d_{\textrm{H}}(\textrm{Sp}(P_nA |_{P_n{\mathcal {H}}}) \cup S, \textrm{Sp}(Q_nA |_{Q_n{\mathcal {H}}})) \rightarrow 0, \quad \text {as }n \rightarrow \infty , \end{aligned}$$

where $d_{\textrm{H}}$ denotes the Hausdorff distance.

Theorem 3.9

(Pokrzywa [132]) Let $A \in {\mathcal {B}}({\mathcal {H}})$ and let $\{P_n\}$ be a sequence of finite-dimensional projections converging strongly to the identity. If $\lambda \notin W_e(A)$, then $\lambda \in \textrm{Sp}(A)$ if and only if

$$\begin{aligned} \textrm{dist}(\lambda ,\textrm{Sp}( P_nA|_{P_n{\mathcal {H}}})) \rightarrow 0, \quad \text {as } n \rightarrow \infty . \end{aligned}$$

Theorems 3.8 and 3.9 show that spectral pollution is confined to the essential numerical range and can be arbitrarily bad in $W_e(A)\backslash \textrm{Sp}(A)$.^{Footnote 12} For self-adjoint operators, the gaps in the essential spectrum correspond exactly to $W_e(A)\backslash \textrm{Sp}(A)$. As a result, there has been considerable attention towards methods that detect gaps in essential spectra and eigenvalues within these gaps [31, 49, 108, 144], as well as studying the precise nature of spectral pollution [107, 112, 113, 137].

A consequence of the main result of this section, Theorem 3.10, is that detecting these gaps is strictly harder in the sense of the SCI hierarchy than computing the spectrum for self-adjoint operators (which was classified in [18, 60, 64]). We define the problem function $\Xi _{we}(A)=W_e(A)$. For a given non-empty open set U in ${\mathbb {F}}$ (with ${\mathbb {F}}$ being ${\mathbb {C}}$ or ${\mathbb {R}}$), let $\Xi _{poll}^{{\mathbb {F}}}$ be the decision problem

$$\begin{aligned} \Xi _{poll}^{{\mathbb {F}}}(A,U)={\left\{ \begin{array}{ll} 1,\quad \text { if }{\overline{U}}\cap (W_e(A)\backslash \textrm{Sp}(A))\ne \emptyset \\ 0,\quad \text { otherwise.} \end{array}\right. } \end{aligned}$$

$\Xi _{poll}^{{\mathbb {F}}}$ decides whether spectral pollution can occur on the closed set ${\overline{U}}$. For the self-adjoint case and ${\mathbb {F}}={\mathbb {R}}$, this is equivalent to asking whether there exists a point in the open set U that also lies in a gap of the essential spectrum. To incorporate U into $\Lambda _i$, we allow access to a countable number of open balls $\{U_m\}_{m\in {\mathbb {N}}}$ whose union is U. If ${\mathbb {F}}={\mathbb {R}}$, then each $U_m$ is of the form $(a_m,b_m)$ with $a_m,b_m\in {\mathbb {Q}}\cup \{\pm \infty \}$. If ${\mathbb {F}}={\mathbb {C}}$, then each $U_m$ is equal to $D_{r_m}(z_m)$ (the open ball of radius $r_m$ centred at $z_m$) with $r_m\in \mathbb {Q_+}\cup \{\infty \}$ and $z_m\in {\mathbb {Q}}+i{\mathbb {Q}}$. We add pointwise evaluations of the relevant sequences $\{(a_m,b_m)\}$ or $\{(r_m,z_m)\}$ to $\Lambda _i$.

Theorem 3.10

(Computation of essential numerical range and whether spectral pollution can occur on a set) Let $\Omega =\Omega _\textrm{N},\Omega _\textrm{SA}$ or $\Omega _{\textrm{B}}$ and let $i=1,2$. Then

$$\begin{aligned} \Delta ^G_2 \not \ni \{\Xi _{we},\Omega ,\Lambda _i\} \in \Pi ^A_2. \end{aligned}$$

Furthermore, for $i=1,2$ the following classifications hold, valid also if we restrict to the case $U=U_1$ or to $U=U_1={\mathbb {F}}$:

$$\begin{aligned} \Delta ^G_3 \not \ni \{\Xi _{poll}^{{\mathbb {R}}},\Omega _\textrm{SA},\Lambda _i\} \in \Sigma ^A_3, \quad{} & {} \Delta ^G_3 \not \ni \{\Xi _{poll}^{{\mathbb {C}}},\Omega _{\textrm{B}},\Lambda _i\} \in \Sigma ^A_3. \end{aligned}$$

Remark 3.11

(Computing spectra is easier than algorithmically determining whether spectral pollution can occur on a set) One can show that $\{\textrm{Sp}(\cdot ),\Omega _\textrm{SA},\Lambda _1\}\in \Sigma _2^A$ and $\{\textrm{Sp}(\cdot ),\Omega _\textrm{SA},\Lambda _2\}\in \Sigma _1^A$. Hence determining $\Xi _{poll}^{\mathbb {R}}$ is strictly harder than the spectral computational problem and requires two additional successive limits if $\Lambda =\Lambda _2$. Even in the general case, $\{\textrm{Sp}(\cdot ),\Omega _{\textrm{B}},\Lambda _2\}\in \Pi _2^A$ and hence the spectral problem is strictly easier in the sense of SCI hierarchy. The proofs also make clear that we get the same classification of $\Xi _{poll}^{{\mathbb {F}}}$ for other classes such as $\Omega _\textrm{N}$, $\Omega _g$ etc.$\square $

Remark 3.12

(Unbounded operators) In Sect. 7.1, we show that computing the essential numerical range for closed unbounded operators T on $l^2({\mathbb {N}})$ (under the condition that the linear span of the canonical basis forms a core of T) also lies in $\Pi _2^A$. The definition of the essential numerical range for such operators was recently given in [34]. This paper showed that $W_e(T)$ consists precisely of the essential spectrum of T together with all possible spectral pollution that may arise by applying projection methods to approximate the spectrum of T, thus generalising Theorems 3.8 and 3.9. A computational example is given in Sect. 4.2.$\square $

3.5 Lebesgue Measure of Spectra

A basic property of the set $\textrm{Sp}(A)$, also connected to physical applications, is its Lebesgue measure. Well-studied operators such as the almost Mathieu operator at critical coupling [11] or the Fibonacci Hamiltonian [154] have spectra with Lebesgue measure zero. Following [8], there have been many further numerical studies [157,158,159]. For further examples of operators with numerical approximations of the Lebesgue measure, see the references in [10, 24, 147]. Numerical studies typically look at periodic approximates [134], and computing the Lebesgue measure of periodic approximates of tridiagonal operators lies in $\Delta _1^A$. The tools we develop are more general and do not assume such structure. Verification of our algorithms for the almost Mathieu operator is presented in Sect. 4.4.

The Lebesgue measure on ${\mathbb {C}}$ will be denoted by $\textrm{Leb}$. When considering classes of self-adjoint operators, we use the Lebesgue measure on ${\mathbb {R}}$ denoted by $\textrm{Leb}_{{\mathbb {R}}}$. We also define

$$\begin{aligned} \widehat{\textrm{Sp}}_{\epsilon }(A)=\{z\in {\mathbb {C}}:\Vert R(z,A)\Vert ^{-1}<\epsilon \}, \end{aligned}$$

whose closure is $\textrm{Sp}_\epsilon (A)$. For a class $\Omega \subset \Omega _{\textrm{B}}$, there are three questions we answer in this section:

(1)
Given $A\in \Omega $, can we compute $\textrm{Leb}(\textrm{Sp}(A))$?
(2)
Given $A\in \Omega $ and $\epsilon >0$, can we compute $\textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))$?^{Footnote 13}
(3)
Given $A\in \Omega $, can we determine whether $\textrm{Leb}(\textrm{Sp}(A))=0$?

For the first two questions, we consider the metric space $([0,\infty ),d)$ with the Euclidean metric. For question three, we consider the discrete metric on $\{0,1\}$, where 1 is interpreted as “Yes”, and 0 as “No”. We denote the above problem functions by $\Xi _1^{L},\Xi _2^{L}$ and $\Xi _3^{L}$, respectively. In analogy to computing spectra and pseudospectra, $\Xi _2^L$ is the easiest to compute and can be done in one limit for a large class of operators. It also follows from the dominated convergence theorem that

$$\begin{aligned} \lim _{\epsilon \downarrow 0}\textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))=\textrm{Leb}(\textrm{Sp}(A)). \end{aligned}$$

(3.3)

Theorem 3.13

(Lebesgue measure of spectra) Given the above set-up, we have the following classifications

$$\begin{aligned} \Delta ^G_2 \not \ni \{\Xi _1^L,\Omega _f,\Lambda _i\} \in \Pi ^A_2, \quad \Delta ^G_2 \not \ni \{\Xi _1^L,\Omega _\textrm{D},\Lambda _i\} \in \Pi ^A_2 \quad i=1,2, \end{aligned}$$

and for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$,

$$\begin{aligned} \Delta ^G_3 \not \ni \{\Xi _1^L,\Omega ,\Lambda _1 \} \in \Pi ^A_3,\quad \Delta ^G_2 \not \ni \{\Xi _1^L,\Omega ,\Lambda _2 \} \in \Pi ^A_2. \end{aligned}$$

The algorithm constructed in the proof of Theorem 3.13 is local and can be adapted to find the Lebesgue measure of $\textrm{Sp}(A)$ intersected with any compact interval or cube in one or two dimensions, respectively. Moreover, when considering $\Omega _f$, we do not need the sequence $\{c_n\}$, and the algorithm can be restricted to ${\mathbb {R}}$, where it converges to $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A)\cap {\mathbb {R}})$. Our results also hold when considering bounded diagonal operators (dropping the restriction of self-adjointness) and using $\textrm{Leb}$ instead of $\textrm{Leb}_{{\mathbb {R}}}$.

We now turn to the SCI classification of $\textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))$, which is useful since it provides a route to computing $\textrm{Leb}(\textrm{Sp}(A))$ for any $A\in \Omega _{\textrm{B}}$ via (3.3). This is a similar state of affairs to the computation of the spectrum itself—one can approximate the spectrum via pseudospectra.

Theorem 3.14

(Lebesgue measure of pseudospectra) Given the above set-up, we have the following classifications

$$\begin{aligned} \Delta ^G_1 \not \ni \{\Xi _2^L,\Omega _f,\Lambda _i\} \in \Sigma ^A_1, \quad \Delta ^G_1 \not \ni \{\Xi _2^L,\Omega _\textrm{D},\Lambda _i\} \in \Sigma ^A_1 \quad i=1,2, \end{aligned}$$

and for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$,

$$\begin{aligned} \Delta ^G_2 \not \ni \{\Xi _2^L,\Omega ,\Lambda _1 \} \in \Sigma ^A_2,\quad \Delta ^G_1 \not \ni \{\Xi _2^L,\Omega ,\Lambda _2 \} \in \Sigma ^A_1. \end{aligned}$$

Why is $\Xi _2^L$ easier to compute than $\Xi _1^L$? Heuristically, the pseudospectrum is less refined than the spectrum, making the measure easier to approximate. Another viewpoint is the continuity points of the maps $\Xi _1^L$ and $\Xi _2^L$. For simplicity, consider these maps restricted to $\Omega _\textrm{D}$ and equip these diagonal operators with the operator norm topology. The following shows that $\Xi _2^L$ is more stable than $\Xi _1^L$, explaining why it is easier to approximate. Again, this is the same state of affairs as comparing $\textrm{Sp}(A)$ and $\textrm{Sp}_{\epsilon }(A)$ as sets.

Proposition 3.15

In the above set-up, the following hold:

(1)
$\Xi _1^L$ is continuous at $A\in \Omega _\textrm{D}$ if and only if $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A))=0$.
(2)
$\Xi _2^L$ is continuous at all $A\in \Omega _\textrm{D}$.

Finally, when computing $\Xi _3^L$, we let $({\mathcal {M}},d)$ be the set $\{0,1\}$ endowed with the discrete topology and consider the problem function

$$\begin{aligned} \Xi _3^L(A)= {\left\{ \begin{array}{ll} 0,\quad \text { if }\textrm{Leb}(\textrm{Sp}(A))>0\\ 1,\quad \text { otherwise.} \end{array}\right. } \end{aligned}$$

It is straightforward to build a family of algorithms that converge in three successive limits for this problem using the algorithm constructed in Theorem 3.13 and its monotonicity. The next theorem shows that this is optimal, even for the set of diagonal self-adjoint bounded operators. This demonstrates how hard it is to solve decision problems about the spectrum with finite amounts of information, particularly when the problems involve an object that ignores countable sets, such as the Lebesgue measure.

Theorem 3.16

(Is the spectrum Lebesgue null?) Given the above set-up, we have the following classifications

$$\begin{aligned} \Delta ^G_3\not \ni \{\Xi _3^L,\Omega _f,\Lambda _i\}\in \Pi ^A_3,\quad \Delta ^G_3\not \ni \{\Xi _3^L,\Omega _\textrm{D},\Lambda _i\}\in \Pi ^A_3, \quad i=1,2, \end{aligned}$$

and for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$,

$$\begin{aligned} \Delta ^G_4\not \ni \{\Xi _3^L,\Omega ,\Lambda _1\}\in \Pi ^A_4, \quad \Delta ^G_3\not \ni \{\Xi _3^L,\Omega ,\Lambda _2\}\in \Pi ^A_3. \end{aligned}$$

Remark 3.17

These are the first examples of computational spectral problems that require four successive limits to compute in the SCI hierarchy. To prove this, we need some tools from descriptive set theory in Sect. 5. Note that we prove the lower bounds for general algorithms, so regardless of the model of computation.$\square $

3.6 Fractal Dimensions of Spectra

When considering operators from physical models, such as Schrödinger operators in quantum mechanics, fractal dimensions of spectra are related to important phenomena, such as the spreading of an initially localised wavepacket [101]. Further applications and numerical studies are already discussed in Sect. 1. However, estimating the fractal dimension is extremely difficult. This can be explained by the SCI hierarchy—the $\textrm{SCI}>1$, even for computing the box-counting dimension, the most basic definition of fractal dimension. The Hausdorff dimension is even worse and has $\text {SCI}\ge 3$. In this section, we exclusively treat self-adjoint operators and hence seek fractal dimensions of $\textrm{Sp}(A)\subset {\mathbb {R}}$.^{Footnote 14}

Box-Counting Dimension: Let F be a bounded set in ${\mathbb {R}}$ and let $N_{\delta }(F)$ be the number of closed intervals of length $\delta >0$ required to cover F. We define the upper and lower box-counting dimensions as

$$\begin{aligned} \overline{\textrm{dim}}_B(F)=\limsup _{\delta \downarrow {}0}\frac{\log (N_{\delta }(F))}{\log (1/\delta )},\quad \underline{\textrm{dim}}_B(F)=\liminf _{\delta \downarrow {}0}\frac{\log (N_{\delta }(F))}{\log (1/\delta )}. \end{aligned}$$

When $\overline{\textrm{dim}}_B(F)=\underline{\textrm{dim}}_B(F)$, we can replace the $\liminf $ and $\limsup $ by $\lim $, and the common value is the box-counting dimension $\textrm{dim}_B(F)$, an example of a fractal dimension. A possible drawback of the box-counting dimension is its lack of countable stability. For example, $\textrm{dim}_B(\{0,1,1/2,1/3,\ldots \})=1/2$. Let $\Omega _{f}^{BD}$ be the class of self-adjoint operators in $\Omega _f$ (see (2.1)) whose upper and lower box-counting dimensions of the spectrum agree. Let $\Omega _{\textrm{SA}}^{BD}$ be the class of self-adjoint operators whose upper and lower box-counting dimensions of the spectrum agree, and denote by $\Omega _{\textrm{D}}^{BD}$ the class of diagonal operators in $\Omega _{\textrm{SA}}^{BD}$.

Hausdorff Dimension: A more complicated, yet robust notion of fractal dimension is related to the Hausdorff measure [71, 114]. Let $F\subset {\mathbb {R}}^n$ be a bounded Borel set and let ${\mathcal {C}}_{\delta }(F)$ denote the class of (countable) $\delta $-covers^{Footnote 15} of F. One first defines the quantities (for $d\ge 0$)

$$\begin{aligned} {\mathcal {H}}^{d}_{\delta }(F)=\inf \left\{ \sum _i\textrm{diam}(U_i)^d:\{U_i\}\in {\mathcal {C}}_{\delta }(F)\right\} ,\quad {\mathcal {H}}^{d}(F)=\lim _{\delta \downarrow 0}{\mathcal {H}}^{d}_{\delta }(F). \end{aligned}$$

There is a unique $d'=\textrm{dim}_{H}(F)\ge 0$, the Hausdorff dimension of F, such that ${\mathcal {H}}^{d}(F)=0$ for $d>d'$ and ${\mathcal {H}}^{d}(F)=\infty $ for $d<d'$. One can prove that

$$\begin{aligned} \textrm{dim}_{H}(F)\le \underline{\textrm{dim}}_B(F)\le \overline{\textrm{dim}}_B(F). \end{aligned}$$

With these definitions in hand, we can now present the main theorem of this section.

Theorem 3.18

(Fractal dimensions of spectra) Let $\Xi _B(A)=\textrm{dim}_B(\textrm{Sp}(A))$ and $\Xi _H=\textrm{dim}_H(\textrm{Sp}(A))$. Then for $i=1,2$,

$$\begin{aligned}&\Delta ^G_2\not \ni \{\Xi _B,\Omega _{f}^{BD},\Lambda _i\}\in \Pi ^A_2, \quad{} & {} \Delta ^G_2\not \ni \{\Xi _B,\Omega _{\textrm{D}}^{BD},\Lambda _i\}\in \Pi ^A_2\\&\Delta ^G_3\not \ni \{\Xi _H,\Omega _f\cap \Omega _\textrm{SA},\Lambda _i\}\in \Sigma ^A_3, \quad{} & {} \Delta ^G_3\not \ni \{\Xi _H,\Omega _\textrm{D},\Lambda _i\}\in \Sigma ^A_3, \end{aligned}$$

whereas

$$\begin{aligned}&\Delta ^G_3\not \ni \{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\in \Pi ^A_3,\quad{} & {} \Delta ^G_2\not \ni \{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _2\}\in \Pi ^A_2\\&\Delta ^G_4\not \ni \{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}\in \Sigma ^A_4,\quad{} & {} \Delta ^G_3\not \ni \{\Xi _H,\Omega _\textrm{SA},\Lambda _2\}\in \Sigma ^A_3. \end{aligned}$$

Remark 3.19

(When $\underline{\textrm{dim}}_B(\textrm{Sp}(A))\ne \overline{\textrm{dim}}_B(\textrm{Sp}(A))$)The algorithms for $\Xi _B$ also converge without the assumption that the upper and lower box-counting dimensions of $\textrm{Sp}(A)$ agree, to a quantity $\Gamma (A)$ with

$$\begin{aligned} \underline{\textrm{dim}}_B(\textrm{Sp}(A))\le \Gamma (A)\le \overline{\textrm{dim}}_B(\textrm{Sp}(A)). \end{aligned}$$

One of the properties that makes the Hausdorff dimension harder to compute than the box-counting dimension is its countable stability, meaning that if F is countable then $\textrm{dim}_H(F)=0$.$\square $

Remark 3.20

Some of our results have interpretations for real bounded sequences. Given such a sequence $\{a_i\}_{i\in {\mathbb {N}}}\subset {\mathbb {R}}$, we can ask the same questions about $\overline{\{a_1,a_2,\ldots \}}$ as we have asked about the spectrum. We can embed these problems as spectral problems for the class $\Omega _\textrm{D}$ of bounded self-adjoint diagonal operators by simply considering diagonal operators with entries $\{a_1,a_2,\ldots \}$. Theorems 3.13, 3.16 and 3.18 immediately then give the classifications. With regard to fractal dimensions, the key problem is to try and relate the amount of data that has been seen to the resolution obtained from the data (as highlighted in the computational example below). Once we have the framework of the SCI, we can immediately see why the problem is so difficult—the computational problem requires three successive limits for the Hausdorff dimension.$\square $

Finally, the following lemma is used in the construction of the tower of algorithms for computing the Hausdorff dimension but is interesting in its own right so is listed here.

Lemma 3.21

Let $(a,b)\subset {\mathbb {R}}$ be a finite open interval and let $A\in \Omega _f\cap \Omega _\textrm{SA}$. Then determining whether $ \textrm{Sp}(A)\cap (a,b)\ne \emptyset $ using $\Lambda _i$ is a problem with $\textrm{SCI}_A=1$. Furthermore, we can design an algorithm that halts if and only the answer is “Yes”, that is, the problem lies in $\Sigma ^A_1$. Similarly, the problem lies in $\Sigma _2^A$ when considering $\Omega _\textrm{SA}$ with $\Lambda _1$ (or $\Sigma _1^A$ when we allow access to $\Lambda _2$).

4 Computational Examples

In this section, we demonstrate that the SCI-sharp algorithms constructed in this paper can be efficiently implemented for large-scale computations. Moreover, the algorithms have desirable convergence properties, converging monotonically or being eventually constant, as captured by the $\Sigma /\Pi $ classification. Generically, this monotonicity holds in all of the successive limits, and not just the final limit; many of the towers of algorithms undergo oscillation phenomena where each subsequent limit is monotone but in the opposite sense/direction than the limit beforehand. We can take advantage of this when analysing the algorithms numerically. The algorithms also highlight suitable information that lowers the SCI classification to $\Sigma _1/\Pi _1$. Other advantages of the algorithms based on approximating the resolvent norm include locality, numerical stability and speed/parallelisation. In the examples that follow, we remind the reader what each parameter $n_k$ intuitively does in the relevant algorithm and simplified routines for many of the algorithms can be found in “Appendix B”. Finally, we point the reader to Remark 5.12—all of the algorithms can be implemented rigorously using arithmetic operations over the rationals or with methods such as interval arithmetic.

4.1 Spectral Radius

We begin with the spectral radius and consider the upper-triangular non-normal operator on $l^2({\mathbb {Z}})$ defined by its action on the canonical basis via

$$\begin{aligned} Ae_j=e_{j-2}+i^{j}e_{j-1}. \end{aligned}$$

In this case, the operator norm of A is 2 and the approximation of the spectrum by finite section is $\{0\}$. Hence, to compute the spectral radius, one must resort to the techniques used in our algorithms based on rectangular truncations. Recall that the SCI classification for computing the spectral radius of such operators (where the dispersion is known) is $\Pi _2^A$ (see Theorem 3.3 for further classifications). The first parameter, $n_1$, controls the size of the rectangular truncation^{Footnote 16} (as well as the grid resolution), whereas the second, $n_2$, controls the resolvent norm cut-off ($\epsilon =1/n_2$).

Figure 2 (left) shows the output of $\Gamma _{n_2,n_1}(A)$ for computing the spectral radius. We see the expected monotonicity; $\Gamma _{n_2,n_1}(A)$ is increasing in $n_1$ but decreasing in $n_2$. It appears that $\lim _{n_1\rightarrow \infty }\Gamma _{10^2,n_1}(A)\approx \lim _{n_1\rightarrow \infty }\Gamma _{10^3,n_1}(A)\approx 1.4149$. The fact that these two values for different $n_2$ are similar suggests that we have reached convergence. Though, of course, the proof that the problem does not lie in $\Delta _2^G$ shows that we can never apply a choice of subsequences to gain convergence in one limit over the whole class $\Omega _f$. Nevertheless, the approximate value of 1.4149 is confirmed in Fig. 2 (right) where we have shown pseudospectra, computed using the algorithm in [64].

4.2 Essential Numerical Range

To demonstrate the algorithm for computing the essential numerical range, we first consider the Laurent operator $A_0$ acting on $l^2({\mathbb {Z}})$ with the symbol

$$\begin{aligned} a(t)=\frac{t^4+t^{-1}}{2}. \end{aligned}$$

In this case, $\textrm{Sp}(A_0)=\textrm{Sp}_{\textrm{ess}}(A_0)=\{a(z):|z|=1\}$. We consider the operator $A=A_0+E$ where the compact perturbation E is given by

$$\begin{aligned} Ee_j=-\frac{3i}{1+|j|}e_{j-1}. \end{aligned}$$

Recall that the SCI classification for computing the essential numerical range is $\Pi _2^A$ (see Theorem 3.10). The first parameter, $n_1$, controls the size of the truncation, whereas the second, $n_2$, controls how far along the matrix the truncations $(I-P_{n_2})P_{n_1+n_2}A|_{P_{n_1+n_2}(I-P_{n_2}){\mathcal {H}}}$ are taken with respect to the canonical basis.

Figure 3 (left) shows the output of the algorithm $\Gamma _{n_2,n_1}(A)$ to compute the essential numerical range for $n_2=20{,}000$ and $n_1=500$. We show the boundary $\partial \Gamma _{n_2,n_1}(A)$ since the essential numerical range is convex. In this example, $W_\textrm{e}(A)$ is the convex hull of $\textrm{Sp}_{\textrm{ess}}(A_0)$, which allows us to verify the output of the algorithm. We also show 200 eigenvalues of finite section (computed using extended precision to avoid numerical instabilities associated with non-normal truncations), the majority of which are due to truncation and provide an example of spectral pollution. This is confirmed when we compare to the pseudospectrum, also shown in Fig. 3 (right), computed using the algorithm in [64]. However, eigenvalues outside $W_\textrm{e}(A)$ correspond to true eigenvalues of A (see Theorem 3.9).

The algorithm can also be extended to unbounded operators, as outlined in Sect. 7.1. For example, we consider the complex Schrödinger operator

$$\begin{aligned} T=-\frac{d^2}{dx^2}+(2i+1)\cos (x). \end{aligned}$$

(4.1)

By using a Gabor basis, we can represent T as a closed operator on $l^2({\mathbb {N}})$ such that the linear span of the canonical basis (corresponding to the Gabor basis) forms a core. This allows us to use Corollary 7.5, where we can compute the matrix elements (corresponding to inner products with the basis functions) with error control using quadrature. Figure 4 shows the output for $n_2=10^4$ and various $n_1$. We see the expected monotonicity as $n_1$ increases and the output for $n_1=2000$ has converged to visible accuracy in the plot.

4.3 Capacity

We now consider a transport Hamiltonian on a Penrose tile for which few analytical results are known. Quasicrystals were discovered in 1982 by Shechtman [145] who was awarded the Nobel prize in 2011 for his discovery. Over the past 30 years, there has been considerable interest in their often exotic properties [67, 151]. The Penrose tile is the standard two-dimensional model [69, 165], and a finite portion of the tiling is shown in Fig. 5 (left). However, unlike one-dimensional models, very little is known about the spectral properties of two-dimensional quasicrystals. Let G be the graph consisting of the vertices, V(G), of the Penrose tiling and E(G) the set of edges. If there is an edge connecting two vertices x and y, we write $x \sim y$. The (negative) Laplacian, H, acts on $\psi \in l^2(V(G)) \cong l^{2}({\mathbb {N}})$ by

$$\begin{aligned} (H\psi )(x) = \sum _{y\sim x} \left( \psi (y)-\psi (x)\right) . \end{aligned}$$

(4.2)

By choosing a suitable ordering of the vertices, we can represent H as an operator acting on $l^2({\mathbb {N}})$ of bounded dispersion with $f(n)-n\sim {\mathcal {O}}(\sqrt{n})$. Recall that the SCI classification for computing the capacity of the spectrum of such operators is $\Pi _2^A$ (see Theorem 3.7 for further classifications). The first parameter, $n_1$, controls the size of the truncation used to test if intervals intersect the spectrum via Lemma 3.21, whereas the second, $n_2$, controls the spacings of the interval coverings (which have width $2^{-n_2}$). In this example, we used the conformal mapping method of [109] to accurately and rapidly compute the capacity of finite unions of intervals in ${\mathbb {R}}$ (see also Remark 6.4).

Figure 5 (right) shows the output of $\Gamma _{n_2,n_1}(H)$, and we see the expected monotonicity; the output is increasing in $n_1$ but decreasing in $n_2$. By comparing the outputs for $n_1=10^4$ and $n_1=10^5$, it appears we have convergence up to around $n_2=8$. This suggests an upper bound (since the output is non-increasing in $n_2$) of approximately 2.26 for the capacity of $\textrm{Sp}(H)$ ($\textrm{Sp}(H)$ is shown in Fig. 6).

4.4 Lebesgue Measure

First, we consider the almost Mathieu operator, which is related to a wealth of mathematical and physical problems such as the Ten Martini Problem [9]. The operator acts on $l^2({\mathbb {Z}})$ via

$$\begin{aligned} (H_{\alpha }x)_n=x_{n-1}+x_{n+1}+2\lambda \cos (2\pi n\alpha )x_n. \end{aligned}$$

(4.3)

The choice of $\lambda =1$ was studied in Hofstadter’s classic paper [92], giving rise to the famous Hofstadter butterfly. In this case, the Hamiltonian represents a crystal electron in a uniform magnetic field and the spectrum can be interpreted as the allowed energies of the system. For irrational $\alpha $, we have [11]

$$\begin{aligned} \textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(H_{\alpha }))=4\left| 1-\left| \lambda \right| \right| \end{aligned}$$

(4.4)

and we consider the case $\alpha =(\sqrt{5}-1)/2$. Recall that the SCI classification for computing the Lebesgue measure of the spectrum of such operators (where the dispersion is known) is $\Pi _2^A$, whereas the SCI classification of computing the Lebesgue measure of the pseudospectrum is $\Sigma _1^A$ (see Theorems 3.13, 3.14 and 3.16 for the further classifications). For computing the Lebesgue measure of the spectrum, the first parameter, $n_1$, controls the size of the truncation used to compute the approximation of the resolvent norm, whereas the second, $n_2$, controls the grid refinement (the spacings are $2^{-n_2}$). For the pseudospectrum, $n_1$ controls the size of the truncations and the grid spacings.

Figure 6 (left) shows the output of the algorithms computing $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(H_{\alpha }))$ (LebSpec) and also $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon }(H_{\alpha }))$ (LebPseudoSpec) for a range of values of $\epsilon $. We chose values of $n_1=10^4$ and a grid spacing of 1/128 ($n_2=7$). One can clearly see that the estimates for $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon }(H_{\alpha }))$ are decreasing to the true value of $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(H_{\alpha }))$, which is well approximated by LebSpec.

Next, we consider the operator H in (4.2), for which the Lebesgue measure of $\textrm{Sp}(H)$ is unknown. We set $n_1=10^5$ and look at the average estimated error of the output via DistSpec (see “Appendix A”). This was of the order $10^{-3}$, so we consider grid refinements of spacing $1/32, 1/64,\ldots ,1/1024$ corresponding to $n_2=5, 6,\ldots ,10$. Figure 6 (right) shows the output as a cumulative Lebesgue measure, that is, an estimate of $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A)\cap (-\infty ,x])$ for a given x, along with the computed spectrum (for a grid spacing of $10^{-5}$). The figure provides strong evidence that the part of the spectrum closest to 0 is resolved by the algorithm and has Lebesgue measure zero. We shall see more evidence for this in Sect. 4.5.

4.5 Fractal Dimension

For this example, we again consider the operator H in (4.2), for which the fractal dimension of $\textrm{Sp}(H)$ is unknown. In Fig. 7, we plot $N_{1/n_2}({\tilde{\Gamma }}_{10^5}(H)\cap [-3,\infty ))$ against $n_2$ (recall that $N_\delta ($F) is the number of closed intervals of length $\delta > 0$ required to cover F). This corresponds to a rectangular truncation with $n_1=10^5$ columns. Recall that ${\tilde{\Gamma }}_{n}$ denotes the algorithm that converges to the spectrum with error control, in particular avoiding spectral pollution (see “Appendix A”). We also show a linear fit of slope 0.8. The error control provided by the algorithm ${\tilde{\Gamma }}_{n}$ allows us to deduce the region where the fit holds, corresponding to a reliable resolution of the spectrum (this is at least as large as the region shown in the plot). In other words, we can ensure that $n_2$ is not too large so that the spacings of the coverings are not smaller than the numerically resolved spectrum. As expected, when $n_2$ is too large we see the effect of the grid spacing and the unresolved spectrum (by choosing larger $n_1$, we can take $n_2$ larger). The figure suggests that the spectrum above $-3$ is fractal with box-counting dimension $\approx 0.8$ and hence has Lebesgue measure zero, in agreement with the findings in Fig. 6.

Figure 7 also shows what happens when one performs the same experiment but with a finite section replacing ${\tilde{\Gamma }}_{n}$ (now using a square $10^5\times 10^5$ truncation). There are two noticeable features. First, for small $n_2$, using a finite section produces an overestimate of the size of the covering and the corresponding slope of the graph due to spectral pollution. In other words, finite section prevents us from detecting the fractal spectrum. Second, the covering estimate via finite section breaks down at smaller $n_2$ and it is impossible to predict suitable values of $n_2$ so that the spacings of the coverings do not go beyond the resolution of the computed spectrum. Together, these issues highlight why the finite section method is unsuitable in general^{Footnote 17} for approximating fractal dimensions and why the new algorithms in this paper (which are proven to converge) are needed.

5 Mathematical Preliminaries and Combinatorial Problems in the SCI Hierarchy

In this section, we begin by providing formal definitions of the SCI hierarchy. We then link the SCI hierarchy, in a certain specific case, to the Baire hierarchy on a suitable topological space. As well as being interesting in its own right, this provides a useful method of providing canonical problems high up in the SCI hierarchy. In particular, the results we prove hold for towers of general algorithms (see Definition 5.1) without the restrictions of arithmetic operations or notions of recursivity etc. This will be used extensively in the proofs of lower bounds for spectral problems that have $\textrm{SCI}>2$, where we typically reduce the problems discussed here to the given spectral problem. It should be stressed that such links to existing hierarchies only exist in special cases when $\Omega $ and ${\mathcal {M}}$ are particularly well-behaved. Even when such a link does exist, the induced topology on $\Omega $ is often too complicated, unnatural or strong to be useful from a computational viewpoint. We also take the view that, for problems of scientific interest, the mappings $\Lambda $ and metric space ${\mathcal {M}}$ are often given to us apriori from the corresponding applications and are typically not compatible with topological viewpoints of computation.

5.1 The SCI Hierarchy

We begin by defining the solvability complexity index (SCI) hierarchy, allowing us to show that our algorithms realise the boundary of what computers can achieve. We have already presented the definition of a computational problem $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$ in §2.1. Recall that the goal is to find algorithms that approximate the function $\Xi $. More generally, the main pillar of our framework is the concept of a tower of algorithms, which is needed to describe problems that need several successive limits in the computation. However, first one needs the definition of a general algorithm.

Definition 5.1

(General Algorithm) Given a computational problem $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$, a general algorithm is a mapping $\Gamma :\Omega \rightarrow {\mathcal {M}}$ such that for each $A\in \Omega $

(i)
there exists a (non-empty) finite subset of evaluations $\Lambda _\Gamma (A) \subset \Lambda $,
(ii)
the action of $\,\Gamma $ on A only depends on $\{A_f\}_{f \in \Lambda _\Gamma (A)}$ where $A_f:= f(A),$
(iii)
for every $B\in \Omega $ such that $B_f=A_f$ for every $f\in \Lambda _\Gamma (A)$, it holds that $\Lambda _\Gamma (B)=\Lambda _\Gamma (A)$.

The definition of a general algorithm is more general than the definition of a Turing machine [164] or a BSS machine [28]. A general algorithm has no restrictions on the operations allowed. The only restriction is that it can only take a finite amount of information, though it is allowed to adaptively choose the finite amount of information it reads depending on the input. Condition (iii) ensures that the algorithm consistently reads the information. With a definition of a general algorithm, we can define the concept of towers of algorithms.

Definition 5.2

(Tower of Algorithms) Given a computational problem $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$, a tower of algorithms of height k for $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$ is a family of sequences of functions

$$\begin{aligned} \Gamma _{n_k}:\Omega \rightarrow {\mathcal {M}},\ \Gamma _{n_k, n_{k-1}}:\Omega \rightarrow {\mathcal {M}},\quad \dots \quad ,\ \Gamma _{n_k, \ldots , n_1}:\Omega \rightarrow {\mathcal {M}}, \end{aligned}$$

where $n_k,\ldots ,n_1 \in {\mathbb {N}}$ and the functions $\Gamma _{n_k, \ldots , n_1}$ at the lowest level of the tower are general algorithms in the sense of Definition 5.1. Moreover, for every $A \in \Omega $,

$$\begin{aligned} \Xi (A)= \lim _{n_k \rightarrow \infty } \Gamma _{n_k}(A), \quad \Gamma _{n_k, \ldots , n_{j+1}}(A)= \lim _{n_j \rightarrow \infty } \Gamma _{n_k, \ldots , n_j}(A) \quad j=k-1,\dots ,1. \end{aligned}$$

In addition to a general tower of algorithms, we focus on arithmetic towers.

Definition 5.3

(Arithmetic Tower) Given a computational problem $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$, where $\Lambda $ is countable, we define the following: An arithmetic tower of algorithms of height k for $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$ is a tower of algorithms where the lowest functions $\Gamma = \Gamma _{n_k, \ldots , n_1}:\Omega \rightarrow {\mathcal {M}}$ satisfy the following: For each $A\in \Omega $ the mapping $(n_k, \ldots , n_1) \mapsto \Gamma _{n_k, \ldots , n_1}(A) = \Gamma _{n_k, \ldots , n_1}(\{A_f\}_{f \in \Lambda })$ is recursive, and $\Gamma _{n_k, \ldots , n_1}(A)$ is a finite string of complex numbers that can be identified with an element in ${\mathcal {M}}$. For arithmetic towers, we let $\alpha = A$.

Remark 5.4

By recursive we mean the following. If $f(A) \in {\mathbb {Q}}$ (or ${\mathbb {Q}}+i{\mathbb {Q}}$) for all $f \in \Lambda $, $A \in \Omega $, and $\Lambda $ is countable, then $\Gamma _{n_k, \ldots , n_1}(\{A_f\}_{f \in \Lambda })$ can be executed by a Turing machine [164], that takes $(n_k, \ldots , n_1)$ as input, and that has an oracle tape consisting of $\{A_f\}_{f \in \Lambda }$. If $f(A) \in {\mathbb {R}}$ (or ${\mathbb {C}}$) for all $f \in \Lambda $, then $\Gamma _{n_k, \ldots , n_1}(\{A_f\}_{f \in \Lambda })$ can be executed by a BSS machine [28] that takes $(n_k, \ldots , n_1)$, as input, and that has an oracle that can access any $A_f$ for $f \in \Lambda $.$\square $

Given the definitions above we can now define the key concept, namely the solvability complexity index:

Definition 5.5

(Solvability Complexity Index) A computational problem $\{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}$ is said to have solvability complexity index $\textrm{SCI}(\Xi ,\Omega ,{\mathcal {M}},\Lambda )_{\alpha } = k$, with respect to a tower of algorithms of type $\alpha $, if k is the smallest integer for which there exists a tower of algorithms of type $\alpha $ of height k. If no such tower exists, then $\textrm{SCI}(\Xi ,\Omega ,{\mathcal {M}},\Lambda )_{\alpha } = \infty .$ If there exists a tower $\{\Gamma _n\}_{n\in {\mathbb {N}}}$ of type $\alpha $ and height one such that $\Xi = \Gamma _{n_1}$ for some $n_1 < \infty $, then we define $\textrm{SCI}(\Xi ,\Omega ,{\mathcal {M}},\Lambda )_{\alpha } = 0$. The type $\alpha $ may be General, or Arithmetic, denoted, respectively, G and A. We may sometimes write $\textrm{SCI}(\Xi ,\Omega )_{\alpha }$ to simplify notation when ${\mathcal {M}}$ and $\Lambda $ are obvious.

We will let $\textrm{SCI}(\Xi ,\Omega )_{\textrm{A}}$ and $\textrm{SCI}(\Xi ,\Omega )_{\textrm{G}}$ denote the SCI with respect to an arithmetic tower and a general tower, respectively. Note that a general tower means just a tower of algorithms as in Definition 5.2, where there are no restrictions on the mathematical operations. Thus, clearly $\textrm{SCI}(\Xi ,\Omega )_{\textrm{A}} \ge \textrm{SCI}(\Xi ,\Omega )_{\textrm{G}}$. The definition of the SCI immediately induces the SCI hierarchy:

Definition 5.6

(The Solvability Complexity Index Hierarchy) Consider a collection ${\mathcal {C}}$ of computational problems and let ${\mathcal {T}}$ be the collection of all towers of algorithms of type $\alpha $ for the computational problems in ${\mathcal {C}}$. Define

$$\begin{aligned} \begin{aligned} \Delta ^{\alpha }_0&:= \{\{\Xi ,\Omega \} \in {\mathcal {C}} \ \vert \ \textrm{SCI}(\Xi ,\Omega )_{\alpha } = 0\}\\ \Delta ^{\alpha }_{m+1}&:= \{\{\Xi ,\Omega \} \in {\mathcal {C}} \ \vert \ \textrm{SCI}(\Xi ,\Omega )_{\alpha } \le m\}, \qquad \quad m \in {\mathbb {N}}, \end{aligned} \end{aligned}$$

as well as

$$\begin{aligned} \Delta ^{\alpha }_{1}:= \{\{\Xi ,\Omega \} \in {\mathcal {C}} \ \vert \ \exists \ \{\Gamma _n\}_{n\in {\mathbb {N}}} \in {\mathcal {T}}\text { s.t. } \forall A \ d(\Gamma _n(A),\Xi (A)) \le 2^{-n}\}. \end{aligned}$$

When there is additional structure on the metric space, such as in the spectral case when one considers the Attouch–Wets or the Hausdorff metric, one can extend the SCI hierarchy. For non-empty closed sets, we consider the Attouch–Wets metric defined by

$$\begin{aligned} d_{\textrm{AW}}(C_1,C_2)=\sum _{n=1}^{\infty } 2^{-n}\min \left\{ {1,\underset{\left| x\right| \le n}{\sup }\left| \textrm{dist}(x,C_1)-\textrm{dist}(x,C_2)\right| }\right\} , \end{aligned}$$

(5.1)

for $C_1,C_2\in \textrm{Cl}({\mathbb {C}}),$ where $\textrm{Cl}({\mathbb {C}})$ denotes the set of closed non-empty subsets of ${\mathbb {C}}$. This generalises the familiar Hausdorff metric to unbounded closed sets and corresponds to local uniform converge on compact subsets of ${\mathbb {C}}$.

Definition 5.7

(The SCI Hierarchy (Attouch–Wets/Hausdorff metric)) Given the set-up in Definition 5.6, and suppose in addition that $({\mathcal {M}},d)$ has the Attouch–Wets or the Hausdorff metric induced by another metric space $({\mathcal {M}}^{\prime },d')$, define, for $m \in {\mathbb {N}}$,

$$\begin{aligned} \Sigma ^{\alpha }_0&= \Pi ^{\alpha }_0 = \Delta ^{\alpha }_0,\\ \Sigma ^{\alpha }_{1}&= \{\{\Xi ,\Omega \} \in \Delta _{2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n}\} \in {\mathcal {T}}, \ \{X_{n}(A)\}\subset {\mathcal {M}} \text{ s.t. } \ \Gamma _{n}(A) \mathop {\subset }_{{\mathcal {M}}^{\prime }} X_n(A),\\ {}&\quad \quad \quad \quad \quad \quad \lim _{n\rightarrow \infty }\Gamma _{n}(A)=\Xi (A),\ \ d(X_{n}(A),\Xi (A))\le 2^{-n} \ \ \forall A \in \Omega \}, \\ \Pi ^{\alpha }_{1}&= \{\{\Xi ,\Omega \} \in \Delta _{2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n}\} \in {\mathcal {T}}, \ \{X_{n}(A)\}\subset {\mathcal {M}} \text{ s.t. } \ \Xi (A) \mathop {\subset }_{{\mathcal {M}}^{\prime }} X_{n}(A),\\ {}&\quad \quad \quad \quad \quad \quad \lim _{n\rightarrow \infty }\Gamma _{n}(A)=\Xi (A),\ \ d(X_{n}(A),\Gamma _n(A))\le 2^{-n} \ \ \forall A \in \Omega \}, \end{aligned}$$

where $\mathop {\subset }_{{\mathcal {M}}^{\prime }}$ means inclusion in the metric space ${\mathcal {M}}^{\prime }$, and $\{X_{n}(A)\}$ is a sequence where $X_n(A) \in {\mathcal {M}}$ depends on A. Moreover,

$$\begin{aligned} \begin{aligned} \Sigma ^{\alpha }_{m+1}&= \{\{\Xi ,\Omega \} \in \Delta _{m+2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n_{m+1},\ldots ,n_1}\}\in {\mathcal {T}},\\ {}&\quad \quad \{X_{n_{m+1}}(A)\}\subset {\mathcal {M}} \text{ s.t. } \ \Gamma _{n_{m+1}}(A) \mathop {\subset }_{{\mathcal {M}}^{\prime }} X_{n_{m+1}}(A),\\ {}&\quad \quad \lim _{n_{m+1}\rightarrow \infty }\Gamma _{n_{m+1}}(A)=\Xi (A),\ \ d(X_{n_{m+1}}(A),\Xi (A))\le 2^{-n_{m+1}} \ \ \forall A \in \Omega \}, \\ \Pi ^{\alpha }_{m+1}&= \{\{\Xi ,\Omega \} \in \Delta _{m+2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n_{m+1},\ldots ,n_1}\}\in {\mathcal {T}}, \\ {}&\quad \quad \{X_{n_{m+1}}(A)\}\subset {\mathcal {M}} \text{ s.t. } \ \Xi (A) \mathop {\subset }_{{\mathcal {M}}^{\prime }} X_{n_{m+1}}(A),\\ {}&\quad \quad \lim _{n_{m+1}\rightarrow \infty }\Gamma _{n_{m+1}}(A) =\Xi (A),\ \ d(X_{n_{m+1}}(A),\Gamma _{n_{m+1}}(A))\le 2^{-n_{m+1}} \ \ \forall A \in \Omega \}, \end{aligned} \end{aligned}$$

where d can be either $d_{\textrm{H}}$ or $d_{\textrm{AW}}$.

Note that to build a $\Sigma _1$ algorithm, it is enough (by taking subsequences of n) to construct $\Gamma _n(A)$ such that $\Gamma _{n}(A) \subset {\mathcal {N}}_{E_n(A)}(\Xi (A))$ with some computable $E_n(A)$ that converges to zero. The same idea can be applied to the real line with the usual metric, or $\{0,1\}$ with the discrete metric (we interpret 1 as “Yes”).

Definition 5.8

(The SCI Hierarchy (totally ordered set)) Given the set-up in Definition 5.6 and suppose in addition that ${\mathcal {M}}$ is a totally ordered set. Define

$$\begin{aligned} \begin{aligned} \Sigma ^{\alpha }_0&= \Pi ^{\alpha }_0 = \Delta ^{\alpha }_0,\\ \Sigma ^{\alpha }_{1}&= \{\{\Xi ,\Omega \} \in \Delta _{2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n}\} \in {\mathcal {T}} \text { s.t. } \Gamma _{n}(A) \nearrow \Xi (A) \ \, \forall A \in \Omega \}, \\ \Pi ^{\alpha }_{1}&= \{\{\Xi ,\Omega \} \in \Delta _{2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n}\} \in {\mathcal {T}} \text { s.t. } \Gamma _{n}(A) \searrow \Xi (A) \ \, \forall A \in \Omega \}, \end{aligned} \end{aligned}$$

where $\nearrow $ and $\searrow $ denotes convergence from below and above, respectively, as well as, for $m \in {\mathbb {N}}$,

$$\begin{aligned} {\begin{matrix} \Sigma ^{\alpha }_{m+1} &{}= \{\{\Xi ,\Omega \} \in \Delta _{m+2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n_{m+1}, \hdots , n_1}\} \in \mathcal {T} \text { s.t. }\Gamma _{n_{m+1}}(A) \nearrow \Xi (A) \ \, \forall A \in \Omega \}, \\ \Pi ^{\alpha }_{m+1} &{}= \{\{\Xi ,\Omega \} \in \Delta _{m+2}^{\alpha } \ \vert \ \exists \ \{\Gamma _{n_{m+1}, \hdots , n_1}\} \in \mathcal {T} \text { s.t. }\Gamma _{n_{m+1}}(A) \searrow \Xi (A) \ \, \forall A \in \Omega \}. \end{matrix}} \end{aligned}$$

Remark 5.9

($\Delta ^{\alpha }_1\subsetneq \Sigma ^{\alpha }_1 \subsetneq \Delta ^{\alpha }_2$) Note that the inclusions are strict. For example, if $\Omega _K$ consists of the set of compact infinite matrices acting on $l^2({\mathbb {N}})$ and $\Xi (A)=\textrm{Sp}(A)$ (the spectrum of A) then $\{\Xi , \Omega _K\} \in \Delta ^{\alpha }_2$ but not in $ \Sigma _1^\alpha \cup \Pi _1^\alpha $ for $\alpha $ representing either towers of arithmetical or general type (see [18] for a proof). Moreover, as was demonstrated in [64], if $\tilde{\Omega }$ is the set of discrete Schrödinger operators on $l^2({\mathbb {Z}})$, then $\{\Xi , {{\tilde{\Omega }}}\} \in \Sigma ^{\alpha }_1$ but not in $\Delta ^{\alpha }_1$.$\square $

Suppose we are given a computational problem $\{\Xi , \Omega , {\mathcal {M}}, \Lambda \}$, and that $\Lambda = \{f_j\}_{j \in \beta }$, where $\beta $ is some index set that can be finite or infinite. Obtaining $f_j$ may be a computational task on its own, which is exactly the problem in most areas of computational mathematics. In particular, for $A \in \Omega $, $f_j(A)$ could be the number $e^{\frac{\pi }{j} i }$ for example. Hence, we cannot access $f_j(A)$, but rather $f_{j,n}(A)$ where $f_{j,n}(A) \rightarrow f_{j}(A)$ as $n \rightarrow \infty $. Or, just as for problems that are high up in the SCI hierarchy, it could be that we need several successive limits, in particular one may need mappings $f_{j,n_m,\ldots , n_1}: \Omega \rightarrow {\mathbb {D}} + i{\mathbb {D}}$, where ${\mathbb {D}}$ denotes the dyadic rational numbers, such that

$$\begin{aligned} \lim _{n_m \rightarrow \infty } \ldots \lim _{n_1 \rightarrow \infty } \Vert f_{j,n_m,\ldots , n_1}(A) - f_j(A)\Vert _{\infty } = 0 \quad \forall j\in \beta ,\forall A \in \Omega . \end{aligned}$$

(5.2)

In particular, we may view the problem of obtaining $f_j(A)$ as a problem in the SCI hierarchy, where $\Delta _1$ classification would correspond to the existence of mappings $f_{j,n}: \Omega \rightarrow {\mathbb {D}} + i {\mathbb {D}}$ such that

$$\begin{aligned} \Vert f_{j,n}(A) - f_j(A)\Vert _{\infty } \le 2^{-n} \quad \forall j\in \beta ,\forall A \in \Omega . \end{aligned}$$

(5.3)

This idea is formalised in the following definition.

Definition 5.10

($\Delta _{m}$-information) Let $\{\Xi , \Omega , {\mathcal {M}}, \Lambda \}$ be a computational problem. For $m \in {\mathbb {N}}$, we say that $\Lambda $ has $\Delta _{m+1}$-information if each $f_j \in \Lambda $ is not available, however, there are mappings $f_{j,n_m,\ldots , n_1}: \Omega \rightarrow {\mathbb {D}} + i {\mathbb {D}}$ such that (5.2) holds. Similarly, for $m = 0$ there are mappings $f_{j,n}: \Omega \rightarrow {\mathbb {D}} + i {\mathbb {D}}$ such that (5.3) holds. Finally, if $k \in {\mathbb {N}}$ and ${{\hat{\Lambda }}}$ is a collection of such functions described above such that $\Lambda $ has $\Delta _k$-information, we say that ${{\hat{\Lambda }}}$ provides $\Delta _k$-information for $\Lambda $. Moreover, we denote the family of all such ${{\hat{\Lambda }}}$ by ${\mathcal {L}}^k(\Lambda )$.

We want algorithms that can handle all computational problems $\{\Xi ,\Omega ,{\mathcal {M}},{{\hat{\Lambda }}}\}$ when ${{\hat{\Lambda }}} \in {\mathcal {L}}^m(\Lambda )$. To formalise this, we define a computational problem with $\Delta _m$-information.

Definition 5.11

(Computational problem with $\Delta _m$-information) Given $m \in {\mathbb {N}}$ with $m>1$, a computational problem where $\Lambda $ has $\Delta _m$-information is denoted by $ \{\Xi ,\Omega ,{\mathcal {M}},\Lambda \}^{\Delta _m}:= \{{{\tilde{\Xi }}},{{\tilde{\Omega }}},{\mathcal {M}},{{\tilde{\Lambda }}}\}, $ where

$$\begin{aligned} {{\tilde{\Omega }}} = \Big \{ {{\tilde{A}}} = \{f_{j,n_{m-1},\ldots , n_1}(A)\}_{j,n_{m-1},\ldots , n_1 \in \beta \times {\mathbb {N}}^{m-1}} \, \vert&\, A \in \Omega , \{f_j\}_{j \in \beta }= \Lambda , \\&f_{j,n_{m-1},\ldots , n_1} \text{ satisfy } \text{(5.2) } \Big \} . \end{aligned}$$

Moreover, ${{\tilde{\Xi }}}({{\tilde{A}}}) = \Xi (A)$, and we have $\tilde{\Lambda }= \{{{\tilde{f}}}_{j,n_{m-1},\ldots , n_1}\}_{j,n_{m-1},\ldots , n_1 \in \beta \times {\mathbb {N}}^{m-1}}$ where $\tilde{f}_{j,n_{m-1},\ldots , n_1}({{\tilde{A}}}) = f_{j,n_{m-1},\ldots , n_1}(A)$. Note that ${{\tilde{\Xi }}}$ is well-defined by Definition 2.1 of a computational problem. Similarly, we define $\Delta _1$-information using (5.3).

The SCI and the SCI hierarchy, given $\Delta _m$-information, are then defined in the standard obvious way.

Remark 5.12

(Classifications in this paper) For the problems considered in this paper, the SCI classifications do not change if we consider arithmetic towers with $\Delta _1$-information. This is easy to see through Church’s thesis and an analysis of the stability of our algorithms. For example, when the input is rational we have been careful to restrict all relevant operations to ${\mathbb {Q}}$ rather than ${\mathbb {R}}$, and errors incurred from $\Delta _1$-information can be removed in the first limit. Explicitly, for the algorithms based on DistSpec (see “Appendix A”) it is possible to carry out an error analysis. We can also bound numerical errors (e.g. using interval arithmetic [163]) and incorporate this uncertainty for the estimation of $\left\| R(z,A)\right\| ^{-1}$ to gain the same classification of our problems. Similarly, for other algorithms based on similar functions. In other words, for the results of this paper, it does not matter which model of computation one uses for a definition of ‘algorithm’. From a classification point of view, they are equivalent for these spectral problems. This leads to rigorous $\Sigma _k^\alpha $ or $\Pi _k^\alpha $ type error control suitable for verifiable numerics. In particular, for $\Sigma _1^{\alpha }$ or $\Pi _1^{\alpha }$ towers of algorithms, this could be useful for computer-assisted proofs.$\square $

5.2 Recalling Some Results from Descriptive Set Theory

We briefly recall the definition of the Borel hierarchy as well as some well-known theorems from descriptive set theory. It is beyond the scope of this paper to provide an extensive discussion of descriptive set theory, but we refer the reader to [98, Chapter 2] for an excellent introduction that covers the main ideas.

Let X be a metric space and define

$$\begin{aligned} \Sigma _1^0(X)=\{U\subset X:U\text { is open}\},\quad \Pi _1^0(X)=\sim \Sigma _1^0(X)=\{F\subset X:F\text { is closed}\}, \end{aligned}$$

where for a class ${\mathcal {U}}$, $\sim {\mathcal {U}}$ denotes the class of complements (in X) of elements of ${\mathcal {U}}$. Inductively define

$$\begin{aligned} \Sigma ^0_\xi (X)&=\{\cup _{n\in {\mathbb {N}}}A_n:A_n\in \Pi ^0_{\xi _n},\xi _n<\xi \},\text { if }\xi >1,\\ \Pi _\xi ^0(X)&=\sim \Sigma _\xi ^0(X),\quad \Delta _\xi ^0(X)=\Sigma ^0_\xi (X)\cap \Pi _\xi ^0(X). \end{aligned}$$

The full Borel hierarchy extends to all $\xi <\omega _1$ ($\omega _1$ being the first uncountable ordinal) by transfinite induction but we do not need this here.

Definition 5.13

Given a class of subsets, ${\mathcal {U}}$, of a metric space X and given another metric space Y, we say that the function $f:X\rightarrow Y$ is ${\mathcal {U}}$-measurable if $f^{-1}(U)\in {\mathcal {U}}$ for every open set $U\subset Y$.

Given metric spaces X and Y, the Baire hierarchy is defined as follows. A function $f:X\rightarrow Y$ is of Baire class 1, written $f\in {\mathcal {B}}_1$, if it is $\Sigma _2^0(X)$-measurable. For $1<\xi <\omega _1$, a function $f:X\rightarrow Y$ is of Baire class $\xi $, written $f\in {\mathcal {B}}_\xi $, if it is the pointwise limit of a sequence of functions $f_n$ in ${\mathcal {B}}_{\xi _n}$ with $\xi _n<\xi $. The following Theorem is well-known [98, Section 24] and provides a useful link between the Borel and Baire hierarchies.

Theorem 5.14

(Lebesgue, Hausdorff, Banach) Let X, Y be metric spaces with Y separable and $1\le \xi <\omega _1$. Then, $f\in {\mathcal {B}}_\xi $ if and only if it is $\Sigma _{\xi +1}^0(X)$ measurable. Furthermore, if X is zero-dimensional (Hausdorff with a basis of clopen sets) and $f\in {\mathcal {B}}_1$, then f is the pointwise limit of a sequence of continuous functions.

The assumption that X is zero-dimensional in the last statement is important. Without any assumptions, the final statement of the theorem is false, as is easily seen by considering $X={\mathbb {R}}$. Examples of zero-dimensional spaces include products of the discrete space $\{0,1\}$ or the Cantor space. Any such space is necessarily totally disconnected, meaning that the connected components in the space are the one-point sets (the converse is true for locally compact Hausdorff spaces). Our primary interest will be when Y is equal to $\{0,1\}$ or [0, 1], both with their natural topologies.

5.3 Linking the SCI Hierarchy to the Baire Hierarchy in a Special Case

Definition 5.15

Given the triple $\{\Omega ,{\mathcal {M}},\Lambda \}$, a class of algorithms ${\mathcal {A}}$ is closed under search with respect to $\{\Omega ,{\mathcal {M}},\Lambda \}$ if whenever

(1)
${\mathcal {I}}$ is an index set,
(2)
$\{n_i\}_{i\in {\mathcal {I}}}$ a family of natural numbers,
(3)
$\{\Gamma _{i,l}:\Omega \rightarrow {\mathcal {M}}\}_{i\in {\mathcal {I}},l\le n_i}\subset {\mathcal {A}}$,
(4)
$\{U_{i,l}\}_{i\in {\mathcal {I}},l\le n_i}$ family of basic open sets in ${\mathcal {M}}$ with $\cup _{i\in {\mathcal {I}}}\cap _{l\le n_i}\Gamma _{i,l}^{-1}(U_{i,l})=\Omega ,$
(5)
$\{c_i\}_{i\in {\mathcal {I}}}$ a family of points in some arbitrary dense subset of ${\mathcal {M}}$,

then there is some $\Gamma \in {\mathcal {A}}$ such that for every $x\in \Omega $ there exists some $i\in {\mathcal {I}}$ with $\Gamma (x)=c_i$ and for all $l\le n_i$ we have $\Gamma _{i,l}(x)\in U_{i,l}$.

Proposition 5.16

Suppose that ${\mathcal {A}}$ is closed under search with respect to $\{\Omega ,{\mathcal {M}},\Lambda \}$, then there exists a topology ${\mathcal {T}}$ on $\Omega $ such that $\Delta _1^{{\mathcal {A}}}$ is precisely the set of continuous functions from $(\Omega ,{\mathcal {T}})$ to ${\mathcal {M}}$.

Proof

Let ${\mathcal {T}}$ be the topology generated by $\{\Gamma ^{-1}(B):\Gamma \in {\mathcal {A}},B\subset {\mathcal {M}}\text { basic open}\}$. Any $\Gamma \in {\mathcal {A}}$ is continuous with respect to this topology. Uniform limits of continuous functions into metric spaces are also continuous, and hence any function in $\Delta _1^{{\mathcal {A}}}$ is continuous with respect to ${\mathcal {T}}$.

For the other direction, suppose that $f: (\Omega ,{\mathcal {T}})\rightarrow {\mathcal {M}}$ is continuous. Choose $\{c_i\}_{i\in {\mathcal {I}}}\subset {\mathcal {M}}$ such that ${\mathcal {M}}\subset \cup _{i\in {\mathcal {I}}}D(c_i,2^{-n})$. Continuity of f implies that $f^{-1}(D(c_i,2^{-n}))$ are open. This implies that there is an index set ${\mathcal {J}}$, natural numbers $\{n_{i,j}\}_{j\in {\mathcal {J}}}$, a family $\{\Gamma _{i,j,l}\}_{i\in {\mathcal {I}},j\in {\mathcal {J}},l\le n_{i,j}}$ (in ${\mathcal {A}}$) and a family of basic open sets $\{U_{i,j,l}\}_{i\in {\mathcal {I}},j\in {\mathcal {J}},l\le n_{i,j}}$ with the property that

$$\begin{aligned} f^{-1}(D(c_i,2^{-n}))=\bigcup _{j\in {\mathcal {J}}}\bigcap _{l\le n_{i,j}}\Gamma ^{-1}_{i,j,l}(U_{i,j,l}). \end{aligned}$$

It follows that

$$\begin{aligned} \bigcup _{i\in {\mathcal {I}},j\in {\mathcal {J}}}\bigcap _{l\le n_{i,j}}\Gamma ^{-1}_{i,j,l}(U_{i,j,l})=\Omega . \end{aligned}$$

Since ${\mathcal {A}}$ is closed under search, there exists $f_n\in {\mathcal {A}}$ such that for every $x\in \Omega $ there exists some $i\in {\mathcal {I}}$ and $j\in {\mathcal {J}}$ with $f_n(x)=c_i$ and for all $l\le n_{i,j}$, $x\in \Gamma ^{-1}_{i,j,l}(U_{i,j,l}).$ But this implies that $d(f_n(x),f(x))< 2^{-n}$. Since n was arbitrary, we have $f\in \Delta _1^{{\mathcal {A}}}$. $\square $

The generated topology can be very perverse and not every class of algorithms is closed under search. However, we do have the following useful theorem when $\Omega $ (and $\Lambda $) is a particularly simple discrete space.

Theorem 5.17

Suppose that $\Omega =\{0,1\}^{{\mathbb {N}}}=\{\{a_i\}_{i\in {\mathbb {N}}}:a_i\in \{0,1\}\}$ with the set of evaluation functions $\Lambda $ equal to the set of pointwise evaluations $\{\lambda _{j}(a):=a_j:j\in {\mathbb {N}}\}$ and let ${\mathcal {M}}$ be an arbitrary separable metric space with at least two separated points. Endow $\Omega $ with the product topology, $\tilde{{\mathcal {T}}}$, induced by the discrete topology on $\{0,1\}$ and consider the Baire hierarchy, $\{{\mathcal {B}}_{\xi }((\Omega ,\tilde{{\mathcal {T}}}),{\mathcal {M}})={\mathcal {B}}_{\xi }\}_{\xi <\omega _1}$, of functions $f:\Omega \rightarrow {\mathcal {M}}$. Then for any problem function $\Xi :\Omega \rightarrow {\mathcal {M}}$ and $m\in {\mathbb {N}}$,

$$\begin{aligned} \{\Xi ,\Omega ,\Lambda \}\in \Delta ^G_{m+1}\Leftrightarrow \Xi \in {\mathcal {B}}_{m}. \end{aligned}$$

(5.4)

In other words, the SCI corresponds to the Baire hierarchy index.

Remark 5.18

The proof shows that we can replace $\Omega $ by $\{0,1\}^{{\mathbb {N}}\times {\mathbb {N}}}$ or any other such product space (induced by a discrete topology) of the form $A^B$ with A, B countable, with $\Lambda $ the corresponding component-wise evaluations, as long as ${\mathcal {M}}$ has at least $\left| A\right| $ jointly separated points and is separable.$\square $

Proof

First we show that general algorithms are closed under search and that the topology ${\mathcal {T}}$ in Proposition 5.16 is equal to the product topology $\tilde{{\mathcal {T}}}$. Without loss of generality, we can assume that ${\mathcal {I}}$ is well-ordered by $\prec $. Given $x\in \Omega $, let $k\in {\mathbb {N}}$ be minimal such that there exists $i\in {\mathcal {I}}$ with $x\in \cap _{l\le n_i}\Gamma ^{-1}_{i,l}(U_{i,l})$ and $\Lambda _{\Gamma _{i,l}}(x)\subset \{\lambda _j:j\le k\}$ for $l\le n_i$. Let $i_0$ be the $\prec $-least witness for k and then define $\Gamma (x)=c_{i_0}$. The well-ordering of ${\mathcal {I}}$ implies that $\Gamma $ is a general algorithm, and it clearly satisfies the requirements in the definition of closed under search. Note that this part of the proof only uses countability of $\Lambda $.

To equate the topologies, suppose that $\Gamma \in \Delta _0^G$ is a general algorithm. For each $a\in \Omega ,$ $\Lambda _{\Gamma }(a)$ is finite and we can assume without loss of generality that it is equal to $\{\lambda _j:j\le I(a)\}$ for some finite I(a). In particular, there exists an open set $U_a$ such that any $b\in U_a$ has $\lambda _j(b)=\lambda _j(a)$ for $j\le I(a)$ and hence $\Gamma (b)=\Gamma (a)$. Then for any open set $B\subset {\mathcal {M}}$

$$\begin{aligned} \Gamma ^{-1}(B)=\bigcup _{a\in \Gamma ^{-1}(B)}U_a \end{aligned}$$

is open. Hence each $\Gamma $ is continuous with respect to the product topology on $\Omega $. It follows that ${\mathcal {T}}\subset \tilde{{\mathcal {T}}}$. To prove the converse, we must show that each projection map $\lambda _j$ is continuous with respect to ${\mathcal {T}}$. Let $x_1,x_2$ be separated points in ${\mathcal {M}}$ and consider $f:\{0,1\}\rightarrow {\mathcal {M}}$ with $f(0)=x_1$ and $f(1)=x_2$. Then the composition $f\circ \lambda _j$ is a general algorithm and hence continuous with respect to ${\mathcal {T}}$. But this implies that $\lambda _j$ is continuous. It follows from Proposition 5.16 that $\{\Xi ,\Omega ,\Lambda \}\in \Delta ^G_{1}$ if and only if $\Xi $ is continuous.

Now the space $(\Omega ,{\mathcal {T}})$ is zero-dimensional and ${\mathcal {M}}$ is separable, hence by Theorem 5.14, any element of ${\mathcal {B}}_1$ is a limit of continuous functions. The converse holds in greater generality. It follows that $\Xi \in {\mathcal {B}}_m$ if and only if there are $f_{n_m,\ldots ,n_1}\in \Delta ^G_{1}$ with

$$\begin{aligned} \Xi (a)=\lim _{n_m\rightarrow \infty }\cdots \lim _{n_1\rightarrow \infty }f_{n_m,\ldots ,n_1}(a). \end{aligned}$$

(5.5)

If this holds, then there exist general algorithms $\Gamma _{n_m,\ldots ,n_1}$ such that for all $a\in \Omega $,

$$\begin{aligned} d(\Gamma _{n_m,\ldots ,n_1}(a),f_{n_m,\ldots ,n_1}(a))\le 2^{-n_1} \end{aligned}$$

and hence

$$\begin{aligned} \lim _{n_m\rightarrow \infty }\cdots \lim _{n_1\rightarrow \infty }\Gamma _{n_m,\ldots ,n_1}(a)=\Xi (a) \end{aligned}$$

so that $\{\Xi ,\Omega ,\Lambda \}\in \Delta ^G_{m+1}$. Conversely if $\{\Xi ,\Omega ,\Lambda \}\in \Delta ^G_{m+1}$ with tower of algorithms $\{\Gamma _{n_{m},\ldots ,n_1}\}$, then since each general algorithm is continuous, (5.5) holds with $f_{n_m,\ldots ,n_1}(a)=\Gamma _{n_{m},\ldots ,n_1}$. $\square $

5.4 Combinatorial Problems High up in the SCI Hierarchy

We can now combine the results of the previous two subsections and obtain combinatorial problems high up in the SCI hierarchy. Let $k\in {\mathbb {N}}_{\ge 2}$ and let $\Omega _k$ denote the collection of all infinite arrays $\{a_{m_1,\ldots ,m_k}\}_{m_1,\ldots ,m_k\in {\mathbb {N}}}$ with entries $a_{m_1,\ldots ,m_k}\in \{0,1\}$. As usual, $\Lambda _k$ is the set of component-wise evaluations/projections. Consider the formulas

$$\begin{aligned} P(a,m_1,\ldots ,m_{k-2})&={\left\{ \begin{array}{ll} 1,\quad \text {if }\, \exists i \, \forall j\, \exists n>j \text { s.t. }a_{m_1,\ldots ,m_{k-2},n,i}=1\\ 0,\quad \text {otherwise} \end{array}\right. },\\ Q(a,m_1,\ldots ,m_{k-2})&={\left\{ \begin{array}{ll} 1,\quad \text {if } \, \forall ^{\infty }i\, \forall \, j\, \exists n>j \text { s.t. }a_{m_1,\ldots ,m_{k-2},n,i}=1\\ 0,\quad \text {otherwise} \end{array}\right. }, \end{aligned}$$

where $\forall ^{\infty }$ means “for all but a finite number of”. In words, P decides whether the corresponding matrix has a column with infinitely many 1’s, whereas Q decides whether the matrix has only finitely many columns with only finitely many 1’s. For $R=P$ or Q, consider the problem function for $a\in \Omega _k$

$$\begin{aligned} \Xi _{k,R}(a)={\left\{ \begin{array}{ll} \exists m_1\text { } \forall m_2\text { }\ldots \text { }\forall m_{k-2} R(a,m_1,\ldots ,m_{k-2}),\quad \text {if }k\text { is even}\\ \forall m_1\text { } \exists m_2\text { }\ldots \text { }\forall m_{k-2} R(a,m_1,\ldots ,m_{k-2}),\quad \text {otherwise} \end{array}\right. }, \end{aligned}$$

that is, so that all quantifier types alternate.

Theorem 5.19

Let ${\mathcal {M}}$ be either $\{0,1\}$ with the discrete metric or [0, 1] with the usual metric and consider the above problems $\{\Xi _k,\Omega _k,{\mathcal {M}},\Lambda _k\}$. For $k\in {\mathbb {N}}_{\ge 2}$ and $R=P$ or Q,

$$\begin{aligned} \Delta _{k+1}^G\not \ni \{\Xi _{k,R},\Omega _{k},{\mathcal {M}},\Lambda _k\}\in \Delta _{k+2}^A. \end{aligned}$$

In other words, we can solve the problem via a height $k+1$ arithmetic tower, but it is impossible to do so with a height k general tower.

Remark 5.20

Note that we allow both discrete and continuous spaces ${\mathcal {M}}$, which will be important for our reduction arguments when proving lower bounds for classifications of spectral problems for non-discrete ${\mathcal {M}}$. The lower bound is a strong result in the sense that it holds regardless of the model of computation. In other words, it is the intrinsic combinatorial complexity of the problems that make the problems hard.$\square $

Proof

We deal with the case of $R=P$ since the case of $R=Q$ is completely analogous. It is easy to see that $\{\Xi _{k,P},\Omega _{k},{\mathcal {M}},\Lambda _k\}\in \Delta _{k+2}^A$. First consider the case $k=2$ and set

$$\begin{aligned} \Gamma _{n_3,n_2,n_1}(a)=\max _{j\le n_3}\chi _{(n_2,\infty )}\left( \sum _{i=1}^{n_1} a_{i,j}\right) , \end{aligned}$$

where $\chi _C$ denotes the indicator function of a set C. This is the decision problem that decides whether there exists a column with index at most $n_3$ such that there are at least $n_2$ 1’s in the first $n_1$ rows. This is clearly an arithmetic tower and it is straightforward to show that this converges to $\Xi _{2,P}$ in ${\mathcal {M}}$ (in either of the $\{0,1\}$ and [0, 1] cases). For $k>2$, we simply alternate taking products (which corresponds to minima in this case) and maxima. Explicitly, we set

$$\begin{aligned}{} & {} \Gamma _{n_{k+1},\ldots ,n_1}(a)\\ {}{} & {} ={\left\{ \begin{array}{ll} \displaystyle \max _{m_{1}\le n_{k+1}}\displaystyle \prod _{m_{2}=1}^{n_{k}}\cdots \displaystyle \prod _{m_{k-2}=1}^{n_4}\left\{ \displaystyle \max _{j\le n_3}\chi _{(n_2,\infty )}\left( \sum _{i=1}^{n_1} a_{m_1,\ldots ,m_{k-2},i,j}\right) \right\} ,\quad \text {if }k\text { is even}\\ \displaystyle \prod _{m_{1}=1}^{n_{k+1}}\displaystyle \max _{m_{2}\le n_{k}}\cdots \displaystyle \prod _{m_{k-2}=1}^{n_4}\left\{ \displaystyle \max _{j\le n_3}\chi _{(n_2,\infty )}\left( \sum _{i=1}^{n_1} a_{m_1,\ldots ,m_{k-2},i,j}\right) \right\} ,\quad \text {otherwise}. \end{array}\right. } \end{aligned}$$

Again, this is an arithmetic tower and it is straightforward to show that this converges to $\Xi _{k,P}$ in ${\mathcal {M}}$. It also holds that $\{\Xi _{k,P},\Omega _{k},{\mathcal {M}},\Lambda _k\}\in \Sigma _{k+1}^A$ if k is even and $\{\Xi _{k,P},\Omega _{k},{\mathcal {M}},\Lambda _k\}\in \Pi _{k+1}^A$ if k is odd (not to be confused with the notation for the Borel hierarchy).

Recall the topology ${\mathcal {T}}$ on $\Omega _k$ from Theorem 5.17. For the lower bound, we note that P is $\Sigma _3^0$ complete (in the literature it is known as the problem “$S_3$”, see for example [98, Section 23]). Since $(\Omega _{k},{\mathcal {T}})$ is zero-dimensional, a theorem of Wadge implies that this means that P is the indicator function of a set, also denoted by P, which lies in $\Sigma _3^0(\Omega _k)$ but not $\Pi _3^0(\Omega _k)$. It also follows that $\Xi _{k,P}$ is $\Sigma _{k+1}^0(\Omega _k)$ complete if k is even and $\Pi _{k+1}^0(\Omega _k)$ complete otherwise. Now suppose for a contradiction that $\{\Xi _{k,P},\Omega _{k},{\mathcal {M}},\Lambda _k\}\in \Delta _{k+1}^G$. But then Theorem 5.17 implies that $\Xi _{k,P}\in {\mathcal {B}}_k(\Omega _k,{\mathcal {M}})$ and hence by Theorem 5.14, $\Xi _{k,P}$ is $\Sigma _{k+1}^0(\Omega _k)$ measurable. $\Xi _{k,P}$ is the indicator function of a set, which we denote by $\Xi _{k,P}$ with an abuse of notation, which is either $\Sigma _{k+1}^0(\Omega _k)$ or $\Pi _{k+1}^0(\Omega _k)$ complete depending on the parity of k. But 0 and 1 are separated in ${\mathcal {M}}$ and hence since $\Xi _{k,P}$ is $\Sigma _{k+1}^0(\Omega _k)$ measurable, $\Xi _{k,P}$ and its complement both lie in $\Sigma _{k+1}^0(\Omega _k)$. It follows that $\Xi _{k,P}\in \Sigma _{k+1}^0(\Omega _k)\cap \Pi _{k+1}^0(\Omega _k)$, contradicting the stated completeness. $\square $

For our applications to spectral problems, we will use ${\tilde{\Omega }}$ to denote $\Omega _k$ and consider

$$\begin{aligned} \begin{aligned} {\tilde{\Xi }}_1=\Xi _{2,P},\quad {\tilde{\Xi }}_2=\Xi _{2,Q},\\ {\tilde{\Xi }}_3=\Xi _{3,P},\quad {\tilde{\Xi }}_4=\Xi _{3,Q}. \end{aligned} \end{aligned}$$

(5.6)

Theorem 5.19 holds for a much wider class of decision problems, but these four are the only ones we shall use in the sequel. The decision problems ${{\tilde{\Xi }}}_1$ and ${{\tilde{\Xi }}}_2$ were shown to have $\textrm{SCI}_G=3$ in [18], but only with regard to the discrete space ${\mathcal {M}}=\{0,1\}$ and the proof used a somewhat complicated Baire category argument. Theorem 5.19 is much more general, can be extended to arbitrarily large SCI, and has a much slicker proof, making clear a beautiful connection with the Baire hierarchy for well-behaved $\Omega $.

6 Proofs Concerning Spectral Radii, Essential Spectral Radii, Capacity and Operator Norms

Here we prove the theorems found in Sects. 3.1–3.3. First, we briefly recall $\Sigma _1^A$ algorithms for spectral problems presented in [64], that are sharp in the SCI hierarchy. The algorithms constructed in [64] are shown as pseudocode in “Appendix A”, where we also refer the reader to a more detailed account. The following was proven in [64] and was generalised in [60] to unbounded operators:

Theorem 6.1

For each $\Omega _f$ and $\Omega _f\cap \Omega _g$, consider the family $\Lambda $ consisting of $\Lambda _1$, together with pointwise evaluation of $f,\{c_n\}$ (and evaluation of g at rational points if considering $\Omega _f\cap \Omega _g$). The algorithms presented in “Appendix A” achieve $\Sigma _1^A$ error control. In particular the following classification holds:

$$\begin{aligned} \Delta _1^G\not \ni \{\textrm{Sp}(\cdot ),\Omega _f\cap \Omega _g,\Lambda _1\}\in \Sigma _1^A,\quad \Delta _1^G\not \ni \{\textrm{Sp}_\epsilon (\cdot ),\Omega _f,\Lambda _1\}\in \Sigma _1^A. \end{aligned}$$

We now turn to the proof of Theorem 3.3, dealing first with the evaluation set $\Lambda _1$. Suppose that $\{{\tilde{\Gamma }}_{n_k,\ldots ,n_1}\}$ is a $\Pi ^A_k$ tower of algorithms to compute the spectrum of a class of operators, where the output is a finite set for each $n_1,\ldots ,n_k$. It is then clear that

$$\begin{aligned} \Gamma _{n_k,\ldots ,n_1}(A)=\sup _{z\in {\tilde{\Gamma }}_{n_k,\ldots ,n_1}(A)}\left| z\right| +\frac{1}{2^{n_k}} \end{aligned}$$

provides a $\Pi ^A_k$ tower of algorithms for the spectral radius. Strictly speaking, the above may not be an arithmetic tower owing to the absolute value. But it can be approximated to arbitrary precision (from above say), the error of which can be absorbed in the first limit. In what follows, we always assume this is done without further comment. Similarly if $\{{\tilde{\Gamma }}_{n_k,\ldots ,n_1}\}$ provides a $\Sigma ^A_k$ tower of algorithms for the spectrum (and outputs a finite set for each $n_1,\ldots ,n_k$),

$$\begin{aligned} \Gamma _{n_k,\ldots ,n_1}(A)=\sup _{z\in {\tilde{\Gamma }}_{n_k,\ldots ,n_1}(A)}\left| z\right| -\frac{1}{2^{n_k}} \end{aligned}$$

provides a $\Sigma ^A_k$ tower of algorithms for the spectral radius. If we only have a height k tower with no $\Sigma _k$ or $\Pi _k$ type error control for the spectrum, then taking the supremum of absolute values shows that we get a height k tower for the spectral radius.

The fact that $\{\Xi _r,\Omega _\textrm{D}\} \in \Sigma ^A_1$, $\{\Xi _r,\Omega _f\cap \Omega _g\} \in \Sigma ^A_1$, $\{\Xi _r,\Omega _g\} \in \Sigma ^A_2$, $\{\Xi _r,\Omega _f\} \in \Pi ^A_2$ and $\{\Xi _r,\Omega _{\textrm{B}}\} \in \Pi ^A_3$ hence follow from Theorems 6.1 and the results of [18]. It is clear that $\{\Xi _r,\Omega _\textrm{D}\}\notin \Delta ^G_1$ and this also shows that $\{\Xi _r,\Omega _\textrm{N}\}\notin \Delta ^G_1$ and $\{\Xi _r,\Omega _f\cap \Omega _g\}\notin \Delta ^G_1$. Hence, we must show the positive result that $\{\Xi _r,\Omega _\textrm{N}\} \in \Sigma ^A_1$ and prove the lower bounds $\{\Xi _r,\Omega _g\}\notin \Delta ^G_2$, $\{\Xi _r,\Omega _f\}\notin \Delta ^G_2$ and $\{\Xi _r,\Omega _{\textrm{B}}\}\notin \Delta ^G_3$.

Proof of Theorem 3.3for $\Lambda _1$ Throughout this proof, we use the evaluation set $\Lambda _1$, which we drop from the notation for convenience.

Step 1 $\{\Xi _r,\Omega _\textrm{N}\} \in \Sigma ^A_1$. Recall that the spectral radius of a normal operator $A\in \Omega _{\textrm{B}}$ is equal to its operator norm. Consider the finite section matrices $P_nAP_n\in {\mathbb {C}}^{n\times n}$. It is straightforward to show that

$$\begin{aligned} \left\| P_nAP_n\right\| \uparrow \left\| A\right\| \quad \text { as }n\rightarrow \infty . \end{aligned}$$

The norm $\left\| P_nAP_n\right\| $ is the square root of the largest eigenvalue of the semi-positive definite self-adjoint matrix $(P_nAP_n)^*(P_nAP_n)$. This can be estimated from below to an accuracy of 1/n using Corollary 6.9 of [60], which then yields a $\Sigma _1^A$ algorithm for $\{\Xi _r,\Omega _\textrm{N}\}$.

Step 2 $\{\Xi _r,\Omega _g\}\notin \Delta ^G_2$. Recall that we assumed the existence of a $\delta \in (0,1)$ such that $g(x)\le (1-\delta )x$. Let $\epsilon >0$, then it is easy to see that the matrices

$$\begin{aligned} S_{\pm }(\epsilon )=\begin{pmatrix} 1 &{} 0 \\ \pm \epsilon &{} 1 \end{pmatrix} \end{aligned}$$

have norm bounded by $1+\epsilon +\epsilon ^2$ and are clearly inverse of each other. Choose $\epsilon $ small such that $(1+\epsilon +\epsilon ^2)^2\le 1/(1-\delta )$. If $B\in {\mathbb {C}}^{2\times 2}$ is normal, it follows that ${\hat{B}}:=S_{+}(\epsilon )BS_{-}(\epsilon )$ lies in $\Omega _g$ and has the same spectrum as B. We choose

$$\begin{aligned} {\hat{B}}=S_{+}(\epsilon )\begin{pmatrix} 1 &{} -\epsilon \\ -\epsilon &{} 0 \end{pmatrix}S_{-}(\epsilon )=\begin{pmatrix} 1+\epsilon ^2 &{} -\epsilon \\ \epsilon ^3 &{} -\epsilon ^2 \end{pmatrix}. \end{aligned}$$

The crucial property of ${\hat{B}}$ is that the first entry $1+\epsilon ^2$ is strictly greater in magnitude than the two eigenvalues $(1\pm \sqrt{1+4\epsilon ^2})/2$.

Now suppose for a contradiction that a height one tower, $\{\Gamma _n\}$, solves the problem. We will gain a contradiction by showing that $\Gamma _n(A)$ does not converge for an operator of the form,

$$\begin{aligned} A=\bigoplus _{r=1}^\infty A_{l_r},\quad A_{m}:=\begin{pmatrix} 1+\epsilon ^2&{} &{} &{} &{}-\epsilon \\ &{}0&{} &{} &{} \\ &{} &{}\ddots &{} &{} \\ &{} &{} &{}0&{} \\ \epsilon ^3&{} &{} &{} &{}-\epsilon ^2\\ \end{pmatrix} \in {\mathbb {C}}^{m\times m}, \end{aligned}$$

where we only consider $l_k\ge 3$. Each $A_m$ is unitarily equivalent to the matrix ${\hat{B}}\oplus 0\in {\mathbb {C}}^{m\times m}$ and has spectrum equal to $\{0,(1\pm \sqrt{1+4\epsilon ^2})/2\}$. Any A of the above form is unitarily equivalent to a direct sum of an infinite number of ${\hat{B}}$’s and the zero operator and hence lies in $\Omega _g$. Now suppose that $l_1,\ldots ,l_k$ have been chosen and consider the operator

$$\begin{aligned} B_k=A_{l_1}\oplus \dots \oplus A_{l_k}\oplus C,\quad C=\textrm{diag}\{1+\epsilon ^2,0,\ldots \}. \end{aligned}$$

The spectrum of $B_k$ is $\{0,(1\pm \sqrt{1+4\epsilon ^2})/2,1+\epsilon ^2\}$, and hence, there exists $\eta >0$ and $n(k)\ge k$ such that $\Gamma _{n(k)}(B_k)>(1+\sqrt{1+4\epsilon ^2})/2+\eta $. But $\Gamma _{n(k)}(B_k)$ can only depend on the evaluations of the matrix entries $\{B_k\}_{ij}=\langle B_ke_j,e_i \rangle $ with $i,j\le N(B_k,n(k))$ (as well as evaluations of the function g) into account. If we choose $l_{k+1}>N(B_k,n(k))$ then by the assumptions in Definition 5.1, $\Gamma _{n(k)}(A)=\Gamma _{n(k)}(B_k)>(1+\sqrt{1+4\epsilon ^2})/2+\eta $. But $\Gamma _n(A)$ must converge to $(1+\sqrt{1+4\epsilon ^2})/2$, a contradiction.

Step 3 $\{\Xi _r,\Omega _f\}\notin \Delta ^G_2$. Suppose for a contradiction that a height one tower, $\{\Gamma _n\}$, solves the problem. We will gain a contradiction by showing that $\Gamma _n(A)$ does not converge for an operator of the form

$$\begin{aligned} A=\bigoplus _{r=1}^\infty C_{l_r}\oplus A_{l_r},\quad{} & {} A_{m}:=\begin{pmatrix} 0&{} 1&{} &{} &{}\\ &{}0&{} 1&{} &{} \\ &{} &{}\ddots &{} \ddots &{} \\ &{} &{} &{}&{} 1\\ &{} &{} &{} &{}0\\ \end{pmatrix} \in {\mathbb {C}}^{m\times m},\quad \\ {}{} & {} C_m=\textrm{diag}\{0,0,\ldots ,0\}\in {\mathbb {C}}^{m\times m}, \end{aligned}$$

where we assume that $l_r\ge r$ to ensure that the spectrum of A is equal to the unit disc $B_1(0)$. Note that the function $f(n)=n+1$ will do for the bounded dispersion with $c_n=0$. Now suppose that $l_1,\ldots ,l_k$ have been chosen and consider the operator

$$\begin{aligned} B_k=\big (C_{l_1}\oplus A_{l_1}\big )\oplus \cdots \oplus \big (C_{l_k}\oplus A_{l_k}\big )\oplus C,\quad C=\textrm{diag}\{0,0,\ldots \}. \end{aligned}$$

The spectrum of $B_k$ is $\{0\}$ and hence there exist $n(k)\ge k$ such that $\Gamma _{n(k)}(B_k)<1/4$. But $\Gamma _{n(k)}(B_k)$ can only depend on the evaluations of the matrix entries $\{B_k\}_{ij}=\langle B_ke_j,e_i \rangle $ with $i,j\le N(B_k,n(k))$ (as well as evaluations of the function f) into account. If we choose $l_{k+1}>N(B_k,n(k))$, then by the assumptions in Definition 5.1, $\Gamma _{n(k)}(A)=\Gamma _{n(k)}(B_k)<1/4$. But $\Gamma _n(A)$ must converge to 1, a contradiction.

Step 4 $\{\Xi _r,\Omega _{\textrm{B}}\}\notin \Delta ^G_3$. Suppose for a contradiction that $\{\Gamma _{n_2,n_1}\}$ is a height two (general) tower and without loss of generality, assume it to be nonnegative. We use the results of Sect. 5. Let $({\mathcal {M}},d)$ be the space [0, 1] with the usual metric (note in particular this is not discrete so we use Remark 5.20), let ${{\tilde{\Omega }}}$ denote the collection of all infinite matrices $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}$ with entries $a_{i,j}\in \{0,1\}$ and recall the problem function

$$\begin{aligned} {{\tilde{\Xi }}}_1(\{a_{i,j}\}):\text { Does }\{a_{i,j}\}\text { have a column containing infinitely many nonzero entries?} \end{aligned}$$

Theorem 5.19 in Sect. 5 shows that $\textrm{SCI}({{\tilde{\Xi }}}_1,{{\tilde{\Omega }}})_{G} = 3$. We will gain a contradiction by using the supposed height two tower to solve $\{{{\tilde{\Xi }}}_1,{{\tilde{\Omega }}}\}$.

Without loss of generality, identify $\Omega _{\textrm{B}}$ with ${\mathcal {B}}(X)$ where $X=\bigoplus _{j=1}^{\infty }X_j$ in the $l^2$-sense with $X_j=l^2({\mathbb {N}})$. Now let $\{a_{i,j}\}\in {{\tilde{\Omega }}}$ and define $B_j\in {\mathcal {B}}(X_j)$ with the matrix representation

$$\begin{aligned} (B_{j})_{k,i}= {\left\{ \begin{array}{ll} 1, &{} \text {if } k=i\text { and }a_{k,j}=0\\ 1, &{} \text {if } k<i\text { and }a_{l,j}=0\text { for }k<l<i\\ 0, &{} \text {otherwise } 0\le n\le 1. \end{array}\right. } \end{aligned}$$

Let ${\mathcal {I}}_j$ be the index set of all i where $a_{i,j}=1$. $B_j$ acts as a unilateral shift on $\overline{\textrm{span}}\{e_k:k\in {\mathcal {I}}_j\}$ and the identity on its orthogonal complement. It follows that

$$\begin{aligned} \textrm{Sp}(B_j)= {\left\{ \begin{array}{ll} 1, &{} \text {if } {\mathcal {I}}_j=\emptyset \\ \{0,1\}, &{} \text {if } {\mathcal {I}}_j\text { is finite and non-empty}\\ {\mathbb {D}}\quad (\text {the unit disc}),&{} \text {if } {\mathcal {I}}_j\text { is infinite}. \end{array}\right. } \end{aligned}$$

For the matrix $\{a_{i,j}\}$ define $A\in \Omega _{\textrm{B}}$ by

$$\begin{aligned} A=\bigoplus _{j=1}^{\infty }(B_j-\frac{1}{2}I_j), \end{aligned}$$

where $I_j$ denotes the identity operator on ${\mathbb {C}}^{j\times j}$, then $\mathrm {Sp(A)}=\overline{\cup _{j=1}^\infty \textrm{Sp}(B_j)}-\frac{1}{2}$.

Hence, we see that

$$\begin{aligned} \Xi _r(A)= {\left\{ \begin{array}{ll} \frac{1}{2}, &{} \text {if }{{\tilde{\Xi }}}_1(\{a_{i,j}\})=0\\ \frac{3}{2}, &{} \text {if }{{\tilde{\Xi }}}_1(\{a_{i,j}\})=1. \end{array}\right. } \end{aligned}$$

We then set ${\tilde{\Gamma }}_{n_2,n_1}(\{a_{i,j}\})=\min \{\max \{\Gamma _{n_2,n_1}(A)-1/2,0\},1\}$. It is clear that this defines a generalised algorithm mapping into [0, 1]. In particular, given N we can evaluate $\{A_{k,l}:k,l\le N\}$ using only finitely many evaluations of $\{a_{i,j}\}$, where we can use a bijection between canonical bases of $l^2({\mathbb {N}})$ and $\bigoplus _{j=1}^{\infty }X_j$ to view A as acting on $l^2({\mathbb {N}})$. But then $\{{{\tilde{\Gamma }}}_{n_2,n_1}\}$ provides a height two tower for $\{{{\tilde{\Xi }}}_1,{{\tilde{\Omega }}}\}$, a contradiction. $\square $

Remark 6.2

The algorithm in step 1 of the above proof works for any operator whose operator norm is equal to its spectral radius. If, instead, the operator is spectraloid, meaning the spectral radius is equal to the numerical radius

$$\begin{aligned} w(A):=\sup \{\left| \left\langle Ax,x\right\rangle \right| :\left\| x\right\| =1\}, \end{aligned}$$

then a similar argument will hold by estimating $w(P_nAP_n)$. To do this, we need a way of computing w(A) to a given accuracy using finitely many arithmetic operations and comparisons (e.g. Lemma 7.1 below).$\square $

Proof of Theorem 3.3for $\Lambda _2$ Here we prove the changes for $\Xi _r$ when we consider the evaluation set $\Lambda _2$. It is clear that the classifications in $\Sigma _1^A$ do not change. It is also easy to use the algorithm in Theorem 6.1 (now using $\Lambda _2$ to collapse the first limit and approximate $\gamma _n$—see “Appendix A”) to prove $\{\Xi _r,\Omega _g,\Lambda _2\}\in \Sigma _1^A$. Similarly we can use the algorithm for the spectrum of operators in $\Omega _f$ for $\Omega _{\textrm{B}}$ using $\Lambda _2$ to collapse the first limit and hence $\{\Xi _r,\Omega _{\textrm{B}},\Lambda _2\}\in \Pi _2^A$. Since $\Omega _f\subset \Omega _{\textrm{B}}$, it follows that we only need to prove $\{\Xi _r,\Omega _f,\Lambda _2\}\not \in \Delta _2^G$. This can be proven using exactly the same example and a similar argument to step 3 of the proof of Theorem 3.3 (hence omitted). $\square $

Proof of Theorem 3.6

We begin by proving the results for $\Lambda _1$. For the lower bounds, it is enough to show that $\{\Xi _{er},\Omega _\textrm{D},\Lambda _1\}\not \in \Delta _2^G$ and $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}\not \in \Delta _3^G$. For the upper bounds, we must show that $\{\Xi _{er},\Omega _f,\Lambda _1\}\in \Pi _2^A$, $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}\in \Pi _3^A$ and $\{\Xi _{er},\Omega _\textrm{N},\Lambda _1\}\in \Pi _2^A$. The lower bounds for $\Lambda _2$ follow from $\{\Xi _{er},\Omega _\textrm{D},\Lambda _1\}\not \in \Delta _2^G$ and for the upper bounds it is enough to prove $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _2\}\in \Pi _2^A$.

Step 1 $\{\Xi _{er},\Omega _\textrm{D},\Lambda _1\}\not \in \Delta _2^G$. This is the same argument as in step 3 of the proof of Theorem 3.3; however, now we replace $A_m$ by $A_m=\textrm{diag}\{1,1,\ldots ,1\}\in {\mathbb {C}}^{m\times m}$ and use the fact that $\Xi _{er}(B_k)=0$. It follows that given the proposed height one tower $\{\Gamma _n\}$ and the constructed A, $\Xi _{er}(A)=1$ but $\Gamma _{n(k)}(A)<1/4$, the required contradiction.

Step 2 $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}\not \in \Delta _3^G$. This is the same argument as step 4 of the proof of Theorem 3.3.

Step 3 $\{\Xi _{er},\Omega _f,\Lambda _1\}\in \Pi _2^A$, $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}\in \Pi _3^A$ and $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _2\}\in \Pi _2^A$. $\{\Xi _{er},\Omega _f,\Lambda _1\}\in \Pi _2^A$ follows immediately from the existence of a $\Pi _2^A$ tower of algorithms for the essential spectrum of operators in $\Omega _f$ proven in [18]. The output of this tower is a finite collection of rectangles with complex rational vertices; hence, we can gain an approximation of the maximum absolute value over this output to any given precision. This can be used to construct a $\Pi _2^A$ tower for $\{\Xi _{er},\Omega _f,\Lambda _1\}$. Similarly, $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}\in \Pi _3^A$ follows from the $\Pi _3^A$ tower of algorithms for $\{\textrm{Sp}_{\textrm{ess}},\Omega _{\textrm{B}},\Lambda _1\}$ constructed in [18]. Finally, we can use $\Lambda _2$ to collapse the first limit of the algorithm for the essential spectrum in [18], giving a $\Pi _2^A$ algorithm, and this can be used to show $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _2\}\in \Pi _2^A$.

Step 4 $\{\Xi _{er},\Omega _\textrm{N},\Lambda _1\}\in \Pi _2^A$. A $\Pi _2^A$ tower is constructed in the proof of Theorem 3.10 for the essential numerical range, $W_e(A)$, of normal operators (using $\Lambda _1$), and this outputs a finite collection of points. For normal operators A, $W_e(A)$ is the convex hull of the essential spectrum and hence $\sup _{z\in W_e(A)}\left| z\right| $ is equal to $\Xi _{er}(A)$. Hence, a $\Pi _2^A$ tower for $\{\Xi _{er},\Omega _\textrm{N},\Lambda _1\}$ follows by taking the maximum absolute value over the tower for $W_e(A)$. $\square $

Proof of Theorem 3.7

Note that given a height k arithmetical tower $\{{{\widehat{\Gamma }}}_{n_k,\ldots ,n_1}(\cdot ,p)\}$ for $\Xi _{r,p}$ and a class $\Omega '$, we can build a $\Pi _{k+1}^A$ tower for $\{\Xi _{cap},\Omega '\}$ as follows. Let $p_1,p_2,\ldots $ be an enumeration of the monic polynomials with rational coefficients and ${{\tilde{\Gamma }}}_{n_k,\ldots ,n_1}(\cdot ,p)$ be an approximation to $|{{\widehat{\Gamma }}}_{n_k,\ldots ,n_1}(\cdot ,p)|^{1/\textrm{deg}(p)}$ to accuracy $1/n_1$ using finitely many arithmetic operations and comparisons. Define

$$\begin{aligned} \Gamma _{n_{k+1},\ldots ,n_1}(A)=\min _{1\le m\le n_{k+1}}{{\tilde{\Gamma }}}_{n_k,\ldots ,n_1}(A,p_m). \end{aligned}$$

The fact that this is a convergent $\Pi _{k+1}^A$ tower is clear. This, together with inclusions of the considered classes of operators, means that to prove the positive results we only need to prove $\{\Xi _{r,p},\Omega _f,\Lambda _1\}\in \Sigma _1^A$, $\{\Xi _{r,p},\Omega _{\textrm{B}},\Lambda _1\}\in \Sigma _2^A$ and $\{\Xi _{r,p},\Omega _{\textrm{B}},\Lambda _2\}\in \Sigma _1^A$. Likewise, for the negative results we only need to prove $\{\Xi _{cap},\Omega _{\textrm{D}},\Lambda _2\}\not \in \Delta ^G_2$ (the fact that $\{\Xi _{r,p},\Omega _{\textrm{D}},\Lambda _2\}\not \in \Delta ^G_1$ is obvious), $\{\Xi _{cap},\Omega _{\textrm{N}},\Lambda _1\}\not \in \Delta ^G_3$ and $\{\Xi _{r,p},\Omega _{\textrm{N}},\Lambda _2\}\not \in \Delta ^G_2$. We shall prove these results with $\Omega _{\textrm{N}}$ replaced by the class of self-adjoint bounded operators denoted by $\Omega _{\textrm{SA}}$.

Step 1 $\{\Xi _{r,p},\Omega _f,\Lambda _1\}\in \Sigma _1^A$. The function f and sequence $\{c_n\}$ allow us to compute the matrix elements of p(A) for any $A\in \Omega _f$ and polynomial p to arbitrary accuracy. We can then use the same argument as step 1 of the proof of Theorem 3.3, approximating $\Vert P_np(A)P_n\Vert $ instead of $\left\| P_nAP_n\right\| $.

Step 2 $\{\Xi _{r,p},\Omega _{\textrm{B}},\Lambda _1\}\in \Sigma _2^A$ and $\{\Xi _{r,p},\Omega _{\textrm{B}},\Lambda _2\}\in \Sigma _1^A$. For the first result, we note that

$$\begin{aligned} \lim _{m\rightarrow \infty }\Vert P_np(P_{m}AP_m)P_n\Vert =\Vert P_np(A)P_n\Vert \end{aligned}$$

and let $\Gamma _{n,m}(A,p)$ be an approximation of $\Vert P_np(P_{m}AP_m)P_n\Vert $ to accuracy 1/m, which can be computed in finitely many arithmetic operations and comparisons. To prove $\{\Xi _{r,p},\Omega _{\textrm{B}},\Lambda _2\}\in \Sigma _1^A$, for any given $A\in \Omega _{\textrm{B}}$ we can use $\Lambda _2$ to compute a function $f_A$ and sequence $\{c_n(A)\}$ bounding the dispersion such that $A\in \Omega ^{f_A}$ and use step 1.

Step 3 $\{\Xi _{cap},\Omega _\textrm{SA},\Lambda _1\}\notin \Delta ^G_3$. Suppose for a contradiction that $\{\Gamma _{n_2,n_1}\}$ is a height two (general) tower for the problem and without loss of generality, assume it to be nonnegative. Our strategy will be as in the proof of Theorem 3.3 (recall also the results of Sect. 5). Let $({\mathcal {M}},d)$ be the space [0, 1] with the usual metric (note in particular this is not discrete so we use remark 5.20), let ${\tilde{\Omega }}$ denote the collection of all infinite matrices $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}$ with entries $a_{i,j}\in \{0,1\}$ and consider the problem function

$$\begin{aligned} {\tilde{\Xi }}_2(\{a_{i,j}\}):\text { Does }\{a_{i,j}\}\text { have (only) finitely many columns with (only) finitely many 1's?} \end{aligned}$$

Recall that it is shown in Sect. 5 that $\textrm{SCI}({\tilde{\Xi }}_2,{\tilde{\Omega }})_{G} = 3$. We will gain a contradiction by using the supposed height two tower to solve $\{{\tilde{\Xi }}_2,{\tilde{\Omega }}\}$. Without loss of generality, identify $\Omega _\textrm{SA}$ with self-adjoint operators in ${\mathcal {B}}(X)$ where $X=\bigoplus _{j=1}^{\infty }X_j$ in the $l^2$-sense with $X_j=l^2({\mathbb {N}})$. To proceed, we need the following elementary lemma, which will be useful in constructing examples of spectral pollution.

Lemma 6.3

Let $z_1,z_2,\ldots ,z_k\in [-1,1]$ and let $a_{j}=\sqrt{1-z_j^2}$ (say positive square root). Then, the symmetric matrix

$$\begin{aligned} B(z_1,\ldots ,z_k)=\left( \begin{array}{ccccc|ccccc} z_1&{} 0&{} \cdots &{} &{} &{} a_1&{} 0&{} \cdots &{} &{} \\ 0 &{}z_2&{} 0 &{} \cdots &{} &{} 0 &{}a_2&{} 0 &{} \cdots &{} \\ \vdots &{} 0&{}\ddots &{} &{} &{} \vdots &{} 0&{}\ddots &{} &{} \\ &{}\vdots &{} &{} &{} &{} &{}\vdots &{} &{} &{}\\ &{} &{} &{} &{}z_k &{} &{} &{} &{} &{}a_k\\ \hline a_1&{} 0&{} \cdots &{} &{} &{} -z_1&{} 0&{} \cdots &{} &{} \\ 0 &{}a_2&{} 0 &{} \cdots &{} &{} 0 &{}-z_2&{} 0 &{} \cdots &{} \\ \vdots &{} 0&{}\ddots &{} &{} &{} \vdots &{} 0&{}\ddots &{} &{} \\ &{}\vdots &{} &{} &{} &{} &{}\vdots &{} &{} &{}\\ &{} &{} &{} &{}a_k &{} &{} &{} &{} &{}-z_k \end{array}\right) \in {\mathbb {C}}^{2k\times 2k} \end{aligned}$$

has eigenvalues $\pm 1$ (repeated k times).

Proof

By a change of basis, the above matrix is equivalent to a block diagonal matrix with blocks

$$\begin{aligned} \begin{pmatrix} z_j &{} a_j\\ a_j&{} -z_j \end{pmatrix}. \end{aligned}$$

These blocks have eigenvalues $\{-1,1\}$. $\square $

Now choose a sequence of rational numbers $\{z_j\}_{j\in {\mathbb {N}}}\in [-1,1]$ that is also dense in $[-1,1]$ and let $B_j=B(z_1,\ldots ,z_j)$. For each column of a given $\{a_{i,j}\}\in {{\tilde{\Omega }}}$, let the infinite matrix $C^{(j)}$ be defined as follows. If $k,l<j+1$ then $C^{(j)}_{kl}=z_{k}\delta _{k,l}$. Let r(i) denote the row of the ith one of the column $\{a_{i,j}\}_{i\in {\mathbb {N}}}$ (with $r(i)=\infty $ if $\sum _{m}a_{m,j}<i$ and $r(0)=0$). If $r(i)<\infty $ then for $k\le l$ define

$$\begin{aligned} C^{(j)}_{kl}={\left\{ \begin{array}{ll} a_p\delta _{k,l-(r(i)-r(i-1)-1)}, &{} p=1,\ldots ,j,l=r(i)+j\cdot (2i-1)+p-1\\ -z_p\delta _{k,l}, &{} p=1,\ldots ,j,l=r(i)+j\cdot (2i-1)+p-1\\ z_p\delta _{k,l}, &{} p=1,\ldots ,j,l=r(i)+2j\cdot i+p-1\\ 0, &{} \text {otherwise}, \end{array}\right. } \end{aligned}$$

and extend $C^{(j)}_{kl}$ below the diagonal to a symmetric matrix. The key property of this matrix is that if the column $\{a_{i,j}\}_{i\in {\mathbb {N}}}$ has infinitely many 1 s, then its is unitarily equivalent to an infinite direct sum of infinitely many $B_j$ together with the zero operator acting on some subspace (whose dimension is equal to the number of zeros in the column). In this case $\textrm{Sp}(C^{(j)})=\{-1,1,0\}$ or $\{-1,1\}$. On the other hand, if $\{a_{i,j}\}_{i\in {\mathbb {N}}}$ has finitely many 1 s, then $C^{(j)}$ is unitarily equivalent the direct sum of a finite number of $B_j$, the diagonal operator $\textrm{diag}\{z_1,\ldots ,z_k\}$ and the zero operator acting on some subspace. In this case $\{z_1,\ldots ,z_j\}\subset \textrm{Sp}(C^{(j)})$. Let $A=\bigoplus _{j=1}^{\infty }C^{(j)}$, then it is clear that if ${{\tilde{\Xi }}}_2(\{a_{i,j}\})=1$, then $\textrm{Sp}(A)$ is a finite set, otherwise it is the entire interval $[-1,1]$.

Now we use the following facts for bounded self-adjoint operators A. If $\textrm{Sp}(A)$ is a finite set then $\Xi _{cap}(A)=0$, whereas if $\textrm{Sp}(A)=[-1,1]$ then $\Xi _{cap}(A)=1/2$ (this can be proven easily using the minimal $l^\infty $ norm property of monic Chebyshev polynomials). We then define ${\tilde{\Gamma }}_{n_2,n_1}(\{a_{i,j}\})=\min \{\max \{1-2\Gamma _{n_2,n_1}(A),0\},1\}$. It is clear that this defines a generalised algorithm. In particular, given N we can evaluate $\{A_{k,l}:k,l\le N\}$ using only finitely many evaluations of $\{a_{i,j}\}$, where we can use a bijection between canonical bases of $l^2({\mathbb {N}})$ and $\bigoplus _{j=1}^{\infty }X_j$ to view A as acting on $l^2({\mathbb {N}})$. We also have the convergence $\lim _{n_2\rightarrow \infty }\lim _{n_1\rightarrow \infty }{\tilde{\Gamma }}_{n_2,n_1}(\{a_{i,j}\})={{\tilde{\Xi }}}_2(\{a_{i,j}\})$, a contradiction.

Step 4 $\{\Xi _{cap},\Omega _{\textrm{D}},\Lambda _2\}\not \in \Delta ^G_2$. This is the same argument as in step 3 of the proof of Theorem 3.3. However, we now replace $A_m$ by $A_m=\textrm{diag}\{d_1,d_2,\ldots ,d_m\}\in {\mathbb {C}}^{m\times m}$, where $\{d_m\}$ is a dense subsequence of $[-1,1]$, and use the fact that $\Xi _{cap}(B_k)=0$. It follows that given the proposed height one tower $\{\Gamma _n\}$ and the constructed A, $\Xi _{cap}(A)=1/2$ but $\Gamma _{n(k)}(A)<1/4$, the required contradiction.

Step 5 $\{\Xi _{r,p},\Omega _{\textrm{SA}},\Lambda _2\}\not \in \Delta ^G_2$. Recall that we are given some polynomial p of degree at least two. We assume without loss of generality that the zeros of p are $\pm 1$ and $\left| p(0)\right| >1$ (the more general case is similar). The argument is similar to step 3 of the proof of Theorem 3.3, but we spell it out since it uses Lemma 6.3. Suppose for a contradiction that a height one tower, $\{\Gamma _n\}$, solves the problem. We will gain a contradiction by showing that $\Gamma _n(A)$ does not converge for an operator of the form,

$$\begin{aligned} A=\bigoplus _{r=1}^\infty B(z_1,\ldots ,z_{l_r}), \end{aligned}$$

and define

$$\begin{aligned} C=\textrm{diag}\{z_1,z_2,\ldots \}\in \Omega _{\textrm{B}}. \end{aligned}$$

We assume that $l_r\ge r$ to ensure that the spectrum of A is equal to $\{-1,1\}$ and hence $\Xi _{r,p}(A)=0$. Now suppose that $l_1,\ldots ,l_k$ have been chosen and consider the operator

$$\begin{aligned} B_k=B(z_1)\oplus \cdots \oplus B(z_1,\ldots ,z_{l_k})\oplus C. \end{aligned}$$

The spectrum of $B_k$ is $[-1,1]$ so that $\Xi _{r,p}(B_k)>1$ and hence there exists $n(k)\ge k$ such that $\Gamma _{n(k)}(B_k)>1/4$. But $\Gamma _{n(k)}(B_k)$ can only depend on the evaluations of the matrix entries $\{B_k\}_{ij}=\langle B_ke_j,e_i \rangle $ with $i,j\le N(B_k,n(k))$ (as well as evaluations of the function f) into account. If we choose $l_{k+1}>N(B_k,n(k))$ then by the assumptions in Definition 5.1, $\Gamma _{n(k)}(A)=\Gamma _{n(k)}(B_k)>1/4$. But $\Gamma _n(A)$ must converge to 0, a contradiction. $\square $

Remark 6.4

(Efficiently computing the capacity) Listing the monic polynomials with rational coefficients in the above proof is very inefficient. In practice, it is much better to split the domain of interest into intervals (or squares if in the complex plane, but we stick to the self-adjoint case in the following discussion). Suppose that each interval has dyadic endpoints and a diameter of $2^{-n_2}$ and that our operator is self-adjoint with known bounded dispersion. One can then apply Lemma 3.21 (denoting the index of that tower by $n_1$) to obtain an interval covering of the spectrum which will converge as $n_1\rightarrow \infty $, modulo the possibility of isolated points of the spectrum located at the endpoints of the intervals. Since the capacity of a compact set is unaltered by adding finitely many points, we do not have to worry about the endpoints—the limit of the capacity of this covering as $n_1\rightarrow \infty $ will be the capacity of a covering of the spectrum. As $n_2\rightarrow \infty $, we can use the fact that capacity is right-continuous as a set function (for compact sets $E_n,E$ with $E_n\downarrow E$, one has $\textrm{cap}(E_n)\downarrow \textrm{cap}(E)$) to obtain a $\Pi _2^A$ algorithm. The point of this is that it reduces the computation of the resulting tower $\{\Gamma _{n_2,n_1}\}$ to computing the capacity of finite unions of disjoint closed intervals in ${\mathbb {R}}$. In our computational examples, we made use of the method in [109], which uses conformal mappings and can deal with thousands of intervals.$\square $

7 Proofs Concerning Essential Numerical Ranges, Essential Spectra and Spectral Pollution

Proof of Theorem 3.10for $\Xi _{we}$ For the lower bounds, it is enough to note that $\{\Xi _{we},\Omega _\textrm{D},\Lambda _2\}\not \in \Delta _2^G$ by the same argument as step 1 of the proof of Theorem 3.6. The construction is exactly the same but yields $d_{\textrm{H}}(\Gamma _{n(k)}(A),\{0\})\le 1/2$, whereas $\Xi _{we}(A)=[0,1]$. Hence, the proposed height one tower cannot converge. To construct a $\Pi _2^A$ tower for general operators, we need the following Lemma:

Lemma 7.1

Let $B\in {\mathbb {C}}^{n\times n}$ and $\epsilon >0$. Then using finitely many arithmetic operations and comparisons, we can compute points $z_1,\ldots ,z_k\in {\mathbb {Q}}+i{\mathbb {Q}}$ such that

$$\begin{aligned} d_{\textrm{H}}(\{z_1,\ldots ,z_k\},W(B))\le \epsilon . \end{aligned}$$

Proof

Recall from step 1 of the proof of Theorem 3.3 that we can compute an upper bound $M\in {\mathbb {Q}}_+$ for $\Vert B\Vert $ in finitely many arithmetic operations and comparisons. Now choose points $x_1,\ldots ,x_k\in {\mathbb {Q}}^n$, each of norm at most 1, such that $d_{\textrm{H}}(\{x_1,\ldots ,x_k\},\{x\in {\mathbb {C}}^n:\Vert x\Vert =1\})<\epsilon /(3M)$. These can be computed in finitely many arithmetic operations and comparisons using generalised polar coordinates and approximations of trigonometric identities. It follows that

$$\begin{aligned} d_{\textrm{H}}(\{\langle Bx_1,x_1\rangle ,\ldots ,\langle Bx_k,x_k\rangle \},W(B))\le 2\epsilon /3. \end{aligned}$$

We then let each $z_j\in {\mathbb {Q}}+i{\mathbb {Q}}$ be a $\epsilon /4$ approximation of $\langle Bx_j,x_j\rangle $, which can be computed in finitely many arithmetic operations and comparisons. $\square $

Remark 7.2

(Efficient computation) In practice, there are much more efficient methods of computation. For example, the method of Johnson [96], reduces the computation of W(A) for $A\in {\mathbb {C}}^{n\times n}$ to a series of $n\times n$ Hermitian eigenvalue problems.$\square $

It is well known that for $A\in \Omega _{\textrm{B}}$,

$$\begin{aligned} \overline{W(P_nA|_{P_n{\mathcal {H}}})}&\uparrow \overline{W(A)},\\ \overline{W((I-P_n)A|_{(I-P_n){\mathcal {H}}})}&\downarrow W_{e}(A). \end{aligned}$$

Given A, let $\Gamma _{n_2,n_1}(A)$ be a finite collection of points produced by the algorithm in Lemma 7.1 applied to $B=(I-P_{n_2})P_{n_1+n_2+1}A|_{P_{n_1+n_2+1}(I-P_{n_2}){\mathcal {H}}}$ and $\epsilon =1/n_1$. The above limits show that $\{\Gamma _{n_2,n_1}\}$ provides a $\Pi _2^A$ tower for $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}$. $\square $

Proof of Theorem 3.10for $\Xi _{poll}^{{\mathbb {F}}}$ We will prove that $\{\Xi _{poll}^{{\mathbb {R}}},\Omega _\textrm{D},\Lambda _i\}\not \in \Delta ^G_3$ and $\{\Xi _{poll}^{{\mathbb {C}}},\Omega _{\textrm{B}},\Lambda _1\}\in \Sigma ^A_3$. The construction of towers for $\Xi _{poll}^{{\mathbb {R}}}$ is similar, as are the arguments for lower bounds.

Step 1 $\{\Xi _{poll}^{{\mathbb {C}}},\Omega _{\textrm{B}},\Lambda _1\}\in \Sigma ^A_3$. Let $\{{{\tilde{\Gamma }}}_{n_2,n_1}\}$ be the $\Pi _2^A$ tower for $\{\Xi _{er},\Omega _{\textrm{B}},\Lambda _1\}$ constructed above. Recall the definition

$$\begin{aligned} \gamma _{n_2,n_1}(z;A)=\min \{\sigma _{\textrm{inf}}(P_{n_1}(A-zI){|_{P_{n_2}{\mathcal {H}}}}),\sigma _{\textrm{inf}}(P_{n_1}(A^*-{\bar{z}}I){|_{P_{n_2}{\mathcal {H}}}})\} \end{aligned}$$

and that this can be approximated to any given accuracy in finitely many arithmetic operations and comparisons (see also “Appendix A”). We assume that we approximate from below to an accuracy of $1/n_1$ and call this approximation $\tilde{\gamma }_{n_2,n_1}$. The function $\gamma _{n_2,n_1}(z;A)$ is Lipschitz continuous with Lipschitz constant bounded by 1. Define the set

$$\begin{aligned} V_{n_1}=\bigcup _{m=1}^{n_1}U_{m}, \end{aligned}$$

where $U_{m}$ are the approximations to the open set U. By taking squares of distances to ball centres, we can decide whether a point $z\in {\mathbb {Q}}+i{\mathbb {Q}}$ has $\textrm{dist}(z,V_{n_1})<\eta $ for any given $\eta \in {\mathbb {Q}}_+$. Let $\Upsilon _{n_2,n_1}(A,U)$ be the finite collection of all $z\in {{\tilde{\Gamma }}}_{n_2,n_1}(A)$ with $\textrm{dist}(z,V_{n_1})<1/n_2-1/n_1$. If $\Upsilon _{n_2,n_1}(A,U)$ is empty then set $Q_{n_2,n_1}(A,U)=0$, otherwise set

$$\begin{aligned} Q_{n_2,n_1}(A,U):=\sup _{z\in \Upsilon _{n_2,n_1}(A,U)} {{\tilde{\gamma }}}_{n_2,n_1}(z;A)-\frac{1}{n_1}. \end{aligned}$$

The above remarks show that this can be computed using finitely many arithmetic operations and comparisons.

Let $W_{n_2}=\overline{W((I-P_{n_2})A|_{(I-P_{n_2}){\mathcal {H}}})}$ and $W_{n_2,n_1}=W((I-P_{n_2})P_{n_1+n_2+1}A|_{P_{n_1+n_2+1}(I-P_{n_2}){\mathcal {H}}})$. We claim that the set $\Upsilon _{n_2,n_1}(A,U)$ converges to

$$\begin{aligned} \Upsilon _{n_2}(A,U):=\overline{\left\{ z\in W_{n_2}:\textrm{dist}(z,{\overline{U}})<\frac{1}{n_2}\right\} }, \end{aligned}$$

as $n_1\rightarrow \infty $, meaning also if $\Upsilon _{n_2}(A,U)$ is empty then $\Upsilon _{n_2,n_1}(A,U)$ is empty for large $n_1$. If $z\in \Upsilon _{n_2,n_1}(A,U)$, then there exists ${\hat{z}}\in W_{n_2,n_1}\subset W_{n_2}$ with $\left| z-{\hat{z}}\right| \le 1/{n_1}$. Since

$$\begin{aligned} \textrm{dist}(z,{\overline{U}})\le \textrm{dist}(z,V_{n_1})<1/n_2-1/n_1, \end{aligned}$$

it follows that $\textrm{dist}({\hat{z}},{\overline{U}})<{1}/{n_2}$ and hence $\Upsilon _{n_2}(A,U)$ is non-empty. So to prove convergence, we only need to deal with the case $\Upsilon _{n_2}(A,U)\ne \emptyset $. The above argument also shows that any limit point of a subsequence $z_{m(j)}\in \Upsilon _{n_2,m(j)}(A,U)$ must lie in $\Upsilon _{n_2}(A,U)$. Hence to prove the claim, we need to only prove that for any $z\in \Upsilon _{n_2}(A,U)$, there exists $z_{n_1}$ that are contained in $\Upsilon _{n_2,n_1}(A,U)$ for large $n_1$ and converge to z.

Let $z\in W_{n_2} $ with $\textrm{dist}(z,{\overline{U}})<{1}/{n_2}$, then there exists $\epsilon >0$ and $j>0$ such that $\textrm{dist}(z,U_j)<{1}/{n_2}-\epsilon $. There also exists $z_{n_1}\in {{\tilde{\Gamma }}}_{n_2,n_1}(A)$ with $z_{n_1}\rightarrow z$. It must hold for $n_1>j$ that

$$\begin{aligned} \textrm{dist}({z}_{n_1},V_{n_1})\le \textrm{dist}({z}_{n_1},V_j)&\le \left| z_{n_1}-z\right| +\textrm{dist}(z,U_j)\\&<\left| z_{n_1}-z\right| +\frac{1}{n_2}-\epsilon . \end{aligned}$$

This last quantity is smaller than $1/n_2-1/n_1$ for large $n_1$ and hence $z_{n_1}\in \Upsilon _{n_2,n_1}(A,U)$ for large $n_1$. It follows for any $z\in \Upsilon _{n_2}(A,U)$, there exists $z_{n_1}$ that are contained in $\Upsilon _{n_2,n_1}(A,U)$ for large $n_1$ and converge to z.

Define

$$\begin{aligned} Q_{n_2}(A,U):=\sup _{z\in \Upsilon _{n_2}(A,U)} \gamma _{n_2}(z;A), \end{aligned}$$

where we recall that $\gamma _{n_2}(z;A)=\min \{\sigma _{\textrm{inf}}((A-zI){|_{P_{n_2}{\mathcal {H}}}}),\sigma _{\textrm{inf}}((A^*-{\bar{z}}I){|_{P_{n_2}{\mathcal {H}}}})\}.$ If $z\in \Upsilon _{n_2,n_1}(A,U)$, then the above shows that there exists ${\hat{z}}\in \Upsilon _{n_2}(A,U)$ with $\left| z-{\hat{z}}\right| \le 1/{n_1}$. It follows that

$$\begin{aligned} {{\tilde{\gamma }}}_{n_2,n_1}(z;A)-\frac{1}{n_1}&\le \gamma _{n_2,n_1}(z;A)-\frac{1}{n_1}\\&\le \gamma _{n_2,n_1}({\hat{z}};A)\le \gamma _{n_2}(z;A), \end{aligned}$$

where we have used the bound on the Lipschitz constant and the fact that $\gamma _{n_2,n_1}$ converge up to $\gamma _{n_2}$ (and uniformly on compact subsets of ${\mathbb {C}}$). It follows that $Q_{n_2,n_1}(A,U)\le Q_{n_2}(A,U)$ and this also covers the case that $\Upsilon _{n_2}(A,U)=\emptyset $ if we define the supremum over the empty set to be 0. The set convergence proven above and uniform convergence of ${{\tilde{\gamma }}}_{n_2,n_1}$ implies that $Q_{n_2,n_1}(A,U)$ converges to $Q_{n_2}(A,U)$. It is also clear that the $\Upsilon _{n_2}(A,U)$ are nested and converge down to $W_e(A)\cap {\overline{U}}$ since $W_{n_2}$ converges down to $W_{e}(A)$. The functions $\gamma _{n_2}$ also converge down to

$$\begin{aligned} \gamma (z;A)=\left\| R(z,A)\right\| ^{-1} \end{aligned}$$

uniformly on compact subsets of ${\mathbb {C}}$ and hence $Q_{n_2}(A,U)$ converges down to

$$\begin{aligned} Q(A,U)=\sup _{z\in W_e(A)\cap {\overline{U}}} \left\| R(z,A)\right\| ^{-1}. \end{aligned}$$

Define

$$\begin{aligned} \Gamma _{n_3,n_2,n_1}(A,U)=1-\chi _{[0,1/{n_3}]}(Q_{n_2,n_1}(A,U))\in \{0,1\}. \end{aligned}$$

The above show that

$$\begin{aligned} \lim _{n_1\rightarrow \infty } \Gamma _{n_3,n_2,n_1}(A,U)=1-\chi _{[0,1/{n_3}]}(Q_{n_2}(A,U))=:\Gamma _{n_3,n_2}(A,U). \end{aligned}$$

Since $\chi _{[0,1/n_3]}$ has right limits and $Q_{n_2}(A,U)$ are non-increasing,

$$\begin{aligned} \lim _{n_2\rightarrow \infty } \Gamma _{n_3,n_2}(A,U)=1-\chi _{[0,1/{n_3}]}(Q(A,U)\pm )=:\Gamma _{n_3}(A,U), \end{aligned}$$

where ± denotes one of the right or left limits (it is possible to have either). Now if $\Xi _{poll}^{{\mathbb {C}}}(A,U)=0$, then $\Gamma _{n_3}(A,U)=0$ for all $n_3$. But if $\Xi _{poll}^{{\mathbb {C}}}(A,U)=1$, then for large $n_3$, $\Gamma _{n_3}(A,U)=1$. Moreover, in this latter case, $\Gamma _{n_3}(A,U)=1$ signifies the existence of $z\in W_e(A)\cap {\overline{U}}$ with $\gamma (z;A)>0$ and hence $z\not \in \textrm{Sp}(A)$. Hence, $\{\Gamma _{n_3,n_2,n_1}\}$ provides a $\Sigma _3^A$ tower.

Step 2 $\{\Xi _{poll}^{{\mathbb {R}}},\Omega _{\textrm{D}},\Lambda _2\}\not \in \Delta ^G_3$. We will argue for the case that $U=U_1={\mathbb {R}}$ and the restricted case is similar. Assume for a contradiction that this is false and that $\{{\widehat{\Gamma }}_{n_2,n_1}\}$ is a general height two tower for $\{\Xi _{poll}^{{\mathbb {R}}},\Omega _{\textrm{D}},\Lambda _2\}$. We follow the same strategy as the proof of Theorem 3.3 step 4 (recall also the results of Sect. 5). Let $({\mathcal {M}},d)$ be discrete space $\{0,1\}$ and ${{\tilde{\Omega }}}$ denote the collection of all infinite matrices $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}$ with entries $a_{i,j}\in \{0,1\}$ and consider the problem function

$$\begin{aligned} {{\tilde{\Xi }}}_1(\{a_{i,j}\}):\text { Does }\{a_{i,j}\}\text { have a column containing infinitely many nonzero entries?} \end{aligned}$$

For $j\in {\mathbb {N}}$, let $\{b_{i,j}\}_{i\in {\mathbb {N}}}$ be a dense subset of $I_j:=[1-1/2^{2j-1},1-1/{2^{2j}}]$. Given a matrix $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}\in {{\tilde{\Omega }}}$, construct a matrix $\{c_{i,j}\}_{i,j\in {\mathbb {N}}}$ by letting $c_{i,j}=a_{i,j}b_{r(i,j),j}$ where

$$\begin{aligned} r(i,j)=\max \left\{ 1,\sum _{k=1}^{i}a_{k,j}\right\} . \end{aligned}$$

Now consider any bijection $\phi :{\mathbb {N}}\rightarrow {\mathbb {N}}^2$ and define the diagonal operator

$$\begin{aligned} A=\textrm{diag}(c_{\phi (1)},c_{\phi (2)},c_{\phi (3)},\ldots ). \end{aligned}$$

The algorithm ${\widehat{\Gamma }}_{n_2,n_1}$ thus translates to an algorithm $\Gamma '_{n_2,n_1}$ for $\{{{\tilde{\Xi }}}_1,{{\tilde{\Omega }}}\}$. Namely, set $\Gamma '_{n_2,n_1}(\{a_{i,j}\}_{i\in {\mathbb {N}}})={\widehat{\Gamma }}_{n_2,n_1}(A)$. The fact that $\phi $ is a bijection shows that the lowest level $\Gamma '_{n_2,n_1}$ are generalised algorithms (and are consistent). In particular, given N, we can find $\{A_{i,j}:i,j\le N\}$ using finitely many evaluations of the matrix values $\{c_{k,l}\}$ (the same is true for $A^*A$ and $AA^*$ since the operator is diagonal). But for any given $c_{k,l}$ we can evaluate this entry using only finitely many evaluations of the matrix values $\{a_{m,n}\}$ by the construction of r. Finally, note that

$$\begin{aligned} \textrm{Sp}(A)=\{1\}\cup \left( \bigcup _{j:\{a_{i,j}\}_{i\in {\mathbb {N}}}\text { has infinitely many 1s}}I_j\right) \cup Q, \end{aligned}$$

where Q lies in the discrete spectrum. The intervals $I_j$ are also separated. It follows that there is a gap in the essential spectrum if and only if there exists a column $\{a_{i,j}\}_{i\in {\mathbb {N}}}$ with infinitely many 1 s. Otherwise the essential spectrum is $\{1\}$. It follows that $\tilde{\Xi }(\{a_{i,j}\})=\Xi _{poll}^{{\mathbb {R}}}(A,{\mathbb {R}})$, and hence, we get a contradiction. $\square $

7.1 Essential Numerical Range for Unbounded Operators

The essential numerical range (see (3.1)) was first introduced for a bounded operator A in [152], as the closure of the numerical range of the image of A in the Calkin algebra:

$$\begin{aligned} W_e(A)=\bigcap _{B\in \Omega _K}\overline{W(A+B)}. \end{aligned}$$

Other equivalent characterisations were then given in [82]. The unbounded case is significantly different from the bounded case, and definitions that are equivalent in the bounded case may yield very different sets in the unbounded case. The definition for unbounded operators appeared in [34] and required the development of several new ideas and tools. In this section, we let $\Omega _{{\mathcal {C}}}$ denote the set of closed operators T with domain ${\mathcal {D}}(T)\subset l^2({\mathbb {N}})$ such that the linear span of the canonical basis forms a core of T. This latter condition ensures that we can use the usual matrix representation of the operator T and hence the evaluation functions $\Lambda _1$. We follow [34] and define

$$\begin{aligned} W_e(T)=\left\{ \lambda \in {\mathbb {C}}:\exists \{x_n\}_{n\in {\mathbb {N}}}\subset {\mathcal {D}}(T),\Vert x_n\Vert =1,x_n\xrightarrow {w}0,\lim _{n\rightarrow \infty }\langle Tx_n,x_n\rangle =\lambda \right\} .\nonumber \\ \end{aligned}$$

(7.1)

In [34], it was shown that for any $T\in \Omega _{{\mathcal {C}}}$, $W_e(T)$ consists precisely of the essential spectrum of T together with all possible spectral pollution that may arise by applying projection methods to find the spectrum of T numerically. This result therefore generalises Theorems 3.8 and 3.9. The set $W_e(T)$ is closed and convex, but, unlike the case when T is bounded, $W_e(T)$ may be empty. We first need two simple lemmas.

Lemma 7.3

Let $T\in \Omega _{{\mathcal {C}}}$, then $W(P_nT|_{P_n{\mathcal {H}}})\uparrow \overline{W(T)}$ in the Attouch–Wets topology as $n\rightarrow \infty $.

Proof

It is clear that

$$\begin{aligned} W(P_nT|_{P_n{\mathcal {H}}})\subset W(T):=\{\langle Tx,x\rangle :x\in {\mathcal {D}}(T),\Vert x\Vert =1\}, \end{aligned}$$

and that the sets $W(P_nT|_{P_n{\mathcal {H}}})$ are increasing with n. Now let $\lambda \in \overline{W(T)}$ be arbitrary. It is enough to show that there exists $\lambda _n\in W(P_nT|_{P_n{\mathcal {H}}})$ such that $\lambda _n\rightarrow \lambda $ as $n\rightarrow \infty $. By assumption, there exists $x_n\in {\mathcal {D}}(T)$ such that $\Vert x_n\Vert =1$ and $\lim _{n\rightarrow \infty }\langle Tx_n,x_n\rangle =\lambda $. Since the linear span of the canonical basis forms a core of T, we can assume without loss of generality that each $x_n$ has finite support with respect to the canonical basis. By taking subsequences if necessary, we may assume that $P_nx_n=x_n$ and hence $\langle Tx_n,x_n\rangle \in W(P_nT|_{P_n{\mathcal {H}}})$. The result now follows. $\square $

Lemma 7.4

Let $T\in \Omega _{{\mathcal {C}}}$. If $W_e(T)\ne \emptyset $, then $\overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})}\downarrow W_{e}(T)$ in the Attouch–Wets topology as $n\rightarrow \infty $. If $W_e(T)=\emptyset $, then for any compact set K, $K\cap \overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})}=\emptyset $ for large n.

Proof

We clearly have that $\overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})}$ are non-empty and decreasing in n. It is enough to show the following two results:

(1)
If $\lambda \in W_e(T)$, then $\lambda \in \overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})}$ for all n.
(2)
If $\lambda \notin W_e(T)$, then $\liminf _{n\rightarrow \infty } \textrm{dist}(\lambda ,\overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})})>0$.

We first prove (1), so assume that $\lambda \in W_e(T)$. Then, since the linear span of the canonical basis functions form a core of T, we can assume that there exists $x_n$ with $\Vert x_n\Vert =1$ such that each $x_n$ has finite support with respect to the canonical basis, $x_n\xrightarrow {w}0$ and $\lim _{n\rightarrow \infty }\langle Tx_n,x_n\rangle =\lambda $. It follows that for any fixed m, $\lim _{n\rightarrow \infty }P_mx_n=0$ and hence $\lambda \in \overline{W((I-P_m)T|_{(I-P_m){\mathcal {H}}})}$.

Finally, to see (2), suppose that this were false for some $\lambda \notin W_e(T)$. We may then choose $\lambda _n\in \overline{W((I-P_n)T|_{(I-P_n){\mathcal {H}}})}$ such that $\liminf _{n\rightarrow \infty } |\lambda -\lambda _n|=0$. By taking subsequences if necessary, we may assume that $\lambda _n\rightarrow \lambda $ and that there exists $x_n$ with $\Vert x_n\Vert =1$, $P_{n}x_n=0$ and $|\langle Tx_n,x_n\rangle -\lambda _n|\rightarrow 0$. But this implies that $x_n\xrightarrow {w}0$ and $\lim _{n\rightarrow \infty }\langle Tx_n,x_n\rangle =\lambda $. Therefore, $\lambda \in W_e(T)$, the required contradiction. $\square $

We have the following corollary, which shows that the SCI classification of computing $W_e(T)$ for $T\in \Omega _{{\mathcal {C}}}$ remains $\Pi _2^A$ (one can make this precise by adding the empty set to the Attouch–Wets topology, but we omit the details).

Corollary 7.5

There exists a height two tower of arithmetic algorithms $\{\Gamma _{n_2,n_1}\}$, using $\Lambda _1$ (the matrix values with respect to the canonical basis) and $\Delta _1-$information (see Definition 5.11), such that for any $T\in \Omega _{{\mathcal {C}}}$, the following hold with respect to the Attouch–Wets topology:

$\Gamma _{n_2,n_1}(T)\uparrow \Gamma _{n_2}(T)\subset \overline{W(T)}$ as $n_1\rightarrow \infty $.
If $W_e(T)\ne \emptyset $, then $\Gamma _{n_2}(T)\downarrow W_{e}(T)$ as $n_2\rightarrow \infty $. If $W_e(T)=\emptyset $, then for any compact set K, $K\cap \Gamma _{n_2}(T)=\emptyset $ for large $n_2$.

Proof

We simply let $\Gamma _{n_2,n_1}(T)$ be an approximation of

$$\begin{aligned} W\left( (I-P_{n_2})P_{n_1+n_2+1}T|_{P_{n_1+n_2+1}(I-P_{n_2}){\mathcal {H}}}\right) \end{aligned}$$

that can be computed in finitely many arithmetic operations and comparisons, even when using inexact input (see Definition 5.11 and Remark 5.12), using the arguments in Sect. 7. The results now follow from Lemmas 7.3 and 7.4. $\square $

8 Proofs Concerning Lebesgue Measure

We use the function $\texttt {DistSpec}$ in “Appendix A”. For ease of notation, we suppress the dispersion function f in calling $\texttt {DistSpec}$, but assume that we know $\{c_n\}$ with $D_{f,n}(A)\le c_n$ and $c_n\rightarrow 0$ as $n\rightarrow \infty $. However, the proof of convergence also works when using $c_n=0$ (which does not necessarily bound $D_{f,n}(A)$). The key observation is the following:

Observation: If $A\in \Omega _f$, then the function $F_n(z):=\texttt {DistSpec}(A,n,z,f(n))+c_n$ converges uniformly to $\left\| R(z,A)\right\| ^{-1}$ from above on compact subsets of ${\mathbb {C}}$. By taking successive minima, we can assume without loss of generality that $F_n$ is non-increasing in n.

The other ingredient needed is the following proposition

Proposition 8.1

Given a finite union of disks in the complex plane, the Lebesgue measure of their intersection with the interior of a rectangle can be computed within arbitrary precision, using finitely many arithmetical operations and comparisons on the centres and radii of the discs, as well as the position of the rectangle.

Proof

Without loss of generality, we assume that the rectangle is $\{x+iy:x,y\in [0,1]\}$. Consider dividing the rectangle into $n^2$ subrectangles using the division of [0, 1] into n equal intervals. Given such a subrectangle, we can easily test via a finite number of arithmetic operations and comparisons whether the centre is in the union of the circles. Let r(n) denote the number of subrectangles whose centre lies in the union. Then, since the boundary of the union of the circles has measure zero, it is easy to see that $r(n)/n^2$ converges to the desired Lebesgue measure. Moreover, we can bound the number of subrectangles that intersect the boundary of any of the circles, and this can be used to obtain any desired precision. $\square $

Proof of Theorem 3.13

Step 1 $\{\Xi _1^L,\Omega _f,\Lambda _i\},\{\Xi _1^L,\Omega _\textrm{D},\Lambda _i\}\in \Pi ^A_2$. It is enough to consider $\Lambda _1$. We will estimate $\textrm{Leb}(\textrm{Sp}(A))$ by estimating the Lebesgue measure of the resolvent set on the closed square $[-C,C]^2$, where $\left\| A\right\| \le C$. We do not assume C is known. For $n_1,n_2\in {\mathbb {N}}$, let

$$\begin{aligned} \texttt {Grid}(n_1,n_2)=\left( \frac{1}{2^{n_2}}{\mathbb {Z}}+\frac{1}{2^{n_2}}i{\mathbb {Z}}\right) \cap [-n_1,n_1]^2. \end{aligned}$$

Letting B(x, r) and D(x, r) denote the closed and open balls of radius r around x, respectively,^{Footnote 18} in ${\mathbb {C}}$ (or ${\mathbb {R}}$ where appropriate), we define

$$\begin{aligned} U(n_1,n_2,A)=[-n_1,n_1]\times [-n_1,n_1]\cap (\cup _{z\in \texttt {Grid}(n_1,n_2)}B(z,F_{n_1}(z))). \end{aligned}$$

Note that $\textrm{Leb}(U(n_1,n_2,A))$ can be computed up to arbitrary predetermined precision using only arithmetic operations and comparisons by Proposition 8.1. Using this, we can define

$$\begin{aligned} \Gamma _{n_2,n_1}(A)=4n_1^2-\textrm{Leb}(U(n_1,n_2,A)) \end{aligned}$$

where, without loss of generality, we assume that we have computed the exact value of the Lebesgue measure (since we can absorb this error in the first limit). $\Gamma _{n_2,n_1}$ are arithmetical algorithms using the fact that $\texttt {DistSpec}$ is and the above proposition. The only non-trivial part is convergence. The algorithm is summarised in the routine LebSpec in § B.3.

We now show that the algorithm LebSpec converges and realises the $\Pi _2^A$ classification. There exists a compact set K such that $\left\| R(z,A)\right\| ^{-1}> 1$ on $K^c$ and without loss of generality we can make C larger, $C\in {\mathbb {N}}$ and take $K=[-C,C]^2$. For $n_1\ge C$

$$\begin{aligned} U(n_1,n_2,A)=([-C,C]^2\cap (\cup _{z\in \texttt {Grid}(n_1,n_2)}B(z,F_{n_1}(z))))\cup ([-n_1,n_1]^2\backslash [-C,C]^2), \end{aligned}$$

since $F_{n}(z)\ge \left\| R(z,A)\right\| ^{-1}$. It follows that for large $n_1$

$$\begin{aligned} \Gamma _{n_2,n_1}(A)=4C^2-\textrm{Leb}([-C,C]^2\cap (\cup _{z\in \texttt {Grid}(n_1,n_2)}B(z,F_{n_1}(z)))). \end{aligned}$$

As $n_1\rightarrow \infty $, $[-C,C]^2\cap (\cup _{z\in \texttt {Grid}(n_1,n_2)}B(z,F_{n_1}(z)))$ converges to the closed set

$$\begin{aligned} K(n_2,A)=[-C,C]^2\cap (\cup _{z\in \texttt {Grid}(C,n_2)}B(z,\left\| R(z,A)\right\| ^{-1})) \end{aligned}$$

from above and hence

$$\begin{aligned} \lim _{n_1\rightarrow \infty }\Gamma _{n_2,n_1}(A)=4C^2-\textrm{Leb}(K(n_2,A)), \end{aligned}$$

from below. Consider the relatively open set

$$\begin{aligned} V(n_2,A)=[-C,C]^2\cap (\cup _{z\in \texttt {Grid}(C,n_2)}D(z,\left\| R(z,A)\right\| ^{-1})). \end{aligned}$$

Clearly, $\textrm{Leb}(K(n_2,A))=\textrm{Leb}(V(n_2,A))$ since the sets differ by a finite collection of circular arcs or points (recall we defined the open ball of radius zero to be the empty set). Hence, we must show that

$$\begin{aligned} \lim _{n_2\rightarrow \infty }\textrm{Leb}(V(n_2,A))=\textrm{Leb}(\rho _{C}(A)), \end{aligned}$$

where $\rho _{C}(A)=[-C,C]^2\backslash \textrm{Sp}(A)$. For $z\in \rho _C(A)$,

$$\begin{aligned} \textrm{dist}(z,\textrm{Sp}(A))\ge \left\| R(z,A)\right\| ^{-1} \end{aligned}$$

and hence we get $V(n_2,A)\subset \rho _{C}(A)$. Since $\rho _C(A)$ is relatively open, a simple density argument using the continuity of $\left\| R(z,A)\right\| ^{-1}$ yields $V(n_2,A)\uparrow \rho _{C}(A)$ as $n_2\rightarrow \infty $ since the grid refines itself. So we get

$$\begin{aligned} \textrm{Leb}(V(n_2,A))\uparrow \textrm{Leb}(\rho _{C}(A)). \end{aligned}$$

This proves the convergence and also shows that $\Gamma _{n_2}(A)\downarrow \Xi _1^L(A)$, thus yielding the $\Pi _2^A$ classification. The same argument works in the one-dimensional case when considering self-adjoint operators $\Omega _{\textrm{D}}$ and $\textrm{Leb}_{{\mathbb {R}}}$. We simply restrict everything to the real line and consider the interval $[-C,C]$ rather than a square.

Step 2 $\{\Xi _1^L,\Omega _f,\Lambda _i\},\{\Xi _1^L,\Omega _\textrm{D},\Lambda _i\}\notin \Delta ^G_2$. It is enough to consider $\Lambda _2$. We will only show that $\textrm{SCI}(\Xi _1^L,\Omega _\textrm{D},\Lambda _2)_{G} \ge 2$ for which we use $\textrm{Leb}_{{\mathbb {R}}}$ and the two-dimensional case is similar. Suppose for a contradiction that there exists a height one tower $\{\Gamma _n\}$, then $\Lambda _{\Gamma _n}(A)$ is finite for each $A\in \Omega _\textrm{D}$. Hence, for every A and n there exists a finite number $N(A,n)\in {\mathbb {N}}$ such that the evaluations from $\Lambda _{\Gamma _n}(A)$ only take the matrix entries $A_{ij} = \left\langle Ae_j, e_i\right\rangle $ with $i,j\le N(A,n)$ into account.

Pick any sequence $a_1,a_2,\ldots $ that is dense in the unit interval [0, 1]. Consider the matrix operators $A_m=\textrm{diag}\{a_1,a_2,\ldots ,a_m\}\in {\mathbb {C}}^{m\times m}$, $B_m=\textrm{diag}\{0,0,\ldots ,0\}\in {\mathbb {C}}^{m\times m}$ and $C=\textrm{diag}\{0,0,\ldots \}$. Set $A=\bigoplus _{m=1}^{\infty }(B_{k_m}\oplus A_{k_m})$, where we choose an increasing sequence $k_m$ inductively as follows. Set $k_1=1$ and suppose that $k_1,\ldots ,k_m$ have been chosen. $\textrm{Sp}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)=\{0,a_1,a_2,\ldots ,a_{k_m}\}$ and hence $\textrm{Leb}(\textrm{Sp}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C))=0$ so there exists some $n_m\ge m$ such that if $n\ge n_m$ then

$$\begin{aligned} \Gamma _n(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)\le \frac{1}{2}. \end{aligned}$$

Now let $k_{m+1}\ge \max \{N(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C,n_m),k_m+1\}$. Any evaluation function $f_{i,j}\in \Lambda $ is simply the $(i,j)^{\textrm{th}}$ matrix entry and hence by construction

$$\begin{aligned} f_{i,j}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)=f_{i,j}(A), \end{aligned}$$

for all $f_{i,j}\in \Lambda _{\Gamma _{n_m}}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)$. By assumption (iii) in Definition 5.1, it follows that $\Lambda _{\Gamma _{n_m}}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)=\Lambda _{\Gamma _{n_m}}(A)$ and hence by assumption (ii) in the same definition that $\Gamma _{n_m}(A)=\Gamma _{n_m}(B_{k_1}\oplus A_{k_1}\oplus \cdots \oplus B_{k_m}\oplus A_{k_m}\oplus C)\le 1/2$. But $\lim _{n\rightarrow \infty }(\Gamma _n(A))=\textrm{Leb}(\overline{\{0,a_1,a_2,\ldots \}})=1$, a contradiction.

Step 3 $\{\Xi _1^L,\Omega ,\Lambda _1\}\in \Pi ^A_3$ for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$. We will deal with the case of $\Omega _{\textrm{B}}$. The cases of $\Omega _\textrm{N}$ and $\Omega _g$ then follow via $\Omega _{\textrm{N}}\subset \Omega _{g}\subset \Omega _{\textrm{B}}$ and the one-dimensional Lebesgue measure case for $\Omega _\textrm{SA}$ is similar. A careful analysis of the proof in step 1 yields that

$\Gamma _{n_2,n_1}(A)$ converges to $\Gamma _{n_2}(A)$ from below as $n_1\rightarrow \infty $.
$\Gamma _{n_2}(A)$ converges to $\textrm{Leb}(\textrm{Sp}(A))$ monotonically from above as $n_2\rightarrow \infty $.

We can ensure that the first limit converges from below by always slightly overestimating the Lebesgue measure of $U(n_1,n_2)$ (with error converging to zero) and using Proposition 8.1. These observations will be used later to answer question 3. We do not need to know $c_n$ for the above proof to work, but we will need it for the first of the above facts. A slight alteration of the proof/algorithm by inserting an additional successive limit deals with the general case.

Define the function

$$\begin{aligned} \gamma _{n,m}(z;A)=\min \{\sigma _{\textrm{inf}}(P_m(A-zI){|_{P_n{\mathcal {H}}}}),\sigma _{\textrm{inf}}(P_m(A^*-{\bar{z}}I){|_{P_n{\mathcal {H}}}})\}, \end{aligned}$$

where $\sigma _{\textrm{inf}}$ denotes the injection modulus/smallest singular value (see also “Appendix A”). One can show that $\gamma _{n,m}$ converges uniformly on compact subsets to

$$\begin{aligned} \gamma _{n}(z;A)=\min \{\sigma _{\textrm{inf}}((A-zI){|_{P_n{\mathcal {H}}}}),\sigma _{\textrm{inf}}((A^*-{\bar{z}}I){|_{P_n{\mathcal {H}}}})\}, \end{aligned}$$

as $m\rightarrow \infty $ and that this converges uniformly down to $\left\| R(z,A)\right\| ^{-1}$ on compact subsets as $n\rightarrow \infty $ [91]. With a slight abuse of notation, we can approximate $\gamma _{n,m}(z;A)$ to within 1/m by $\texttt {DistSpec}(A,n,z,m)$ (where the spacing of the search routine is 1/m, see also “Appendix A”) so that this converges uniformly on compact subsets to $\gamma _{n}(z;A)$. In exactly the same manner as before, define

$$\begin{aligned} U(n_1,n_2,n_3,A)&=[-n_2,n_2]^2\cap (\cup _{z\in \texttt {Grid}(n_2,n_3)}B(z,\gamma _{n_2,n_1}(z;A))),\\ \Gamma _{n_3,n_2,n_1}(A)&=(2n_2)^2-\textrm{Leb}(U(n_1,n_2,n_3,A)). \end{aligned}$$

The stated uniform convergence means that the argument in step 1 carries through and we have a height three tower, realising the $\Pi ^A_3$ classification.

Step 4 $\{\Xi _1^L,\Omega _\textrm{SA},\Lambda _1\}\notin \Delta ^G_3$. The proof is exactly the same argument as the proof of step 3 of Theorem 3.7. However, in this case to gain the contradiction, we then define ${\tilde{\Gamma }}_{n_2,n_1}(\{a_{i,j}\})=\min \{\max \{1-\Gamma _{n_2,n_1}(A)/2,0\},1\}$ where $\{\Gamma _{n_2,n_1}\}$ is the supposed height two tower for $\{\Xi _1^L,\Omega _\textrm{SA},\Lambda _1\}$.

Step 5 $\{\Xi _1^L,\Omega ,\Lambda _1\}\notin \Delta ^G_3$ for $\Omega =\Omega _{\textrm{B}},\Omega _{\textrm{N}}$, or $\Omega _g$. Since $\Omega _\textrm{N}\subset \Omega _g\subset \Omega _{\textrm{B}}$, we only need to deal with $\Omega _\textrm{N}$. We can use a similar argument as in step 4, but now replacing each $C^{(j)}$ by

$$\begin{aligned} D^{(j)}=\bigoplus _{k=1}^j ih_k C^{(j)}, \end{aligned}$$

where $h_1,h_2,\ldots $ is a dense sequence in [0, 1], and these operators act on $X_j=\bigoplus _{k=1}^j l^2({\mathbb {N}})$. This ensures that the spectrum of the operator yields a positive two-dimensional Lebesgue measure if and only if ${{\tilde{\Xi }}}_2(\{a_{i,j}\})=0$. The rest of the argument is entirely analogous.

Step 6 $\Delta ^G_2 \not \ni \{\Xi _1^L,\Omega ,\Lambda _2\}\in \Pi ^A_2$ for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$. The impossibility result follows by considering diagonal operators. For the existence of $\Pi ^A_2$ algorithms, we can use the construction in step 3, but the knowledge of matrix values of $A^*A$ allows us to skip the first limit and approximate $\gamma _n$ directly. $\square $

Proof of Theorem 3.14

Using the convergence

$$\begin{aligned} \lim _{\epsilon \downarrow 0}\textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))=\textrm{Leb}(\textrm{Sp}(A)), \end{aligned}$$

the lower bounds in Theorem 3.13 immediately imply the lower bounds in Theorem 3.14. Hence, we only need to construct the appropriate algorithms.

Step 1 $\{\Xi _2^L,\Omega _f,\Lambda _1\},\{\Xi _2^L,\Omega _{\textrm{D}},\Lambda _1\}\in \Sigma _1^A$. Let $A\in \Omega _f$ and

$$\begin{aligned} E_n=\frac{1}{n}\left( {\mathbb {Z}}+i{\mathbb {Z}}\right) \cap \{z\in {\mathbb {C}}:F_n(z)\le \epsilon \}\cap [-n,n]^2. \end{aligned}$$

Clearly, we can compute $E_n$ with finitely many arithmetic operations and comparisons and we set

$$\begin{aligned} \Gamma _n(A)=\textrm{Leb}\left( \cup _{z\in E_n}D(z,\max \{0,\epsilon -F_n(z)\})\right) . \end{aligned}$$

Proposition 8.1 shows that, without loss of generality, we can assume $\Gamma _n(A)$ can be computed exactly using finitely many arithmetic operations and comparisons. The algorithm is presented in the LebPseudoSpec routine in § B.3 and the following shows that this algorithm is sharp in the SCI hierarchy.

Suppose that $F_n(z)<\epsilon $ and that $\left| w\right| <\epsilon -F_n(z)$. If $z\in \textrm{Sp}(A)$, then clearly

$$\begin{aligned} \left\| R(z+w,A)\right\| ^{-1}\le \left| w\right| <\epsilon -F_n(z)\le \epsilon , \end{aligned}$$

and this holds trivially if $z+w\in \textrm{Sp}(A)$. So assume that neither of $z,z+w$ are in the spectrum. The resolvent identity yields

$$\begin{aligned} \left\| R(z+w,A)\right\| \ge \left\| R(z,A)\right\| -\left| w\right| \left\| R(z+w,A)\right\| \left\| R(z,A)\right\| , \end{aligned}$$

which rearranges to

$$\begin{aligned} \left\| R(z+w,A)\right\| ^{-1}\le \left\| R(z,A)\right\| ^{-1}+\left| w\right| <\epsilon . \end{aligned}$$

It follows that $\cup _{z\in E_n}D(z,\max \{0,\epsilon -F_n(z)\})$ is in $\widehat{\textrm{Sp}}_{\epsilon }(A)$ and hence that $\Gamma _{n}(A)\le \Xi _2^L(A)$. Without loss of generality by taking successive maxima, we can assume that $\Gamma _{n}(A)$ is increasing. Together, these will yield the $\Sigma ^A_1$ classification once convergence is shown. Using the uniform convergence of $F_n$ and density of $1/n({\mathbb {Z}}+i{\mathbb {Z}})\cap [-n,n]^2$, we see that pointwise convergence holds:

$$\begin{aligned} \chi _{\cup _{z\in E_n}D(z,\max \{0,\epsilon -F_n(z)\})}\rightarrow \chi _{\widehat{\textrm{Sp}}_{\epsilon }(A)}, \end{aligned}$$

where $\chi _E$ denotes the indicator function of a set E. It follows by the dominated convergence theorem that $\Gamma _n(A)\rightarrow \textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))$. The proof for $\Omega _{\textrm{D}}$ is similar by restricting everything to the real line.

Step 2 $\{\Xi _2^L,\Omega ,\Lambda _1\}\in \Sigma _2^A$ for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$. To prove this, we simply replace $F_{n_1}$ by the functions $\gamma _{n_2,n_1}$ and set

$$\begin{aligned} \Gamma _{n_2,n_1}(A)=\textrm{Leb}\left( \cup _{z\in E_{n_2}}D(z,\max \{0,\epsilon -\gamma _{n_2,n_1}(z;A)\})\right) . \end{aligned}$$

Step 3 $\{\Xi _2^L,\Omega ,\Lambda _2\}\in \Sigma _1^A$ for $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$. The knowledge of matrix values of $A^*A$ allows us to skip the first limit in the construction of step 2 and approximate $\gamma _n$ directly. $\square $

Proof of Proposition 3.15

We begin with the proof of 1. Suppose $A\in \Omega _\textrm{D}$ has $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A))=0$ and let $A_n\in \Omega _\textrm{D}$ be such that $\left\| A-A_n\right\| \rightarrow 0$ as $n\rightarrow \infty $. This implies that $\textrm{Sp}(A_n)\rightarrow \textrm{Sp}(A)$ since all our operators are normal. To prove that $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A_n))\rightarrow 0$, it is enough to prove that

$$\begin{aligned} \textrm{Leb}(F_n)\downarrow 0, \end{aligned}$$

(8.1)

where $F_n=\textrm{Sp}(A)\cup (\cup _{m\ge n}\textrm{Sp}(A_m))$. But $F_n$ decreases to $\textrm{Sp}(A)$ and is bounded in measure, so (8.1) holds. For the converse, let $\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A))>0$. Without loss of generality, assume that all of A’s entries lie in [0, 1]. Let ${\mathbb {D}}_n$ denote the set $\{j/2^n\}_{j=1}^n$ and consider the map $ \phi _n:x\hookrightarrow 2^{-n}\left\lceil x2^n\right\rceil $ on [0, 1]. Let $A_n$ be the diagonal operator obtained by applying $\phi _n$ to each of A’s entries. We clearly have that $\left\| A-A_n\right\| \rightarrow 0$ as $n\rightarrow \infty $ but note that $\textrm{Sp}(A_n)$ is finite so has Lebesgue measure 0. Hence $\Xi _1^L$ is discontinuous at A.

To prove 2, note that for $A\in \Omega _\textrm{D}$, $\textrm{Leb}_{{\mathbb {R}}}(S_{\epsilon }(A))=0$. Let $A_n\in \Omega _\textrm{D}$ have $\left\| A-A_n\right\| \rightarrow 0$. Then given some $0<\delta <\epsilon $ it holds for large n that $ \textrm{Sp}_{\epsilon -\delta }(A)\subset \textrm{Sp}_{\epsilon }(A_n)\subset \textrm{Sp}_{\epsilon +\delta }(A) $ and hence that

$$\begin{aligned} \limsup _{n\rightarrow \infty }\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon }(A_n))&\le \textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon +\delta }(A))\\ \liminf _{n\rightarrow \infty }\textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon }(A_n))&\ge \textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}_{\epsilon -\delta }(A)). \end{aligned}$$

Now let $\delta \downarrow 0$ and use the fact that $\Xi _2^L$ is continuous in $\epsilon $. $\square $

Finally, we deal with the question of determining whether the Lebesgue measure is zero. Recall that for this problem, $({\mathcal {M}},d)$ denotes the set $\{0,1\}$ endowed with the discrete topology and we consider the problem function

$$\begin{aligned} \Xi _3^L(A)= {\left\{ \begin{array}{ll} 0,\quad \text { if }\textrm{Leb}(\textrm{Sp}(A))>0\\ 1,\quad \text { otherwise.} \end{array}\right. } \end{aligned}$$

Proof of Theorem 3.16

We will show that $\{\Xi _3^L,\Omega _f,\Lambda _1\}\in \Pi _3^A$ and $\{\Xi _3^L,\Omega _\textrm{D},\Lambda _2\}\notin \Delta _3^G$. The analogous statements $\{\Xi _3^L,\Omega _\textrm{D},\Lambda _1\}\in \Pi _3^A$ and $\{\Xi _3^L,\Omega _f,\Lambda _2\}\notin \Delta _3^G $ follow from similar arguments.

The lower bound argument can also be used when considering $\Lambda _2$ and $\Omega =\Omega _{\textrm{B}},\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$. We will also prove the lower bound $\{\Xi _3^L,\Omega _\textrm{SA},\Lambda _1\}\notin \Delta _4^G$. The remaining lower bounds for $\Lambda _1$ follow from a similar argument and construction as in step 5 of the proof of Theorem 3.13 to ensure we are dealing with two-dimensional Lebesgue measure. Finally, we prove that $\{\Xi _3^L,\Omega _{\textrm{B}},\Lambda _1\}\in \Pi _4^A$. The upper bounds for $\Omega =\Omega _\textrm{SA}$, $\Omega _\textrm{N}$ or $\Omega _g$ and $\Lambda _1$ follow an almost identical argument. When considering $\Lambda _2$, we can collapse the first limit in the same manner as we did for solving $\Xi _1^L$.

Step 1 $\{\Xi _3^L,\Omega _f,\Lambda _1\}\in \Pi _3^A$. First we use the algorithm used to compute $\Xi _1^L$ in Theorem 3.13, which we shall denote by ${\widetilde{\Gamma }}$, to build a height 3 tower for $\{\Xi _3^L,\Omega _f\}$. As above, $\Omega _f$ denotes the set of bounded operators with the usual assumption of bounded dispersion (now with known bounds $c_n$). Recall that we observed

${\widetilde{\Gamma }}_{n_2,n_1}(A)$ converges to ${\widetilde{\Gamma }}_{n_2}(A)$ from below as $n_1\rightarrow \infty $.
${\widetilde{\Gamma }}_{n_2}(A)$ converges to $\textrm{Leb}(\textrm{Sp}(A))$ monotonically from above as $n_2\rightarrow \infty $.

We can alter our algorithms, by taking maxima, so that we can assume without loss of generality that ${\widetilde{\Gamma }}_{n_2,n_1}(A)$ converges to ${\widetilde{\Gamma }}_{n_2}(A)$ monotonically from below as $n_1\rightarrow \infty $. Now let

$$\begin{aligned} \Gamma _{n_3,n_2,n_1}(A)=\chi _{[0,1/n_3]}({\widetilde{\Gamma }}_{n_2,n_1}(A)). \end{aligned}$$

Note that $\chi _{[0,1/n_3]}$ is left continuous on $[0,\infty )$ with right limits. Hence by the assumed monotonicity

$$\begin{aligned} \lim _{n_1\rightarrow \infty }\Gamma _{n_3,n_2,n_1}(A)=\chi _{[0,1/n_3]}({\widetilde{\Gamma }}_{n_2}(A)). \end{aligned}$$

It follows that

$$\begin{aligned} \lim _{n_2\rightarrow \infty }\lim _{n_1\rightarrow \infty }\Gamma _{n_3,n_2,n_1}(A)=\chi _{[0,1/n_3]}(\textrm{Leb}(\textrm{Sp}(A))\pm ), \end{aligned}$$

where ± denotes one of the right or left limits (it is possible to have either). It is then easy to see that

$$\begin{aligned} \lim _{n_3\rightarrow \infty }\lim _{n_2\rightarrow \infty }\lim _{n_1\rightarrow \infty }\Gamma _{n_3,n_2,n_1}(A)=\Xi _3^L(A). \end{aligned}$$

It is also clear that the answer to the question is “No” if $\Gamma _{n_3}(A)=0$, which yields the $\Pi _3^A$ classification.

Step 2 $\{\Xi _3^L,\Omega _\textrm{D},\Lambda _1\}\notin \Delta _3^G$. Assume for a contradiction that this is false and $\{{\widehat{\Gamma }}_{n_2,n_1}\}$ is a general height two tower for $\{\Xi _3^L,\Omega _\textrm{D}\}$. Let $({\mathcal {M}},d)$ be discrete space $\{0,1\}$ and ${{\tilde{\Omega }}}$ denote the collection of all infinite matrices $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}$ with entries $a_{i,j}\in \{0,1\}$ and consider the problem function

$$\begin{aligned} {{\tilde{\Xi }}}_1(\{a_{i,j}\}):\text { Does }\{a_{i,j}\}\text { have a column containing infinitely many nonzero entries?} \end{aligned}$$

For $j\in {\mathbb {N}}$, let $\{b_{i,j}\}_{i\in {\mathbb {N}}}$ be a dense subset of $I_j:=[1-1/2^{j-1},1-1/{2^j}]$. Given a matrix $\{a_{i,j}\}_{i,j\in {\mathbb {N}}}\in {{\tilde{\Omega }}}$, construct a matrix $\{c_{i,j}\}_{i,j\in {\mathbb {N}}}$ by letting $c_{i,j}=a_{i,j}b_{r(i,j),j}$ where

$$\begin{aligned} r(i,j)=\max \left\{ 1,\sum _{k=1}^{i}a_{k,j}\right\} . \end{aligned}$$

Now consider any bijection $\phi :{\mathbb {N}}\rightarrow {\mathbb {N}}^2$ and define the diagonal operator

$$\begin{aligned} A=\textrm{diag}(c_{\phi (1)},c_{\phi (2)},c_{\phi (3)},\ldots ). \end{aligned}$$

The algorithm ${\widehat{\Gamma }}_{n_2,n_1}$ thus translates to an algorithm $\Gamma '_{n_2,n_1}$ for $\{{{\tilde{\Xi }}}_1,{{\tilde{\Omega }}}\}$. Namely, set $\Gamma '_{n_2,n_1}(\{a_{i,j}\}_{i\in {\mathbb {N}}})={\widehat{\Gamma }}_{n_2,n_1}(A)$. The fact that $\phi $ is a bijection shows that the lowest level $\Gamma '_{n_2,n_1}$ are generalised algorithms (and are consistent). In particular, given N, we can find $\{A_{i,j}:i,j\le N\}$ using finitely many evaluations of the matrix values $\{c_{k,l}\}$. But for any given $c_{k,l}$, we can evaluate this entry using only finitely many evaluations of the matrix values $\{a_{m,n}\}$ by the construction of r. Finally note that

$$\begin{aligned} \textrm{Sp}(A)=\left( \bigcup _{j:\sum _ia_{i,j}=\infty }I_j\right) \cup Q, \end{aligned}$$

where Q is at most countable. Hence,

$$\begin{aligned} \textrm{Leb}_{{\mathbb {R}}}(\textrm{Sp}(A))=\sum _{j:\sum _ia_{i,j}=\infty }\frac{1}{2^j}. \end{aligned}$$

It follows that ${{\tilde{\Xi }}}_1(\{a_{i,j}\})=\Xi _3^L(A)$, and hence, we get a contradiction.

Step 3 $\{\Xi _3^L,\Omega _\textrm{SA},\Lambda _1\}\notin \Delta _4^G$. Suppose for a contradiction that $\{\Gamma _{n_3,n_2,n_1}\}$ is a height three tower of general algorithms for the problem $\{\Xi _3^L,\Omega _\textrm{SA},\Lambda _1\}$. Let $({\mathcal {M}},d)$ be the space $\{0,1\}$ with the discrete metric, let ${\tilde{\Omega }}$ denote the collection of all infinite arrays $\{a_{m,i,j}\}_{m,i,j\in {\mathbb {N}}}$ with entries $a_{m,i,j}\in \{0,1\}$ and consider the problem function

$$\begin{aligned} \begin{aligned} {\tilde{\Xi }}_4(\{a_{m,i,j}\}):\text { For every { m}, does }\{a_{m,i,j}\}_{i,j}&\text { have (only) finitely many columns}\\&\quad \quad \quad \quad \quad \quad \quad \text {with (only) finitely many 1's?} \end{aligned} \end{aligned}$$

Recall that it is shown in Sect. 5 that $\textrm{SCI}({\tilde{\Xi }}_4,{\tilde{\Omega }})_{G} = 4$. We will gain a contradiction by using the supposed height three tower to solve $\{{\tilde{\Xi }}_4,{\tilde{\Omega }}\}$.

The construction follows step 3 of the proof of Theorem 3.7 closely. For fixed m, recall the construction of the operator $A_m:=A(\{a_{m,i,j}\}_{i,j})$ from that proof, the key property being that if $\{a_{m,i,j}\}_{i,j}$ has (only) finitely many columns with (only) finitely many 1’s then $\textrm{Sp}(A_m)$ is a finite subset of $[-1,1]$, otherwise it is the whole interval $[-1,1]$. Now consider the intervals $I_m=[1-2^{m-1},1-2^{m}]$ and affine maps, $\alpha _m$, that act as a bijection from $[-1,1]$ to $I_m$. Without loss of generality, identify $\Omega _\textrm{SA}$ with self-adjoint operators in ${\mathcal {B}}(X)$ where $X=\bigoplus _{i=1}^{\infty }\bigoplus _{j=1}^{\infty }X_{i,j}$ in the $l^2$-sense with $X_{i,j}=l^2({\mathbb {N}})$. We then consider the operator

$$\begin{aligned} T(\{a_{m,i,j}\}_{m,i,j})=\bigoplus _{m=1}^\infty \alpha _{m}(A_m). \end{aligned}$$

The same arguments in the proof of Theorem 3.7 show that the map

$$\begin{aligned} {{\tilde{\Gamma }}}_{n_3,n_2,n_1}(\{a_{m,i,j}\}_{m,i,j})=\Gamma _{n_3,n_2,n_1}(T(\{a_{m,i,j}\}_{m,i,j})) \end{aligned}$$

defines a general tower using the relevant pointwise evaluation functions of the array $\{a_{m,i,j}\}_{m,i,j}$. If it holds that ${\tilde{\Xi }}_4(\{a_{m,i,j}\})=1$, then $\textrm{Sp}(T(\{a_{m,i,j}\}_{m,i,j}))$ is countable and hence $\Xi _3^L(T(\{a_{m,i,j}\}_{m,i,j}))=1$. On the other hand, if ${\tilde{\Xi }}_4(\{a_{m,i,j}\})=0$, then there exists m with $\textrm{Sp}(A_m)=[-1,1]$, and hence, $I_m\subset \textrm{Sp}(T(\{a_{m,i,j}\}_{m,i,j}))$ so that $\Xi _3^L(T(\{a_{m,i,j}\}_{m,i,j}))=0$. It follows that $\{{{\tilde{\Gamma }}}_{n_3,n_2,n_1}\}$ provides a height three tower for $\{{\tilde{\Xi }}_4,{\tilde{\Omega }}\}$, a contradiction.

Step 4 $\{\Xi _3^L,\Omega _{\textrm{B}},\Lambda _1\}\in \Pi _4^A$. Recall the tower of algorithms to solve $\{\Xi _1^L,\Omega _{\textrm{B}},\Lambda _1\}$ and denote it by ${\widetilde{\Gamma }}$. Our strategy will be the same as in step 1 but with an additional successive limit. It is easy to show that

${\widetilde{\Gamma }}_{n_3,n_2,n_1}(A)$ converges to ${\widetilde{\Gamma }}_{n_3,n_2}(A)$ from above as $n_1\rightarrow \infty $.
${\widetilde{\Gamma }}_{n_3,n_2}(A)$ converges to ${\widetilde{\Gamma }}_{n_3}(A)$ from below as $n_2\rightarrow \infty $.
${\widetilde{\Gamma }}_{n_3}(A)$ converges to $\textrm{Leb}(\textrm{Sp}(A))$ from above as $n_3\rightarrow \infty $.

Again, by taking successive maxima or minima where appropriate, we can assume that all of these are monotonic. Now let

$$\begin{aligned} \Gamma _{n_4,n_3,n_2,n_1}(A)=\chi _{[0,1/n_4]}({\widetilde{\Gamma }}_{n_3,n_2,n_1}(A)). \end{aligned}$$

Note that $\chi _{[0,1/n_4]}$ is left continuous on $[0,\infty )$ with right limits. Hence by the assumed monotonicity and arguments as in step 1, it is easy to see that

$$\begin{aligned} \lim _{n_4\rightarrow \infty }\lim _{n_3\rightarrow \infty }\lim _{n_2\rightarrow \infty }\lim _{n_1\rightarrow \infty }\Gamma _{n_4,n_3,n_2,n_1}(A)=\Xi _3^L(A). \end{aligned}$$

It is also clear that the answer to the question is “No” if $\Gamma _{n_4}(A)=0$, which yields the $\Pi _4^A$ classification. $\square $

9 Proofs Concerning Fractal Dimensions

We begin with the box-counting dimension. For the construction of towers of algorithms, it is useful to use a slightly different but equivalent [71] definition of the upper and lower box-counting dimensions. Let $F\subset {\mathbb {R}}$ be bounded and $N'_\delta (F)$ denote the number of $\delta $-mesh intervals that intersect F. A $\delta $-mesh interval is an interval of the form $[m\delta ,(m+1)\delta ]$ for $m\in {\mathbb {Z}}$. Then

$$\begin{aligned} \overline{\textrm{dim}}_B(F)=\limsup _{\delta \downarrow {}0}\frac{\log (N'_{\delta }(F))}{\log (1/\delta )},\quad \underline{\textrm{dim}}_B(F)=\liminf _{\delta \downarrow {}0}\frac{\log (N'_{\delta }(F))}{\log (1/\delta )}. \end{aligned}$$

Proof of Theorem 3.18 for box-counting dimension

Since $\Omega _{BD}^\textrm{D}\subset \Omega ^{BD}_f\subset \Omega ^{BD}_{\textrm{SA}}$, it is enough to prove that $\{\Xi _B,\Omega ^{BD}_f,\Lambda _1\}\in \Pi _2^A$, $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _2\}\in \Pi _2^A$, $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\in \Pi _3^A$, $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\not \in \Delta _3^A$ and $\{\Xi _B,\Omega ^{BD}_\textrm{D},\Lambda _2\}\not \in \Delta _2^A$.

Step 1 $\{\Xi _B,\Omega ^{BD}_f,\Lambda _1\}\in \Pi _2^A$. Recall the existence of a height one tower, $\{{{\tilde{\Gamma }}}_n\}$, using $\Lambda _1$ for $\textrm{Sp}(A)$, $A\in \Omega ^{BD}_f$ from “Appendix A”. Furthermore, ${{\tilde{\Gamma }}}_n(A)$ outputs a finite collection $\{z_{1,n},\ldots ,z_{k_n,n}\}\subset {\mathbb {Q}}$ such that $\textrm{dist}(z_{j,n},\textrm{Sp}(A))\le 2^{-n}$. Define the intervals

$$\begin{aligned} I_{j,n}=[z_{j,n}-2^{-n},z_{j,n}+2^{-n}] \end{aligned}$$

and let ${\mathcal {I}}_{m}$ denote the collection of all $2^{-m}$-mesh intervals. Let $\Upsilon _{m,n}(A)$ be any union of finitely many such mesh intervals with minimal length $\left| \Upsilon _{m,n}(A)\right| $ (“length” being the number of intervals $\in {\mathcal {I}}_{m}$ that make up $\Upsilon _{m,n}(A)$) such that

$$\begin{aligned} \Upsilon _{m,n}(A)\cap I_{j,l}\ne \emptyset ,\quad \text {for }1\le l\le n,1\le j\le k_l. \end{aligned}$$

There may be more than one such collection, so we can gain a deterministic algorithm by enumerating each ${\mathcal {I}}_{m}$ and choosing the first such collection in this enumeration. It is then clear that $\left| \Upsilon _{m,n}(A)\right| $ is increasing in n. Furthermore, to determine $\Upsilon _{m,n}(A)$, there are only finitely many intervals in ${\mathcal {I}}_{m}$ to consider, namely those that have non-empty intersection with at least one $I_{j,l}$ with $1\le l\le n,1\le j\le k_l$. It follows that $\Upsilon _{m,n}(A)$ and hence $\left| \Upsilon _{m,n}(A)\right| $ can be computed in finitely may arithmetic operations and comparisons using $\Lambda _1$.

Suppose that $I=[a,b]\in {\mathcal {I}}_{m}$ has $(a,b)\cap \textrm{Sp}(A)\ne \emptyset $. Then for large n there exists $z_{j,n}\in I$ such that $I_{j,n}\subset I$ and hence $I\subset \Upsilon _{m,n}(A)$ for large n. If $z\in \textrm{Sp}(A)\cap 2^{-m}{\mathbb {Z}}$, then a similar argument shows that $z\subset \Upsilon _{m,n}(A)$ for large n. Since $\textrm{Sp}(A)$ is bounded and $\textrm{Sp}(A)\cap 2^{-m}{\mathbb {Z}}$ finite, it follows that $\textrm{Sp}(A)\subset \Upsilon _{m,n}(A)$ for large n and hence

$$\begin{aligned} N_{2^{-m}}(\textrm{Sp}(A))\le \liminf _{n\rightarrow \infty }\left| \Upsilon _{m,n}(A)\right| . \end{aligned}$$

Let $W_m(A)$ be the union of all intervals in ${\mathcal {I}}_m$ that intersect $\textrm{Sp}(A)$. It is clear that $W_m(A)\cap I_{j,l}\ne \emptyset $ for $1\le l\le n,1\le j\le k_l$, and hence, $\left| \Upsilon _{m,n}(A)\right| \le N'_{2^{-m}}(\textrm{Sp}(A))$. It follows that $\lim _{n\rightarrow \infty }\left| \Upsilon _{m,n}(A)\right| =\delta _{m}(A)$ exists with

$$\begin{aligned} N_{2^{-m}}(\textrm{Sp}(A))\le \delta _{m}(A)\le N'_{2^{-m}}(\textrm{Sp}(A)). \end{aligned}$$

(9.1)

For $n_2>n_1$ set $\Gamma _{n_2,n_1}(A)=0$, otherwise set

$$\begin{aligned} \Gamma _{n_2,n_1}(A)=\max _{n_2\le k \le n_1} \max _{1\le j\le n_1}\frac{\log (\left| \Upsilon _{k,j}(A)\right| )}{k\log (2)}. \end{aligned}$$

The above monotone convergence and (9.1) shows that

$$\begin{aligned} \lim _{n_1\rightarrow \infty }\Gamma _{n_2,n_1}(A)&=\Gamma _{n_2}(A)=\sup _{k\ge n_2}\frac{\log (\delta _k(A))}{k\log (2)}\ge \limsup _{k\rightarrow \infty }\frac{\log (\delta _k(A))}{k\log (2)},\\ \lim _{n_2\rightarrow \infty }\Gamma _{n_2}(A)&=\limsup _{k\rightarrow \infty }\frac{\log (\delta _k(A))}{k\log (2)}. \end{aligned}$$

Hence, by the assumption that the box-counting dimension exists, we have constructed a $\Pi _2^A$ tower.

Step 2 $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _2\}\in \Pi _2^A$ and $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\in \Pi _3^A$. The first of these is exactly as in step 1, using $\Lambda _2$ to construct the relevant $\Sigma _1^A$ tower for the spectrum. The proof that $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\in \Pi _3^A$ uses a height two tower, $\{{{\tilde{\Gamma }}}_{n_2,n_1}\}$, using $\Lambda _1$ for $\textrm{Sp}(A)$, $A\in \Omega _{\textrm{SA}}^{BD}$ (or any self-adjoint A) constructed in [18]. This tower has the property that each ${{\tilde{\Gamma }}}_{n_2,n_1}(A)$ is a finite subset of ${\mathbb {Q}}$ and, for fixed $n_2$, is constant for large $n_1$. Moreover, if $z\in \lim _{n_1\rightarrow \infty }{{\tilde{\Gamma }}}_{n_2,n_1}(A)$, then $\textrm{dist}(z,\textrm{Sp}(A))\le 2^{-n_2}$. It follows that we can use the same construction as step 1 with an additional limit at the start to reach the finite set $\lim _{n_1\rightarrow \infty }{{\tilde{\Gamma }}}_{n_2,n_1}(A)$.

Step 3 $\{\Xi _B,\Omega ^{BD}_\textrm{D},\Lambda _2\}\not \in \Delta _2^A$. This is exactly the same argument as step 2 of the proof of Theorem 3.13 with Lebesgue measure replaced by box-counting dimension.

Step 4 $\{\Xi _B,\Omega _{\textrm{SA}}^{BD},\Lambda _1\}\not \in \Delta _3^A$. This is exactly the same argument as step 4 of the proof of Theorem 3.13 with Lebesgue measure replaced by box-counting dimension. $\square $

We now turn to the Hausdorff dimension. Recall Lemma 3.21 on the problem of determining whether $\textrm{Sp}(A)\cap (a,b)\ne \emptyset $.

Proof of Lemma 3.21

We start with the class $\Omega _f\cap \Omega _\textrm{SA}$. We can interpret this problem as a decision problem and the following algorithm as one that halts on output “Yes”. Let $c=(a+b)/2$ and $\delta =(b-a)/2$, then the idea is to simply test whether $\texttt {DistSpec}(A,n,c,f(n))+c_n<\delta $. If the answer is yes, then we output “Yes”, otherwise we output “No” and increase n by one. Note that $\textrm{Sp}(A)\cap (a,b)\ne \emptyset $ if and only if $\left\| R(c,A)\right\| ^{-1}<\delta $ and hence as $\texttt {DistSpec}(A,n,c,f(n))+c_n$ converges down to $\left\| R(c,A)\right\| ^{-1}$ we see that this provides a convergent algorithm. For $\Omega _\textrm{SA}$, we require an additional successive limit by replacing $\texttt {DistSpec}(A,n,c,f(n))+c_n$ with the function $\gamma _{n_2,n_1}(z;A)$. If we have access to $\Lambda _2$, then this can be avoided in the usual way. $\square $

To build our algorithm for the Hausdorff dimension, we use an alternative, equivalent definition for compact sets. We consider the case of subsets of ${\mathbb {R}}$. Let $\rho _k$ denote the set of all closed binary intervals of the form $ [2^{-k}m,2^{-k}(m+1)], m\in {\mathbb {Z}}. $ Set

$$\begin{aligned} {\mathcal {A}}_k(F)=\left\{ \{U_i\}_{i\in I}:I\text { is finite },F\subset \cup _{i\in I}U_i,U_i\in \cup _{l\ge k}\rho _l\right\} \end{aligned}$$

and define

$$\begin{aligned} \tilde{{\mathcal {H}}}^{d}_{k}(F)=\inf \left\{ \sum _i\textrm{diam}(U_i)^d:\{U_i\}_{i\in I}\in {\mathcal {A}}_{k}(F)\right\} ,\quad \tilde{{\mathcal {H}}}^{d}(F)=\lim _{k\rightarrow \infty }\tilde{{\mathcal {H}}}^{d}_{k}(F). \end{aligned}$$

The following can be found in [81] (Theorem 3.13):

Theorem 9.1

Let F be a bounded subset of ${\mathbb {R}}$. Then, there exists a unique $d'=\textrm{dim}_{H'}(F)$ such that $\tilde{{\mathcal {H}}}^{d}(F)=0$ for $d>d'$ and $\tilde{{\mathcal {H}}}^{d}(F)=\infty $ for $d<d'$. Furthermore, $d'=\textrm{dim}_H({\overline{F}})$.

Denoting the dyadic rationals by ${\mathbb {D}}$, we shall compute $\textrm{dim}_H(\textrm{Sp}(A))$ via approximating the above applied to $F=\textrm{Sp}(A)\cap {\mathbb {D}}^c$ and using Lemma 3.21.

Proof of Theorem 3.18 for Hausdorff dimension

It is enough to prove the lower bounds $\{\Xi _H,\Omega _\textrm{D},\Lambda _2\}\notin \Delta ^G_3$, $\{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}\notin \Delta ^G_4$ and construct the towers of algorithms for the inclusions $\{\Xi _H,\Omega _f\cap \Omega _\textrm{SA},\Lambda _1\}\in \Sigma ^A_3$, $\{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}\in \Sigma _4^A$ and $\{\Xi _H,\Omega _\textrm{SA},\Lambda _2\}\in \Sigma _3^A$.

Step 1 $\{\Xi _H,\Omega _\textrm{D},\Lambda _2\}\notin \Delta ^G_3$. Suppose for a contradiction that a height two tower, $\{\Gamma _{n_2,n_1}\}$, exists for $\{\Xi _H,\Omega _\textrm{D}\}$ (taking values in [0, 1] without loss of generality). We repeat the argument in the proof of Theorem 3.16. Consider the same problem

$$\begin{aligned} {{\tilde{\Xi }}}_1(\{a_{i,j}\}):\text { Does }\{a_{i,j}\}\text { have a column containing infinitely many nonzero entries?} \end{aligned}$$

However, now we consider the above mapping to [0, 1] with the usual metric. We consider the same operator $ A=\textrm{diag}(c_{\phi (1)},c_{\phi (2)},c_{\phi (3)},\ldots ) $ with

$$\begin{aligned} \textrm{Sp}(A)=\left( \bigcup _{j:\sum _{i}a_{i,j}=\infty }I_j\right) \cup Q, \end{aligned}$$

where Q is at most countable. We use the fact that the Hausdorff dimension satisfies

$$\begin{aligned} \textrm{dim}_H(\cup _{j=1}^\infty X_j)=\sup _{j\in {\mathbb {N}}}\textrm{dim}_H(X_j) \end{aligned}$$

and that $\textrm{dim}_H(Q)=0$ for any countable Q to note that $\Xi _H(A)={{\tilde{\Xi }}}_1(\{a_{i,j}\})$. We set ${\tilde{\Gamma }}_{n_2,n_1}(\{a_{i,j}\}_{i,j})=\Gamma _{n_2,n_1}(A)$ to provide a height two tower for ${{\tilde{\Xi }}}_1$. But this contradicts Theorem 5.19.

Step 2 $\{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}\notin \Delta ^G_4$. Suppose for a contradiction that $\{\Gamma _{n_3,n_2,n_1}\}$ is a height three tower of general algorithms for the problem $\{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}$ (taking values in [0, 1] without loss of generality). Let $({\mathcal {M}},d)$ be the space [0, 1] with the usual metric, let ${\tilde{\Omega }}$ denote the collection of all infinite arrays $\{a_{m,i,j}\}_{m,i,j\in {\mathbb {N}}}$ with entries $a_{m,i,j}\in \{0,1\}$ and consider the problem function

$$\begin{aligned} \begin{aligned}&{\tilde{\Xi }}_4(\{a_{m,i,j}\}):\text { For every { m}, does }\{a_{m,i,j}\}_{i,j}\text { have (only) finitely many columns}\\&\quad \text {with (only) finitely many 1's?} \end{aligned} \end{aligned}$$

Recall that it is shown in Sect. 5 that $\textrm{SCI}({\tilde{\Xi }}_4,{\tilde{\Omega }})_{G} = 4$. We will gain a contradiction by using the supposed height three tower to solve $\{{\tilde{\Xi }}_4,{\tilde{\Omega }}\}$. We use the same construction as in step 3 of the proof of Theorem 3.16. If ${\tilde{\Xi }}_4(\{a_{m,i,j}\})=1$, then $\textrm{Sp}(T(\{a_{m,i,j}\}_{m,i,j}))$ is countable, and hence, $\Xi _H(T(\{a_{m,i,j}\}_{m,i,j}))=0$. On the other hand, if ${\tilde{\Xi }}_4(\{a_{m,i,j}\})=0$, then there exists m with $\textrm{Sp}(A_m)=[-1,1]$ and hence $I_m\subset \textrm{Sp}(T(\{a_{m,i,j}\}_{m,i,j}))$ so that $\Xi _H(T(\{a_{m,i,j}\}_{m,i,j}))=1$. It follows that ${{\tilde{\Gamma }}}_{n_3,n_2,n_1}(\{a_{m,i,j}\}_{m,i,j})=1-\Gamma _{n_3,n_2,n_1}(T(\{a_{m,i,j}\}_{m,i,j}))$ provides a height three tower for $\{{\tilde{\Xi }}_4,{\tilde{\Omega }}\}$, a contradiction.

Step 3 $\{\Xi _H,\Omega _f\cap \Omega _\textrm{SA},\Lambda _1\}\in \Sigma _3^A$. To construct a height three tower for $A\in \Omega _f\cap \Omega _\textrm{SA}$, if $n_2<n_3$ set $\Gamma _{n_3,n_2,n_1}(A)=0$. Otherwise, consider the set

$$\begin{aligned} {\mathcal {A}}_{n_3,n_2,n_1}(A)=\left\{ \{U_i\}_{i\in I}:I\text { is finite },S_{n_1,n_2}(A)\subset \cup _{i\in I}U_i,U_i\in \cup _{n_3\le l\le n_2}\rho _l\right\} \end{aligned}$$

where $S_{n_1,n_2}(A)$ is the union of all $S\in \rho _{n_2}$ with $S\subset [-n_1,n_1]$ and such that the algorithm discussed in Lemma 3.21 outputs “Yes” for the interior of S and input parameter $n_1$. We then define

$$\begin{aligned} h_{n_3,n_2,n_1}(A,d)=\inf \left\{ \sum _i\textrm{diam}(U_i)^d:\{U_i\}\in {\mathcal {A}}_{n_3,n_2,n_1}(A)\right\} . \end{aligned}$$

If $S_{n_1,n_2}(A)$ is empty, then we interpret the infinum as 0. There are only finitely many sets to check and hence the infinum is a minimisation problem over finitely many coverings (see § B.4 for a discussion of efficient implementation). It follows that $h_{n_3,n_2,n_1}(A,d)$ defines a general algorithm computable in finitely many arithmetic operations and comparisons. Furthermore, it is easy to see that

$$\begin{aligned} \lim _{n_1\rightarrow \infty }h_{n_3,n_2,n_1}(A,d)=\inf \left\{ \sum _i\textrm{diam}(U_i)^d:\{U_i\}\in {\mathcal {C}}_{n_3,n_2}(A)\right\} =:h_{n_3,n_2}(A,d) \end{aligned}$$

from below (since we are covering larger sets as $n_1$ increases). Here,

$$\begin{aligned} {\mathcal {C}}_{n_3,n_2}(A)=\left\{ \{U_i\}_{i\in I}:I\text { is finite },\textrm{Sp}(A)\cap {\mathbb {D}}_{n_2}^c\subset \cup _{i\in I}U_i,U_i\in \cup _{n_3\le l\le n_2}\rho _l\right\} \end{aligned}$$

and ${\mathbb {D}}_k:=1/{2^k}\cdot {\mathbb {Z}}$ denotes the dyadic rationals of resolution k. We now use the property that ${\mathcal {A}}_k(F)$ consists of collections of finite coverings. As $n_2\rightarrow \infty $, $h_{n_3,n_2}(A,d)$ is non-increasing (since we take infinum over a larger class of coverings and the sets $\textrm{Sp}(A)\cap {\mathbb {D}}_{n_2}^c$ decrease) and hence converges to some number. Clearly

$$\begin{aligned} \lim _{n_2\rightarrow \infty }h_{n_3,n_2}(A,d)=:h_{n_3}(A,d)\ge \tilde{{\mathcal {H}}}^d_{n_3}(\textrm{Sp}(A)\cap {\mathbb {D}}^c). \end{aligned}$$

For $\epsilon >0$, let $l\in {\mathbb {N}}$ and $\{U_i\}\in {\mathcal {A}}_{n_3}(\textrm{Sp}(A)\cap {\mathbb {D}}_l^c)\}$ with

$$\begin{aligned} \sum _{i}\textrm{diam}(U_i)^d\le \epsilon +\tilde{{\mathcal {H}}}^d_{n_3}(\textrm{Sp}(A)\cap {\mathbb {D}}_l^c). \end{aligned}$$

For large enough $n_2$, $\{U_i\}\in {\mathcal {C}}_{n_3,n_2}(A)$ and hence since $\epsilon >0$ was arbitrary,

$$\begin{aligned} h_{n_3}(A,d)\le \tilde{{\mathcal {H}}}^d_{n_3}(\textrm{Sp}(A)\cap {\mathbb {D}}_l^c) \end{aligned}$$

for all l. For a fixed A and d, $h_{n_3}(A,d)$ is non-decreasing in $n_3$ and hence converges to a function of d, h(A, d) (possibly taking infinite values). Furthermore,

$$\begin{aligned} \tilde{{\mathcal {H}}}^{d}(\textrm{Sp}(A)\cap {\mathbb {D}}^c)\le h(A,d) \le \tilde{{\mathcal {H}}}^{d}(\textrm{Sp}(A)\cap {\mathbb {D}}_l^c). \end{aligned}$$

Since the set $\textrm{Sp}(A)\cap {\mathbb {D}}$ is countable, its Hausdorff dimension is zero. Using sub-additivity of Hausdorff dimension and Theorem 9.1,

$$\begin{aligned} \textrm{dim}_H(\textrm{Sp}(A))&\le \textrm{dim}_H(\textrm{Sp}(A)\cap {\mathbb {D}}^c)\\&\le \textrm{dim}_H(\overline{\textrm{Sp}(A)\cap {\mathbb {D}}^c})=\textrm{dim}_{H'}(\textrm{Sp}(A)\cap {\mathbb {D}}^c)\\&\le \textrm{dim}_H(\overline{\textrm{Sp}(A)\cap {\mathbb {D}}^c_l})=\textrm{dim}_{H'}(\textrm{Sp}(A)\cap {\mathbb {D}}^c_l)\\&\le \textrm{dim}_H(\textrm{Sp}(A)). \end{aligned}$$

It follows that $h(A,d)=0$ if $d>\textrm{dim}_H(\textrm{Sp}(A))$ and that $h(A,d)=\infty $ if $d<\textrm{dim}_H(\textrm{Sp}(A))$. Define

$$\begin{aligned} \Gamma _{n_3,n_2,n_1}(A)=\sup _{j=1,\ldots ,2^{n_3}}\left\{ \frac{j}{2^{n_3}}:h_{n_3,n_2,n_1}(A,k/{2^{n_3}})+\frac{1}{n_2}>\frac{1}{2}\text { for }k=1,\ldots ,j\right\} , \end{aligned}$$

where in this case we define the maximum over the empty set to be 0.

Consider $n_2\ge n_3$. Since $h_{n_3,n_2,n_1}(A,d)\uparrow h_{n_3,n_2}(A,d)$, it is clear that

$$\begin{aligned} \lim _{n_1\rightarrow \infty }\Gamma _{n_3,n_2,n_1}(A)= & {} \sup _{j=1,\ldots ,2^{n_3}}\left\{ \frac{j}{2^{n_3}}:h_{n_3,n_2}(A,k/2^{n_3})+\frac{1}{n_2}>\frac{1}{2}\text { for }k=1,\ldots ,j\right\} \\=: & {} \Gamma _{n_3,n_2}(A). \end{aligned}$$

If $h_{n_3}(A,d)\ge 1/2$, then $h_{n_3,n_2}(A,d)+1/{n_2}>1/2$ for all $n_2$ otherwise $h_{n_3,n_2}(A,d)+1/{n_2}<1/2$ eventually. Hence,

$$\begin{aligned} \lim _{n_2\rightarrow \infty }\Gamma _{n_3,n_2}(A)=\sup _{j=1,\ldots ,2^{n_3}}\left\{ \frac{j}{2^{n_3}}:h_{n_3}(A,k/{2^{n_3}})\ge \frac{1}{2}\text { for }k=1,\ldots ,j\right\} =:\Gamma _{n_3}(A). \end{aligned}$$

Using the monotonicity of $h_{n_3}(A,d)$ in d and the proven properties of the limit function h, it follows that

$$\begin{aligned} \lim _{n_3\rightarrow \infty }\Gamma _{n_3}(A)=\textrm{dim}_H(\textrm{Sp}(A)). \end{aligned}$$

The fact that $h_{n_3}$ is non-decreasing in $n_3$, the set $\{1/2^{n_3},2/2^{n_3},\ldots ,1\}$ refines itself, and the stated monotonicity collectively shows that convergence is monotonic from below, and hence, we get the $\Sigma _3^A$ classification.

Step 4 $\{\Xi _H,\Omega _\textrm{SA},\Lambda _1\}\in \Sigma _4^A$ and $\{\Xi _H,\Omega _\textrm{SA},\Lambda _2\}\in \Sigma _3^A$. The first of these can be proven as in step 3 by replacing $(n_1,n_2,n_3)$ by $(n_2,n_3,n_4)$ and the set $S_{n_2,n_1}(A)$ by the set $S_{n_3,n_2,n_1}(A)$ given by the union of all $S\in \rho _{n_3}$ with $S\subset [-n_2,n_2]$ and such that the $\Sigma _2^A$ tower of algorithms discussed in Lemma 3.21 outputs “Yes” for the interior of S and input parameters $(n_2,n_1)$. To prove $\{\Xi _H,\Omega _\textrm{SA},\Lambda _2\}\in \Sigma _3^A$, we use exactly the same construction as in step 3 now using the $\Sigma _1^A$ algorithm (which uses $\Lambda _2$) given by Lemma 3.21. $\square $

Notes

Many of our algorithms can also be extended to unbounded operators.
For precise notions of algorithm, see Sect. 5.
For related work on practical infinite-dimensional numerical linear algebra, see [63, 85, 93, 126,127,128,129,130, 166], and for rigorous data-driven algorithms for spectral properties of Koopman operators (operators on infinite-dimensional spaces that globally linearise nonlinear dynamical systems), see [56, 58, 65].
For algorithms that compute spectral measures and decompositions, see [53, 61, 63] and their recent physical applications in [62, 97].
This technique is not restricted to spectral problems—it can be adapted to other scenarios.
Sometimes the sequence $\{c_n\}$ is not needed and we will explicitly mention when this is the case.
We fix the polynomial p for the strongest possible negative results. However, the existence of the towers of algorithms also holds when considering the polynomial p itself as an input.
This is an idealisation since the capacity studies operator norms while true Krylov processes look at p(A)x with one or several vectors x. However, from local spectral theory (e.g. [118]) it follows that, generically, the asymptotic speeds are the same.
Arveson [3,4,5,6,7] and Brown [42,43,44] pioneered spectral computations from the point of view of $C^*$-algebras, both for the general spectral computation problem and for Schrödinger operators. This combination can be traced back to the work of Böttcher and Silberman [39]. Arveson also considered spectral computation in terms of densities, which is related to Szegö’s work [155] on finite section approximations.
Even when the finite section method converges, it typically only yields $\Delta _2^A$ classifications in the SCI hierarchy [37, 38, 45, 46].
If A is hyponormal, then $W_e(A)$ is the convex hull of the essential spectrum [142].
In the non-normal case, it is possible for finite sections to not capture all of the spectrum—parts of the spectrum may be unattainable. This is distinct from spectral pollution. Theorem 3.8 says that, up to a different choice of projections, this can be avoided on $W_e(A)$.
We consider the computation of $\textrm{Leb}(\widehat{\textrm{Sp}}_{\epsilon }(A))$ instead of $\textrm{Leb}({\textrm{Sp}}_{\epsilon }(A))$ since it is not clear that the level sets
$$\begin{aligned} S_{\epsilon }(A):=\{z\in {\mathbb {C}}:\left\| R(z,A)\right\| ^{-1}=\epsilon \} \end{aligned}$$
(3.2)
always have Lebesgue measure zero (this is currently an open problem for general bounded operators). This situation is analogous to the case of approximating the pseudospectra of bounded operators, where one uses the crucial property that pseudospectra cannot jump—the resolvent norm cannot be constant on open subsets of ${\mathbb {C}}\backslash \textrm{Sp}(A)$ for a bounded operator A acting on a separable Hilbert space [143]. The question of whether the sets in (3.2) are Lebesgue null is the measure theoretic equivalent. Note, however, that it is straightforward to show that $S_{\epsilon }(A)$ is null for $A\in \Omega _{\textrm{N}}$ through the formula $\Vert R(z,A)\Vert ^{-1}=\textrm{dist}(z,\textrm{Sp}(A))$.
The proofs for general self-adjoint operators can be adapted with an additional successive limit and the use of two-dimensional covering boxes to treat the class of general bounded operators. Some care is needed to deal with the boundaries of covering boxes for the Hausdorff dimension, but we omit the details.
That is, the set of covers $\{U_i\}_{i\in I}$ with I at most countable and with $\textrm{diam}(U_i)\le \delta $.
For this example and other operators on $l^2({\mathbb {Z}})$ below, we reorder the basis so that the operator A acts on $l^2({\mathbb {N}})$.
There do exist examples of operators, typically with a lot of structure, where one can use periodic versions of finite section.
We set $D(x,0)=\emptyset $.

References

Aiena, P.: Fredholm and local spectral theory, with applications to multipliers. Springer Science & Business Media (2007)
Antun, V., Colbrook, M.J., Hansen, A.C.: Proving existence is not enough: Mathematical paradoxes unravel the limits of neural networks in artificial intelligence. SIAM News 55(4), 1–4 (2022)
Arveson, W.: Discretized CCR algebras. Journal of Operator Theory 26(2), 225–239 (1991)
MathSciNet MATH Google Scholar
Arveson, W.: Improper filtrations for $C^*$-algebras: spectra of unilateral tridiagonal operators. Acta Sci. Math. (Szeged) 57(1-4), 11–24 (1993)
MathSciNet MATH Google Scholar
Arveson, W.: Noncommutative spheres and numerical quantum mechanics. In: Operator algebras, mathematical physics, and low-dimensional topology, Res. Notes Math., vol. 5, pp. 1–10. A K Peters, Wellesley, MA (1993)
Arveson, W.: $C^*$-algebras and numerical linear algebra. Journal of Functional Analysis 122(2), 333–360 (1994)
Article MathSciNet MATH Google Scholar
Arveson, W.: The role of $C^\ast $-algebras in infinite-dimensional numerical linear algebra. In: $C^\ast $-algebras: 1943–1993 (San Antonio, TX, 1993), Contemp. Math., vol. 167, pp. 114–129. Amer. Math. Soc., Providence, RI (1994)
Aubry, S., André, G.: Analyticity breaking and Anderson localization in incommensurate lattices. Ann. Israel Phys. Soc 3(133), 18 (1980)
MathSciNet MATH Google Scholar
Avila, A., Jitomirskaya, S.: The Ten Martini Problem. Annals of Mathematics (2) 170(1), 303–342 (2009)
Avila, A., Jitomirskaya, S., Marx, C.: Spectral theory of extended Harper’s model and a question by Erdős and Szekeres. Inventiones mathematicae 210(1), 283–339 (2017)
Article MathSciNet MATH Google Scholar
Avila, A., Krikorian, R.: Reducibility or nonuniform hyperbolicity for quasiperiodic Schrödinger cocycles. Annals of Mathematics (2) 164(3), 911–940 (2006)
Avila, A., Viana, M.: Simplicity of Lyapunov spectra: proof of the Zorich–Kontsevich conjecture. Acta Mathematica 198(1), 1–56 (2007)
Article MathSciNet MATH Google Scholar
Azbel, M.Y.: Energy spectrum of a conduction electron in a magnetic field. Sov. Phys. JETP 19(3), 634–645 (1964)
Google Scholar
Bandres, M.A., Rechtsman, M.C., Segev, M.: Topological photonic quasicrystals: Fractal topological spectrum and protected transport. Physical Review X 6(1), 011,016 (2016)
Bastounis, A., Hansen, A.C., Vlačić, V.: The extended Smale’s 9th problem—On computational barriers and paradoxes in estimation, regularisation, computer-assisted proofs and learning. arXiv:2110.15734 (2021)
Becker, S., Hansen, A.: Computing solutions of Schrödinger equations on unbounded domains—on the brink of numerical algorithms. arXiv preprint arXiv:2010.16347 (2020)
Beckus, S., Pogorzelski, F.: Spectrum of Lebesgue measure zero for Jacobi matrices of quasicrystals. Mathematical Physics, Analysis and Geometry 16(3), 289–308 (2013)
Article MathSciNet MATH Google Scholar
Ben-Artzi, J., Colbrook, M.J., Hansen, A.C., Nevanlinna, O., Seidel, M.: Computing Spectra—On the Solvability Complexity Index hierarchy and towers of algorithms. arXiv:1508.03280v5 (2020)
Ben-Artzi, J., Hansen, A.C., Nevanlinna, O., Seidel, M.: New barriers in complexity theory: On the solvability complexity index and the towers of algorithms. Comptes Rendus Mathematique 353(10), 931 – 936 (2015)
Article MathSciNet MATH Google Scholar
Ben-Artzi, J., Marletta, M., Rösler, F.: Computing the sound of the sea in a seashell. Foundations of Computational Mathematics pp. 1–35 (2021)
Ben-Artzi, J., Marletta, M., Rösler, F.: Computing scattering resonances. J. Eur. Math. Soc. (to appear)
Ben-Artzi, J., Marletta, M., Rösler, F.: Universal algorithms for computing spectra of periodic operators. Numerische Mathematik (to appear)
Ben-Artzi, J., Rösler, F., Marletta, M.: Universal algorithms for solving inverse spectral problems. arXiv preprint arXiv:2203.13078 (2022)
Benza, V.G., Sire, C.: Band spectrum of the octagonal quasicrystal: Finite measure, gaps, and chaos. Physical Review B 44(18), 10,343 (1991)
Berry, M.: Fractal modes of unstable lasers with polygonal and circular mirrors. Optics communications 200(1-6), 321–330 (2001)
Article Google Scholar
Berry, M.: Physics of nonhermitian degeneracies. Czechoslovak journal of physics 54(10), 1039–1047 (2004)
Article MathSciNet Google Scholar
Berry, M., Storm, C., Van Saarloos, W.: Theory of unstable laser modes: edge waves and fractality. Optics communications 197(4-6), 393–402 (2001)
Article Google Scholar
Blum, L., Cucker, F., Shub, M., Smale, S.: Complexity and Real Computation. Springer-Verlag New York, Inc., Secaucus, NJ, USA (1998)
Book MATH Google Scholar
Blum, L., Shub, M., Smale, S.: On a theory of computation and complexity over the real numbers: NP-completeness, recursive functions and universal machines. American Mathematical Society. Bulletin. 21(1), 1–46 (1989)
Article MathSciNet MATH Google Scholar
Boche, H., Pohl, V.: The solvability complexity index of sampling-based Hilbert transform approximations. In: 2019 13th International conference on Sampling Theory and Applications (SampTA), pp. 1–4. IEEE (2019)
Boffi, D., Brezzi, F., Gastaldi, L.: On the problem of spurious eigenvalues in the approximation of linear elliptic problems in mixed form. Mathematics of Computation 69(229), 121–140 (2000)
Article MathSciNet MATH Google Scholar
Boffi, D., Duran, R.G., Gastaldi, L.: A remark on spurious eigenvalues in a square. Appl. Math. Lett. 12(3), 107–114 (1999)
Article MathSciNet MATH Google Scholar
Bögli, S., Brown, B.M., Marletta, M., Tretter, C., Wagenhofer, M.: Guaranteed resonance enclosures and exclosures for atoms and molecules. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 470(2171), 20140,488 (2014)
Bögli, S., Marletta, M., Tretter, C.: The essential numerical range for unbounded linear operators. Journal of Functional Analysis p. 108509 (2020)
Böttcher, A.: Pseudospectra and singular values of large convolution operators. Journal of Integral Equations and Applications 6(3), 267–301 (1994)
Article MathSciNet MATH Google Scholar
Böttcher, A.: Infinite matrices and projection methods. In: Lectures on operator theory and its applications (Waterloo, ON, 1994), Fields Inst. Monogr., vol. 3, pp. 1–72. Amer. Math. Soc., Providence, RI (1996)
Böttcher, A., Brunner, H., Iserles, A., Nørsett, S.P.: On the singular values and eigenvalues of the Fox-Li and related operators. New York J. Math. 16, 539–561 (2010)
MathSciNet MATH Google Scholar
Böttcher, A., Grudsky, S., Iserles, A.: Spectral theory of large Wiener–Hopf operators with complex-symmetric kernels and rational symbols. Mathematical Proceedings of the Cambridge Philosophical Society 151(1), 161–191 (2011)
Böttcher, A., Silbermann, B.: The finite section method for Toeplitz operators on the quarter-plane with piecewise continuous symbols. Mathematische Nachrichten 110, 279–291 (1983)
Article MathSciNet MATH Google Scholar
Böttcher, A., Silbermann, B.: Introduction to large truncated Toeplitz matrices. Universitext. Springer-Verlag, New York (1999)
Book MATH Google Scholar
Böttcher, A., Silbermann, B.: Analysis of Toeplitz operators, second edn. Springer Monographs in Mathematics. Springer-Verlag, Berlin (2006)
MATH Google Scholar
Brown, N.: Invariant means and finite representation theory of C*-algebras. Memoirs of the American Mathematical Society 184 (2003). https://doi.org/10.1090/memo/0865
Brown, N.P.: AF embeddings and the numerical computation of spectra in irrational rotation algebras. Numer. Funct. Anal. Optim. 27(5-6), 517–528 (2006)
Article MathSciNet MATH Google Scholar
Brown, N.P.: Quasi-diagonality and the finite section method. Mathematics of Computation 76(257), 339–360 (2007)
Article MathSciNet MATH Google Scholar
Brunner, H., Iserles, A., Nørsett, S.P.: The spectral problem for a class of highly oscillatory Fredholm integral operators. IMA Journal of Numerical Analysis 30(1), 108–130 (2008)
Article MathSciNet MATH Google Scholar
Brunner, H., Iserles, A., Nørsett, S.P.: The computation of the spectra of highly oscillatory Fredholm integral operators. J. Integral Equations Applications 23(4), 467–519 (2011). https://doi.org/10.1216/JIE-2011-23-4-467
Article MathSciNet MATH Google Scholar
Buffa, A., Perugia, I.: Discontinuous Galerkin approximation of the Maxwell eigenproblem. SIAM Journal on Numerical Analysis 44(5), 2198–2226 (2006)
Article MathSciNet MATH Google Scholar
Burke, J.V., Greenbaum, A.: Characterizations of the polynomial numerical hull of degree k. Linear algebra and its applications 419(1), 37–47 (2006)
Article MathSciNet MATH Google Scholar
Carmona, R., Lacroix, J.: Spectral theory of random Schrödinger operators. Probability and its Applications. Birkhäuser Boston, Inc., Boston, MA (1990)
Book MATH Google Scholar
Christiansen, S.H., Winther, R.: On variational eigenvalue approximation of semidefinite operators. IMA J. Numer. Anal. 33(1), 164–189 (2013)
Article MathSciNet MATH Google Scholar
Colbrook, M.J.: The foundations of infinite-dimensional spectral computations. Ph.D. thesis, University of Cambridge (2020)
Colbrook, M.J.: Pseudoergodic operators and periodic boundary conditions. Mathematics of Computation 89(322), 737–766 (2020)
Article MathSciNet MATH Google Scholar
Colbrook, M.J.: Computing spectral measures and spectral types. Communications in Mathematical Physics 384(1), 433–501 (2021)
Article MathSciNet MATH Google Scholar
Colbrook, M.J.: Unscrambling the infinite: Can we compute spectra? IMA Mathematics Today (2021). https://ima.org.uk/16912/unscrambling-the-infinite-can-we-compute-spectra/
Colbrook, M.J.: Computing semigroups with error control. SIAM Journal on Numerical Analysis 60(1), 396–422 (2022)
Article MathSciNet Google Scholar
Colbrook, M.J.: The mpEDMD Algorithm for Data-Driven Computations of Measure-Preserving Dynamical Systems. arXiv preprint arXiv:2104.09444 (2022)
Colbrook, M.J., Antun, V., Hansen, A.C.: The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and Smale’s 18th problem. Proceedings of the National Academy of Sciences 119(12), e2107151,119 (2022)
Colbrook, M.J., Ayton, L., Szoke, M.: Residual Dynamic Mode Decomposition: Robust and verified Koopmanism. arXiv preprint (2022)
Colbrook, M.J., Hansen, A.C.: On the infinite-dimensional QR algorithm. Numerische Mathematik 143(1), 17–83 (2019)
Article MathSciNet MATH Google Scholar
Colbrook, M.J., Hansen, A.C.: The foundations of spectral computations via the solvability complexity index hierarchy. Journal of the European Mathematical Society (to appear)
Colbrook, M.J., Horning, A.: Specsolve: Spectral methods for spectral measures. arXiv preprint arXiv:2201.01314 (2022)
Colbrook, M.J., Horning, A., Thicke, K., Watson, A.B.: Computing spectral properties of topological insulators without artificial truncation or supercell approximation. arXiv preprint arXiv:2112.03942 (2021)
Colbrook, M.J., Horning, A., Townsend, A.: Computing spectral measures of self-adjoint operators. SIAM Review 63(3), 489–524 (2021)
Article MathSciNet MATH Google Scholar
Colbrook, M.J., Roman, B., Hansen, A.C.: How to compute spectra with error control. Physical Review Letters 122(25), 250,201 (2019)
Colbrook, M.J., Townsend, A.: Rigorous data-driven computation of spectral properties of Koopman operators for dynamical systems. arXiv preprint arXiv:2111.14889 (2021)
Cucker, F.: The arithmetical hierarchy over the reals. J. Logic Comput. 2(3), 375–395 (1992)
Article MathSciNet MATH Google Scholar
Damanik, D., Embree, M., Gorodetski, A.: Spectral properties of Schrödinger operators arising in the study of quasicrystals. In: Mathematics of aperiodic order, pp. 307–370. Springer (2015)
Dean, C.R., Wang, L., Maher, P., Forsythe, C., Ghahari, F., Gao, Y., Katoch, J., Ishigami, M., Moon, P., Koshino, M., et al.: Hofstadter’s butterfly and the fractal quantum Hall effect in Moiré superlattices. Nature 497(7451), 598–602 (2013)
Article Google Scholar
Della Villa, A., Enoch, S., Tayeb, G., Pierro, V., Galdi, V., Capolino, F.: Band gap formation and multiple scattering in photonic quasicrystals with a Penrose-type lattice. Physical Review Letters 94(18), 183,903 (2005)
Doyle, P., McMullen, C.: Solving the quintic by iteration. Acta Mathematica 163(3-4), 151–180 (1989)
Article MathSciNet MATH Google Scholar
Falconer, K.: Fractal geometry, second edn. John Wiley & Sons, Inc., Hoboken, NJ (2003)
Book MATH Google Scholar
Fefferman, C., Seco, L.: On the energy of a large atom. Bull. Amer. Math. Soc. (N.S.) 23(2), 525–530 (1990)
Fefferman, C., Seco, L.: Eigenvalues and eigenfunctions of ordinary differential operators. Adv. Math. 95(2), 145–305 (1992)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: Aperiodicity of the Hamiltonian flow in the Thomas-Fermi potential. Rev. Mat. Iberoamericana 9(3), 409–551 (1993)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: The density in a one-dimensional potential. Adv. Math. 107(2), 187–364 (1994)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: The eigenvalue sum for a one-dimensional potential. Adv. Math. 108(2), 263–335 (1994)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: On the Dirac and Schwinger corrections to the ground-state energy of an atom. Adv. Math. 107(1), 1–185 (1994)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: The density in a three-dimensional radial potential. Adv. Math. 111(1), 88–161 (1995)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: The eigenvalue sum for a three-dimensional radial potential. Adv. Math. 119(1), 26–116 (1996)
Article MathSciNet MATH Google Scholar
Fefferman, C., Seco, L.: Interval arithmetic in quantum mechanics. In: Applications of interval computations (El Paso, TX, 1995), Appl. Optim., vol. 3, pp. 145–167. Kluwer Acad. Publ., Dordrecht (1996)
Fernández-Martínez, M., Sánchez-Granero, M.A.: Fractal dimension for fractal structures: a Hausdorff approach revisited. Journal of Mathematical Analysis and Applications 409(1), 321–330 (2014)
Article MathSciNet MATH Google Scholar
Fillmore, P.A., Stampfli, J.G., Williams, J.P.: On the essential numerical range, the essential spectrum, and a problem of Halmos. Acta Sci. Math.(Szeged) 33(197), 179–192 (1972)
MathSciNet MATH Google Scholar
Geim, A.K., Grigorieva, I.V.: Van der Waals heterostructures. Nature 499(7459), 419–425 (2013)
Article Google Scholar
Gil, M.I.: Operator functions and localization of spectra. Springer (2003)
Gilles, M.A., Townsend, A.: Continuous analogues of Krylov subspace methods for differential operators. SIAM Journal on Numerical Analysis 57(2), 899–924 (2019)
Article MathSciNet MATH Google Scholar
Gowers, W.: Rough structure and classification. Geom. Funct. Anal. pp. 79–117 (2000)
Hales, T., Adams, M., Bauer, G., Dang, T.D., Harrison, J., Hoang, L.T., Kaliszyk, C., Magron, V., McLaughlin, S., Nguyen, T.T., Nguyen, Q.T., Nipkow, T., Obua, S., Pleso, J., Rute, J., Solovyev, A., Ta, T.H.A., Tran, N.T., Trieu, T.D., Urban, J., Vu, K., Zumkeller, R.: A formal proof of the Kepler conjecture. Forum Math. Pi 5, e2, 29 (2017)
Hales, T.C.: A proof of the Kepler conjecture. Annals of Mathematics (2) 162(3), 1065–1185 (2005)
Halmos, P.R.: Capacity in Banach algebras. Indiana Univ. Math. J. 20, 855–863 (1970/1971)
Han, J., Thouless, D., Hiramoto, H., Kohmoto, M.: Critical and bicritical properties of Harper’s equation with next-nearest-neighbor coupling. Physical Review B 50(16), 11,365 (1994)
Hansen, A.C.: On the solvability complexity index, the $n$-pseudospectrum and approximations of spectra of operators. Journal of the American Mathematical Society 24(1), 81–124 (2011)
Article MathSciNet MATH Google Scholar
Hofstadter, D.R.: Energy levels and wave functions of Bloch electrons in rational and irrational magnetic fields. Physical Review B 14(6), 2239 (1976)
Article Google Scholar
Horning, A., Townsend, A.: Feast for differential eigenvalue problems. SIAM Journal on Numerical Analysis 58(2), 1239–1262 (2020)
Article MathSciNet MATH Google Scholar
Hunt, B., Sanchez-Yamagishi, J., Young, A., Yankowitz, M., LeRoy, B.J., Watanabe, K., Taniguchi, T., Moon, P., Koshino, M., Jarillo-Herrero, P., et al.: Massive Dirac fermions and Hofstadter butterfly in a van der Waals heterostructure. Science 340(6139), 1427–1430 (2013)
Article Google Scholar
Jitomirskaya, S.: Critical phenomena, arithmetic phase transitions, and universality: some recent results on the almost Mathieu operator.
Johnson, C.R.: Numerical determination of the field of values of a general complex matrix. SIAM Journal on Numerical Analysis 15(3), 595–602 (1978)
Article MathSciNet MATH Google Scholar
Johnstone, D., Colbrook, M.J., Nielsen, A.E., Öhberg, P., Duncan, C.W.: Bulk localised transport states in infinite and finite quasicrystals via magnetic aperiodicity. Physical Review B 106(4), 045,149 (2022)
Kechris, A.S., Louveau, A.: Descriptive set theory and the structure of sets of uniqueness, London Mathematical Society Lecture Note Series, vol. 128. Cambridge University Press, Cambridge (1987)
MATH Google Scholar
Ketzmerick, R., Kruse, K., Kraut, S., Geisel, T.: What determines the spreading of a wave packet? Physical Review Letters 79(11), 1959 (1997)
Article MathSciNet MATH Google Scholar
Ketzmerick, R., Petschel, G., Geisel, T.: Slow decay of temporal correlations in quantum systems with Cantor spectra. Physical Review Letters 69(5), 695 (1992)
Article Google Scholar
Killip, R., Kiselev, A., Last, Y.: Dynamical upper bounds on wavepacket spreading. American journal of mathematics 125(5), 1165–1198 (2003)
Article MathSciNet MATH Google Scholar
Klaus, M.: On the point spectrum of Dirac operators. Helv. Phys. Acta 53(3), 453–462 (1981) (1980)
Kohmoto, M., Sutherland, B., Tang, C.: Critical wave functions and a Cantor-set spectrum of a one-dimensional quasicrystal model. Physical Review B 35(3), 1020 (1987)
Article MathSciNet Google Scholar
Last, Y.: Spectral theory of Sturm–Liouville operators on infinite intervals: a review of recent developments. In: Sturm-Liouville Theory, pp. 99–120. Springer (2005)
Laursen, K.B., Laursen, K.B.L., Neumann, M.: An introduction to local spectral theory. 20. Oxford University Press (2000)
Levi, L., Rechtsman, M., Freedman, B., Schwartz, T., Manela, O., Segev, M.: Disorder-enhanced transport in photonic quasicrystals. Science 332(6037), 1541–1544 (2011)
Article Google Scholar
Lewin, M., Séré, E.: Spectral pollution and how to avoid it (with applications to Dirac and periodic Schrödinger operators). Proc. Lond. Math. Soc. (3) 100(3), 864–900 (2010)
Lewin, M., Séré, É.: Spurious modes in Dirac calculations and how to avoid them. In: Many-Electron Approaches in Physics, Chemistry and Mathematics, pp. 31–52. Springer (2014)
Liesen, J., Sète, O., Nasser, M.M.S.: Fast and accurate computation of the logarithmic capacity of compact sets. Computational Methods and Function Theory 17(4), 689–713 (2017)
Article MathSciNet MATH Google Scholar
Luitz, D.J., Lev, Y.B.: The ergodic side of the many-body localization transition. Annalen der Physik 529(7), 1600,350 (2017)
Malcolm Brown, B., Langer, M., Marletta, M., Tretter, C., Wagenhofer, M.: Eigenvalue enclosures and exclosures for non-self-adjoint problems in hydrodynamics. LMS Journal of Computation and Mathematics 13, 65–81 (2010)
Article MathSciNet MATH Google Scholar
Marletta, M.: Neumann-Dirichlet maps and analysis of spectral pollution for non-self-adjoint elliptic PDEs with real essential spectrum. IMA J. Numer. Anal. 30(4), 917–939 (2010)
Article MathSciNet MATH Google Scholar
Marletta, M., Scheichl, R.: Eigenvalues in spectral gaps of differential operators. Journal of Spectral Theory 2(3), 293–320 (2012)
Article MathSciNet MATH Google Scholar
Mattila, P.: Geometry of sets and measures in Euclidean spaces, Cambridge Studies in Advanced Mathematics, vol. 44. Cambridge University Press, Cambridge (1995)
Book Google Scholar
McMullen, C.: Families of rational maps and iterative root-finding algorithms. Annals of Mathematics (2) 125(3), 467–493 (1987)
McMullen, C.: Braiding of the attractor and the failure of iterative algorithms. Invent. Math. 91(2), 259–272 (1988)
Article MathSciNet MATH Google Scholar
Miekkala, U., Nevanlinna, O.: Iterative solution of systems of linear differential equations. Acta Numerica 5(1), 259–307 (1996)
Article MathSciNet MATH Google Scholar
Müller, V.: Local behaviour of the polynomial calculus of operators. J. Reine Angew. Math. 430, 61–68 (1992)
MathSciNet MATH Google Scholar
Müller, V.: Spectral theory of linear operators: and spectral systems in Banach algebras, vol. 139. Springer Science & Business Media (2007)
Naumis, G.G., Barraza-Lopez, S., Oliva-Leyva, M., Terrones, H.: Electronic and optical properties of strained graphene and other strained 2D materials: a review. Reports on Progress in Physics 80(9), 096,501 (2017)
Nevanlinna, O.: Linear acceleration of Picard–Lindelöf iteration. Numerische Mathematik 57(1), 147–156 (1990)
Article MathSciNet MATH Google Scholar
Nevanlinna, O.: Convergence of iterations for linear equations. Lectures in Mathematics ETH Zürich. Birkhäuser Verlag, Basel (1993)
Nevanlinna, O.: Hessenberg matrices in Krylov subspaces and the computation of the spectrum. Numer. Funct. Anal. Optim. 16(3-4), 443–473 (1995)
Article MathSciNet MATH Google Scholar
Nevanlinna, O.: Computing the spectrum and representing the resolvent. Numerical Functional Analysis and Optimization 30(9-10), 1025–1047 (2009)
Article MathSciNet MATH Google Scholar
New, G., Yates, M., Woerdman, J., McDonald, G.: Diffractive origin of fractal resonator modes. Optics communications 193(1-6), 261–266 (2001)
Article Google Scholar
Olver, S.: GMRES for the differentiation operator. SIAM journal on numerical analysis 47(5), 3359–3373 (2009)
Article MathSciNet MATH Google Scholar
Olver, S.: ApproxFun.jl v0.8. github (online) https://github.com/JuliaApproximation/ApproxFun.jl (2018)
Olver, S., Townsend, A.: A fast and well-conditioned spectral method. SIAM Review 55(3), 462–489 (2013)
Article MathSciNet MATH Google Scholar
Olver, S., Townsend, A.: A Practical Framework for Infinite-dimensional Linear Algebra. In: Proceedings of the 1st First Workshop for High Performance Technical Computing in Dynamic Languages, HPTCDL ’14, pp. 57–62. IEEE Press, Piscataway, NJ, USA (2014)
Olver, S., Webb, M.: SpectralMeasures.jl. github (online) https://github.com/JuliaApproximation/SpectralMeasures.jl (2018)
Orland, G.H.: On a class of operators. Proc. Amer. Math. Soc. 15, 75–79 (1964)
Article MathSciNet MATH Google Scholar
Pokrzywa, A.: Method of orthogonal projections and approximation of the spectrum of a bounded operator. Studia Mathematica 65(1), 21–29 (1979)
Article MathSciNet MATH Google Scholar
Ponomarenko, L., Gorbachev, R., Yu, G., Elias, D., Jalil, R., Patel, A., Mishchenko, A., Mayorov, A., Woods, C., Wallbank, J., et al.: Cloning of Dirac fermions in graphene superlattices. Nature 497(7451), 594–597 (2013)
Article Google Scholar
Puelz, C., Embree, M., Fillman, J.: Spectral Approximation for Quasiperiodic Jacobi Operators. Integral Equations and Operator Theory 82(4), 533–554 (2015)
Article MathSciNet MATH Google Scholar
Puig, J.: Cantor spectrum for the almost Mathieu operator. Communications in Mathematical Physics 244(2), 297–309 (2004)
Article MathSciNet MATH Google Scholar
Putnam, C.R.: Operators satisfying a $G_{1}$ condition. Pacific Journal of Mathematics 84(2), 413–426 (1979)
Article MathSciNet MATH Google Scholar
Rappaz, J., Sanchez Hubert, J., Sanchez Palencia, E., Vassiliev, D.: On spectral pollution in the finite element approximation of thin elastic “membrane” shells. Numerische Mathematik 75(4), 473–500 (1997)
Rivera, J.A., Galvin, T.C., Steinforth, A.W., Eden, J.G.: Fractal modes and multi-beam generation from hybrid microlaser resonators. Nature communications 9(1), 1–8 (2018)
Article Google Scholar
Roman-Taboada, P., Naumis, G.G.: Spectral butterfly, mixed Dirac-Schrödinger fermion behavior, and topological states in armchair uniaxial strained graphene. Physical Review B 90(19), 195,435 (2014)
Rösler, F.: On the Solvability Complexity Index for Unbounded Selfadjoint and Schrödinger Operators. Integral Equations and Operator Theory 91(6), 54 (2019)
Article MATH Google Scholar
Rösler, F., Stepanenko, A.: Computing eigenvalues of the Laplacian on rough domains. arXiv preprint arXiv:2104.09444 (2021)
Salinas, N.: Operators with essentially disconnected spectrum. Acta Sci. Math. (Szeged) 33, 193–205 (1972)
MathSciNet MATH Google Scholar
Shargorodsky, E.: On the level sets of the resolvent norm of a linear operator. Bull. Lond. Math. Soc. 40(3), 493–504 (2008)
Article MathSciNet MATH Google Scholar
Shargorodsky, E.: On the limit behaviour of second order relative spectra of self-adjoint operators. Journal of Spectral Theory 3(4), 535–552 (2013)
Article MathSciNet MATH Google Scholar
Shechtman, D., Blech, I., Gratias, D., Cahn, J.W.: Metallic phase with long-range orientational order and no translational symmetry. Physical Review Letters 53, 1951–1953 (1984)
Article Google Scholar
Simon, B.: Schrödinger operators in the twenty-first century. Mathematical physics 2000, 283–288 (2000)
MATH Google Scholar
Sire, C.: Electronic spectrum of a 2D quasi-crystal related to the octagonal quasi-periodic tiling. EPL (Europhysics Letters) 10(5), 483 (1989)
Article Google Scholar
Smale, S.: The fundamental theorem of algebra and complexity theory. American Mathematical Society. Bulletin. 4(1), 1–36 (1981)
Article MathSciNet MATH Google Scholar
Smale, S.: On the efficiency of algorithms of analysis. Bull. Amer. Math. Soc. (N.S.) 13(2), 87–121 (1985)
Smale, S.: Complexity theory and numerical analysis. In: Acta numerica, 1997, Acta Numer., vol. 6, pp. 523–551. Cambridge Univ. Press, Cambridge (1997)
Stadnik, Z.M.: Physical properties of quasicrystals, vol. 126. Springer Science & Business Media (2012)
Stampfli, J.G., Williams, J.P.: Growth conditions and the numerical range in a Banach algebra. Tohoku Mathematical Journal, Second Series 20(4), 417–424 (1968)
MathSciNet MATH Google Scholar
Stewart, D.: Towards numerically estimating Hausdorff dimensions. The ANZIAM Journal 42(04), 451–461 (2001)
Article MathSciNet MATH Google Scholar
Sütő, A.: Singular continuous spectrum on a Cantor set of zero Lebesgue measure for the Fibonacci Hamiltonian. Journal of Statistical Physics 56(3-4), 525–531 (1989)
Article MathSciNet MATH Google Scholar
Szegő, G.: Beiträge zur Theorie der Toeplitzschen Formen. Mathematische Zeitschrift 6(3-4), 167–202 (1920)
Article MathSciNet MATH Google Scholar
Tanese, D., Gurevich, E., Baboux, F., Jacqmin, T., Lemaître, A., Galopin, E., Sagnes, I., Amo, A., Bloch, J., Akkermans, E.: Fractal energy spectrum of a polariton gas in a Fibonacci quasiperiodic potential. Physical Review Letters 112(14), 146,404 (2014)
Thouless, D.: Bandwidths for a quasiperiodic tight-binding model. Physical Review B 28(8), 4272 (1983)
Article Google Scholar
Thouless, D.: Scaling for the discrete Mathieu equation. Communications in mathematical physics 127(1), 187–193 (1990)
Article MathSciNet MATH Google Scholar
Thouless, D., Tan, Y.: Total bandwidth for the Harper equation. III. corrections to scaling. Journal of Physics A: Mathematical and General 24(17), 4055 (1991)
Thouless, D.J., Kohmoto, M., Nightingale, M.P., den Nijs, M.: Quantized Hall conductance in a two-dimensional periodic potential. Physical Review Letters 49(6), 405 (1982)
Article Google Scholar
Torres-Herrera, E., Santos, L.F.: Dynamics at the many-body localization transition. Physical Review B 92(1), 014,208 (2015)
Trefethen, L.N., Embree, M.: Spectra and pseudospectra. Princeton University Press, Princeton, NJ (2005)
Book MATH Google Scholar
Tucker, W.: Validated numerics: a short introduction to rigorous computations. Princeton University Press (2011)
Turing, A.M.: On Computable Numbers, with an Application to the Entscheidungsproblem. Proc. London Math. Soc. (2) 42(3), 230–265 (1936)
Vardeny, Z.V., Nahata, A., Agrawal, A.: Optics of photonic quasicrystals. Nature Phot. 7(3), 177–187 (2013)
Article Google Scholar
Webb, M., Olver, S.: Spectra of Jacobi operators via connection coefficient matrices. Communications in Mathematical Physics 382(2), 657–707 (2021)
Article MathSciNet MATH Google Scholar
Weinberger, S.: Computers, Rigidity, and Moduli: The Large-Scale Fractal Geometry of Riemannian Moduli Space. Princeton University Press, USA (2004)
MATH Google Scholar
Zhao, S.: On the spurious solutions in the high-order finite difference methods for eigenvalue problems. Computer methods in applied mechanics and engineering 196(49-52), 5031–5046 (2007)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

This work was supported by EPSRC Grant EP/L016516/1. I am grateful to Arno Pauly for discussions regarding Definition 5.15 and its use in Proposition 5.16. Finally, I would like to thank Mohamed Nasser for generously sharing the code from [109] for the computation of the capacity of finite unions of intervals.

Author information

Authors and Affiliations

DAMTP, Centre for Mathematical Sciences, University of Cambridge, Cambridge, CB3 0WA, UK
Matthew J. Colbrook

Authors

Matthew J. Colbrook
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthew J. Colbrook.

Additional information

Communicated by Felipe Cucker.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A. Routines for Computing Spectra

We describe the SCI-sharp $\Sigma _1^A$ algorithms in [64] and [60], that are used in some of our proofs. In this section, we consider the problem functions $\Xi _1(A)=\textrm{Sp}(A)$ and $\Xi _2(A)=\textrm{Sp}_{\epsilon }(A)$, taking values in the space of non-empty compact subsets of ${\mathbb {C}}$ equipped with Hausdorff metric. The definitions of the classes $\Omega _g$ and $\Omega _f$ can be found in Sect. 2. As written, the outputs of the algorithms below may be empty for small n (and hence not lie in the correct metric space). This does not affect the classifications and can be avoided by computing successive $\Gamma _n(A)$ and outputting $\Gamma _{m(n)}(A)$ where $m(n)\ge n$ is minimal with $\Gamma _{m(n)}(A)\ne \emptyset $.

The methods in [64] and [60] use the function f to approximate the function

$$\begin{aligned} \gamma _{n}(z;A)=\min \{\sigma _{\textrm{inf}}((A-zI){|_{P_n{\mathcal {H}}}}),\sigma _{\textrm{inf}}((A^*-{\bar{z}}I){|_{P_n{\mathcal {H}}}})\}, \end{aligned}$$

(A.1)

where $P_m$ denotes the orthogonal projection onto the linear span of the first m basis vectors and $\sigma _{\textrm{inf}}$ denotes the injection modulus. As $n\rightarrow \infty $, the functions $\gamma _n$ converge uniformly on compact subsets down to the continuous function $\gamma (z;A)=\left\| R(z,A)\right\| ^{-1}$, which we interpret as zero if the resolvent $R(z,A)=(A-zI)^{-1}$ does not exist as a bounded operator. The function f and sequence $\{c_n\}$ allow us to approximate $\gamma _n$ to any given precision. To use this to compute the spectrum, we need some control on how the resolvent norm diverges near the spectrum and this is provided by the function g satisfying (2.2). At various points in this paper, we have also made use of the related functions

$$\begin{aligned} \gamma _{n,m}(z;A)=\min \{\sigma _{\textrm{inf}}(P_m(A-zI){|_{P_n{\mathcal {H}}}}),\sigma _{\textrm{inf}}(P_m(A^*-{\bar{z}}I){|_{P_n{\mathcal {H}}}})\}. \end{aligned}$$

(A.2)

These can be computed from the rectangular matrices $P_m(A-zI)P_n,P_m(A-zI)^*P_n$ and converge uniformly on compact subsets of ${\mathbb {C}}$ to $\gamma _n$ as $m\rightarrow \infty $.

Throughout, we use that DistSpec requires only finitely many arithmetic operations and comparisons, as proven in [60] (one can perform the IsPosDef routine using incomplete Cholesky decompositions). Furthermore, as outlined in Remark 5.12, we can make all of the algorithms in this paper and those in this appendix work using $\Delta _1$-information and restricting to arithmetical operations over the rationals.

Appendix B. Examples of Computational Routines

We provide short and simplified routines for some of the algorithms in this paper. For example, we have ignored issues like the rigorous approximation of the function $\gamma _{n,m}$ in (A.2) using arithmetical operations. For brevity, we stick to one domain $\Omega $ and the evaluation set $\Lambda _1$ (matrix values) for each problem function $\Xi $. In each case, we have chosen the non-trivial $\Omega $ with the simplest algorithm. For the different algorithms for different classes of operators, see the proofs. In general, different classes of operators and evaluation sets have different SCI classifications and different algorithms for the same problem function.

1.1 B.1. Spectral Radii, Capacity and Operator Norms

For the problem functions in Sects. 3.1–3.3, we consider $\Omega _f$ (see (2.1)) and $\Omega _f\cap \Omega _{\textrm{SA}}$ for computing the capacity of the spectrum.

1.2 B.2. Essential Numerical Range, Gaps in Essential Spectra and Detecting Algorithm Failure for Finite Section

For the problems in Sect. 3.4, we consider $\Omega _\textrm{B}$.

1.3 B.3. Lebesgue Measure

For the problems in Sect. 3.5, we consider $\Omega _f$.

1.4 B.4. Fractal Dimensions

For the problems in Sect. 3.6, we consider $\Omega _f^{BD}$ for the box-counting dimension and $\Omega _f\cap \Omega _{\textrm{SA}}$ for the Hausdorff dimension.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Colbrook, M.J. On the Computation of Geometric Features of Spectra of Linear Operators on Hilbert Spaces. Found Comput Math (2022). https://doi.org/10.1007/s10208-022-09598-0

Download citation

Received: 05 February 2021
Revised: 13 June 2022
Accepted: 16 September 2022
Published: 01 December 2022
DOI: https://doi.org/10.1007/s10208-022-09598-0

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On the Computation of Geometric Features of Spectra of Linear Operators on Hilbert Spaces

Abstract

Similar content being viewed by others

On Spectral Approximations of Unbounded Operators

Sum-of-Squares Relaxations for Information Theory and Variational Inference

An iterative method for the solution of Laplace-like equations in high and very high space dimensions

1 Introduction

1.1 The SCI Hierarchy

1.2 The Problems Addressed in this Paper

1.3 Contributions to the SCI Hierarchy Itself

1.4 Outline of Paper

2 Essentials of the SCI Hierarchy and Preliminary Definitions

2.1 A Brief Introduction to the SCI Hierarchy

2.1.1 Description of the SCI Hierarchy

Definition 2.1

Remark 2.2

2.1.2 The Model of Computation \(\alpha \)

2.1.3 Computer-Assisted Proofs

2.2 Evaluation Sets and Domains

2.2.1 Evaluation Sets

2.2.2 Classes of Operators

3 Main Results: The Foundations of Computing Geometric Features of Spectra

Remark 3.1

Remark 3.2

3.1 Spectral Radii

Theorem 3.3

Remark 3.4

Remark 3.5

3.2 Essential Spectral Radii

Theorem 3.6

3.3 Capacity and Polynomial Operator Norms

Theorem 3.7

3.4 Essential Numerical Range, Gaps in Essential Spectra and Detecting Failure of Finite Section

Theorem 3.8

Theorem 3.9

Theorem 3.10

Remark 3.11

Remark 3.12

3.5 Lebesgue Measure of Spectra

Theorem 3.13

Theorem 3.14

Proposition 3.15

Theorem 3.16

Remark 3.17

3.6 Fractal Dimensions of Spectra

Theorem 3.18

Remark 3.19

Remark 3.20

Lemma 3.21

4 Computational Examples

4.1 Spectral Radius

4.2 Essential Numerical Range

4.3 Capacity

4.4 Lebesgue Measure

4.5 Fractal Dimension

5 Mathematical Preliminaries and Combinatorial Problems in the SCI Hierarchy

5.1 The SCI Hierarchy

Definition 5.1

Definition 5.2

Definition 5.3

Remark 5.4

Definition 5.5

Definition 5.6

Definition 5.7

Definition 5.8

Remark 5.9

Definition 5.10

Definition 5.11

Remark 5.12

5.2 Recalling Some Results from Descriptive Set Theory

Definition 5.13

Theorem 5.14

5.3 Linking the SCI Hierarchy to the Baire Hierarchy in a Special Case

Definition 5.15

Proposition 5.16

Proof

Theorem 5.17

Remark 5.18

Proof

5.4 Combinatorial Problems High up in the SCI Hierarchy