Partial spectral flow and the Aharonov–Bohm effect in graphene

Katsnelson, Mikhail I.; Nazaikinskii, Vladimir

doi:10.1140/epjc/s10052-020-08464-z

Partial spectral flow and the Aharonov–Bohm effect in graphene

Regular Article – Theoretical Physics
Open access
Published: 25 September 2020

Volume 80, article number 888, (2020)
Cite this article

Download PDF

You have full access to this open access article

The European Physical Journal C Aims and scope Submit manuscript

Partial spectral flow and the Aharonov–Bohm effect in graphene

Download PDF

Mikhail I. Katsnelson¹ &
Vladimir Nazaikinskii^2,3

1136 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

We study the Aharonov–Bohm effect in an open-ended tube made of a graphene sheet whose dimensions are much larger than the interatomic distance in graphene. An external magnetic field vanishes on and in the vicinity of the graphene sheet and its flux through the tube is adiabatically switched on. It is shown that, in the process, the energy levels of the tight-binding Hamiltonian of $\pi $-electrons unavoidably cross the Fermi level, which results in the creation of electron–hole pairs. The number of pairs is proven to be equal to the number of magnetic flux quanta of the external field. The proof is based on the new notion of partial spectral flow, which generalizes the ordinary spectral flow already having well-known applications (such as the Kopnin forces in superconductors and superfluids) in condensed matter physics.

Analytical study of bound states in graphene nanoribbons and carbon nanotubes: The variable phase method and the relativistic Levinson theorem

Article 01 June 2016

Magnetic-flux-driven topological quantum phase transition and manipulation of perfect edge states in graphene tube

Article Open access 24 August 2016

Spectral Gaps of Dirac Operators Describing Graphene Quantum Dots

Article 18 March 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

One of the main trends in contemporary theoretical physics and, in particular, theory of condensed matter is the increasing role of geometric and especially topological language [1,2,3,4,5,6,7,8,9]. Subtle and nontrivial topological effects in superfluid helium-3 [4], topologically protected zero-energy states in graphene in magnetic field [5], and the quickly growing field of topological insulators [7] are just a few examples.

In most of cases, the use of topological concepts in condensed matter physics is closely related to the continuum-medium description. For example, the topology of electronic states in graphene, topological insulators, Weyl semimetals, and other “topological quantum matter” [8] is studied for effective Hamiltonians describing the electronic band structure in the close vicinity of some special points in the Brillouin zone. In this approximation, the Hamiltonians are partial differential operators, and one can use the well-developed machinery, such as the concepts of index of Dirac operators [10] or spectral flow [11]. Note that the appearance of nonzero spectral flow related to “Dirac-like” dynamics of fermions in the presence of vortices in rotating superfluid He-3 or in type II semiconductors leads to very interesting observable quantities such as additional forces acting on moving vortices [4, 12,13,14,15,16,17,18]. However, there also exist natural models in which the Hamiltonians in periodic crystal lattices are matrices, and accordingly the Schrödinger equation for electrons is a finite-difference equation rather than a differential one. Transfer of topological concepts to this case is in general a nontrivial mathematical problem. To our knowledge, it is a rather poorly studied field, at least, in the context of applications to condensed matter physics. Keeping in mind a broad use of lattice models in quantum field theory [19], it may have even more general interest. Here we will give a solution of one particular problem of this kind, namely, a modification of the concept of spectral flow which is required when passing from the continuum-medium to lattice description of electronic structure of graphene [5].

Consider a flake with several holes containing magnetic fluxes (see Fig. 1). Even when the magnetic field is nonzero only within the holes, it will affect the wave function and the energy spectrum of the electrons in the flake owing to the Aharonov–Bohm effect [20, 21]. The spectrum should be a periodic function of the fluxes; namely, when all fluxes are changed by some integers (in the units of flux quantum), the spectrum should coincide with the initial one. If the Hamiltonian with purely discrete spectrum is bounded or at least semibounded above or below, it means automatically that the total number of, say, negative eigenvalues is a periodic function of the fluxes, and the spectral flow is zero.^{Footnote 1} However, for the Dirac operator, which is unbounded on both sides, it can be also the shift of the spectrum, e.g., $E_n \longrightarrow E_{n+1}$. In this situation the spectral flow is nonzero. It was proven [22, 23] that such a situation arises in graphene for a certain kind of boundary conditions if the electrons in graphene are described by the Dirac approximation. This has important physical consequences [23]. In particular, a nonzero spectral flow means that for any position of the Fermi energy when changing the magnetic fluxes it will be unavoidably the situation when one of the energy levels coincides with the Fermi energy, which means all kind of specific many-body effects, potential instabilities, etc. [5].

However, literally speaking, this cannot be the case of real graphene, because the Dirac model is valid only within a close vicinity of the conical K and $K'$ points. At larger energy scale, one needs to use a tight-binding model with a finite bandwidth [5]. Obviously, the usually defined spectral flow can be only zero in such a situation.

In this paper, we introduce a concept of partial spectral flow for the tight-binding model of graphene. We will show that despite the vanishing of the total spectral flow the physical conclusion [23] on the unavoidable crossing of energy levels with the Fermi energy at adiabatically growing magnetic flux remains correct.

To make our consideration mathematically rigorous and to avoid unnecessary, purely technical complications we will consider the situation simpler than in Fig. 1, namely, a graphene tube (which can be considered as a carbon nanotube of a very large radius). We conjecture that the same situation takes place also for the case of graphene flake with several holes considered in [23].

2 Reminder: Hamiltonians of $\pi $-electrons in an infinite graphene sheet

We use the common model described in [5, Chap. 1]. Recall that graphene has hexagonal (“honeycomb”) lattice with nearest-neighbor interatomic distance $a\approx 1.42$ Å. The lattice naturally splits into two sublattices A and B, where each atom in sublattice A is surrounded by three atoms of sublattice B, and vice versa.

Geometrically, it will be convenient to us to think of the sheet plane as tiled by $3a\times \sqrt{3}a$ rectangles each containing a single hexagon of the lattice (see Fig. 2). Each of sublattices A and B is a Bravais lattice with primitive vectors

$$\begin{aligned} a_1 =\biggl (\frac{3a}{2},\frac{a\sqrt{3}}{2}\biggr ),\quad a_2 =\biggl (\frac{3a}{2},-\frac{a\sqrt{3}}{2}\biggr ), \end{aligned}$$

and the reciprocal lattice is generated by the vectors (see Fig. 3)

$$\begin{aligned} b_1 =\biggl (\frac{2\pi }{3a},\frac{2\pi }{a\sqrt{3}}\biggr ),\quad b_2 =\biggl (\frac{2\pi }{3a},-\frac{2\pi }{a\sqrt{3}}\biggr ). \end{aligned}$$

In the tight-binding approximation, the electron $\psi $-function is defined on the lattice, and the Hamiltonian has the form

$$\begin{aligned} {}[{\widehat{H}}\psi ](x)=\gamma _0\sum _y\psi (y), \end{aligned}$$

(1)

where the sum is over the three neighbors y of the lattice point x and $\gamma _0$ is a constant known as the hopping parameter. Note that the sign of gamma does not affect any properties of the Hamiltonian and can be changed just by re-definition of the basis vectors [5, Chap. 1]. To be specific, we will assume here $\gamma _0 >0$.

The Hamiltonian can be conveniently expressed in terms of the operators ${\widehat{p}}=({\widehat{p}}_1,{\widehat{p}}_2)$, $\widehat{p}_j=-i\frac{\partial }{\partial x_j}$, if the $\psi $-function is represented as a 2-vector $\psi =\bigl ({\begin{matrix} \psi _B \\ \psi _A \end{matrix}}\bigr )$, where $\psi _B$ and $\psi _A$ are the restrictions of $\psi $ to sublattices B and A, respectively. Then

$$\begin{aligned} \begin{aligned} {\widehat{H}}=&H({\widehat{p}}),\quad H(p)= \gamma _0 \begin{pmatrix} 0 &{} T(p) \\ T^*(p) &{} 0 \\ \end{pmatrix}, \\ T(p)&=\sum _{j=1}^3e^{i\langle \delta _j, p\rangle }, \end{aligned} \end{aligned}$$

(2)

where

$$\begin{aligned} \delta _1=\biggl (\frac{a}{2},\frac{a\sqrt{3}}{2}\biggr ),\quad \delta _2=\biggl (\frac{a}{2},-\frac{a\sqrt{3}}{2}\biggr ),\quad \delta _3=(-a,0) \end{aligned}$$

are the vectors joining a point of sublattice B with its nearest A neighbors and $\langle u,v\rangle =u_1v_1+u_2v_2$. Thus, $e^{i\langle \delta _j,{\widehat{p}}\rangle }$ is a shift operator,

$$\begin{aligned} \bigl [e^{i\langle \delta _j,\widehat{p}\rangle }\varphi \bigr ](x)=\varphi (x+\delta _j). \end{aligned}$$

(3)

The function T(p) vanishes at the Dirac points

$$\begin{aligned} K=\biggl (\frac{2\pi }{3a},\frac{2\pi }{3a\sqrt{3}}\biggr ), \quad K'=\biggl (\frac{2\pi }{3a},-\frac{2\pi }{3a\sqrt{3}}\biggr ) \end{aligned}$$

of the reciprocal lattice (see Fig. 3), and for $\psi $-functions localized in the momentum space near these points the Dirac Hamiltonians are used, which are obtained as approximations to the tight-binding Hamiltonian as follows. Make the change of variables

$$\begin{aligned} \begin{pmatrix} \psi _B(x) \\ \psi _A(x) \end{pmatrix}= W\begin{pmatrix} u_B(x) \\ u_A(x) \end{pmatrix},\quad W=e^{i\langle {\widetilde{K}},x\rangle } \begin{pmatrix} 1 &{} 0 \\ 0 &{} e^{-\tfrac{5\pi i}{6}} \end{pmatrix}, \end{aligned}$$

(4)

where ${\widetilde{K}}=K$ or $K'$. Then the Hamiltonian acting on the vector functions $u=\bigl ({\begin{matrix} u_B\\ u_A \end{matrix}}\bigr )$ is

$$\begin{aligned} W^{-1}{\widehat{H}}W=\gamma _0\begin{pmatrix} 0 &{} e^{-\tfrac{5\pi i}{6}}T({\widetilde{K}}+{\widehat{p}}) \\ e^{\tfrac{5\pi i}{6}}T^*({\widetilde{K}}+{\widehat{p}}) &{} 0 \end{pmatrix}. \end{aligned}$$

Assuming that $u_B(x)$ and $u_A(x)$ are smooth functions on ${\mathbb {R}}^2$ varying slowly compared with the exponential $e^{i\langle {\widetilde{K}},x\rangle }$, the symbol $T({\widetilde{K}}+p)$ can be replaced in the first approximation by the linear part of its Taylor expansion at the point $p=0$, and we obtain the Dirac Hamiltonian ${\widehat{D}}=D^{+}({\widehat{p}})$ if ${\widetilde{K}}=K$ or ${\widehat{D}}'=D^{-}({\widehat{p}})$ if ${\widetilde{K}}=K'$, where

$$\begin{aligned} D^{\pm }(p)&=\frac{3a\gamma _0}{2}\begin{pmatrix} 0 &{} p_1\pm ip_2 \\ p_1\mp ip_2 &{} 0 \end{pmatrix},\nonumber \\ D^{\pm }({\widehat{p}})&=\frac{3a\gamma _0}{2}\begin{pmatrix} 0 &{} \displaystyle -i\frac{\partial }{\partial x_1}\pm \frac{\partial }{\partial x_2} \\ \displaystyle -i\frac{\partial }{\partial x_1}\mp \frac{\partial }{\partial x_2} &{} 0 \end{pmatrix}. \end{aligned}$$

(5)

3 Main results

Consider a graphene tube in the shape of a right circular open-ended cylinder whose length and radius are both much greater than the distance between neighboring carbon atoms. We will study how the $\pi $-electron energy levels in graphene are affected if one adiabatically switches on a magnetic field ${\mathbf {B}}$ whose line pass through the tube and which vanishes on the tube surface (see Fig. 4).

3.1 Hamiltonians and boundary conditions

We denote the cylinder by X. Let L and R be the cylinder length and radius, respectively. We assume that $L\gg a$ and $R\gg a$, where a is the nearest-neighbor interatomic distance. The circumference of the tube is $l=2\pi R$. We use the coordinates $(x_1,x_2)$ on X, where $x_1\in [0,L]$ is the coordinate along the cylinder axis and $x_2\in [0,l]$ is the circumferential coordinate (so that the endpoints of [0, l] are glued together) and sometimes identify X with $[0,L]\times [0,l]$.

The unfolded graphene tube is shown in Fig. 5. We assume that the graphene lattice, which we denote by $X_a=X_A\cup X_B\subset X$, has zigzag boundaries at the tube ends.

Then we have $L=3aM$ and $l=\sqrt{3}aN$, where M and N are the numbers of elementary $3a\times \sqrt{3}a$ rectangles (cf. Fig. 2, right) along the $x_1$- and $x_2$-axis, respectively. It is easily seen that the lattice $X_a$ has 4MN vertices. Mathematically, it is convenient to assume that L and l are constant and a is a small parameter. Thus, $a\rightarrow 0$ and accordingly $M,N\rightarrow \infty $ so that the ratio M/N remains constant, $M/N=L/(l\sqrt{3})$. This is the point of view we take in what follows.

For the graphene tube, definition (1) (or, equivalently, (2)) of the tight-binding Hamiltonian fails to work at the boundary sites, where one of the neighboring lattice points is missing (see Fig. 5). To make the definition work, we must somehow define the values of the $\psi $-function at the “fictitious” neighboring sites outside X based on its values at the sites belonging to $X_a$. There are many ways to do this; here we use the simplest rule and define the value at an outer site to be equal to the value at the nearest inner site; i.e., we set

$$\begin{aligned} \begin{aligned} \psi _A\Bigl (-\frac{a}{2},x_2\Bigr )&:=\psi _B\Bigl (\frac{a}{2},x_2\Bigr ),\\ \psi _B\Bigl (L+\frac{a}{2},x_2\Bigr )&:=\psi _A\Bigl (L-\frac{a}{2},x_2\Bigr ). \end{aligned} \end{aligned}$$

(6)

A straightforward computation shows that the operator ${\widehat{H}}$ defined by (1) with the boundary conditions (6) is self-adjoint in the Hilbert space ${{\mathscr {H}}}_a=\ell ^2(X_a)$ with inner product

$$\begin{aligned} (\psi ,{{\widetilde{\psi }}})=\frac{1}{4MN}\sum _{x\in X_a}\overline{\psi (x)}{{\widetilde{\psi }}}(x). \end{aligned}$$

(7)

Now if we substitute (4) into (6) and let $a\rightarrow 0$, then we arrive at the boundary conditions for the Dirac operators (5). They have the form

$$\begin{aligned} -iu_B(0,x_2)=u_A(0,x_2),\quad -iu_B(L,x_2)=u_A(L,x_2)\nonumber \\ \end{aligned}$$

(8)

and are a special case of the Berry–Mondragon boundary conditions [24]

$$\begin{aligned} (n_{x_2}-in_{x_1})u_B=\varkappa u_A, \end{aligned}$$

where ${\mathbf {n}}=(n_{x_1},n_{x_2})$ is the inward normal on the boundary and $\varkappa $ is a nonvanishing real-valued function on the boundary. Indeed, ${\mathbf {n}}=(1,0)$ at the left end of the tube ($x_1=0$), and ${\mathbf {n}}=(-1,0)$ at the right end ($x_1=L$). Thus, $\varkappa =1$ for the first condition in (8), and $\varkappa =-1$ for the second condition. The expressions (5) with the boundary conditions (8) define self-adjoint operators ${\widehat{D}}$ and ${\widehat{D}}'$ on the Hilbert space ${{\mathscr {H}}}_0=L^2(X)\oplus L^2(X)$ with inner product

$$\begin{aligned} (u,v)=\frac{1}{2Ll}\iint _{[0,L]\times [0,l]} \bigr (\overline{u_A(x)}v_A(x)+\overline{u_B(x)}v_B(x)\bigl )\,dx. \end{aligned}$$

(9)

3.2 Switching on the magnetic field

Consider a magnetic field ${\mathbf {B}}$ vanishing on and in the vicinity of the tube surface. (This is the setting in which one speaks of the Aharonov–Bohm effect: the field is zero in the domain where the particles (in our case, the $\pi $-electrons) are confined. However, note that all the subsequent constructions remain valid under the weaker condition that the normal component of ${\mathbf {B}}$ vanishes everywhere on the tube surface.) Let us switch on the field adiabatically. This means that we have a continuous family ${\mathbf {B}}(t)$ of magnetic fields vanishing on X such that ${\mathbf {B}}(0)=0$ and ${\mathbf {B}}(1)={\mathbf {B}}$, and t is slow (“adiabatic”) time; that is, t varies with the ordinary time so slowly that the system can be viewed as passing through a family of stationary states. Physically, this means that the dissipation of the energy levels due to the finite time of the process must be much less than the distance between neighboring energy levels, $\hbar /\tau \ll \varDelta E$, where $\hbar $ is the Planck constant, $\tau $ is the actual (physical) time of the switching-on process, and $\varDelta E$ is the interlevel distance (which in our problem is of the order of the hopping parameter $\gamma _0$ divided by the sample area, that is, of the order of $\gamma _0/(MN)$). The simplest example is ${\mathbf {B}}(t)=t{\mathbf {B}}$. We can write ${\mathbf {B}}(t)=\nabla \times {\mathbf {A}}(t)$, where ${\mathbf {A}}(t)$ is the magnetic vector potential. It will be assumed without loss in generality that ${\mathbf {A}}(0)=0$ (which is consistent with the condition ${\mathbf {B}}(0)=0$). Let $A_1(x,t)$ and $A_2(x,t)$, $x\in X$, be the axial and circumferential components, respectively, of the vector potential ${\mathbf {A}}(t)$ restricted to the tube surface. We write ${\mathrm {A}}=(A_1,A_2)$. (If magnetic potentials are interpreted as differential 1-forms, then $A_1(x,t)\,dx_1+A_2(x,t)\,dx_2$ is just the restriction of ${\mathbf {A}}(t)$ to X.)

The condition that ${\mathbf {B}}(t)=0$ on X implies that

$$\begin{aligned} \frac{\partial A_1}{\partial x_2}-\frac{\partial A_2}{\partial x_1}=0,\quad x\in X. \end{aligned}$$

(10)

In the presence of the magnetic field ${\mathbf {B}}(t)$, the boundary conditions remain the same, and the momentum operator occurring in the Hamiltonians is modified as follows [5, Ch. 2]:

$$\begin{aligned} {\widehat{p}}_j=-i\frac{\partial }{\partial x_j}\longmapsto {\widehat{p}}_j-A_j(x,t),\quad j=1,2. \end{aligned}$$

(11)

(We work in a system of units where $e=1$ and $c=1$ and omit the factor e/c.) Thus, in the Dirac approximation we have the Hamiltonians

$$\begin{aligned} {\widehat{D}}_t=D^+({\widehat{p}}-{\mathrm {A}}(x,t)), \quad \widehat{D}_{{\mathbf {A}}}'=D^-({\widehat{p}}-{\mathrm {A}}(x,t)) \end{aligned}$$

(12)

corresponding to the K and $K'$ valleys, respectively, with the boundary conditions (8), and the tight-binding Hamiltonian becomes

$$\begin{aligned} {\widehat{H}}_t=H({\widehat{p}}-{\mathrm {A}}(x,t)) \end{aligned}$$

(13)

with the boundary conditions (6). The symbol H(p) (see (2)) involves exponential functions of p, and so it might be helpful if we explain how the right-hand side of (13) is defined. It suffices to define the exponential $e^{i\langle \delta _j,{\widehat{p}}-{\mathrm {A}}(x,t)\rangle }$. This exponential is none other than the value at $\tau =1$ of the solution of the Cauchy problem for the first-order differential equation

$$\begin{aligned} -i\frac{\partial u}{\partial \tau }=\langle \delta _j,{\widehat{p}}-{\mathrm {A}}(x,t)\rangle u,\quad u|_{t=0}=1. \end{aligned}$$

By solving this problem, we find that

$$\begin{aligned}&e^{i\langle \delta _j,{\widehat{p}}-{\mathrm {A}}(x,t)\rangle }\nonumber \\&\quad =\exp \left\{ -i\int _{0}^{1}\langle \delta _j, {\mathrm {A}}(x+\tau \delta _j,t)\rangle \,d\tau \right\} e^{i\langle \delta _j,{\widehat{p}}\rangle }. \end{aligned}$$

(14)

Now assume that the magnetic flux $\varPhi $ of the field ${\mathbf {B}}$ through the tube is an integer multiple of $2\pi $:

$$\begin{aligned} \varPhi =\int _{0}^{l} A_2(x_1,x_2,1)\,dx_2=2\pi q,\quad q\in {\mathbb {Z}}. \end{aligned}$$

(15)

(The integral in (15) is independent of $x_1$ by condition (10).) The number q is referred to as the “number of magnetic flux quanta” through the tube. In view of (10), there exists a function S(x) on the rectangle $[0,L]\times [0,l]$ such that $\nabla S(x)=\mathrm {A}(x,1)$, and it follows from (15) that

$$\begin{aligned} S(x_1,l)-S(x_1,0)=2\pi q. \end{aligned}$$

Consequently, $e^{iS(x_1,0)}=e^{iS(x_1,l)}$, the formula

$$\begin{aligned} U(x)=e^{iS(x)} \end{aligned}$$

(16)

gives a well-defined smooth function on the cylinder X, and one has

$$\begin{aligned} \nabla U(x)=A(x,1)U(x). \end{aligned}$$

It follows that ${\widehat{p}}-{\mathrm {A}}(x,1)=U {\widehat{p}} U^{-1}$, and we see that the gauge transformation by U establishes a unitary equivalence between the Hamiltonians at $t=0$ and $t=1$:

$$\begin{aligned} \begin{aligned} {\widehat{H}}&\equiv {\widehat{H}}_0=U^{-1} {\widehat{H}}_1 U,\\ {\widehat{D}}&\equiv {\widehat{D}}_0=U {\widehat{D}}_1 U^{-1},\quad \widehat{D}'\equiv {\widehat{D}}_0'=U {\widehat{D}}_1' U^{-1}. \end{aligned} \end{aligned}$$

(17)

Thus, the spectrum of each of these Hamiltonians without the magnetic field is the same as that of the same Hamiltonian with the magnetic field fully switched on. But what happens with the spectrum in between, that is, as t varies from 0 to 1? Do the eigenvalues cross the zero level? How many of them do so, and in what direction?

3.3 Aharonov–Bohm effect for the Dirac Hamiltonians

The answer for the case of Dirac Hamiltonians was given in [22, 23]. An adequate tool for describing the motion of eigenvalues is given by the notion of spectral flow introduced by Atiyah, Patodi, and Singer [11], which can be informally described as follows. Consider a family $\{B_t\}_{t\in [0,1]}$ of self-adjoint operators that in some sense continuously depend on t and whose spectrum in a neighborhood of zero is purely discrete. Then the spectral flow ${\text {sf}}\{B_t\}$ is the net number of eigenvalues crossing zero in the positive direction as t varies from 0 to 1 (see Fig. 6).

The rigorous definition can be found in [11] and, in a different form, in [25] (see also [26] and Remark 1 in the next subsection). The spectral flow is homotopy invariant in the class of families such that $B_0$ and $B_1$ are isospectral (i.e., have the same spectrum) and hence can be computed by topological means. A formula for the spectral flow of Dirac Hamiltonians on an arbitrary graphene “flake” was conjectured in [22] and then shown to be true in [23], where a general theorem on the spectral flow of families of Dirac type operators with classical boundary conditions on a compact manifold with boundary was proved. In our situation, this formula is as follows.

Proposition 1

(Special case of [23, Theorem 1]) Let condition (15) be satisfied. Then the spectral flow of the families (12) is given by the formula

$$\begin{aligned} {\text {sf}}\{{\widehat{D}}_t'\}=-{\text {sf}}\{\widehat{D}_t\}=q. \end{aligned}$$

(18)

Thus, the spectral flow coincides (up to the sign) with the number of magnetic flux quanta.

3.4 Partial spectral flow

If we try to apply the same tool—spectral flow—to the case of the tight-binding Hamiltonian, then we immediately see that such an approach fails. Indeed, the tight-binding Hamiltonian acts on the finite-dimensional space ${{\mathscr {H}}}_a$, and hence the spectral flow of the family ${\widehat{H}}_t$ (as well as of any operator family $\{B_t\}$ on a finite-dimensional space with isospectral $B_0$ and $B_1$) is necessarily zero.

That is why we introduce a finer notion of partial spectral flow along a subspace, which takes into account not only the eigenvalues themselves but also how close the corresponding eigenvectors are to a given subspace.

Let ${{\mathscr {H}}}$ be a Hilbert space, and let ${{\mathscr {L}}}\subset {{\mathscr {H}}}$ be a (closed) subspace. The orthogonal projection onto ${{\mathscr {L}}}$ in ${{\mathscr {H}}}$ will be denoted by $P_{{\mathscr {L}}}$.

Consider a family $\{B_t\}$, $t\in [0,1]$, of self-adjoint operators on ${{\mathscr {H}}}$. By $E(B_t,J)$, where $J\subset {\mathbb {R}}$ is an arbitrary interval, we denote the orthogonal projection in ${{\mathscr {H}}}$ onto the closed linear span of eigenvectors of $B_t$ corresponding to the eigenvalues lying in J.

Definition 1

The family $\{B_t\}$ is said to be ${{\mathscr {L}}}$-tame if the following conditions are satisfied:

(i)
The resolvent $(i-B_t)^{-1}$ continuously depends on $t\in [0,1]$ in the operator norm.

Next, there exists a $\delta >0$ such that

(ii)
For each $t\in [0,1]$, the spectrum of $B_t$ on the interval $(-\delta ,\delta )$ is purely discrete.
(iii)
For any $t\in [0,1]$ and any interval $J\subset (-\delta ,\delta )$, one has
$$\begin{aligned} \left\| [P_{{\mathscr {L}}},E(B_t,J)] \right\| <\frac{1}{4}. \end{aligned}$$
(19)

Here $[P_{{\mathscr {L}}},E(B_t,J)]=P_{{\mathscr {L}}} E(B_t,J)-E(B_t,J)P_{{\mathscr {L}}}$ is the commutator of $P_{{\mathscr {L}}}$ and $E(B_t,J)$.

Let $\{B_t\}$, $t\in [0,1]$, be an ${{\mathscr {L}}}$-tame family. By (i) and (ii), for some n there exists a partition $0=t_0<t_1<t_2<\cdots <t_{n+1}=1$ of the interval [0, 1] and numbers $\gamma _1,\dotsc ,\gamma _{n+1}\in (-\delta ,\delta )$ such that $\gamma _j$ does not lie in the spectrum ${\text {Spec}}(B_t)$ of the operator $B_t$ for $t\in [t_{j-1},t_j]$, $\gamma _1=\gamma _{n+1}\le 0$, and if $\gamma _1<0$, then the half-open interval $[\gamma _1,0)$ does not contain any points of spectrum of $B_0$ and $B_1$. Let ${{\mathscr {V}}}_j={{\mathscr {V}}}(B_{t_j},\gamma _j,\gamma _{j+1})$ be the linear span of eigenvectors of $B_{t_j}$ corresponding to the eigenvalues lying between $\gamma _j$ and $\gamma _{j+1}$. On the subspace ${{\mathscr {V}}}_j$, consider the quadratic form

$$\begin{aligned} A_j[u]=(u,(2P_{{\mathscr {L}}}-1)u),\quad u\in {{\mathscr {V}}}_j. \end{aligned}$$

(20)

Let $m_{j+}=\sigma _+(A_j)$ be the positive index of inertia of the form (20), i.e., the dimension of the positive subspace ${{\mathscr {V}}}_{j+}\subset {{\mathscr {V}}}_j$ of this form.

Definition 2

The partial spectral flow of the ${{\mathscr {L}}}$-tame family $\{B_t\}$, $t\in [0,1]$, along ${{\mathscr {L}}}$ is the number

$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}}\{B_t\} =\sum _{j=1}^nm_{j+}{\text {sign}}(\gamma _j-\gamma _{j+1}). \end{aligned}$$

(21)

Remark 1

The definition of “traditional” spectral flow in the form given in [25, 26] is the special case of Definition 2 for ${{\mathscr {L}}}={{\mathscr {H}}}$. Here condition (iii) in Definition 1 is satisfied automatically, and the numbers $m_{j+}$ become the dimensions $m_j$ of the eigenspaces ${{\mathscr {V}}}_j$.

For the general case of ${{\mathscr {L}}}\subsetneq {{\mathscr {H}}}$, the subspace ${{\mathscr {V}}}_{j+}$ can be thought of as the part of ${{\mathscr {V}}}_j$ “close” to the subspace ${{\mathscr {L}}}$.

Some properties of the partial spectral flow are stated in the following theorem.

Theorem 1

(a)
Let $\{B_t\},$ $t\in [0,1],$ be an ${{\mathscr {L}}}$-tame family of self-adjoint operators. The partial spectral flows ${\text {sf}}_{{\mathscr {L}}}\{B_t\}$ and ${\text {sf}}_{{{\mathscr {L}}}^\perp }\{B_t\}$ are well defined, and
$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}}\{B_t\} +{\text {sf}}_{{{\mathscr {L}}}^\perp }\{B_t\}={\text {sf}}\{B_t\}. \end{aligned}$$
(22)
(b)
(homotopy invariance of the partial spectral flow) Let $\{B(t,\tau )\}$ be a two-parameter family of self-adjoint operators satisfying conditions (i)–(iii) in Definition 1 in which $t\in [0,1]$ is everywhere replaced with $(t,\tau )\in [0,1]\times [0,1]$. If
$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}}\{B(0,t)\}= {\text {sf}}_{{\mathscr {L}}}\{B(1,t)\}, \end{aligned}$$
(23)
then
$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}}\{B(t,0)\}= {\text {sf}}_{{\mathscr {L}}}\{B(t,1)\}. \end{aligned}$$

The proof of this theorem, as well as some more details concerning the partial spectral flow, will be given in Sect. 4.

3.5 Aharonov–Bohm effect for the tight-binding Hamiltonian

Here we will show that although the spectral flow ${\text {sf}}\{{\widehat{H}}_t\}$ is zero, there is nonetheless a nontrivial motion of eigenvalues as t varies from 0 to 1. Namely, there exist subspaces ${{\mathscr {L}}},{{\mathscr {L}}}'\subset {{\mathscr {H}}}_a$ consisting of functions localized in the momentum space near the Dirac points K and $K'$, respectively, and such that the partial spectral flows of the family $\{{\widehat{H}}_t\}$ along these subspaces coincide with the spectral flows (18) of the respective families of Dirac operators.

Our first task will be to define these subspaces, and to this end we introduce a basis in ${{\mathscr {H}}}_a$. Consider the set $G_0$ of pairs (m, n) of integers such that

(a)
$-N\le n\le N-1$;
(b)
$M+1\le m\le 3M-1$ if $-N\le n\le -N/2$ or $N/2<n\le N-1$;
(c)
$M\le m\le 3M$ if $-N/2<n\le N/2$.

It is easily seen that $G_0$ contains exactly 4MN elements.

Lemma 1

(See Sect. A.1 for the proof) The functions

$$\begin{aligned} \varphi _{mn}(x)= {\left\{ \begin{array}{ll} e^{i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2},&{} x\in X_B,\\ e^{-i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2},&{} x\in X_A, \end{array}\right. } \quad (m,n)\in G_0,\nonumber \\ \end{aligned}$$

(24)

form an orthonormal basis in ${{\mathscr {H}}}_a$.

To simplify the exposition, we will assume that N is a multiple of 3. Set

$$\begin{aligned} {\overline{m}}=2M,\quad {\overline{n}}=\frac{N}{3}. \end{aligned}$$

Note that the $\varphi _{mn}$ can be rewritten in the form

$$\begin{aligned} \begin{aligned} \varphi _{mn}(x)&=e^{i\langle K,x\rangle } {\left\{ \begin{array}{ll} e^{i\tfrac{\pi (m-{\overline{m}})}{L} x_1 +i\tfrac{2\pi (n-{\overline{n}})}{l} x_2},&{} x\in X_B,\\ e^{\tfrac{2\pi i}{3}}e^{i\tfrac{\pi ({\overline{m}}-m)}{L} x_1 +i\tfrac{2\pi (n-{\overline{n}})}{l} x_2},&{} x\in X_A, \end{array}\right. }\\&=e^{i\langle K',x\rangle } {\left\{ \begin{array}{ll} e^{i\tfrac{\pi (m-{\overline{m}})}{L} x_1 +i\tfrac{2\pi (n+{\overline{n}})}{l} x_2},&{} x\in X_B,\\ e^{\tfrac{2\pi i}{3}}e^{i\tfrac{\pi ({\overline{m}}-m)}{L} x_1 +i\tfrac{2\pi (n+{\overline{n}})}{l} x_2},&{} x\in X_A. \end{array}\right. } \end{aligned}\nonumber \\ \end{aligned}$$

(25)

Thus, the function $\varphi _{mn}$ with $m={\overline{m}}$ and $n={\overline{n}}$ (or $n=-{\overline{n}}$) is just the exponential $e^{i\langle K,x\rangle }$ (or $e^{i\langle K',x\rangle }$) with the additional phase factor $e^{\tfrac{2\pi i}{3}}$ on sublattice A. Accordingly, the $\varphi _{mn}$ with (m, n) close to $({\overline{m}},\pm {\overline{n}})$ are localized in the momentum space near the Dirac points K and $K'$.

Take some $d>0$ and define subspaces ${{\mathscr {L}}},{{\mathscr {L}}}'\subset {{\mathscr {H}}}_a$ as the linear spans

$$\begin{aligned} {{\mathscr {L}}}&={\text {Lin}}\{\varphi _{mn}:(m-{\overline{m}})^2+(n-{\overline{n}})^2\le d^2\}, \end{aligned}$$

(26)

$$\begin{aligned} {{\mathscr {L}}}'&={\text {Lin}}\{\varphi _{mn}:(m-{\overline{m}})^2+(n+{\overline{n}})^2\le d^2\}. \end{aligned}$$

(27)

The domains corresponding to ${{\mathscr {L}}}$ and ${{\mathscr {L}}}'$ in the momentum space are shown in Fig. 7.

Now we are in a position to state the main theorem of the present paper.

Theorem 2

There exists a $d>0$ (which may depend on the family ${\mathbf {B}}(t)$) such that, for all sufficiently small $a>0,$ the family ${\widehat{H}}_t$ is ${{\mathscr {L}}}$-, ${{\mathscr {L}}}'$-, and $({{\mathscr {L}}}\oplus {{\mathscr {L}}}')$-tame, and

$$\begin{aligned}&{\text {sf}}_{{{\mathscr {L}}}}\{{\widehat{H}}_t\} ={\text {sf}} \{{\widehat{D}}_t\}, \quad {\text {sf}}_{{{\mathscr {L}}}'}\{{\widehat{H}}_t\} ={\text {sf}} \{{\widehat{D}}'_t\}, \\&{\text {sf}}_{({{\mathscr {L}}} \oplus {{\mathscr {L}}}')^\perp }\{{\widehat{H}}_t\}=0. \end{aligned}$$

Thus, informally speaking, all nontrivial spectral flow in concentrated near the Dirac points K and $K'$ in the momentum space, and the partial spectral flows of the tight-binding Hamiltonian near these points are equal to the spectral flows provided by the respective Dirac approximations.

3.6 Proof of Theorem 2

We will only prove the assertion of the theorem for the subspace ${{\mathscr {L}}}$. The proof for the subspace ${{\mathscr {L}}}'$ is, mutatis mutandis, essentially the same. As to the claim for the subspace $({{\mathscr {L}}}\oplus {{\mathscr {L}}}')^\perp $, it readily follows from Lemmas 2 and 3 below; we omit the details.

To make the proof more readable, we have transferred some technical computations to Appendix A.

A. First, note that the specific value of $\gamma _0$ does not affect the assertion of the theorem in any way, because the spectral flow, as well as the partial spectral flow, does not change if the operator family is multiplied by a positive constant. Thus, we can take any $\gamma _0>0$ convenient to us instead of the actual, physically meaningful value, and from now on we set $\gamma _0=\frac{2}{3a}$ so as to ensure that the factor $\frac{3a\gamma _0}{2}$ occurring in formulas (5) for the Dirac operators is equal to unity.

B. Let $\varPhi (t)=2\pi q(t)$ be the flux of the field ${\mathbf {B}}(t)$ through the tube, $q(0)=0$, $q(1)=q\in {\mathbb {Z}}$. The potentials ${\mathrm {A}}(x,t)$ and ${\mathrm {A}}_0(t)=(0,\varPhi (t) l^{-1})$ (the latter being independent of x) generate the same flux, and hence there exists a smooth real-valued function F(x, t) on $X\times [0,1]$ such that $\nabla _x F={\mathrm {A}}-{\mathrm {A}}_0$. The corresponding gauge transformation $\psi \mapsto U_t^{-1}\psi $, where $U_t$ is the operator of multiplication by $e^{iF(x,t)}$, reduces the family ${\widehat{H}}_t=H({\widehat{p}}-{\mathrm {A}}(x,t))$ to the family ${\widehat{H}}_{0t}=H({\widehat{p}}-{\mathrm {A}}_0(t))$ of operators with constant magnetic potential:

$$\begin{aligned} {\widehat{H}}_t =U_t H({\widehat{p}}-{\mathrm {A}}_0(t)) U_t^{-1}\equiv U_t {\widehat{H}}_{0t} U_t^{-1}. \end{aligned}$$

(28)

C. It follows from (28) that any eigenvector of ${\widehat{H}}_t$ has the form $U_t\psi $, where $\psi $ is an eigenvector of ${\widehat{H}}_{0t}$ with the same eigenvalue. Let us study the eigenvalue problem for the operator ${\widehat{H}}_{0t}$. The operator ${\widehat{H}}_{0t}$ acts on the basis vectors $\varphi _{mn}$ by the formulas

$$\begin{aligned} {\widehat{H}}_{0t}\varphi _{mn}&=\mu (m,n,t)\varphi _{2{\overline{m}}-m,n}, \end{aligned}$$

(29)

$$\begin{aligned} {\widehat{H}}_{0t}\varphi _{2{\overline{m}}-m,n}&=\mu (2{\overline{m}}-m,n,t)\varphi _{mn} =\mu ^*(m,n,t)\varphi _{mn}, \end{aligned}$$

(30)

$(m,n)\in G_0$. (These formulas are proved in Sect. A.2, where we give explicit expressions for $\mu (m,n,t)$.) It follows from (29) and (30) that ${{\mathscr {H}}}_a$ splits into the orthogonal direct sum of two-dimensional invariant subspaces

$$\begin{aligned} {{\mathscr {V}}}_{mn}={\text {Lin}}\{\varphi _{mn},\varphi _{2{\overline{m}}-m,n}\},\qquad (m,n)\in G_0,\quad m>{\overline{m}}, \end{aligned}$$

and one-dimensional invariant subspaces

$$\begin{aligned} {{\mathscr {W}}}_n={\text {Lin}}\{\varphi _{{\overline{m}} n}\},\quad -N\le n\le N-1. \end{aligned}$$

On the subspace ${{\mathscr {V}}}_{mn}$, the operator ${\widehat{H}}_{0t}$ is represented by the $2\times 2$ antidiagonal matrix with antidiagonal entries $\mu (m,n,t)$ and $\mu ^*(m,n,t)$, and hence the eigenvalues of ${\widehat{H}}_{0t}$ on ${{\mathscr {V}}}_{mn}$ are $\pm |\mu (m,n,t)|$. The eigenvalue of ${\widehat{H}}_{0t}$ on ${{\mathscr {W}}}_n$ is $\mu ({\overline{m}},n,t)$.

Lemma 2

(See Sect. A.3 for the proof) There exists numbers $\delta ,{\overline{q}},a_0>0$ such that if $a<a_0$ and $\psi $ is an eigenvector of ${\widehat{H}}_{0t}$ with eigenvalue $\lambda $ satisfying $-\delta<\lambda <\delta $, then

$$\begin{aligned} \psi \in \bigoplus _{j=-{\overline{q}}}^{{\overline{q}}} \bigl ({{\mathscr {W}}}_{j+{\overline{n}}}\oplus {{\mathscr {W}}}_{j-{\overline{n}}}\bigr ). \end{aligned}$$

D. We need to prove that the family ${\widehat{H}}_{0t}$ of self-adjoint operators is ${{\mathscr {L}}}$-tame for sufficiently large d. Conditions (i) and (ii) in Definition 1 are trivially satisfied, because the family continuously depends on t and acts on the finite-dimensional space ${{\mathscr {H}}}_a$. To verify (iii), take an arbitrary interval $J\subset (-\delta ,\delta )$ and fix a $t\in [0,1]$. The orthogonal projection $E({\widehat{H}}_t,J)$ onto the linear span of eigenvectors of ${\widehat{H}}_t$ corresponding to the eigenvalues lying in J has the form

$$\begin{aligned} E({\widehat{H}}_t,J)=U_t E({\widehat{H}}_{0t},J) U_t^{-1}. \end{aligned}$$

In turn, it follows from Lemma 2 and the invariance of the subspaces ${{\mathscr {W}}}_n$ with respect to ${\widehat{H}}_{0t}$ that

$$\begin{aligned} E({\widehat{H}}_{0t},J)=\sum _{k\in R}P_k, \end{aligned}$$

where $R \subset R_{{\overline{q}}}=\{k\in {\mathbb {Z}}:|{\overline{n}}-k|\le {\overline{q}}\text { or } |{\overline{n}}+k|\le {\overline{q}}\}$ is some subset (depending on t and J) and $P_k$ is the orthogonal projection onto ${{\mathscr {W}}}_k$. Accordingly,

$$\begin{aligned} {}[P_{{\mathscr {L}}},E({\widehat{H}}_t,J)]=\sum _{k\in R}[P_{{\mathscr {L}}},{\widetilde{P}}_k],\quad \text {where}\ \widetilde{P}_k=U_t P_k U_t^{-1}. \end{aligned}$$

(31)

Lemma 3

(See Sect. A.5 for the proof) There exists an integer $d>0$ such that, for the space ${\mathscr {L}}$ defined in (26) with this d,

$$\begin{aligned} \left\| [P_{{\mathscr {L}}},\smash {\widetilde{P}_k}] \right\| <\frac{1}{4(4{\overline{q}}+2)}\quad \text {for all}\quad k\in R_{{\overline{q}}} \end{aligned}$$

for all sufficiently small a. Similar estimates hold for the commutators with $P_{{{\mathscr {L}}}'}$.

Since the number of terms in the sum in (31) does not exceed $4{\overline{q}}+2$, we see that condition (iii) holds.

E. Consider the two-parameter family ${\widehat{H}}_{t,\tau }$ defined by the formula ${\widehat{H}}_{t,\tau }=U_{\tau t}{\widehat{H}}_{0t}U_{\tau t}^{-1}$. This is a homotopy between the ${{\mathscr {L}}}$-tame families ${\widehat{H}}_{0t}={\widehat{H}}_{t,0}$ and ${\widehat{H}}_t={\widehat{H}}_{t,1}$. We have

$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}} \widehat{H}_t={\text {sf}}_{{\mathscr {L}}}{\widehat{H}}_{0t} \end{aligned}$$

(32)

by Theorem 1(b).

F. It readily follows from (29), (30), and the definition of ${{\mathscr {L}}}$ that ${{\mathscr {L}}}$ is an invariant subspace of the operators ${\widehat{H}}_{0t}$. Hence the partial spectral flow of the family $\{{\widehat{H}}_{0t}\}$ along ${{\mathscr {L}}}$ is equal to the usual spectral flow of the restriction of this family to ${{\mathscr {L}}}$,

$$\begin{aligned} {\text {sf}}_{{\mathscr {L}}}\{{\widehat{H}}_{0t}\}={\text {sf}} \{{\widehat{H}}_{0t}\big |_{{{\mathscr {L}}}}\}. \end{aligned}$$

(33)

(Although the space ${{\mathscr {L}}}$ is finite-dimensional, the right-hand side need not be zero, because the restrictions of the operators ${\widehat{H}}_{00}$ and ${\widehat{H}}_{01}$ to ${{\mathscr {L}}}$ are not necessarily isospectral.)

G. Now let us study the spectral flow of the Dirac operator. The same gauge transformation as in B,^{Footnote 2}$\psi \mapsto U_t^{-1}\psi $, where $U_t$ is the operator of multiplication by $e^{iF(x,t)}$, reduces the family ${\widehat{D}}_t=D({\widehat{p}}-{\mathrm {A}}(x,t))$ to the family ${\widehat{D}}_{0t}=D({\widehat{p}}-{\mathrm {A}}_0(t))$,

$$\begin{aligned} {\widehat{D}}_t =U_t D({\widehat{p}}-{\mathrm {A}}_0(t)) U_t^{-1}\equiv U_t {\widehat{D}}_{0t} U_t^{-1}. \end{aligned}$$

Using the homotopy ${\widehat{D}}_{t,\tau }=U_{t\tau } {\widehat{D}}_{0t} U_{t\tau }^{-1}$, we conclude that

$$\begin{aligned} {\text {sf}}\{{\widehat{D}}_t\}={\text {sf}}\{\widehat{D}_{0t}\}. \end{aligned}$$

(34)

The vector functions

$$\begin{aligned} u_{mn}(x)= \begin{pmatrix} e^{i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2}\\ -ie^{-i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2} \end{pmatrix},\quad x\in X, \ m,n\in {\mathbb {Z}},\nonumber \\ \end{aligned}$$

(35)

form an orthonormal basis in ${{\mathscr {H}}}_0$ and satisfy the boundary conditions (8) (see Sect. A.4). Hence they lie in the domain of the Dirac operators. The subspace

$$\begin{aligned} \widetilde{{\mathscr {L}}} ={\text {Lin}}\{u_{mn}:m^2+n^2\le d^2\}\subset {{\mathscr {H}}}_0, \end{aligned}$$

as well as its orthogonal complement $\widetilde{{\mathscr {L}}}^\perp $, is invariant with respect to ${\widehat{D}}_{0t}$, and the restriction of ${\widehat{D}}_{0t}$ to $\widetilde{{\mathscr {L}}}^\perp $ is boundedly invertible (see Sect. A.6). Hence the spectral flow of $\{{\widehat{D}}_{0t}\}$ is equal to that of its restriction to $\widetilde{{\mathscr {L}}}$,

$$\begin{aligned} {\text {sf}}\{{\widehat{D}}_{0t}\}={\text {sf}}\{\widehat{D}_{0t}\big |_{\widetilde{{\mathscr {L}}}}\}. \end{aligned}$$

(36)

H. Consider the mapping $W:{{\mathscr {H}}}\rightarrow {{\mathscr {H}}}_a$ given by the formula

$$\begin{aligned} W\begin{pmatrix} u_B \\ u_A \end{pmatrix} =\begin{pmatrix} \varphi _B \\ \varphi _A \end{pmatrix}, \end{aligned}$$

where

$$\begin{aligned} \varphi _B(x)&=\bigl [e^{i\langle K,x\rangle }u_B(x)\bigr ]\big |_{X_B},\\ \varphi _A(x)&=e^{-\tfrac{5\pi }{6}i}\bigl [e^{i\langle K,x\rangle }u_A(x)\bigr ]\big |_{X_A}. \end{aligned}$$

This mapping can also be described by the formula

$$\begin{aligned} W(u_{mn})=\varphi _{{\overline{m}}+m,{\overline{n}}+n}, \quad m,n\in {\mathbb {Z}}, \end{aligned}$$

and hence its restriction to $\widetilde{{\mathscr {L}}}$ (which we denote by the same letter W) is an isomorphism onto the subspace ${{\mathscr {L}}}$.

I. Since ${{\mathscr {L}}}$ is ${\widehat{H}}_{0t}$-invariant, it follows that the operator

$$\begin{aligned} {\widehat{R}}_t=W^{-1}{\widehat{H}}_{0t}\big |_{{\mathscr {L}}} W:\widetilde{{\mathscr {L}}}\longrightarrow \widetilde{{\mathscr {L}}} \end{aligned}$$

is well defined, and

$$\begin{aligned} {\text {sf}}\{\widehat{H}_{0t}\big |_{{\mathscr {L}}}\}={\text {sf}}\{{\widehat{R}}_t\}. \end{aligned}$$

(37)

K. Now note that ${\widehat{R}}_t\rightarrow \widehat{D}_{0t}\big |_{\widetilde{{\mathscr {L}}}}$ in the operator norm uniformly with respect to $t\in [0,1]$ as $a\rightarrow 0$ (see Sect. A.7). This also implies the resolvent convergence, because $\widetilde{{\mathscr {L}}}$ is finite-dimensional. It follows that

$$\begin{aligned} {\text {sf}}\{{\widehat{R}}_t\}={\text {sf}}\{\widehat{D}_{0t}\big |_{\widetilde{{\mathscr {L}}}}\} \end{aligned}$$

(38)

for sufficiently small a, because the spectral projections of ${\widehat{R}}_t$ converge to those of $\widehat{D}_{0t}\big |_{\widetilde{{\mathscr {L}}}}$ and hence the partition $0=t_0<t_1<t_2<\cdots <t_{n+1}=1$ of the interval [0, 1] and the numbers $\gamma _1,\dotsc ,\gamma _{n+1}\in (-\delta ,\delta )$ in the definition of spectral flow can be chosen to be the same for $\{{\widehat{R}}_t\}$ and $\{\widehat{D}_{0t}\big |_{\widetilde{{\mathscr {L}}}}\}$.

Now we combine (32)–(34), (36)–(38) and conclude that

$$\begin{aligned} {\text {sf}}\{\widehat{H}_{0t}\big |_{{\mathscr {L}}}\}={\text {sf}}\{{\widehat{D}}_{0t}\}. \end{aligned}$$

The proof of Theorem 2 is complete. $\square $

4 Partial spectral flow: details

The aim of this section is to give more insight into the notion of partial spectral flow and provide a proof of Theorem 1. A key point in the concept of partial spectral flow is given by condition (iii) in Definition 1, which states that the commutator of projections onto two subspaces is sufficiently small. We study some properties following from such smallness in Sect. 4.1 and then use the results in Sect. 4.2 to prove Theorem 1.

4.1 Almost reducible subspaces

Let ${{\mathscr {H}}}$ be a Hilbert space with inner product $(\,\varvec{\cdot }\,,\,\varvec{\cdot }\,)$, and let ${{\mathscr {L}}}\subset {{\mathscr {H}}}$ be a subspace. A subspace ${{\mathscr {V}}}\subset {{\mathscr {H}}}$ is said to be reducible (with respect to ${{\mathscr {L}}}$, or, more precisely, with respect to the decomposition ${{\mathscr {H}}}={{\mathscr {L}}}\oplus {{\mathscr {L}}}^\perp $) if

$$\begin{aligned} {{\mathscr {V}}}=({{\mathscr {V}}}\cap {{\mathscr {L}}})\oplus ({{\mathscr {V}}}\cap {{\mathscr {L}}}^\perp ). \end{aligned}$$

This is obviously equivalent to the condition $[P_{{\mathscr {L}}},P_{{\mathscr {V}}}]=0$, where $[A,B]=AB-BA$ is the commutator of operators A and B.

Definition 3

Let $\varepsilon \ge 0$. We say that a subspace ${{\mathscr {V}}}\subset {{\mathscr {H}}}$ is $\varepsilon $-reducible with respect to ${{\mathscr {L}}}$ (or simply $\varepsilon $-reducible, provided that ${{\mathscr {L}}}$ is clear from the context) if

$$\begin{aligned} \left\| [P_{{\mathscr {L}}},P_{{\mathscr {V}}}] \right\| \le \varepsilon . \end{aligned}$$

We will also say for brevity that ${{\mathscr {V}}}$ is almost reducible if it is $\varepsilon $-reducible with a sufficiently small $\varepsilon $, where being “sufficiently small” means that $\varepsilon <\varepsilon _0$, where $\varepsilon _0>0$ depends on the context. Namely, each of the subsequent assertions is true for some $\varepsilon _0>0$, and we need all of them (or part of them) be true for almost reducible subspaces, so we just take the minimum of all the corresponding $\varepsilon _0$.

Consider the quadratic form $A[u]=A(u,u)$ on ${{\mathscr {H}}}$ associated with the Hermitian form

$$\begin{aligned} A(u,v)=(u,P_{{\mathscr {L}}} v)-(u,P_{{{\mathscr {L}}}^\perp }v) \equiv (u,(2P_{{\mathscr {L}}}-1)v).\nonumber \\ \end{aligned}$$

(39)

Let ${{\mathscr {V}}}\subset {{\mathscr {H}}}$ be a finite-dimensional subspace. By $A_{{\mathscr {V}}}[u]$ we denote the restriction of the form A[u] to ${{\mathscr {V}}}$.

Lemma 4

Assume that ${{\mathscr {V}}}$ is $\varepsilon $-reducible with $\varepsilon <\frac{1}{2}$. Then the form $A_{{\mathscr {V}}}$ is nonsingular, and if ${{\mathscr {V}}}={{\mathscr {V}}}_+\oplus {{\mathscr {V}}}_-$ is the decomposition of ${{\mathscr {V}}}$ into the positive and negative subspaces of this form, then

$$\begin{aligned} |A_{{\mathscr {V}}}[u]|\ge (1-2\varepsilon )\left\| u \right\| ^2, \quad u\in {{\mathscr {V}}}_\pm . \end{aligned}$$

(40)

Proof

For brevity, write $P:=P_{{\mathscr {L}}}$ and $Q:=P_{{\mathscr {V}}}$. One has

$$\begin{aligned} A_v[u]=(u,Cu),\quad u\in {{\mathscr {V}}}, \end{aligned}$$

where the self-adjoint operator $C:{{\mathscr {V}}}\longrightarrow {{\mathscr {V}}}$ corresponding to the quadratic form $A_{{\mathscr {V}}}$ is given by $C=Q(2P-1)$. Let $u\in {{\mathscr {V}}}$ be an eigenvector of C, $Cu=\lambda u$. Thus, we have

$$\begin{aligned}&\lambda u=Q(2P-1)u=(2P-1)u+2[Q,P]u, \quad \text {or}\\&\quad (2P-1)u=\lambda u+2[P,Q]u. \end{aligned}$$

The operator $2P-1$ is unitary, and $\left\| [P,Q] \right\| \le \varepsilon $. Hence, by the triangle inequality,

$$\begin{aligned} \left\| u \right\| \le |\lambda |\left\| u \right\| +2\varepsilon \left\| u \right\| \quad \Longrightarrow \quad |\lambda |\ge 1-2\varepsilon . \end{aligned}$$

Since $\varepsilon <\frac{1}{2}$, we see that $\lambda \ne 0$ (hence the form $A_{{\mathscr {V}}}$ is nonsingular) and moreover,

$$\begin{aligned} \pm A_{{\mathscr {V}}}[u]\ge (1-2\varepsilon )\left\| u \right\| ^2\quad \text {on }{{\mathscr {V}}}_\pm . \end{aligned}$$

The proof of the lemma is complete. $\square $

We see that $\varepsilon _0=\frac{1}{2}$ for this lemma.

Definition 4

If a finite-dimensional subspace ${{\mathscr {V}}}\subset {{\mathscr {H}}}$ satisfies the assumptions of Lemma 4, then the dimension of ${{\mathscr {V}}}$ along ${{\mathscr {L}}}$ is defined as

$$\begin{aligned} \dim _{{\mathscr {L}}} {{\mathscr {V}}}=\sigma _+(A_{{\mathscr {V}}}), \end{aligned}$$

where $\sigma _+(A_{{\mathscr {V}}})$ is the positive index of inertia of the form $A_{{\mathscr {V}}}$ (i.e., the dimension of the subspace ${{\mathscr {V}}}_+$ in the decomposition of ${{\mathscr {V}}}$ in Lemma 4).

Lemma 5

If a subspace ${{\mathscr {V}}}\subset {{\mathscr {H}}}$ is $\varepsilon $-reducible with respect to ${{\mathscr {L}}},$ then it is $\varepsilon $-reducible with respect to ${{\mathscr {L}}}^\perp $. Further, if ${{\mathscr {V}}}$ is finite-dimensional and $\varepsilon <\frac{1}{2},$ then

$$\begin{aligned} \dim _{{\mathscr {L}}} {{\mathscr {V}}}+\dim _{{{\mathscr {L}}}^\perp }{{\mathscr {V}}}=\dim {{\mathscr {V}}}. \end{aligned}$$

Proof

It suffices to note that $P_{{{\mathscr {L}}}^\perp }=1-P_{{\mathscr {L}}}$, so that

$$\begin{aligned} {}[P_{{\mathscr {L}}},P_{{\mathscr {V}}}] =-[P_{{{\mathscr {L}}}^\perp },P_{{\mathscr {V}}}], \end{aligned}$$

and further that ${{\mathscr {V}}}_+$ and ${{\mathscr {V}}}_-$ exchange places when we pass from $\varepsilon $-reducibility with respect to ${{\mathscr {L}}}$ to that with respect to ${{\mathscr {L}}}_\perp $. $\square $

Lemma 6

Let ${{\mathscr {V}}}_j\subset {{\mathscr {H}}},$ $j=1,2,$ be orthogonal $\varepsilon $-reducible subspaces. Then their direct sum ${{\mathscr {V}}}_1\oplus {{\mathscr {V}}}_2$ is $2\varepsilon $-reducible. If, moreover, they are finite-dimensional and $\varepsilon <\frac{1}{4},$ then

$$\begin{aligned} \dim _{{\mathscr {L}}}({{\mathscr {V}}}_1\oplus {{\mathscr {V}}}_2)=\dim _{{\mathscr {L}}} {{\mathscr {V}}}_1+\dim _{{\mathscr {L}}} {{\mathscr {V}}}_2. \end{aligned}$$

(41)

Proof

Let ${{\mathscr {V}}}={{\mathscr {V}}}_1\oplus {{\mathscr {V}}}_2$, $Q=P_{{\mathscr {V}}}$, and $Q_j=P_{{{\mathscr {V}}}_j}$, $j=1,2$. We have $Q=Q_1+Q_2$, and so the first assertion is obvious. To prove the second assertion, consider the subspace ${{\mathscr {W}}}={{\mathscr {V}}}_{1+}\oplus {{\mathscr {V}}}_{2+}\subset {{\mathscr {V}}}$. We cannot claim that ${{\mathscr {W}}}={{\mathscr {V}}}_+$; however, we will show that the restriction of the form $A_{{\mathscr {V}}}$ to this subspace (i.e., just the form $A_{{\mathscr {W}}}$) is positive definite. Indeed, let $u\in {{\mathscr {W}}}$. Then $u=u_1+u_2$, $u_j\in {{\mathscr {V}}}_{j+}$, and we have

$$\begin{aligned} A[u]=A[u_1]+A[u_2]+2{\text {Re}}(u_1,(2P-1)u_2), \end{aligned}$$

where $P=P_{{\mathscr {L}}}$. Next,

$$\begin{aligned} (u_1,(2P-1)u_2)&=(u_1,(2P-1)Q_2u_2)\\&=(Q_2u_1,(2P-1)u_2)+2(u_1,[Q_2,P]u_2). \end{aligned}$$

The first term is zero, because $Q_2u_1=0$, and we obtain

$$\begin{aligned} |(u_1,(2P-1)u_2)|\le 2\varepsilon \left\| u_1 \right\| \left\| u_2 \right\| . \end{aligned}$$

Finally,

$$\begin{aligned} A[u]\ge (1-2\varepsilon )\left\| u_1 \right\| ^2+(1-2\varepsilon )\left\| u_2 \right\| ^2 -4\varepsilon \left\| u_1 \right\| \left\| u_2 \right\| . \end{aligned}$$

The discriminant

$$\begin{aligned} D(\varepsilon )=16\varepsilon ^2-4(1-2\varepsilon )^2=16\varepsilon -4 \end{aligned}$$

of the quadratic form on the right-hand side is negative for $\varepsilon <\frac{1}{4}$, and hence the form $A_{{\mathscr {W}}}=A_{{\mathscr {V}}}\big |_{{{\mathscr {W}}}}$ itself is positive definite. We conclude that

$$\begin{aligned} \sigma _+(A_{{\mathscr {V}}})\ge \dim {{\mathscr {W}}}=\dim _{{\mathscr {L}}}{{{\mathscr {V}}}_1}+\dim _{{\mathscr {L}}}{{{\mathscr {V}}}_2}. \end{aligned}$$

(42)

The same reasoning with ${{\mathscr {L}}}$ and ${{\mathscr {L}}}_\perp $ interchanged shows that

$$\begin{aligned} \sigma _-(A_{{\mathscr {V}}})\ge \dim _{{{\mathscr {L}}}^\perp }{{{\mathscr {V}}}_1}+\dim _{{{\mathscr {L}}}^\perp }{{{\mathscr {V}}}_2}. \end{aligned}$$

(43)

Assume that the inequality in (42) is strict. We add (43) to (42) and use Lemma 5 to obtain

$$\begin{aligned} \dim {{\mathscr {V}}}&=\sigma _+(A_{{\mathscr {V}}})+\sigma _-(A_{{\mathscr {V}}})\\&>\dim _{{\mathscr {L}}}{{{\mathscr {V}}}_1}+\dim _{{\mathscr {L}}}{{{\mathscr {V}}}_2} +\dim _{{{\mathscr {L}}}^\perp }{{{\mathscr {V}}}_1}+\dim _{{{\mathscr {L}}}^\perp }{{{\mathscr {V}}}_2}\\&=\dim {{\mathscr {V}}}_1+\dim {{\mathscr {V}}}_2=\dim {{\mathscr {V}}}, \end{aligned}$$

which is a contradiction. Thus, we have the equality in (42), relation (41) holds, and the proof of the lemma is complete. $\square $

Lemma 7

Let ${{\mathscr {V}}}_t,$ $t\in [a,b],$ be a continuous family of finite-dimensional $\varepsilon $-reducible subspaces, where $\varepsilon \le \frac{1}{2}$ and the continuity is understood as the norm continuity of the corresponding family of projections $Q(t)=P_{{{\mathscr {V}}}_t}$. Then $\dim _{{\mathscr {L}}} {{\mathscr {V}}}_t$ is independent of $t\in [a,b]$.

Proof

It is well known that there exists a unitary U(t) continuously depending on t such that the space ${{\mathscr {V}}}=U(t){{\mathscr {V}}}_t$ is independent of t. The operator

$$\begin{aligned} C(t)=U(t)Q(t)(2P-1)U^{-1}(t):{{\mathscr {V}}}\longrightarrow {{\mathscr {V}}}, \end{aligned}$$

which determines the form $A_{{{\mathscr {V}}}_t}$ transferred by U(t) to the fixed subspace ${{\mathscr {V}}}$, continuously depends on t and is nonsingular for all t. Hence $\sigma _+(A_{{{\mathscr {V}}}_t})={\text {const}}$, as desired. The proof of the lemma is complete. $\square $

4.2 Proof of Theorem 1

(a) We need to prove that the right-hand side of (21) is independent of the choice of the partition of the interval [0, 1] and the numbers $\gamma _j$. To compare two such choices, it suffices to consider the case in which both partitions are the same (just take a new partition containing the points of both). Further, we can change the numbers $\gamma _j$ one by one, so it suffices to see what happens if we change just one of them, i.e., replace $\gamma _j$ by some ${{\widetilde{\gamma }}}_j$ on the interval $[t_{j-1},t_j]$. The points $\gamma _j$ and ${{\widetilde{\gamma }}}_j$ do not lie in the spectrum of $B_t$ for any $t\in [t_{j-1},t_j]$. The projection onto the linear span ${{\mathscr {V}}}(B_t,\gamma _j,{{\widetilde{\gamma }}}_j)$ of eigenvectors of $B_t$ corresponding to eigenvalues lying between $\gamma _j$ and ${{\widetilde{\gamma }}}_j$ can be expressed as the contour integral of the resolvent of $B_t$ over a loop crossing the real line at the points $\gamma _j$ and ${{\widetilde{\gamma }}}_j$ (see Fig. 8) and hence continuously depends on $t\in [t_{j-1},t_j]$.

In other words, ${{\mathscr {V}}}(t)={{\mathscr {V}}}(B_t,\gamma _j,{{\widetilde{\gamma }}}_j)$ depends on t continuously on that interval, and $\dim _{{\mathscr {L}}}{{\mathscr {V}}}(t_{j-1})=\dim _{{\mathscr {L}}}{{\mathscr {V}}}(t_j)$ by Lemma 7. Now let us see what changes occur in the sum (21) when replacing $\gamma _j$ by ${{\widetilde{\gamma }}}_j$. Only the $(j-1)$st and jth terms are affected; the number $\dim _{{\mathscr {L}}} {{\mathscr {V}}}(t_{j-1})=\dim _{{\mathscr {L}}}{{\mathscr {V}}}(t_j)$ is added to one of these terms and subtracted from the other by Lemma 6, and so the sum remains unchanged.

The ${{\mathscr {L}}}^\perp $-tameness is a straightforward consequence of Lemma 5, and formula (22) follows from the fact that the sum of positive and negative indices of inertia of a nondegenerate quadratic form is the total dimension of the space where the form is considered. The proof of (a) is complete.

(b) It suffices to prove that

$$\begin{aligned}&{\text {sf}}_{{\mathscr {L}}}\{B(t,0)\} +{\text {sf}}_{{\mathscr {L}}}\{B(1,t)\}\\&\qquad -{\text {sf}}_{{\mathscr {L}}}\{B(t,1)\} -{\text {sf}}_{{\mathscr {L}}}\{B(0,t)\}=0. \end{aligned}$$

The left-hand side of this equation is just the partial spectral flow along ${{\mathscr {L}}}$ of the family obtained by the restriction of $B(t,\tau )$ to the boundary of the unit square (with the counterclockwise sense). The closed contour (the boundary) can be contracted into a point within the unit square, without changing the partial spectral flow. (Indeed, for sufficiently small changes of the contour the partition of the interval and the $\gamma _j$ can remain unchanged, so the partial spectral flow remains constant.) The partial spectral flow of the constant family is obviously zero, and the theorem follows.

The proof of Theorem 1 is complete. $\square $

Remark 2

Condition (23) is a generalization of the isospectrality condition, which looks as follows for the case of partial spectral flow:

For any $\gamma ,{{\widetilde{\gamma }}}\in (-\delta ,\delta )$, where $\delta >0$ is the same as in Definition 1, one has

$$\begin{aligned} \dim _{{\mathscr {L}}} {{\mathscr {V}}}(B(0,\tau ),\gamma ,{{\widetilde{\gamma }}})=\dim _{{\mathscr {L}}} {{\mathscr {V}}}(B(1,\tau ),\gamma ,{{\widetilde{\gamma }}}) \end{aligned}$$

(44)

for all $\tau \in [0,1]$.

In the case of the usual spectral flow (${{\mathscr {L}}}={{\mathscr {H}}}$), this condition becomes the common isospectrality condition: the spectra of $B(0,\tau )$ and $B(1,\tau )$ in a neighborhood of $\lambda =0$ are the same for each $\tau $. Although condition (23) is much weaker than the isospectrality condition (44), it is sufficient for the homotopy invariance to hold.

5 Conclusions

To conclude, let us look at our results from a more general point of view. Transfer of concepts between condensed matter physics and the “fundamental physics” such as high energy physics, cosmology, etc. is an important source of innovations in both of the fields. It is probably enough to mention such concepts as spontaneous broken symmetry and renormalization group which revolutionized the fields. To be closer to our specific subject one can just refer to the role of graphene as “CERN on the desk”, with long-waiting physical realizations of Klein paradox and relativistic atomic collapse [5]. There is however a fundamental difference: while in high-energy physics and quantum field theory the space-time is assumed to be continuous (despite the use of lattices is an extremely useful technical tool [19]), in condensed matter physics the discreteness of crystal lattices is the crucial fact. The difference is especially important when transferring topological concepts to condensed matter physics: from the point of view of topology, continuum and a discrete lattice are dramatically different. In this paper we have demonstrated, using a specific simple example, that in some cases this transfer can be rigorously justified. Namely, one can make a conclusion that under certain circumstances adiabatically growing magnetic fluxes will induce electron-hole pair creation in graphene, because of nonvanishing spectral flow of Dirac operator [23]. The spectral flow of the tight-binding Hamiltonian at honeycomb lattice is obviously zero but nevertheless the physical conclusion formulated above is still valid and can be justified via the new concept of partial spectral flow. Despite globally the (unbounded and differential) Dirac operator and (bounded and finite-matrix) Hamiltonian on honeycomb lattices are completely different their topological properties are connected in some nontrivial way. We believe that this example can be interesting for a much more general issue on the connections between lattice and continuous models in physics.

Data Availability Statement

This manuscript has no associated data or the data will not be deposited. [Authors’ comment: The work does not involve any computations or experiments, all the information relevant for the proof of our results is contained in the paper.]

Notes

Note, however, that the spectral flow occurring in the construction of the Kopnin spectral flow force [17, 18] may well be nonzero even for a finite-dimensional Hamiltonian, because the periodicity condition is not satisfied there.
Strictly speaking, not exactly the same; here we deal with functions defined on X, while B deals with lattice functions defined on $X_a\subset X$.

References

Geometric Phases in Physics, ed. by A. Schapere, F. Wilczek (World Scientific, Singapore, 1989)
D.J. Thouless, Topological Quantum Numbers in Nonrelativistic Physics (World Scientific, Singapore, 1998)
Book Google Scholar
M. Nakahara, Geometry, Topology, and Physics, 2nd edn. (Taylor & Francis, London, 2003)
MATH Google Scholar
G.E. Volovik, The Universe in a Helium Droplet (Clarendon Press, Oxford, 2003)
MATH Google Scholar
M.I. Katsnelson, The Physics of Graphene, 2nd edn. (Cambridge University Press, Cambridge, 2020)
Book Google Scholar
N.D. Mermin, Rev. Mod. Phys. 51, 591 (1979)
Article ADS Google Scholar
X.-L. Qi, S.-C. Zhang, Rev. Mod. Phys. 83, 1057 (2011)
Article ADS Google Scholar
F.D.M. Haldane, Rev. Mod. Phys. 89, 040502 (2017)
Article ADS Google Scholar
J.M. Kosterlitz, Rev. Mod. Phys. 89, 040501 (2017)
Article ADS MathSciNet Google Scholar
M.F. Atiyah, I.M. Singer, Bull. Am. Math. Soc. 69, 422 (1963)
Article Google Scholar
M.F. Atiyah, V.K. Patodi, I.M. Singer, Math. Proc. Camb. Philos. Soc. 79, 71 (1976)
Article Google Scholar
N.B. Kopnin, V.E. Kravtsov, JETP Lett. 23(11), 578 (1976)
ADS Google Scholar
G.E. Volovik, JETP Lett. 43(9), 551 (1986)
ADS Google Scholar
M. Stone, F. Gaitan, Ann. Phys. 178, 89 (1987)
Article ADS Google Scholar
N.B. Kopnin, G.E. Volovik, Ü. Parts, Europhys. Lett. 32(8), 651 (1995)
Article ADS Google Scholar
T.D.C. Bevan, A.J. Manninen, J.B. Cook, J.R. Hook, H.E. Hall, T. Vachaspati, G.E. Volovik, Nature 386, 689 (1997)
Article ADS Google Scholar
N.B. Kopnin, Rep. Prog. Phys. 65, 1633 (2002)
Article ADS Google Scholar
G.E. Volovik, JETP Lett. 98, 753 (2014)
Article ADS Google Scholar
M. Creutz, Quarks, Gluons and Lattices (Cambridge University Press, Cambridge, 1983)
Google Scholar
Y. Aharonov, D. Bohm, Phys. Rev. 115, 485 (1959)
Article ADS MathSciNet Google Scholar
S. Olariu, I.I. Popescu, Rev. Mod. Phys. 57, 339 (1985)
Article ADS Google Scholar
M. Prokhorova, Commun. Math. Phys. 322, 385 (2013)
Article ADS MathSciNet Google Scholar
M.I. Katsnelson, V.E. Nazaikinskii, Theor. Math. Phys. 172(3), 1263 (2012)
Article Google Scholar
M.V. Berry, R.J. Mondragon, Proc. Roy. Soc. Lond. A 412, 53 (1987)
Article ADS Google Scholar
B. Booss-Bavnbek, M. Lesch, J. Phillips, Can. J. Math. 57(2), 225 (2005)
Article Google Scholar
V. Nazaikinskii et al., Elliptic Theory on Singular Manifolds (CRC Press, Boca Raton, 2005)
Book Google Scholar

Download references

Acknowledgements

The work of MIK was supported by the JTC-FLAGERA Project GRANSPORT. The work of VN was supported by the Ministry of Science and Higher Education of the Russian Federation within the framework of the Russian State Assignment under Contract no. AAAA-A20-120011690131-7. We are grateful to the participants of the joint online seminar between the Theory of Solid State Departments of the University of Hamburg, Uppsala University, and the Radboud University and in particular to Professor Walter van Suijlekom for valuable discussion.

Author information

Authors and Affiliations

Institute for Molecules and Materials, Radboud University, Heyendaalseweg 135, 6525AJ, Nijmegen, The Netherlands
Mikhail I. Katsnelson
Ishlinsky Institute for Problems in Mechanics RAS, 101-1 Vernadsky Ave., Moscow, 119526, Russia
Vladimir Nazaikinskii
Moscow Institute of Physics and Technology, Institutsky Lane 9, Dolgoprudny, Moscow Region, 141700, Russia
Vladimir Nazaikinskii

Authors

Mikhail I. Katsnelson
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir Nazaikinskii
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail I. Katsnelson.

Appendix A: Some technical computations

1.1 A.1 Proof of Lemma 1

Consider the mapping $\omega :{\mathbb {R}}^2\rightarrow {\mathbb {R}}^2$, $(x_1,x_2)\mapsto (-x_1,x_2)$, and the rectangle

$$\begin{aligned} {\widetilde{X}}=X\cup \omega (X)=[L,L]\times [0,l], \end{aligned}$$

which we identify with the torus obtained by pasting together the endpoints of each of the two intervals. The lattice $\widetilde{X}_B=X_B\cup \omega (X_A)$ is the natural extension of the lattice $X_B$ from X to ${\widetilde{X}}$, and the mapping $V:{{\mathscr {H}}}_a\rightarrow \ell ^2({\widetilde{X}}_B)$ given by

$$\begin{aligned} {}[Vf](x)={\left\{ \begin{array}{ll} f(x),&{}x_1>0,\\ f(\omega (x))\equiv f(-x_1,x_2),&{}x_1<0, \end{array}\right. } \quad x\in {\widetilde{X}}_B, \end{aligned}$$

is a unitary isomorphism. Note that

$$\begin{aligned} {}[V\varphi _{mn}](x)=e^{i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2},\quad x\in {\widetilde{X}}_B, \end{aligned}$$

(A.1)

and so it suffices to prove that the functions (A.1), where $(m,n)\in G_0$, form an orthonormal basis in $\ell ^2(\widetilde{X}_B)$. To show this, we reduce $G_0$ to a more convenient indexing set. Consider the vectors

$$\begin{aligned} e_1=(2M,N),\quad e_2=(2M,-N). \end{aligned}$$

(A.2)

One can show by a straightforward computation that the functions $\varphi _{mn}(x)$, $x\in X_a$, obey the transformation rule

$$\begin{aligned} \begin{aligned} \varphi _{{\widetilde{m}}\widetilde{n}}(x)&=e^{-\tfrac{2\pi (j+k)i}{3}}\varphi _{mn}(x)\\ \text {if}\quad ({\widetilde{m}},{\widetilde{n}})&=(m,n)+je_1+ke_2,\quad j,k\in {\mathbb {Z}}, \end{aligned} \end{aligned}$$

(A.3)

and so do the functions (A.1); hence we can transform $G_0$ by shifting each element $(m,n)\in G_0$ by some vector of the integer lattice generated by $e_1$ and $e_2$. It is an elementary but tiresome exercise to show that such shifts can be used to reduce $G_0$ to the set $G_1=\{(m,n):-M\le m<M ,\;-N\le n<N\}$. Since the lattice ${\widetilde{X}}_B$ on the torus ${\widetilde{X}}$ is the (skew) product of two one-dimensional lattices on circles with 2M and 2N points, respectively, it readily follows that the functions (A.1) with $(m,n)\in G_1$ (and hence with $(m,n)\in G_0$) indeed form an orthonormal basis in $\ell (\widetilde{X}_B)$. $\square $

1.2 A.2 Action of ${\widehat{H}}_{0t}$ on basis vectors

We have

$$\begin{aligned} {\widehat{H}}_{0t}=H({\widehat{p}}-{\mathrm {A}}_0(t)) =\frac{2}{3a} \begin{pmatrix} 0 &{} T({\widehat{p}}-{\mathrm {A}}_0(t)) \\ T^*({\widehat{p}}-{\mathrm {A}}_0(t)) &{} 0 \end{pmatrix}. \end{aligned}$$

The basis functions $\varphi _{mn}(x)$ given by (24) agree with the boundary conditions (6) in the sense that the values of components of these functions prescribed by the boundary conditions at the fictitious nodes outside X are given by the same exponential expressions as the components themselves. As a consequence, the application of a function of ${\widehat{p}}$ to these components amounts to the replacement of ${\widehat{p}}$ by the corresponding wave number. In particular, we have

$$\begin{aligned} {\widehat{H}}_{0t}\varphi _{mn}&=\frac{2}{3a} \begin{pmatrix} T\biggl (-\dfrac{\pi m}{L},\dfrac{2\pi (n-q(t)}{l}\biggr ) e^{-i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2}\\ T^*\biggl (\dfrac{\pi m}{L},\dfrac{2\pi (n-q(t)}{l}\biggr ) e^{i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2} \end{pmatrix}\\&=\frac{2}{3a} T\biggl (-\dfrac{\pi m}{L},\dfrac{2\pi (n-q(t)}{l}\biggr ) \varphi _{-m,n}\\&=\frac{2}{3a}e^{\tfrac{4\pi i}{3}} T\biggl (-\dfrac{\pi m}{L},\dfrac{2\pi (n-q(t)}{l}\biggr ) \varphi _{2{\overline{m}}-m,n}\\&\equiv \mu (m,n,t)\varphi _{2{\overline{m}}-m,n}, \end{aligned}$$

because $T^*(p_1,p_2)=T(-p_1,p_2)$ and in view of the transformation formula (A.3). (Note that ${\overline{m}}=2M$ and so $(2{\overline{m}}-m,n)=(-m,n)+e_1+e_2$.) A straightforward computation using definition (2) of T(p) shows that

$$\begin{aligned} \mu (m,n,t)=\frac{2}{3a}e^{-\tfrac{1}{3}i\alpha } (e^{i\alpha }-2\cos \beta ), \end{aligned}$$

(A.4)

where

$$\begin{aligned} \alpha =\frac{\pi (m-{\overline{m}})}{2M},\quad \beta =\frac{\pi (n-q(t))}{N}. \end{aligned}$$

Further,

$$\begin{aligned} \widehat{H}_{0t}\varphi _{2{\overline{m}}-m,n}=\mu (2{\overline{m}}-m,n,t)\varphi _{mn}, \end{aligned}$$

and we note that

$$\begin{aligned} \frac{\pi ((2{\overline{m}}-m)-{\overline{m}})}{2M}=\frac{\pi ({\overline{m}}-m)}{2M}=-\alpha \end{aligned}$$

and hence $\mu (2{\overline{m}}-m,n,t)=\mu ^*(m,n,t)$.

1.3 A.3 Proof of Lemma 2

First, consider a subspace ${{\mathscr {V}}}_{mn}$ with $m>{\overline{m}}$, $(m,n)\in G$. In this case,

$$\begin{aligned} \frac{\pi }{2}\ge \alpha =\frac{\pi (m-{\overline{m}})}{2M}\ge \frac{\pi }{2M}=\frac{3\pi a}{2L}. \end{aligned}$$

Consequently,

$$\begin{aligned} |\mu (m,n,t)|\ge \frac{2}{3a}{\text {Im}} e^{i\alpha } \ge \frac{2}{3a}\sin \frac{3\pi a}{2L} \xrightarrow {a\rightarrow 0}\frac{\pi }{L}, \end{aligned}$$

and the right-hand side is greater than $\pi /(2L)$ for small a.

Next, consider the subspace ${{\mathscr {W}}}_n$. The eigenvalue in question has the form

$$\begin{aligned} \begin{aligned} \mu ({\overline{m}},n,t)&=\frac{2}{3a}(1-2\cos \beta )\\&=\frac{2}{3a} \biggl [1-2\cos \biggl [\pm \frac{\pi }{3}+\frac{\pi }{N}(n\pm {\overline{n}}-q(t))\biggr ]\biggr ]. \end{aligned}\nonumber \\ \end{aligned}$$

(A.5)

(Recall that $N=3{\overline{n}}$.) If $|n-q(t)\pm {\overline{n}}|\ge 1$, then

$$\begin{aligned} \frac{2\pi }{3}+\frac{\pi |q(t)|a\sqrt{3}}{l}\ge \left| \beta \pm \frac{\pi }{3}\right| \ge \frac{\pi a\sqrt{3}}{l}, \end{aligned}$$

and hence

$$\begin{aligned} |\mu ({\overline{m}},n,t)|\ge \frac{2}{3a} \frac{\pi a\sqrt{3}}{l}\sin \frac{\pi }{3} =\frac{\pi }{l} \end{aligned}$$

for sufficiently small a. Now we see that it suffices to set

$$\begin{aligned} \delta =\min \biggl \{\frac{\pi }{2L},\frac{\pi }{l}\biggr \}, \quad {\overline{q}}=\max _{t\in [0,1]}|q(t)| \end{aligned}$$

and take a sufficiently small $a_0$. The proof of the lemma is complete. $\square $

1.4 A.4 Orthogonal basis in ${{\mathscr {H}}}_0$

By analogy with Sect. A.1, the mapping $V_0:{{\mathscr {H}}}_0\rightarrow L^2({\widetilde{X}})$ given by

$$\begin{aligned} {}[V_0f](x)={\left\{ \begin{array}{ll} f(x),&{}x_1\ge 0,\\ f(\omega (x))\equiv f(-x_1,x_2),&{}x_1<0, \end{array}\right. } \quad x\in {\widetilde{X}}, \end{aligned}$$

is a unitary isomorphism. Further, the exponentials

$$\begin{aligned} {}[V_0u_{mn}](x)=e^{i\tfrac{\pi m}{L} x_1 +i\tfrac{2\pi n}{l} x_2}=:e_{mn}(x),\quad x\in {\widetilde{X}}, \end{aligned}$$

(A.6)

with $(m,n)\in {\mathbb {Z}}^2$ form an orthonormal basis in $L^2({\widetilde{X}})$. Hence the original functions $u_{mn}(x)$ form an orthonormal basis in ${{\mathscr {H}}}_0$, as desired. Further, the boundary conditions (8) require that $iu_A(x_1,x_2)=u_B(x_1,x_2)$ for $x_1=0$ and $x_1=L$. For the functions $u_{mn}(x)$, this amounts to the requirement that

$$\begin{aligned} e^{i\tfrac{\pi m}{L} x_1}=e^{-i\tfrac{\pi m}{L} x_1} \end{aligned}$$

for $x_1=0$ and $x_1=L$, which is obviously true.

1.5 A.5 Proof of Lemma 3

The proof is based on some properties of the function $e^{iF(x,t)}$ occurring in the definition of the operator $U_t$ (Sect. 3.6, B). Consider the expansion of the function $e^{iF(x,t)}$ restricted to the lattice $X_a$ in the basis functions $\varphi _{mn}$:

$$\begin{aligned} e^{iF(x,t)}=\sum _{(m,n)\in G_0} b(m,n,t)\varphi _{mn}(x),\quad x\in X_a. \end{aligned}$$

We need estimates for the coefficients b(m, n, t). To state these estimates, we introduce the function

$$\begin{aligned} \rho (m,n)=\min _{j,k\in {\mathbb {Z}}} {}[(m+2M(j+k))^2+(n+N(j-k))^2]^{1/2}. \end{aligned}$$

This function is none other than the distance from (m, n) to the nearest point of the integer lattice generated by the vectors $e_1=(2M,N)$ and $e_2=(2M,-N)$ (see (A.2)).

Proposition 2

There exists a constant $C>0$ independent of a such that

$$\begin{aligned} |b(m,n,t)|\le \frac{C}{1+\rho ^2(m,n)}. \end{aligned}$$

(A.7)

Proof

Let us continue the function $e^{iF(x,t)}$ from X to $\widetilde{X}$ as an even function of $x_1$. We use the same notation for the continuation as for the function itself. Thus,

$$\begin{aligned} e^{iF(x_1,x_2,t)}=e^{iF(-x_1,x_2,t)},\quad x\in {\widetilde{X}}. \end{aligned}$$

We view ${\widetilde{X}}$ as a torus. Let

$$\begin{aligned} c(m,n,t)=\frac{1}{2lL}\iint _{{\widetilde{X}}} e^{-i\tfrac{\pi m}{L} x_1 -i\tfrac{2\pi n}{l} x_2}e^{iF(x_1,x_2,t)}\,dx_1\,dx_2\nonumber \\ \end{aligned}$$

(A.8)

be the Fourier coefficients of the function $e^{iF(x,t)}$ in the system of exponentials $\{e_{mn}(x)\}$, $(m,n)\in {\mathbb {Z}}^2$. These coefficients satisfy the estimates

$$\begin{aligned} |c(m,n,t)|\le \frac{C_1}{(1+m^2)(1+n^2)}, \quad (m,n)\in {\mathbb {Z}}^2, \end{aligned}$$

which can be derived in a standard way by integration by parts in (A.8) with respect to $x_1$ for $m\ne 0$ and with respect to $x_2$ for $n\ne 0$. The function $e^{iF(x_1,x_2,t)}$ is continuous, but its first derivative with respect to $x_1$ may have jump discontinuities at $x_1=0$ and $x_1=L$. Hence we can integrate at most twice by parts with respect to $x_1$: the second time we get the integrated term, and the factor $(1+m^2)^{-1}$ cannot be improved further. We can integrate as many times as desired with respect to $x_2$, but we just do not need a better estimate than $(1+n^2)^{-1}$. In view of the construction in Sect. A.1, the coefficients b(m, n, t) coincide with the coefficients in the expansion of the restriction of $e^{iF(x_1,x_2,t)}$ to $\widetilde{X}_B$ in the functions (A.1), $(m,n)\in G_0$. In view of the transformation rule (A.3) for the functions (A.1), we have

$$\begin{aligned} b(m,n,t)=\sum _{j,k=-\infty }^{\infty } e^{i\tfrac{2(j+k)\pi }{3}} c\bigl (m+2M(j+k),n+N(j-k)\bigr ), \end{aligned}$$

and so

$$\begin{aligned}&|b(m,n,t)|\nonumber \\&\quad \le \sum _{j=-\infty }^{\infty } \sum _{\begin{array}{c} k=-\infty \\ k=j\bmod 2 \end{array}}^{^\infty } \frac{C_1}{(1+(m+2Mj)^2)(1+(n+Nk)^2)}. \end{aligned}$$

(A.9)

Note that $M\le m\le 3M$ and $-N\le n<N$; hence we have

$$\begin{aligned} |m+2Mj|&\ge \frac{1}{2}M|j|\quad \text {for } j\notin \{-1,0\},\\ |n+Nk|&\ge \frac{1}{2}N|k|\quad \text {for } k\notin \{-1,0,1\}. \end{aligned}$$

Now we split the sum (A.9) into four sums $\varSigma _1+\varSigma _2+\varSigma _3+\varSigma _4$, where the summation is

over the set $\varDelta _1$: $j\in \{-1,0\}$ and $k\in \{-1,0,1\}$ for $\varSigma _1$;

over the set $\varDelta _2$: $j\in \{-1,0\}$ and $k\notin \{-1,0,1\}$ for $\varSigma _2$;

over the set $\varDelta _3$: $j\notin \{-1,0\}$ and $k\in \{-1,0,1\}$ for $\varSigma _3$;

over the set $\varDelta _4$: $j\notin \{-1,0\}$ and $k\notin \{-1,0,1\}$ for $\varSigma _4$.

Of course, we also have in mind the condition $k=j\bmod 2$.

The sum $\varSigma _1$ contains three terms,

$$\begin{aligned} \varDelta _1=\{(-1,-1),(0,0),(-1,1)\}. \end{aligned}$$

Since

$$\begin{aligned}&(1+(m+2Mj)^2)(1+(n+Nk)^2)\\&\quad = 1+(m+2Mj)^2 +(n+Nk)^2\\&\qquad {}+(m+2Mj)^2(n+Nk)^2\\&\quad \ge 1+(m+2Mj)^2 +(n+Nk)^2 \ge 1+\rho ^2(m,n) \end{aligned}$$

for any $j=k\bmod 2$, we have

$$\begin{aligned} \varSigma _1\le \frac{3C_1}{1+\rho ^2(m,n)}. \end{aligned}$$

Further,

$$\begin{aligned} \varSigma _2&\le 4C_1\sum _{k\in {\mathbb {Z}}\setminus \{0\}} \frac{1}{N^2k^2} = \frac{C_2}{N^2} \\ \varSigma _3&\le 6C_1\sum _{j\in {\mathbb {Z}}\setminus \{0\}} \frac{1}{(2M)^2j^2} = \frac{C_3}{M^2}\\ \varSigma _4&\le C_1\sum _{j,k\in {\mathbb {Z}}\setminus \{0\}} \frac{1}{N^2k^2}\frac{1}{M^2j^2}=\frac{C_4}{M^2N^2}. \end{aligned}$$

Since the ratio M/N is equal to $L/(l\sqrt{3})$ and does not vary as $a\rightarrow 0$ and $M,N\rightarrow \infty $, we readily see that there exists a constant $C_5$ such that

$$\begin{aligned} 1+\rho ^2(m,n)\le C_5M=C_5 L/(l\sqrt{3})N \end{aligned}$$

for any (m, n). Hence we arrive at (A.7). The proof of the proposition is complete. $\square $

Now we can prove the lemma. One has

$$\begin{aligned} {}[P_{{\mathscr {L}}},{\widetilde{P}}_k]&=P_{{\mathscr {L}}} U_t P_k U_t^{-1}- U_t P_k U_t^{-1}P_{{\mathscr {L}}} \end{aligned}$$

(A.10)

$$\begin{aligned}&=U_t P_k U_t^{-1}(1-P_{{\mathscr {L}}})-(1-P_{{\mathscr {L}}}) U_t P_k U_t^{-1}, \end{aligned}$$

(A.11)

and hence

$$\begin{aligned} \left\| [P_{{\mathscr {L}}},\smash {{\widetilde{P}}_k}] \right\|&\le 2\left\| P_{{\mathscr {L}}} U_t P_k \right\| , \end{aligned}$$

(A.12)

$$\begin{aligned} \left\| [P_{{\mathscr {L}}},\smash {{\widetilde{P}}_k}] \right\|&\le 2\left\| (1-P_{{\mathscr {L}}})U_t P_k \right\| . \end{aligned}$$

(A.13)

If Q is a projection, then

$$\begin{aligned} Q U_t P_k u=(\varphi _{{\overline{m}} k}, u)Q (e^{iF(x,t)}\varphi _{{\overline{m}} k}(x)), \end{aligned}$$

and hence

$$\begin{aligned} \left\| Q U_t P_k \right\| =\left\| Q (e^{iF(x,t)}\varphi _{{\overline{m}} k}(x)) \right\| . \end{aligned}$$

We will use the estimate (A.12) if $|k+{\overline{n}}|\le {\overline{q}}$ and the estimate (A.13) if $|k-{\overline{n}}|\le {\overline{q}}$. Consider the latter case. We have

where the prime indicates that the sum is over $(m,n)\in G_0$ satisfying $\rho (m,n+k-{\overline{n}})>d$. (Indeed, $1-P_{{\mathscr {L}}}$ annihilates any basis function $\varphi _{js}$ with $\rho (j-{\overline{m}},s-{\overline{n}})\le d$.) If $\rho (m,n+k-{\overline{n}})>d$, then, by the triangle inequality for the metric generated by $\rho $,

$$\begin{aligned} \rho (m,n)\ge \rho (m,n+k-{\overline{n}})-\rho (0,k-{\overline{n}})>d-{\overline{q}}, \end{aligned}$$

and we have

$$\begin{aligned}&\left\| (1-P_{{\mathscr {L}}})(e^{iF(x,t)}\varphi _{{\overline{m}} k}(x)) \right\| ^2\\&\quad \le \sum _{\begin{array}{c} (m,n)\in G_0\\ \rho (m,n)>d-{\overline{q}} \end{array}} |b(m,n,t)|^2 \le C\sum _{\begin{array}{c} (m,n)\in G_0\\ \rho (m,n)> d-{\overline{q}} \end{array}}\frac{1}{(1+\rho ^2(m,n))^2}\\&\quad \le C\sum _{\begin{array}{c} (m,n)\in {\mathbb {Z}}^2\\ {\overline{m}}^2+n^2> (d-{\overline{q}})^2 \end{array}}\frac{1}{(1+m^2+n^2)^2}. \end{aligned}$$

(The last transition can be explained as follows: we shift each points of $G_0$ by some integer linear combination of $e_1$ and $e_2$ so as to ensure that $\rho ^2(m,n)=m^2+n^2$ and then extend the summation to all $(m,n)\in {\mathbb {Z}}^2$ with $m^2+n^2>(d-{\overline{q}})^2$ by adding infinitely many positive terms to the sum.) Since the series $\sum (1+m^2+n^2)^{-2}$ converges, we can find d such that the right-hand side of the last inequality is less than $4^{-1}(4{\overline{q}}+2)^{-1}$.

The proof for the case of $|k+{\overline{n}}|\le {\overline{q}}$ goes along the same lines. Here we use formula (A.12) instead of (A.13), and the role of $d-{\overline{q}}$ is now played by $2{\overline{n}}-d-{\overline{q}}$ (where d has already be computed in the preceding case). Since ${\overline{n}}\rightarrow \infty $ as $a\rightarrow 0$, it remains to take a small enough that $2{\overline{n}}-d-{\overline{q}}>d-{\overline{q}}$, i.e., ${\overline{n}}>d$.

The proof of Lemma 3 is complete. $\square $

1.6 A.6 Decomposition of ${\widehat{D}}_{0t}$

The symbol of the operator ${\widehat{D}}_{0t}$ has the form

$$\begin{aligned} D_{0t}(p)=\begin{pmatrix} 0 &{} p_1+ip_2-\frac{2\pi i q(t)}{l} \\ p_1-ip_2+\frac{2\pi i q(t)}{l} &{} 0 \end{pmatrix}. \end{aligned}$$

Using this expression, one can readily compute

$$\begin{aligned} \begin{aligned} {\widehat{D}}_{0t}u_{mn}&=\mu _0(m,n,t)u_{-m,n},\\ {\widehat{D}}_{0t}u_{-m,n}&=\mu _0^*(m,n,t)u_{m,n}, \end{aligned} \end{aligned}$$

(A.14)

where

$$\begin{aligned} \mu _0(m,n,t)=\frac{2\pi (n-q(t))}{l}+i\frac{\pi m}{L}. \end{aligned}$$

We see that the space ${{\mathscr {H}}}_0$ splits into the direct sum of two-dimensional invariant subspaces spanned by $u_{mn}$ and $u_{-m,n}$ for $m>0$ and one-dimensional invariant subspaces spanned by $u_{0n}$. The eigenvalues are $\pm |\mu _0(m,n,t)|^2\ne 0$ on the two-dimensional subspaces and

$$\begin{aligned} \mu _0(0,n,t)=\frac{2\pi (n-q(t))}{l} \end{aligned}$$

on the one-dimensional subspaces. The latter are obviously nonzero if $|n|>{\overline{q}}$. Since $d>{\overline{q}}$, it follows that all eigenvectors corresponding to zero eigenvalues lie in the space

$$\begin{aligned} \widetilde{{\mathscr {L}}} ={\text {Lin}}\{u_{mn}:m^2+n^2\le d^2\}\subset {{\mathscr {H}}}_0, \end{aligned}$$

which is obviously invariant, because it contains $u_{mn}$ and $u_{-m,n}$ simultaneously. One can readily show that the operator ${\widehat{D}}_{0t}$ is invertible on $\widetilde{{\mathscr {L}}}^\perp $.

1.7 A.7 Convergence of the tight-binding Hamiltonian to the Dirac Hamiltonian on $\widetilde{{\mathscr {L}}}$

It follows from the formula

$$\begin{aligned} {\widehat{H}}_{0t}\varphi _{mn}=\mu (m,n,t)\varphi _{2{\overline{m}}-m,n} \end{aligned}$$

(see (29)) for the tight-binding Hamiltonian and the formula

$$\begin{aligned} W(u_{mn})=\varphi _{{\overline{m}}+m,{\overline{n}}+n} \end{aligned}$$

for the isomorphism $W:\widetilde{{\mathscr {L}}}\rightarrow {{\mathscr {L}}}$ that the operator

$$\begin{aligned} {\widehat{R}}_t=W^{-1}{\widehat{H}}_{0t}\big |_{{\mathscr {L}}} W \end{aligned}$$

acts by the formula

$$\begin{aligned} {\widehat{R}}_t u_{mn}=\mu (m+{\overline{m}},n+{\overline{n}},t)u_{-m,n}. \end{aligned}$$

Thus, by (A.14), to prove the uniform convergence $\widehat{R}_t\rightarrow {\widehat{D}}_{0t}$ as $a\rightarrow 0$, it suffices to prove that

$$\begin{aligned} \mu (m+{\overline{m}},n+{\overline{n}},t)\xrightarrow {a\rightarrow 0}\mu _0(m,n,t) \end{aligned}$$

uniformly with respect to $t\in [0,1]$.

We have (see (A.4))

$$\begin{aligned} \mu (m+{\overline{m}},n+{\overline{n}},t)&=\frac{2}{3a}e^{-\tfrac{1}{3}i\alpha } (e^{i\alpha }-2\cos \beta ),\\ \alpha&=\frac{\pi m}{2M}=3a\frac{\pi m}{2L},\\ \beta&=\frac{\pi }{3}+\frac{\pi (n-q(t))}{N}\\&=\frac{\pi }{3}+\sqrt{3}a\frac{\pi (n-q(t))}{l}. \end{aligned}$$

We take the first term of the Taylor series as $a\rightarrow 0$ and obtain

$$\begin{aligned}&\mu (m+{\overline{m}},n+{\overline{n}},t)\\&=\frac{2}{3a} \biggl [1-ia\frac{\pi m}{2L}\biggr ] \biggl [1+3ia\frac{\pi m}{2L}-1+3a\frac{\pi (n-q(t))}{l}\biggr ]\\&\qquad {}+O(a)\\&=\frac{2\pi (n-q(t))}{l}+i\frac{\pi m}{L}+O(a)= \mu _0(m,n,t) +O(a). \end{aligned}$$

Thus, we have arrived at the desired result.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Funded by SCOAP³

Reprints and permissions

About this article

Cite this article

Katsnelson, M.I., Nazaikinskii, V. Partial spectral flow and the Aharonov–Bohm effect in graphene. Eur. Phys. J. C 80, 888 (2020). https://doi.org/10.1140/epjc/s10052-020-08464-z

Download citation

Received: 06 August 2020
Accepted: 10 September 2020
Published: 25 September 2020
DOI: https://doi.org/10.1140/epjc/s10052-020-08464-z

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Partial spectral flow and the Aharonov–Bohm effect in graphene

Abstract

Similar content being viewed by others

Analytical study of bound states in graphene nanoribbons and carbon nanotubes: The variable phase method and the relativistic Levinson theorem

Magnetic-flux-driven topological quantum phase transition and manipulation of perfect edge states in graphene tube

Spectral Gaps of Dirac Operators Describing Graphene Quantum Dots

1 Introduction

2 Reminder: Hamiltonians of \(\pi \)-electrons in an infinite graphene sheet

3 Main results

3.1 Hamiltonians and boundary conditions

3.2 Switching on the magnetic field

3.3 Aharonov–Bohm effect for the Dirac Hamiltonians

Proposition 1

3.4 Partial spectral flow

Definition 1

Definition 2

Remark 1

Theorem 1

3.5 Aharonov–Bohm effect for the tight-binding Hamiltonian

Lemma 1

Theorem 2

3.6 Proof of Theorem 2

Lemma 2

Lemma 3

4 Partial spectral flow: details

4.1 Almost reducible subspaces

Definition 3

Lemma 4

Proof

Definition 4

Lemma 5

Proof

Lemma 6

Proof

Lemma 7

Proof

4.2 Proof of Theorem 1

Remark 2

5 Conclusions

Data Availability Statement

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendix A: Some technical computations

Appendix A: Some technical computations

1.1 A.1 Proof of Lemma 1

1.2 A.2 Action of \({\widehat{H}}_{0t}\) on basis vectors

1.3 A.3 Proof of Lemma 2

1.4 A.4 Orthogonal basis in \({{\mathscr {H}}}_0\)

1.5 A.5 Proof of Lemma 3

Proposition 2

Proof

1.6 A.6 Decomposition of \({\widehat{D}}_{0t}\)

1.7 A.7 Convergence of the tight-binding Hamiltonian to the Dirac Hamiltonian on \(\widetilde{{\mathscr {L}}}\)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation