Abstract
We investigate some foundational issues in the quantum theory of spin transport, in the general case when the unperturbed Hamiltonian operator \(H_0\) does not commute with the spin operator in view of Rashba interactions, as in the typical models for the quantum spin Hall effect. A gapped periodic one-particle Hamiltonian \(H_0\) is perturbed by adding a constant electric field of intensity \(\varepsilon \ll 1\) in the j-th direction, and the linear response in terms of a S-current in the i-th direction is computed, where S is a generalized spin operator. We derive a general formula for the spin conductivity that covers both the choice of the conventional and of the proper spin current operator. We investigate the independence of the spin conductivity from the choice of the fundamental cell (unit cell consistency), and we isolate a subclass of discrete periodic models where the conventional and the proper S-conductivity agree, thus showing that the controversy about the choice of the spin current operator is immaterial as far as models in this class are concerned. As a consequence of the general theory, we obtain that whenever the spin is (almost) conserved, the spin conductivity is (approximately) equal to the spin-Chern number. The method relies on the characterization of a non-equilibrium almost-stationary state (NEASS), which well approximates the physical state of the system (in the sense of space-adiabatic perturbation theory) and allows moreover to compute the response of the adiabatic S-current as the trace per unit volume of the S-current operator times the NEASS. This technique can be applied in a general framework, which includes both discrete and continuum models.
Similar content being viewed by others
1 Introduction and Main Results
The aim of this paper is to shed some light on the theory of spin transport in gapped (non-interacting) fermionic systems, a problem which is highly relevant to the research on topological insulators (see the end of Sect. 1.2).
The theory of spin transport, as compared to charge transport, is still in a preliminary stage. First, despite two decades of scientific debate, no general consensus has been reached yet about the correct form of the operator representing the spin current density. Denoting by \(H_0\) the unperturbed Hamiltonian operator, by \(\mathbf {X}=(X_1, \ldots , X_d)\) the position operator and by \(S_z\) the operator representing the z-component of the spin, one may considerFootnote 1
-
(i)
the “conventional” spin current operator
$$\begin{aligned} \mathbf {J} _{\mathrm {conv}}^{S_z}:= \frac{1}{2}\big ( \mathrm {i}[H_0,\mathbf {X}] \, S_z + \mathrm {i}S_z \, [H_0,\mathbf {X}] \big ) \end{aligned}$$(1.1) -
(ii)
the “proper” spin current operator
$$\begin{aligned} \mathbf {J}_{\mathrm {prop}}^{S_z} := \mathrm {i}[H_0, \mathbf {X} S_z] \end{aligned}$$(1.2)
Whenever \([H_0,S_z]=0\), the two above definitions agree and the theory of spin transport reduces to the theory of charge transport. However, in general \([H_0,S_z] \ne 0\) in topological insulators, as it happens e.g. in the model proposed by Kane and Mele in view of the so-called Rashba term [26, 33]. As we will explain now (see the related discussion [43]), the lack of commutativity poses technical and conceptual problems for the theory of spin transport, and the main objective of our paper is to clarify some of these issues.
As a second issue, whenever \([H_0,S_z]=0\) the spin conductivity is given, in analogy with charge transport, by a double commutator formula, namely
where \(\tau \) is the trace per unit volume and \(\varPi _0\) the Fermi projector of the gapped system. Formula (1.3), equivalently rewritten in terms of Bloch orbitals, has been considered as the starting point for further analysis of the robustness of the spin conductivity [60, 64], or for a mathematical comparison of spin conductivity and spin conductance [43].
In this paper, we address two foundational questions in spin transport theory:
-
(Q1)
is it possible to derive from the first principles of quantum theory, in the general case \([H_0,S_z]\ne 0\), a double commutator formula for the spin conductivity similar to (1.3)?
-
(Q2)
to which extent is such a formula affected by a different choice of the spin current operator, namely \(\mathbf {J} _{\mathrm {conv}}^{S_z}\) versus \(\mathbf {J}_{\mathrm {prop}}^{S_z}\)?
Moreover, any formula for spin transport coefficients should satisfy the so-called unit cell consistency (UCC), namely the requirement that any prediction on macroscopic transport must be independent of the choice of the fundamental cell [69].
In order to answer these questions, we reconsider the whole approach to quantum transport theory.
1.1 Two Paradigms for Quantum Transport
The usual paradigm is based on the adiabatic switching-on of the perturbing electric field. More specifically, one considers the time-dependent Hamiltonian operator
where \(f :{\mathbb {R}}\rightarrow [0,1]\) is a smooth function such that \(f(s) = 0\) for all \(s\le -1\) and \(f(s) = 1\) for all \(s \ge 0\), i.e. the Hamiltonian describes the process where the perturbation is switched on during the finite time interval \([-1/\eta ,0]\), for \(\eta >0\). As \(\eta \rightarrow 0^+\), the process becomes adiabatic. One assumes that the system is prepared, at some time \(t \le -1/\eta \), in the equilibrium state \(\varPi _0\) and that the switching occurs adiabatically. The state \(\rho _{\varepsilon , \eta }(s)\) at macroscopic time \(s = \eta t\) is given by the solution to the time-dependent Schrödinger equation
The linear response coefficient \(\sigma _A\) of an extensive observable A is defined by comparing the expectation value of A at time \(t_* \ge 0\) (when the perturbation is completely switched on) and in the far past (when the system is in the unperturbed equilibrium state). By considering the adiabatic limit, one defines \(\sigma _A\) by setting
The real part appears in the formula since one does not know a priori whether the conditional cyclicity of the trace per unit volume can be invoked.Footnote 2 The standard approach for obtaining a tractable formula for \(\sigma _A\) is to first approximate \(\rho _{\varepsilon , \eta }(0)\) by first-order time-dependent perturbation theory, and then to formally exchange the small field limit and the adiabatic limit, see e.g. [1, 28]. Choosing \(f(\eta t) = \mathrm {e}^{\eta t} \chi _{(- \infty , 0]}(t) + \chi _{(0, + \infty )}(t)\), this results in Kubo’s formula [39] for the linear response coefficientsFootnote 3
In the case of charge transport, one considers the response of a charge current in the i-th direction, \(i \in \left\{ 1, \ldots , d \right\} \), whose corresponding quantum mechanical operator is \( J_i^{\mathrm {c}} := \mathrm {i}[H_0, X_i]\,,\) and from (1.6) one obtains the formula
The importance of the double commutator formula (1.7) (sometimes dubbed Kubo–Chern formula) cannot be overstated, as it implies e.g. quantization of Hall conductivity in two-dimensional systems [5, 10, 22, 37]. When considering spin transport, we had to face the fact that even the algebra which leads formally to (1.7) becomes cumbersome for spin currents, whenever \([H_0, S_z] \ne 0\). Moreover, the fact that the formula is intrinsic (i.e. does not depend on the choice of the switching function appearing in (1.4) and on the choice of \(t_* \ge 0\)) is not obvious as far as spin currents are concerned.
Thus, we propose an alternative way of computing linear response coefficients based on the non-equilibrium almost-stationary states (NEASS) (see Sect. 4), a concept related to the almost-invariant subspaces in space-adiabatic perturbation theory [52, 53, 67]. Assuming, for the moment, that the state of the system at times when the perturbation has been turned on is approximately given by the NEASS \(\varPi ^\varepsilon \), we find a simple prescription for computing linear (and also higher order) response coefficients: Let \(\varPi ^\varepsilon = \varPi _0 + \varepsilon \varPi _1 + o(\varepsilon )\), then from
one concludes that
In this paper, we will show how to compute formulas for the spin-conductivities based on formula (1.8) for linear response coefficients, instead of (1.6). The advantage of this method is that the operator \(\varPi _1\) is rather explicit, namely \(\varPi _1 = {\mathcal {I}}\big (\overline{[ X_j,\varPi _0]} \big )\), where the overline denotes the operator closure and \({\mathcal {I}}\) is the inverse of the Liouvillian operator \( B \mapsto [H_0, B]\), with integral representation (4.2).
Of course one expects, and formally it is also easy to see, that the two expressions (1.6) and (1.8) agree. However, in the present setting—where expectations are obtained via a trace per unit volume which is only conditionally cyclic—this is not straightforward to prove. Moreover, both formulas are somewhat heuristic: For (1.6) we assumed applicability of time-dependent perturbation theory also for long adiabatic time-scales, while for (1.8) we just postulated that the perturbed system is in the state \(\varPi ^\varepsilon \).
In order to reconcile and justify both approaches, one needs to prove that in the adiabatic regime the dynamical switching drives the initial equilibrium state \(\varPi _0\) approximately into the NEASS \(\varPi ^\varepsilon \), i.e. that the state \(\rho _{\varepsilon , \eta }(t )\) is close to \(\varPi ^\varepsilon \). Indeed, it is shown in [41] that for times \(t \ge 0\) and any \(n,m\in {\mathbb {N}}^*\)
uniformly on bounded intervals in (macroscopic) time. Here \(I_{m,\varepsilon } = [\varepsilon ^m ,\varepsilon ^{1/m}]\) is an interval of admissible time-scales for the switching. Too slow switching (\(\eta \ll \varepsilon ^m\) for all \(m\in {\mathbb {N}}^{*}\)) must be excluded, because due to tunneling the NEASS decays on such long times-scales, while too fast switching (\(1\gg \eta \gg \varepsilon ^{1/m}\) for all \(m\in {\mathbb {N}}^*\)) would merely yield an error o(1) on the right hand side of (1.9) (see also [28, 68]). In other words, the initial equilibrium state \(\varPi _0\) dynamically evolves into the NEASS independently of the shape of the switching function up to lower order errors.
1.2 Main Results on Spin Transport and Conductivity
By using the NEASS paradigm, we will answer the questions (\(Q_1\)) and (\(Q_2\)) stated before, at least in the periodic setting. Let us shortly summarize the main results in the paper.
We consider a crystalline system of non-interacting fermions, whose one-body Hamiltonian \(H_0\) is periodic. This operator acts on the Hilbert space \({\mathcal {H}}= L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\), where either \({\mathcal {X}}={\mathbb {R}}^d\) (continuum case) or \({\mathcal {X}}\subset {\mathbb {R}}^d\) is a discrete set (discrete case), and N is the number of internal degrees of freedom of the particle, which may include spin; periodicity of \(H_0\) is understood with respect to (magnetic) translations along vectors in a Bravais lattice \(\varGamma \simeq {\mathbb {Z}}^d \). We assume that the Hamiltonian \(H_0\) has a spectral gap and that the initial state of the system is given by the spectral projection \(\varPi _0\) on the bands below this gap (Fermi projector). The system is driven out of equilibrium by applying a constant electric field of intensity \(\varepsilon \ll 1\) pointing in the j-th direction, \(j \in \left\{ 1,\ldots ,d \right\} \). Hence, the stationary Hamiltonian of the perturbed system is \(H^\varepsilon = H_0 - \varepsilon X_j\), where \(X_j\) is the j-th component of the position operator.
We consider a generalized spin operator in the form \(S = \mathbb {1}_{L^2({\mathcal {X}})} \otimes s\) and—denoting by \(J_{\mathrm {conv},i }^S:=\frac{1}{2}\big ( \mathrm {i}[H_0, X_i] \, S + \mathrm {i}S \, [H_0, X_i] \big )\) and \(J_{\mathrm {prop},i }^S = \mathrm {i}[H_0, X_i S]\) the corresponding conventional and proper S-current operator—we define the conventional and proper S-conductivity, respectively, asFootnote 4
In view of the controversy on the choice of the spin current operator discussed at the beginning of Introduction, we find convenient the decomposition
where we have defined the operators
In this decomposition, the S-orbital term \({\mathsf {O}}\) contains the contribution associated with the conventional S-current operator, while the S-rotation term \(X_i{\mathsf {R}}\) contains corrections related to the replacement of the latter with the proper S-current operator. More precisely, we prove in Theorem 5.6 that splitting (1.11) leads to
where
with \(A^{\mathrm {D}}\) (resp. \(A^{\mathrm {OD}}\)) referring to the diagonal (resp. off-diagonal) part of the operator A with respect to the orthogonal decomposition induced by \(\varPi _0\), and the rotation S-conductivity is
Notice that the first line of (1.14) is in the form of a current-current correlation at the equilibrium, involving the conventional S-current and the charge current, while the second line involves \(\varPi _1\). Moreover, the trace per unit volume in (1.14) and (1.15) can be replaced with the ordinary trace of the operator restricted to the fundamental cell, up to a volume factor, as in the statement of Theorem 5.6, even if the operator appearing in (1.15) is not periodic.
In order to analyze the S-rotation contribution \(\sigma _{\mathrm {rot},ij }^S\), we preliminary prove in Proposition 5.4(i) that for any bounded periodic observable B, satisfying suitable regularity properties, the expectation of the B-torque operator \(\mathrm {i}[H_0, B]\) on \(\varPi _1\) is given by a double commutator formula, namely
where the operator \({{\mathcal {T}}_B}\) may be dubbed B-torque response in agreement with [43]. If, in addition, \([B,X_j]=0\) then \(\tau \big ( \mathrm {i}[H_0, B] \, \varPi _1 \big )=0\), as stated in Proposition 5.4(ii). Physically, this result means that even if \(\mathrm {i}[H_0, B]\ne 0\), the fact that B commutes with the perturbation \(-\varepsilon X_j\) implies the mesoscopic conservation of the observable B, at least within first order approximation in the NEASS. In particular, when \(B=S_z\) (or for any generalized spin operator S, see Corollary 5.5), we have that the expectation of the spin-torque on \(\varPi _1\) equals the expectation of the spin-torque response \({\mathcal {T}}_{S_z}\) and that the latter vanishes, in agreement with [43, Theorem 2.8]. Notice that the vanishing of the expectation of the spin-torque response is a condition singled out in [43] to obtain the equality of spin conductivity and spin conductance in two-dimensional systems.
As a further step, we consider the unit cell consistency (UCC) of both the contributions to the proper S-conductivity appearing in (1.13). We prove in Proposition 5.8 that \(\sigma _{\mathrm {conv}}^S\) always satisfies UCC, while for the additional contribution \(\sigma ^S_{\mathrm {rot}}\) we can prove UCC only if the model enjoys a discrete rotational symmetry, in agreement with the claim in [62] that the use of \(\mathbf {J}_{\mathrm {prop}}^{S_z}\) is “possible for systems where the spin generation in the bulk is absent due to symmetry reasons”. In Proposition 5.9, we isolate a subclass of discrete models, enjoying a discrete rotational symmetry and a further property, such that \(\sigma _{\mathrm {conv}}^S = \sigma _{\mathrm {prop}}^S\). Remarkably, the paradigmatic model proposed by Kane and Mele is in this class. A crucial consequence is that, for this class of models, the choice of the spin current operator (either \(\mathbf {J}^S_{\mathrm {conv}}\) or \(\mathbf {J}^S_{\mathrm {prop}}\)) is immaterial as far as the S-conductivity is concerned.
While the paper is focused on transport theory, one of our long-term goals is to clarify the relation between the spin transport coefficients and the topological invariants associated with quantum spin Hall (QSH) insulators. These materials, theoretically predicted in [33, 34] and soon experimentally realized [38, 65], display dissipationless edge spin currents, which are robust against continuous deformations of the model and disorder [60]. A crucial issue, both for fundamental understanding and for potential applications, is whether there exists a bulk topological invariant “protecting” the QSH effect. Two candidates have been extensively investigated in the literature. First, the \({\mathbb {Z}}_2\)-valued index proposed by Fu, Kane and Mele [21, 34], whose definition and geometric properties rely on the fermionic time-reversal symmetry of the system [15, 20, 23]. Second, the (half-)integer-valued spin-Chern number, introduced in [61] via spin dependent boundary conditions, and later intrinsically redefined by Prodan as a bulk invariant [56], which relies instead on the almost-conservation of spin, and is associated with robust spin edge currents [36, 55, 58, 59].
Our analysis establishes a direct relation between the bulk spin conductivity and the spin-Chern number, in agreement with the (recent) discovery that QSH plateaux may persist under broken time-reversal symmetry [18]. Indeed, whenever spin is conserved, our results yield that the (bulk) spin conductivity equals the spin-Chern number (Remark 5.12). Moreover, the result is robust: If spin is approximately conserved, with errors of order \({{\mathcal {O}}}(\lambda )\), then the mentioned equality holds true up to a correction of order \({{\mathcal {O}}}(\lambda )\) (Proposition 5.13), in analogy with the persistence of edge spin currents proved in [58].
In summary, our paper contributes to put spin transport theory on a firm mathematical ground: We derive a new formula for the spin conductivity which covers both the choice of the conventional and the proper spin current operator; we isolate conditions under which UCC is satisfied and additional conditions which guarantee that \(\sigma _{\mathrm {conv}}^S = \sigma _{\mathrm {prop}}^S\); we make connection with the spin-Chern number. We hope that our mathematical investigations will contribute to clarify some of the controversies in the emerging and promising field of spintronics, and will stimulate a fruitful exchange of ideas between mathematicians and solid state physicists. While, for technical reasons, this paper focuses on the case of periodic non-interacting systems, we are confident that our approach can be suitably generalized to random and interacting systems.
1.3 Further References to the Literature
Several different mathematical problems have been labeled “proving Kubo’s formula” and a short review highlighting the differences appears elsewhere [28]. Here we only mention a few works without going into any detail: A similar approach to the one we use was employed in [66], where Kubo’s formula for the Hall conductivity of simple isolated bands is derived using semi-classical methods. The rigorous derivation of Kubo’s formula for interacting fermionic systems on the lattice has recently been done in [7, 46, 68], where [7, 46] consider only situations where the perturbation does not close the spectral gap. A similar result for non-interacting fermions in the continuum is in preparation [42, 43], generalizing a previous result [19] which also assumes a non-closing-gap condition. In many other works, Kubo’s formula for the Hall conductivity is taken as a starting point and the objective is to prove quantization of the Hall plateaux also in presence of disorder, assuming the Fermi energy lies in a mobility gap [1, 10, 12], or including interaction effects [6, 24, 27], with the aim of proving universality of the Hall conductivity. Moreover, the linear response to a quenched perturbation has been recently analyzed in [14]. Finally, in [13, 17] (and references therein) mathematical frameworks are developed, within which the applicability of linear response theory in very general random resp. interacting systems can be established. However, a rigorous justification of Kubo’s formula for the quantum Hall conductivity in situations with mobility gap is still a completely open problem, even in the case of non-interacting systems on the lattice.
Linear response theory can also be considered in the case of heat or charge fluxes induced by thermodynamical (i.e. non-mechanical) driving forces, such as deviations of temperature or chemical potential from their equilibrium values. In this context, the validity of the Green–Kubo formula has been extensively investigated in algebraic quantum statistical mechanics, by relating it to the structure of non-equilibrium steady states [29,30,31,32].
The field of spintronics and of quantum transport of spin is relatively new, but has already attracted a lot of attention both in the physics and mathematics communities. Results concerning the quantization and robustness of spin Hall currents in the presence of disorder [56, 58] and of interactions [3, 44] also rely, to some extent, on a Kubo-like formula. We foresee the possibility of adapting the techniques developed in [46, 68] to derive such formulas from first principles also in the context of interacting fermions on a lattice.
2 Periodic Operators and Trace Per Unit Volume
In condensed matter physics, it is customary to describe crystalline solids by means of periodic Hamiltonian operators. The appropriate trace-like functional used to compute thermodynamic expectations of periodic observables is given by the trace per unit volume. This Section is devoted to recall some generalities about this framework.
Let \({\mathcal {X}}\) denote the configuration space of a d-dimensional crystal. We will treat both continuum models, in which \({\mathcal {X}}={\mathbb {R}}^d\) equipped with the Lebesgue measure, and discrete models, in which \({\mathcal {X}}\subset {\mathbb {R}}^d\) is a discrete set of points arranged in a crystalline structure, equipped with the counting measure (in \(d=2\) think of the square lattice \({\mathbb {Z}}^2\) or of the honeycomb structure, for example). In general, “crystalline structure” means that we assume the existence of a Bravais lattice
that acts on \({\mathcal {X}}\) by translations, i.e. \(\mathrm {T}_\gamma x:= x+\gamma \) for \(\gamma \in \varGamma \) defines a group action \(\mathrm {T}:\varGamma \times {\mathcal {X}}\rightarrow {\mathcal {X}}\).
We consider the one-particle Hilbert space
for a particle moving on \({\mathcal {X}}\) and having N internal degrees of freedom (e.g. spin). In the following we will write elements of \({\mathcal {H}}\) as \({\mathbb {C}}^N\)-valued functions on \({\mathcal {X}}\). We assume that there is a unitary representation T of \(\varGamma \) on \({\mathcal {H}}\) by (magnetic) translation operators
where \(M: \varGamma \times {\mathcal {X}}\rightarrow {\mathcal {U}}({\mathbb {C}}^N)\) are unitaries satisfying the cocycle conditionFootnote 5
Position operators for \(j \in \, \left\{ 1, \ldots , d \right\} \) are defined via
An operator A on \({\mathcal {H}}\) is called periodic or, more specifically, \(\varGamma \)-periodic if \([A, T_\gamma ] = 0\) for all \(\gamma \in \varGamma \). The following simple observation, whose proof is omitted, is very useful.
Lemma 2.1
For any periodic operator A, the operator \([A, X_j]\) is also periodic.
Notice that, in general, the operator \([A, X_j]\) might be non-densely defined or even defined on the trivial subspace \( \left\{ 0 \right\} \), as pointed out in [35, III-§5.1]. This pathology will not appear for the specific operators we will consider in the following sections.
The analysis of periodic operators is best performed in the so-called (magnetic) Bloch–Floquet–Zak representation (see e.g. [40, 45, 50] and references therein). The (magnetic) Bloch–Floquet–Zak transform is initially defined on compactly supported functions \(\psi \in C_0 ({\mathcal {X}},{\mathbb {C}}^N)\subset L^2({\mathcal {X}},{\mathbb {C}}^N)\) as
By construction, for fixed \(k\in {\mathbb {R}}^d\), the function \(({\mathcal {U}}_{\mathrm {BF}}\psi )(k,\cdot )\) is periodic with respect to the magnetic translations (2.2); hence, it defines an element in the Hilbert space
where the norm refers to a fundamental cell \({{\mathcal {C}}}_1\) for \(\varGamma \) (see (2.7)). As functions of k, elements in the range of \({\mathcal {U}}_{\mathrm {BF}}\) are not periodic with respect to the reciprocal lattice \(\varGamma ^*\), but rather \(\varrho \)-equivariant, namely
whereFootnote 6
defines a unitary representation of \(\varGamma ^*\) on \({\mathcal {H}}_{\mathrm {f}}\). The map defined by (2.4) extends to a unitary operator
where \({\mathcal {H}}_\varrho \equiv L^2_{\varrho }({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\) is the space of locally-\(L^2\), \({\mathcal {H}}_{\mathrm {f}}\)-valued, \(\varrho \)-equivariant functions on \({\mathbb {R}}^d\). Denoting by \({\mathbb {B}}^d\) a fundamental domain for \(\varGamma ^*\), the inverse transformation \({\mathcal {U}}_{\mathrm {BF}}^{-1} :{\mathcal {H}}_\varrho \rightarrow {\mathcal {H}}\), sometimes dubbed Wannier transform, is explicitly given by
At least formally, a periodic operator A on \({\mathcal {H}}\) becomes a covariant fibered operator on \({\mathcal {H}}_\varrho \). More precisely, taking into account the following inclusion and natural isomorphism
one has
where each A(k) acts on \({\mathcal {H}}_{\mathrm {f}}\) and satisfies the covariance property \(A(k+\gamma ^*) = \varrho (\gamma ^*) \, A(k) \, \varrho (\gamma ^*)^{-1} \) for all \(k\in {\mathbb {R}}^d\) and \(\gamma ^*\in \varGamma ^*\).
Most relevant extensive observables in crystalline systems are periodic self-adjoint operators. However, in an infinite system neither these periodic extensive observables nor translation invariant states are trace class. The appropriate functional is instead given by the trace per unit volume \(\tau \), which is well suited to take into account invariance or covariance by discrete lattice translations in the setting of periodic or more generally ergodic operators [2, 9, 12, 54]. The trace per unit volume is defined as follows (compare [12, Prop. 3.20]). Denote by \(\chi _\varOmega \) the orthogonal projection on \({\mathcal {H}}\) which multiplies by the characteristic function of \(\varOmega \subset {\mathcal {X}}\). For any \(L\in 2{\mathbb {N}}+1\), we set
and \(\chi _L:=\chi _{{{\mathcal {C}}}_L}\). The set \({{\mathcal {C}}}_1\) is called a fundamental or primitive (unit) cell. It is not unique since the choice of the spanning vectors \(\{a_j\}_{1\le j\le d}\) for \(\varGamma \) (see (2.1)) is not unique. Notice that, restricting to odd integers \(L\in 2{\mathbb {N}}+1\), one has the convenient decompositionFootnote 7
We call an operator A acting in \({\mathcal {H}}\) trace class on compact sets if \(\chi _K A \chi _K\) is trace class for all compact sets \(K \subset {\mathcal {X}}\).Footnote 8
Definition 2.2
(Trace per unit volume). Let A be an operator acting in \({\mathcal {H}}\) such that A is trace class on compact sets. The trace per unit volume of A is defined as
whenever the limit exists.
Let us denote
We will refer to operators in \({\mathcal {B}}_1^\tau \) as the operators of trace-per-unit-volume class or \(\tau \)-class for simplicity. Moreover, in view of [12, Proposition 3.17] we have \({\mathcal {B}}_\infty ^\tau \,\cdot \, {\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau \) and \({\mathcal {B}}_1^\tau \,\cdot \, {\mathcal {B}}_\infty ^\tau \subset {\mathcal {B}}_1^\tau \), and
The following lemma recalls some useful properties of \(\tau \)-class operators.
Lemma 2.3
Let \(A\in {\mathcal {B}}_1^\tau \). Then
and
In particular, we have that A is trace class on compact sets.
Proof
Inequality (2.11) is proved in [12, Lemma 3.10] and the proof given there easily generalizes to obtain also (2.12). \(\square \)
The next result allows to compute the trace per unit volume of operators which are periodic and trace class on compact sets.
Proposition 2.4
-
(i)
Let A be periodic and trace class on compact setsFootnote 9. Then, \(\tau (A)\) is well defined and
$$\begin{aligned} \tau (A) =\frac{1}{\left|{{\mathcal {C}}}_1\right|} {{\,\mathrm{Tr}\,}}(\chi _1 A \chi _1). \end{aligned}$$(2.13) -
(ii)
Let A be a periodic and bounded operator acting on \({\mathcal {H}}\). Denoting by
$$\begin{aligned} {\mathcal {U}}_{\mathrm {BF}}\, A \, {\mathcal {U}}_{\mathrm {BF}}^{-1} = \int _{{\mathbb {R}}^d}^{\oplus }\mathrm {d}k\, A(k) \end{aligned}$$its Bloch–Floquet–Zak decomposition, assume that A(k) is trace class and that \({{\,\mathrm{Tr}\,}}_{{\mathcal {H}}_{\mathrm {f}}}(\left|A(k)\right|)<C\) for all \(k\in {\mathbb {B}}^d\). Then
$$\begin{aligned} {{\,\mathrm{Tr}\,}}(\chi _1 A \chi _1) = \frac{1}{|{\mathbb {B}}^d|} \int _{{\mathbb {B}}^d} \mathrm {d}k\,{{\,\mathrm{Tr}\,}}_{{\mathcal {H}}_{\mathrm {f}}}(A(k)). \end{aligned}$$(2.14)
Proof
(i). In view of the decomposition (2.8) and the hypotheses on A, one has
Since \(\left|{{\mathcal {C}}}_L\right|=L^d\left|{{\mathcal {C}}}_1\right|=\mathrm {card}(\varGamma \cap {{\mathcal {C}}}_L)\left|{{\mathcal {C}}}_1\right|\) for every \(L\in 2{\mathbb {N}}+1\), one obtains
(ii). This is proved e.g. in [51, Lemma 3]. \(\square \)
In the following result, we introduce a class of operators which are not necessarily in \({\mathcal {B}}_1^\tau \), but have finite trace per unit volume.
Proposition 2.5
Let A be periodic and trace class on compact sets\(^{(9)}\). Then
-
(i)
the operator \(X_jA\) for \(j \in \left\{ 1,\ldots ,d \right\} \) has finite trace per unit volume and
$$\begin{aligned} \tau (X_j A ) = \frac{1}{\left|{{\mathcal {C}}}_1\right|}{{\,\mathrm{Tr}\,}}\left( \chi _1 X_j A \chi _1 \right) . \end{aligned}$$(2.15) -
(ii)
If, in addition \(\tau (A)=0\), then \(\tau (X_j A)\) does not depend on the exhaustionFootnote 10\({{\mathcal {C}}}_L\nearrow {\mathcal {X}}\) set in Definition 2.2 and on the choice of the origin, in the sense that
$$\begin{aligned} \tau ((X_j +\alpha ) A )=\tau (X_j A )\quad \forall \,\alpha \in {\mathbb {R}}. \end{aligned}$$
Proof
(i). Since \(\chi _L X_j \chi _L\) is bounded for every \(L\in 2{\mathbb {N}}+1\) and A is trace class on compact sets by hypothesis, we have that \(\chi _L X_j A \chi _L=\chi _L X_j\chi _L A\chi _L\) is trace class. Therefore, in view of the decomposition (2.8), one has that
Using that A is periodic, that \([T_{\gamma },X_j]=-\gamma _j T_{\gamma }\), and the result from Proposition 2.4(i), we obtain that
Consequently, we get that
Since both \(\gamma \) and \(-\gamma \) are in \(\varGamma \cap {{\mathcal {C}}}_L\) for all \(L\in 2{\mathbb {N}}+1\), the sum in brackets on the right-hand side of the above vanishes, and the thesis follows immediately.
(ii). The statement follows from (2.16) and the hypothesis \(\tau (A)=0.\) \(\square \)
A property which will be fundamental for all the following analysis is the conditional cyclicity of the trace per unit volume. We state it in the following lemma, whose proof can be found in [12, Lemma 3.22].
Lemma 2.6
(Conditional cyclicity of the trace per unit volume). If \(A\in {\mathcal {B}}_1^\tau \) and \(B\in {\mathcal {B}}_\infty ^\tau \), then \(\tau (AB)=\tau (BA).\)
The trace per unit volume is defined in (2.9) through a specific choice of the cell \({{\mathcal {C}}}_L\), which in turn depends via (2.7) on the choice of a particular linear basis \(\{a_1,\ldots ,a_d\}\) for \(\varGamma \). The term unit cell consistency refers to the requirement that physically relevant quantities are independent of the latter choice. Precisely, one considers a different linear basis \( \left\{ \tilde{a}_1, \ldots , \tilde{a}_d \right\} \) for \(\varGamma \) and the corresponding cell, defined by
and sets \(\widetilde{\chi }_L:=\chi _{\widetilde{{{\mathcal {C}}}}_L}\). Denoting by \(\tau (\,\cdot \,)\) and \(\widetilde{\tau }(\,\cdot \,)\), respectively, the trace per unit volume induced by the choice of the primitive cells \({{\mathcal {C}}}_1\) and \(\widetilde{{{\mathcal {C}}}}_1\), we prove in Proposition A.2(i) that for any periodic operator A, which is trace class on compact sets, one has that
When a contribution to the transport coefficient is in the form \(\tau (X_i A)\) for a periodic operator A, as in formula (1.15), a more careful analysis is needed, as discussed at the end of Sect. 5.1 and in Appendix A.
3 The Unperturbed Model
Our goal is to study the linear response of a crystalline system to the application of an external electric field of small intensity. Before considering the perturbed system, we state our assumptions on the unperturbed one.
Assumption 3.1
We assume the following:
-
(\(\hbox {H}_{1}\)) the Hamiltonian \(H_0\) of the unperturbed system is a self-adjoint periodic operator on \({\mathcal {H}}\), bounded from below, such that in Bloch–Floquet–Zak representation its fibration
$$\begin{aligned} H_0: {\mathbb {R}}^d\rightarrow {\mathcal {L}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\,,\quad k\mapsto H_0(k) \,, \end{aligned}$$is a smooth equivariant map taking values in the self-adjoint operators with dense domain \({\mathcal {D}}_{\mathrm {f}}\subset {\mathcal {H}}_{\mathrm {f}}\), such that \(\varrho (\gamma ^*):{\mathcal {D}}_{\mathrm {f}}\rightarrow {\mathcal {D}}_{\mathrm {f}}\) for every \(\gamma ^*\in \varGamma ^*\) (compare (2.5)). Here \({\mathcal {L}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\) denotes the space of bounded linear operators from \({\mathcal {D}}_{\mathrm {f}}\), equipped with the graph norm of \(H_0(0)\) denoted by \(\left\| \,\cdot \, \right\| _{{\mathcal {D}}_{\mathrm {f}}}\)Footnote 11 , to \({\mathcal {H}}_{\mathrm {f}}\simeq L^2({{\mathcal {C}}}_1) \otimes {\mathbb {C}}^N\);
-
(\(\hbox {H}_{2}\)) let \(\mu \in {\mathbb {R}}\) (Fermi energy) be in a spectral gapFootnote 12 of \(H_0\). We denote by \(\varPi _0 = \chi _{(-\infty , \mu )}(H_0)\) the corresponding spectral projector (Fermi projector). We assume that its fibration \(k\mapsto \varPi _0(k)\) takes values in the finite-rank projections on \({\mathcal {H}}_{\mathrm {f}}\).Footnote 13
\(\diamond \)
We shortly discuss sufficient conditions implying that Assumption (\(\hbox {H}_{1}\)) holds true. As far as discrete models are concerned, \({\mathcal {H}}_{\mathrm {f}}\) is finite dimensional and \(H_0(k)\) are self-adjoint matrices. The smoothness of the map \(k \mapsto H_0(k)\) follows from the fact that the hopping amplitudes in the model \( \left\{ t_{\gamma } \right\} _{\gamma \in \varGamma }\) decay sufficiently fast as \(|\gamma | \rightarrow \infty \). In all the most popular discrete models of topological insulators [25, 33] the hopping amplitudes have finite range, namely \(t_\gamma =0\) if \(|\gamma | > R\) for some R, hence assumption (\(\hbox {H}_{1}\)) is automatically satisfied.
As for the continuum case \({\mathcal {X}}= {\mathbb {R}}^d\), we first consider a Bloch–Landau operator in the form
acting in \(L^2({\mathbb {R}}^d)\), where A and \(V_\varGamma \) are the magnetic and electrostatic potentials, respectively (the charge Q of the particle is reabsorbed in A and in V). For the sake of simplicity, we consider only \(d \le 3\) and we ignore the “spin space” \({\mathbb {C}}^N\), but similar results holds true if, for example, \(V_\varGamma \) is matrix-valued and acts non-trivially on these degrees of freedom. With the help of Kato’s theory [35], and arguing as in [45] on the basis [11], it is not difficult to prove that, if \(A = A_\varGamma \) is \(\varGamma \)-periodic, and \({\mathcal {C}}_1\) denotes the fundamental cell of the lattice, then for the validity of (\(\hbox {H}_{1}\)) it is sufficient to assume either of the following two sets of hypotheses:
-
(A)
\(A \in L^\infty ({\mathcal {C}}_1,{\mathbb {R}}^2)\) when \(d=2\) or \(A \in L^4({\mathcal {C}}_1,{\mathbb {R}}^3)\) when \(d=3\), and \(\mathrm {div}\, A, V_\varGamma \in L^2_{\mathrm {loc}}({\mathbb {R}}^d)\) when \(d \in \left\{ 2,3 \right\} \);
-
(B)
\(A \in L^r({\mathcal {C}}_1,{\mathbb {R}}^2)\) with \(r>2\) and \(V_\varGamma \in L^p({\mathcal {C}}_1)\) with \(p>1\) when \(d=2\), or \(A \in L^3({\mathcal {C}}_1,{\mathbb {R}}^3)\) and \(V_\varGamma \in L^{3/2}({\mathcal {C}}_1)\) when \(d=3\).
If instead \(A = A_b\) is a linear potential inducing a constant uniform magnetic field, it is enough to assume that \(V_\varGamma \) is infinitesimally form bounded with respect to \(- \Delta \) on \({\mathcal {H}}_{\mathrm {f}}\), and that the magnetic flux per unit cell is a rational multiple of the magnetic flux quantum. As a further example, we consider the Hamiltonian
acting on \(L^2({\mathbb {R}}^2) \otimes {\mathbb {C}}^2\). In the above, \(\mathbf {p} \equiv (p_1, p_2) = - \mathrm {i}\nabla \) denotes the momentum operator, \(\mathbf {E} \equiv (E_1, E_2, E_3)\) is a constant vector (which we interpret as a constant electric field), and \(\mathbf {s} \equiv (s_x, s_y, s_z)\) denotes the vector of spin matrices (half of the Pauli matrices). Thus, the first term in \(H_0\) represents the kinetic energy, the second one a spin–orbit coupling, and the third one is a Rashba term: This Hamiltonian represents a continuum analogue of the Kane–Mele Hamiltonian proposed in [33]. One can argue that the above operator can be fibered via Bloch–Floquet–Zak transform leading to a family of fiber Hamiltonians \(H_0(k)\) as in Assumption (\(\hbox {H}_{1}\)). Moreover, since
and since the momentum operator is relatively bounded with respect to the Laplacian, one can see that \([H_0,S]\) is relatively bounded with respect to \(H_0\), and hence the assumptions on S (compare Definition 5.1) are satisfied as well.
The following spaces of operators and functions turn out to be useful for our analysis.
Definition 3.2
Let \({\mathcal {H}}_1,{\mathcal {H}}_2\in \{ {\mathcal {D}}_{\mathrm {f}}\, ,\,{\mathcal {H}}_{\mathrm {f}}\}\). We denote by \({\mathcal {L}}({\mathcal {H}}_1,{\mathcal {H}}_2)\) the space of bounded linear operators from \({\mathcal {H}}_1\) to \({\mathcal {H}}_2\) and by \({\mathcal {L}}({\mathcal {H}}_1):={\mathcal {L}}({\mathcal {H}}_1,{\mathcal {H}}_1)\). We define
equipped with the norm \(\left\| A \right\| _{{\mathcal {P}}({\mathcal {H}}_1,{\mathcal {H}}_2)}:=\max _{k\in {\mathbb {B}}^d}\left\| A(k) \right\| _{{\mathcal {L}}({\mathcal {H}}_1,{\mathcal {H}}_2)}\). We also set \({\mathcal {P}}({\mathcal {H}}_1) :={\mathcal {P}}({\mathcal {H}}_1,{\mathcal {H}}_1)\).
Since the Fréchet derivative follows the usual rules of the differential calculus, we have that \({\mathcal {P}}({\mathcal {H}}_1,{\mathcal {H}}_2)\) is a linear space, \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\), \({\mathcal {P}}({\mathcal {D}}_{\mathrm {f}})\) and \({\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) are normed algebras, and e.g. for \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) and \(B\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) we have
Definition 3.3
Let \({\mathcal {H}}_1\in \{ {\mathcal {D}}_{\mathrm {f}}\, ,\,{\mathcal {H}}_{\mathrm {f}}\}\). We set
Notice that \(C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\supset C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})\), \(C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\) is dense in \({\mathcal {H}}_\varrho \) with respect to \(\left\| \,\cdot \, \right\| _{{\mathcal {H}}_\varrho }\), and \(C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})\) is dense in \(L^2_{\varrho }({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})\) with respect to the norm on \({\mathcal {H}}_\varrho \) induced by the graph norm \(\left\| \,\cdot \, \right\| _{{\mathcal {D}}_{\mathrm {f}}}\).
Since we are interested in computing \([A,X_j]\) where A is in one of the above spaces of operators, the following proposition will be relevant. It states their invariance under the derivation \(\overline{[\,\cdot \,,X_j]}\), where the overline denotes the operator closure.
Proposition 3.4
Let \({\mathcal {H}}_1,{\mathcal {H}}_2\in \{ {\mathcal {D}}_{\mathrm {f}}\, ,\,{\mathcal {H}}_{\mathrm {f}}\}\), and \(A\in {\mathcal {P}}({\mathcal {H}}_1,{\mathcal {H}}_2)\). Then, \(\overline{[A,X_j]}\) is in \({\mathcal {P}}({\mathcal {H}}_1,{\mathcal {H}}_2)\), and
Proof
Notice that
thus for every \(\varphi \in C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_1)\) one has that
Since \({\mathcal {U}}_{\mathrm {BF}}\, A \, {\mathcal {U}}_{\mathrm {BF}}^{-1} \, C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_1)\subset C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_2)\) (see [35, III-§3.1, Problem (3.11)]), the commutator appearing on the right-hand side of the first equality is densely defined on \(C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_1)\) and so by unitary conjugation the commutator on the left-hand side is densely defined as well. Thus, Lemma 2.1 implies that \([A,X_j]\) acting on \({{\mathcal {U}}_{\mathrm {BF}}^{-1}C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_1)}\) is periodic.
Observe that
for every \(f(k,\cdot ) \in {\mathcal {H}}_1\). By (3.3) and (3.4), one obtains
for all \(\varphi \in C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_1)\). Therefore, as \({\mathcal {H}}_2\) is a Banach space, the extension principle implies the thesis. \(\square \)
Lemma 3.5
Under Assumption 3.1 we have that \((H_0 - z \mathbb {1})^{-1}\in {\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) for every \(z\in \rho (H_0)\), and that \(\varPi _0 \in {\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\).
Proof
The first claim is evident because of \((H_0 - z \mathbb {1})^{-1}(k) = (H_0(k) - z \mathbb {1})^{-1}\). Since \({\mathcal {D}}_{\mathrm {f}}\) is a Banach space, the second one follows from Riesz’s formula
where C is a positively-oriented complex contour intersecting the real axis at the Fermi energy (so, in the gap) and below the bottom of the spectrum of \(H_0\). \(\square \)
Corollary 3.6
Under Assumption 3.1 we have that
-
(i)
\(\overline{[\varPi _0,X_j]\Big |_ {{\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})}}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\),
-
(ii)
\(\overline{\big [ [\varPi _0,X_j] , X_i \big ]\Big |_ {{\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})}}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\),
-
(iii)
\(\overline{[H_0,X_j]\Big |_{{\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})} }}\in {\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\).
Proof
(i). By Lemma 3.5, one has that \(\varPi _0 \in {\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\). Proposition 3.4 implies the statement.
(ii). In view of Lemma 3.5, one has that \(\varPi _0 \in {\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\). Using an argument similar to the one presented in the proof of Proposition 3.4 one deduces the thesis.
(iii). Since by hypothesis \(H_0\in {\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\), Proposition 3.4 concludes the proof. \(\square \)
For the sake of readability, we introduce the concise notation
4 Non-equilibrium Almost-Stationary States
Now that we have established the model for the unperturbed system, we consider the perturbed Hamiltonian
where \( \varepsilon \in [0,1]\) is the strength of the external electric field pointing in the j-direction.
As discussed in Introduction, we are interested in the linear response of the system to such a perturbation when it starts initially in the zero-temperature equilibrium state \(\varPi _0\). While it is clear that the perturbation given by the linear electric potential has the effect of driving the system out of equilibrium, the perturbation is slowly varying and thus acts locally merely as a shift in energy. Hence it is expected that the initial equilibrium state \(\varPi _0\) changes continuously into a nearby non-equilibrium almost-stationary state (NEASS). A detailed discussion and justification of the concepts of NEASS can be found in [41, 68].
For the following construction of the NEASS in the present setting we only need to know that the operator \(\varPi ^\varepsilon \), representing the NEASS, is determined uniquely (up to terms of order \({{\mathcal {O}}}(\varepsilon ^{M+1})\)) by the following two properties:
-
(\(\hbox {SA}_{1}\)) \(\varPi ^\varepsilon = {\mathrm {e}}^{-\mathrm {i}\varepsilon {\mathcal {S}}^\varepsilon } \, \varPi _0 \, {\mathrm {e}}^{\mathrm {i}\varepsilon {\mathcal {S}}^\varepsilon }\) for some bounded, periodic and self-adjoint operator \({\mathcal {S}}^\varepsilon \);
-
(\(\hbox {SA}_{2}\)) \(\varPi ^\varepsilon \) almost-commutes with the Hamiltonian \(H^\varepsilon \), namely \([H^\varepsilon ,\varPi ^\varepsilon ] = {{\mathcal {O}}}(\varepsilon ^{M+1})\).
Here \({{\mathcal {O}}}(\varepsilon ^{M+1})\) is understood in the sense of the operator norm.
Proposition 4.1
Consider the Hamiltonian \(H^\varepsilon = H_0 - \varepsilon X_j\) with \(H_0\) satisfying Assumption 3.1.
-
(i)
Let \({\mathcal {S}}:=\mathrm {i}\, {\mathcal {I}}\big ([\overline{[X_j,\varPi _0]},\varPi _0]\big )\), where
$$\begin{aligned} {\mathcal {I}}(A) := \frac{\mathrm {i}}{2 \pi } \oint _C \mathrm {d}z \, (H_0 - z \mathbb {1})^{-1} \, [A,\varPi _0] \, (H_0 - z \mathbb {1})^{-1}, \end{aligned}$$(4.2)with C a positively-oriented contour in the complex energy plane enclosing the part of the spectrum of \(H_0\) below the gap. Then \({\mathcal {S}}\) is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) and is self-adjoint.
-
(ii)
Let \(\varPi ^\varepsilon := {\mathrm {e}}^{-\mathrm {i}\varepsilon {\mathcal {S}}} \,\varPi _0\, {\mathrm {e}}^{\mathrm {i}\varepsilon {\mathcal {S}}}.\) Then, \(\varPi ^\varepsilon = \varPi _0 + \varepsilon \varPi _1 + \varepsilon ^2 \varPi _r^\varepsilon \), where both \(\varPi _1 = {\mathcal {I}}\big ( \overline{[ X_j,\varPi _0]} \big )\) and \( \varPi _r^\varepsilon \) are in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\), and the map \({[0,1]}\ni \varepsilon \mapsto \varPi _r^\varepsilon \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) is bounded. Moreover, \(\overline{[H^\varepsilon , \varPi ^\varepsilon ]} = \varepsilon ^{2} R^\varepsilon \), where \(R^\varepsilon \) is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) and the map \({[0,1]} \ni \varepsilon \mapsto R^\varepsilon \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) is bounded.
We postpone the proof of the above proposition to Sect. 6.3. It is already clear from the statement that the map \({\mathcal {I}}(\,\cdot \,)\) plays a crucial role: Its properties are summarized in Sect. 6.2, where we recall in particular the well-known fact from perturbation theory that \({\mathcal {I}}(A)\) is the unique solution to the equation \([H_0,{\mathcal {I}}(A)]=A\) whenever A is off-diagonal in the orthogonal decomposition induced by \(\varPi _0\).
5 Results on the S-conductivity
As stated in the previous sections, we want to investigate quantum S-currents induced by the perturbation given by an external electric field and compute their S-conductivities as linear response coefficients. To fix the ideas, the reader can think of the case \(S = \mathbb {1}_{\mathcal {H}}\) (which corresponds physically to the charge current, in appropriate units, e.g. in quantum Hall systems) or to \(S = \mathbb {1}_{L^2({\mathcal {X}})} \otimes s_z\), where \(s_z = \sigma _z/2\) is half of the third Pauli matrix (which corresponds to the spin current e.g. in quantum spin Hall systems).
Definition 5.1
(S-current and S-conductivity). Let \(S = \mathbb {1}_{L^2({\mathcal {X}})} \otimes s\) be a self-adjoint operator on \({\mathcal {H}}= L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\). Furthermore, assume that S is periodicFootnote 14 and its fibration \(\mathbb {1}_{L^2({{\mathcal {C}}}_1)} \otimes s\) is in \({\mathcal {L}}( {\mathcal {D}}_{\mathrm {f}})\).
The conventional and the proper S-current operator are defined, respectively, as
where \(H_0\) satisfies Assumption 3.1. The conventional and proper S-conductivity are defined, respectively, as
\(\diamond \)
Since \(\varPi ^\varepsilon =\varPi _0 + \varepsilon \varPi _1 + \varepsilon ^2 \varPi _r^\varepsilon \) by Proposition 4.1.(ii), we have that
In order to prove that \({{\,\mathrm{Re}\,}}\tau (J_{\mathrm {prop},i }^S\,\varPi _1)=\sigma _{\mathrm {prop},ij }^S\) according to (5.1), it suffices to show that all the traces per unit volume above are well defined and finite, and that the term carrying a prefactor \(\varepsilon ^2\) is uniformly bounded in \(\varepsilon \). While the control of the remainder term will be done in Sect. 6.4, we focus now on the linear response coefficient, namely \({{\,\mathrm{Re}\,}}\tau (J_{\mathrm {prop},i }^S\,\varPi _1)\).
5.1 The Linear Response Coefficient
In order to compute the linear response coefficient, we employ directly Definition 2.2 for \(\tau (J_{\mathrm {prop},i }^S \,\varPi _1)\), and start by localizing this operator on the cell \({{\mathcal {C}}}_L\), defined in (2.7), through the projection \(\chi _L\) which multiplies by the characteristic function of \({{\mathcal {C}}}_L\). It is convenient to notice at this point the following
Remark 5.2
The range of \({\mathcal {U}}_{\mathrm {BF}}\chi _L\) is contained in \(C^\infty _\varrho ({\mathbb {R}}^d, {\mathcal {H}}_{\mathrm {f}})\) (compare Definition 3.3) for every \(L>0\). Indeed, for all \(f \in {\mathcal {H}}= L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\) and all \({r} \in {\mathbb {N}}\), the function \(\langle X \rangle ^{{r}} \chi _L f\) is still in \({\mathcal {H}}\), where \((\langle X \rangle \psi )(x) := (1 + |x|^2)^{1/2} \psi (x)\) for \(\psi \in {\mathcal {D}}(\langle X \rangle )\). By standard Bloch–Floquet theory [45, Appendix A], this is equivalent to requiring that \({\mathcal {U}}_{\mathrm {BF}}(\chi _L f)\) is in the space \(H^{{r}}_\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\) of \(\varrho \)-covariant maps \(\varphi :{\mathbb {R}}^d \rightarrow {\mathcal {H}}_{\mathrm {f}}\) with Sobolev regularity r: It is a classical result that the intersection of all these Sobolev spaces is contained in \(C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\). \(\diamond \)
Notice that, since \(\varPi _1 \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\), it maps \({\mathcal {U}}_{\mathrm {BF}}^{-1} C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {H}}_{\mathrm {f}})\) to \({\mathcal {U}}_{\mathrm {BF}}^{-1} C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})\). Thus, we have that
and this allows for the following simple manipulations. Using the Leibniz rule, we obtain the following chain of equalities on \({{\,\mathrm{Ran}\,}}(\chi _L)\):
where the operators \({\mathsf {O}}\) and \({\mathsf {R}}\) have been defined in (1.12). We call \({\mathsf {O}}\) the S-orbital part and \(X_i{\mathsf {R}}\) the S-rotation part of the operator related to linear response of the proper S-current \(J_{\mathrm {prop},i }^S\). The latter terminology is due to the fact that, when S is the spin operator and \([H_0,S]\ne 0\), the spin transport is the result of two contributions, that is, the S-orbital part coming from the center-of-mass drift and the S-rotation part due to the spin non-conservation. Notice that both \({\mathsf {O}}\) and \({\mathsf {R}}\) are periodic, instead obviously \(X_i{\mathsf {R}}\) is not periodic and thus a more careful analysis of its trace per unit volume is required. We begin by handling the S-orbital part \({\mathsf {O}}\). In view of the defining relation \(\varPi _1 = {\mathcal {I}}(\overline{[X_j,\varPi _0]})\) (compare Propositions 4.1 and 6.3), we obtain on \({{\,\mathrm{Ran}\,}}(\chi _L)\):
where we have defined the operators
We call \({\mathsf {C}}\) the Chern-like term and \({\mathsf {E}}_\ell \) the \(\ell \)-th extra or beyond-Chern-like term for \(\ell \in \{1,\dots ,4\}\). This terminology is motivated by the fact that whenever the spin is conserved, for \(d=2\), \(i=1\) and \(j=2\) in quantum (spin) Hall systems the Chern-like term \({\mathsf {C}}\) corresponds to the (spin-)Chern number (see Remark 5.12). In general, whenever \([H_0,S]=0\), all extra terms have trace per unit volume zero (see Sect. 5.2) and obviously the S-rotation part vanishes.
In the following proposition, we analyze the trace per unit volume of the operators resulting from the previous algebraic manipulations.
Proposition 5.3
Under Assumption 3.1 and hypotheses on S in Definition 5.1, we have that the Chern-like term \({\mathsf {C}}\), the extra terms \({\mathsf {E}}_\ell \) for any \(\ell \in \{1,\dots ,4\}\) and \(X_i{\mathsf {R}}\), defined in (1.12) and (5.6) have finite traces per unit volume. Moreover, one has
and
where the diagonal and off-diagonal parts of the above operators refer to the orthogonal decomposition induced by the Fermi projection \(\varPi _0\).
The proof of the above proposition is postponed to Sect. 6.5.
We are going to prove that trace per unit volume of the operator \(X_i{\mathsf {R}}\) is well defined and finite. In view of Proposition 2.5.(ii), it suffices to show that \(\tau ({\mathsf {R}})\) is zero. The latter result is an immediate consequence of the following
Proposition 5.4
If \(H_0\) satisfies Assumption 3.1, and B is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\cap {\mathcal {P}}({\mathcal {D}}_{\mathrm {f}})\) (in particular, B is a bounded periodic operator) the following holds:
-
(i)
$$\begin{aligned} \tau \big (\mathrm {i}[H_0,B] \varPi _1 \big )=\tau \big ( \mathrm {i}\varPi _0\big [ [B,\varPi _0] , [X_j,\varPi _0] \big ] \big ). \end{aligned}$$
-
(ii)
If, in addition, \([B,X_j]=0\) then
$$\begin{aligned} \tau (\mathrm {i}[H_0,B]\varPi _1)=0. \end{aligned}$$
The above proposition, whose proof is deferred to Sect. 6.5, immediately implies the following
Corollary 5.5
Under Assumption 3.1 and hypotheses on S in Definition 5.1, we have that
We are now in position to state one of our main results.
Theorem 5.6
(General formula for the S-conductivity) Let \(H^\varepsilon = H_0 - \varepsilon X_j\) be acting in \(L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\), with \(H_0\) and \(\varPi _0\) as in Assumption 3.1. Let \(\varPi ^\varepsilon \) be the NEASS defined in Sect. 4. Consider the conventional (resp. proper) S-conductivity \(\sigma _{\mathrm {conv},ij }\) (resp. \(\sigma _{\mathrm {prop},ij }\)) as in Definition 5.1. Then
where
and the rotation contribution to the proper S-conductivity is defined as
Moreover, the trace per unit volume appearing in (5.13) does not depend on the particular exhaustion\(^{(10)}\) \({{\mathcal {C}}}_L\nearrow {\mathcal {X}}\) chosen in Definition 2.2 and on the choice of the origin, in the sense that \(\tau (X_i{\mathsf {R}})=\tau ((X_i+\alpha ){\mathsf {R}})\) for every \(\alpha \in {\mathbb {R}}\).
The proof of the above theorem is postponed to Sect. 6.
Remark 5.7
Some comments about the above result.
-
(i)
Notice that one can rewrite the above formula for the proper S-conductivity \(\sigma _{\mathrm {prop},ij }^S\), summing the two contributions \(\sigma _{\mathrm {conv},ij }^S\) and \(\sigma _{\mathrm {rot},ij }^S\) as follows:
$$\begin{aligned} \sigma _{\mathrm {prop},ij }^S&=\frac{1}{\left|{{\mathcal {C}}}_1\right|}{{\,\mathrm{Re}\,}}{{\,\mathrm{Tr}\,}}\Big ( \chi _1\left( \mathrm {i}\varPi _0\big [[X_i,\varPi _0] S, [X_j,\varPi _0] \big ] +\mathrm {i}[H_0,X_i^{\mathrm {D}}] S^{\mathrm {OD}} \varPi _1 \right) \chi _1\Big ) \end{aligned}$$(5.14)$$\begin{aligned}&\quad + \frac{1}{\left|{{\mathcal {C}}}_1\right|}{{\,\mathrm{Re}\,}}{{\,\mathrm{Tr}\,}}\Big (\chi _1 \mathrm {i}X_i^{\mathrm {D}} [H_0, S ] \varPi _1 \chi _1 \Big ). \end{aligned}$$(5.15)While (5.11) emphasizes the splitting between the drift contribution coming from the center-of-mass momentum and the one resulting from the spin rotation, the latter decomposition isolates the contribution coming from a periodic operator, in (5.14), and the one deriving from a non-periodic operator, in (5.15).
-
(ii)
The real part is needed in both (5.12) and (5.13), even if on \({{\,\mathrm{Ran}\,}}(\chi _1)\) one has that \(J_{\mathrm {prop},i }^S\, \varPi ^\varepsilon = {\left( J_{\mathrm {prop},i }^S\right) }^*\,\varPi ^\varepsilon .\) On the other hand, if \([H_0,S]=0\) then \(\tau (J_{\mathrm {prop},i }^S\, \varPi ^\varepsilon )\) is automatically real. Moreover, for systems with a fermionic time-reversal symmetry \(\varTheta \) such that \(\varTheta S \varTheta ^{-1} = -S\), the number \(\tau (J_{\mathrm {prop},i }^S\, \varPi _1)\) is real, so the real part is redundant.
It is worth to investigate how the contributions to the proper S-conductivity, appearing in (5.11), behave under a change of primitive cell.
Proposition 5.8
(Unit Cell Consistency of the S-conductivity). Under the hypotheses of Theorem 5.6, we have that
-
(i)
\(\sigma _{\mathrm {conv},ij }^S\) satisfies UCC.
-
(ii)
If, in addition, the model enjoys a discrete rotational symmetry satisfying the hypotheses of Proposition A.3, then \(\sigma _{\mathrm {rot},ij }^S\) satisfies UCC.
Proof
(i). Since all operators involved in the trace per unit volume computing \(\sigma _{\mathrm {conv},ij }^S\) are periodic, Proposition A.2.(i) implies the thesis. (ii). By applying Proposition A.3.(i) along with Proposition A.2.(ii), the conclusion follows. \(\square \)
The next proposition shows that in some discrete models with discrete rotational symmetry, one has that \(\sigma _{\mathrm {rot},ij }^S=0\), and hence the choice between \(J_{\mathrm {prop},i }^S\) and \(J_{\mathrm {conv},i }^S\) becomes immaterial. Remarkably, the Kane–Mele model is in this class.
Proposition 5.9
(Equality of conventional and proper S-conductivity). Let \(H_0\) be a discrete Hamiltonian with finite range hopping amplitudes and S be as in Definition 5.1. Assume that the model satisfies the hypotheses of Proposition A.3 and \(\mathrm {Rank}{\chi _{P_\gamma }}=1\), where \( \left\{ P_\gamma \right\} _{\gamma \in I}\subset {\mathcal {X}}\) is the family of subsets defined in (A.1). Then
Proof
By direct computation, since there exists \(\lambda _{i,\gamma }\in {\mathbb {R}}\) such that \(X_i\chi _{P_\gamma }=\lambda _{i,\gamma }\chi _{P_\gamma }\), we have that
because by Proposition A.3(i) \({{\,\mathrm{Tr}\,}}(\chi _{P_\gamma }\, \mathrm {i}[H_0, S ] \varPi _1\, \chi _{P_\gamma })=0\) for every \(\gamma \in I\). \(\square \)
5.2 When S is (Approximately) Conserved
The computation of the linear response coefficient \(\sigma _{ij}^S\) simplifies considerably if we assume that S is a conserved quantity of the system, namely that
Under this assumption, then \([\varPi _0, S]=0\) as well, since \(\varPi _0\) is a spectral projection associated with \(H_0\), and thus S is diagonal in the decomposition induced by \(\varPi _0\).
If (5.16) holds, then \(J_i^S:=J_{\mathrm {prop},i }^S=J_{\mathrm {conv},i }^S= \mathrm {i}\overline{[H_0, X_i]} S\) is in \({\mathcal {P}}({\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\) by Corollary 3.6.(iii) and the hypothesis \(S\in {\mathcal {P}}({\mathcal {D}}_{\mathrm {f}})\). Hence, since \(\varPi ^\varepsilon \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) by Proposition 4.1.(ii), we have that \(J_i^S\varPi ^\varepsilon \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) and furthermore applying Proposition 6.6.(i), we deduce that \(J_i^S\varPi ^\varepsilon \) is \(\tau \)-class. Thus, the trace per unit volume of \(J_i^S\varPi ^\varepsilon \) is well defined and only the Chern-like term contributes to it. Indeed, by Proposition 5.3 the extra term \({\mathsf {E}}_\ell \) does not contribute for \(\ell \in \{2,4\}\) and the next Lemma shows that \(\tau ({\mathsf {E}}_1)=0\). Obviously, \({\mathsf {E}}_3=0={\mathsf {R}}\) whenever (5.16) holds.
Lemma 5.10
Under Assumption 3.1 and the hypotheses on S in Definition 5.1, assume further that \([H_0,S] =0\). Then,
where \({\mathsf {E}}_1\), \({\mathsf {E}}_3\) and \({\mathsf {R}}\) are defined in (5.6) and (1.12).
The proof of Lemma 5.10 is also deferred to Sect. 6.5, but it is easily seen to imply the following
Theorem 5.11
(S-conductivity in the S conserved case) Let \(H^\varepsilon = H_0 - \varepsilon X_j\) be acting in \(L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\), with \(H_0\) and \(\varPi _0\) as in Assumption 3.1. Let \(\varPi ^\varepsilon \) be the NEASS defined in Sect. 4 and \(J_i^S\) be as in Definition 5.1. Assume moreover that \([H_0,S]=0\). Then, the S-conductivity is
Proof
In view of Lemma 5.10, the extra terms \({\mathsf {E}}_1\) and \({\mathsf {E}}_3\), and the S-rotation part \(\sigma _{\mathrm {rot},ij }^S\) do not contribute to the trace per unit volume of \(J_i^S\, \varPi _1\). Therefore, using Proposition 2.4.(ii) we are going to compute the k-space representation of the trace of \({{\mathsf {C}}} \chi _1 =\mathrm {i}S \varPi _0 \big [ [X_i,\varPi _0],[X_j,\varPi _0] \big ] \chi _1\). To this end, it suffices to notice that the fiber operator associated with \(\overline{[X_j,\varPi _0]}\) in the Bloch–Floquet–Zak representation is given by \(\mathrm {i}\partial _{k_j} \varPi _0(k)\) (Proposition 3.4) and that \(|{\mathcal {C}}_1| \, |{\mathbb {B}}^d| = (2\pi )^d\). \(\square \)
This theorem applies in particular to the transverse charge current in quantum Hall systems (\(S=\mathbb {1}_{\mathcal {H}}\)), and to the transverse spin current in quantum spin Hall systems (\(S=\mathbb {1}_{L^2({\mathcal {X}})} \otimes s_z\)) whenever the z-component of the spin is conserved. In particular, in the latter case we recover the formula for the spin conductivity proposed in [16, 63], which was derived assuming that the unperturbed Hamiltonian \(H_0\) has an identically degenerate Bloch band, where the degeneracy comes from the spin degrees of freedom. So in that model effectively \((\varPi _0 H_0 \varPi _0)(k) = E_0(k) \mathbb {1}_{L^2({\mathcal {X}})} \otimes \mathbb {1}_{{\mathbb {C}}^2}\), and (5.16) is in particular satisfied after projection to the relevant spectral subspace. Our argument used only (5.16) and no spectral assumption (other than the gap condition) on the Hamiltonian.
Remark 5.12
(Spin conductivity and spin-Chern number). Let \(S=\mathbb {1}_{L^2({\mathcal {X}})} \otimes s_z\) with \(s_z\) a spin operator for non-integer spin r, i.e. with spectrum \(\{ -r, -r+1,\ldots , r-1, r\}\), acting on \({\mathbb {C}}^N\) with \(N=2r+1\) (e.g. half the third Pauli matrix \(\sigma _z\) for \(r=\frac{1}{2}\) and \(N=2\)). Denote by \(s_z = \sum _{\ell =0}^{2r} ( \ell -r ) p_\ell \) its spectral decomposition.
Then, the commutation relation \([\varPi _0,S] = 0\) implies that \(\varPi _0\) admits a splitting in the decomposition induced by S:
The formula for the S-conductivity \(\sigma _{ij}^S \) in Theorem 5.11 simplifies then to
This spin-Chern number S-Chern\((\varPi _0)_{ij}\), proposed in [61] and intrinsically defined in [56], is in general a half integer.Footnote 15 It becomes an integer if the system enjoys time-reversal symmetry. Even in time-reversal invariant systems it can be different from zero while the Chern number
necessarily vanishes. Our approach to spin transport shows then that for \(d=2\) the bulk spin Hall conductivity (measured in units of \(\frac{e}{2\pi } \equiv \frac{1}{2\pi }\)) equals the spin-Chern number, as long as \([H_0, S]=0\). On the other hand, when \(S=\mathbb {1}_{L^2({\mathcal {X}})}\otimes \mathbb {1}_{{\mathbb {C}}^N}\) and \(d=2\) the integral in Theorem 5.11 computes, up to a factor \(1/2\pi \), the Chern number Chern\((\varPi _0)\) of the family of projections \( \left\{ \varPi _0(k) \right\} _{k \in {\mathbb {R}}^2}\), implying the quantization of the Hall conductivity measured in units of \(\frac{e^2}{h} \equiv \frac{1}{2\pi }\) (see [22] and references therein). \(\diamond \)
Abstracting from the previous remark, we consider now any operator in the form \(S=\mathbb {1}_{L^2({\mathcal {X}})} \otimes s\), with s as in Definition 5.1. If S is approximately conserved, i.e. if \(\lambda := \Vert [H_0,S]\Vert _{{\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})}\) is sufficiently small, then one can still define a spin-Chern number related to \(\varPi _0\) [56, 58] and the S-conductivity is still approximately given by the spin-Chern number. To see this, let \(s = \sum _{\ell =1}^k s_\ell \,p_\ell \) be the spectral representation of s (we need no assumptions on the spectrum of s here), \( {\widetilde{H}}_0 := \sum _{\ell =1}^k (\mathbb {1}\otimes p_\ell ) H_0 (\mathbb {1}\otimes p_\ell ) \quad \text{ and }\quad V:= ( H_0 - {\widetilde{H}}_0) = \sum _{\ell _1 \not = \ell _2} (\mathbb {1}\otimes p_{\ell _1})H_0 (\mathbb {1}\otimes p_{\ell _2})\,.\) Then, \(H_0 = \widetilde{H}_0 + V\), where \( [{\widetilde{H}}_0,S]=0\) and \( \Vert V\Vert _{{\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})} = \left\| \sum _{\ell _1 \not = \ell _2} (\mathbb {1}\otimes p_{\ell _1}) [H_0, (\mathbb {1}\otimes p_{\ell _2})]\right\| _{{\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})} \le \lambda \,C_s \,,\) with a constant \(C_s\) that depends only on S.
The spin-conserving Hamiltonian \({\widetilde{H}}_0\) is \(H_0\)-bounded with relative bound \(\lambda C_s\). For \(\lambda <\frac{1}{C_s}\), \({\widetilde{H}}_0\) is thus self-adjoint on the domain of \(H_0\), and for \(\lambda \) small enough, by standard perturbation theory, the Fermi energy \(\mu \) lies also in a gap of \({\widetilde{H}}_0\). Thus we can define the gapped Fermi projection \({\widetilde{\varPi }}_0 := \chi _{(-\infty ,\mu ]}({\widetilde{H}}_0)\) of \({\widetilde{H}}_0\) and, in analogy with Remark 5.12, its associated spin-Chern number. More precisely, let
It is straightforward to see thatFootnote 16\({\widetilde{\varPi }}_0^{(\ell )}\in {\mathcal {P}}( {\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\cap {\mathcal {B}}_1^\tau \) and thus the Chern numbers
are well defined and integer. The S-Chern number of \( \varPi _0 \) is finally defined as
We now show that the S-conductivity is given at leading order in \(\lambda \) by \(S-\text{ Chern }( \varPi _0 )\), a result which coherently complements the robustness of edge spin currents proved by Schulz-Baldes [58].
To formulate such a perturbative statement precisely, we slightly change perspective and notation and introduce a \(\lambda \)-dependent family of Hamiltonians: Let \( H_0\) satisfy Assumption 3.1 and \([ H_0, S] =0\) and assume \(V\in {\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\). Then, for \(\lambda _0>0\) sufficiently small, it holds that \(H_\lambda := H_0 + \lambda V\,\) is self-adjoint on the domain of \(H_0\) and has a spectral gap at \(\mu \) for each \(\lambda \in [0,\lambda _0)\). As before we consider the gapped Fermi projection \( \varPi _\lambda := \chi _{(-\infty ,\mu ]}( H_\lambda )\) of \( H_\lambda \), put \( \varPi _0^{(\ell )} := \varPi _0(\mathbb {1}\otimes p_\ell )\) and the associated Chern numbers Chern\(( \varPi _0^{(\ell )} )_{ij}\). The \(\lambda \)-independent S-Chern number associated with \(\varPi _\lambda \) is again \( S\text{-Chern }( \varPi _\lambda )_{ij}:= \sum _{\ell =1}^k s_\ell \cdot \text{ Chern }( \varPi _0^{(\ell )} )_{ij} \equiv S\text{-Chern }( \varPi _0 )_{ij}\,. \)
Proposition 5.13
Let \(H_\lambda = H_0+\lambda V\), be a perturbation of a spin-commuting Hamiltonian \(H_0\) as defined above. Then, the S-conductivity \(\sigma _{ij,\lambda }^S \) of \(H_\lambda \) satisfies
Proof
By standard perturbation theory we obtain
Observe that \(\left\| \varPi _\lambda \right\| _{1,\tau }=\left\| \varPi _0 \right\| _{1,\tau }\) using the smallness argument in \(^{(16)}\). Hence, starting from (5.14) and (5.15), we find
In the second to last equality we used that \(\mathbb {1}\otimes p_{\ell }\) commutes with \(X_i\) and \(\varPi _0\). \(\square \)
6 Proofs
6.1 Diagonal and Off-Diagonal Operators
The Fermi projection \(\varPi _0\) of the unperturbed Hamiltonian \(H_0\) clearly induces a decomposition of \(L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\) into \({{\,\mathrm{Ran}\,}}\varPi _0 \oplus ({{\,\mathrm{Ran}\,}}\varPi _0)^\perp \). Correspondingly, operators acting in \(L^2({\mathcal {X}}) \otimes {\mathbb {C}}^N\) will admit a block decomposition. We review in this section some properties of this decomposition, heading toward the proof of a well-known formula from asymptotic perturbation theory which allows to invert the Liouvillian \([H_0, \, \cdot \,]\) acting on operators which only have off-diagonal blocks.
Definition 6.1
(Diagonal and off-diagonal parts). Given an operator A and an orthogonal projection \(\varPi \), i.e. \(\varPi = \varPi ^* = \varPi ^2\), such that \(A\,\varPi \) is densely definedFootnote 17, one defines its diagonal and off-diagonal parts as
respectively. The operator A is called diagonal (resp. off-diagonal) if \(A = A^{\mathrm {D}}\) (resp. \(A = A^{\mathrm {OD}}\)).
We collect in the following lemma two simple properties of diagonal and off-diagonal operators in a general Hilbert space \({\mathcal {H}}\), whose proof is elementary.
Lemma 6.2
Let A be an operator acting in \({\mathcal {H}}\) and \(\varPi \) an orthogonal projection on \({\mathcal {H}}\) such that \(A \varPi \) is densely defined.
-
(i)
A is diagonal if and only if \([A,\varPi ] = 0\). A is off-diagonal if and only if \(A = A \varPi + \varPi A\).
-
(ii)
\(A^{\mathrm {OD}} = [[A,\varPi ],\varPi ]\).
6.2 Inverse Liouvillian
We study here the Liouvillian (super-)operator \(B \mapsto [H_0,B]\) associated with the unperturbed Hamiltonian, and in particular the possibility to invert it away from its kernel. We look in other words for the solution B to the equation \([H_0,B] = A\), where \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) is off-diagonal with respect to the decomposition \({\mathcal {H}}= \varPi _0 {\mathcal {H}}\oplus (\mathbb {1}-\varPi _0){\mathcal {H}}\). We state in the following proposition the solution to this problem, which traces back at least to [47, 48].
Proposition 6.3
Under Assumption 3.1, let \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) be such that \(A=A^{\mathrm {OD}}\) with respect to \(\varPi _0\). Then, the operator \({\mathcal {I}}(A)\), defined in (4.2), is the unique off-diagonal solution in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) to the equation
Proof
From the very definition (4.2) and our hypotheses on A, we have that \({\mathcal {I}}(A)\) is off-diagonal and is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) by Lemma 3.5.
Thus, we need only to prove (6.1). Since \(H_0(k)\in {\mathcal {L}}({\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\) and \({{\,\mathrm{Ran}\,}}(H_0(k) - z \mathbb {1})^{-1}\subset {\mathcal {D}}_{\mathrm {f}}\) for any \(z\in \rho (H_0)\), applying [70, §V.5 Corollary 2] we have that on \( {\mathcal {D}}_{\mathrm {f}}\)
Hence, we obtain that on the domain \({\mathcal {U}}_{\mathrm {BF}}^{-1}L^2_{\varrho }({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})\)
using the Riesz formula (compare (3.5)) and Lemma 6.2.(ii).
Finally, notice that \({\mathcal {I}}(A)\) is the unique off-diagonal solution in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) to equation (6.1) for any off-diagonal operator \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\). Indeed, if \(B\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) is another solution to (6.1), then \({\mathcal {I}}(A)-B\) commutes with \(H_0\), and hence with \(\varPi _0\). By Lemma 6.2.(i), \({\mathcal {I}}(A)-B\) is diagonal, and hence \(B=B^{\mathrm {OD}} = {\mathcal {I}}(A)^{\mathrm {OD}} = {\mathcal {I}}(A)\). \(\square \)
6.3 NEASS
This section is devoted to the proof of Proposition 4.1 and thus to the explicit construction of the NEASS \(\varPi ^\varepsilon \) satisfying (\(\hbox {SA}_{1}\)) and (\(\hbox {SA}_{2}\)). In order to give this proof, we first need the following preparatory lemma.
Lemma 6.4
If \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\), then both \(({\mathrm {e}}^{\varepsilon A} -\mathbb {1})\) and \(\overline{[{\mathrm {e}}^{\varepsilon A},X_j]}\) are in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) and their norms in this space are bounded uniformly in \(\varepsilon \in [0,1]\).
Proof
Clearly, \(({\mathrm {e}}^{\varepsilon A} -\mathbb {1})\) is periodic as A is periodic. Since \({\mathcal {L}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) is a Banach space,
converges in \({\mathcal {L}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) uniformly in \(k\in K\) for any compact set \(K\subset {\mathbb {R}}^d\) and \(\varepsilon \in [0,1]\), as the sequence \(\{\sum _{n=1}^N \frac{\varepsilon ^n A^n(k)}{n!}\}_{N\in {\mathbb {N}}}\subset {\mathcal {L}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) converges absolutely in \({\mathcal {L}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) uniformly in \(k\in K\) and \(\varepsilon \in [0,1]\). Moreover, observe that each summand is such that
and
converges in \({\mathcal {L}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) uniformly in \(\varepsilon \in [0,1]\) and in \(k\in K\) for any compact set \(K\subset {\mathbb {R}}^d\) due to the assumption that \(A\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\). Therefore, we are allowed to interchange the derivation in k and the series in n. Iterating this argument implies that \(({\mathrm {e}}^{\varepsilon A}-\mathbb {1})\) is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) and that its norm in this space is uniformly bounded with respect to \(\varepsilon \in [0,1]\). Thus, by Proposition 3.4 we deduce that \(\overline{[{\mathrm {e}}^{\varepsilon A},X_j]}=\overline{[{\mathrm {e}}^{\varepsilon A}-\mathbb {1}, X_j]}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) again with uniform bounds on its norm for \(\varepsilon \in [0,1]\). \(\square \)
We are now ready to tackle the
Proof of Proposition 4.1
In this proof, we will abbreviate the expression “the map \([0,1] \ni \varepsilon \mapsto A^\varepsilon \in {\mathcal {P}}\) is uniformly bounded” for some space of operators \({\mathcal {P}}\) by just saying that “\(A^\varepsilon \) is in \({\mathcal {P}}\) uniformly in \(\varepsilon \in [0,1]\).”
(i), By Corollary 3.6.(i) one has that \([\overline{[X_j,\varPi _0]},\varPi _0]\) is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\subset {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) and is off-diagonal with respect to \(\varPi _0\); hence, Proposition 6.3 implies that \({\mathcal {S}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\). Self-adjointness of \({\mathcal {S}}\) is evident.
(ii), By Taylor’s formula, we find that for any \(\varepsilon >0\)
for some \({\tilde{\varepsilon }}\in (0,\varepsilon )\). Thus, in view of Lemma 6.2.(ii) and of the fact that \(\overline{[ X_j,\varPi _0]}={\overline{[ X_j,\varPi _0]}}^{\mathrm {OD}}\), one has
Moreover,
In view of Corollary 3.6.(i) and of Proposition 6.3, we have \(\varPi _1\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\). Notice now that \([\varPi _1,{\mathcal {S}}]\) is in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) (because \(\varPi _1, {\mathcal {S}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\)), and \(({\mathrm {e}}^{-\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}}-\mathbb {1})\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) uniformly in \({\tilde{\varepsilon }}\in (0,\varepsilon )\subseteq [0,1]\) by Lemma 6.4. Therefore, we conclude that
uniformly in \(\tilde{\varepsilon } \in (0,\varepsilon )\subseteq [0,1]\). Finally, on the domain \({\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})}\) we have that
On the right-hand side, the first term vanishes in view of equation (6.1):
As for the second term, we recognize that
extends to a bounded operator in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) uniformly in \(\varepsilon \in [0,1]\). Indeed, the second summand \([X_j,\varPi _1]\Big |_ {{\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})}}\) in (6.4) extends to an operator in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\subset {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) by Proposition 3.4. We split instead the first summand in (6.4) as
The first of the terms above satisfies, in view of (6.3),
while \([X_j,\varPi _r^\varepsilon ]\Big |_ {{\mathcal {U}}_{\mathrm {BF}}^{-1}{C^\infty _\varrho ({\mathbb {R}}^d,{\mathcal {D}}_{\mathrm {f}})}}\) extends to an operator in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) applying again Proposition 3.4. By using Leibniz’s rule, Lemma 6.4 and Proposition 3.4 one concludes that \(\left\| \overline{[X_j,\varPi _r^\varepsilon ]} \right\| _{{\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})}\) is bounded uniformly in \(\varepsilon \in [0,1]\). \(\square \)
6.4 Well-Posedness of the Proper S-conductivity
By using (5.2) and combining the following results, here we prove that \({{\,\mathrm{Re}\,}}\tau (J_{\mathrm {prop},i }^S\,\varPi _1)=\sigma _{\mathrm {prop},ij }^S\).
Lemma 6.5
Under Assumption 3.1 we have that \(\varPi _0,\varPi _1,\varPi _r^\varepsilon \in {\mathcal {B}}_1^\tau .\)
Proof
By Assumption 3.1\(\varPi _0(k)\) is a finite-rank projection on \({\mathcal {H}}_{\mathrm {f}}\) with rank m (independent of k). Thus, in view of Proposition 2.4, we get
hence \( \varPi _0 \in {\mathcal {B}}_1^\tau \). In view of Propositions 4.1 and 6.3, we have that \(\varPi _1=\varPi _1^{\mathrm {OD}}\) is off-diagonal with respect to the orthogonal decomposition induced by \(\varPi _0\), and hence Lemma 6.2.(i) implies that \(\varPi _1=\varPi _1\varPi _0+\varPi _0\varPi _1.\) Since \(\varPi _1\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\subset {\mathcal {B}}_\infty ^\tau \) and \(\varPi _0\in {\mathcal {B}}_1^\tau \) by (6.5), we deduce that the right-hand side of the above is in \({\mathcal {B}}_1^\tau \) as \({\mathcal {B}}_\infty ^\tau \cdot {\mathcal {B}}_1^\tau \cdot {\mathcal {B}}_\infty ^\tau \subset {\mathcal {B}}_1^\tau \).
Finally, recall from (6.2) that \( \varPi _r^\varepsilon =\tfrac{\mathrm {i}}{2} {\mathrm {e}}^{-\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}} \,[\varPi _1,{\mathcal {S}}]\, {\mathrm {e}}^{\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}} \) for some \({\tilde{\varepsilon }}\in (0,\varepsilon )\). As we just proved \(\varPi _1\in {\mathcal {B}}_1^\tau \), and the other operators which appear on the right-hand side of the above are in \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\subset {\mathcal {B}}_\infty ^\tau \), we conclude that \(\varPi _r^\varepsilon \in {\mathcal {B}}_1^\tau \). \(\square \)
Proposition 6.6
Under Assumption 3.1 and the hypotheses on S in Definition 5.1, for \(\varPi _\sharp \in \{ \varPi _0,\varPi _1,\varPi _r^\varepsilon \}\) we have that
-
(i)
the operators \( \overline{[H_0,X_i]} S \varPi _\sharp \) and \([H_0, S]\varPi _\sharp \) are in \( {\mathcal {B}}_1^\tau \),
-
(ii)
the operators \( X_i [H_0, S]\varPi _\sharp \) and \(\overline{[H_0,X_i]}S\varPi _\sharp \) have finite trace per unit volume.
Proof
(i). We have that \(\varPi _\sharp \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) by Proposition 4.1, \(S\in {\mathcal {P}}( {\mathcal {D}}_{\mathrm {f}})\) by hypothesis, and \(\overline{[H_0,X_i]}\in {\mathcal {P}}({\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\) by Corollary 3.6.(iii); thus, we deduce that
On the other hand, since
we get that
Now we are going to show that the operators in (6.6) and (6.8) are in \({\mathcal {B}}_1^\tau \), using the previous results.
First we analyze the case \(\varPi _\sharp =\varPi _0\). In view of Lemma 6.5 we have \(\varPi _0\in {\mathcal {B}}_1^\tau \), thus we deduce that \(\overline{[H_0, X_i ]} S\varPi _0=\overline{[H_0, X_i ]} S\varPi _0\cdot \varPi _0\in {\mathcal {B}}_\infty ^\tau \cdot {\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau \) and similarly \([H_0, S]\varPi _0=[H_0, S]\varPi _0\cdot \varPi _0 \in {\mathcal {B}}_\infty ^\tau \cdot {\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau .\)
We proceed with the case \(\varPi _\sharp =\varPi _1\). By virtue of Lemma 6.5, of the construction in Proposition 4.1, and of Lemma 6.2.(i), we have that \({\mathcal {B}}_1^\tau \ni \varPi _1=\varPi _1\varPi _0+\varPi _0\varPi _1\); hence, we obtain that
One can argue in an analogous way to conclude that \([H_0, S]\varPi _1\in {\mathcal {B}}_1^\tau \) using (6.7).
Finally, we analyze the case \(\varPi _\sharp =\varPi _r^\varepsilon =\tfrac{\mathrm {i}}{2} {\mathrm {e}}^{-\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}} \,[\varPi _1,{\mathcal {S}}]\, {\mathrm {e}}^{\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}}\). Notice that
Observe that on the right-hand side of the last equality each summand is in \({\mathcal {B}}_1^\tau \). Indeed, since \(({\mathrm {e}}^{-\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}}-\mathbb {1}) \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) by Lemma 6.4 and \({\mathrm {e}}^{\mathrm {i}\tilde{\varepsilon } {\mathcal {S}}} \varPi _r^\varepsilon \in {\mathcal {B}}_1^\tau \) by Lemma 6.5, we deduce that \(\overline{[H_0, X_i ]} S({\mathrm {e}}^{-\mathrm {i}\tilde{\varepsilon }{\mathcal {S}}}-\mathbb {1}) \cdot {\mathrm {e}}^{\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}} \varPi _r^\varepsilon \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}}) \cdot {\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau .\) Using (6.9) and \({\mathcal {S}} {\mathrm {e}}^{\mathrm {i}\tilde{\varepsilon }{\mathcal {S}}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) by Proposition 4.1(i), we infer that \(\overline{[H_0, X_i ]} S \varPi _1\cdot {\mathcal {S}} {\mathrm {e}}^{\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}}\in {\mathcal {B}}_1^\tau \cdot {\mathcal {B}}_\infty ^\tau \subset {\mathcal {B}}_1^\tau .\) As for the third summand in (6.10), observe that \({\mathcal {S}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\) by Proposition 4.1(i) and \(\varPi _1 {\mathrm {e}}^{\mathrm {i}\tilde{\varepsilon }{\mathcal {S}}}\in {\mathcal {B}}_1^\tau \) by Lemma 6.5, therefore we get that \(\overline{[H_0, X_i ]} S {\mathcal {S}} \cdot \varPi _1 {\mathrm {e}}^{\mathrm {i}{\tilde{\varepsilon }} {\mathcal {S}}}\in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}}) \cdot {\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau .\) From a similar argument, it follows that \([H_0, S]\varPi _r^\varepsilon \in {\mathcal {B}}_1^\tau \) owing to (6.7).
(ii). The conclusion follows by applying part (i) of the statement (which we just proved), Lemma 2.3, Proposition 2.4(i), and Proposition 2.5.(i). \(\square \)
In the following lemma, we prove that some expectation values of \(J_{\mathrm {conv},i }^S\) which are relevant to transport theory, reduce to the real part of the expectation value of \(J_{\mathrm {naive},i }^S := \mathrm {i}\overline{[H_0, X_i]} S\).
Lemma 6.7
Under Assumption 3.1 and the hypotheses on S in Definition 5.1, let \(\varPi _\sharp \in \{ \varPi _0,\varPi _1,\varPi _r^\varepsilon , \varPi ^\varepsilon \}\). Then
Proof
We first prove the claim for \(\varPi _\sharp \in \{ \varPi _0, \varPi ^\varepsilon \}\), exploiting the fact that \(\varPi _\sharp ^2 = \varPi _\sharp = \varPi _\sharp ^*\) in this case. Using in addition that \(\varPi _\sharp \in {\mathcal {B}}_1^\tau \) by Lemma 6.5, \(J_{\mathrm {naive},i }^S \varPi _\sharp \in {\mathcal {B}}_{\infty }^\tau \) by Proposition 6.6.(i), the cyclicity of the trace per unit volume, \(\overline{{{\,\mathrm{Tr}\,}}(A)}={{\,\mathrm{Tr}\,}}(A^*)\), one has that
The case \(\varPi _\sharp = \varPi _1\) is subtler, and we crucially use the fact that \(\varPi _1=\varPi _1\varPi _0+\varPi _0\varPi _1\), and \(\varPi _1^* = \varPi _1\). Indeed, by using Lemma 2.6, Lemma 6.5, and Proposition 6.6 we obtain that
Finally, it remains to prove the claim for \(\varPi _\sharp = \varPi _r^\varepsilon \). The latter follows by \({\mathbb {R}}\)-linearity from the previous cases, as \(\varPi _r^\varepsilon = \varPi ^\varepsilon - \varPi _0 - \varepsilon \varPi _1\). \(\square \)
Proposition 6.8
(Bounds on the remainder terms) Under Assumption 3.1, there exist \(C_1,C_2 \in {\mathbb {R}}\) such that
Proof
We begin by showing the first inequality. In view of Lemma 6.7 and the triangle inequality, we obtain that
By using the inequalities in (2.10) and Lemma 6.4, the first summand on the right-hand side can be bounded uniformly in \(\varepsilon \) as
where C is a constant independent of \(\varepsilon \). Applying again the inequalities in (2.10), we get for the second and third summand in (6.12), respectively
using Proposition 6.6.(i) and
in view of Lemma 6.5.
Now, to obtain the second inequality of the thesis, notice that
The first summand on the right-hand side is bounded uniformly in \(\varepsilon \), as it is shown before, while for the second one we proceed as follows. Since \([H_0, S]\varPi _r^\varepsilon \) is \(\tau \)-class by Proposition 6.6.(i), Proposition 2.5.(i) implies that
Applying the inequality \(\left|{{\,\mathrm{Tr}\,}}(AB)\right|\le \left\| A \right\| {{\,\mathrm{Tr}\,}}(\left|B\right|)\) for a bounded operator A and a trace class operator B, estimate (2.11) and Proposition 2.4.(i), we have that
Finally, since \([H_0, S]\in {\mathcal {P}}({\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\) as shown in (6.7), one can reason as in (6.12) to conclude that \(\left\| [H_0, S]\varPi _r^\varepsilon \right\| _{1,\tau }\le D\), for a constant D independent of \(\varepsilon \), which yields the second inequality in the thesis. \(\square \)
6.5 Chern-Like and Extra Contributions to the S-conductivity
In this subsection, we prove Proposition 5.3, Proposition 5.4, Theorem 5.6 and Lemma 5.10.
Proof of Proposition 5.3
We are going to show that \({\mathsf {C}}\) and \({\mathsf {E}}_\ell \) are in \({\mathcal {B}}_1^\tau \) for any \(\ell \in \{1,\dots ,4\}\) and then equality (5.7) follows from Proposition 2.4(i). We begin by looking at the Chern-like term \({\mathsf {C}}\). We can write on \({{\,\mathrm{Ran}\,}}(\chi _L)\) that \({\mathsf {C}} = \varPi _0\big [\, \overline{[X_i,\varPi _0]}, S\overline{[X_j,\varPi _0]}\, \big ]\), by using Remark 5.2 and (3.6). In view of Corollary 3.6.(i) and Lemma 6.5, we have that \(\varPi _0\cdot \big [\, \overline{[X_i,\varPi _0]}, S\overline{[X_j,\varPi _0]}\, \big ] \in {\mathcal {B}}_1^\tau \,\cdot \,{\mathcal {P}}({\mathcal {H}}_{\mathrm {f}}).\) Therefore, the above operators are \(\tau \)-class as \({\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\,\cdot \,{\mathcal {B}}_1^\tau \subset {\mathcal {B}}_1^\tau \). In view of the cyclicity of the trace per unit volume and the off-diagonality of \([X_i,\varPi _0]\), one can rewrite \(\tau ({\mathsf {C}})\) as
We now analyze the first extra term \({\mathsf {E}}_1\). Similarly to the previous computation, we have on \({{\,\mathrm{Ran}\,}}(\chi _L)\) that \({\mathsf {E}}_1= \mathrm {i}\varPi _0 \overline{[H_0,X_i]}\varPi _0 S \varPi _1 +\mathrm {i}\varPi _0^{\perp }\overline{[H_0,X_i]}\varPi _0^{\perp } S \varPi _1\). In view of Corollary 3.6.(iii), Lemma 3.5, Lemma 6.5 and Proposition 6.6.(i), we get that \(\varPi _0 \overline{[H_0,X_i]}\varPi _0S\cdot \varPi _1 \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\,\cdot \,{\mathcal {B}}_1^\tau \) and \(\varPi _0^{\perp }\overline{[H_0,X_i]}\varPi _0^{\perp } S\varPi _1=\varPi _0^{\perp }\overline{[H_0,X_i]} S \varPi _1-\varPi _0^{\perp }\overline{[H_0,X_i]}\varPi _0 S \varPi _1\in {\mathcal {B}}_1^\tau \). Then, noticing that \(\varPi _1=\varPi _1^{\mathrm {OD}}\) by construction (see Propositions 4.1.(ii) and 6.3) and applying Lemma 2.6, we obtain the final expression for \(\tau ({\mathsf {E}}_1)\) in (5.9). The proof concerning the statement that \({\mathsf {E}}_2, {\mathsf {E}}_3\) and \({\mathsf {E}}_4\) are in \({\mathcal {B}}_1^\tau \) is obtained by similar estimates. By applying Proposition 6.6.(ii), we infer that \(X_i{\mathsf {R}}\) has finite trace per unit volume for any \(i\in \{1,\dots ,d\}\) and equality (5.7) is implied by Proposition 2.5.(i).
Finally, Lemma 2.6 implies that \(\tau ({\mathsf {E}}_2)=0=\tau ({\mathsf {E}}_4)\). Indeed, for the first identity observe that \(\tau ( X_i^{\mathrm {OD}}S\varPi _1H_0)=\tau ( \varPi _1H_0 X_i^{\mathrm {OD}}S)=\tau ( H_0 X_i^{\mathrm {OD}}S\varPi _1)\), and for the second one just notices that \({\mathsf {E}}_4\) is the commutator of \(\overline{[X_i,\varPi _0]}\in {\mathcal {B}}_\infty ^\tau \) and \(\varPi _0 S \overline{[\varPi _0,X_j]}\in {\mathcal {B}}_1^\tau \). \(\square \)
Proof of Proposition 5.4
(i) First of all, notice that \([H_0,B]\varPi _1\in {\mathcal {B}}_1^\tau \) since \(\varPi _1=\varPi _1\varPi _0+\varPi _0\varPi _1\) and, for \(\varPi _\sharp \in \left\{ \varPi _0, \varPi _1 \right\} \), one has \([H_0,B]\cdot \varPi _\sharp \in {\mathcal {P}}({\mathcal {D}}_{\mathrm {f}},{\mathcal {H}}_{\mathrm {f}})\cdot {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\subset {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}})\) and \(\varPi _\sharp \in {\mathcal {P}}({\mathcal {H}}_{\mathrm {f}},{\mathcal {D}}_{\mathrm {f}})\cap {\mathcal {B}}_1^\tau \). Since
by construction (see Propositions 4.1.(ii) and 6.3), we have that
where in the first equality we have used the cyclicity of the trace per unit volume, based on the fact that \([H_0,B] (H_0 - z \mathbb {1})^{-1}\in {\mathcal {B}}_\infty ^\tau \) by Lemma 3.5, and that \(\big [\,\overline{[X_j,\varPi _0]},\varPi _0\,\big ] \in {\mathcal {B}}_1^\tau \) by Corollary 3.6.(i) and Lemma 6.5.
Finally, in virtue of \([B,\varPi _0]\overline{[X_j,\varPi _0]}\varPi _0=\varPi _0 [B,\varPi _0]\overline{[X_j,\varPi _0]}\) and by using again the cyclicity of the trace per unit volume since \(\varPi _0\overline{[X_j,\varPi _0]}\in {\mathcal {B}}_1^\tau \) and \([B,\varPi _0]\in {\mathcal {B}}_\infty ^\tau \), we conclude that
(ii). In view of intermediate formula (6.14), the claim is equivalent to show that \(\tau \left( [B,\varPi _0] \big [\,\overline{[X_j,\varPi _0]},\varPi _0\,\big ]\right) =0\). By algebraic manipulations, exploiting the fact that \(\varPi _0\) is a projection, we obtain that on \({{\,\mathrm{Ran}\,}}(\chi _1)\)
Therefore, since each summand on the right-hand side above is in \({\mathcal {B}}_1^\tau \), using again the cyclicity of the trace per unit volume and the off-diagonality of \(\overline{[\varPi _0,X_j]}\), we get that
where we have used that \([B,X_j]=0\) and the conditional cyclicity of the trace, since \(\chi _1 B\varPi _0 \chi _1\) is trace class in view of the fact that \(B\varPi _0\in {\mathcal {B}}_1^\tau \) and of Lemma 2.3. \(\square \)
Proof of Theorem 5.6
First of all, Proposition 6.6.(ii) implies that \(J_{\mathrm {prop},i }^S\varPi _\sharp \) has a finite trace per unit volume for \(\varPi _\sharp \in \{ \varPi _0,\varPi _1,\varPi _r^\varepsilon \}\), thus all the terms appearing in (5.2) are finite. By virtue of Proposition 6.8, we obtain that \(\sigma _{\mathrm {prop},ij }^S = {{\,\mathrm{Re}\,}}\tau (J_{\mathrm {prop},i }^S\,\varPi _1)\). Now, notice that \({{\,\mathrm{Re}\,}}\tau (\mathrm {i}\overline{[H_0,X_i]}S\,\varPi _1) =\tau (J_{\mathrm {conv},i }^S\,\varPi _1)\) by Lemma 6.7. Therefore, by previous computation in (5.3) and (5.4), we get that
Observe that Proposition 6.8 implies that \(\sigma _{\mathrm {conv},ij }^S = {{\,\mathrm{Re}\,}}\tau (J_{\mathrm {conv},i }^S\,\varPi _1) = \tau (J_{\mathrm {conv},i }^S\,\varPi _1)\). From equation (5.5) and Proposition 5.3 we derive formula (5.12). Corollary 5.5 along with Proposition 2.5.(ii) implies the well-posedness of the rotation S-conductivity, as it appears in (5.13). \(\square \)
Proof of Lemma 5.10
Obviously, \({\mathsf {E}}_3=0\) and \({\mathsf {R}}=0\) by using (5.16). Proposition 5.3 implies that \(\tau ({\mathsf {E}}_1)=\mathrm {i}\tau ( [H_0,X_i^{\mathrm {D}}] S^{\mathrm {OD}} \varPi _1)=0\), since \(S^{\mathrm {OD}}=0\). \(\square \)
Notes
All over the paper we use Hartree units, so that the reduced Planck constant \(\hbar \), the mass of the electron \(m_{\mathrm {e}}\) and the charge of the positron e are equal to 1. With this choice, both the unit of charge conductivity \(\frac{e^2}{h}\) and of spin conductivity \(\frac{e}{2\pi }\) reduce to \(\frac{1}{2\pi }\).
A similar phenomenon appears in quaternionic quantum mechanics, where the Hilbert space trace fails to be cyclic [49].
Note that the specific choice \(f(\eta t) = {\mathrm {e}}^{\eta t}\) for \(t\le 0\) has the computational advantage that the integral in (1.6) becomes the inverse Liouvillian, i.e. that the right hand side of (1.6) for finite \(\eta >0\) equals, at least formally, \(\tau (({\mathcal {L}}_{H_0} - \mathrm {i}\eta )^{-1}([X_j,\varPi _0]) A)\). However, in the adiabatic limit any other integrable and smooth choice for the switching function f leads to the same value for \( \sigma _A^{\mathrm{Kubo}}\).
Notice that the j-dependence of \(\sigma _{\mathrm {conv/prop},ij }^S\) is hidden on the left-hand side of the following definition in \(\varPi _1\), see its definition in Proposition 4.1(ii).
The case of magnetic translations [71] is included in this framework, and thus, the Bloch–Landau Hamiltonian can be considered in our setting, assuming a rationality condition on the magnetic flux per unit cell.
We denote by \({\mathcal {U}}({\mathcal {H}}_{\mathrm {f}})\) the group of the unitary operators on \({\mathcal {H}}_{\mathrm {f}}\).
The symbol \(\bigsqcup \) denotes the disjoint union up to zero-measure sets.
This condition is automatically satisfied in the discrete case for any operator A, since the range of \(\chi _K\) is finite-dimensional.
The condition that A is trace class on compact sets is satisfied whenever A is in \({\mathcal {B}}_1^\tau \), as proved in Lemma 2.3.
Notice that this particular choice of the exhaustion \({{\mathcal {C}}}_L\nearrow {\mathcal {X}}\) is such that \({{\mathcal {C}}}_L \cap \varGamma \) is symmetric with respect to the involution \(x \mapsto - x\).
From now on, \({\mathcal {D}}_{\mathrm {f}}\) is understood to be equipped with the norm \(\left\| \,\cdot \, \right\| _{{\mathcal {D}}_{\mathrm {f}}}\).
In the following, when we refer to “the” spectral gap of \(H_0\), we will refer to this specific gap.
From the smoothness assumption (\(\hbox {H}_{1}\)), it follows that \(\mathrm {Rank}(\varPi _0(k)) =m\in {\mathbb {N}}\cup \left\{ + \infty \right\} \) is independent of k. Therefore, in view of the fact that \(\varPi _0\) is an orthogonal projection, \(m<+ \infty \) is equivalent to the assumption \(\varPi _0\in {\mathcal {B}}_1^\tau \).
Notice that this assumption is not automatically satisfied since the (magnetic) translation operators (see (2.2)) may act non-trivially on the factor \({\mathbb {C}}^N\). Obviously, for either the standard magnetic translations [71] or translation operators with a trivial action on the factor \({\mathbb {C}}^N\) the periodicity of S is ensured.
4 By choosing \(\lambda \) small enough, one has that \(\left\| {\widetilde{\varPi }}_0(k)- \varPi _0(k) \right\| <1\) and thus the two projections have the same rank for every k. Then, by the argument in the proof of Lemma 6.5, it follows that \({\widetilde{\varPi }}_0\in {\mathcal {B}}_1^\tau \) and hence \(\widetilde{\varPi }_0^{(\ell )}\in {\mathcal {B}}_1^\tau \) as well.
The operator A may be unbounded and thus a careful analysis is required. In particular, we want to avoid pathological examples and have \(A \supseteq A^{\mathrm {D}} + A^{\mathrm {OD}}\) on a dense domain. Later, we will, for example, apply the block decomposition with respect to the Fermi projection \(\varPi _0\) to the operator \(X_i\) for \(i\in \left\{ 1, \ldots , d \right\} \) (see Sect. 5.1), and \(X_i \, \varPi _0\) is densely defined under Assumption 3.1 on the unperturbed Hamiltonian (compare Corollary 3.6).
In two-level systems one defines \(\rho _{\vartheta ,(12)}:={\mathrm {e}}^{-\mathrm {i}\vartheta s_z}\) to encode the rotation of angle \(\theta \) around the z-axis on \({\mathbb {C}}^2\).
References
Aizenman, M., Graf, G.M.: Localization bounds for an electron gas. J. Phys. A 31, 6783–6806 (1998)
Aizenman, M., Warzel, S.: Random Operators. Vol. 168 in Graduate Studies in Mathematics. American Mathematical Society, Providence (2015)
Antinucci, G., Mastropietro, V., Porta, M.: Universal edge transport in interacting Hall systems. Commun. Math. Phys. 362, 295–362 (2018)
Ashcroft, N.W., Mermin, N.D.: Solid State Physics, College edn. Harcourt College Publishers, New York (1976)
Avron, J., Seiler, R., Simon, B.: Charge deficiency, charge transport and comparison of dimensions. Commun. Math. Phys. 159, 399–422 (1994)
Bachmann, S., Bols, A., De Roeck, W., Fraas, M.: Quantization of conductance in gapped interacting systems. Ann. H. Poincaré 19, 695–708 (2018)
Bachmann, S., De Roeck, W., Fraas, M.: The adiabatic theorem and linear response theory for extended quantum systems. Commun. Math. Phys. 361, 997–1027 (2018)
Bachmann, S., Bols, A., De Roeck, W., Fraas, M.: A many-body index for quantum charge transport. Commun. Math. Phys. Online First (2019)
Bellissard, J.: \(K\)-theory of \(C^*\)-algebras in solid state physics. In: Dorlas, T.C., Hugenholtz, N.M., Winnink, M. (eds.) Statistical Mechanics and Field Theory: Mathematical Aspects. Vol. 257 in Lecture Notes in Physics, pp. 99–156. Springer, Berlin (1986)
Bellissard, J., van Elst, A., Schulz-Baldes, H.: The non-commutative geometry of the quantum Hall effect. J. Math. Phys. 35, 5373–5471 (1994)
Birman, S.H., Suslina, T.A.: Periodic magnetic Hamiltonian with a variable metric. The problem of absolute continuity. Algebra i Analiz 11, 1–40 (1999). English translation in St. Petersburg Math. J. 11, 1–30 (2000)
Bouclet, J.-M., Germinet, F., Klein, A., Schenker, J.H.: Linear response theory for magnetic Schrödinger operators in disordered media. J. Funct. Anal. 226, 301–372 (2005)
Bru, J.-B., de Siqueira Pedra, W.: Lieb–Robinson Bounds for Multi-commutators and Applications to Response Theory. Vol. 13 in Springer Briefs in Mathematical Physics. Springer, Berlin (2016)
Cánces, E., Fermanian Kammerer, C., Levitt, A., Siraj-Dine, S.: Coherent electronic transport in periodic systems. Preprint arXiv:2002.01990 (2020)
Cornean, H.D., Monaco, D., Teufel, S.: Wannier functions and \({\mathbb{Z}}_2\) invariants in time-reversal symmetric topological insulators. Rev. Math. Phys. 29, 1730001 (2017)
Culcer, D., Yao, Y., Niu, Q.: Coherent wave-packet evolution in coupled bands. Phys. Rev. B 72, 085110 (2005)
De Nittis, G., Lein, M.: Linear Response Theory: A Modern Analytic-Algebraic Approach. Vol. 21 in Springer Briefs in Mathematical Physics. Springer, Berlin (2017)
Du, L., Knez, I., Sullivan, G., Dui, R.R.: Robust helical edge transport in gated InAs/GaSb bilayers. Phys. Rev. Lett. 114, 096802 (2015)
Elgart, A., Schlein, B.: Adiabatic charge transport and the Kubo formula for Landau-type Hamiltonians. Commun. Pure Appl. Math. 57, 590–615 (2004)
Fiorenza, D., Monaco, D., Panati, G.: \({\mathbb{Z}}_2\)-invariants of topological insulators as geometric obstructions. Commun. Math. Phys. 343, 1115–1157 (2016)
Fu, L., Kane, C.L.: Time reversal polarization and a \({\mathbb{Z}}_2\) adiabatic spin pump. Phys. Rev. B 74, 195312 (2006)
Graf, G.M.: Aspects of the integer quantum hall effect. In: Gesztesy, F., Deift, P., Galvez, C., Perry, P., Schlag, W. (eds.) Spectral Theory and Mathematical Physics: A Festschrift in Honor of Barry Simon’s 60th Birthday. No. 76 in Proceedings of Symposia in Pure Mathematics, pp. 429–442. American Mathematical Society, Providence (2007)
Graf, G.M., Porta, M.: Bulk-edge correspondence for two-dimensional topological insulators. Commun. Math. Phys. 324, 851–895 (2013)
Giuliani, A., Mastropietro, V., Porta, M.: Universality of the Hall conductivity in interacting electron systems. Commun. Math. Phys. 349, 1107–1161 (2017)
Haldane, F.D.M.: Model for a quantum Hall effect without Landau levels: condensed-matter realization of the “parity anomaly”. Phys. Rev. Lett. 61, 2015 (1988)
Hasan, M.Z., Kane, C.L.: Colloquium: topological insulators. Rev. Mod. Phys. 82, 3045–3067 (2010)
Hastings, M., Michalakis, S.: Quantization of Hall conductance for interacting electrons on a torus. Commun. Math. Phys. 334, 433–471 (2015)
Henheik, J., Teufel, S.: Justifying Kubo’s formula for gapped systems at zero temperature: a brief review and some new results. Rev. Math. Phys. Online Ready (2020)
Jakšić, V., Ogata, Y., Pillet, C.A.: The Green–Kubo formula and Onsager reciprocity relations in quantum statistical mechanics. Commun. Math. Phys. 265, 721–738 (2006)
Jakšić, V., Ogata, Y., Pillet, C.A.: Linear response theory for thermally driven quantum open systems. J. Stat. Phys. 123, 547–569 (2006)
Jakšić, V., Ogata, Y., Pillet, C.A.: The Green–Kubo formula for the spin-fermion system. Commun. Math. Phys. 268, 369–401 (2006)
Jakšić, V., Ogata, Y., Pillet, C.A.: The Green–Kubo formula for locally interacting fermionic systems. Ann. H. Poincaré 8, 1013–1036 (2007)
Kane, C.L., Mele, E.J.: Quantum spin Hall effect in graphene. Phys. Rev. Lett. 95, 226801 (2005)
Kane, C.L., Mele, E.J.: \({\mathbb{Z}}_2\) topological order and the quantum spin Hall effect. Phys. Rev. Lett. 95, 146802 (2005)
Kato, T.: Perturbation Theory for Linear Operators. Springer, Berlin (1966)
Katsura, H., Koma, T.: The \({\mathbb{Z}}_2\) index of disordered topological insulators with time-reversal symmetry. J. Math. Phys. 57, 021903 (2016)
Klein, M., Seiler, R.: Power-law corrections to the Kubo formula vanish in quantum Hall systems. Commun. Math. Phys. 128, 141–160 (1990)
König, M., et al.: The quantum spin Hall effect: theory and experiment. J. Phys. Soc. Jpn. 77, 031007 (2008)
Kubo, R.: Statistical–mechanical theory of irreversible processes I: general theory and simple applications to magnetic and conduction problems. J. Phys. Soc. Jpn. 12, 570–586 (1957)
Kuchment, P.: An overview of periodic ellipic operators. Bull. AMS 53, 343–414 (2016)
Marcelli, G., Teufel, S.: Non-equilibrium almost-stationary states and linear response for gapped non-interacting quantum systems. In preparation (2020)
Marcelli, G.: Improved energy estimates for a class of time-dependent perturbed Hamiltonians. Preprint https://arxiv.org/abs/1904.11300 (2019)
Marcelli, G., Panati, G., Tauber, C.: Spin conductance and spin conductivity in topological insulators: analysis of Kubo-like terms. Ann. H. Poincaré 20, 2071–2099 (2019)
Mastropietro, V., Porta, M.: Spin Hall insulators beyond the helical Luttinger model. Phys. Rev. B 96, 245135 (2017)
Monaco, D., Panati, G., Pisante, A., Teufel, S.: Optimal decay of Wannier functions in Chern and quantum Hall insulators. Commun. Math. Phys. 359, 61–100 (2018)
Monaco, D., Teufel, S.: Adiabatic currents for interacting electrons on a lattice. Rev. Math. Phys. 31, 1950009 (2019)
Nenciu, G.: Linear adiabatic theory, exponential estimates. Commun. Math. Phys. 152, 479–496 (1993)
Nenciu, G.: On asymptotic perturbation theory for quantum mechanics: almost invariant subspaces and gauge invariant magnetic perturbation theory. J. Math. Phys. 43, 1273 (2002)
Moretti, V., Oppio, M.: The correct formulation of Gleason’s theorem in quaternionic Hilbert spaces. Ann. H. Poincaré 19, 3321–3355 (2018)
Panati, G., Pisante, A.: Bloch bundles, Marzari–Vanderbilt functional and maximally localized Wannier functions. Commun. Math. Phys. 322, 835–875 (2013)
Panati, G., Sparber, C., Teufel, S.: Geometric currents in piezoelectricity. Arch. Rat. Mech. Anal. 191, 387–422 (2009)
Panati, G., Spohn, H., Teufel, S.: Effective dynamics for Bloch electrons: Peierls substitution and beyond. Commun. Math. Phys. 242, 547–578 (2003)
Panati, G., Spohn, H., Teufel, S.: Space-adiabatic perturbation theory. Adv. Theor. Math. Phys. 7, 145–204 (2003)
Pastur, L., Figotin, A.: Spectra of Random and Almost-Periodic Operators. Springer, Berlin (1992)
Prodan, E.: An edge index for the quantum spin Hall effect. J. Phys. A: Math. Theor. 42, 1 (2009)
Prodan, E.: Robustness of the spin-Chern number. Phys. Rev. B 80, 125327 (2009)
Rashba, E.I.: Spin currents in thermodynamic equilibrium: the challenge of discerning transport currents. Phys. Rev. B 68, 241315 (2003)
Schulz-Baldes, H.: Persistence of spin edge currents in disordered quantum spin Hall systems. Commun. Math. Phys. 324, 589–600 (2013)
Schulz-Baldes, H.: \({\mathbb{Z}}_2\)-indices and factorization properties of odd symmetric Fredholm operators. Documenta Math. 20, 1481–1500 (2015)
Sheng, L., Sheng, D.N., Ting, C.S., Haldane, F.D.M.: Nondissipative spin Hall effect via quantized edge transport. Phys. Rev. Lett. 95, 136602 (2005)
Sheng, D.N., Weng, Z.Y., Sheng, L., Haldane, F.D.M.: Quantum spin-Hall effect and topologically invariant Chern numbers. Phys. Rev. Lett. 97, 036808 (2006)
Shi, J., Zhang, P., Xiao, D., Niu, Q.: Proper definition of spin current in spin–orbit coupled systems. Phys. Rev. Lett. 96, 076604 (2006)
Shindou, R., Imura, K.-I.: Noncommutative geometry and non-abelian Berry phase in the wave-packet dynamics of Bloch electrons. Nucl. Phys. B 720, 399–435 (2005)
Sinova, J., Culcer, D., Niu, Q., Sinitsyn, N.A., Jungwirth, T., MacDonald, A.H.: Universal intrinsic spin Hall effect. Phys. Rev. Lett. 92, 126603 (2004)
Sinova, J., Valenzuela, S.O., Wunderlich, J., Back, C.H., Jungwirth, T.: Spin Hall effects. Rev. Mod. Phys. 87, 1213 (2015)
Stiepan, H.-M., Teufel, S.: Semiclassical approximations for hamiltonians with operator-valued symbols. Commun. Math. Phys. 320, 821–849 (2013)
Teufel, S.: Adiabatic Perturbation Theory in Quantum Dynamics. No. 1821 in Lecture Notes in Mathematics. Springer, Berlin (2003)
Teufel, S.: Non-equilibrium almost-stationary states for interacting electrons on a lattice. Commun. Math. Phys. 373, 621–653 (2020)
Yang, X., He, Z., Zheng, X.: Unit cell consistency of maximally localized Wannier functions. Electron. Struct. 2, 1 (2020)
Yosida, K.: Functional Analysis. Springer, Berlin (1980)
Zak, J.: Magnetic translation group. Phys. Rev. 134, A1602 (1964)
Zhang, P., Wang, Z., Shi, J., Xiao, D., Niu, Q.: Theory of conserved spin current and its application to a two-dimensional hole gas. Phys. Rev. B 77, 075304 (2008)
Acknowledgements
We are very grateful to Domenico Monaco for intensive discussions and exchange of ideas in an early stage of our project. We are thankful to Luca Fresta, Vojkan Jakšić, Marcello Porta, and Clément Tauber for intensive and fruitful discussions. S. T. also benefited from useful and stimulating interactions with Giuseppe De Nittis and Max Lein. Financial support from the German Science Foundation within the Research Training Group 1838 on “Spectral theory and dynamics of quantum systems” is gratefully acknowledged.
Funding
Open access funding provided by UniversitA degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Vieri Mastropietro.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A. Unit Cell Consistency and Vanishing of Persistent S-currents
Appendix A. Unit Cell Consistency and Vanishing of Persistent S-currents
1.1 Results on the Unit Cell Consistency
As a preliminary step, we prove that the trace per unit volume, acting on a suitable class of operators, is independent of the choice of the fundamental cell \({{\mathcal {C}}}_1\). First, notice that chosen any two primitive cells of arbitrary shape, it is possible to cut the first up into pieces, which, when translated through suitable lattice vectors, can be reassembled to give the second. This fact, well-known to solid state physicists [4], can be reformulated in the following lemma, whose proof is elementary. Recall that \({{\mathcal {C}}}_L\) is defined in (2.7) with reference to a linear basis \( \left\{ a_1, \ldots , a_d \right\} \) of \(\varGamma \), while \(\widetilde{{{\mathcal {C}}}}_L\) in (2.17) refers to another linear basis \( \left\{ \tilde{a}_1, \ldots , \tilde{a}_d \right\} \) of \(\varGamma \).
Lemma A.1
Let \({{\mathcal {C}}}_1\) and \(\widetilde{{{\mathcal {C}}}}_1\) be two primitive cells. Then there exist a finite set \(I\subset \varGamma \) and a family of subsets \( \left\{ P_\gamma \right\} _{\gamma \in I}\subset {\mathcal {X}}\) such that\(^{(7)}\)
In particular, one may choose
Denoting by \(\tau (\,\cdot \,)\) and \(\widetilde{\tau }(\,\cdot \,)\), respectively, the trace per unit volume induced by the choice of the primitive cells \({{\mathcal {C}}}_1\) and \(\widetilde{{{\mathcal {C}}}}_1\), we have the following result.
Proposition A.2
Consider an operator A which is periodic and trace class on compact sets.
-
(i)
Then, \( \tau (A)=\widetilde{\tau }(A)\).
-
(ii)
If, in addition, \({{\,\mathrm{Tr}\,}}(\chi _{P_\gamma } A\, \chi _{P_\gamma })=0\) for all \(\gamma \in I\), where \( \left\{ P_\gamma \right\} _{\gamma \in I}\) are the sets defined in (A.2), then \(\tau (X_iA)=\widetilde{\tau }(X_iA)\).
Proof
(i) In view of Proposition 2.4.(i), it suffices to prove
Obviously, from decompositions (A.1) and the translation invariance of the Lebesgue measure, it follows that \(\left|{{\mathcal {C}}}_1\right|=|\widetilde{{{\mathcal {C}}}}_1|\). The first identity in (A.1), conditional cyclicity of the trace and identity \(\chi _{\mathrm {T}_{\gamma } P_\gamma }^2=\chi _{\mathrm {T}_{\gamma } P_\gamma }\) imply that
Now, by the invariance of trace under unitary conjugation and the periodicity of A, one has that
where in the last equality we used the second decomposition in (A.1).
(ii) After arguing as in the steps leading to (A.3), one notices that
where we have used \([T_\gamma ^*,X_i]=\gamma _i \,T_\gamma ^*\). Therefore, in view of the decompositions (A.1), we conclude that
which yields the claim. \(\square \)
For any \(\gamma \in I\), consider the operator \(\mathbb {1}_{ \left\{ T_\eta T_\gamma P_\gamma \,:\, \eta \in \varGamma \right\} }=\mathbb {1}_{ \left\{ T_\eta P_\gamma \,:\, \eta \in \varGamma \right\} }\), which is periodic by its very definition (here \(\mathbb {1}_\varOmega \) is an alternative notation for the characteristic function of the set \(\varOmega \)). By applying (2.13), one has
for every operator A which is periodic and trace class on compact sets. Using the previous rewriting, we are in position to prove that \(\chi _{P_\gamma }\, {\mathsf {R}}\, \chi _{P_\gamma }\), defined in (1.12), has vanishing trace whenever the model enjoys a discrete rotational symmetry.
1.2 Models with Discrete Rotational Symmetries
Let us fix indices \(i\ne j \in \left\{ 1,\ldots ,d \right\} \), and denote by \(\mathrm {R}_{\vartheta ,(ij)}\) the counterclockwise rotation of angle \(\vartheta \in [0,2\pi )\) in the plane \((x_i,x_j)\):
Rotation operators in the plane \((x_i,x_j)\) on \({\mathcal {H}}\) are defined via
where \(\rho _{\vartheta ,(ij)}\) is a unitary matrix acting on \({\mathbb {C}}^N\).Footnote 18
Suppose that the d-dimensional crystal under consideration is invariant under a rotation of angle \(\vartheta = 2\pi /n\), for some \(n \in {\mathbb {N}}^*\), in the plane \((x_i,x_j)\), namely \(\gamma \in \varGamma \) if and only if \(\mathrm {R}_{\vartheta , (ij)} \gamma \in \varGamma \) (then it trivially follows that \(x\in {\mathcal {X}}\) if and only if \(\mathrm {R}_{\vartheta , (ij)} x\in {\mathcal {X}}\)). A periodic Hamiltonian \(H_0\) is said to be itself rotationally symmetric or invariant under rotation of angle \(\vartheta \) in the plane \((x_i,x_j)\) if and only if \(R_{\vartheta ,(ij)}^{-1} \, H_0 \, R_{\vartheta ,(ij)} = H_0\). For example, several models on the honeycomb structure, including e.g. the Kane–Mele model (see [33] or [43, Appendix A]), are invariant under the rotation \(R_{2\pi /3, (12)}\).
Proposition A.3
Let \(\vartheta =2\pi /n\) for some \(n\in {\mathbb {N}}^*\). Let the Bravais lattice \(\varGamma \) be invariant under the rotation \(\mathrm {R}_{\vartheta ,(ij)}\), i.e. \(\gamma \in \varGamma \) if and only if \(\mathrm {R}_{\vartheta , (ij)} \gamma \in \varGamma \), and
Let the operator \(H_0\), as in Assumption 3.1, be rotationally symmetric of angle \(\vartheta \) in the plane \((x_i,x_j)\). Let \(S= \mathbb {1}_{L^2({\mathcal {X}})} \otimes s\), as in Definition 5.1, be such that \(\rho _{\vartheta ,(ij)}^{-1}s\rho _{\vartheta ,(ij)}=s\). Then
-
(i)
\({{\,\mathrm{Tr}\,}}(\chi _{P_\gamma }\, {\mathsf {R}} \,\chi _{P_\gamma })=0 \quad \forall \gamma \in I\), where \({\mathsf {R}}=\mathrm {i}[H_0, S ] \varPi _1\) and \( \left\{ P_\gamma \right\} _{\gamma \in I}\) are the sets defined in (A.2).
-
(ii)
the persistent conventional S-current vanishes, namely \(\tau (J_{\mathrm {conv},i }^S\varPi _0)=0.\)
The vanishing of persistent S-currents is not expected to hold true in general, i.e. for systems without a discrete symmetry. However, it has been argued that such persistent currents do not contribute to transport [57].
Proof
(i) In view of Proposition 6.6.(i) \({\mathsf {R}}\in {\mathcal {B}}_1^\tau \) and thus applying (A.4) the thesis is equivalent to show that \( \tau (\mathbb {1}_{ \left\{ T_\eta P_\gamma \,:\, \eta \in \varGamma \right\} } {\mathsf {R}} )=\mathrm {i}\tau (\mathbb {1}_{ \left\{ T_\eta P_\gamma \,:\, \eta \in \varGamma \right\} } [H_0, S ] \varPi _1 )=0. \) Using the invariance of the trace under unitary conjugation and the identities \(R_{\vartheta , (ij)}^{-1}\chi _1 R_{\vartheta , (ij)}=\widetilde{\chi }_1\), \(R_{\vartheta , (ij)}^{-1} H_0 R_{\vartheta , (ij)}=H_0\) and \(R_{\vartheta , (ij)}^{-1} S R_{\vartheta , (ij)}=S\) and (A.5), and \( \varPi _1 = {\mathcal {I}}\left( \overline{[ X_j,\varPi _0]}\right) \) by Proposition 4.1.(ii), we obtain that
where we have used Proposition 2.4.(i) and Proposition A.2.(i). Therefore, by iterating the previous computation we have that
Now we are going to compute \(\sum _{k=0}^{n-1} R_{\vartheta , (ij)}^{-k} X_j R_{\vartheta , (ij)}^{k}\). The rotation of angle \(\vartheta \) acts non-trivially only in the plane \((x_i,x_j)\), which we parametrize with the complex coordinate \(z := x_i + \mathrm {i}x_j\). In this parametrization, the rotation of angle \(\vartheta \) is implemented as \(\hat{\mathrm {R}}_{\vartheta }z:= {\mathrm {e}}^{\mathrm {i}\vartheta }z\). Introducing the complex position operator \(Z:=X_i+\mathrm {i}X_j\), one has then that \(\hat{R}_{\vartheta , (ij)}^{-k}\,\mathrm {i}X_j\,\hat{R}_{\vartheta , (ij)}^{k}={{\,\mathrm{Im}\,}}\left( {\mathrm {e}}^{\mathrm {i}k\vartheta }Z\right) \) and thus
As \(\sum _{k=0}^{n-1}{\mathrm {e}}^{\mathrm {i}2\pi k/n}=0\), we deduce that the term in (A.6) vanishes. This concludes the proof.
(ii) In view of decomposition (A.1), it suffices to show that
whose proof is analogous to the previous one since \([H_0,X_i]S\varPi _0\in {\mathcal {B}}_1^\tau .\) \(\square \)
Remark A.4
In general, even exploiting the peculiar discrete rotational symmetries in the hypotheses of Proposition A.3, it is not obvious that the persistent proper S-current vanishes, i.e. \(\tau (J_{\mathrm {prop},i }^S\varPi _0)=0\), since the argument we used relies on the periodicity of the operators involved and \(J_{\mathrm {prop},i }^S\) is not periodic. Nevertheless, in the Kane–Mele model this property holds true thanks to the specific structure of the model.
1.3 Vanishing of Persistent S-current When S is Conserved
When S is a conserved quantity, namely when \([H_0, S]=0\), the vanishing of the persistent S-current \(J_i^S=J_{\mathrm {prop},i }^S=J_{\mathrm {conv},i }^S\) holds true without any symmetry assumption on \(H_0\) (compare [10, 12], where similar results are deduced in the case \(S = \mathbb {1}\), and [8], which offers a proof in the context of many-body quantum spin systems).
To show this, notice first that \([\varPi _0,S]=0\) as well and that \(J_i^S = \mathrm {i}\overline{[H_0, X_i]}\, S\) is a periodic operator in view of Lemma 2.1. Moreover, \(J_i^S \varPi _0=\mathrm {i}\overline{[H_0, X_i]}\, S\varPi _0\) is \(\tau \)-class in view of Proposition 6.6.(i). Consequently, by the identity \(\varPi _0^2=\varPi _0\), Lemma 2.6 and Proposition 2.4.(i), we have that
By Remark 5.2, after some algebra, we get that
Notice that the trace of the first summand above vanishes:
where both summands inside the trace are trace class. Indeed, \(\varPi _0 \chi _1\) is an Hilbert–Schmidt operator, since \({{\,\mathrm{Tr}\,}}((\varPi _0 \chi _1)^* \varPi _0 \chi _1) = {{\,\mathrm{Tr}\,}}(\chi _1 \varPi _0 \chi _1) = |{{\mathcal {C}}}_1| \tau (\varPi _0) < \infty \) by Lemma 6.5. This implies that the adjoint \(\chi _1 \varPi _0\) is Hilbert–Schmidt as well. Since S, \(\varPi _0 H_0 \varPi _0\) and \(\chi _1 X_i \chi _1\) are all bounded operators, the desired claim follows in view of the conditional cyclicity of the trace \({{\,\mathrm{Tr}\,}}(\,\cdot \,)\). Finally, we have that the trace of the second summand in (A.7) vanishes as well. Indeed, by Remark 5.2, Lemma 6.2(ii) and definition (3.6) we obtain that
and thus, using Proposition 2.4(i), we deduce
The conditional cyclicity of \(\tau \) implies the conclusion, since \(\varPi _0 H_0 S \varPi _0\in {\mathcal {B}}_\infty ^\tau \) and \([\,\overline{[X_i,\varPi _0]}\,,\varPi _0]\in {\mathcal {B}}_1^\tau \).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Marcelli, G., Panati, G. & Teufel, S. A New Approach to Transport Coefficients in the Quantum Spin Hall Effect. Ann. Henri Poincaré 22, 1069–1111 (2021). https://doi.org/10.1007/s00023-020-00974-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00023-020-00974-6