Abstract
Starting from the construction of the free quantum scalar field of mass \(m\ge 0\), we give mathematically precise and rigorous versions of three different approaches to computing the Casimir forces between compact obstacles. We then prove that they are equivalent.
Similar content being viewed by others
1 Introduction
Casimir interactions are forces between objects such as perfect conductors. They can be either understood as quantum fluctuations of the vacuum or as the total effect of van der Waals forces. Hendrik Casimir predicted and computed this effect in the special case of two planar conductors in 1948 using an infinite mode summation [1]. This force was measured experimentally by Sparnaay about 10 years later [2]. Since then also, more precise measurements have been performed with good agreement to the theoretical prediction of Casimir [3,4,5,6]. Other geometric situations, such as for example the Casimir force between a sphere and a plane, were also considered in precision experiments [7, 8].
The classical way to compute Casimir forces, and indeed the way it was done by Casimir himself, is by performing a zeta function regularisation of the vacuum energy. This has been carried out for a number of particular geometric situations (see [9,10,11,12,13] and references therein). Since this method requires knowledge of the spectrum of the Laplace operator in order to perform the analytic continuation, it has long been a very difficult problem to compute the Casimir force in a generic geometric situation even from a non-rigorous point of view. Already, it has been realised by quantum field theorists (see, e.g. [14,15,16,17,18]) that the Casimir force can also be understood by considering the renormalised stress–energy tensor of the electromagnetic field. This tensor is defined by comparing the induced vacuum states of the quantum field with boundary conditions and the free theory. Once the renormalised stress–energy tensor is mathematically defined, the computation of the Casimir energy density becomes a problem of spectral geometry (see, e.g. [19]) and numerical analysis. The renormalised stress–energy tensor and its relation to the Casimir effect can be understood at the level of rigour of axiomatic algebraic quantum field theory. Currently, this is still the subject of ongoing research in mathematical physics (see, e.g. [20]) in particular when it comes to the effect in curved backgrounds and fields that are not scalar, or in situations when the objects move.
Recently progress was made in the non-rigorous numerical computation of Casimir forces between objects (see for example [21,22,23]). This approach uses a formalism that relates the Casimir energy to a determinant computed from boundary layer operators. Such determinant formulae result in finite quantities that do not require further regularisation and have been obtained and justified in the physics literature [24,25,26,27,28,29,30,31,32,33]. This has not only resulted in more efficient numerical algorithms but also in various asymptotic formulae for the Casimir forces for large and small separations. Many of the justifications and derivations of these formulae are based on physics considerations of macroscopic properties of matter or of van der Waals forces. As such they often involve ill-defined path integrals. From a mathematical point of view, considerations that link the determinant formulae to the spectral approach initially taken by Casimir have been largely formal computations or involve ad hoc cut-offs and regularisation procedures. We note that in the Appendix of [30] it is proved correctly that the Fredholm determinant in the final formula of the Casimir energy is well defined. Only recently a mathematical justification of these formulae was given in [34], relating it to the trace of a linear combination of powers of the Laplace operator. The determinant formulae are directly related to the multi-reflection expansion of Balian and Duplantier [35] that also yields a finite Casimir energy. We mention [36] in the mathematical literature where the Casimir energy of a piston configuration is expressed in terms of the zeta regularised Fredholm determinant of the Dirichlet-to-Neumann operator.
Summarising, we list several ways to compute the Casimir force acting on a compact object that have been proposed and carried out:
-
(1)
Using a total energy obtained in some way by regularising the spectrally defined zeta function. This can be done either directly or by first considering a compact problem in a box and then taking the adiabatic limit.
-
(2)
By integrating the renormalised stress–energy tensor around any surface enclosing the object.
-
(3)
Using formulae for the energy in terms of a determinant of boundary layer operator.
The list is non-exhaustive and other methods exist, such as for example the worldline approach (see, e.g. [37] and references). Here, we will restrict ourselves to the listed methods.
The aim of the present paper is to establish, in the case of finitely many compact objects, the precise mathematical meaning of each of the listed methods for the case of the scalar field and then prove that they give the same answer for the force (not necessarily for the energy). The main tool to achieve this will be the relative stress–energy tensor. This tensor mimics the definition of the relative trace of [34] and seems to not have been defined or studied previously in the literature. Note that the renormalised stress tensor becomes unbounded and non-integrable [16, 38] near the boundaries of objects and this makes it unsuitable to compute the total energy component from the energy density \(T_{00}\). In contrast, the relative stress–energy tensor is smooth up to the obstacles and is much more regular when considering boundary variations. This relative stress–energy tensor does not satisfy Dirichlet or Neumann boundary conditions and therefore integration by parts involves boundary contributions. The relative energy density can be defined entirely in terms of functional calculus of the Laplace operator. This relative energy density has been introduced in [34]. It was shown to be integrable and its integral can be interpreted as the trace of a certain operator. The main result of [34] states that this trace can be expressed as the determinant of an operator constructed from boundary layer operators, thus providing a rigorous justification of the method (3) linking it with method (1). To show the equivalence of methods (2) and (3) we must provide a formula for the variation of the relative energy when one of the objects is moved. To compute this variation, we prove and use a special case of the Hadamard variation formula ([39,40,41]) adapted to the non-compact setting. We then show that as a consequence of this formula that the variation of the total energy equals the surface integral of the spatial components of the relative stress–energy tensor (see Theorem 5.9). This surface integral is also equal to the surface integral over the renormalised tensor (see Theorem 5.10). We will now give a more precise formulation of our main result.
We consider d-dimensional Euclidean space with \(d \ge 2\). Let \({\mathcal {O}}\) be a bounded open subset of \(\mathbb {R}^d\) with smooth boundary such that the complement \(\mathcal {E} = \mathbb {R}^d \setminus \overline{\mathcal {O}}\) is connected. The domain \(\mathcal {O}\) will be assumed to consist of N many connected components \(\mathcal {O}_1,\ldots ,\mathcal {O}_N\). The space \(X = \mathbb {R}^d \setminus \partial \mathcal {O}\) therefore consists of the \(N+1\)-many connected components \(\mathcal {O}_1,\ldots ,\mathcal {O}_N,\mathcal {E}\). We think of \(\mathcal {O}\) as obstacles placed in \(\mathbb {R}^d\), and \(\mathcal {E}\) then corresponds to the exterior region of these obstacles. The set \(X \subset \mathbb {R}^d\) consists of the interior and the exterior of the obstacles, separated by \(\partial \mathcal {O}\). See Fig. 1 for an example with three obstacles.
For the free scalar field of mass \(m \ge 0\), let \(T_\mathrm {ren}\) be the renormalised stress–energy tensor as defined in Sect. 2.2. In QFT terms, this stress–energy tensor is equivalent to the usual stress–energy tensor obtained from canonical quantisation when normal ordering is used with respect to the free vacuum state. This is a smooth symmetric two-tensor away from \(\partial \mathcal {O}\), but it is singular at \(\partial \mathcal {O}\) and the integral of \(T_{00}\) over \(\mathcal {E}\) does not converge. Let \(T_\mathrm {rel}\) be the relative stress–energy tensor as given in Definition 3.3. The relative total energy \(E_\mathrm {rel}\) is defined as the integral of \((T_\mathrm {rel})_{00}(x)\) over \(\mathbb {R}^d\) which can be shown to exist and to be equal the trace of a certain combination of operators. In case \(m>0\), the regularised energy \(E_\mathrm {reg}\) is defined in Sect. 6, Definition 6.3 via zeta function regularisation. We would like to compute the force on an object \(\mathcal {O}_j\) due to the presence of the other objects. Approach (2) is to directly compute
where \(T_\mathrm {ren}\) is the renormalised stress–energy tensor given in Definition 2.3 and \(\Sigma \) is any smooth surface enclosing \(\mathcal {O}_j\) (i.e. homologous to \(\partial \mathcal {O}_j\) in \(\overline{\mathcal {E}}\)), \(n^k\) is the exterior unit normal vector field, and \(\mathrm {d}\sigma \) is the surface measure on \(\Sigma \). If energy conservation holds, i.e. no energy is radiated off to infinity, one expects this force to be the directional derivative of the total energy when \(\mathcal {O}_j\) is moved rigidly. Thus, let Z be a constant vector field on \(\mathbb {R}^d\). Let Y be a vector field that equals Z near \(\mathcal {O}_j\) and vanishes near \(\mathcal {O}_l\) when \(l \not = j\). The vector field generates a flow \(\Phi _\epsilon \) that, for \(\epsilon \) near zero, moves the object \(\mathcal {O}_j\) rigidly and we end up with a configuration that depends on the parameter \(\epsilon \). The total energies \(E_\mathrm {rel}\) and \(E_\mathrm {reg}\) then also depend on \(\epsilon \) in that way and become functions of \(\epsilon \) in a small interval around zero. The change of these energies with respect to the flow can be interpreted as the change of energy needed to move object \(\mathcal {O}_j\) rigidly relative to the other objects in the direction of Z. Our main result may be stated as follows.
Theorem 1.1
The relative energy \(E_\mathrm {rel}(\epsilon )\) is differentiable in \(\epsilon \) at \(\epsilon =0\) and its derivative equals \(F_i Z^i\), where
Moreover,
where \(Q_\mathrm {rel}(\omega ) = Q_{\mathrm {i}\omega } ({\tilde{Q}}_{\mathrm {i}\omega })^{-1}\) is defined in detail in Sect. 3 and constructed out of boundary layer operators for the Laplacian. If \(m>0\), then \(E_\mathrm {rel}(\epsilon ) - E_{\mathrm {reg}}(\epsilon )\) is constant near \(\epsilon =0\).
We note that this mathematical theorem simply shows that all these proposed computational methods give the same Casimir interactions in the case of separated rigid bodies. The statement does not say anything about the actual origin of the Casimir force or its existence, which needs to be determined from experiments or physics considerations. There is however a strong argument for the expression
to be a directly relevant physical quantity. Our point of view is that the stress–energy tensor does not have an absolute meaning in this context, but rather is used to compare two vacuum states (normal ordering depends on a comparison state). If we would like to know the effect for the rigid objects \(\mathcal {O}_2,\ldots \mathcal {O}_N\) on the rigid object \(\mathcal {O}_1\), the states to compare are not the ground state with Dirichlet conditions and the free ground state. It is rather the vacuum states obtained from the Laplacian with Dirichlet conditions imposed on \(\mathcal {O}_1\) alone and with Dirichlet conditions on all the objects. The comparison of these two states yields a stress–energy tensor that is completely regular near \(\mathcal {O}_1\), and the computation of the force based on this tensor leads directly to the above formula without regularisation.
The paper is organised as follows. In Sect. 2, we review the rigorous construction of the free scalar field of mass \(m\ge 0\) in the presence of boundaries and show how this leads to a natural definition of the renormalised stress–energy tensor, which is given in Sect. 2.2. We also review its most important properties and express it in terms of spectral quantities for the Laplace operator. Section 3 introduces the relative setting and gives the definition of the relative stress–energy tensor and its basic properties. Some norm estimates on the relative resolvent are given in Sect. 4, which provides mathematical justifications for later proofs. In Sect. 5, we prove a Hadamard variation formula and compute the variation of the relative energy to establish the first part of the main theorem. In Sect. 6, we show that for \(m>0\) the renormalised version of the zeta function has a meromorphic continuation and can thus be used to define the regularised energy. This section also contains a proof that variations of the regularised energy and the relative energy coincide. To illustrate the method and relate it to the classical computations we treat the easier example of the one-dimensional Casimir energy explicitly in Appendix 7. This example also illustrates that a divergence term for the time-component of the renormalised stress–energy tensor that is normally neglected needs to be taken into account to obtain the correct result (see Remark 2.5).
In a follow-up paper, we will establish a similar theorem for the electromagnetic field. We note here that the stress–energy for the electromagnetic field is quite different from the scalar field and there are additional complications such as zero modes [42] that are absent for the scalar field. Moreover, the boundary conditions for the electromagnetic field are slightly more complicated, and cannot be reduced to Dirichlet boundary conditions. We therefore decided to not attempt a unified treatment which would obscure the result by additional notations.
Our approach is expected to carry over to other boundary conditions such a Neumann, mixed Dirichlet–Neumann or Robin boundary conditions with the single-layer operators replaced by the appropriate layer operators. As in the electromagnetic case, additional technical problems need to be overcome in these cases due to the possible appearance of zero modes and singularities of the Dirichlet-to-Neumann map at zero.
1.1 Notations
Let \(M \subset \mathbb {R}^d\) be an open subset. By the Schwartz kernel theorem, continuous linear operators \(A: C^\infty _0(M) \rightarrow \mathcal {D}'(M)\) are in one to one correspondence to distributions in \(\mathcal {D}'(M \times M)\), i.e. for every such A there exists a unique Schwartz kernel in \(\mathcal {D}'(M \times M)\). In this paper, the Schwartz kernel of A will be denoted by \(\breve{A}\).
2 Scalar Quantum Field Theory with Dirichlet Boundary Conditions
Let \(-\Delta \) be the (positive) Dirichlet Laplacian imposed on the codimension one submanifold \(\partial \mathcal {O}\). By definition, this is the unbounded self-adjoint operator defined on the Hilbert space \(L^2(\mathbb {R}^d)\) associated with the Dirichlet quadratic form \(q_D(f,f) = \int _X \Vert \nabla f \Vert ^2 \mathrm {d}x \) with form domain being the Sobolev space \(H^1_0(X)\).
As a consequence of elliptic regularity ([43]*Sect. 7.2), we have for the domain of \(\mathrm {dom}((-\Delta )^{\frac{s}{2}})\) equipped with its graph norm for any \(s \in 2 \mathbb {N}_0\) the continuous inclusions
One can use complex interpolation ([44]*Sect. 4 and Theorem 4.2) to extend this to any \(s \ge 0\). Here, \(H^{s}_{\mathrm {comp}}(X)\) denotes the space of functions in \(H^{s}(X)\) with compact support in X.
The Hilbert space \(L^2(\mathbb {R}^d)\) then decomposes into a direct sum
and each subspace is an invariant subspace for \(-\Delta \) in the sense that any bounded function of the operator as defined by spectral calculus will leave these subspaces invariant. The restriction of \(-\Delta \) to \(L^2(\mathcal {O}_j)\) is the Dirichlet Laplacian on the interior of \(\mathcal {O}_j\) and therefore has compact resolvent. The restriction of \(-\Delta \) to \(L^2(\mathcal {E})\) has purely absolutely continuous spectrum \([0,\infty )\). By comparison, we also have the free Laplacian \(-\Delta _\text {free}\) on \(\mathbb {R}^d\) which corresponds to the case \(\mathcal {O}=\emptyset \). Throughout the paper, we fix a mass parameter \(m \ge 0\).
Definition 2.1
The relativistic Hamiltonian H is defined to be the self-adjoint operator \(H = (-\Delta + m^2)^{\frac{1}{2}}\).
The space-time M we consider is the Lorentzian space-time \(\mathbb {R}\times X\) with Minkowski metric. The forward and backward fundamental solutions \(G_{+/-} : C^\infty _0(M) \rightarrow C^\infty (M)\) of the Klein–Gordon operator \(\Box + m^2\) with Dirichlet boundary conditions are given by
where \(\theta \) is the Heaviside step function. As usual in canonical quantisation, one considers the difference \(G = G_+ - G_-\) given by
Here, \(H^{-1} \sin (H(t-t'))\) is defined by spectral calculus. Since the function \(| x^{-1} \sin (x t) |\) is bounded by |t|, the operator \(H^{-1} \sin (H(t-t'))\) defines for any \(s\ge 0\) a bounded map from \(H^s_{\mathrm {comp}}(X)\) to \(\mathrm {dom}(-\Delta )^s \subset H^s(\mathcal {O}_1) \oplus \ldots H^s(\mathcal {O}_N) \oplus H^s(\mathcal {E})\). Here, the inclusion of the domain in the Sobolev spaces follows for \(s \in 2 \mathbb {N}_0\) from elliptic regularity up to the boundary ( [45]*Theorem 4.18) and for general \(s \ge 0\) by interpolation. In particular, this means that \(H^{-1} \sin (H(t-t'))\) has a distributional integral kernel. We can define a symplectic structure \(\sigma \) on \(\mathcal {W}=C^\infty _0(M)/\left( (\Box +m^2)C^\infty _0(M) \right) \) by
This induces a symplectic structure on \(G C^\infty _0(M)\) that is well known to coincide with the standard symplectic structure on the space of solutions. Indeed, if we define \(u = G f\) and \(v = G g\) for \(f,g \in C^\infty _0(M)\) then u and v solve the Klein–Gordon equation with Dirichlet boundary conditions and
In this equality, the right-hand side is independent of t.
2.1 Field Algebra and the Vacuum State
The field algebra of the real Klein–Gordon field is then the (unbounded) CCR \(*\)-algebra of this symplectic space. Instead of using the symplectic space \(\mathcal {W}\) one can describe this algebra \(\mathcal {A}\) directly as the complex unital \(*\)-algebra generated by symbols \(\phi (f)\) with \(f \in C^\infty _0(M,\mathbb {R})\) satisfying the relation
Physical states of this quantum system are states on this \(*\)-algebra. The construction and physical interpretations of such states usually relies on a Fock representation of \(\mathcal {A}\). This representation is chosen on physical grounds as a positive energy representation.
We briefly explain this now. The group of time translations \((T_s f)(t,x)=f(t-s,x)\) commutes with \(\Box + m^2\) and G and therefore \(\alpha _t(\phi (f)):=\phi (T_t f)\) defines a group of \(*\)-automorphisms of \(\mathcal {A}\). If a state \(\omega :\mathcal {A} \rightarrow \mathbb {C}\) is invariant then this group lifts to a group of unitary transformations U(t) on the GNS-Hilbert space which is uniquely determined by
We say that \(\pi \) is a positive energy representation if this group is strongly continuous and its infinitesimal generator has non-negative spectrum.
We will focus in this paper on the quasi-free ground state. This means that the state is completely determined by its two point distribution
which is given explicitly as
i.e. \(\omega _2\) is the integral kernel of the operator \(H^{-1} e^{-\mathrm {i}H(t-t')}\). In case \(m=0\), the spectrum of H contains zero and this expression needs to be interpreted in the sense of quadratic forms. Namely, it follows from general resolvent expansions (for example [42]*Theorems 1.5, 1.6 and 1.7) that \(C^\infty _0(X)\) is contained in the domain of the operator \(H^{-\frac{1}{2}}\). This follows from the formula
In particular, the space of test functions \(C^\infty _0(X)\) is contained in the form domain of \(H^{-1}\) and therefore, by the Schwartz kernel theorem, the operator \(H^{-1}\) has a distributional kernel in \(\mathcal {D}'(X \times X)\). This is of course also the case for general \(m>0\). We will denote the integral kernel of \(H^{-1}\) by \(\breve{H}^{-1}\), mildly abusing notation.
One can check directly that \(\omega _2\) defines a positive energy representation. Instead of using \(-\Delta \), we could also have used \(-\Delta _\text {free}\), the free Laplace operator. This also defines a positive energy state on the free algebra of observables \(\mathcal {A}_\text {free}\) which we will denote by \(\omega _{\text {free}}\), and similarly we use the notation \(H_\text {free}= (-\Delta _\text {free}+ m^2)^\frac{1}{2}\). There states can be compared by restricting them to certain subalgebras that are contained in both the algebra of observables and the free algebra of observables. For example if U is contained in a double cone in M then \(\mathcal {A}(U)\) which is generated by \(\phi (f), \mathrm {supp}(f) \subset U\) can be thought of as a subset of both \(\mathcal {A}\) and \(\mathcal {A} _\text {free}\) and therefore both states can be restricted to this algebra.
2.2 The Renormalised Stress–energy Tensor
The classical stress–energy tensor of the Klein–Gordon field for a smooth real-valued solution u is given by
This is a symmetric 2-tensor on M and one can easily show, using the Klein–Gordon equation, that it is divergence-free. Here, g is the Minkowski metric on M with signature \((-1,1,\ldots ,1)\). The Euclidean metric on \(\mathbb {R}^d\) will be denoted by h. The components \(T_{ij}^{cl}\) of the stress–energy tensor are the restrictions to the diagonal of the functions \(Q^{cl}_{ij}(x,y)\) defined on \(M \times M\) by
The quantum field theory counterpart of Q can be written in the field algebra as a field-algebra-valued bilinear form in the test functions \(f_1,f_2 \in C^\infty (M)\) as
The expectation value of \(Q_{ij}\) with respect to the state \(\omega \) is then given in terms of the two point function \(\omega _2\) as
Let K(t, x, y) be the distributional kernel of \(\frac{1}{2}H^{-1}e^{-\mathrm {i}Ht}\). Then, the distribution \(\omega (Q_{ij})\) with respect to the ground state is given by
or in short,
The above expressions are formal and make sense only when paired with test functions. We will use such formal notation throughout the paper when there is danger of confusion. The expectation value of the stress–energy tensor would correspond to the restriction of \(\omega (Q_{ij})\) to the diagonal as a distribution. Unfortunately, the distribution \(\omega (Q_{ij})\) is singular and cannot be restricted to the diagonal in a straightforward manner. If one is interested in relative quantities only then one can still define the renormalised expectation value of the stress–energy tensor between the states. Both states \(\omega \) and \(\omega _\text {free}\) are positive energy states and therefore satisfy the Hadamard condition (for example [46]*Theorem 6.3). By uniqueness of such Hadamard states, the difference of the two-point distributions is smooth near the diagonal in \(M \times M\). In the present case, this can also be seen more directly as follows. Let \(K_\text {free}(t,x,y)\) be the kernel of \(\frac{1}{2}H_\text {free}^{-1}e^{-\mathrm {i}H_\text {free}t}\). We will consider the difference \(\mathcal {K}(t,x,y) =K(t,x,y) - K_\text {free}(t,x,y)\).
Theorem 2.2
The distribution \(\mathcal {K}\) is smooth near the set \(\{ (0,x,y) \mid x,y \in X\} \subset \mathbb {R}\times X \times X\). In particular \(\breve{H}^{-1} - \breve{H}^{-1}_\text {free}\) is smooth in \(X \times X\).
Proof
The distribution \(\mathcal {K}(t,x,y)\) is a solution of the wave equation
on \(\mathbb {R}\times X \times X\) with initial conditions
By [34]*Lemma 5.1, integral kernel \(\kappa _\lambda \) of the resolvent difference
is smooth and satisfies on any compact subset U of \(X \times X\) a \(C^k\)-norm bound of the form
for some \(\ell \ge 0\) and \(\delta '>0\). Therefore, the integral representation
converges in the \(C^\infty (X \times X)\) topology. Thus, the distribution \(\breve{H}^{-1} - \breve{H}^{-1}_\text {free}\) is smooth in \(X \times X\). Since the initial conditions are smooth the solution \(\mathcal {K}\) is smooth where it is uniquely determined by the initial data. This is the case in a neighbourhood of \(t=0\) in \(\mathbb {R}\times X \times X\). \(\square \)
Hence, the distribution \(\omega (Q_{ij})- \omega _\text {free}(Q_{ij})\) is a smooth function in a neighbourhood of the diagonal \(\mathtt {\Delta }\subset M\times M\).
Definition 2.3
The components \((T_\mathrm {ren})_{ij}\) of the renormalised stress–energy tensor \(T_\mathrm {ren}\) are defined to be the restriction to the diagonal of the function \(\omega (Q_{ij})- \omega _\text {free}(Q_{ij})\).
Theorem 2.4
The renormalised stress–energy tensor is symmetric and is given by
and
for \(1\le i,j \le d\). Note that here \(\breve{H}^{-1}\) and \(\breve{H}^{-1}_\text {free}\) are the integral kernels of \(H^{-1}\) and \(H^{-1}_\text {free}\), respectively. Moreover, the expression \(\breve{A}|_\mathtt {\Delta }\) means the restriction of the integral kernel, \(\breve{A}\), to the diagonal (See Sect. 1.1).
Remark 2.5
The terms of divergence forms in the renormalised stress–energy tensor are commonly neglected in the literature, as one may naively think that they have zero contribution when integrating over the whole space. However, this is not the case. As it is not integrable due to the singular behaviour near the boundary, the divergence theorem does not apply in this case. See Appendix 7 for the simplest case. The problem disappears when we work with the relative stress–energy tensor given in Definition 3.3.
Proof of Theorem 2.4
Let \(\mathcal {K}(t,x,x')\) be the kernel of \(\frac{1}{2}(H^{-1}e^{-\mathrm {i}Ht}-H_\text {free}^{-1}e^{-\mathrm {i}H_\text {free}t})\) and \(\mathscr {K}(t,t',x,x')=\mathcal {K}(t-t',x,x')\). We have that
where \(\partial _0=\partial _t\), \(\partial '_0=\partial _{t'}\), \(\partial _i=\partial _{x^i}\) and \(\partial _i'=\partial _{x'^i}\) for \(1\le i \le d\). By theorem 2.2 the distribution \(\mathscr {K}(t,t',x,x')\) is smooth in a neighbourhood of the diagonal \(\mathtt {\Delta }\subset M\times M\). Moreover, \(\mathcal {K}(t-t',\cdot ,\cdot )\) is the kernel of a symmetric operator on X with respect to the real inner product \((\cdot ,\cdot )\) and therefore satisfies
This implies
Using product rules, we have
which gives
That is
Applying Eq. (8) to (10), we have
From Eqs. (8) and (9), we have
In other words, \((T_\mathrm {ren})_{0i}=(T_\mathrm {ren})_{i0}=0\) for \(1\le i \le d\). Hence, equations (6), (11) and (12) show that \((T_\mathrm {ren})_{ij}\) is symmetric tensor on M. Moreover,
and
Since \(\partial _k\partial ^k \mathscr {K}-m^2\mathscr {K}=0\), we have
When \(i=0\),
Also, we have
which yields the expressions for the renormalised stress–energy tensor. \(\square \)
Theorem 2.6
The renormalised stress–energy tensor is divergence-free and independent of t.
Proof
Let \(\mathcal {K}(t,x,y)\) and \(\mathscr {K}(t,t',x,y)\) be the same as the in the previous theorem. Recall that the shorthand expression of (3) is given by
Then, we have
Now we use product rules to get
In particular, we have
Hence, one has
which means \((T_\mathrm {ren})_{ij}\) is independent of time.
We use the same trick to show that
Now use the symmetric properties of \(\mathscr {K}\) in Eqs. (7) and (8), we have
The function \(\mathscr {K}\) solves the Klein–Gordon equation, i.e.
Altogether, we have
which shows the divergence-free property of the renormalised stress–energy tensor. \(\square \)
3 The Relative Trace-formula and the Casimir Energy
As mentioned in the introduction the renormalised stress–energy tensor \(T_\mathrm {ren}(x)\) becomes unbounded and non-integrable when x approaches the boundary of obstacles [14, 16, 19, 38]. This prevents us from defining a renormalised total energy. One way to circumvent the problem is to introduce the relative framework of [34] which we now summarise. The main advantage of this construction is that it completely avoids ill-defined quantities and the need for regularisation.
Relative quantities are defined with respect to different obstacle configurations where instead of \(\mathcal {O}\) only one obstacle \(\mathcal {O}_j\) is present, i.e. where \(\mathcal {O}\) is replaced by \(\mathcal {O}_j\). If an operator is defined with respect to such a configuration we use the subscript \(\mathcal {O}_j\), and we use the subscript \(\mathcal {O}\) to distinguish it from the original configuration. For instance the renormalised stress–energy tensor \(T_\mathrm {ren}\) in Theorem 2.4 will be denoted by \((T_\mathrm {ren})_\mathcal {O}\), which shows its dependence on the presence of obstacles \(\mathcal {O}=\mathcal {O}_1 \cup \dots \cup \mathcal {O}_N\). Similarly, \((T_\mathrm {ren})_{\mathcal {O}_j}\) denotes the renormalised stress–energy tensor when only obstacle \(\mathcal {O}_j\) is present and \(\Delta _{\mathcal {O}_j}\) denotes the Laplace operator with Dirichlet boundary condition on \(\partial \mathcal {O}_j\). The operator \(H_{\mathcal {O}_j}\) is defined in the same way.
Now we introduce a relative operator
More generally one defines the relative operator associated with a polynomially bounded function \(f:[0,\infty ) \rightarrow \mathbb {R}\), i.e.
Since all our operators are densely defined operators on the same Hilbert space \(L^2(\mathbb {R}^d)\) this combination makes sense. As a consequence of f being polynomially bounded, the space \(C^\infty _0(X)\) is in the domain of \(D_f\) and therefore \(D_f\) has a distribution kernel in \(\mathcal {D}'(X \times X)\).
To simplify our analysis later, we absorb the dependence of mass m in the functional f. We could write \(f_m\) to emphasise the dependence on m, but the later analysis will not be affected by m. Therefore, we omit the m-dependence and have
The main result of [34] is that for a large class of functions f, including the functions \(f(\lambda )=\sqrt{\lambda ^2}\) and \(f(\lambda )=\sqrt{\lambda ^2+m^2}\) which are of interest in our context, the operator \(D_f\) is trace-class and its trace can be computed by integrating the kernel on the diagonal. We now explain the precise statement of this result and its relation to the determinant of the boundary layer operator.
In the following, we will denote by \(G_{\mathcal {O},\lambda } \in \mathcal {D}'(X \times X)\) the distributional kernel of the resolvent \(R_{\mathcal {O},\lambda }=(-\Delta _\mathcal {O}- \lambda ^2)^{-1}\). The kernels \(G_{\mathcal {O}_j,\lambda }=\breve{R}_{\mathcal {O}_j,\lambda }\) and \(G_{\text {free},\lambda }=\breve{R}_{\text {free},\lambda }\) are defined in an analogous way. By elliptic regularity, these Green’s distributions are smooth away from the diagonal \(x=y\).
Recall that for \(\lambda \in \mathbb {C}\) we have the single-layer potential operator
given by
where \(\mathrm {d}\sigma \) is the surface measure. Let \(\gamma :H^{s+\frac{1}{2}}(\mathbb {R}^d)\rightarrow H^s(\partial \mathcal {O})\) be the Sobolev trace operator for \(s>0\). Properties of the Sobolev trace operator can be found, for instance, in [45]. One can write the above also as \(\mathcal {S}_\lambda =G_{\text {free},\lambda }\circ \gamma ^*\). Restriction of \(\mathcal {S}_{\lambda }\) to the boundary defines an operator
The operator \(Q_\lambda \) is known to have the following properties.
Since the boundary \(\partial \mathcal {O}\) consists of N connected components \(\partial \mathcal {O}_j\), we therefore have an orthogonal decomposition \(L^2(\partial \mathcal {O})=\oplus _{j=1}^N L^2(\partial \mathcal {O}_j)\). Let \(p_j: L^2(\partial \mathcal {O}) \rightarrow L^2(\partial \mathcal {O}_j)\) be the corresponding orthogonal projection. Now we have
In short, \(\tilde{Q}_\lambda \) and \(T_\lambda \) are, respectively, the diagonal and off diagonal part of \(Q_\lambda \). Now let \(\mathfrak {D}_\nu \) to be a sector in the upper half plane and it is given by
where it suffices to consider \(0<\epsilon < \frac{\pi }{2}\) for our applications.
The operator \(Q_\lambda \) is invertible for \({\text {Im}}(\lambda )>0\). Moreover \(Q_\lambda {\tilde{Q}}_\lambda ^{-1} -1\) is trace-class and the Fredholm determinant \(\det (Q_\lambda {\tilde{Q}}_\lambda ^{-1})\) of \(Q_\lambda {\tilde{Q}}_\lambda ^{-1}\) can be used to define a function
which is holomorphic in the upper half space and for some \(\delta '>0\) satisfies the bound
See Theorem 1.6 of [34] for the above bound in the sector of the form \(\{ z \mid {\text {Im}}(z) \ge b |{\text {Re}}(z)| \}\) for some \(\delta >0\).
Assume \(0<\theta < \frac{\pi }{2}\) and let \(\mathfrak {S}_\theta \) be the open sector
Let \(\mathcal {P}_\theta \) be the set of functions that are holomorphic and polynomially bounded in \(\mathfrak {S}_\theta \).
Definition 3.1
The space \(\tilde{\mathcal {P}}_\theta \) is defined to be the space of functions f such that \(f(\lambda )=g(\lambda ^2)\) for some \(g\in \mathcal {P}_{\theta +\epsilon }\) for some \(\epsilon >0\) and there exists \(a>0\) such that \(f = O(|z|^a)\) if \(|z|<1\)
We then have the following theorem (see Fig. 2 for the contour).
Theorem 3.2
([34], Theorems 1.6 and 1.7). Let \(f\in \tilde{\mathcal {P}}_\theta \). Then, \(D_f\) extends to a trace-class operator with integral kernel that is smooth on X and has continuous inner and outer boundary values on \(\partial \mathcal {O}\). The trace of \(D_f\) equals the integral over the diagonal of its integral kernel over \(\mathbb {R}^d\). Moreover, it is equal to
In particular, choosing the function \(f(\lambda ) = \sqrt{\lambda ^2 + m^2}-m\) one obtains \(D_f = H_\mathrm {rel}\) and therefore \(H_\mathrm {rel}\) is trace-class with trace equal to
This follows immediately by deforming the contour integral using the exponential decay of \(\Xi \) in the upper half plane, considering the branch cut of \(\sqrt{z+m^2}\) at \(\mathrm {i}\sqrt{m}\).
Definition 3.3
The relative stress–energy tensor is the renormalised stress–energy tensor in the relative setting and it is defined as
where \((T_\mathrm {ren})_\mathcal {O}\) is the renormalised stress–energy tensor for obstacle \(\mathcal {O}\) and \((T_\mathrm {ren})_{\mathcal {O}_i}\) is the renormalised stress–energy tensor for obstacle \(\mathcal {O}_i\), which are defined at the beginning of this section.
Remark 3.4
One can also consider other versions of a relative stress–energy tensor. For instance, one can define \(\mathcal {O}_A=\mathcal {O}_1\cup \cdots \cup \mathcal {O}_j\) and \(\mathcal {O}_B=\mathcal {O}_{j+1}\cup \cdots \cup \mathcal {O}_N\) for some \(1\le j < N\), dropping the connectedness requirement of obstacles and work with
The corresponding energy encodes the amount of work needed to separate the two obstacle configurations A and B. This quantity can also be expressed in terms of \(T_\mathrm {rel}\) for A and B. It is easy to see that this equals
and therefore working with \(T_\mathrm {rel}\) only does not result in a loss of generality.
Theorem 3.5
\(T_\mathrm {rel}\) is smooth on X and extends smoothly to \(\overline{\mathcal {E}}\) as well as to \(\overline{\mathcal {O}}\). The function \((T_\mathrm {rel})_{00}\) is integrable on \(\mathbb {R}^d\).
Proof
By Theorem 5, we have
The theorem was shown in [34] for the part \(\frac{1}{2} \breve{H}_\mathrm {rel}|_\mathtt {\Delta }\) and the same method of proof can also be applied to the second term. We provide the full details here for the sake of completeness. We use two estimates proved in [34] which we now explain. Recall that the relative resolvent is given by
and \(R_{\mathcal {O},\lambda }=(-\Delta _\mathcal {O}-\lambda ^2)^{-1}\), \(R_{\text {free},\lambda }=(-\Delta _\text {free}-\lambda ^2)^{-1}\), and \(R_{\mathcal {O}_i,\lambda }=(-\Delta _{\mathcal {O}_i}-\lambda ^2)^{-1}\). For the integral kernel \(\breve{R}_{\mathrm {rel},\lambda }\) we write \(G_{\mathrm {rel},\lambda }\).
As shown in [34] in the proof of Theorem 1.5, the integral kernel \(G_{\mathrm {rel},\lambda }\) of \(R_{\mathrm {rel}, \lambda }\) is smooth up to the boundary on \(\overline{\mathcal {E}}\) as well as to \(\overline{\mathcal {O}}\) and its \(C^k\)-norms on compact subsets \(K \subset \overline{\mathcal {E}}\times \overline{\mathcal {E}}\) satisfy the bound
for some \(\ell \ge 0\) for \(\lambda \) in the sector containing the imaginary axis.
We consider the operator
A similar bound holds for \(K \subset \overline{\mathcal {O}} \times \overline{\mathcal {E}}\), \(K \subset \overline{\mathcal {E}} \times \overline{\mathcal {O}}\), and \(K \subset \overline{\mathcal {O}} \times \overline{\mathcal {O}}\). The \(\log \)-factor in the estimate is only needed in dimension two. It then follows that \((H^{-1})_\mathrm {rel}\) has an integral kernel that extends smoothly to \(\overline{\mathcal {E}}\) as well as to \(\overline{\mathcal {O}}\) (i.e. smooth up to the boundary).
Similarly, \(\breve{H}_\mathrm {rel}\) is also smooth up to the boundary. Hence, by Theorem 5 and Definition 3.3, we obtain the smoothness of \(T_\mathrm {rel}\) up to the boundary. In order to show integrability, we recall an estimate for the diagonal of the integral kernel of the resolvent difference, in particular [34]*Theorem 2.9, Equ. (21) and (22).
Let \(\lambda \in \mathfrak {D}_\nu \) (See (15), i.e. a sector in the upper half plane) and \(k_{\mathcal {O},\lambda }(x,y)\) denote the integral of \(R_{\mathcal {O},\lambda }-R_{\text {free},\lambda }\), then we have
which implies
and
Let \(\rho (x)=\mathrm {dist}(x,\partial \mathcal {O})\). By Lemma A.2 of [34] we have for \(\rho (x)\lambda \le 1\),
and \(\ln (\rho (x)\lambda )\) disappears when \(|\alpha |+|\beta |\ge 1\). Moreover, for \(\rho (x)\lambda > 1\), one has
This shows
and
for some positive \(C_1\) and \(C_2\). By Corollary 2.8 of [34] we have
Now we can conclude that
and
Let \(h_\mathrm {rel}=(\breve{H}^{-1})_\mathrm {rel}|_\mathtt {\Delta }\). That is \(h_\mathrm {rel}(x)=\frac{2}{\pi }\int _m^\infty \frac{\lambda }{\sqrt{\lambda ^2-m^2}} G_{\mathrm {rel},\mathrm {i}\lambda }(x,x)\mathrm {d}\lambda \). For \(m\ne 0\), one can use Eqs. (17) and (18) to get the decay rate of \(h_\mathrm {rel}(x)\) by integrating over \(\lambda \). That is, for \(\rho (x)>1\), \(h_\mathrm {rel}(x)\) has a decay of \(\rho ^{-k}(x)e^{-Cm\rho (x)}\) with both k and C being positive. This warrants the integrability of \(\Delta h_\mathrm {rel}(x)\) for \(d\ge 2\), \(m\ne 0\) and \(\rho (x)>1\). Therefore, we will now focus on the case \(m=0\). By integrability of the integrand we can interchange differentiation and integration and therefore get
Again, by integrating over \(\lambda \), we have
Let \(\Omega \subset \mathbb {R}^d\) be an open set with \(\mathrm {dist}(\Omega ,\partial \mathcal {O})>1\) and \(\varphi \in C_0^\infty (\mathbb {R}^d)\) satisfy \(0\le \varphi \le 1\) and \(\varphi =1\) in a neighbourhood of \(\mathbb {R}^d\backslash \Omega \). Then, we have the decomposition
The integrability of \(\Delta (\varphi h_\mathrm {rel})\) in Eq. (20) follows from the smoothness property of the kernel of \((\breve{H}^{-1})_\mathrm {rel}\) at the diagonal as shown above. Thereby the integrability of \(\Delta h_\mathrm {rel}\) on \(\mathbb {R}^d\) is equivalent to the one of \(\Delta [(1-\varphi )h_\mathrm {rel}]\) on \({{\,\mathrm{supp}\,}}(1-\varphi )\). This follows immediately from Eq. (19). Therefore, we have shown the integrability of \(\Delta ((\breve{H}^{-1})_\mathrm {rel}|_\mathtt {\Delta })\) on \(\mathbb {R}^d\). Finally, the integrability of \(T_\mathrm {rel}\) follows from Theorem 2.4 and the definition of relative stress–energy tensor (i.e. Definition 3.3). \(\square \)
Definition 3.6
The relative energy is defined as
Theorem 3.7
We have the equality
Proof
We have
where \(B_r\) is the ball of radius r centred at the origin. As \(\Delta ((\breve{H}^{-1})_\mathrm {rel}|_\mathtt {\Delta })\) has only jump-type discontinuity across \(\partial {X}\), we can apply the divergence theorem to the integral on \(B_r\) for sufficiently large r. That is
where \((\cdot )_+\) and \((\cdot )_-\) are the exterior and interior limits, respectively. From equation (19), we also have
which implies the contribution of the integral over \(\partial B_r\) vanished as \(r \rightarrow \infty \) and therefore
From Eq. (16), we then have
This shows
We start by showing that for the restrictions to \(\partial \mathcal {O}_q\) we have the following identity
To see this, we temporarily denote by k(x, y) the integral kernel of \(\left( H_{\mathcal {O}}^{-1} - H_{\mathcal {O}_q}^{-1} \right) \). This kernel vanishes \(\mathcal {O}_q \times \mathcal {O}_q\) and the interior normal derivative therefore vanishes trivially. We therefore only need to concern ourself with the exterior normal derivative. As shown in the proof of Theorem 3.5 the kernel \((H^{-1})_\mathrm {rel}\) is smooth. One concludes from this, using Theorem 2.2, that \(\left( H_{\mathcal {O}}^{-1} - H_{\mathcal {O}_q}^{-1} \right) \) is smooth near \(\mathcal {O}_q \times \mathcal {O}_q\) in \(\overline{\mathcal {E}}\times \overline{\mathcal {E}}\). The kernel k satisfies Dirichlet boundary conditions in both variables in the sense that \(k(x,y)=0\) if \(y \in \partial \mathcal {O}_q\) or if \(x \in \partial \mathcal {O}_q\). By the chain rule \(\left[ \left. \partial _\nu \left( \breve{H}_{\mathcal {O}}^{-1} - \breve{H}_{\mathcal {O}_q}^{-1} \right) \right| _\mathtt {\Delta }\right] _\pm (x)\) equals \((\partial _{\nu ,x} k(x,y) + \partial _{\nu ,y} k(x,y))|_{y=x}\), which therefore vanishes on \(\partial \mathcal {O}_q\). We then have
By Theorem 2.2, we know that \(\left. \partial _\nu \left( \breve{H}_{\mathcal {O}_p}^{-1}- \breve{H}^{-1}_\text {free}\right) \right| _\mathtt {\Delta }\) is smooth across the boundary \(\partial \mathcal {O}_q\) for \(p \ne q\), which implies
Hence, we have \(\int _{\mathbb {R}^d} \Delta ((\breve{H}^{-1})_\mathrm {rel}|_\mathtt {\Delta }) \mathrm {d}x=0\), which verifies the first Eq. in (21). The representation of the trace of \(H_{\mathrm {rel}}\) in terms of \(\Xi \) follows via \( \Xi '(\lambda ) = -2\lambda \mathrm {Tr}\left( R_{\mathrm {rel},\lambda } \right) \) from Theorem 4.2 in [34], Theorem 3.2 and equation (16). \(\square \)
4 Estimates on the Relative Resolvent
In preparation for the proof of the variational formula, we will need some additional estimates on the relative resolvent \(R_{\mathrm {rel},\lambda }\), which we collect in this section. We have the following well-known layer potential representation (see for example [34]*Equ. (19))
which gives
Also, let \(\rho \) be the function defined as
For \(i,j,k\in \mathbb {Z}\), let \(\rho _{i,j;k}\) be the functions defined as
The following proposition partially follows from [34, Proposition 2.1] and extends [34, Proposition 2.2]. Let us summarise some mapping properties of the layer potential operators.
Proposition 4.1
Let \(d\ge 2\) and \(-\frac{3}{2}\le s \le -\frac{1}{2}\). Then, we have the following properties of \(\mathcal {S}_{\lambda }\) for \(\lambda \in \mathfrak {D}_\nu \) (See (15), i.e. a sector in the upper half plane).
-
(1)
If \({\text {Re}}(\lambda ) \ne 0\) then \(\Vert \mathcal {S}_{\lambda }\Vert _{H^{s}(\partial \mathcal {O})\rightarrow L^2(\mathbb {R}^d)}\le C_s \frac{\sqrt{1+|\lambda |^2}}{{\text {Re}}(\lambda ){\text {Im}}(\lambda )}\).
-
(2)
Let \(\chi \in C^\infty (\mathbb {R}^d)\) be supported in V, where \(V\subset \mathbb {R}^d\) is an open set with smooth boundary and \(\mathrm {dist}(V,\partial \mathcal {O})>0\). For \(0<\delta < \mathrm {dist}(V,\partial \mathcal {O})\), we have that \(\chi \mathcal {S}_{\lambda }: H^{s}(\partial \mathcal {O})\rightarrow L^2(V)\) is a Hilbert–Schmidt operator whose Hilbert–Schmidt norm is bounded by
$$\begin{aligned} \Vert \chi \mathcal {S}_{\lambda }\Vert _{HS(H^{s}(\partial \mathcal {O})\rightarrow L^2(\mathbb {R}^d))} \le C_{\delta ,\nu ,k}\, \sqrt{\rho ({\text {Im}}\lambda )} e^{-\delta {\text {Im}}\lambda } \end{aligned}$$which implies
$$\begin{aligned} \Vert \mathcal {S}_{\lambda }\Vert _{H^{s}(\partial \mathcal {O})\rightarrow L^2(\mathbb {R}^d)} \le C_{\delta ,\nu ,s}\left( 1+\sqrt{\rho ({\text {Im}}\lambda )}\right) . \end{aligned}$$ -
(3)
\(\Vert Q_\lambda ^{-1}\Vert _{H^{\frac{1}{2}}(\partial \mathcal {O}) \rightarrow H^{-\frac{1}{2}}(\partial \mathcal {O})} \le C_\nu (1+({\text {Im}}\lambda )^2)^2.\)
Proof
Recall that we could also write the single-layer potential operator as \(\mathcal {S}_\lambda =G_{\text {free},\lambda }\circ \gamma ^*\). Note that \(\gamma ^*: H^{s}(\partial \mathcal {O}) \rightarrow H^{s-\frac{1}{2}}(\mathbb {R}^d)\), \(G_{\text {free},\lambda }: H^{s-\frac{1}{2}}(\mathbb {R}^d) \rightarrow H^{s+\frac{3}{2}}(\mathbb {R}^d)\) and the natural inclusion map \(\iota : H^{s+\frac{3}{2}}(\mathbb {R}^d) \rightarrow L^2(\mathbb {R}^d)\) are bounded maps. For \(-\frac{3}{2}\le s \le -\frac{1}{2}\), their norms are bounded by
The second bound follows from the spectral representation of \(-\Delta _\text {free}\). Therefore, we conclude the proof for part (1).
Part (3) follows immediately from the bound on the Dirichlet-to-Neumann operator in [34]. For part (2), let \(\mathcal {O}\subset V'\subset \mathbb {R}^d\) be an open set with \(\mathrm {dist}(V,V')>0\) and choose \(0<\delta <\mathrm {dist}(V,V')\). In particular, \(0<\delta <\mathrm {dist}(V,\partial \mathcal {O})\). Let \(k\in \mathbb {N}\). Now we have from [34] the estimate
where \(\Delta =\Delta _x+\Delta _y\) is the Laplace operator on \(V\times V'\). In other words, we have
Now by taking Sobolev trace, we have
Statement (2) follows, using the properties of the Hilbert–Schmidt norm (for example Sect. A.3.1 in [43]). Since \(-\frac{3}{2}\le s \le -\frac{1}{2}\), we have
where \(E_s=(\sqrt{-\Delta _{\partial \mathcal {O}}}+1)^s\). To prove the last property of \(\mathcal {S}_{\lambda }\), we are left with proving the bound for \(\phi \, \mathcal {S}_{\lambda }\), where \(\phi \in C^\infty (V'')\) is supported in \(V''\) with \(\mathcal {O}\subset V'\subset V''\), \(\phi (x)=1\) on \(\mathcal {O}\) and \(1-\phi \) is supported in V. It suffices to bound \(\phi \, G_{\text {free},\lambda } \, \phi : H^{s}(\mathbb {R}^d) \rightarrow L^2(\mathbb {R}^d)\) for \(-\frac{3}{2}\le s \le -\frac{1}{2}\). Note that the explicit kernel of \(\phi G_{\text {free},\lambda }\phi \), denoted by \(K_\lambda (x,y)\), is given by
where \(H^{(1)}_\nu \) is the Hankel function. By the Schur test and estimates on the free resolvent (see Appendix A for details), we have
Taking the adjoint, we get the same bound for \(H^{-2}\rightarrow L^2\). Finally, using the estimate (24) with \(\chi =1-\phi \), we obtain
\(\square \)
Remark 4.2
In the case of odd dimensions, an easier argument can be used to provide a weaker estimate that is still sufficient for the purposes of this paper. Using the strong Huygens principle, one deduces ( [47, Sect. 3.1])
which implies
This gives \(\Vert \phi \, \mathcal {S}_{\lambda }\Vert _{H^{s} \rightarrow L^2}\le C_{\phi ,\nu }(1+{\text {Im}}\lambda )\) for \(-\frac{3}{2}\le s \le -\frac{1}{2}\). Applying inequality (24) for \(\chi =1-\phi \) and using \(\frac{-2s+1}{2} \ge 1\), we get
5 Hadamard Variation Formula and Equivalence of Approaches (2) and (3)
In this section, we will show that a version of Hadamard variation of the renormalised stress–energy tensor \((T_\mathrm {ren})_{ij}\) defined in Sect. 2.2 is related to the Hadamard variation formula for the resolvent. We will follow the methods developed in [39, 41, 48, 49] and derive a Hadamard variation formula for the relative resolvent, then apply it to the relative stress–energy tensor. Short proofs of Hadamard variation formula can be found in [50, 51] for the case of bounded domains. Since we are dealing with an unbounded domain, we extend theory to non-compact setting for the special case of boundary translation flows (see Definition 5.1) in Theorem 5.3, which are sufficient for our purposes.
5.1 Hadamard Variation Formula
Let U be a possibly unbounded open subset in \(\mathbb {R}^d\) with smooth compact boundary and Y be a smooth vector field on \(\mathbb {R}^d\). The flow, denoted by \(\varphi _\epsilon \) and generated by Y, gives, for small \(|\epsilon |\), a one-parameter family of smooth manifolds in \(\mathbb {R}^d\), which is denoted by \(U_\epsilon =\varphi _\epsilon (U)\). For our application, U would be either \(\mathcal {E}\) or \(\mathcal {O}\) as defined in Sect. 1 and we will only consider flows that generate rigid translations of obstacles.
Definition 5.1
A flow \(\varphi _\epsilon \) associated with vector field Y is called a boundary translation if Y is locally constant near \(\partial U\).
Following Peetre’s derivation of Hadamard variation formula, we define the following variational derivative.
Definition 5.2
Let \(u_\epsilon \) be a (weak-\(*\)) \(C^1\) curve of functions in \(\mathcal {D}'(U_\epsilon )\). The variational derivative at \(\epsilon =0\) is given by
where \( \theta _Y u=\lim _{\epsilon \rightarrow 0}\frac{\varphi _\epsilon ^*u_\epsilon -u_0}{\epsilon } \) is the variational derivative defined by Garabedian–Schiffer’s in [39]. Here, the derivative \(\theta _Y u\) is understood in the weak-\(*\)-sense and the action of the vector field Y is understood in the sense of distribution.
Note that \(\theta _Y u\) is different from the standard (conventional) Lie derivative, as \(u_\epsilon \) may have an additional dependence on \(\epsilon \). In fact, the last term, Yu in Eq. (26), should be understood as the conventional Lie derivative of \(u_0\).
The derivation of Hadamard variational formula for the resolvent associated with the Dirichlet Laplace operator usually starts with the energy quadratic form (see [39, 41, 49]). The energy quadratic form associated with the Dirichlet Laplacian \(-\Delta _{U_\epsilon }\) on \(U_\epsilon \) is given by
where \(\lambda \in \mathbb {C}\) and \({\text {Im}}{\lambda }>0\). Using the diffeomorphism flow \(\varphi _\epsilon \), one can pull-back the quadratic form from \(U_\epsilon \) to \(U_0=U\), which gives a one-parameter family of quadratic forms on U, i.e.
Note that the energy form (27) and the induced energy forms (28) are related by
The operator associated with the energy form (27) is the Dirichlet Laplace operator, whereas \(\tilde{E}_\epsilon \) defines a one-parameter families of elliptic operators on U for sufficiently small \(\epsilon \). Let \(G_{\lambda ,\epsilon } = G_{U_\epsilon ,\lambda }\) be kernel of the resolvent for the Dirichlet Laplacian on \(U_\epsilon \). Then, from Eqs. (27), (28) and (29), we have in the sense of distributions
where y is in the interior of \(U_\epsilon \) and by elliptic regularity \(G_{\lambda ,\epsilon }(x,y)\) is then smooth at the boundary and therefore it makes sense to define its boundary value.
As we would like to study the variation of resolvents, it is convenient to consider the variation as distributions on \(U \times U\). In other words, for \(R_\epsilon \in \mathcal {D}'(U_\epsilon \times U_\epsilon )\) and \(u, v\in C_0^\infty (U)\), we have, from the Schwartz kernel theorem,
where the first two brackets correspond to the pairing between distributions and test functions while the third and the forth brackets are the \(L^2\) inner products on \(U_\epsilon \) and U, respectively. It is not hard to see that the existence of the variational derivative of \(R_\epsilon \) in the sense of (26) is implied by the existence of \(\theta _Y R\). From Eq. (31), the existence of \(\theta _Y R=\lim _{\epsilon \rightarrow 0}\frac{\varphi _\epsilon ^*R_ \epsilon -R_0}{\epsilon }\) in the weak-\(*\)-sense is equivalent to the existence of the standard derivative of \(r(\epsilon )=\langle u , \varphi _{\epsilon }^{*} R_ \epsilon \varphi _{-\epsilon }^{*} v \rangle \) with respect to \(\epsilon \) for all \(u, v\in C_0^\infty (U)\).
The following theorem is the well-known Hadamard variation formula for the resolvent extended to case of unbounded domains in our setting.
Theorem 5.3
Let \(\varphi _\epsilon \) be a boundary translation flow, then the variational derivative of \(R_{\lambda ,\epsilon }\) exists in the weak-\(*\) topology. Let \(G_{\lambda ,\epsilon }(x,y)\) be the kernel of \(R_{\lambda ,\epsilon }\), then its variational derivative is given by
Proof
Firstly, we prove the existence of \(\theta _Y R_\lambda \) in weak-\(*\)-sense. We know that, from Eq. (23), \(\varphi _{\epsilon }^{*} R_{\lambda ,\epsilon } \varphi _{-\epsilon }^{*}-R_{\text {free},\lambda }=-\varphi _{\epsilon }^{*} \mathcal {S}_{\lambda ,\epsilon } Q_{\lambda ,\epsilon }^{-1}\mathcal {S}_{\lambda ,\epsilon }^t \varphi _{-\epsilon }^{*}\). We will therefore establish differentiability of
for any fixed test functions \(g,f \in C^\infty _0(U)\) and compute its derivative. In the last term of Eq. (33), the operator \(\mathcal {S}_{\lambda ,\epsilon }^t \) is the transpose operator to \(\mathcal {S}_{\lambda ,\epsilon } \) obtained from the real inner product, i.e. \(\mathcal {S}_{\lambda ,\epsilon }^t f = \overline{\mathcal {S}_{\lambda ,\epsilon }^t \overline{f}}\). Since the free resolvent is smooth off the diagonal and
the kernel of \(\mathcal {S}_{\lambda ,\epsilon }\) is smooth on \(U \times \partial U\). Here, \(f \in C^\infty (\partial U)\), \(g \in C_0^\infty (U)\) and \(G_{\text {free},\lambda }\) is the kernel of the free resolvent. To establish differentiability by the product rule, it is sufficient to prove the existence of \(\frac{\mathrm {d}}{\mathrm {d}\epsilon }\left( \varphi _{\epsilon }^{*}\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^{*} \right) \) in the \(C^\infty \)-topology of integral kernels on \(U \times \partial U\), and the existence of \(\frac{\mathrm {d}}{\mathrm {d}\epsilon }\left( \varphi _{\epsilon }^{*} Q_{\lambda ,\epsilon }^{-1} \varphi _{-\epsilon }^{*}\right) \) at \(\epsilon =0\) in the weak-\(*\)-sense. The free resolvent kernel is smooth off the diagonal and therefore the above formula for \(\varphi _{\epsilon }^{*}\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) shows differentiability of the smooth kernel in the parameter \(\epsilon \) at \(\epsilon =0\) and the classical sense. We are thus left with proving the existence of \(\frac{\mathrm {d}}{\mathrm {d}\epsilon }\left( \varphi _{\epsilon }^{*} Q_{\lambda ,\epsilon }^{-1} \varphi _{-\epsilon }^{*}\right) \) at \(\epsilon =0\) in the weak-\(*\)-sense. Note that \(\varphi _{\epsilon }^{*} Q_{\lambda ,\epsilon }^{-1} \varphi _{-\epsilon }^{*}\) is a one-parameter family of maps from \(C^\infty (\partial U)\) to \(C^\infty (\partial U)\), i.e. the spaces do not depend on \(\epsilon \). Similar to Eq. (14), we have the splitting
where \(p_j\) are the orthogonal projections \(L^2(\partial U) \rightarrow L^2(\partial U_j)\) and \(\partial U_j, j=1,\ldots ,N\) are the connected components of \(\partial U\). Define \(\mathcal {Q}_{\lambda ,\epsilon }=\varphi _{\epsilon }^* Q_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\), \(\tilde{\mathcal {Q}}_{\lambda ,\epsilon }=\varphi _{\epsilon }^* \tilde{Q}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) and \(\mathcal {T}_{\lambda ,\epsilon }=\varphi _{\epsilon }^* T_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\). By the definition of \(\tilde{Q}_{\lambda ,\epsilon }\), we have
where \(f\in C^\infty (\partial U)\) and \(f_i=p_i f\). As \(\varphi _\epsilon \) is a boundary translation flow, we obtain the following relationships.
Now, from the decomposition of \(Q_\lambda \) in Eq. (14), one obtains
The family \(\mathcal {T}_{\lambda ,\epsilon }\) is a differentiable family of smoothing operators for sufficiently small \(\epsilon \) (i.e. no obstacles are overlapping) and its derivative in \(\epsilon \) therefore, by Taylor’s remainder estimate, exits as a family of smooth kernels. Hence, \(\mathcal {Q}_{\lambda ,\epsilon }\) is differentiable in \(\epsilon \) at \(\epsilon =0\) as a family of operators from \(H^s(\partial U)\) to \(H^{s+1}(\partial U)\) for any \(s \in \mathbb {R}\). We now use that \(\mathcal {Q}_{\lambda ,0}\) is invertible and the inverse \(\mathcal {Q}_{\lambda ,0}^{-1}\) is a pseudodifferential operator of order one, and maps \(H^s(\partial U)\) to \(H^{s-1}(\partial U)\). Since the space of invertible operators is open the inverses \(\mathcal {Q}_{\lambda ,\epsilon }^{-1}\) exist near \(\epsilon =0\) as maps from \(H^s(\partial U)\) to \(H^{s-1}(\partial U)\). Hence, \(\mathcal {Q}_{\lambda ,\epsilon }^{-1}\) is differentiable in \(\epsilon \) at \(\epsilon =0\) as a family of operators from \(H^s(\partial U)\) to \(H^{s-1}(\partial U)\). In particular the derivatives \(\frac{\mathrm {d}}{\mathrm {d}\epsilon }\mathcal {Q}_{\lambda ,\epsilon }\) and \(\frac{\mathrm {d}}{\mathrm {d}\epsilon } \mathcal {Q}_{\lambda ,\epsilon }^{-1}\) exist in the weak-\(*\)-sense. Hence, the variational derivative of \(R_{\lambda ,\epsilon }\) exists in the weak-\(*\) sense and it is given by
where \(Y'\) means the action of Y on the second variable. It remains to compute the derivative. To do this, we consider the inhomogeneous problem
Let \(\nu \) be the exterior unit normal of U, y be an interior point of U such that \(\varphi _{\epsilon }(y)=\tilde{y}\) and \(e(u,v)=\nabla u\cdot \nabla v -\lambda ^2 uv\). By applying
to equation (35), taking derivative in \(\epsilon \) of Eq. (35) and using Eq. (30) and Peetre’s computations [41], one has
Let \(v(x)=G_{\lambda ,0}(x,z)\), we have
Using the symmetric property of \(G_{\lambda ,\epsilon }(x,y)=G_{\lambda ,\epsilon }(y,x)\) and Eqs. (34), (36) and (37), we obtain
Using the boundary conditions \(G_{\lambda ,\epsilon }(\varphi _\epsilon (x),y)=0\) for \(x \in \partial U\), we arrive at the Hadamard variation formula for the Dirichlet resolvent.
\(\square \)
5.2 Application of the Hadamard Variation Formula to the Relative Resolvent
We now apply the Hadamard variation formula to our setting with finitely many obstacles, combining the above formulae for \(U=\mathcal {E}\) and \(U=\mathcal {O}\). We have from Eq. (32)
where \((\cdot )_+\) means taking limits from \(\mathcal {E}\) to the boundary \(\partial \mathcal {E}\) and \((\cdot )_-\) means taking limits from \(\mathcal {O}\) to the boundary \(\partial \mathcal {O}\). We will now use the variational formula for the relative resolvent to prove the following theorem. Hence, we define \(\mathcal {O}_\epsilon = \varphi _\epsilon (\mathcal {O})\) and similarly \(\mathcal {O}_{j,\epsilon }= \varphi _\epsilon (\mathcal {O}_j)\). In this way, we can define the relative resolvent
and its integral kernel \(G_{\mathrm {rel},\lambda ,\epsilon }\) depending on the parameter \(\epsilon \).
Theorem 5.4
Let \(\varphi _\epsilon \) be a boundary translation flow, \(d\ge 2\) and \(\lambda \in \mathfrak {D}_\nu \). Then, \(R_{\mathrm {rel},\lambda ,\epsilon }\) is for each \(\lambda \in \mathfrak {D}_\nu \) a \(C^{1}\) trace-class operator-valued function of \(\epsilon \) near the point \(\epsilon =0\). Its derivative \(\dot{R}_{\mathrm {rel},\lambda ,0}\) equals \(\delta _Y R_{\mathrm {rel},\lambda }\) and there exists \(\delta >0\) such that \(L^2\)-trace-norm of \(\dot{R}_{\mathrm {rel},\lambda ,0}\) is bounded by
Its kernel, \(\dot{G}_{\mathrm {rel},\lambda ,0}=\breve{\dot{R}}_{\mathrm {rel},\lambda ,0}\), is given by
for \(x,y \in \mathcal {E}\) or \(x,y \in \mathcal {O}\), where \(\nu _{\mathcal {E}_i}\) is the exterior unit normal of \(\mathcal {E}_i\) and \(\mathcal {E}_i = \mathbb {R}^d \setminus \overline{\mathcal {O}_i}\).
Proof
We start by showing that the family is Fréchet differentiable in the Banach space of trace-class operators with continuous derivative, i.e. the function is \(C^1\) as a trace-class operator-valued function on I, where is a fixed sufficiently small compact interval. As in the previous section we have \(R_{\mathrm {rel},\lambda }=-\mathcal {S}_\lambda (Q_\lambda ^{-1}-\tilde{Q}_\lambda ^{-1}) \mathcal {S}_\lambda ^t\). We can decompose its variation form as in the proof of Theorem 5.3
Then, we split the last term into a product of three terms, i.e. \(\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\), \(\varphi _{\epsilon }^* (Q_{\lambda ,\epsilon }^{-1}-\tilde{Q}_{\lambda ,\epsilon }^{-1}) \varphi _{-\epsilon }^*\), and \(\varphi _{\epsilon }^* \mathcal {S}_{\lambda ,\epsilon }^t \). The first operator \(\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) is given by
Since \(\partial \mathcal {O}\) is a disjoint union of the components \(\partial \mathcal {O}_j\) the operators \(\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) splits into a sum \(\sum _j \mathcal {S}_{j,\lambda ,\epsilon } = \sum _j T_j(\epsilon ) \mathcal {S}_{j,\lambda }\), where \(\mathcal {S}_{j,\lambda }: L^2(\partial \mathcal {O}_j) \rightarrow H^1(\mathbb {R}^d)\) and \(T_j(\epsilon ): L^2(\mathbb {R}^d) \rightarrow L^2(\mathbb {R}^d)\) is the translation \(T_j(\epsilon ) f(x) = f(x - Z_j \epsilon )\). Here, we used the fact that Z is constant and equal to \(Z_j\) near \(\partial \mathcal {O}_j\) and that the free Green’s function is translation invariant. Since \(T_j(\epsilon )\) is \(C^1\) as a family of maps \(H^1(\mathbb {R}^d) \rightarrow L^2(\mathbb {R}^d)\), this shows that \(\mathcal {S}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) and its adjoint are \(C^1\) as families of bounded operators \(L^2(\partial \mathcal {O}) \rightarrow L^2(\mathbb {R}^d)\). As shown in the proof of Theorem 5.3, the operator \(\varphi _{\epsilon }^*{\tilde{Q}}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) is independent of \(\epsilon \) and therefore
The map \( \mathcal {T}_{\lambda ,\epsilon }\) has smooth integral kernel that depends smoothly on \(\epsilon \) for sufficiently small \(\epsilon \). This family is therefore \(C^1\) as a family of trace-class operators. We temporarily denote \(\varphi _{\epsilon }^* Q_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) by \(J_\epsilon \), As \(G_{\text {free},\lambda }\) is translation invariant, \(\varphi _{\epsilon }^* {\tilde{Q}}_{\lambda ,\epsilon } \varphi _{-\epsilon }^*\) is independent of \(\epsilon \) and hence we denote it by \({\tilde{J}}\). Then, by the above \(J_\epsilon = J_0 + r_\epsilon \) and \(J_0 - \tilde{J}\) is trace-class. Moreover, the remainder term \(r_\epsilon \) is of the form \(r_\epsilon = \dot{J}_0 \epsilon + \rho (\epsilon )\), where \(\Vert \rho (\epsilon ) \Vert _1 = o(\epsilon )\) as \(\epsilon \rightarrow 0\). By the Neumann series, we have
where again \(\Vert \tilde{\rho }(\epsilon ) \Vert _1 = o(\epsilon )\). We have used here that trace-class operators form an ideal in the algebra of bounded operators, and the norm estimate \(\Vert A B \Vert _1 \le \Vert A \Vert \Vert B \Vert _1\) holds. We conclude that \(\varphi _{\epsilon }^* (Q_{\lambda ,\epsilon }^{-1}-\tilde{Q}_{\lambda ,\epsilon }^{-1}) \varphi _{-\epsilon }^*\) is \(C^1\) as a family of trace-class operators.
We now compute the derivative in \(\epsilon \) of all the three terms. We obtain
For the derivative of the second term, we have as in Theorem 5.3
and
Therefore, the variation of the relative resolvent is given by
To estimate the trace-norm of \(\dot{R}_{\mathrm {rel},\lambda ,0}\), note that the first term in the above equation can be estimated by
where \(E_s=(\sqrt{-\Delta _{\partial \mathcal {O}}}+1)^s\) and \(\Vert E_{1} T_\lambda \Vert _1 \) is the trace-norm from \(H^{-\frac{1}{2}} \rightarrow H^{-\frac{1}{2}}\). Now by Proposition 4.1, we have
The third term in Eq. (41) can be bounded the same as the first term. For the second term, one can estimate it by
Combining Eqs. (42) and (43), one has
In order to prove the bound (39), it suffices to prove that
As in Theorem 5.3, \(\mathcal {T}_{\lambda ,\epsilon }\) is a smoothing operator that depends smoothly on \(\epsilon \), as long as we have \(\mathrm {dist}(\partial \mathcal {O}_{j,\epsilon },\partial \mathcal {O}_{k,\epsilon })>0\) for all pairs of obstacles. Since the obstacles are compact, \(\mathcal {T}_{\lambda ,\epsilon }\) is a smoothing operator on compact domains and hence it is also a trace-class operator from \(H^s(\partial \mathcal {O}) \rightarrow H^s(\partial \mathcal {O})\) for all \(s \in \mathbb {R}\). This proves the first part of the theorem.
Also, from Theorem 5.3, we know that \(\delta _Y G_{\mathrm {rel},\lambda ,0}\) exists in the weak-\(*\) sense. By Eqs. (33) and (34), we know that the kernel of \(\dot{R}_{\mathrm {rel},\lambda ,0}\) coincides with \(\delta _Y G_{\mathrm {rel},\lambda ,0}\). In other words, the variational derivative exists in a stronger sense. We can therefore apply the variation formula (38) to the relative resolvent, which gives
Since the interior parts of \(G_{\mathcal {O},\lambda ,\epsilon }\) and \(G_{\mathcal {O}_j,\lambda ,\epsilon }\) are the same, the interior contributions of \(\delta _Y G_{\mathcal {O},\lambda ,0}\) in the expression (38) cancel out with the ones of \(\delta _Y G_{\mathcal {O}_j,\lambda ,0}\). Therefore, we are left with only the exterior contributions as shown in Eq. (40). \(\square \)
Definition 5.5
Let M be a Riemannian manifold with A be an operator on \(L^2(M)\) with continuous kernel Schwartz kernel \(\breve{A}\in C(M \times M)\). For an open subset \(V \subset M\), we define the localised trace on V as
whenever the integral exists. If A is trace-class and has continuous kernel, the trace of A on \(L^2(M)\) then equals \(\mathrm {Tr}_M(A)\) by Mercer’s theorem. We also write \(\mathrm {Tr}_V(\breve{A})\).
Our next proposition gives a relationship between the trace of the variation of the relative resolvent with a local trace on the boundary.
Proposition 5.6
The trace of the variation of the relative resolvent on \(L^2(X)\) is also given by
Proof
Let \(\gamma _{\mathcal {E},\nu } : H^{\frac{3}{2}}(\mathcal {E})\rightarrow L^2(\partial \mathcal {O})\) be the Sobolev trace after taking the exterior normal derivative (\(\nu \) is the exterior normal vector field) and \(\mathcal {B}_{\mathcal {O},\lambda }: L^2(\partial \mathcal {O}) \rightarrow L^2(\mathbb {R}^d)\) is defined as \(\mathcal {B}_{\mathcal {O},\lambda }= R_{\mathcal {O},\lambda } \circ \gamma _{\mathcal {E},\nu }^*\). To see that this map is well defined, we note that
where \(\mathcal {S}_{\mathcal {O},\lambda }\) and \(Q_{\mathcal {O},\lambda }\) are the same as \(\mathcal {S}_\lambda \) and \(Q_\lambda \) defined in (22), but with emphasis on the dependence on \(\mathcal {O}\). Then, \(R_{\text {free},\lambda } \gamma _{\mathcal {E},\nu }^*\) maps \(L^2(\partial \Omega )\) continuously to \(L^2(\mathbb {R}^d)\). The operator \(\mathcal {S}_{\lambda }^t \gamma _{\mathcal {E},\nu }^*\) is the double layer operator on the boundary \(\partial \mathcal {O}\) and maps continuously \(L^2(\partial \Omega ) \rightarrow L^2(\partial \Omega )\) (see for example [52]). Since \(Q_\lambda ^{-1}\) is a pseudodifferential operator of order one, and \( \mathcal {S}_{\lambda }\) continuously maps \(H^{-1}(\partial \mathcal {O})\) to \(H^\frac{1}{2}(\mathbb {R}^d) \subset L^2(\mathbb {R}^d)\) we see that \(\mathcal {B}_{\mathcal {O},\lambda }: L^2(\partial \mathcal {O}) \rightarrow L^2(\mathbb {R}^d)\) is indeed well defined and continuous.
Similarly, we define \(\gamma _{\mathcal {O}_i,\nu }\) and \(\mathcal {B}_{\mathcal {O}_i,\lambda }\). Let \(p_{i}\) be the orthogonal projection \(L^2(\partial \mathcal {O}) \rightarrow L^2(\partial \mathcal {O}_i)\). Then, from (44) then have for \((\mathcal {B}_{\mathcal {O},\lambda }-\mathcal {B}_{\mathcal {O}_i,\lambda }) p_{i}\) the representation
where \(Q_{\mathcal {O}_i,\lambda }\) is the operator \(Q_\lambda \) in Eq. (22) when \(\mathcal {O}\) is replaced by \(\mathcal {O}_i\) in the definition. As in (14) we have used the decomposition \(Q_\lambda = {\tilde{Q}}_\lambda + T_\lambda \). Since \(T_\lambda \) is smoothing, so is \(Q_\lambda ^{-1}-{\tilde{Q}}_\lambda ^{-1} = - Q_\lambda ^{-1} T_\lambda {\tilde{Q}}_\lambda ^{-1}\). Similarly, as the free Green’s function is smooth off the diagonal the operator \(p_k S_\lambda ^t \gamma _{\mathcal {E},\nu }^* p_i\) has smooth integral kernel for \(k \not = i\). In particular, these operators are trace-class as maps from \(L^2(\partial \mathcal {O})\) to \(L^2(\partial \mathcal {O})\). Since \(S_\lambda \) as well as \(S_\lambda \tilde{Q}_{\lambda }^{-1}\) is bounded from \(L^2(\partial \mathcal {O})\) to \(L^2(\mathbb {R}^d)\) this shows that for every \(i \in \{1,\ldots , N\}\) the operator \((\mathcal {B}_{\mathcal {O},\lambda }-\mathcal {B}_{\mathcal {O}_i,\lambda }) p_{i}\) is nuclear.
Equation (40) can be rewritten as
where \(Y_i=\langle Y,\nu _{\mathcal {E}_i} \rangle |_{\partial \mathcal {O}_i}\) is viewed as a multiplication operator acting on \(L^2(\partial \mathcal {O}_i) \subset L^2(\partial \mathcal {O})\). Taking the trace, we have
Here, the cyclic permutation under the trace is justified because of the nuclearity of \((\mathcal {B}_{\mathcal {O},\lambda }-\mathcal {B}_{\mathcal {O}_i,\lambda }) p_i\). Since \(\mathcal {B}_{\mathcal {O},\lambda }^* \mathcal {B}_{\mathcal {O},\lambda } = \frac{1}{2\lambda } \gamma _{\mathcal {E},\nu } \frac{\mathrm {d}}{\mathrm {d}\lambda } R_{\mathcal {O},\lambda }\gamma _{\mathcal {E},\nu }^*\) its integral kernel is \(\frac{1}{2\lambda }\partial _\nu \partial _\nu '\frac{\partial G_{\mathcal {O},\lambda }}{\partial \lambda }\). We then obtain
\(\square \)
By Theorem 3.2, one can define trace-class operators \(g_\mathrm {rel}=D_f=\frac{\mathrm {i}}{\pi }\int _{\tilde{\Gamma }} \lambda f(\lambda ) R_{\mathrm {rel},\lambda } \mathrm {d}\lambda \) for \(f(\lambda )=g(\lambda ^2)\) and \(g\in {\mathcal {P}}_\theta \). We now have the following.
Proposition 5.7
Let \(g\in {\mathcal {P}}_\theta \) and \(\varphi _\epsilon \) be a boundary translation flow. Then, \(\delta _Y g_\mathrm {rel}\) is a \(C^{1}\) trace-class operator-valued function of \(\epsilon \) near the point \(\epsilon =0\). Its derivative \(\dot{g}_\mathrm {rel}= \delta _Y g_\mathrm {rel}\) satisfies
where \(g'(z)=\frac{\mathrm {d}g}{\mathrm {d}z}(z)\).
Proof
By Theorem 5.4, the operator \(R_{\mathrm {rel}}(z)\) is, for fixed \(z \notin \mathfrak {S}_\theta \), in the Banach space \(C^1(I, \mathcal {L}_1)\) of trace-class operator-valued \(C^1\)-functions on a compact interval I containing zero. Differentiation defines a closed operator from on \(C(I, \mathcal {L}_1)\) with domain \(C^1(I, \mathcal {L}_1)\). By Theorem 5.4, the derivative of \(R_{\mathrm {rel},\lambda }\) is integrable in \(\lambda \). An application of Hille’s theorem to the Bochner integral defining \(g_\mathrm {rel}\) in the Banach space of trace-class operators shows that differentiation in \(\epsilon \) commutes with integration. We therefore know that \(g_\mathrm {rel}\) is differentiable and
Let \(f(\lambda )=g(\lambda ^2)\). Using Proposition 5.6 and integration by parts in \(\lambda \), we have
\(\square \)
In the special case \(g(z) = \sqrt{z+m^2}\), Proposition 5.7 shows differentiability of \(H_\mathrm {rel}\) with respect to \(\epsilon \) in the space of trace-class operators at \(\epsilon =0\). Using Theorem 3.7 and differentiating under the trace then gives the following theorem.
Theorem 5.8
The variation of the relative energy is given by \(\delta _Y E_\mathrm {rel}=\frac{1}{2}\mathrm {Tr}(\delta _Y H_\mathrm {rel}).\)
We will now use the Hadamard variation formula to compute this variation.
5.3 Variation of the Klein–Gordon Energy
Theorem 5.9
Let Y be a smooth boundary translation vector field. The variation of the Klein–Gordon energy generated by Y is equal to the boundary integral of its spatial tensor contracted with Y. That is
where the integration on the right-hand side is at the exterior boundary and \(\nu \) is the exterior normal for \(\mathcal {E}\).
Proof
From Eq. (5), we have
We know that
-
\(((T_\mathrm {ren})_{\mathcal {O}_p})_{ij}\) is smooth on a neighbourhood of \(\mathcal {O}_q\) for \(p\ne q\) (by Theorem 2.6),
-
\(((T_\mathrm {ren})_{\mathcal {O}_p})_{ij}\) is divergence-free (by Theorem 2.6),
-
Y is constant on a neighbourhood of \(\mathcal {O}_q\) (by assumptions) for all \(q \in \{1, \ldots N\}\).
Therefore, \(F_i=( (T_\mathrm {ren})_{\mathcal {O}_p})_{ij} Y^j\) is smooth and divergence-free on a neighbourhood of \(\mathcal {O}_q\). In other words, we have
That is
As \(((T_\mathrm {ren})_\mathcal {O}-(T_\mathrm {ren})_{\mathcal {O}_p})_{ij}\) is vanishing at the boundary \(\partial \mathcal {O}_p\), we have
Now, from Theorem 2.4 we get
Altogether, we have
The second term in the above equation can be expressed as
where we used the properties (11) and (30) of \(G_{\mathcal {O}}(\lambda ^2)=G_{\mathcal {O},\lambda }\) and \(G_{\mathcal {O}_p}(\lambda ^2)=G_{\mathcal {O}_p,\lambda }\). Now we obtain
Since \(\int _{\partial \mathcal {O}} (T_\mathrm {rel})_{ij}\nu ^i Y^j \mathrm {d}\sigma \) is integrating at the exterior boundary, we have
Applying Proposition 5.7 to \(g(z)=\sqrt{z+m^2}\), we have
Now \(\mathrm {Tr}_{\partial \mathcal {O}}[ \partial _\nu \partial _\nu ' \breve{H}_\mathcal {O}^{-1}\langle Y,\nu \rangle ]_--\sum _{p=1}^N\mathrm {Tr}_{\partial \mathcal {O}_p}[ \partial _\nu \partial _\nu ' \breve{H}_{\mathcal {O}_p}^{-1}\langle Y,\nu \rangle ]_-=0\) as Y is a boundary translation vector field. Therefore, we have
That is
Finally, Eqs. (47), (48) and Theorem 5.8 complete the proof. \(\square \)
An application of an analogue of Theorem 5.9 to calculate the Casimir force in dimension one can be found in Appendix 7. Finally, we have the following theorem.
Theorem 5.10
Let \(T_\mathrm {ren}\) be the renormalised stress–energy tensor in Theorem 2.3 and let Y be a boundary translation flow as in Theorem 5.9. We assume further that Y is constant near \(\mathcal {O}_p\) for some \(p \in \{1,\dots ,N\}\) and vanishes near \(\mathcal {O}_q\) if \(q \not = p\). Let S be any smooth hypersurface in \(\mathcal {E}\) that is homologous to \(\partial \mathcal {O}_p\) in \(\overline{\mathcal {E}}\) and let \(\nu \) be the exterior normal vector field of S. Then, the variation of the relative energy generated by Y is equal to
where Z is the unique constant vector field on \(\mathbb {R}^d\) whose restriction to \(\partial \mathcal {O}_q\) equals Y.
Proof
As in the proof of Theorem 5.9, we have
We also know that \(\partial \mathcal {O}_p\) is homologous to \(S_{R}\), a sphere with sufficiently large radius R in \(\mathbb {R}^d\backslash \mathcal {O}_p\). Because \((T_\mathrm {ren})_{\mathcal {O}_p}\) is divergent-free, one has
To get a decay property of \((T_\mathrm {ren})_{\mathcal {O}_p}\) at infinity, we first recall that
For \(m\ne 0\), we would have an exponential decay of \(e^{-Cm \rho (x)}\) for \(\left( \breve{H}_{\mathcal {O}} - \breve{H}_\text {free}\right) (x,x)\), as explained in the proof of Theorem 3.5. For \(m=0\), one could use the estimates of (17) and (18) to obtain that, for \(d\ge 2\),
Moreover, estimate (19) also implies
Applying the above estimates to Theorem 2.4, one concludes that
This implies \(\lim _{R\rightarrow \infty } \int _{S_R}((T_\mathrm {ren})_{\mathcal {O}_p})_{ij}\nu ^i \nu ^j \langle Y,\nu \rangle \mathrm {d}\sigma =0\).
As \((T_\mathrm {ren})_\mathcal {O}=T_\mathrm {ren}\) (see Definition 2.3 and the first two paragraphs of Section 3), this completes the proof. \(\square \)
6 The Zeta Regularised Energy and the Equivalence of (1) and (3)
In this section, we assume that \(m>0\). In that case it is well known that for \({\text {Re}}(s)>(d+1)/2\) the operator
is trace-class (see for example [53]) and we can therefore define the renormalised zeta functions \(\zeta _\mathcal {O}(s)\) as
where \(\xi (\lambda )\) is the spectral shift function of the problem. The Birman–Krein formula applies to this setting and we have
where \(S(\lambda )\) is the stationary scattering matrix of the problem.
One can also use the Mellin transform to write
where
The following lemma should be well known but we could not find a reference for the precise statement. It is a simple consequence of heat kernel expansions [54,55,56] and Kac’s principle of not feeling the boundary [57, 58]. We also refer to [59, 60] for more details on obstacle scattering theory and the Birman–Krein formula.
Lemma 6.1
The function \(h_\mathcal {O}(t)\) is exponentially decaying as \(t \rightarrow \infty \) and has a full asymptotic expansion as \(t \searrow 0\) of the form
where the infinite sum is understood in the sense of asymptotic summation. The coefficients \(a_k\) are integrals over \(\partial \mathcal {O}\) of locally determined quantities expressed in terms of the extrinsic and intrinsic curvature of the boundary and its derivatives. In particular,
Proof
The exponential decay follows immediately from the representation by means of the spectral shift function and \(m>0\). Now \(h_\mathcal {O}(t)\) is the trace of the difference of the two heat operators \(e^{-t H^2_\mathcal {O}}\) and \(e^{-t H^2_{\text {free}}}\) with integral kernels \(K_\mathcal {O}(t,x,y)\) and \(K_\text {free}(t,x,y)\), respectively. Since the difference is trace-class and the integral kernel is smooth we have
where \(B_R\) is a ball of radius R, i.e. integration is over a large ball of radius R with the obstacles removed. The heat kernel difference satisfies not feeling the boundary estimates. For example, a general finite propagation speed estimate [58] gives
where \(\rho (x) = \mathrm {dist}(x,\mathcal {O})\). This shows that
Let \(\mathcal {U}\subset \mathcal {E}\) be an open neighbourhood that contains \(\partial \mathcal {E}\), i.e. \(\partial \mathcal {E}\subset \mathcal {U}\subset \mathcal {E}\). Then, we have for \(|t|<1\) that
for any \(N>0\). This computation is therefore purely local. Using another not feeling the boundary estimate we can replace \(K_\mathcal {O}(t,x,x)\) in this integral by the Dirichlet heat kernel on the compact manifold obtained by removing \(\mathcal {O}\) from the large flat torus. The coefficients are therefore the same as the heat kernel coefficients on a domain with boundary. It is well-known that heat kernel coefficients are determined by local invariants of the jets of the symbols of the operators, i.e. jets of Riemannian metric and second fundamental forms ([56, Lemma 2.6] or [54, Lemma 2.1]). As our interior geometry is Euclidean, only the first heat coefficients corresponding to the interior is nonzero. This first coefficient is the same for both operators and therefore only boundary terms contribute to the expansion. The first non-trivial term is given by \(a_0 = -\frac{1}{4}(4 \pi )^{-\frac{d-1}{2}} \mathrm {Vol}_{d-1}(\partial \mathcal {O})\) (see for example [54, Theorem 1.1]). \(\square \)
Remark 6.2
The general form of the heat expansion for a compact Riemannian manifold M with boundary \(\partial M\) is of the form
where the \(b_{2k}\) are integrals of locally defined quantities over M, and the \(b_{2k+1}\) are integrals of locally defined quantities over \(\partial M\) which are determined by the boundary conditions. When considering differences heat kernels of Laplace operators with different boundary conditions the \(b_{2k}\) terms cancel and only the terms containing \(b_{2k+1}\) remain.
It follows as usual (for example [61, Sect. 1.12]) that \(\zeta _\mathcal {O}\) has a meromorphic continuation to the complex plane. If d is odd then there are finitely many poles at \(\{\frac{d-1}{2},\frac{d-1}{2}-1,\ldots , 1\}\) with residue at \(\frac{d-1}{2}-k\) determined by the coefficients \(a_k\). In this case, the values at non-positive integers are also expressible in terms of \(a_k\). In case d is even, poles may be located at the points \(\{\frac{d-1}{2}-k \;\mid \; k \in \mathbb {N}_0 \}\).
Definition 6.3
The regularised energy \(E_\mathrm {reg}\) is then defined
where \( \mathrm {FP}_{z=a} f(z)\) denotes the finite part of the meromorphic function f at the point a, i.e. the constant term in the Laurent expansion of f about the point a.
In particular, in case d is odd we have \(E_\mathrm {reg}= \frac{1}{2} \zeta _\mathcal {O}(-\frac{1}{2})\).
We can also define a zeta regularised energy \(E^j_\mathrm {reg}\) for every object \(\mathcal {O}_j\). Obviously, \(E^j_\mathrm {reg}\) does not depend on the position of \(\mathcal {O}_j\) in \(\mathbb {R}^d\) and is also invariant under active rotations of the object. Since the heat coefficients are local quantities the relative zeta function
is an entire function. Since the relative quantities
are trace-class for \(s<0\) we also have that \(E_\mathrm {rel}= \frac{1}{2} \zeta _\mathrm {rel}(-\frac{1}{2})\) and therefore
Thus, \(E_\mathrm {rel}- E_\mathrm {reg}\) does not change if the individual objects are translated or rotated.
7 Proof of Main Theorem
In this section, we will prove our main theorem (Theorem 1.1) by combining the results we obtained in the previous sections.
Proof of Theorem 1.1
The differentiability of \(E_\mathrm {rel}(\epsilon )\) follows from Theorems 5.3 and 5.4. As shown in Section 3, we have \(\Xi (\lambda ) = \log \det (Q_\lambda {\tilde{Q}}_\lambda ^{-1})\) and Theorem 3.7. Recall that equation (21), says
By substituting \(\Xi (\lambda )\), one obtains Eq. (2) of our main theorem. Equation (1) follows immediately from Theorems 5.9 and 5.10. Since the flow \(\Phi _\epsilon \) described right above Theorem 1.1 is exactly a boundary translation flow (see Definition 5.1), we know from the end of Sect. 6 that \(E^j_\mathrm {reg}\) is constant if the individual objects are translated. Hence, the fact that \(E_\mathrm {rel}(\epsilon ) - E_{\mathrm {reg}}(\epsilon )\) is constant near \(\epsilon =0\) for \(m>0\) follows. \(\square \)
References
Casimir, H.B.G.: On the attraction between two perfectly conducting plates. Indag. Math. 10, 261–263 (1948)
Sparnaay, M.J.: Measurements of attractive forces between flat plates. Physica 24(6), 751–764 (1958)
Bressi, G., Carugno, G., Onofrio, R., Ruoso, G.: Measurement of the Casimir force between parallel metallic surfaces. Phys. Rev. Lett. 88, 041804 (2002)
Chan, H.B., Aksyuk, V.A., Kleiman, R.N., Bishop, D.J., Capasso, F.: Quantum mechanical actuation of microelectromechanical systems by the Casimir force. Science 291(5510), 1941–1944 (2001)
Ederth, T.: Template-stripped gold surfaces with 0.4 nm rms roughness suitable for force measurements: application to the Casimir force in the 20–100 nm range. Phys. Rev. A 62, 062104 (2000)
Lamoreaux, S.K.: Demonstration of the Casimir force in the \(0.6\) to 6 \(\mu m\) range. Phys. Rev. Lett. 78(1), 5 (1997)
Mohideen, U., Roy, A.: Precision measurement of the Casimir force from \(0.1\) to 0.9 \(\mu m\). Phys. Rev. Lett. 81(21), 4549 (1998)
Roy, A., Lin, C.-Y., Mohideen, U.: Improved precision measurement of the Casimir force. Phys. Rev. D 60(11), 111101 (1999)
Bordag, M., Klimchitskaya, G.L., Mohideen, U., Mostepanenko, V.M.: New developments in the Casimir effect. Phys. Rep. 353(1–3), 1–205 (2001)
Bordag, M., Klimchitskaya, G.L., Mohideen, U., Mostepanenko, V.M.: Advances in the Casimir effect. Oxford University Press, Oxford (2009)
Elizalde, E., Romeo, A.: Expressions for the zeta-function regularized Casimir energy. J. Math. Phys. 30(5), 1133–1139 (1989)
Elizalde, E., Romeo, A.: Heat-kernel approach to the zeta-function regularization of the Casimir energy for domains with curved boundaries. Int. J. Mod. Phys. A 5(09), 1653–1669 (1990)
Kirsten, K.: Spectral Functions in Mathematics and Physics. Chapman & Hall/CRC, Boca Raton (2002)
Brown, L.S., Maclay, G.J.: Vacuum stress between conducting plates: an image solution. Phys. Rev. 184(5), 1272 (1969)
Dzyaloshinskii, I.E., Lifshitz, E.M., Pitaevskii, L.P.: General theory of Van der Waals’ forces. Physics-Uspekhi 4(2), 153–176 (1961)
Deutsch, D., Candelas, P.: Boundary effects in quantum field theory. Phys. Rev. D 20(12), 3063–3080 (1979)
Kay, B.S.: Casimir effect in quantum field theory. Phys. Rev. D 20(12), 3052 (1979)
Scharf, G., Wreszinski, W.F.: On the Casimir effect without cutoff. Found. Phys. Lett. 5(5), 479–487 (1992)
Fulling, S.A.: Vacuum energy as spectral geometry. Sym. Integrab. Geom.: Meth. Appl 3, 094 (2007)
Dappiaggi, C., Nosari, G., Pinamonti, N: The Casimir effect from the point of view of algebraic quantum field theory. arXiv preprint arXiv:1412.1409, (2014)
Johnson, S.G.: Numerical methods for computing Casimir interactions. In: Dalvit, D., Milonni, P., Roberts, D., da Rosa, F. (eds.) Casimir Physics, pp. 175–218. Springer, Berlin (2011)
Reid, M.T.H., Rodriguez, A.W., White, J., Johnson, S.G.: Efficient computation of Casimir interactions between arbitrary \(3d\) objects. Phys. Rev. Lett. 103(4), 040401 (2009)
Rodriguez, A.W., Capasso, F., Johnson, S.G.: The Casimir effect in microstructured geometries. Nat. Photon. 5(4), 211–221 (2011)
Bimonte, G., Emig, T., Kardar, M., Krüger, M.: Nonequilibrium fluctuational quantum electrodynamics: heat radiation, heat transfer, and force. Ann. Rev. Condens. Matter Phys. 8, 119–143 (2017)
Emig, T., Graham, N., Jaffe, R.L., Kardar, M.: Casimir forces between arbitrary compact objects. Phys. Rev. Lett. 99(17), 170403 (2007)
Emig, T., Graham, N., Jaffe, R.L., Kardar, M.: Casimir forces between compact objects: the scalar case. Phys. Rev. Lett. 77(2), 025005 (2008)
Emig, T., Jaffe, R.L.: Casimir forces between arbitrary compact objects. J. Phys. A: Math. Theor. 41(16), 164001 (2008)
Emig, T., Jaffe, R.L., Kardar, M., Scardicchi, A.: Casimir interaction between a plate and a cylinder. Phys. Rev. Lett. 96(8), 080403 (2006)
Kenneth, O., Klich, I.: Opposites attract: a theorem about the Casimir force. Phys. Rev. Lett. 97, 060401 (2006)
Kenneth, O., Klich, I.: Casimir forces in a T operator approach. Phys. Rev. B 78, 014103 (2008)
Milton, K.A., Wagner, J.: Multiple scattering methods in Casimir calculations. J. Phys. A: Math. Theor. 41, 155402 (2008)
Rahi, S.J., Emig, T., Graham, N., Jaffe, R.L., Kardar, M.: Scattering theory approach to electrodynamic Casimir forces. Phys. Rev. D 80(8), 085021 (2009)
Renne, M.J.: Microscopic theory of retarded van der Waals forces between macroscopic dielectric bodies. Physica 56, 125–137 (1971)
Hanisch, F., Strohmaier, A., Waters, A.: A relative trace formula for obstacle scattering. arxiv: pdf/2002.07291 (2020)
Balian, R., Duplantier, B.: Electromagnetic waves near perfect conductors II. Casimir effect. Ann. Phys. 112, 165–208 (1978)
Kirsten, K., Lee, Y.: Gluing formula for Casimir energies. Symmetry 10(1), 31 (2018)
Gies, H., Langfeld, K., Moyaerts, L.: Casimir effect on the worldline. JHEP 06, 018 (2003)
Candelas, P.: Vacuum energy in the presence of dielectric and conducting surfaces. Ann. Phys. 143(2), 241–295 (1982)
Garabedian, P.R., Schiffer, M.: Convexity of domain functionals. In Journal d’Analyse Mathématique 2, 281–368 (1952)
Hadamard, J Mémoire Sur Le Problème D’analyse Relatif À L’équilibre Des Plaques Élastiques Encastrées . Mémoires des Savants Etrangers, 1908
Peetre, J.: On Hadamard’s variational formula. J. Differ. Equ. 36(3), 335–346 (1980)
Strohmaier, A., Waters, A.: Geometric and obstacle scattering at low energy. Comm. Partial Differ. Equ. 45, 1451–1511 (2020)
Shubin, M.A.: Pseudodifferential Operators and Spectral Theory, 2nd edn. Springer, Berlin (2001)
Taylor, M.E.: Pseudodifferential Operators. Princeton Mathematical Series, vol. 34. Princeton University Press, Princeton (1981)
McLean, W.: Strongly Elliptic Systems and Boundary Integral Equations. Cambridge University Press, Cambridge (2000)
Strohmaier, A., Verch, R., Wollenberg, M.: Microlocal analysis of quantum fields on curved space-times: analytic wave front sets and Reeh-Schlieder theorems. J. Math. Phys. 43(11), 5514–5530 (2002)
Dyatlov, S, Zworski, M.: Mathematical Theory of Scattering Resonances. http://math.mit.edu/~dyatlov/res/res_final.pdf, (2019)
Henry, D.: Perturbation of the Boundary in Boundary-Value Problems of Partial Differential Equations. London Mathematical Society Lecture Note Series (318), Cambridge University Press, Cambridge (2005)
Hezari, H., Zelditch, S.: \(C^{\infty }\) Spectral rigidity of the ellipse. Analysis & PDE 5(5), 1105–1132 (2012)
Ozawa, S.: Hadamard’s variation of the Green kernels of heat equations and their traces I. J. Math. Soc. Jpn. 34(3), 455–473 (1982)
Suzuki, T., Tsuchiya, T.: First and second Hadamard variational formulae of the Green function for general domain perturbations. J. Math. Soc. Jpn. 68(4), 1389–1419 (2016)
Chandler-Wilde, S.N., Graham, I.G., Langdon, S., Spence, E.A.: Numerical-asymptotic boundary integral methods in high-frequency acoustic scattering. Acta Numer. 21, 89–305 (2012)
Borisov, N.V., Müller, W., Schrader, R.: Relative index theorems and supersymmetric scattering theory. Comm. Math. Phys. 114(3), 475–513 (1988)
Branson, T.P., Gilkey, P.B.: The asymptotics of the Laplacian on a manifold with boundary. Comm. Partial Differ. Equ. 15, 245–272 (1990)
Greiner, P.: An asymptotic expansion for the heat equation. Arch. Rat. Mech. Anal. 41, 163–218 (1971)
Gilkey, P.B., Smith, L.: The eta invariant for a class of elliptic boundary value problems. Comm. Pure Appl. Math. 36, 85–131 (1983)
Kac, M.: On some connections between probability theory and differential and integral equations. In: Proceedings of Second Berkeley Symposium on Mathematical Statistics and Probability, pp. 189–215. University of California Press, Berkeley and Los Angeles, (1951)
Li, L., Strohmaier, A.: Heat kernel estimates for general boundary problems. J. Spectr. Theory 6(4), 903–919 (2016)
Jensen, A., Kato, T.: Asymptotic behavior of the scattering phase for exterior domains. Comm. Partial Differ. Equ. 3(12), 1165–1195 (1978)
Kato, T.: Monotonicity theorems in scattering theory. Hadron. J. 1, 134–154 (1978)
Gilkey, P.B.: Invariance Theory, the Heat Equation, and the Atiyah-Singer Index Theorem. Studies in Advanced Mathematics, 2nd edn. CRC Press, Boca Raton (1994)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Karl-Henning Rehren.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Y-L. Fang: Supported by Leverhulme Grant RPG-2017-329.
Appendices
Appendix A: Bounds on the Free Resolvent
In this appendix, we will give some estimates on the kernels of \(\phi G_{\text {free},\lambda }\phi \) and \(\phi G_{\text {free},\lambda }\chi \), which are denoted by \(K_\lambda (x,y)\) and \(\tilde{K}_\lambda (x,y)\), respectively. They are given by
where \(H^{(1)}_\nu \) is the Hankel function. Moreover, we assume \(\phi \) has compact support whereas the support of \(\chi \) is unbounded with \(\mathrm {dist}({{\,\mathrm{supp}\,}}\phi ,{{\,\mathrm{supp}\,}}\chi )=\delta >0\). Now we have
Similarly,
Recall that Hankel functions have the following asymptotic
Therefore, for small \(\lambda \) such that \(r_\phi |\lambda | < r_0\), we have
For large \(\lambda \) such that \(r_\phi |\lambda | \ge r_0\), we have
The above terms can be bounded by using the asymptotic of Hankel functions in Eqs. (49) and (50).
Replacing \(r_\phi |\lambda |\) by \(\infty \) in the integral, we obtain
Combining with the asymptotic for small \(|\lambda |\), we have
For \(\tilde{\mathcal {K}}_{\mu ,\nu ,p,d}\), we have
For small \(\lambda \) such that \(\delta |\lambda | < r_0\), we also have
This can be bounded by
Replacing \(\delta |\lambda |\) by 0, one has
For large \(\lambda \) such that \(\delta |\lambda | \ge r_0\), we have
This implies
By the Schur test and Eq. (51), we have
Using the recurrence relations of Hankel functions \((H^{(1)}_{\nu })'(z)=-H^{(1)}_{\nu +1}(z)+\frac{\nu }{z}H^{(1)}_{\nu }(z)\) and similar calculations, one has
This is the same bound for \(\Vert \phi G_{\text {free},\lambda }\phi \Vert _{L^2\rightarrow H^1}\). To get the bound for \(\Vert \phi G_{\text {free},\lambda }\phi \Vert _{L^2\rightarrow H^2}\), we recall that \((-\Delta -\lambda ^2)G_{\text {free},\lambda }=I\), hence
where \(\tilde{\phi }=1\) near \({{\,\mathrm{supp}\,}}\phi \) and \({\text {diam}}({{\,\mathrm{supp}\,}}\tilde{\phi })\) is bounded. Combining with the estimates on \(\Vert \phi G_{\text {free},\lambda }\phi \Vert _{L^2\rightarrow H^1}\) and \(\Vert \phi G_{\text {free},\lambda }\phi \Vert _{L^2\rightarrow L^2}\), we have
For the operator \(\chi G_{\text {free},\lambda }\phi \), we obtain from Eq. (52) that
which gives the verification of the estimate (24).
Appendix B: The Method Illustrated for the 1-D Casimir Effect
In this appendix, we illustrate Theorem 5.9 in its simplest form, i.e. for the case of the one-dimensional Casimir effect. This will also illustrate the advantages of the relative framework. Let \(a_1<b_1<a_2<b_2\), where \(\mathcal {O}_1=(a_1,b_1)\) and \(\mathcal {O}_2=(a_2,b_2)\) are the obstacles. Then, we have
Then, \(G_\mathrm {rel}=G_\mathcal {O}-G_{\mathcal {O}_1}-G_{\mathcal {O}_2}+G_\text {free}\) is given by
In particular,
For \(x,y>b\), we have
which implies
The same calculation yields for \(x,y<b\). That is
When restricting to the diagonal, we have
Now for \(a<x,y<b\), we have
which implies
Note that
and
implies
When restricting to the diagonal, we have
Equation (53) gives
From Eqs. (54) and (55), we have
This equation shows that \(H_\mathrm {rel}(x)\) is continuous, which is consistent with the claim in the proof of Theorem 5.8. Integrating over \(\mathbb {R}\), we have
Similarly, one has the renormalised counterpart of \(H_\mathrm {rel}(x)\), which is given by \((H_\mathcal {O})_{\text {ren}}=[\Delta _{\mathcal {O}}^{\frac{1}{2}}-\Delta _\text {free}^{\frac{1}{2}}]|_\mathtt {\Delta }\). Note that this only corresponds to the first term in \(T_{00}\) of (5), i.e. \( \frac{1}{2}(H-H_\text {free})|_\mathtt {\Delta }\). It is given by
It is easy to see that \(H_{\text {ren}}\) is not integrable. Therefore, some regularisation schemes would be needed at this point. One way is by heat kernel regularisation (see, for instance, [19]). However, this only resolves the non-integrability problem of the first term of \(T_{00}\). We also need to integrate the term \(\frac{1}{8}\Delta [(H^{-1}-H_\text {free}^{-1})|_\mathtt {\Delta }]\) in Eq. (5) over \(\mathbb {R}\). This is also ill-defined, as it is not integrable. We will see that these problems disappear when we work in the relative setting.
Restricting Eq. (53) to the diagonal and then taking the action of Laplacian, we have
Integrating spectral variable k along \(\tilde{\Gamma }\) and then over the space variable x, we have
hence
Note that using heat kernel regularisation, one would also obtain \(E_\mathrm {rel}\), see [19]. Equations (56) and (58) agree with Theorem 3.7. Note that equation (57) also shows that
where all the three terms are ill-defined as they are not integrable. For instance, \(\Delta [(H_{\mathcal {O}_1}^{-1}-H_\text {free}^{-1})|_\mathtt {\Delta }](x)\) has \(\frac{1}{(a_1-x)^2}\) singularity when approaching \(x=a_1\). This justifies Remark 2.5.
Now let X be a smooth vector field that generates a movement of (right) obstacle 2 to the right with a constant speed v. Moreover, X is zero around (left) obstacle 1. In other words, we move the obstacle 2 to the right by \(v\epsilon \) and keep the obstacle 1 stationary. Now the variation of the relative energy is given by \(\delta _X E=v\cdot \partial _{a_2} E\). The left-hand side of Eq. in Theorem 5.8 becomes
Now the identity (48) used in the proof of Theorem 5.9 says
It becomes
Note that
therefore
Combining Eqs. (58), (60) and (61), we have verified the identity (48) in one-dimensional cases. Moreover, Eqs. (59), (60) and (61) are consistent with Theorems 5.8 and 5.9.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fang, YL., Strohmaier, A. A Mathematical Analysis of Casimir Interactions I: The Scalar Field. Ann. Henri Poincaré 23, 1399–1449 (2022). https://doi.org/10.1007/s00023-021-01119-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00023-021-01119-z